Computer Vision Chapter 4

Slides:



Advertisements
Similar presentations
Principles of Density Estimation
Advertisements

Naïve Bayes. Bayesian Reasoning Bayesian reasoning provides a probabilistic approach to inference. It is based on the assumption that the quantities of.
Component Analysis (Review)
Data Mining Classification: Alternative Techniques
Data Mining Classification: Alternative Techniques
Pattern Classification. Chapter 2 (Part 1): Bayesian Decision Theory (Sections ) Introduction Bayesian Decision Theory–Continuous Features.
Pattern recognition Professor Aly A. Farag
Pattern Recognition: Readings: Ch 4: , , 4.13
0 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John.
Chapter 2: Pattern Recognition
Data-intensive Computing Algorithms: Classification Ref: Algorithms for the Intelligent Web 6/26/20151.
1 MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING By Kaan Tariman M.S. in Computer Science CSCI 8810 Course Project.
Chapter 4 (part 2): Non-Parametric Classification
Pattern Recognition. Introduction. Definitions.. Recognition process. Recognition process relates input signal to the stored concepts about the object.
Review Rong Jin. Comparison of Different Classification Models  The goal of all classifiers Predicating class label y for an input x Estimate p(y|x)
METU Informatics Institute Min 720 Pattern Classification with Bio-Medical Applications PART 2: Statistical Pattern Classification: Optimal Classification.
Digital Camera and Computer Vision Laboratory Department of Computer Science and Information Engineering National Taiwan University, Taipei, Taiwan, R.O.C.
ECSE 6610 Pattern Recognition Professor Qiang Ji Spring, 2011.
Principles of Pattern Recognition
Digital Camera and Computer Vision Laboratory Department of Computer Science and Information Engineering National Taiwan University, Taipei, Taiwan, R.O.C.
Machine Learning1 Machine Learning: Summary Greg Grudic CSCI-4830.
Data Mining Practical Machine Learning Tools and Techniques Chapter 4: Algorithms: The Basic Methods Section 4.6: Linear Models Rodney Nielsen Many of.
Mixture Models, Monte Carlo, Bayesian Updating and Dynamic Models Mike West Computing Science and Statistics, Vol. 24, pp , 1993.
1 Pattern Recognition Pattern recognition is: 1. A research area in which patterns in data are found, recognized, discovered, …whatever. 2. A catchall.
Pattern Recognition 1 Pattern recognition is: 1. The name of the journal of the Pattern Recognition Society. 2. A research area in which patterns in data.
Digital Camera and Computer Vision Laboratory Department of Computer Science and Information Engineering National Taiwan University, Taipei, Taiwan, R.O.C.
1 Pattern Recognition: Statistical and Neural Lonnie C. Ludeman Lecture 24 Nov 2, 2005 Nanjing University of Science & Technology.
Chapter 4: Pattern Recognition. Classification is a process that assigns a label to an object according to some representation of the object’s properties.
Optimal Bayes Classification
Chapter 3: Maximum-Likelihood Parameter Estimation l Introduction l Maximum-Likelihood Estimation l Multivariate Case: unknown , known  l Univariate.
Digital Image Processing Lecture 25: Object Recognition Prof. Charlene Tsai.
Digital Camera and Computer Vision Laboratory Department of Computer Science and Information Engineering National Taiwan University, Taipei, Taiwan, R.O.C.
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John Wiley.
1Ellen L. Walker Category Recognition Associating information extracted from images with categories (classes) of objects Requires prior knowledge about.
Computer and Robot Vision II Chapter 20 Accuracy Presented by: 傅楸善 & 王林農 指導教授 : 傅楸善 博士.
Chapter 20 Classification and Estimation Classification – Feature selection Good feature have four characteristics: –Discrimination. Features.
Fast Query-Optimized Kernel Machine Classification Via Incremental Approximate Nearest Support Vectors by Dennis DeCoste and Dominic Mazzoni International.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition LECTURE 04: GAUSSIAN CLASSIFIERS Objectives: Whitening.
Digital Camera and Computer Vision Laboratory Department of Computer Science and Information Engineering National Taiwan University, Taipei, Taiwan, R.O.C.
Digital Camera and Computer Vision Laboratory Department of Computer Science and Information Engineering National Taiwan University, Taipei, Taiwan, R.O.C.
Computer Vision Lecture 7 Classifiers. Computer Vision, Lecture 6 Oleh Tretiak © 2005Slide 1 This Lecture Bayesian decision theory (22.1, 22.2) –General.
SUPERVISED AND UNSUPERVISED LEARNING Presentation by Ege Saygıner CENG 784.
Data-intensive Computing Algorithms: Classification Ref: Algorithms for the Intelligent Web 7/10/20161.
Machine Learning – Classification David Fenyő
Chapter 7. Classification and Prediction
Chapter 3: Maximum-Likelihood Parameter Estimation
MIRA, SVM, k-NN Lirong Xia. MIRA, SVM, k-NN Lirong Xia.
Ch9: Decision Trees 9.1 Introduction A decision tree:
Pattern Recognition Pattern recognition is:
Classification Techniques: Bayesian Classification
LECTURE 05: THRESHOLD DECODING
Course Outline MODEL INFORMATION COMPLETE INCOMPLETE
Hidden Markov Models Part 2: Algorithms
Computer Vision Chapter 4
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
Instance Based Learning
Computer Vision Chapter 4
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John.
MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING
MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING
Parametric Methods Berlin Chen, 2005 References:
Multivariate Methods Berlin Chen
LECTURE 05: THRESHOLD DECODING
Multivariate Methods Berlin Chen, 2005 References:
Computer and Robot Vision I
Computer and Robot Vision I
Computer and Robot Vision I
A task of induction to find patterns
MIRA, SVM, k-NN Lirong Xia. MIRA, SVM, k-NN Lirong Xia.
Computer Vision II Chapter 20 Accuracy
Presentation transcript:

Computer Vision Chapter 4 Statistical Pattern Recognition Presenter: 王夏果 Cell phone: 0937384214 E-mail: r94922103@ntu.edu.tw Digital Camera and Computer Vision Laboratory Department of Computer Science and Information Engineering National Taiwan University, Taipei, Taiwan, R.O.C.

Introduction Units: Image regions and projected segments Each unit has an associated measurement vector Using decision rule to assign unit to class or category optimally DC & CV Lab. CSIE NTU

Introduction (Cont.) Feature selection and extraction techniques Decision rule construction techniques Techniques for estimating decision rule error DC & CV Lab. CSIE NTU

Simple Pattern Discrimination Also called pattern identification process A unit is observed or measured A category assignment is made that names or classifies the unit as a type of object The category assignment is made only on observed measurement (pattern) DC & CV Lab. CSIE NTU

Simple Pattern Discrimination (cont.) a: assigned category from a set of categories C t: true category identification from C d: observed measurement from a set of measurements D (t, a, d): event of classifying the observed unit P(t, a, d): probability of the event (t, a, b) DC & CV Lab. CSIE NTU

Economic Gain Matrix e(t, a): economic gain/utility with true category t and assigned category a A mechanism to evaluate a decision rule Identity gain matrix DC & CV Lab. CSIE NTU

An Instance DC & CV Lab. CSIE NTU

Another Instance P(g, g): probability of true good, assigned good, P(g, b): probability of true good, assigned bad, ... e(g, g): economic consequence for event (g, g), … e positive: profit consequence e negative: loss consequence DC & CV Lab. CSIE NTU

Another Instance (cont.) DC & CV Lab. CSIE NTU

Another Instance (cont.) DC & CV Lab. CSIE NTU

Another Instance (cont.) Fraction of good objects manufactured P(g) = P(g, g) + P(g, b) P(b) = P(b, g) + P(b, b) Expected profit per object E = DC & CV Lab. CSIE NTU

Conditional Probability DC & CV Lab. CSIE NTU

Conditional Probability (cont.) P(b|g): false-alarm rate P(g|b): misdetection rate Another formula for expected profit per object DC & CV Lab. CSIE NTU

Example 4.1 P(g) = 0.95, P(b) = 0.05 DC & CV Lab. CSIE NTU

Example 4.1 (cont.) DC & CV Lab. CSIE NTU

Example 4.2 P(g) = 0.95, P(b) = 0.05 DC & CV Lab. CSIE NTU

Example 4.2 (cont.) DC & CV Lab. CSIE NTU

Decision Rule Construction (t, a): summing (t, a, d) on every measurements d Therefore, Average economic gain DC & CV Lab. CSIE NTU

Decision Rule Construction (cont.) DC & CV Lab. CSIE NTU

Decision Rule Construction (cont.) We can use identity matrix as the economic gain matrix to compute the probability of correct assignment: DC & CV Lab. CSIE NTU

Fair Game Assumption Decision rule uses only measurement data in assignment; the nature and the decision rule are not in collusion In other words, P(a| t, d) = P(a| d) DC & CV Lab. CSIE NTU

Fair Game Assumption (cont.) From the definition of conditional probability DC & CV Lab. CSIE NTU

Fair Game Assumption (cont.) By fair game assumption, P(t, a, d) = By definition, = DC & CV Lab. CSIE NTU

Deterministic Decision Rule We use the notation f(a|d) to completely define a decision rule; f(a|d) presents all the conditional probability associated with the decision rule A deterministic decision rule: Decision rules which are not deterministic are called probabilistic/nondeterministic/stochastic DC & CV Lab. CSIE NTU

Expected Value on f(a|d) Previous formula By and => DC & CV Lab. CSIE NTU

Expected Value on f(a|d) (cont.) DC & CV Lab. CSIE NTU

Bayes Decision Rules Maximize expected economic gain Satisfy DC & CV Lab. CSIE NTU

Bayes Decision Rules (cont.) DC & CV Lab. CSIE NTU

Bayes Decision Rules (cont.) + + DC & CV Lab. CSIE NTU

Continuous Measurement For the same example, try the continuous density function of the measurements: and Prove that they are indeed density function DC & CV Lab. CSIE NTU

Continuous Measurement (cont.) Suppose that the prior probability of is and the prior probability of is When , a Bayes decision rule will assign an observed unit to t1, which implies => x: measurement DC & CV Lab. CSIE NTU

Continuous Measurement (cont.) .805 > .68, the continuous measurement has larger expected economic gain than discrete DC & CV Lab. CSIE NTU

Prior Probability The Bayes rule: Replace with The Bayes rule can be determined by assigning any categories that maximizes DC & CV Lab. CSIE NTU

Economic Gain Matrix Identity matrix Incorrect loses 1 A more balanced instance DC & CV Lab. CSIE NTU

Maximin Decision Rule Maximizes average gain over worst prior probability DC & CV Lab. CSIE NTU

Example 4.3 DC & CV Lab. CSIE NTU

Example 4.3 (cont.) DC & CV Lab. CSIE NTU

Example 4.3 (cont.) DC & CV Lab. CSIE NTU

Example 4.3 (cont.) The lowest Bayes gain is achieved when The lowest gain is 0.6714 DC & CV Lab. CSIE NTU

Example 4.3 (cont.) DC & CV Lab. CSIE NTU

Example 4.4 DC & CV Lab. CSIE NTU

Example 4.4 (cont.) DC & CV Lab. CSIE NTU

Example 4.4 (cont.) DC & CV Lab. CSIE NTU

Example 4.4 (cont.) DC & CV Lab. CSIE NTU

Example 4.5 DC & CV Lab. CSIE NTU

Example 4.5 (cont.) DC & CV Lab. CSIE NTU

Example 4.5 (cont.) DC & CV Lab. CSIE NTU

Decision Rule Error The misidentification errorαk The false-identification error βk DC & CV Lab. CSIE NTU

An Instance DC & CV Lab. CSIE NTU

Reserving Judgment The decision rule may withhold judgment for some measurements Then, the decision rule is characterized by the fraction of time it withhold judgment and the error rate for those measurement it does assign. It is an important technique to control error rate. DC & CV Lab. CSIE NTU

Nearest Neighbor Rule Assign pattern x to the closest vector in the training set The definition of “closest”: where is a metric or measurement space Chief difficulty: brute-force nearest neighbor algorithm computational complexity proportional to number of patterns in training set brute-force nearest neighbor:暴力法 DC & CV Lab. CSIE NTU

Binary Decision Tree Classifier Assign by hierarchical decision procedure DC & CV Lab. CSIE NTU

Major Problems Choosing tree structure Choosing features used at each non-terminal node Choosing decision rule at each non-terminal node DC & CV Lab. CSIE NTU

Decision Rules at the Non-terminal Node Thresholding the measurement component Fisher’s linear decision rule Bayes quadratic decision rule Bayes linear decision rule Linear decision rule from the first principal component DC & CV Lab. CSIE NTU

Error Estimation An important way to characterize the performance of a decision rule Training data set: must be independent of testing data set Hold-out method: a common technique construct the decision rule with half the data set, and test with the other half DC & CV Lab. CSIE NTU

Neural Network A set of units each of which takes a linear combination of values from either an input vector or the output of other units DC & CV Lab. CSIE NTU

Neural Network (cont.) Has a training algorithm Responses observed Reinforcement algorithms Back propagation to change weights DC & CV Lab. CSIE NTU

Summary Bayesian approach Maximin decision rule Misidentification and false-alarm error rates Nearest neighbor rule Construction of decision trees Estimation of decision rules error Neural network DC & CV Lab. CSIE NTU