Object Recognition by Parts

Slides:



Advertisements
Similar presentations
Active Appearance Models
Advertisements

Distinctive Image Features from Scale-Invariant Keypoints David Lowe.
Scale & Affine Invariant Interest Point Detectors Mikolajczyk & Schmid presented by Dustin Lennon.
Image Modeling & Segmentation
A Bayesian Approach to Recognition Moshe Blank Ita Lifshitz Reverend Thomas Bayes
Object class recognition using unsupervised scale-invariant learning Rob Fergus Pietro Perona Andrew Zisserman Oxford University California Institute of.
Recognition by Probabilistic Hypothesis Construction P. Moreels, M. Maire, P. Perona California Institute of Technology.
Unsupervised Learning of Visual Object Categories Michael Pfeiffer
Global spatial layout: spatial pyramid matching Spatial weighting the features Beyond bags of features: Adding spatial information.
Contour Based Approaches for Visual Object Recognition Jamie Shotton University of Cambridge Joint work with Roberto Cipolla, Andrew Blake.
Pedestrian Detection in Crowded Scenes Dhruv Batra ECE CMU.
Unsupervised Learning for Recognition Pietro Perona California Institute of Technology & Universita di Padova 11 th British Machine Vision Conference –
, Tim Landgraf Active Appearance Models AG KI, Journal Club 03 Nov 2008.
In Search of Objects: 50 years of wondering : Learning-Based Methods in Vision A. Efros, CMU, Spring 2009.
Model: Parts and Structure. History of Idea Fischler & Elschlager 1973 Yuille ‘91 Brunelli & Poggio ‘93 Lades, v.d. Malsburg et al. ‘93 Cootes, Lanitis,
Beyond bags of features: Part-based models Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Mean-Shift Algorithm and Its Application Bohyung Han
Generic Object Recognition -- by Yatharth Saraf A Project on.
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Prénom Nom Document Analysis: Data Analysis and Clustering Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to.
Transferring information using Bayesian priors on object categories Li Fei-Fei 1, Rob Fergus 2, Pietro Perona 1 1 California Institute of Technology, 2.
Object recognition Jana Kosecka Slides from D. Lowe, D. Forsythe and J. Ponce book, ICCV 2005 Tutorial Fei-Fei Li, Rob Fergus and A. Torralba.
Object recognition Jana Kosecka Slides from D. Lowe, D. Forsythe and J. Ponce book, ICCV 2005 Tutorial Fei-Fei Li, Rob Fergus and A. Torralba.
Object class recognition using unsupervised scale-invariant learning Rob Fergus Pietro Perona Andrew Zisserman Oxford University California Institute of.
Object Class Recognition by Unsupervised Scale-Invariant Learning R. Fergus, P. Perona, and A. Zisserman Presented By Jeff.
Pattern Recognition. Introduction. Definitions.. Recognition process. Recognition process relates input signal to the stored concepts about the object.
Unsupervised Category Modeling, Recognition and Segmentation Sinisa Todorovic and Narendra Ahuja.
Statistical Shape Models Eigenpatches model regions –Assume shape is fixed –What if it isn’t? Faces with expression changes, organs in medical images etc.
Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to.
CS 485/685 Computer Vision Face Recognition Using Principal Components Analysis (PCA) M. Turk, A. Pentland, "Eigenfaces for Recognition", Journal of Cognitive.
Building local part models for category-level recognition C. Schmid, INRIA Grenoble Joint work with G. Dorko, S. Lazebnik, J. Ponce.
Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,
A Statistically Selected Part-Based Probabilistic Model for Object Recognition Zhipeng Zhao, Ahmed Elgammal Department of Computer Science, Rutgers, The.
Features-based Object Recognition P. Moreels, P. Perona California Institute of Technology.
MSRI workshop, January 2005 Object Recognition Collected databases of objects on uniform background (no occlusions, no clutter) Mostly focus on viewpoint.
Bayesian Parameter Estimation Liad Serruya. Agenda Introduction Bayesian decision theory Scale-Invariant Learning Bayesian “One-Shot” Learning.
CSE 185 Introduction to Computer Vision Face Recognition.
Paper Reading Dalong Du Nov.27, Papers Leon Gu and Takeo Kanade. A Generative Shape Regularization Model for Robust Face Alignment. ECCV08. Yan.
3D Face Recognition Using Range Images
Discussion of Pictorial Structures Pedro Felzenszwalb Daniel Huttenlocher Sicily Workshop September, 2006.
GENDER AND AGE RECOGNITION FOR VIDEO ANALYTICS SOLUTION PRESENTED BY: SUBHASH REDDY JOLAPURAM.
1 Kernel Machines A relatively new learning methodology (1992) derived from statistical learning theory. Became famous when it gave accuracy comparable.
Unsupervised Learning Part 2. Topics How to determine the K in K-means? Hierarchical clustering Soft clustering with Gaussian mixture models Expectation-Maximization.
Object Recognition by Parts
Presented by David Lee 3/20/2006
LECTURE 09: BAYESIAN ESTIMATION (Cont.)
University of Ioannina
Machine Learning Basics
Context-based vision system for place and object recognition
Object Recognition by Parts
Saliency, Scale and Image Description (by T. Kadir and M
A. Opelt, M. Fussenegger, A. Pinz, P. Auer
Finding Clusters within a Class to Improve Classification Accuracy
Lecture 26: Faces and probabilities
Face Recognition and Detection Using Eigenfaces
Object Recognition by Parts
SMEM Algorithm for Mixture Models
Brief Review of Recognition + Context
Unsupervised Learning of Models for Recognition
Unsupervised Learning II: Soft Clustering with Gaussian Mixture Models
Unsupervised learning of models for recognition
SIFT keypoint detection
Announcements Project 4 out today Project 2 winners help session today
Patch-Based Image Classification Using Image Epitomes
Where are we? We have covered: Project 1b was due today
Object Recognition by Parts
Object Recognition with Interest Operators
The “Margaret Thatcher Illusion”, by Peter Thompson
Presentation transcript:

Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to systems that extracted linear features. . - CAD-model-based vision works well for industrial. An “appearance-based approach” was first developed for face recognition and later generalized up to a point. The new interest operators have led to a new kind of recognition by “parts” that can handle a variety of objects that were previously difficult or impossible.

Object Class Recognition by Unsupervised Scale-Invariant Learning R. Fergus, P. Perona, and A. Zisserman Oxford University and Caltech CVPR 2003 won the best student paper award CVPR 2013 won the best 10-year award

Goal: Enable Computers to Recognize Different Categories of Objects in Images.

Difficulties: Size Variation,

Approach An object is a constellation of parts (from Burl, Weber and Perona, 1998). The parts are detected by an interest operator (Kadir’s). The parts can be recognized by appearance. Objects may vary greatly in scale. The constellation of parts for a given object is learned from training images

Components Model Learning Recognition Generative Probabilistic Model including Location, Scale, and Appearance of Parts Learning Estimate Parameters Via EM Algorithm Recognition Evaluate Image Using Model and Threshold

Model: Constellation Of Parts Fischler & Elschlager, 1973 Yuille, ‘91 Brunelli & Poggio, ‘93 Lades, v.d. Malsburg et al. ‘93 Cootes, Lanitis, Taylor et al. ‘95 Amit & Geman, ‘95, ‘99 Perona et al. ‘95, ‘96, ’98, ‘00

Parts Selected by Interest Operator Kadir and Brady's Interest Operator. Finds Maxima in Entropy Over Scale and Location

Representation of Appearance Projection onto PCA basis 11x11 patch Normalize 121 dimensions was too big, so they used PCA to reduce to 10-15. c15

Learning a Model An object class is represented by a generative model with P parts and a set of parameters . Once the model has been learned, a decision procedure must determine if a new image contains an instance of the object class or not. Suppose the new image has N interesting features with locations X, scales S and appearances A.

Probabilistic Model X is a description of the shape of the object (in terms of locations of parts) S is a description of the scale of the object A is a description of the appearance of the object θ is the (maximum likelihood value of) the parameters of the object h is a hypothesis: a set of parts in the image that might be the parts of the object H is the set of all possible hypotheses for that object in that image. For N features in the image and P parts in the object, its size is O(NP)

Appearance The appearance (A) of each part p has a Gaussian density with mean cp and covariance VP. Background model has mean cbg and covariance Vbg. Gaussian Part Appearance PDF Guausian Appearance PDF Object Background

Shape as Location Object shape is represented by a joint Gaussian density of the locations (X) of features within a hypothesis transformed into a scale-invariant space. Gaussian Shape PDF Uniform Shape PDF Object Background

Scale The relative scale of each part is modeled by a Gaussian density with mean tp and covariance Up. Prob. of detection Gaussian Relative Scale PDF 0.8 0.75 0.9 Log(scale)

Occlusion and Part Statistics This was very complicated and turned out to not work well and not be necessary, in both Fergus’s work and other subsequent works.

Learning Train Model Parameters Using EM: Optimize Parameters Optimize Assignments Repeat Until Convergence location occlusion scale appearance

Recognition Make this likelihood ratio: greater than a threshold.

RESULTS Initially tested on the Caltech-4 data set motorbikes faces airplanes cars Now there is a much bigger data set: the Caltech-101 http://www.vision.caltech.edu/archive.html

Motorbikes Equal error rate: 7.5%

It learns that these are NOT motorbikes. Background Images It learns that these are NOT motorbikes.

Frontal faces Equal error rate: 4.6%

Airplanes Equal error rate: 9.8%

Scale-Invariant Cats Equal error rate: 10.0%

Scale-Invariant Cars Equal error rate: 9.7%

Accuracy Initial Pre-Scaled Experiments