Object Recognition by Parts

Slides:



Advertisements
Similar presentations
Shape Matching and Object Recognition using Low Distortion Correspondence Alexander C. Berg, Tamara L. Berg, Jitendra Malik U.C. Berkeley.
Advertisements

Active Appearance Models
Distinctive Image Features from Scale-Invariant Keypoints David Lowe.
Image Modeling & Segmentation
CSCE643: Computer Vision Bayesian Tracking & Particle Filtering Jinxiang Chai Some slides from Stephen Roth.
A Bayesian Approach to Recognition Moshe Blank Ita Lifshitz Reverend Thomas Bayes
Object class recognition using unsupervised scale-invariant learning Rob Fergus Pietro Perona Andrew Zisserman Oxford University California Institute of.
Recognition by Probabilistic Hypothesis Construction P. Moreels, M. Maire, P. Perona California Institute of Technology.
Unsupervised Learning of Visual Object Categories Michael Pfeiffer
Global spatial layout: spatial pyramid matching Spatial weighting the features Beyond bags of features: Adding spatial information.
Contour Based Approaches for Visual Object Recognition Jamie Shotton University of Cambridge Joint work with Roberto Cipolla, Andrew Blake.
Pedestrian Detection in Crowded Scenes Dhruv Batra ECE CMU.
Unsupervised Learning for Recognition Pietro Perona California Institute of Technology & Universita di Padova 11 th British Machine Vision Conference –
, Tim Landgraf Active Appearance Models AG KI, Journal Club 03 Nov 2008.
Model: Parts and Structure. History of Idea Fischler & Elschlager 1973 Yuille ‘91 Brunelli & Poggio ‘93 Lades, v.d. Malsburg et al. ‘93 Cootes, Lanitis,
Beyond bags of features: Part-based models Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Mean-Shift Algorithm and Its Application Bohyung Han
Generic Object Recognition -- by Yatharth Saraf A Project on.
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Prénom Nom Document Analysis: Data Analysis and Clustering Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to.
Transferring information using Bayesian priors on object categories Li Fei-Fei 1, Rob Fergus 2, Pietro Perona 1 1 California Institute of Technology, 2.
Object recognition Jana Kosecka Slides from D. Lowe, D. Forsythe and J. Ponce book, ICCV 2005 Tutorial Fei-Fei Li, Rob Fergus and A. Torralba.
Project 4 out today –help session today –photo session today Project 2 winners Announcements.
Object recognition Jana Kosecka Slides from D. Lowe, D. Forsythe and J. Ponce book, ICCV 2005 Tutorial Fei-Fei Li, Rob Fergus and A. Torralba.
Object class recognition using unsupervised scale-invariant learning Rob Fergus Pietro Perona Andrew Zisserman Oxford University California Institute of.
Object Class Recognition by Unsupervised Scale-Invariant Learning R. Fergus, P. Perona, and A. Zisserman Presented By Jeff.
Pattern Recognition. Introduction. Definitions.. Recognition process. Recognition process relates input signal to the stored concepts about the object.
Multiple Object Class Detection with a Generative Model K. Mikolajczyk, B. Leibe and B. Schiele Carolina Galleguillos.
Unsupervised Category Modeling, Recognition and Segmentation Sinisa Todorovic and Narendra Ahuja.
Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to.
CS 485/685 Computer Vision Face Recognition Using Principal Components Analysis (PCA) M. Turk, A. Pentland, "Eigenfaces for Recognition", Journal of Cognitive.
Building local part models for category-level recognition C. Schmid, INRIA Grenoble Joint work with G. Dorko, S. Lazebnik, J. Ponce.
A Statistically Selected Part-Based Probabilistic Model for Object Recognition Zhipeng Zhao, Ahmed Elgammal Department of Computer Science, Rutgers, The.
Features-based Object Recognition P. Moreels, P. Perona California Institute of Technology.
MSRI workshop, January 2005 Object Recognition Collected databases of objects on uniform background (no occlusions, no clutter) Mostly focus on viewpoint.
Bayesian Parameter Estimation Liad Serruya. Agenda Introduction Bayesian decision theory Scale-Invariant Learning Bayesian “One-Shot” Learning.
CSE 185 Introduction to Computer Vision Face Recognition.
Topic Models Presented by Iulian Pruteanu Friday, July 28 th, 2006.
Paper Reading Dalong Du Nov.27, Papers Leon Gu and Takeo Kanade. A Generative Shape Regularization Model for Robust Face Alignment. ECCV08. Yan.
3D Face Recognition Using Range Images
Discussion of Pictorial Structures Pedro Felzenszwalb Daniel Huttenlocher Sicily Workshop September, 2006.
Face detection and recognition Many slides adapted from K. Grauman and D. Lowe.
1 Kernel Machines A relatively new learning methodology (1992) derived from statistical learning theory. Became famous when it gave accuracy comparable.
Unsupervised Learning Part 2. Topics How to determine the K in K-means? Hierarchical clustering Soft clustering with Gaussian mixture models Expectation-Maximization.
Object Recognition by Parts
Presented by David Lee 3/20/2006
LECTURE 09: BAYESIAN ESTIMATION (Cont.)
University of Ioannina
Machine Learning Basics
Saliency, Scale and Image Description (by T. Kadir and M
Finding Clusters within a Class to Improve Classification Accuracy
Lecture 26: Faces and probabilities
Face Recognition and Detection Using Eigenfaces
Course Outline MODEL INFORMATION COMPLETE INCOMPLETE
Object Recognition by Parts
Object Recognition by Parts
SMEM Algorithm for Mixture Models
PRAKASH CHOCKALINGAM, NALIN PRADEEP, AND STAN BIRCHFIELD
Brief Review of Recognition + Context
Unsupervised Learning of Models for Recognition
Unsupervised learning of models for recognition
SIFT keypoint detection
Announcements Project 4 out today Project 2 winners help session today
Where are we? We have covered: Project 1b was due today
EM Algorithm and its Applications
Object Recognition by Parts
Object Recognition with Interest Operators
The “Margaret Thatcher Illusion”, by Peter Thompson
Presentation transcript:

Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to systems that extracted linear features. . - CAD-model-based vision works well for industrial. An “appearance-based approach” was first developed for face recognition and later generalized up to a point. The new interest operators have led to a new kind of recognition by “parts” that can handle a variety of objects that were previously difficult or impossible.

Object Class Recognition by Unsupervised Scale-Invariant Learning R. Fergus, P. Perona, and A. Zisserman Oxford University and Caltech CVPR 2003 won the best student paper award

Goal: Enable Computers to Recognize Different Categories of Objects in Images.

Difficulties: Size Variation,

Approach An object is a constellation of parts (from Burl, Weber and Perona, 1998). The parts are detected by an interest operator (Kadir’s). The parts can be recognized by appearance. Objects may vary greatly in scale. The constellation of parts for a given object is learned from training images

Components Model Learning Recognition Generative Probabilistic Model including Location, Scale, and Appearance of Parts Learning Estimate Parameters Via EM Algorithm Recognition Evaluate Image Using Model and Threshold

Model: Constellation Of Parts Fischler & Elschlager, 1973 Yuille, ‘91 Brunelli & Poggio, ‘93 Lades, v.d. Malsburg et al. ‘93 Cootes, Lanitis, Taylor et al. ‘95 Amit & Geman, ‘95, ‘99 Perona et al. ‘95, ‘96, ’98, ‘00

Parts Selected by Interest Operator Kadir and Brady's Interest Operator. Finds Maxima in Entropy Over Scale and Location

Representation of Appearance Projection onto PCA basis 11x11 patch Normalize 121 dimensions was too big, so they used PCA to reduce to 10-15. c15

Learning a Model An object class is represented by a generative model with P parts and a set of parameters . Once the model has been learned, a decision procedure must determine if a new image contains an instance of the object class or not. Suppose the new image has N interesting features with locations X, scales S and appearances A.

Generative Probabilistic Model Top-Down Formulation R is the likelihood ratio.  is the maximum likelihood value of the parameters of the object and bg of the background. h is the hypothesis as to which P of the N features in the image are the object, implemented as a vector of length P with values from 0 to N indicating which image feature corresponds to each object feature. H is the set of all hypotheses; Its size is O(NP). Bayesian Decision Rule

Appearance The appearance (A) of each part p has a Gaussian density with mean cp and covariance VP. Background model has mean cbg and covariance Vbg. Gaussian Part Appearance PDF Guausian Appearance PDF Object Background

Shape as Location Object shape is represented by a joint Gaussian density of the locations (X) of features within a hypothesis transformed into a scale-invariant space. Gaussian Shape PDF Uniform Shape PDF Object Background

Scale The relative scale of each part is modeled by a Gaussian density with mean tp and covariance Up. Prob. of detection Gaussian Relative Scale PDF 0.8 0.75 0.9 Log(scale)

Occlusion and Part Statistics This was very complicated and turned out to not work well and not be necessary, in both Fergus’s work and other subsequent works.

Learning Train Model Parameters Using EM: Optimize Parameters Optimize Assignments Repeat Until Convergence location occlusion scale appearance

Recognition Make this likelihood ratio: greater than a threshold.

RESULTS Initially tested on the Caltech-4 data set motorbikes faces airplanes cars Now there is a much bigger data set: the Caltech-101 http://www.vision.caltech.edu/archive.html

Motorbikes Equal error rate: 7.5%

It learns that these are NOT motorbikes. Background Images It learns that these are NOT motorbikes.

Frontal faces Equal error rate: 4.6%

Airplanes Equal error rate: 9.8%

Scale-Invariant Cats Equal error rate: 10.0%

Scale-Invariant Cars Equal error rate: 9.7%

Accuracy Initial Pre-Scaled Experiments