Learning object shape Gal Elidan Geremy Heitz Daphne Koller February 12 th, 2006 PAIL.

Slides:



Advertisements
Similar presentations
Shape Matching and Object Recognition using Low Distortion Correspondence Alexander C. Berg, Tamara L. Berg, Jitendra Malik U.C. Berkeley.
Advertisements

Applications of one-class classification
CVPR2013 Poster Modeling Actions through State Changes.
A Bayesian Approach to Recognition Moshe Blank Ita Lifshitz Reverend Thomas Bayes
November 12, 2013Computer Vision Lecture 12: Texture 1Signature Another popular method of representing shape is called the signature. In order to compute.
Convex Point Estimation using Undirected Bayesian Transfer Hierarchies Gal Elidan Ben Packer Geremy Heitz Daphne Koller Stanford AI Lab.
Computer Vision for Human-Computer InteractionResearch Group, Universität Karlsruhe (TH) cv:hci Dr. Edgar Seemann 1 Computer Vision: Histograms of Oriented.
Learning to estimate human pose with data driven belief propagation Gang Hua, Ming-Hsuan Yang, Ying Wu CVPR 05.
Modeling the Shape of People from 3D Range Scans
Object Recognition & Model Based Tracking © Danica Kragic Tracking system.
LOCUS (Learning Object Classes with Unsupervised Segmentation) A variational approach to learning model- based segmentation. John Winn Microsoft Research.
Hidden Variables, the EM Algorithm, and Mixtures of Gaussians Computer Vision CS 143, Brown James Hays 02/22/11 Many slides from Derek Hoiem.
Hidden Variables, the EM Algorithm, and Mixtures of Gaussians Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 03/15/12.
Uncertainty Representation. Gaussian Distribution variance Standard deviation.
Contour Based Approaches for Visual Object Recognition Jamie Shotton University of Cambridge Joint work with Roberto Cipolla, Andrew Blake.
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
A Study of Approaches for Object Recognition
Features-based Object Recognition Pierre Moreels California Institute of Technology Thesis defense, Sept. 24, 2007.
Problem Sets Problem Set 3 –Distributed Tuesday, 3/18. –Due Thursday, 4/3 Problem Set 4 –Distributed Tuesday, 4/1 –Due Tuesday, 4/15. Probably a total.
Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to.
Transferring information using Bayesian priors on object categories Li Fei-Fei 1, Rob Fergus 2, Pietro Perona 1 1 California Institute of Technology, 2.
3D Hand Pose Estimation by Finding Appearance-Based Matches in a Large Database of Training Views
CVPR 2006 New York City Granularity and Elasticity Adaptation in Visual Tracking Ming Yang, Ying Wu NEC Laboratories America Cupertino, CA 95014
Distinctive image features from scale-invariant keypoints. David G. Lowe, Int. Journal of Computer Vision, 60, 2 (2004), pp Presented by: Shalomi.
The Layout Consistent Random Field for Recognizing and Segmenting Partially Occluded Objects By John Winn & Jamie Shotton CVPR 2006 presented by Tomasz.
Recovering Articulated Object Models from 3D Range Data Dragomir Anguelov Daphne Koller Hoi-Cheung Pang Praveen Srinivasan Sebastian Thrun Computer Science.
Transfer Learning of Object Classes: From Cartoons to Photographs NIPS Workshop Inductive Transfer: 10 Years Later Geremy Heitz Gal Elidan Daphne Koller.
Pattern Recognition. Introduction. Definitions.. Recognition process. Recognition process relates input signal to the stored concepts about the object.
Multiple Object Class Detection with a Generative Model K. Mikolajczyk, B. Leibe and B. Schiele Carolina Galleguillos.
Cue Integration in Figure/Ground Labeling Xiaofeng Ren, Charless Fowlkes and Jitendra Malik, U.C. Berkeley We present a model of edge and region grouping.
1 Outline Overview Integrating Vision Models CCM: Cascaded Classification Models Learning Spatial Context TAS: Things and Stuff Descriptive Querying of.
Learning Spatial Context: Can stuff help us find things? Geremy Heitz Daphne Koller April 14, 2008 DAGS Stuff (n): Material defined by a homogeneous or.
Unsupervised Category Modeling, Recognition and Segmentation Sinisa Todorovic and Narendra Ahuja.
Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to.
MRFs and Segmentation with Graph Cuts Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 02/24/10.
Recognition using Regions (Demo) Sudheendra V. Outline Generating multiple segmentations –Normalized cuts [Ren & Malik (2003)] Uniform regions –Watershed.
Classifying Images with Visual/Textual Cues By Steven Kappes and Yan Cao.
#MOTION ESTIMATION AND OCCLUSION DETECTION #BLURRED VIDEO WITH LAYERS
Automated Detection and Classification Models SAR Automatic Target Recognition Proposal J.Bell, Y. Petillot.
Markov Random Fields Probabilistic Models for Images
Fast Direct Super-Resolution by Simple Functions
Features-based Object Recognition P. Moreels, P. Perona California Institute of Technology.
Max-Margin Classification of Data with Absent Features Presented by Chunping Wang Machine Learning Group, Duke University July 3, 2008 by Chechik, Heitz,
Feature based deformable registration of neuroimages using interest point and feature selection Leonid Teverovskiy Center for Automated Learning and Discovery.
Discovering Deformable Motifs in Time Series Data Jin Chen CSE Fall 1.
Vision-based human motion analysis: An overview Computer Vision and Image Understanding(2007)
Automatic Locating of Anthropometric Landmarks on 3D Human Models
Epitomic Location Recognition A generative approach for location recognition K. Ni, A. Kannan, A. Criminisi and J. Winn In proc. CVPR Anchorage,
Final Review Course web page: vision.cis.udel.edu/~cv May 21, 2003  Lecture 37.
CSE 185 Introduction to Computer Vision Feature Matching.
MIT AI Lab / LIDS Laboatory for Information and Decision Systems & Artificial Intelligence Laboratory Massachusetts Institute of Technology A Unified Multiresolution.
Lecture 9 Feature Extraction and Motion Estimation Slides by: Michael Black Clark F. Olson Jean Ponce.
Multi Scale CRF Based RGB-D Image Segmentation Using Inter Frames Potentials Taha Hamedani Robot Perception Lab Ferdowsi University of Mashhad The 2 nd.
Extracting Simple Verb Frames from Images Toward Holistic Scene Understanding Prof. Daphne Koller Research Group Stanford University Geremy Heitz DARPA.
Hmm, HID HMMs Gerald Dalley MIT AI Lab Activity Perception Group Group Meeting 17 April 2003.
Hidden Variables, the EM Algorithm, and Mixtures of Gaussians Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 02/22/11.
SUMMERY 1. VOLUMETRIC FEATURES FOR EVENT DETECTION IN VIDEO correlate spatio-temporal shapes to video clips that have been automatically segmented we.
Computational methods for inferring cellular networks II Stat 877 Apr 17 th, 2014 Sushmita Roy.
Learning a Region-based Scene Segmentation Model
LOCUS: Learning Object Classes with Unsupervised Segmentation
Nonparametric Semantic Segmentation
Dynamical Statistical Shape Priors for Level Set Based Tracking
Object Recognition by Parts
Scott Tan Boonping Lau Chun Hui Weng
Reconstructing Shredded Documents
Brief Review of Recognition + Context
Matching Words with Pictures
Cascaded Classification Models
CSE 185 Introduction to Computer Vision
Object Recognition by Parts
Presentation transcript:

Learning object shape Gal Elidan Geremy Heitz Daphne Koller February 12 th, 2006 PAIL

Localization vs. Recognition Traditional question: “Is there an object of type X in this image?” Airplane? NO Human? YES Dog? YES Our question: “Where in this image is the object of type X?” MAN DOG The man is walking the dog

Outline Registration Shape Learning Shape and Appearance Learning ?

Outline Landmark-based shape model Registration as inference Learning Results 3D preview

Shape Model Set of landmarks Outline is defined via piecewise-linear contour Features of individual landmarks Features of pairs of landmarks tail nose

“Registering” the Model to an Image Task: Assign each landmark l  L to a pixel p l  P ? ? Basic tool: local matches Local score feature + p l

“Shape aware” features Shape Template Patch Appearance (Foreground/Background)

“Shape aware” features Shape template: vector of expected pixel locations relative to landmark good matchbad match

FG/BGappearance feature Histogram of various appearance components (A) Includes RGB, HSV, Texture components … a 1 = H a 2 = T FG … a 1 = H a 2 = T BG

FG/BGappearance feature Histogram of various appearance components (A) Includes RGB, HSV, Texture components … a 1 = H a 2 = T I … a 1 = H a 2 = T M

Pairwise Landmark Features plpl pmpm dXdX XX YY l m dYdY

Outline Landmark-based shape model Registration as inference Learning Results 3D preview

Registration Are local cues enough? Need to jointly consider all cues (features) “Correct” pixel is often not the best match! Inference using max-product

Registration Markov Random Field Random variables = Landmarks Domain = Image pixels Potentials = scores of features Inference using max-product Registration = Most Likely Assignment L nose L tail L under L cockpit

Registration Likelihood Given an assignment of landmarks to pixels The likelihood is defined as local features pairwise features rotations

Domain Pruning Can we use all pixels as the domain? Average Image = 60,000 pixels PW Potentials: 60K X 60K = 3.6B entries First consider only edge pixels Order of 1K Canny edge pixels per image Consider top matches of local features Correct match generally in top 50

Outline Landmark-based shape model Registration as inference Learning Results 3D preview

Bootstrap from simple instances where outlining is easy = cartoons / drawings Learning Challenge Hand LabelHidden Variables Costly, and time-consuming Where to start? Local optima problem no confusing background outline (shape) is easily recovered using snake ? ? ? ? ?

Learning from Cartoon Drawings Registration Shape Learning Shape and Appearance Learning + ?

Contour-Contour Registration Treat one contour as an image Build a model from the other contour (with prior on variances) Registration technique/code is the same

Model Construction Shape template: Sampled along contour, rotated to match, averaged over instances Appearance template Average inside/outside masks Location and pairwise parameters Maximum likelihood estimation of Gaussian moments = ModelInstance 1Instance 2 + Instance 1Instance 2 + OUT IN OUT IN = OUT IN Model

Phase I: Learning from Cartoons Extract high resolution contour using snake Create shape-based model from training contours Pairwise merging of models Selection of landmarks Registration Pyramid Final Shape Model

Learning from Cartoons Extract high resolution contour using snake Create shape-based model (with prior variances) Pairwise merging of models Selection of landmarks Final Shape Model

Outline Landmark-based shape model Registration as inference Learning from cartoon drawings Results 3D preview

Measure of success Typical prediction measure: has nothing to do with localization / outlining 0.77 automatic outlining true outline OMR

Localization Results training cartoons sample registration

Localization Results - 2

Cartoon vs. Hand Segmentation Number of Training Instances Mean Overlap Score Learned from Drawings Hand Constructed Human Inter-Observer cartoon hand segmented Learning shape from cartoons is competitive with hand segmentation!

Prediction False positive rate True positive rate objectrecognition car side86% cougar86% airplane86% buddha84% bass76% rooster73%  Comparable to constellation w/ 5 instances (Fei Fei et. Al)  Leading (discriminative) methods require many instances TP=FP rate

How hard is the problem? registration of shape appears absurd support of edges is not what we expect

Effect of different features shape only with appearancewith location prior

Learning Appearance No Appearance FG/BG Appearance Average overlap shape shape + appearance # images in phase II

Other approaches Other ApproachOur Approach Discriminative [Grauman/Darrell, Serre] Ignorant of shape Rely on image stats Many Instances Shape aware Rely on object stats Few instances Geometric [Fergus,Fei-Fei,Quattoni] Good recognition Localization: not great Good precise outlining Competitive recognition Landmark based [Berg] Match to template rather than model Leverage image stats Model shape, match to object in image Outlining [Cootes,Coughlan,Kumar] Limited scenarios Few D.O.F. Varied classes Flexible model

3D model to 2D images Capture all properties of an object Model projection invariants decimation

3D to 2D registration Projection 1P2P3 Given projection this is just 2D registration

Learning 3D to 2D invariants initial (prior) model P1P2P3 Probabilistic 3D model Distances: EM with least-squares problem Appearance: ???

Summary and Future Work  Flexible probabilistic shape model  Effective outlining of object  Novel learning from cartoons Develop a better appearance model Generalize to 3D models Learn projection invariants

Thanks! Gal Elidan Geremy Heitz Daphne Koller Stanford University

Training Set Selection high score low score Phase II: Learning from Images Correspond initial model to training images Select best correspondences as training instances Learn final shape- and appearance-based model Cartoon Phase Model Natural Image Model Transfer

Transfer of Object Shape Transfer of shape speeds up learning Benefit of shape transfer # images in phase II Average overlap transfer no transfer

Training Instance Selection AUTOPICKED AUTO HAND # images in phase II Average overlap PICKED

Training Instance Selection AUTO HAND PICKED # images in phase II Average overlap