1 Outline Overview Integrating Vision Models CCM: Cascaded Classification Models Learning Spatial Context TAS: Things and Stuff Descriptive Querying of.

Slides:



Advertisements
Similar presentations
O BJ C UT M. Pawan Kumar Philip Torr Andrew Zisserman UNIVERSITY OF OXFORD.
Advertisements

Semantic Contours from Inverse Detectors Bharath Hariharan et.al. (ICCV-11)
The Layout Consistent Random Field for detecting and segmenting occluded objects CVPR, June 2006 John Winn Jamie Shotton.
CVPR2013 Poster Modeling Actions through State Changes.
Foreground Focus: Finding Meaningful Features in Unlabeled Images Yong Jae Lee and Kristen Grauman University of Texas at Austin.
A generic model to compose vision modules for holistic scene understanding Adarsh Kowdle *, Congcong Li *, Ashutosh Saxena, and Tsuhan Chen Cornell University,
Agenda Introduction Bag-of-words models Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based.
Part 4: Combined segmentation and recognition by Rob Fergus (MIT)
Scene Labeling Using Beam Search Under Mutex Constraints ID: O-2B-6 Anirban Roy and Sinisa Todorovic Oregon State University 1.
Mixture of trees model: Face Detection, Pose Estimation and Landmark Localization Presenter: Zhang Li.
Structural Human Action Recognition from Still Images Moin Nabi Computer Vision Lab. ©IPM - Oct
LOCUS (Learning Object Classes with Unsupervised Segmentation) A variational approach to learning model- based segmentation. John Winn Microsoft Research.
Global spatial layout: spatial pyramid matching Spatial weighting the features Beyond bags of features: Adding spatial information.
Object-centric spatial pooling for image classification Olga Russakovsky, Yuanqing Lin, Kai Yu, Li Fei-Fei ECCV 2012.
Groups of Adjacent Contour Segments for Object Detection Vittorio Ferrari Loic Fevrier Frederic Jurie Cordelia Schmid.
Contour Based Approaches for Visual Object Recognition Jamie Shotton University of Cambridge Joint work with Roberto Cipolla, Andrew Blake.
Sketch Tokens: A Learned Mid-level Representation for Contour and Object Detection CVPR2013 POSTER.
Biased Normalized Cuts 1 Subhransu Maji and Jithndra Malik University of California, Berkeley IEEE Conference on Computer Vision and Pattern Recognition.
Fast intersection kernel SVMs for Realtime Object Detection
Beyond bags of features: Part-based models Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Recognition using Regions CVPR Outline Introduction Overview of the Approach Experimental Results Conclusion.
Unsupervised Learning of Categorical Segments in Image Collections *California Institute of Technology **Technion Marco Andreetto*, Lihi Zelnik-Manor**,
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Object Recognition Szeliski Chapter 14.
1 Integrating Vision Models for Holistic Scene Understanding Geremy Heitz CS223B March 4 th, 2009.
Learning object shape Gal Elidan Geremy Heitz Daphne Koller February 12 th, 2006 PAIL.
Generic Object Detection using Feature Maps Oscar Danielsson Stefan Carlsson
Learning Spatial Context: Using stuff to find things Geremy Heitz Daphne Koller Stanford University October 13, 2008 ECCV 2008.
Region-based Voting Exemplar 1 Query 1 Exemplar 2.
Scene Understanding through Transfer Learning Stephen Gould Ben Packer Geremy Heitz Daphne Koller DARPA Update September 11, 2008.
TextonBoost : Joint Appearance, Shape and Context Modeling for Multi-Class Object Recognition and Segmentation J. Shotton*, J. Winn†, C. Rother†, and A.
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to.
Oxford Brookes Seminar Thursday 3 rd September, 2009 University College London1 Representing Object-level Knowledge for Segmentation and Image Parsing:
Learning 3D mesh segmentation and labeling Evangelos Kalogerakis, Aaron Hertzmann, Karan Singh University of Toronto Head Tors o Upper arm Lower arm Hand.
1 Segmentation with Scene and Sub-Scene Categories Joseph Djugash Input Image Scene/Sub-Scene Classification Segmentation.
Transfer Learning of Object Classes: From Cartoons to Photographs NIPS Workshop Inductive Transfer: 10 Years Later Geremy Heitz Gal Elidan Daphne Koller.
CONTEXT. Learned Satellite Clusters Results - Satellite Prior: Detector Only Posterior: TAS Model Region Labels.
Learning Spatial Context: Can stuff help us find things? Geremy Heitz Daphne Koller April 14, 2008 DAGS Stuff (n): Material defined by a homogeneous or.
What, Where & How Many? Combining Object Detectors and CRFs
Accuracy # Training Instances (per class) Giraffe NB CENT LOOPS GROUND.
Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to.
Autonomous Learning of Object Models on Mobile Robots Xiang Li Ph.D. student supervised by Dr. Mohan Sridharan Stochastic Estimation and Autonomous Robotics.
Object Detection Sliding Window Based Approach Context Helps
Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,
Leo Zhu CSAIL MIT Joint work with Chen, Yuille, Freeman and Torralba 1.
A Statistically Selected Part-Based Probabilistic Model for Object Recognition Zhipeng Zhao, Ahmed Elgammal Department of Computer Science, Rutgers, The.
Learning Collections of Parts for Object Recognition and Transfer Learning University of Illinois at Urbana- Champaign.
Associative Hierarchical CRFs for Object Class Image Segmentation Ľubor Ladický 1 1 Oxford Brookes University 2 Microsoft Research Cambridge Based on the.
MSRI workshop, January 2005 Object Recognition Collected databases of objects on uniform background (no occlusions, no clutter) Mostly focus on viewpoint.
Putting Context into Vision Derek Hoiem September 15, 2004.
Geodesic Saliency Using Background Priors
Epitomic Location Recognition A generative approach for location recognition K. Ni, A. Kannan, A. Criminisi and J. Winn In proc. CVPR Anchorage,
Associative Hierarchical CRFs for Object Class Image Segmentation
Grouplet: A Structured Image Representation for Recognizing Human and Object Interactions Bangpeng Yao and Li Fei-Fei Computer Science Department, Stanford.
O BJ C UT M. Pawan Kumar Philip Torr Andrew Zisserman UNIVERSITY OF OXFORD.
Towards Total Scene Understanding: Classification, Annotation and Segmentation in an Automatic Framework N 工科所 錢雅馨 2011/01/16 Li-Jia Li, Richard.
Extracting Simple Verb Frames from Images Toward Holistic Scene Understanding Prof. Daphne Koller Research Group Stanford University Geremy Heitz DARPA.
Object Recognition by Integrating Multiple Image Segmentations Caroline Pantofaru, Cordelia Schmid, Martial Hebert ECCV 2008 E.
TextonBoost: Joint Appearance, Shape and Context Modeling for Multi-Class Object Recognition and Segmentation J. Shotton ; University of Cambridge J. Jinn,
Recent developments in object detection
Krishna Kumar Singh, Yong Jae Lee University of California, Davis
Nonparametric Semantic Segmentation
ICCV Hierarchical Part Matching for Fine-Grained Image Classification
Object detection as supervised classification
Learning to Combine Bottom-Up and Top-Down Segmentation
Brief Review of Recognition + Context
Cascaded Classification Models
Liyuan Li, Jerry Kah Eng Hoe, Xinguo Yu, Li Dong, and Xinqi Chu
“Traditional” image segmentation
Presentation transcript:

1 Outline Overview Integrating Vision Models CCM: Cascaded Classification Models Learning Spatial Context TAS: Things and Stuff Descriptive Querying of Images LOOPS: Localizing Object Outlines using Probabilistic Shape Future Directions [Heitz et al. NIPS 2008b]

2 Image Queries on Objects Categorization Localization Descriptive What color is his tail? Where is his head? Is he… standing? sitting? bent over? What is he doing?

3 Related Work Refined Localization Coarse Fine Boosted Detectors “Parts” Models OUR WORK Localization Models [ Torralba et al., PAMI 2007 ] [ Opelt, ECCV 2006 ] [ Fei-Fei and Perona, CVPR 2005 ] [ Fergus et al., CVPR 2003 ] [ Leibe et al, ECCV 2004 ] [ Bar-Hillel et al, CVPR 2005 ] [ Winn & Shotton, CVPR 2006 ] [ Kumar et al., CVPR 2005 ] [ Cootes et al., CVIU 1995 ] [ Borenstein et al., CVPR 2004 ]

4 Shape Representation: Landmarks Set of “keypoint” landmarks Shape defined by connecting piecewise  linear contour Internal landmarks are allowed (but not shown here)

5 Training Data Images + Outlines

6 State-of-the-art Alternatives kAS Detector: Edge-based object detector Pro: No outline required Great at detection Con: No single outline OBJ CUT: Object-based segmentation Pro: Produces outlines Con: Appearance model based on internal texture [ Kumar et al., CVPR 2005 ] [ Prasad et al., CVPR 2006 ] [ Ferrari et al., CVPR 2007 ]

7 LOOPS Pipeline Localized Test Outlines LOOPS Model -1 std +1 std produce corresponded training data register model to images shape based classification Consistent Outlines learn shape & landmark detectors Descriptive Classification DownUp UP DOWN Localizing Object Outlines using Probabilistic Shape Images + Outlines

8 Corresponded Outlines produce corresponded training data Consistent Outlines Localizing Object Outlines using Probabilistic Shape Images + Outlines [ Hill & Taylor, BMVC 1996 ] Based on existing work in medical imaging Intuition: Arc length and curvature should remain consistent

9 Learning Shape & Detectors LOOPS Model -1 std +1 std Consistent Outlines learn shape & landmark detectors Localizing Object Outlines using Probabilistic Shape

10 Multivariate Gaussian over landmark locations Shape Model Neck Legs

11 Build on state-of-the-art discriminative methods for detecting “parts” or “objects” Build a detector for each landmark Landmark Detectors

12 Registration Localized Test Outlines LOOPS Model -1 std +1 std register model to images Localizing Object Outlines using Probabilistic Shape

13 “Registering” the Model to an Image Task: Assign each landmark l  L to a pixel p l  P ? ? L – Assignment of Landmarks to Pixels L* = argmax Score(L | I) = argmax ShapeScore(L) + ImageScore(L | I)

14 The LOOPS MRF shape score landmark detectors pairwise image score Registering = MAP Inference over L

15 Outlining Image Detectors Only Full LOOPS

16 Results GiraffeLlama Rhino

kAS Detector OBJ CUT LOOPS

kAS Detector OBJ CUT LOOPS

19 LOOPS Pipeline Localized Test Outlines LOOPS Model -1 std +1 std produce corresponded training data register model to images shape based classification Consistent Outlines learn shape & landmark detectors Descriptive Classification DownUp UP DOWN Localizing Object Outlines using Probabilistic Shape Images + Outlines

20 Descriptive Classification Localized Test Outlines shape based classification Descriptive Classification DownUp UP DOWN Localizing Object Outlines using Probabilistic Shape

21 RANDOM Descriptive Queries Goal: Classify based on shape characteristics Is the giraffe Or # Training Instances (per class) “True” shape Boosting Close this gap Accuracy

22 Mammals