Learning Collections of Parts for Object Recognition and Transfer Learning University of Illinois at Urbana- Champaign.

Slides:



Advertisements
Similar presentations
Semantic Contours from Inverse Detectors Bharath Hariharan et.al. (ICCV-11)
Advertisements

Learning Shared Body Plans Ian Endres University of Illinois work with Derek Hoiem, Vivek Srikumar and Ming-Wei Chang.
Learning Semantics with Less Supervision
Diagnosing Error in Object Detectors Department of Computer Science University of Illinois at Urbana-Champaign (UIUC) Derek Hoiem Yodsawalai Chodpathumwan.
Foreground Focus: Finding Meaningful Features in Unlabeled Images Yong Jae Lee and Kristen Grauman University of Texas at Austin.
Limin Wang, Yu Qiao, and Xiaoou Tang
Ľubor Ladický1 Phil Torr2 Andrew Zisserman1
Recovering Human Body Configurations: Combining Segmentation and Recognition Greg Mori, Xiaofeng Ren, and Jitentendra Malik (UC Berkeley) Alexei A. Efros.
Face Alignment with Part-Based Modeling
Lecture 31: Modern object recognition
Many slides based on P. FelzenszwalbP. Felzenszwalb General object detection with deformable part-based models.
Multi-Local Feature Manifolds for Object Detection Oscar Danielsson Stefan Carlsson Josephine Sullivan
Object-centric spatial pooling for image classification Olga Russakovsky, Yuanqing Lin, Kai Yu, Li Fei-Fei ECCV 2012.
Knowing a Good HOG Filter When You See It: Efficient Selection of Filters for Detection Ejaz Ahmed 1, Gregory Shakhnarovich 2, and Subhransu Maji 3 1 University.
Enhancing Exemplar SVMs using Part Level Transfer Regularization 1.
Large-Scale Object Recognition with Weak Supervision
DISCRIMINATIVE DECORELATION FOR CLUSTERING AND CLASSIFICATION ECCV 12 Bharath Hariharan, Jitandra Malik, and Deva Ramanan.
Beyond bags of features: Part-based models Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Recognition: A machine learning approach
Recognition using Regions CVPR Outline Introduction Overview of the Approach Experimental Results Conclusion.
Quantifying and Transferring Contextual Information in Object Detection Professor: S. J. Wang Student : Y. S. Wang 1.
1. Introduction Humanising GrabCut: Learning to segment humans using the Kinect Varun Gulshan, Victor Lempitksy and Andrew Zisserman Dept. of Engineering.
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
1 Image Recognition - I. Global appearance patterns Slides by K. Grauman, B. Leibe.
Learning Spatial Context: Using stuff to find things Geremy Heitz Daphne Koller Stanford University October 13, 2008 ECCV 2008.
Object Recognizing We will discuss: Features Classifiers Example ‘winning’ system.
Video Google: Text Retrieval Approach to Object Matching in Videos Authors: Josef Sivic and Andrew Zisserman University of Oxford ICCV 2003.
Object Recognition: Conceptual Issues Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and K. Grauman.
Spatial Pyramid Pooling in Deep Convolutional
Lecture 29: Recent work in recognition CS4670: Computer Vision Noah Snavely.
Generic object detection with deformable part-based models
A Scale and Rotation Invariant Approach to Tracking Human Body Part Regions in Videos Yihang BoHao Jiang Institute of Automation, CAS Boston College.
Computer Vision CS 776 Spring 2014 Recognition Machine Learning Prof. Alex Berg.
“Secret” of Object Detection Zheng Wu (Summer intern in MSRNE) Sep. 3, 2010 Joint work with Ce Liu (MSRNE) William T. Freeman (MIT) Adam Kalai (MSRNE)
Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,
Detecting Curved Symmetric Parts using a Deformable Disc Model Tom Sie Ho Lee, University of Toronto Sanja Fidler, TTI Chicago Sven Dickinson, University.
Efficient Region Search for Object Detection Sudheendra Vijayanarasimhan and Kristen Grauman Department of Computer Science, University of Texas at Austin.
Object Detection with Discriminatively Trained Part Based Models
Lecture 31: Modern recognition CS4670 / 5670: Computer Vision Noah Snavely.
A Codebook-Free and Annotation-free Approach for Fine-Grained Image Categorization Authors Bangpeng Yao et al. Presenter Hyung-seok Lee ( 이형석 ) CVPR 2012.
PSEUDO-RELEVANCE FEEDBACK FOR MULTIMEDIA RETRIEVAL Seo Seok Jun.
CS 1699: Intro to Computer Vision Detection II: Deformable Part Models Prof. Adriana Kovashka University of Pittsburgh November 12, 2015.
Pictorial Structures and Distance Transforms Computer Vision CS 543 / ECE 549 University of Illinois Ian Endres 03/31/11.
Category Independent Region Proposals Ian Endres and Derek Hoiem University of Illinois at Urbana-Champaign.
Recognition Using Visual Phrases
CSE 185 Introduction to Computer Vision Feature Matching.
Object Recognition as Ranking Holistic Figure-Ground Hypotheses Fuxin Li and Joao Carreira and Cristian Sminchisescu 1.
Object Recognition by Integrating Multiple Image Segmentations Caroline Pantofaru, Cordelia Schmid, Martial Hebert ECCV 2008 E.
Object Recognizing. Object Classes Individual Recognition.
Object Recognition by Discriminative Combinations of Line Segments and Ellipses Alex Chia ^˚ Susanto Rahardja ^ Deepu Rajan ˚ Maylor Leung ˚ ^ Institute.
Object Recognizing. Object Classes Individual Recognition.
Rich feature hierarchies for accurate object detection and semantic segmentation 2014 IEEE Conference on Computer Vision and Pattern Recognition Ross Girshick,
Scene Parsing with Object Instances and Occlusion Ordering JOSEPH TIGHE, MARC NIETHAMMER, SVETLANA LAZEBNIK 2014 IEEE CONFERENCE ON COMPUTER VISION AND.
1 Bilinear Classifiers for Visual Recognition Computational Vision Lab. University of California Irvine To be presented in NIPS 2009 Hamed Pirsiavash Deva.
Strong Supervision from Weak Annotation: Interactive Training of Deformable Part Models S. Branson, P. Perona, S. Belongie.
Recent developments in object detection
Object Detection based on Segment Masks
Object detection with deformable part-based models
Data Driven Attributes for Action Detection
Krishna Kumar Singh, Yong Jae Lee University of California, Davis
Huazhong University of Science and Technology
Object Localization Goal: detect the location of an object within an image Fully supervised: Training data labeled with object category and ground truth.
Object detection as supervised classification
HOGgles Visualizing Object Detection Features
Figure 4. Testing minimal configurations with existing models for spatiotemporal recognition. (A-B) A binary classifier is trained to separate a positive.
“The Truth About Cats And Dogs”
Brief Review of Recognition + Context
Object Detection + Deep Learning
CornerNet: Detecting Objects as Paired Keypoints
RCNN, Fast-RCNN, Faster-RCNN
Presentation transcript:

Learning Collections of Parts for Object Recognition and Transfer Learning University of Illinois at Urbana- Champaign

Flexible Part Based Model Why? – Interested in learning a large number of object categories – Avoid learning new category from scratch when useful information can be borrowed from other categories

Flexible Part Based Model: Objectives Simple to train – Minimal manual initialization effort – Train each part independently – Simple spatial model 3 How Can We Adapt Existing Part Models to New Categories?

Boosted Collections of Parts 4 Simple to train – Minimal manual initialization effort – Train each part independently – Simple spatial model ECCV 2010 How Can We Adapt Existing Part Models to New Categories?

Part Refinement Retrain with new examples Train Part Detector Collect Consistent Positives Initialize with Single Exemplar 5

Compute expected part position from exemplar: Transfer to other examples: Encouraging Spatial Consistency 6 How Can We Adapt Existing Part Models to New Categories?

Only allow candidates with sufficient overlap with expected position Encouraging Consistency 7 GOOD BAD How Can We Adapt Existing Part Models to New Categories?

Learned Part Models 8 How Can We Adapt Existing Part Models to New Categories?

Learned Part Models 9 How Can We Adapt Existing Part Models to New Categories?

Part Evaluation: Discrimination 1.How discriminative are our parts? (mean AP) PlaneBikeBoatCatDogSofa Exemplar Refined: All-in Selective: Appearance Selective: App.+Spatial

Part Evaluation: Spatial Consistency 1.How discriminative are our parts? (mean AP) 2.How well can we localize Poselet Keypoint annotations? (mean best AP per keypoint type) PlaneBikeBoatCatDogSofa Exemplar Refined: All-in Selective: Appearance Selective: App.+Spatial

Pooling Part Detections Propose 500 candidate object regions per image (Endres and Hoiem 2010)

Pooling Part Detections Collect highest scoring response for each part:

Pooling Part Detections Collect highest scoring response for each part:

Scoring Object Candidates Classify vector of scores using boosted classifier

Post-Processing Without an explicit spatial model, overlapping regions tend to receive similar category scores Re-predict bounding box by having each part detector cast a weighted vote for a refined bounding box

Relocalization Loose spatial model: Good parts can be assigned to bad regions

Relocalization Loose spatial model: Good parts can be assigned to bad regions Solution 1: Region shape features to down-weight bad regions HOG of segmentation mask

Relocalization Loose spatial model: Good parts can be assigned to bad regions Solution 2: Use parts to repredict bounding box Each part votes for box Weighted average based on appearance score and learned reliability

Results: Beating state of the art Our Model Felzenszwalb et al. Aeroplane > 48.4 AP Cat > 36.9 AP Dog 8.5 -> 20.9 AP

Results: Competitive with state of the art Bicycle > 43.0 AP Boat 6.7 -> 5.0 AP Sofa > 14.1 AP Our Model Felzenszwalb et al.

Adapting Models: Finding analogous parts

Transfer

Adapting Parts: Refining Model Adapt each part’s appearance using object model transfer from e.g. Aytar and Zisserman,

Conclusion Goal: Recognition systems that can give as much detail about any object they encounter Consider supervised tasks that generalize across categories Capture shared similarities across categories and differences within categories 25