On Detection of Multiple Object Instances using Hough Transforms Olga Barinova Moscow State University Victor Lempitsky University of Oxford Pushmeet Kohli.

Slides:



Advertisements
Similar presentations
O BJ C UT M. Pawan Kumar Philip Torr Andrew Zisserman UNIVERSITY OF OXFORD.
Advertisements

The Layout Consistent Random Field for detecting and segmenting occluded objects CVPR, June 2006 John Winn Jamie Shotton.
Foreground Focus: Finding Meaningful Features in Unlabeled Images Yong Jae Lee and Kristen Grauman University of Texas at Austin.
Learning to Combine Bottom-Up and Top-Down Segmentation Anat Levin and Yair Weiss School of CS&Eng, The Hebrew University of Jerusalem, Israel.
Classification using intersection kernel SVMs is efficient Joint work with Subhransu Maji and Alex Berg Jitendra Malik UC Berkeley.
Fitting: The Hough transform. Voting schemes Let each feature vote for all the models that are compatible with it Hopefully the noise features will not.
Image Segmentation Image segmentation (segmentace obrazu) –division or separation of the image into segments (connected regions) of similar properties.
Mixture of trees model: Face Detection, Pose Estimation and Landmark Localization Presenter: Zhang Li.
Proportion Priors for Image Sequence Segmentation Claudia Nieuwenhuis, etc. ICCV 2013 Oral.
Segmentation and Fitting Using Probabilistic Methods
LOCUS (Learning Object Classes with Unsupervised Segmentation) A variational approach to learning model- based segmentation. John Winn Microsoft Research.
Object-centric spatial pooling for image classification Olga Russakovsky, Yuanqing Lin, Kai Yu, Li Fei-Fei ECCV 2012.
Segmentation (2): edge detection
Contour Based Approaches for Visual Object Recognition Jamie Shotton University of Cambridge Joint work with Roberto Cipolla, Andrew Blake.
Pedestrian Detection in Crowded Scenes Dhruv Batra ECE CMU.
Model: Parts and Structure. History of Idea Fischler & Elschlager 1973 Yuille ‘91 Brunelli & Poggio ‘93 Lades, v.d. Malsburg et al. ‘93 Cootes, Lanitis,
Beyond bags of features: Part-based models Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Fitting: The Hough transform
Learning to Detect A Salient Object Reporter: 鄭綱 (3/2)
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Robust Higher Order Potentials For Enforcing Label Consistency
1 Image Recognition - I. Global appearance patterns Slides by K. Grauman, B. Leibe.
Lecture 5: Learning models using EM
P 3 & Beyond Solving Energies with Higher Order Cliques Pushmeet Kohli Pawan Kumar Philip H. S. Torr Oxford Brookes University CVPR 2007.
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Recovering Articulated Object Models from 3D Range Data Dragomir Anguelov Daphne Koller Hoi-Cheung Pang Praveen Srinivasan Sebastian Thrun Computer Science.
Graph Cut based Inference with Co-occurrence Statistics Ľubor Ladický, Chris Russell, Pushmeet Kohli, Philip Torr.
Measuring Uncertainty in Graph Cut Solutions Pushmeet Kohli Philip H.S. Torr Department of Computing Oxford Brookes University.
Fitting: The Hough transform
Object Recognition by Discriminative Methods Sinisa Todorovic 1st Sino-USA Summer School in VLPR July, 2009.
What, Where & How Many? Combining Object Detectors and CRFs
CSCI 347 / CS 4206: Data Mining Module 04: Algorithms Topic 06: Regression.
A k-Nearest Neighbor Based Algorithm for Multi-Label Classification Min-Ling Zhang
Learning Structure in Bayes Nets (Typically also learn CPTs here) Given the set of random variables (features), the space of all possible networks.
Surface Stereo with Soft Segmentation Michael Bleyer 1, Carsten Rother 2, Pushmeet Kohli 2 1 Vienna University of Technology, Austria 2 Microsoft Research.
Fitting: The Hough transform. Voting schemes Let each feature vote for all the models that are compatible with it Hopefully the noise features will not.
Object Stereo- Joint Stereo Matching and Object Segmentation Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on Michael Bleyer Vienna.
Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,
Multi-Output Learning for Camera Relocalization Abner Guzmán-Rivera UIUC Pushmeet Kohli Ben Glocker Jamie Shotton Toby Sharp Andrew Fitzgibbon Shahram.
Object Detection 01 – Advance Hough Transformation JJCAO.
Data Mining Practical Machine Learning Tools and Techniques Chapter 4: Algorithms: The Basic Methods Section 4.6: Linear Models Rodney Nielsen Many of.
Associative Hierarchical CRFs for Object Class Image Segmentation Ľubor Ladický 1 1 Oxford Brookes University 2 Microsoft Research Cambridge Based on the.
Class-Specific Hough Forests for Object Detection Zhen Yuan Hsu Advisor:S.J.Wang Gall, J., Lempitsky, V.: Class-specic hough forests for object detection.
Fitting: The Hough transform
Learning With Bayesian Networks Markus Kalisch ETH Zürich.
Stable Multi-Target Tracking in Real-Time Surveillance Video
Visual Categorization With Bags of Keypoints Original Authors: G. Csurka, C.R. Dance, L. Fan, J. Willamowski, C. Bray ECCV Workshop on Statistical Learning.
Approximate Inference: Decomposition Methods with Applications to Computer Vision Kyomin Jung ( KAIST ) Joint work with Pushmeet Kohli (Microsoft Research)
Associative Hierarchical CRFs for Object Class Image Segmentation
Linear Models for Classification
O BJ C UT M. Pawan Kumar Philip Torr Andrew Zisserman UNIVERSITY OF OXFORD.
Paper Reading Dalong Du Nov.27, Papers Leon Gu and Takeo Kanade. A Generative Shape Regularization Model for Robust Face Alignment. ECCV08. Yan.
Discussion of Pictorial Structures Pedro Felzenszwalb Daniel Huttenlocher Sicily Workshop September, 2006.
Object Detection 01 – Basic Hough Transformation JJCAO.
Total Recall: Automatic Query Expansion with a Generative Feature Model for Object Retrieval O. Chum, et al. Presented by Brandon Smith Computer Vision.
Lecture 08 27/12/2011 Shai Avidan הבהרה: החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.
Recognition Using Visual Phrases
CVPR 2012 POSTER Mobile Object Detection through Client-Server based Vote Transfer.
Jigsaws: joint appearance and shape clustering John Winn with Anitha Kannan and Carsten Rother Microsoft Research, Cambridge.
Learning Sequence Motifs Using Expectation Maximization (EM) and Gibbs Sampling BMI/CS 776 Mark Craven
Diagram Understanding in Geometry Questions Min Joon Seo 1, Hanna Hajishirzi 1, Ali Farhadi 1, Oren Etzioni 2 21.
1 Chapter 8: Model Inference and Averaging Presented by Hui Fang.
Coherent Scene Understanding with 3D Geometric Reasoning Jiyan Pan 12/3/2012.
Hough Transform CS 691 E Spring Outline Hough transform Homography Reading: FP Chapter 15.1 (text) Some slides from Lazebnik.
Unsupervised Learning Part 2. Topics How to determine the K in K-means? Hierarchical clustering Soft clustering with Gaussian mixture models Expectation-Maximization.
Nonparametric Semantic Segmentation
Learning Sequence Motif Models Using Expectation Maximization (EM)
Learning to Combine Bottom-Up and Top-Down Segmentation
Machine Learning: Lecture 6
Machine Learning: UNIT-3 CHAPTER-1
Presentation transcript:

On Detection of Multiple Object Instances using Hough Transforms Olga Barinova Moscow State University Victor Lempitsky University of Oxford Pushmeet Kohli Microsoft Research Cambridge

On Detection of Multiple Object Instances Using Hough Transforms Hough transforms oObject detection → peaks identification in Hough images oBeyond lines!!!  Ballard 1983 – Other primitives  Lowe, ICCV 1999 – Object detection  Leibe, Schiele BMVC 2003 – Object class detection  Last CVPR: Maji& Malik, Gall& Lempitsky, Gu et al. …

On Detection of Multiple Object Instances Using Hough Transforms Example from Gall & Lempitsky CVPR 2009 Category-level object detection

On Detection of Multiple Object Instances Using Hough Transforms Category-level object detection ?

On Detection of Multiple Object Instances Using Hough Transforms Multiple lines detection oIdentifying the peaks in Hough images is highly nontrivial in case of multiple close objects oPostprocessing (e.g. non-maximum suppression) is usually used oOur framework is similar to Hough Transforms but doesn’t require finding local maxima and suppresses non-maxima automatically

On Detection of Multiple Object Instances Using Hough Transforms Our framework Hough space Elements space Voting elements Hypotheses

On Detection of Multiple Object Instances Using Hough Transforms Our framework y – labelling of hypotheses, binary variables: 1 = object is present, 0 = otherwise Hough space Elements space x – labelling of voting elements, xi = index of hypothesis, if element votes for hypothesis, xi = 0, if element votes for background

On Detection of Multiple Object Instances Using Hough Transforms Our framework x – labelling of voting elements, xi = index of hypothesis, if element votes for hypothesis, xi = 0, if element votes for background x 2 =1 x 3 =1 x 4 =2 x 5 =2 x 6 =2 x 7 =0 x 8 =2 x 1 =1 y 1 =1 y 2 =1 y 3 =0 Key idea : joint MAP-inference in x and y Hough space Elements space y – labelling of hypotheses, binary variables: 1 = object is present, 0 = otherwise

On Detection of Multiple Object Instances Using Hough Transforms Probabilistic derivation Likelihood Term oAssume that given the existing objects y and the hypotheses assignments x, the distributions of the descriptors of voting elements are independent oLess crude than the independence assumption implicitly made by the traditional Hough Prior Term oOccam razor (or MDL) prior penalizes the number of the active hypotheses

On Detection of Multiple Object Instances Using Hough Transforms Probabilistic derivation hypotheses how likely is that voting element belongs to an object Corresponds to the votes in standard Hough transform: Training stays the same! “MDL” prior: λ, if y h = 1 0, otherwise -∞ if x i = h, and y h = 0 0, otherwise voting elements Problem known as facility location [Delong et al. CVPR 2010] (today’s poster) looks at facility location with wider set of priors

On Detection of Multiple Object Instances Using Hough Transforms Probabilistic derivation hypotheses voting elements oTried different methods for MAP- inference  belief propagation  simulated annealing oThey work well but don’t allow using large number of hypotheses  graph becomes huge and dense  sparsification heuristics required

On Detection of Multiple Object Instances Using Hough Transforms Probabilistic derivation voting elements hypotheses oIf labeling of y is given the values of x i are independent oAfter maximizing out x we get: oLarge-clique, submodular oGreedy algorithm is as good as anything else (in terms of the approximation factor) oGreedy inference ~ iterative Hough voting

On Detection of Multiple Object Instances Using Hough Transforms Greedy maximization for our energy Greedily add detections starting from the empty set For each iteration 1.do the voting: oSet h 0 = the overall maximum of HoughImage 3.If HoughImage(h 0 ) > λ, add h 0 to detection set, else terminate Sum over all voting elements Maximum over Hough votes for the hypotheses g that have already been switched on, including ‘background’ “standard” Hough vote for element i

On Detection of Multiple Object Instances Using Hough Transforms Inference Using the Hough forest trained in [Gall&Lempitsky CVPR09] Datasets from [Andriluka et al. CVPR 2008] (with strongly occluded pedestrians added)

On Detection of Multiple Object Instances Using Hough Transforms Results for pedestrians detection White = correct detection Green = missing object Red = false positive Our framework Hough transform + non-maximum suppression

On Detection of Multiple Object Instances Using Hough Transforms Results for pedestrians detection Blue = Hough transform + non-maximum suppression Light-blue = our framework Precision Recall Precision Recall TUD-crossing TUD-campus

On Detection of Multiple Object Instances Using Hough Transforms Results for lines detection Our framework Hough + NMS York Urban DB, Elder&Estrada ECCV 2008 oour framework is able to discern very close yet distinct lines, and is in general much less plagued by spurious detections

On Detection of Multiple Object Instances Using Hough Transforms Conclusion oFramework for detecting multiple objects, greedy inference ~ iterated Hough transform oNo need to find local maxima and suppress non-maxima – just take the only global maximum oProbabilistic model allows for extensions (ECCV paper coming: lines + vanishing points + horizon + zenith) oTraining stays the same as for the recent Hough-based framework oCode available at the project page: science/research/machinelearning/hough Thank you for your attention!