Global and Efficient Self-Similarity for Object Classification and Detection CVPR 2010 Thomas Deselaers and Vittorio Ferrari.

Slides:



Advertisements
Similar presentations
Aggregating local image descriptors into compact codes
Advertisements

Three things everyone should know to improve object retrieval
Presented by Xinyu Chang
Carolina Galleguillos, Brian McFee, Serge Belongie, Gert Lanckriet Computer Science and Engineering Department Electrical and Computer Engineering Department.
Classification using intersection kernel SVMs is efficient Joint work with Subhransu Maji and Alex Berg Jitendra Malik UC Berkeley.
Multi-layer Orthogonal Codebook for Image Classification Presented by Xia Li.
Image and video descriptors
Many slides based on P. FelzenszwalbP. Felzenszwalb General object detection with deformable part-based models.
Activity Recognition Aneeq Zia. Agenda What is activity recognition Typical methods used for action recognition “Evaluation of local spatio-temporal features.
Data-driven Visual Similarity for Cross-domain Image Matching
Multi-Local Feature Manifolds for Object Detection Oscar Danielsson Stefan Carlsson Josephine Sullivan
Object-centric spatial pooling for image classification Olga Russakovsky, Yuanqing Lin, Kai Yu, Li Fei-Fei ECCV 2012.
Boundary Preserving Dense Local Regions
Enhancing Exemplar SVMs using Part Level Transfer Regularization 1.
Special Topic on Image Retrieval Local Feature Matching Verification.
Groups of Adjacent Contour Segments for Object Detection Vittorio Ferrari Loic Fevrier Frederic Jurie Cordelia Schmid.
Ghunhui Gu, Joseph J. Lim, Pablo Arbeláez, Jitendra Malik University of California at Berkeley Berkeley, CA
Sketch Tokens: A Learned Mid-level Representation for Contour and Object Detection CVPR2013 POSTER.
Fast intersection kernel SVMs for Realtime Object Detection
More sliding window detection: Discriminative part-based models Many slides based on P. FelzenszwalbP. Felzenszwalb.
Recognition using Regions CVPR Outline Introduction Overview of the Approach Experimental Results Conclusion.
Small Codes and Large Image Databases for Recognition CVPR 2008 Antonio Torralba, MIT Rob Fergus, NYU Yair Weiss, Hebrew University.
Fast and Compact Retrieval Methods in Computer Vision Part II A. Torralba, R. Fergus and Y. Weiss. Small Codes and Large Image Databases for Recognition.
1 Image Recognition - I. Global appearance patterns Slides by K. Grauman, B. Leibe.
Generic Object Detection using Feature Maps Oscar Danielsson Stefan Carlsson
Automatic Image Alignment (feature-based) : Computational Photography Alexei Efros, CMU, Fall 2006 with a lot of slides stolen from Steve Seitz and.
Generic object detection with deformable part-based models
Review: Intro to recognition Recognition tasks Machine learning approach: training, testing, generalization Example classifiers Nearest neighbor Linear.
Exercise Session 10 – Image Categorization
Bag-of-Words based Image Classification Joost van de Weijer.
Computer vision.
Object Bank Presenter : Liu Changyu Advisor : Prof. Alex Hauptmann Interest : Multimedia Analysis April 4 th, 2013.
Object Detection Sliding Window Based Approach Context Helps
Nonparametric Part Transfer for Fine-grained Recognition Presenter Byungju Kim.
“Secret” of Object Detection Zheng Wu (Summer intern in MSRNE) Sep. 3, 2010 Joint work with Ce Liu (MSRNE) William T. Freeman (MIT) Adam Kalai (MSRNE)
Efficient Region Search for Object Detection Sudheendra Vijayanarasimhan and Kristen Grauman Department of Computer Science, University of Texas at Austin.
Lecture 31: Modern recognition CS4670 / 5670: Computer Vision Noah Snavely.
BING: Binarized Normed Gradients for Objectness Estimation at 300fps
Deformable Part Model Presenter : Liu Changyu Advisor : Prof. Alex Hauptmann Interest : Multimedia Analysis April 11 st, 2013.
Beyond Sliding Windows: Object Localization by Efficient Subwindow Search The best paper prize at CVPR 2008.
BING: Binarized Normed Gradients for Objectness Estimation at 300fps
Efficient Subwindow Search: A Branch and Bound Framework for Object Localization ‘PAMI09 Beyond Sliding Windows: Object Localization by Efficient Subwindow.
Locality-constrained Linear Coding for Image Classification
Visual Categorization With Bags of Keypoints Original Authors: G. Csurka, C.R. Dance, L. Fan, J. Willamowski, C. Bray ECCV Workshop on Statistical Learning.
Epitomic Location Recognition A generative approach for location recognition K. Ni, A. Kannan, A. Criminisi and J. Winn In proc. CVPR Anchorage,
Event retrieval in large video collections with circulant temporal encoding CVPR 2013 Oral.
Gang WangDerek HoiemDavid Forsyth. INTRODUCTION APROACH (implement detail) EXPERIMENTS CONCLUSION.
FACE DETECTION : AMIT BHAMARE. WHAT IS FACE DETECTION ? Face detection is computer based technology which detect the face in digital image. Trivial task.
BEYOND SLIDING WINDOW: Object Localization by Efficient Subwindow Search Christoph H. Lampert, Matthew B. Blaschko, and Thomas Hofmann.
Presented by David Lee 3/20/2006
Object Recognition by Discriminative Combinations of Line Segments and Ellipses Alex Chia ^˚ Susanto Rahardja ^ Deepu Rajan ˚ Maylor Leung ˚ ^ Institute.
More sliding window detection: Discriminative part-based models
Week 4: 6/6 – 6/10 Jeffrey Loppert. This week.. Coded a Histogram of Oriented Gradients (HOG) Feature Extractor Extracted features from positive and negative.
1 Bilinear Classifiers for Visual Recognition Computational Vision Lab. University of California Irvine To be presented in NIPS 2009 Hamed Pirsiavash Deva.
A. M. R. R. Bandara & L. Ranathunga
Object detection with deformable part-based models
Presented by David Lee 3/20/2006
Data Driven Attributes for Action Detection
An Additive Latent Feature Model
Learning Mid-Level Features For Recognition
Paper Presentation: Shape and Matching
Mixtures of Gaussians and Advanced Feature Encoding
By Suren Manvelyan, Crocodile (nile crocodile?) By Suren Manvelyan,
Object detection as supervised classification
Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science
HOGgles Visualizing Object Detection Features
An HOG-LBP Human Detector with Partial Occlusion Handling
CVPR 2014 Orientational Pyramid Matching for Recognizing Indoor Scenes
Rob Fergus Computer Vision
A Graph-Matching Kernel for Object Categorization
Presentation transcript:

Global and Efficient Self-Similarity for Object Classification and Detection CVPR 2010 Thomas Deselaers and Vittorio Ferrari

Conventional Image Descriptors Measure direct image properties gradients colors 2

Self-Similarity vs Conventional Descriptors [Shechtman, Irani CVPR 07] Assumption of conventional image descriptors There is a direct visual property shared by images of objects of the same class (e.g. colors, gradients, …). This property can be used to compare images. Self-similarity: Indirect property: geometric layout of repeating patches within an image More general property 3

Local Self-Similarity Descriptors 4 [Shechtman, Irani CVPR 07]

Using Local Self-Similarity Descriptors Applications: object recognition, image retrieval, action recognition Ensemble matching [Shechtman CVPR 07] Nearest neighbor matching [Boiman CVPR 08] Bag of local self-similarities [Gehler ICCV09, Vedaldi ICCV09, Hörster ACMM08, Lampert CVPR09, Chatfield ICCV09 WS] 1.Compute LSS descriptors for an image 2.Assign the LSS descriptors to a codebook 3.Represent the image as a histogram of LSS descriptors 5

Self-Similarity goes Global Capture long-range self-similarities and their spatial arrangement 6

Self-Similarity goes Global Capture long-range self-similarities and their spatial arrangement 7

compute self-similarity between all pairs of pixels Global Self-Similarity Tensor 4D self-similarity tensor Note: local self-similarities included 8

Problems with the GSS Tensor Computation time: Memory requirement: Aim: Reduce both 9 11 ∼ 80GB ∼ 20h

Outline Efficient global self-similarity tensor Global self-similarity descriptors – Bag of correlation surfaces – Self-similarity hypercubes Detection with self-similarity hypercubes – Efficient sliding window – Efficient subwindow search Experiments – Global self-similarity better than local self-similarity – Complementary to conventional descriptors – Object detection possible 10

Efficient Global Self-Similarity Tensor Find an efficient approximation to Quantize patches according to codebook If two patches are assigned to the same prototype, they are similar Reduces runtime to speedup:

Efficient Global Self-Similarity Two patches are only similar if they are assigned to the same prototype Reduces memory to reduction: 12

Patch Prototype Codebooks Remember: Self-similarity encodes image content indirectly Image-specific codebooks can be smaller than conventional ones see paper for more generic codebooks and extensive evaluation 13

Self-similarity hypercubes: now Bag of correlation surfaces: only in the paper Global Self-Similarity Descriptors So far: Compact GSS computed efficiently Now: Descriptors that can be used in machine learning classifiers Fixed dimensionality Compact representation 14

Self-Similarity Hybercubes SSH of size 15

SSHs for Detection Computing SSH naïvely requires operations Sliding windows has to evaluate many windows 16 operations

Efficient Computation of SSHs Compute integral self-similarity tensor: operations to compute SSH for an image window 17  ∼ 5000x speedup  can be obtained using 16 lookups in

Efficient Subwindow Search for SSH 18 Derive an upper bound on the score of a set of windows Section 5.2 in our paper Similar to [Lampert PAMI09]

Experiments: Object classification PASCAL 07 objects – 9608 cropped images of objects from PASCAL 07 – 20 classes Task: Classify each test image into one of 20 classes Model: Linear SVM Train: train+val Test: test 19

Classification on the PASCAL 07 objects set + GSS outperform LSS + Self-Similarity is truly complementary to conventional descriptors 20 classification accuracy [%]

Experiments: Object detection ETHZ Shape Classes – 255 images – 5 classes (apple logos, bottles, giraffes, mugs, swans) Task: Detect objects in images Detector: Linear SVM, sliding windows 21 e.g. [Ferrari CVPR07, Maji CVPR09]

Detection Results + SSH outperforms BOLSS + it is possible to use GSS for detection with good results 22 BoLSSSSH apple logos bottles giraffes mugs swans Average DR at FPPI 0.4 apple logos bottles giraffes mug s swans } } SSH BoLSS FPPI 0.4 Comparison results (avg): [Ferrari CVPR07]: 71.9 [Maji CVPR09]: 93.2 … many more DR at 0.5 PASCAL overlap

Runtimes for Computing Descriptors 200x200 image GSS tensor – directly: 5512s ( ∼ 1.5 hours) – using our method: 81s ( ∼ 1.5 minutes) Computing descriptors: few seconds Our method: 70x speedup For Reference: – GIST: 0.4s – BOLSS: 0.7s 23

Runtimes for Detection Given the prototype assignment map (80s) (once only) SSH sliding window: 30s/img (once per class) For Comparison – Computing direct GSS tensor for windows: 4 years/img Speedup: ∼ 1 million ⇒ Using our methods, GSS can be used for object detection For Reference: – Felzenszwalb PAMI 09: 5s. 24 June 2014

Global and Efficient Self-Similarity for Object Classification and Detection CVPR 2010 Thomas Deselaers and Vittorio Ferrari Feasible

Conclusion self-similarity should be considered globally – Global self-similarity performs better than local self-similarity truly complementary to conventional descriptors global self-similarity is feasible – efficient computation of self-similarity – two descriptors based on self-similarity global self-similarity for detection code will be available soon 26

Thank you for your attention