The Pyramid Match Kernel: Discriminative Classification with Sets of Image Features Kristen Grauman Trevor Darrell MIT.

Slides:



Advertisements
Similar presentations
Distinctive Image Features from Scale-Invariant Keypoints
Advertisements

Distinctive Image Features from Scale-Invariant Keypoints David Lowe.
Context-based object-class recognition and retrieval by generalized correlograms by J. Amores, N. Sebe and P. Radeva Discussion led by Qi An Duke University.
Road-Sign Detection and Recognition Based on Support Vector Machines Saturnino, Sergio et al. Yunjia Man ECG 782 Dr. Brendan.
Zhimin CaoThe Chinese University of Hong Kong Qi YinITCS, Tsinghua University Xiaoou TangShenzhen Institutes of Advanced Technology Chinese Academy of.
Multiclass SVM and Applications in Object Classification
Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?
Three things everyone should know to improve object retrieval
Object Recognition with Features Inspired by Visual Cortex T. Serre, L. Wolf, T. Poggio Presented by Andrew C. Gallagher Jan. 25, 2007.
RGB-D object recognition and localization with clutter and occlusions Federico Tombari, Samuele Salti, Luigi Di Stefano Computer Vision Lab – University.
Foreground Focus: Finding Meaningful Features in Unlabeled Images Yong Jae Lee and Kristen Grauman University of Texas at Austin.
MIT CSAIL Vision interfaces Towards efficient matching with random hashing methods… Kristen Grauman Gregory Shakhnarovich Trevor Darrell.
Efficiently searching for similar images (Kristen Grauman)
Part 1: Bag-of-words models by Li Fei-Fei (Princeton)
Hebrew University1 Threading Local Kernel Functions: Local versus Holistic Representations Amnon Shashua Tamir Hazan School of Computer Science & Eng.
TP14 - Local features: detection and description Computer Vision, FCUP, 2014 Miguel Coimbra Slides by Prof. Kristen Grauman.
MIT CSAIL Vision interfaces Approximate Correspondences in High Dimensions Kristen Grauman* Trevor Darrell MIT CSAIL (*) UT Austin…
Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem
Computer Vision for Human-Computer InteractionResearch Group, Universität Karlsruhe (TH) cv:hci Dr. Edgar Seemann 1 Computer Vision: Histograms of Oriented.
Semi-Supervised Hierarchical Models for 3D Human Pose Reconstruction Atul Kanaujia, CBIM, Rutgers Cristian Sminchisescu, TTI-C Dimitris Metaxas,CBIM, Rutgers.
Global spatial layout: spatial pyramid matching Spatial weighting the features Beyond bags of features: Adding spatial information.
Instructor: Mircea Nicolescu Lecture 13 CS 485 / 685 Computer Vision.
Fast intersection kernel SVMs for Realtime Object Detection
Recognition using Regions CVPR Outline Introduction Overview of the Approach Experimental Results Conclusion.
1 Image Recognition - I. Global appearance patterns Slides by K. Grauman, B. Leibe.
Agenda Introduction Bag-of-words model Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based.
A Study of Approaches for Object Recognition
Generic Object Detection using Feature Maps Oscar Danielsson Stefan Carlsson
Automatic Image Alignment (feature-based) : Computational Photography Alexei Efros, CMU, Fall 2005 with a lot of slides stolen from Steve Seitz and.
CS294‐43: Visual Object and Activity Recognition Prof. Trevor Darrell Spring 2009 March 17 th, 2009.
Supervised Distance Metric Learning Presented at CMU’s Computer Vision Misc-Read Reading Group May 9, 2007 by Tomasz Malisiewicz.
1 Invariant Local Feature for Object Recognition Presented by Wyman 2/05/2006.
Automatic Image Alignment (feature-based) : Computational Photography Alexei Efros, CMU, Fall 2006 with a lot of slides stolen from Steve Seitz and.
Large Scale Recognition and Retrieval. What does the world look like? High level image statistics Object Recognition for large-scale search Focus on scaling.
Distinctive Image Features from Scale-Invariant Keypoints By David G. Lowe, University of British Columbia Presented by: Tim Havinga, Joël van Neerbos.
Unsupervised Learning of Categories from Sets of Partially Matching Image Features Kristen Grauman and Trevor Darrel CVPR 2006 Presented By Sovan Biswas.
Step 3: Classification Learn a decision rule (classifier) assigning bag-of-features representations of images to different classes Decision boundary Zebra.
EADS DS / SDC LTIS Page 1 7 th CNES/DLR Workshop on Information Extraction and Scene Understanding for Meter Resolution Image – 29/03/07 - Oberpfaffenhofen.
Recognition and Matching based on local invariant features Cordelia Schmid INRIA, Grenoble David Lowe Univ. of British Columbia.
Local invariant features Cordelia Schmid INRIA, Grenoble.
Andrew Bender Alexander Cobian
SVM-KNN Discriminative Nearest Neighbor Classification for Visual Category Recognition Hao Zhang, Alex Berg, Michael Maire, Jitendra Malik.
Classifying Images with Visual/Textual Cues By Steven Kappes and Yan Cao.
Lecture 31: Modern recognition CS4670 / 5670: Computer Vision Noah Snavely.
Beyond Sliding Windows: Object Localization by Efficient Subwindow Search The best paper prize at CVPR 2008.
21 June 2009Robust Feature Matching in 2.3μs1 Simon Taylor Edward Rosten Tom Drummond University of Cambridge.
Local invariant features Cordelia Schmid INRIA, Grenoble.
Visual Categorization With Bags of Keypoints Original Authors: G. Csurka, C.R. Dance, L. Fan, J. Willamowski, C. Bray ECCV Workshop on Statistical Learning.
Local invariant features Cordelia Schmid INRIA, Grenoble.
A feature-based kernel for object classification P. Moreels - J-Y Bouguet Intel.
CS 1699: Intro to Computer Vision Support Vector Machines Prof. Adriana Kovashka University of Pittsburgh October 29, 2015.
Features, Feature descriptors, Matching Jana Kosecka George Mason University.
Methods for 3D Shape Matching and Retrieval
Local features: detection and description
Object Recognition as Ranking Holistic Figure-Ground Hypotheses Fuxin Li and Joao Carreira and Cristian Sminchisescu 1.
3D Motion Data Mining Multimedia Project Multimedia and Network Lab, Department of Computer Science.
776 Computer Vision Jan-Michael Frahm Spring 2012.
Finding Clusters within a Class to Improve Classification Accuracy Literature Survey Yong Jae Lee 3/6/08.
Local features: main components
TP12 - Local features: detection and description
Recognizing Deformable Shapes
Paper Presentation: Shape and Matching
Object detection as supervised classification
Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science
Approximate Correspondences in High Dimensions
CS 1674: Intro to Computer Vision Scene Recognition
CAP 5415 Computer Vision Fall 2012 Dr. Mubarak Shah Lecture-5
Brief Review of Recognition + Context
Recognizing Deformable Shapes
Recognition and Matching based on local invariant features
Presentation transcript:

The Pyramid Match Kernel: Discriminative Classification with Sets of Image Features Kristen Grauman Trevor Darrell MIT

Sets of features

examples under varying conditions local shape features invariant region descriptors

Problem How to build a discriminative classifier using the set representation? Kernel-based methods (e.g. SVM) are appealing for efficiency and generalization power… But what is an appropriate kernel? Each instance is unordered set of vectors Varying number of vectors per instance

Existing set kernels Fit (parametric) model to each set, compare with distance over models Kondor & Jebara, Moreno et al., Lafferty & Lebanon, Cuturi & Vert, Wolf & Shashua Restrictive assumptions Ignoring set statistics Compute pair-wise similarity between all vectors in each set Wallraven et al., Lyu, Boughhorbel et al. General family of algebraic functions combining local (vector) kernels Shashua & Hazan High complexity

Partial matching for sets of features Compare sets by computing a partial matching between their features. Robust to clutter, segmentation errors, occlusion…

Pyramid match optimal partial matching

Pyramid match overview Place multi-dimensional, multi-resolution grid over point sets Consider points matched at finest resolution where they fall into same grid cell Approximate similarity between matched points with worst case similarity at given level No explicit search for matches! Pyramid match kernel measures similarity of a partial matching between two sets:

Pyramid match kernel Number of newly matched pairs at level i Measure of difficulty of a match at level i Approximate partial match similarity

Feature extraction, Histogram pyramid: level i has bins of size 2 i

Counting matches Histogram intersection

Counting new matches Difference in histogram intersections across levels counts number of new pairs matched matches at this levelmatches at previous level Histogram intersection

Pyramid match kernel Weights inversely proportional to bin size Normalize kernel values to avoid favoring large sets measure of difficulty of a match at level i histogram pyramids number of newly matched pairs at level i

Efficiency For sets with m features of dimension d, and pyramids with L levels, computational complexity of Pyramid match kernel: Existing set kernel approaches: or

Example pyramid match Level 0

Example pyramid match Level 1

Example pyramid match Level 2

Example pyramid match pyramid match optimal match

100 sets with 2D points, cardinalities vary between 5 and 100 Trial number (sorted by optimal distance) [Indyk & Thaper] Matching output Approximation of the optimal partial matching

Building a classifier Train SVM by computing kernel values between all labeled training examples Classify novel examples by computing kernel values against support vectors One-versus-all for multi-class classification Convergence is guaranteed since pyramid match kernel is positive-definite.

Object recognition results ETH-80 database 8 object classes Features: –Harris detector –PCA-SIFT descriptor, d=10 KernelComplexityRecognition rate Match [Wallraven et al.] 84% Bhattacharyya affinity [Kondor & Jebara] 85% Pyramid match 84% Eichhorn and Chapelle 2004

Object recognition results Caltech objects database 101 object classes Features: –SIFT detector –PCA-SIFT descriptor, d=10 30 training images / class 43% recognition rate (1% chance performance) seconds per match

Localization Inspect intersections to obtain correspondences between features Higher confidence correspondences at finer resolution levels observationtarget

Pyramid match regression Pose estimation from contour features Train SVR with CG data Features: shape context histograms

Summary: Pyramid match kernel optimal partial matching between sets of features number of new matches at level idifficulty of a match at level i

Summary: Pyramid match kernel A new similarity measure based on implicit correspondences that approximates the optimal partial matching linear time complexity no independence assumption model-free insensitive to clutter positive-definite function fast, effective object recognition

Future work Geometric constraints Fast search of large databases with the pyramid match for image retrieval Use as a filter for a slower, explicit correspondence method Alternative feature types and classification domains