Unsupervised Learning of Categories from Sets of Partially Matching Image Features Kristen Grauman and Trevor Darrel CVPR 2006 Presented By Sovan Biswas.

Slides:



Advertisements
Similar presentations
Context-based object-class recognition and retrieval by generalized correlograms by J. Amores, N. Sebe and P. Radeva Discussion led by Qi An Duke University.
Advertisements

Presented by Xinyu Chang
Foreground Focus: Finding Meaningful Features in Unlabeled Images Yong Jae Lee and Kristen Grauman University of Texas at Austin.
Carolina Galleguillos, Brian McFee, Serge Belongie, Gert Lanckriet Computer Science and Engineering Department Electrical and Computer Engineering Department.
The SIFT (Scale Invariant Feature Transform) Detector and Descriptor
TP14 - Local features: detection and description Computer Vision, FCUP, 2014 Miguel Coimbra Slides by Prof. Kristen Grauman.
MIT CSAIL Vision interfaces Approximate Correspondences in High Dimensions Kristen Grauman* Trevor Darrell MIT CSAIL (*) UT Austin…
The Pyramid Match Kernel: Discriminative Classification with Sets of Image Features Kristen Grauman Trevor Darrell MIT.
1 Part 1: Classical Image Classification Methods Kai Yu Dept. of Media Analytics NEC Laboratories America Andrew Ng Computer Science Dept. Stanford University.
Object Recognition using Invariant Local Features Applications l Mobile robots, driver assistance l Cell phone location or object recognition l Panoramas,
CS4670 / 5670: Computer Vision Bag-of-words models Noah Snavely Object
Global spatial layout: spatial pyramid matching Spatial weighting the features Beyond bags of features: Adding spatial information.
Contour Based Approaches for Visual Object Recognition Jamie Shotton University of Cambridge Joint work with Roberto Cipolla, Andrew Blake.
Bag-of-features models Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
1 Image Recognition - I. Global appearance patterns Slides by K. Grauman, B. Leibe.
Lecture 28: Bag-of-words models
Agenda Introduction Bag-of-words model Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based.
Unsupervised Learning of Categories from Sets of Partially Matching Image Features Dominic Rizzo and Giota Stratou.
Object Recognition with Invariant Features n Definition: Identify objects or scenes and determine their pose and model parameters n Applications l Industrial.
CS335 Principles of Multimedia Systems Content Based Media Retrieval Hao Jiang Computer Science Department Boston College Dec. 4, 2007.
Video Google: Text Retrieval Approach to Object Matching in Videos Authors: Josef Sivic and Andrew Zisserman ICCV 2003 Presented by: Indriyati Atmosukarto.
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
CVR05 University of California Berkeley 1 Familiar Configuration Enables Figure/Ground Assignment in Natural Scenes Xiaofeng Ren, Charless Fowlkes, Jitendra.
Automatic Image Alignment (feature-based) : Computational Photography Alexei Efros, CMU, Fall 2005 with a lot of slides stolen from Steve Seitz and.
CS294‐43: Visual Object and Activity Recognition Prof. Trevor Darrell Spring 2009 March 17 th, 2009.
Video Google: Text Retrieval Approach to Object Matching in Videos Authors: Josef Sivic and Andrew Zisserman University of Oxford ICCV 2003.
Automatic Matching of Multi-View Images
1 Invariant Local Feature for Object Recognition Presented by Wyman 2/05/2006.
Automatic Image Alignment (feature-based) : Computational Photography Alexei Efros, CMU, Fall 2006 with a lot of slides stolen from Steve Seitz and.
Spatial Pyramid Pooling in Deep Convolutional
Agenda Introduction Bag-of-words models Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based.
1 Interest Operators Find “interesting” pieces of the image Multiple possible uses –image matching stereo pairs tracking in videos creating panoramas –object.
Large Scale Recognition and Retrieval. What does the world look like? High level image statistics Object Recognition for large-scale search Focus on scaling.
Machine learning & category recognition Cordelia Schmid Jakob Verbeek.
Step 3: Classification Learn a decision rule (classifier) assigning bag-of-features representations of images to different classes Decision boundary Zebra.
Overcoming Dataset Bias: An Unsupervised Domain Adaptation Approach Boqing Gong University of Southern California Joint work with Fei Sha and Kristen Grauman.
Machine learning & category recognition Cordelia Schmid Jakob Verbeek.
Recognition and Matching based on local invariant features Cordelia Schmid INRIA, Grenoble David Lowe Univ. of British Columbia.
Local invariant features Cordelia Schmid INRIA, Grenoble.
Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,
A Statistically Selected Part-Based Probabilistic Model for Object Recognition Zhipeng Zhao, Ahmed Elgammal Department of Computer Science, Rutgers, The.
Svetlana Lazebnik, Cordelia Schmid, Jean Ponce
Classifying Images with Visual/Textual Cues By Steven Kappes and Yan Cao.
MSRI workshop, January 2005 Object Recognition Collected databases of objects on uniform background (no occlusions, no clutter) Mostly focus on viewpoint.
Local invariant features Cordelia Schmid INRIA, Grenoble.
Geodesic Flow Kernel for Unsupervised Domain Adaptation Boqing Gong University of Southern California Joint work with Yuan Shi, Fei Sha, and Kristen Grauman.
Visual Categorization With Bags of Keypoints Original Authors: G. Csurka, C.R. Dance, L. Fan, J. Willamowski, C. Bray ECCV Workshop on Statistical Learning.
A feature-based kernel for object classification P. Moreels - J-Y Bouguet Intel.
Object-Graphs for Context-Aware Category Discovery Yong Jae Lee and Kristen Grauman University of Texas at Austin 1.
Object Recognition by Integrating Multiple Image Segmentations Caroline Pantofaru, Cordelia Schmid, Martial Hebert ECCV 2008 E.
Goggle Gist on the Google Phone A Content-based image retrieval system for the Google phone Manu Viswanathan Chin-Kai Chang Ji Hyun Moon.
3D Motion Data Mining Multimedia Project Multimedia and Network Lab, Department of Computer Science.
Video Google: Text Retrieval Approach to Object Matching in Videos Authors: Josef Sivic and Andrew Zisserman University of Oxford ICCV 2003.
Finding Clusters within a Class to Improve Classification Accuracy Literature Survey Yong Jae Lee 3/6/08.
Document Clustering with Prior Knowledge Xiang Ji et al. Document Clustering with Prior Knowledge. SIGIR 2006 Presenter: Suhan Yu.
Local features: main components
TP12 - Local features: detection and description
Video Google: Text Retrieval Approach to Object Matching in Videos
Paper Presentation: Shape and Matching
By Suren Manvelyan, Crocodile (nile crocodile?) By Suren Manvelyan,
Finding Clusters within a Class to Improve Classification Accuracy
Approximate Correspondences in High Dimensions
Object-Graphs for Context-Aware Category Discovery
CS 1674: Intro to Computer Vision Scene Recognition
Shape matching and object recognition using shape contexts
Brief Review of Recognition + Context
The SIFT (Scale Invariant Feature Transform) Detector and Descriptor
Video Google: Text Retrieval Approach to Object Matching in Videos
Recognition and Matching based on local invariant features
Presentation transcript:

Unsupervised Learning of Categories from Sets of Partially Matching Image Features Kristen Grauman and Trevor Darrel CVPR 2006 Presented By Sovan Biswas MSc (Engg), SERC, IISc

Goal Automatically recover categories from an unlabeled collection of images, and form predictive classifiers to label new images Tolerate clutter, occlusion, common transformations Allow optional, variable amount of supervision Efficiency

Motivation Majority of the object recognition and categorization requires ‘Manual Annotations’. ‘Manual Annotations’ : practical limit on number of classes and number of training examples per class. Introduces Biasing : effects the performance. Unsupervised recognition and categorization would be cost effective and reduce burden.

Motivation Majority of the object recognition and categorization requires ‘Manual Annotations’. ‘Manual Annotations’ : – practical limit on number of classes and number of training examples per class. Introduces Biasing : – effects the performance. Unsupervised recognition and categorization would be cost effective and reduce burden.

Related work Modeling scenes with local descriptors ICCV 2005, P. Quelhas, F. Monay, etc. Discovering Object Categories in Image Collections ICCV 2005, J. Sivic, B. Russell, A. Efros, etc. Learning Object Categories from Google’s Image Search ICCV 2005, R. Fergus, L. Fei-Fei, etc.

Related work (contd.) Local features – recognition and retrieval Unordered feature vectors and Vary in size Previous Approaches : – vector quantization – Build codebook of feature descriptors – Apply conventional clustering methods or latent semantic analysis (LSA) Issues : – Clutters – Occlusions – Addition of small amount of labeled data.

Approach Pair wise affinities reflecting partial-match feature correspondences are computed for all input images. (Optional) A variable amount of supervised labels (pairing constraints) are optionally collected from the user and affinity matrix are adjusted accordingly. Spectral clustering is used to recover the initial dominant clusters. Prototypical examples : by evaluating the typical “feature masks” contributing to each within-cluster partial matching Top-ranked prototypical examples from refined groupings compose the learned categories, which are used to train a set of predictive classifiers for labelling unseen examples.

Pyramid match kernel Number of newly matched pairs at level i Measure of difficulty of a match at level i Approximate partial match similarity

Sets of features

Feature extraction, Histogram pyramid: level i has bins of size 2 i

Pyramid match kernel Weights inversely proportional to bin size Normalize kernel values to avoid favoring large sets measure of difficulty of a match at level i histogram pyramids number of newly matched pairs at level i

Pyramid match kernel

Algorithm 1) Initial Grouping Feature Sets with partial correspondences. 2) Inferring Category Feature Masks. 3) Identifying Prototypes.

Initial Grouping Data Set U = {I 1, I 2, …, I N } – where I i are images. I i is denoted by feature set – X i = {f 1, …, f mi } where m i may vary across U – f i : description vectors. Authors uses SHIFT for local feature.

Initial Grouping (contd.) Derived Partial Correspondence using Pyramid Match Kernel.

Initial Grouping (contd.) Normalized cuts criterion to create preliminarily classes. Optionally Introduce of weak semi supervision in the form of pairwise constraints between the unlabeled images Adjust the weight of affinity matrix to highest value for “must group” and zero to “cannot – group”.

Initial Grouping (contd.) Background feature matches Multiple object matches

Inferring Category Feature Masks Confounding normalized cuts. Goal : Identify prototypical example. Intuition : Inlier uses similar partial portion of feature sets to form partial matches. Use Modified pyramid Kernel.

Identifying Prototypes Use the feature mask to re weight the individual feature. Sort the images based on how consistent they match with the remainder of sets. Examples within top threshold percentile are “prototype candidates” Optionally allow user constraints and adjust affinity matrix in case of disagreement

Inferring feature masks feature index contribution to match

Inferring feature masks weighted feature mask

Refining intra-cluster matches weighted feature mask

Refining intra-cluster matches weighted feature mask

Refining intra-cluster matches weighted feature mask

Refining intra-cluster matches weighted feature mask

Refining intra-cluster matches weighted feature mask

Selecting category prototypes

Unsupervised recovery of category prototypes Top percentile of prototypes Prototype accuracy / category Caltech-4 data set

Semi-supervised category labeling Caltech-4 data set Recognition rate / class Amount of supervisory information (number of “must-group” pairs)

Future Work Number of groupings are needed to specified during normalized cuts. (limitations) Finding of most critical local features and interest points. (performance) Using the algorithm for image retrival.

References Shape Matching and Object Recognition Using Shape Contexts S. Belongie, J. Malik, and J. Puzicha, IEEE Tranc. PAMI 2002 Shape Matching and Object Recognition using Low Distortion Correspondences A. Berg, T. Berg, and J. Malik, IEEE CVPR 2005 Distinctive Image Features from Scale Invariant Key points D Lowe, International Journal of Computer Vision 2004 Indexing Based on Scale Invariant Interest Points K. Mikolajczyk and C. Schmid, International Journal of Computer Vision 2004 The Pyramid Match Kernel: Discriminative Classification with Sets of Image Features K. Grauman and T. Darrell, IEEE ICCV 2005 Normalized Cuts and Image Segmentation J. Shi and J. Malik, IEEE Tranc. PAMI 2000