Pyramids of Features For Categorization Greg Griffin and Will Coulter (see Lazebnik et al., CVPR 2006, too)

Slides:



Advertisements
Similar presentations
Max-Margin Additive Classifiers for Detection
Advertisements

Distinctive Image Features from Scale-Invariant Keypoints
Improving the Fisher Kernel for Large-Scale Image Classification Florent Perronnin, Jorge Sanchez, and Thomas Mensink, ECCV 2010 VGG reading group, January.
Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?
Three things everyone should know to improve object retrieval
Multi-layer Orthogonal Codebook for Image Classification Presented by Xia Li.
MIT CSAIL Vision interfaces Approximate Correspondences in High Dimensions Kristen Grauman* Trevor Darrell MIT CSAIL (*) UT Austin…
The Pyramid Match Kernel: Discriminative Classification with Sets of Image Features Kristen Grauman Trevor Darrell MIT.
CS395: Visual Recognition Spatial Pyramid Matching Heath Vinicombe The University of Texas at Austin 21 st September 2012.
Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem
1 Part 1: Classical Image Classification Methods Kai Yu Dept. of Media Analytics NEC Laboratories America Andrew Ng Computer Science Dept. Stanford University.
Ziming Zhang *, Ze-Nian Li, Mark Drew School of Computing Science, Simon Fraser University, Vancouver, B.C., Canada {zza27, li, Learning.
CS4670 / 5670: Computer Vision Bag-of-words models Noah Snavely Object
Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,
Global spatial layout: spatial pyramid matching Spatial weighting the features Beyond bags of features: Adding spatial information.
Discriminative and generative methods for bags of features
Bag-of-features models Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Object Recognition with Invariant Features n Definition: Identify objects or scenes and determine their pose and model parameters n Applications l Industrial.
Recognition using Regions CVPR Outline Introduction Overview of the Approach Experimental Results Conclusion.
Robust and large-scale alignment Image from
Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
1 Image Recognition - I. Global appearance patterns Slides by K. Grauman, B. Leibe.
First Law Of Geography:
Lecture 28: Bag-of-words models
Object Recognition with Invariant Features n Definition: Identify objects or scenes and determine their pose and model parameters n Applications l Industrial.
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Bag-of-features models
Local Features and Kernels for Classification of Object Categories J. Zhang --- QMUL UK (INRIA till July 2005) with M. Marszalek and C. Schmid --- INRIA.
Review: Intro to recognition Recognition tasks Machine learning approach: training, testing, generalization Example classifiers Nearest neighbor Linear.
Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,
Real-time Action Recognition by Spatiotemporal Semantic and Structural Forest Tsz-Ho Yu, Tae-Kyun Kim and Roberto Cipolla Machine Intelligence Laboratory,
Step 3: Classification Learn a decision rule (classifier) assigning bag-of-features representations of images to different classes Decision boundary Zebra.
Final Exam Review CS485/685 Computer Vision Prof. Bebis.
Local invariant features Cordelia Schmid INRIA, Grenoble.
“Secret” of Object Detection Zheng Wu (Summer intern in MSRNE) Sep. 3, 2010 Joint work with Ce Liu (MSRNE) William T. Freeman (MIT) Adam Kalai (MSRNE)
1 Action Classification: An Integration of Randomization and Discrimination in A Dense Feature Representation Computer Science Department, Stanford University.
Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,
Handwritten digit recognition Jitendra Malik. Handwritten digit recognition (MNIST,USPS) LeCun’s Convolutional Neural Networks variations (0.8%, 0.6%
A Statistical Approach to Speed Up Ranking/Re-Ranking Hong-Ming Chen Advisor: Professor Shih-Fu Chang.
Svetlana Lazebnik, Cordelia Schmid, Jean Ponce
SVM-KNN Discriminative Nearest Neighbor Classification for Visual Category Recognition Hao Zhang, Alex Berg, Michael Maire, Jitendra Malik.
Classifying Images with Visual/Textual Cues By Steven Kappes and Yan Cao.
Lecture 31: Modern recognition CS4670 / 5670: Computer Vision Noah Snavely.
Representations for object class recognition David Lowe Department of Computer Science University of British Columbia Vancouver, Canada Sept. 21, 2006.
Classifiers Given a feature representation for images, how do we learn a model for distinguishing features from different classes? Zebra Non-zebra Decision.
Beyond Sliding Windows: Object Localization by Efficient Subwindow Search The best paper prize at CVPR 2008.
Methods for classification and image representation
Histograms of Oriented Gradients for Human Detection Navneet Dalal and Bill Triggs CVPR 2005 Another Descriptor.
A feature-based kernel for object classification P. Moreels - J-Y Bouguet Intel.
Boris Babenko, Steve Branson, Serge Belongie University of California, San Diego ICCV 2009, Kyoto, Japan.
CS 1699: Intro to Computer Vision Support Vector Machines Prof. Adriana Kovashka University of Pittsburgh October 29, 2015.
Object Recognition as Ranking Holistic Figure-Ground Hypotheses Fuxin Li and Joao Carreira and Cristian Sminchisescu 1.
Locally Linear Support Vector Machines Ľubor Ladický Philip H.S. Torr.
Presented by David Lee 3/20/2006
Goggle Gist on the Google Phone A Content-based image retrieval system for the Google phone Manu Viswanathan Chin-Kai Chang Ji Hyun Moon.
776 Computer Vision Jan-Michael Frahm Spring 2012.
Finding Clusters within a Class to Improve Classification Accuracy Literature Survey Yong Jae Lee 3/6/08.
Recognizing specific objects Matching with SIFT Original suggestion Lowe, 1999,2004.
Does one size really fit all? Evaluating classifiers in a Bag-of-Visual-Words classification Christian Hentschel, Harald Sack Hasso Plattner Institute.
Another Example: Circle Detection
Lecture IX: Object Recognition (2)
Learning Mid-Level Features For Recognition
Paper Presentation: Shape and Matching
Digit Recognition using SVMS
By Suren Manvelyan, Crocodile (nile crocodile?) By Suren Manvelyan,
CS 1674: Intro to Computer Vision Scene Recognition
A Graph-Matching Kernel for Object Categorization
Presented by Xu Miao April 20, 2005
Boris Babenko, Steve Branson, Serge Belongie
Presentation transcript:

Pyramids of Features For Categorization Greg Griffin and Will Coulter (see Lazebnik et al., CVPR 2006, too)

Intuition: Approximates optimal partial matching U Adapted from

Intuition [cont’d]: Combine bags of features with spatial information

Example Pyramid Comparison

Disadvantages Same objects in different quadrants Objects sliced by bins

Possible Solutions Flipping / rotating image Sliding / shuffling histogram bins

Possible Solutions [cont’d] Split histogram in powers of three

Implementation Overview Image and Feature Extraction (SIFT on regular grid) ↓ Vocabulary Translation (200 words (k-means)) ↓ Histogram Generation (flips, slides, arbitrary mixing) ↓ Matching (full or partial pyramid) ↓ Decision (best match, voting, SVM)

Sanity Check 1 Graphical mini-confusion matrix (and rot. invariance)

Sanity Check 2

Scene Database

81.1% vs 67.4% (100)

Caltech % vs. 33.1% (30, 16)

Caltech 101 minaret windsor chair joshua tree okapi cougar body beaver crocodile ant 97.6%, 86.7% 94.6%, 80.0% 87.9%, 40.0% 87.8%, 33.3% 27.6%, 6.7% 27.5%, 13.3% 25.0%, 13.3% 25.0%, 0.0% (30, 16)

Caltech 101

Work in Progress 256 Performance –64 times more work scene database –6.4 times more work than 101 SVM –one-vs-all weighting issues –speed it up? –improve performance Improvements –Flip, Slide, Arbitrary Powers-of-3 histogram bins

Open Questions Performance of arbitrary match bins –Try random sampling? –Allow multiple best matches? Chess/pattern example Grid example Optimal kernel level weights

Implementation Details [block diagram] Images (288^2,b&w,squished) and feature extraction (-weak,-pca,+sift) Vocab generation (200 words, 20,000- small) Histogram(fliplr,flipud,slide,arbitrary,bag of features) match (full&partial pyramids) Decision (best match,voting,SVM)