Svetlana Lazebnik, Cordelia Schmid, Jean Ponce

Slides:

Advertisements

Similar presentations

Max-Margin Additive Classifiers for Detection

Advertisements

Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?

Foreground Focus: Finding Meaningful Features in Unlabeled Images Yong Jae Lee and Kristen Grauman University of Texas at Austin.

Olivier Duchenne ， Armand Joulin ， Jean Ponce Willow Lab ， ICCV2011.

Clustering with k-means and mixture of Gaussian densities Jakob Verbeek December 3, 2010 Course website:

Multi-layer Orthogonal Codebook for Image Classification Presented by Xia Li.

MIT CSAIL Vision interfaces Approximate Correspondences in High Dimensions Kristen Grauman* Trevor Darrell MIT CSAIL (*) UT Austin…

CS395: Visual Recognition Spatial Pyramid Matching Heath Vinicombe The University of Texas at Austin 21 st September 2012.

1 Part 1: Classical Image Classification Methods Kai Yu Dept. of Media Analytics NEC Laboratories America Andrew Ng Computer Science Dept. Stanford University.

Activity Recognition Aneeq Zia. Agenda What is activity recognition Typical methods used for action recognition “Evaluation of local spatio-temporal features.

Ziming Zhang *, Ze-Nian Li, Mark Drew School of Computing Science, Simon Fraser University, Vancouver, B.C., Canada {zza27, li, Learning.

CS4670 / 5670: Computer Vision Bag-of-words models Noah Snavely Object

Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,

Global spatial layout: spatial pyramid matching Spatial weighting the features Beyond bags of features: Adding spatial information.

Discriminative and generative methods for bags of features

Bag-of-features models Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

Bag of Features Approach: recent work, using geometric information.

Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

1 Image Recognition - I. Global appearance patterns Slides by K. Grauman, B. Leibe.

First Law Of Geography:

Lecture 28: Bag-of-words models

Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

CS294‐43: Visual Object and Activity Recognition Prof. Trevor Darrell Spring 2009 March 17 th, 2009.

Bag-of-features models

Local Features and Kernels for Classification of Object Categories J. Zhang --- QMUL UK (INRIA till July 2005) with M. Marszalek and C. Schmid --- INRIA.

5/30/2006EE 148, Spring Visual Categorization with Bags of Keypoints Gabriella Csurka Christopher R. Dance Lixin Fan Jutta Willamowski Cedric Bray.

Pyramids of Features For Categorization Greg Griffin and Will Coulter (see Lazebnik et al., CVPR 2006, too)

Lecture XI: Object Recognition (2)

Large Scale Recognition and Retrieval. What does the world look like? High level image statistics Object Recognition for large-scale search Focus on scaling.

Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

Review: Intro to recognition Recognition tasks Machine learning approach: training, testing, generalization Example classifiers Nearest neighbor Linear.

Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,

Exercise Session 10 – Image Categorization

Real-time Action Recognition by Spatiotemporal Semantic and Structural Forest Tsz-Ho Yu, Tae-Kyun Kim and Roberto Cipolla Machine Intelligence Laboratory,

Project 2 SIFT Matching by Hierarchical K-means Quantization

Unsupervised Learning of Categories from Sets of Partially Matching Image Features Kristen Grauman and Trevor Darrel CVPR 2006 Presented By Sovan Biswas.

Final Exam Review CS485/685 Computer Vision Prof. Bebis.

A Thousand Words in a Scene P. Quelhas, F. Monay, J. Odobez, D. Gatica-Perez and T. Tuytelaars PAMI, Sept

CSE 473/573 Computer Vision and Image Processing (CVIP)

Classification 2: discriminative models

Watch, Listen and Learn Sonal Gupta, Joohyun Kim, Kristen Grauman and Raymond Mooney -Pratiksha Shah.

Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,

Handwritten digit recognition Jitendra Malik. Handwritten digit recognition (MNIST,USPS) LeCun’s Convolutional Neural Networks variations (0.8%, 0.6%

Andrew Bender Alexander Cobian

Yao, B., and Fei-fei, L. IEEE Transactions on PAMI(2012)

SVM-KNN Discriminative Nearest Neighbor Classification for Visual Category Recognition Hao Zhang, Alex Berg, Michael Maire, Jitendra Malik.

Beyond Sliding Windows: Object Localization by Efficient Subwindow Search The best paper prize at CVPR 2008.

Efficient Subwindow Search: A Branch and Bound Framework for Object Localization ‘PAMI09 Beyond Sliding Windows: Object Localization by Efficient Subwindow.

In Defense of Nearest-Neighbor Based Image Classification Oren Boiman The Weizmann Institute of Science Rehovot, ISRAEL Eli Shechtman Adobe Systems Inc.

Visual Categorization With Bags of Keypoints Original Authors: G. Csurka, C.R. Dance, L. Fan, J. Willamowski, C. Bray ECCV Workshop on Statistical Learning.

Gang WangDerek HoiemDavid Forsyth. INTRODUCTION APROACH (implement detail) EXPERIMENTS CONCLUSION.

Methods for classification and image representation

Kylie Gorman WEEK 1-2 REVIEW. CONVERTING AN IMAGE FROM RGB TO HSV AND DISPLAY CHANNELS.

Hierarchical Matching with Side Information for Image Classification

CS 1699: Intro to Computer Vision Bias-Variance Trade-off + Other Models and Problems Prof. Adriana Kovashka University of Pittsburgh November 3, 2015.

WEEK 1-2 ALEJANDRO TORROELLA. CONVERTING AN IMAGE FROM RGB TO HSV AND DISPLAYING THE SEPARATE CHANNELS.

Lecture 08 27/12/2011 Shai Avidan הבהרה: החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.

WEEK4 RESEARCH Amari Lewis Aidean Sharghi. PREPARING THE DATASET  Cars – 83 samples  3 images for each sample when x=0  7 images for each sample when.

CS654: Digital Image Analysis

Goggle Gist on the Google Phone A Content-based image retrieval system for the Google phone Manu Viswanathan Chin-Kai Chang Ji Hyun Moon.

NICTA SML Seminar, May 26, 2011 Modeling spatial layout for image classification Jakob Verbeek 1 Joint work with Josip Krapac 1 & Frédéric Jurie 2 1: LEAR.

Lecture IX: Object Recognition (2)

Learning Mid-Level Features For Recognition

Paper Presentation: Shape and Matching

Digit Recognition using SVMS

By Suren Manvelyan, Crocodile (nile crocodile?) By Suren Manvelyan,

CS 1674: Intro to Computer Vision Scene Recognition

CVPR 2014 Orientational Pyramid Matching for Recognizing Indoor Scenes

REU Week 1 Ivette Carreras UCF.

SIFT keypoint detection

Presentation transcript:

Svetlana Lazebnik, Cordelia Schmid, Jean Ponce Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories Svetlana Lazebnik, Cordelia Schmid, Jean Ponce Presented by: Lubomir Bourdev Many of the slides by: Svetlana Lazebnik

Key Idea Pyramid Match Kernel (Grauman & Darrell) Pyramid in feature space, ignore location Spatial Pyramid (this work) Pyramid in image space, quantize features

Algorithm Extract interest point descriptors (dense scan) Construct visual word dictionary Build spatial histograms Create intersection kernels Train an SVM

Algorithm OR Extract interest point descriptors (dense scan) Construct visual word dictionary Build spatial histograms Create intersection kernels Train an SVM OR Weak (edge orientations) Strong (SIFT)

Algorithm Extract interest point descriptors (dense scan) Construct visual word dictionary Build spatial histograms Create intersection kernels Train an SVM Vector quantization Usually K-means clustering Vocabulary size (16 to 400)

Algorithm Extract interest point descriptors (dense scan) Construct visual word dictionary Build spatial histograms Create intersection kernels Train an SVM

Algorithm Extract interest point descriptors (dense scan) Construct visual word dictionary Build spatial histograms Create intersection kernels Train an SVM

Algorithm Extract interest point descriptors (dense scan) Construct visual word dictionary Build spatial histograms Create intersection kernels Train an SVM

My experiment: Butterfly Classification Peacock Zebra

Butterflies Dataset from Lazebnik / Schmid / Ponce 70 train / 64 test Images centered on the butterfly Significant background clutter Large pose/viewpoint variations Scale variations: up to x4

Butterfly Results Spatial pyramid levels: 1 (No pyramid) Linear Intersection Weak (16) 82.6% Strong (200) 81.9% 89.5% Dims 16 200 Spatial pyramid levels: 4 Linear Intersection Weak (16) 88.6% 86.7% Strong (200) 84.8% 89.5% Dims 1360 17000