Agenda Introduction Bag-of-words models Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based.

Slides:



Advertisements
Similar presentations
Three things everyone should know to improve object retrieval
Advertisements

Object recognition and scene “understanding”
Detecting Faces in Images: A Survey
EE462 MLCV Lecture 5-6 Object Detection – Boosting Tae-Kyun Kim.
Appearance-based recognition & detection II Kristen Grauman UT-Austin Tuesday, Nov 11.
CSCI 347 / CS 4206: Data Mining Module 07: Implementations Topic 03: Linear Models.
Ivan Laptev IRISA/INRIA, Rennes, France September 07, 2006 Boosted Histograms for Improved Object Detection.
CS4670 / 5670: Computer Vision Bag-of-words models Noah Snavely Object
Global spatial layout: spatial pyramid matching Spatial weighting the features Beyond bags of features: Adding spatial information.
Cos 429: Face Detection (Part 2) Viola-Jones and AdaBoost Guest Instructor: Andras Ferencz (Your Regular Instructor: Fei-Fei Li) Thanks to Fei-Fei Li,
EE462 MLCV Lecture 5-6 Object Detection – Boosting Tae-Kyun Kim.
Contour Based Approaches for Visual Object Recognition Jamie Shotton University of Cambridge Joint work with Roberto Cipolla, Andrew Blake.
Detecting Pedestrians by Learning Shapelet Features
Bag-of-features models Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
The Viola/Jones Face Detector (2001)
Recognition using Regions CVPR Outline Introduction Overview of the Approach Experimental Results Conclusion.
Fast and Compact Retrieval Methods in Computer Vision Part II A. Torralba, R. Fergus and Y. Weiss. Small Codes and Large Image Databases for Recognition.
1 Image Recognition - I. Global appearance patterns Slides by K. Grauman, B. Leibe.
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
Lecture 28: Bag-of-words models
Graz University of Technology, AUSTRIA Institute for Computer Graphics and Vision Fast Visual Object Identification and Categorization Michael Grabner,
Generic Object Detection using Feature Maps Oscar Danielsson Stefan Carlsson
Object Recognition with Informative Features and Linear Classification Authors: Vidal-Naquet & Ullman Presenter: David Bradley.
Bag-of-features models
Learning and Vision: Discriminative Models
Classifiers for Recognition Reading: Chapter 22 (skip 22.3) Slide credits for this chapter: Frank Dellaert, Forsyth & Ponce, Paul Viola, Christopher Rasmussen.
Visual Object Recognition Rob Fergus Courant Institute, New York University
Object Recognition by Discriminative Methods Sinisa Todorovic 1st Sino-USA Summer School in VLPR July, 2009.
Face Detection CSE 576. Face detection State-of-the-art face detection demo (Courtesy Boris Babenko)Boris Babenko.
Matthew Brown University of British Columbia (prev.) Microsoft Research [ Collaborators: † Simon Winder, *Gang Hua, † Rick Szeliski † =MS Research, *=MS.
Review: Intro to recognition Recognition tasks Machine learning approach: training, testing, generalization Example classifiers Nearest neighbor Linear.
Multiclass object recognition
Machine Learning Overview Tamara Berg Language and Vision.
CS 231A Section 1: Linear Algebra & Probability Review
Learning Based Hierarchical Vessel Segmentation
Classification Tamara Berg CSE 595 Words & Pictures.
EADS DS / SDC LTIS Page 1 7 th CNES/DLR Workshop on Information Extraction and Scene Understanding for Meter Resolution Image – 29/03/07 - Oberpfaffenhofen.
Part 3: discriminative methods Antonio Torralba. Overview of section Object detection with classifiers Boosting –Gentle boosting –Weak detectors –Object.
“Secret” of Object Detection Zheng Wu (Summer intern in MSRNE) Sep. 3, 2010 Joint work with Ce Liu (MSRNE) William T. Freeman (MIT) Adam Kalai (MSRNE)
Recognition using Boosting Modified from various sources including
Lecture 2 Overview on object recognition and one practical example Object Recognition and Scene Understanding
Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,
A Statistically Selected Part-Based Probabilistic Model for Object Recognition Zhipeng Zhao, Ahmed Elgammal Department of Computer Science, Rutgers, The.
Window-based models for generic object detection Mei-Chen Yeh 04/24/2012.
Sign Classification Boosted Cascade of Classifiers using University of Southern California Thang Dinh Eunyoung Kim
Lecture 29: Face Detection Revisited CS4670 / 5670: Computer Vision Noah Snavely.
Sharing features for multi-class object detection Antonio Torralba, Kevin Murphy and Bill Freeman MIT Computer Science and Artificial Intelligence Laboratory.
SVM-KNN Discriminative Nearest Neighbor Classification for Visual Category Recognition Hao Zhang, Alex Berg, Michael Maire, Jitendra Malik.
Visual Object Recognition
Classical Methods for Object Recognition Rob Fergus (NYU)
Object Recognition in Images Slides originally created by Bernd Heisele.
BAGGING ALGORITHM, ONLINE BOOSTING AND VISION Se – Hoon Park.
MSRI workshop, January 2005 Object Recognition Collected databases of objects on uniform background (no occlusions, no clutter) Mostly focus on viewpoint.
Lecture 6: Classification – Boosting and SVMs CAP 5415 Fall 2006.
Methods for classification and image representation
Machine Learning Overview Tamara Berg CS 560 Artificial Intelligence Many slides throughout the course adapted from Svetlana Lazebnik, Dan Klein, Stuart.
Lecture 09 03/01/2012 Shai Avidan הבהרה: החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.
The Viola/Jones Face Detector A “paradigmatic” method for real-time object detection Training is slow, but detection is very fast Key ideas Integral images.
CS 1699: Intro to Computer Vision Detection II: Deformable Part Models Prof. Adriana Kovashka University of Pittsburgh November 12, 2015.
Object Detection Overview Viola-Jones Dalal-Triggs Deformable models Deep learning.
Visual Object Recognition
AdaBoost Algorithm and its Application on Object Detection Fayin Li.
1 Bilinear Classifiers for Visual Recognition Computational Vision Lab. University of California Irvine To be presented in NIPS 2009 Hamed Pirsiavash Deva.
Cascade for Fast Detection
Part 3: discriminative methods
Lit part of blue dress and shadowed part of white dress are the same color
Recognition using Nearest Neighbor (or kNN)
Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT
Object detection as supervised classification
Cos 429: Face Detection (Part 2) Viola-Jones and AdaBoost Guest Instructor: Andras Ferencz (Your Regular Instructor: Fei-Fei Li) Thanks to Fei-Fei.
Presentation transcript:

Agenda Introduction Bag-of-words models Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based image retrieval Datasets & Conclusions

Classifier based methods Object detection and recognition is formulated as a classification problem. Bag of image patches … and a decision is taken at each window about if it contains a target object or not. Decision boundary Computer screen Background In some feature space Where are the screens? The image is partitioned into a set of overlapping windows

Discriminative methods 10 6 examples Nearest neighborNeural networks Support Vector Machines and Kernels Conditional Random Fields

Nearest Neighbors 10 6 examples Shakhnarovich, Viola, Darrell 2003 Difficult due to high intrinsic dimensionality of images - lots of data needed - slow neighbor lookup Torralba, Fergus, Freeman 2008

Multi-layer Hubel-Wiesel architectures Neural networks LeCun, Bottou, Bengio, Haffner 1998 Rowley, Baluja, Kanade 1998 Hinton & Salakhutdinov 2006 Ranzato, Huang, Boureau, LeCun 2007 Riesenhuber & Poggio 1999 Serre, Wolf, Poggio Mutch & Lowe 2006 Biologically inspired

Support Vector Machines Heisele, Serre, Poggio, 2001 Face detection Pyramid Match Kernel Combining Multiple Kernels Varma & Roy 2007 Bosch, Munoz, Zisserman 2007 Grauman & Darrell 2005 Lazebnik, Schmid, Ponce 2006

Conditional Random Fields Kumar & Hebert 2003 Quattoni, Collins, Darrell 2004 More in Segmentation section

A simple algorithm for learning robust classifiers –Freund & Shapire, 1995 –Friedman, Hastie, Tibshhirani, 1998 Provides efficient algorithm for sparse visual feature selection –Tieu & Viola, 2000 –Viola & Jones, 2003 Easy to implement, not requires external optimization tools. Boosting

A simple object detector with Boosting Download Toolbox for manipulating dataset Code and dataset Matlab code Gentle boosting Object detector using a part based model Dataset with cars and computer monitors

Boosting Boosting fits the additive model by minimizing the exponential loss Training samples The exponential loss is a differentiable upper bound to the misclassification error.

Weak classifiers The input is a set of weighted training samples (x,y,w) Regression stumps: simple but commonly used in object detection. Four parameters: b=E w (y [x>  ]) a=E w (y [x<  ]) x f m (x) 

From images to features: A myriad of weak detectors We will now define a family of visual features that can be used as weak classifiers (“weak detectors”) Takes image as input and the output is binary response. The output is a weak detector.

A myriad of weak detectors Yuille, Snow, Nitzbert, 1998 Amit, Geman 1998 Papageorgiou, Poggio, 2000 Heisele, Serre, Poggio, 2001 Agarwal, Awan, Roth, 2004 Schneiderman, Kanade 2004 Carmichael, Hebert 2004 …

Weak detectors Textures of textures Tieu and Viola, CVPR 2000 Every combination of three filters generates a different feature This gives thousands of features. Boosting selects a sparse subset, so computations on test time are very efficient. Boosting also avoids overfitting to some extend.

Haar wavelets Haar filters and integral image Viola and Jones, ICCV 2001 The average intensity in the block is computed with four sums independently of the block size.

Haar wavelets Papageorgiou & Poggio (2000) Polynomial SVM

Edges and chamfer distance Gavrila, Philomin, ICCV 1999

Edge fragments Weak detector = k edge fragments and threshold. Chamfer distance uses 8 orientation planes Opelt, Pinz, Zisserman, ECCV 2006

Histograms of oriented gradients Dalal & Trigs, 2006 Shape context Belongie, Malik, Puzicha, NIPS 2000 SIFT, D. Lowe, ICCV 1999

Weak detectors Part based: similar to part-based generative models. We create weak detectors by using parts and voting for the object center location Car model Screen model These features are used for the detector on the course web site.

Weak detectors First we collect a set of part templates from a set of training objects. Vidal-Naquet, Ullman, Nature Neuroscience 2003 …

Weak detectors We now define a family of “weak detectors” as: == Better than chance *

Weak detectors We can do a better job using filtered images Still a weak detector but better than before * * = = =

Example: screen detection Feature output

Example: screen detection Feature output Thresholded output Weak ‘detector’ Produces many false alarms.

Example: screen detection Feature output Thresholded output Strong classifier at iteration 1

Example: screen detection Feature output Thresholded output Strong classifier Second weak ‘detector’ Produces a different set of false alarms.

Example: screen detection + Feature output Thresholded output Strong classifier Strong classifier at iteration 2

Example: screen detection + … Feature output Thresholded output Strong classifier Strong classifier at iteration 10

Example: screen detection + … Feature output Thresholded output Strong classifier Adding features Final classification Strong classifier at iteration 200

We want the complexity of the 3 features classifier with the performance of the 100 features classifier: Cascade of classifiers Fleuret and Geman 2001, Viola and Jones 2001 Recall Precision 0% 100% 3 features 30 features 100 features Select a threshold with high recall for each stage. We increase precision using the cascade

Some goals for object recognition Able to detect and recognize many object classes Computationally efficient Able to deal with data starving situations: –Some training samples might be harder to collect than others –We want on-line learning to be fast

Shared features Is learning the object class 1000 easier than learning the first? Can we transfer knowledge from one object to another? Are the shared properties interesting by themselves? …

Shared features Screen detector Car detector Face detector Independent binary classifiers: Torralba, Murphy, Freeman. CVPR PAMI 2007 Screen detector Car detector Face detector Binary classifiers that share features:

50 training samples/class 29 object classes 2000 entries in the dictionary Results averaged on 20 runs Error bars = 80% interval Krempp, Geman, & Amit, 2002 Torralba, Murphy, Freeman. CVPR 2004 Shared features Class-specific features

Generalization as a function of object similarities 12 viewpoints 12 unrelated object classes Number of training samples per class Area under ROC K = 2.1 K = 4.8 Torralba, Murphy, Freeman. CVPR PAMI 2007

Sharing patches Bart and Ullman, 2004 For a new class, use only features similar to features that where good for other classes: Proposed Dog features

Sharing transformations Miller, E., Matsakis, N., and Viola, P. (2000). Learning from one example through shared densities on transforms. In IEEE Computer Vision and Pattern Recognition. Transformations are shared and can be learnt from other tasks.

Some references on multiclass Caruana 1997 Schapire, Singer, 2000 Thrun, Pratt 1997 Krempp, Geman, Amit, 2002 E.L.Miller, Matsakis, Viola, 2000 Mahamud, Hebert, Lafferty, 2001 Fink 2004 LeCun, Huang, Bottou, 2004 Holub, Welling, Perona, 2005 …