Statistical Recognition Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Kristen Grauman.

Slides:



Advertisements
Similar presentations
Semantic Contours from Inverse Detectors Bharath Hariharan et.al. (ICCV-11)
Advertisements

Object class recognition using unsupervised scale-invariant learning Rob Fergus Pietro Perona Andrew Zisserman Oxford University California Institute of.
1 Part 1: Classical Image Classification Methods Kai Yu Dept. of Media Analytics NEC Laboratories America Andrew Ng Computer Science Dept. Stanford University.
Lecture 31: Modern object recognition
1 Building a Dictionary of Image Fragments Zicheng Liao Ali Farhadi Yang Wang Ian Endres David Forsyth Department of Computer Science, University of Illinois.
More sliding window detection: Discriminative part-based models Many slides based on P. FelzenszwalbP. Felzenszwalb.
Discriminative and generative methods for bags of features
Bag-of-features models Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Beyond bags of features: Part-based models Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Recognition: A machine learning approach
Small Codes and Large Image Databases for Recognition CVPR 2008 Antonio Torralba, MIT Rob Fergus, NYU Yair Weiss, Hebrew University.
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
1 Image Recognition - I. Global appearance patterns Slides by K. Grauman, B. Leibe.
EECS 442 – Computer vision Segments of this lectures are courtesy of Prof F. Li, R. Fergus and A. Zisserman Databases for object recognition and beyond.
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce.
Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to.
Transferring information using Bayesian priors on object categories Li Fei-Fei 1, Rob Fergus 2, Pietro Perona 1 1 California Institute of Technology, 2.
Bag-of-features models
Object Recognition: Conceptual Issues Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and K. Grauman.
Generative learning methods for bags of features
Object Recognition: Conceptual Issues Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and K. Grauman.
Agenda Introduction Bag-of-words models Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based.
Visual Object Recognition Rob Fergus Courant Institute, New York University
Extreme, Non-parametric Object Recognition 80 million tiny images (Torralba et al)
Recap: HOGgles Data, Representation, and Learning matter. – This work looked just at representation By creating a human-understandable HoG visualization,
Opportunities of Scale, Part 2 Computer Vision James Hays, Brown Many slides from James Hays, Alyosha Efros, and Derek Hoiem Graphic from Antonio Torralba.
Lecture 29: Recent work in recognition CS4670: Computer Vision Noah Snavely.
Part I: Classification and Bayesian Learning
Discriminative and generative methods for bags of features
Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to.
Crash Course on Machine Learning
Computer vision: models, learning and inference Chapter 6 Learning and Inference in Vision.
Salient Object Detection by Composition
Exercise Session 10 – Image Categorization
Computer Vision CS 776 Spring 2014 Recognition Machine Learning Prof. Alex Berg.
“Secret” of Object Detection Zheng Wu (Summer intern in MSRNE) Sep. 3, 2010 Joint work with Ce Liu (MSRNE) William T. Freeman (MIT) Adam Kalai (MSRNE)
Perceptual and Sensory Augmented Computing Visual Object Recognition Tutorial Visual Object Recognition Bastian Leibe & Computer Vision Laboratory ETH.
Why Categorize in Computer Vision ?. Why Use Categories? People love categories!
Last part: datasets and object collections. CMU/MIT frontal facesvasc.ri.cmu.edu/idb/html/face/frontal_images cbcl.mit.edu/software-datasets/FaceData2.html.
Building local part models for category-level recognition C. Schmid, INRIA Grenoble Joint work with G. Dorko, S. Lazebnik, J. Ponce.
Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,
Image Classification 영상분류
Reading Between The Lines: Object Localization Using Implicit Cues from Image Tags Sung Ju Hwang and Kristen Grauman University of Texas at Austin Jingnan.
MSRI workshop, January 2005 Object Recognition Collected databases of objects on uniform background (no occlusions, no clutter) Mostly focus on viewpoint.
UNBIASED LOOK AT DATASET BIAS Antonio Torralba Massachusetts Institute of Technology Alexei A. Efros Carnegie Mellon University CVPR 2011.
Bayesian Parameter Estimation Liad Serruya. Agenda Introduction Bayesian decision theory Scale-Invariant Learning Bayesian “One-Shot” Learning.
Visual Categorization With Bags of Keypoints Original Authors: G. Csurka, C.R. Dance, L. Fan, J. Willamowski, C. Bray ECCV Workshop on Statistical Learning.
Epitomic Location Recognition A generative approach for location recognition K. Ni, A. Kannan, A. Criminisi and J. Winn In proc. CVPR Anchorage,
11/26/2015 Copyright G.D. Hager Class 2 - Schedule 1.Optical Illusions 2.Lecture on Object Recognition 3.Group Work 4.Sports Videos 5.Short Lecture on.
Object detection, deep learning, and R-CNNs
CS 1699: Intro to Computer Vision Detection II: Deformable Part Models Prof. Adriana Kovashka University of Pittsburgh November 12, 2015.
Guest lecture: Feature Selection Alan Qi Dec 2, 2004.
Image Classification over Visual Tree Jianping Fan Dept of Computer Science UNC-Charlotte, NC
Recognition Using Visual Phrases
Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning Object Categories ICCV 2005 Beijing, Short Course, Oct 15.
MACHINE LEARNING 3. Supervised Learning. Learning a Class from Examples Based on E Alpaydın 2004 Introduction to Machine Learning © The MIT Press (V1.1)
Max-Margin Training of Upstream Scene Understanding Models Jun Zhu Carnegie Mellon University Joint work with Li-Jia Li *, Li Fei-Fei *, and Eric P. Xing.
SUPERVISED AND UNSUPERVISED LEARNING Presentation by Ege Saygıner CENG 784.
Machine learning & object recognition Cordelia Schmid Jakob Verbeek.
Object detection with deformable part-based models
Supervised Time Series Pattern Discovery through Local Importance
Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT
HOGgles Visualizing Object Detection Features
Speaker: Lingxi Xie Authors: Lingxi Xie, Qi Tian, Bo Zhang
Recognizing and Learning Object categories
“The Truth About Cats And Dogs”
Brief Review of Recognition + Context
Faster R-CNN By Anthony Martinez.
Generic object recognition
Presentation transcript:

Statistical Recognition Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Kristen Grauman

Object categorization: the statistical viewpoint vs. MAP decision:

Object categorization: the statistical viewpoint vs. Bayes rule: posterior likelihood prior MAP decision:

Object categorization: the statistical viewpoint Discriminative methods: model posterior Generative methods: model likelihood and prior posterior likelihood prior

Discriminative methods Direct modeling of Zebra Non-zebra Decision boundary

Model and Generative methods LowMiddle HighMiddle  Low

Generative vs. discriminative learning GenerativeDiscriminative Class densities Posterior probabilities

Generative vs. discriminative methods Generative methods + Can sample from them / compute how probable any given model instance is + Can be learned using images from just a single category – Sometimes we don’t need to model the likelihood when all we want is to make a decision Discriminative methods + Efficient + Often produce better classification rates – Require positive and negative training data – Can be hard to interpret

Steps for statistical recognition Representation –Specify the model for an object category –Bag of features, part-based, global, etc. Learning –Given a training set, find the parameters of the model –Generative vs. discriminative Recognition –Apply the model to a new test image

Generalization How well does a learned model generalize from the data it was trained on to a new test set? Underfitting: model is too “simple” to represent all the relevant class characteristics –High training error and high test error Overfitting: model is too “complex” and fits irrelevant characteristics (noise) in the data –Low training error and high test error Occam’s razor: given two models that represent the data equally well, the simpler one should be preferred

Images in the training set must be annotated with the “correct answer” that the model is expected to produce Contains a motorbike Supervision

Unsupervised “Weakly” supervised Fully supervised Definition depends on task

What task? Classification –Object present/absent in image –Background may be correlated with object Localization / Detection –Localize object within the frame –Bounding box or pixel- level segmentation

Datasets Circa 2001: 5 categories, 100s of images per category Circa 2004: 101 categories Today: thousands of categories, tens of thousands of images

Caltech 101 & 256 Griffin, Holub, Perona, 2007 Fei-Fei, Fergus, Perona,

The PASCAL Visual Object Classes Challenge ( ) 2008 Challenge classes: Person: person Animal: bird, cat, cow, dog, horse, sheep Vehicle: aeroplane, bicycle, boat, bus, car, motorbike, train Indoor: bottle, chair, dining table, potted plant, sofa, tv/monitor

The PASCAL Visual Object Classes Challenge ( ) Main competitions –Classification: For each of the twenty classes, predicting presence/absence of an example of that class in the test image –Detection: Predicting the bounding box and label of each object from the twenty target classes in the test image

The PASCAL Visual Object Classes Challenge ( ) “Taster” challenges –Segmentation: Generating pixel-wise segmentations giving the class of the object visible at each pixel, or "background" otherwise –Person layout: Predicting the bounding box and label of each part of a person (head, hands, feet)

Lotus Hill Research Institute image corpus Z.Y. Yao, X. Yang, and S.C. Zhu,

Labeling with games L. von Ahn, L. Dabbish, 2004; L. von Ahn, R. Liu and M. Blum,

Russell, Torralba, Murphy, Freeman, 2008 LabelMe

80 Million Tiny Images

Dataset issues How large is the degree of intra-class variability? How “confusable” are the classes? Is there bias introduced by the background? I.e., can we “cheat” just by looking at the background and not the object?

Caltech-101

Summary Recognition is the “grand challenge” of computer vision History –Geometric methods –Appearance-based methods –Sliding window approaches –Local features –Parts-and-shape approaches –Bag-of-features approaches Statistical recognition concepts –Generative vs. discriminative models –Generalization, overfitting, underfitting –Supervision Tasks, datasets