11/26/2015 Copyright G.D. Hager Class 2 - Schedule 1.Optical Illusions 2.Lecture on Object Recognition 3.Group Work 4.Sports Videos 5.Short Lecture on.

Slides:



Advertisements
Similar presentations
The Layout Consistent Random Field for detecting and segmenting occluded objects CVPR, June 2006 John Winn Jamie Shotton.
Advertisements

Object class recognition using unsupervised scale-invariant learning Rob Fergus Pietro Perona Andrew Zisserman Oxford University California Institute of.
1 Part 1: Classical Image Classification Methods Kai Yu Dept. of Media Analytics NEC Laboratories America Andrew Ng Computer Science Dept. Stanford University.
Internet Vision - Lecture 3 Tamara Berg Sept 10. New Lecture Time Mondays 10:00am-12:30pm in 2311 Monday (9/15) we will have a general Computer Vision.
Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee.
Recognition by Probabilistic Hypothesis Construction P. Moreels, M. Maire, P. Perona California Institute of Technology.
LOCUS (Learning Object Classes with Unsupervised Segmentation) A variational approach to learning model- based segmentation. John Winn Microsoft Research.
Discriminative and generative methods for bags of features
Object Recognition. So what does object recognition involve?
CPSC 425: Computer Vision (Jan-April 2007) David Lowe Prerequisites: 4 th year ability in CPSC Math 200 (Calculus III) Math 221 (Matrix Algebra: linear.
Beyond bags of features: Part-based models Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Recognition: A machine learning approach
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
How many object categories are there? Biederman 1987.
Statistical Recognition Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Kristen Grauman.
Object Recognition Szeliski Chapter 14.
1 Image Recognition - I. Global appearance patterns Slides by K. Grauman, B. Leibe.
A Study of Approaches for Object Recognition
Object Recognition. So what does object recognition involve?
CS335 Principles of Multimedia Systems Content Based Media Retrieval Hao Jiang Computer Science Department Boston College Dec. 4, 2007.
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce.
Object Recognition. So what does object recognition involve?
Object recognition Jana Kosecka Slides from D. Lowe, D. Forsythe and J. Ponce book, ICCV 2005 Tutorial Fei-Fei Li, Rob Fergus and A. Torralba.
16-721: Learning-based Methods in Vision Staff: Instructor: Alexei (Alyosha) Efros 4207 TA: Tomasz Malisiewicz Smith Hall.
Distinctive Image Feature from Scale-Invariant KeyPoints
Object recognition Jana Kosecka Slides from D. Lowe, D. Forsythe and J. Ponce book, ICCV 2005 Tutorial Fei-Fei Li, Rob Fergus and A. Torralba.
Object Recognition: Conceptual Issues Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and K. Grauman.
Announcements Kevin Matzen office hours – Tuesday 4-5pm, Thursday 2-3pm, Upson 317 TA: Yin Lou Course lab: Upson 317 – Card access will be setup soon Course.
Object Recognition: Conceptual Issues Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and K. Grauman.
Visual Object Recognition Rob Fergus Courant Institute, New York University
16-721: Learning-based Methods in Vision Staff: Instructor: Alexei (Alyosha) Efros 4207 TA: Jean-Francois Lalonde A521.
By Suren Manvelyan,
CS294‐43: Visual Object and Activity Recognition Prof. Trevor Darrell Spring 2009.
Face Recognition: An Introduction
Machine learning & category recognition Cordelia Schmid Jakob Verbeek.
The Beauty of Local Invariant Features
Crash Course on Machine Learning
COMPUTER VISION: SOME CLASSICAL PROBLEMS ADWAY MITRA MACHINE LEARNING LABORATORY COMPUTER SCIENCE AND AUTOMATION INDIAN INSTITUTE OF SCIENCE June 24, 2013.
Computer Vision CS 776 Spring 2014 Recognition Machine Learning Prof. Alex Berg.
Perceptual and Sensory Augmented Computing Visual Object Recognition Tutorial Visual Object Recognition Bastian Leibe & Computer Vision Laboratory ETH.
Last part: datasets and object collections. CMU/MIT frontal facesvasc.ri.cmu.edu/idb/html/face/frontal_images cbcl.mit.edu/software-datasets/FaceData2.html.
Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,
Window-based models for generic object detection Mei-Chen Yeh 04/24/2012.
Camera/Vision for Geo-Location & Geo-Identification John S. Zelek Intelligent Human Machine Interface Lab Dept. of Systems Design Engineering University.
Computer Science Department Pacific University Artificial Intelligence -- Computer Vision.
Why is computer vision difficult?
MSRI workshop, January 2005 Object Recognition Collected databases of objects on uniform background (no occlusions, no clutter) Mostly focus on viewpoint.
Bayesian Parameter Estimation Liad Serruya. Agenda Introduction Bayesian decision theory Scale-Invariant Learning Bayesian “One-Shot” Learning.
Handwritten digit recognition
Discussion of Pictorial Structures Pedro Felzenszwalb Daniel Huttenlocher Sicily Workshop September, 2006.
Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning Object Categories ICCV 2005 Beijing, Short Course, Oct 15.
A Brief Introduction on Face Detection Mei-Chen Yeh 04/06/2010 P. Viola and M. J. Jones, Robust Real-Time Face Detection, IJCV 2004.
Recognition Readings – C. Bishop, “Neural Networks for Pattern Recognition”, Oxford University Press, 1998, Chapter 1. – Szeliski, Chapter (eigenfaces)
Using the Forest to see the Trees: A computational model relating features, objects and scenes Antonio Torralba CSAIL-MIT Joint work with Aude Oliva, Kevin.
Introduction to Recognition CS4670/5670: Intro to Computer Vision Noah Snavely mountain building tree banner vendor people street lamp.
CS 558 Computer Vision John Oliensis. Today’s class What is vision What is computer vision How we can solve vision problems –Important tools –Overall.
Finding Clusters within a Class to Improve Classification Accuracy Literature Survey Yong Jae Lee 3/6/08.
Li Fei-Fei, Stanford Rob Fergus, NYU Antonio Torralba, MIT Recognizing and Learning Object Categories: Year 2009 ICCV 2009 Kyoto, Short Course, September.
Machine learning & object recognition Cordelia Schmid Jakob Verbeek.
Lecture 07 13/12/2011 Shai Avidan הבהרה: החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.
Lecture 25: Introduction to Recognition
Paper Presentation: Shape and Matching
Lecture 14: Introduction to Recognition
Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT
Object detection as supervised classification
Lecture 25: Introduction to Recognition
Brief Review of Recognition + Context
Lecture: Object Recognition
Generic object recognition
Presentation transcript:

11/26/2015 Copyright G.D. Hager Class 2 - Schedule 1.Optical Illusions 2.Lecture on Object Recognition 3.Group Work 4.Sports Videos 5.Short Lecture on Rigid Transformations 6.Lab time

11/26/2015 Copyright G.D. Hager Object Recognition Techniques

11/26/2015 Copyright G.D. Hager Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning Object Categories ICCV 2005 Beijing, Short Course, Oct 15

11/26/2015 Copyright G.D. Hager

11/26/2015 Copyright G.D. Hager perceptible vision materialthing

11/26/2015 Copyright G.D. Hager

11/26/2015 Copyright G.D. Hager

11/26/2015 Copyright G.D. Hager Plato said… Ordinary objects are classified together if they `participate' in the same abstract Form, such as the Form of a Human or the Form of Quartz. Forms are proper subjects of philosophical investigation, for they have the highest degree of reality. Ordinary objects, such as humans, trees, and stones, have a lower degree of reality than the Forms. Fictions, shadows, and the like have a still lower degree of reality than ordinary objects and so are not proper subjects of philosophical enquiry.

11/26/2015 Copyright G.D. Hager Bruegel, 1564

11/26/2015 Copyright G.D. Hager How many object categories are there? Biederman 1987

11/26/2015 Copyright G.D. Hager Problems of Computer Vision: Recognition Given a database of objects and an image determine what, if any of the objects are present in the image.

11/26/2015 Copyright G.D. Hager Problems of Computer Vision: Recognition Given a database of objects and an image determine what, if any of the objects are present in the image.

11/26/2015 Copyright G.D. Hager Problems of Computer Vision: Recognition Given a database of objects and an image determine what, if any of the objects are present in the image.

11/26/2015 Copyright G.D. Hager Object Recognition: The Problem Given: A database D of “known” objects and an image I: 1. Determine which (if any) objects in D appear in I 2. Determine the pose (rotation and translation) of the object Segmentation (where is it 2D) Recognition (what is it) The object recognition conundrum Pose Est. (where is it 3D)

11/26/2015 Copyright G.D. Hager Object Recognition Issues How general is the problem? –2D vs. 3D –range of viewing conditions –available context –segmentation cues What sort of data is best suited to the problem? –local 2D features –3D surfaces –images How many objects are involved? –small: brute force search –large: ??

11/26/2015 Copyright G.D. Hager Object Recognition Approaches Geometry-based –Interpretation trees: use features compute “local constraints” valid under Euclidean or similarity group –Invariants: use features compute “global indices” that do not change over viewing conditions (i.e. invariant in the projective group) Image-based: –store information about views and match to views intensities histograms Semi-local: –use features detected using a stable (but not invariant) interest operator –use stable (but not invariant) measures on groups of features to index views

11/26/2015 Copyright G.D. Hager So what does object recognition involve?

11/26/2015 Copyright G.D. Hager Verification: is that a bus?

11/26/2015 Copyright G.D. Hager Detection: are there cars?

11/26/2015 Copyright G.D. Hager Identification: is that a picture of Mao?

11/26/2015 Copyright G.D. Hager Object categorization sky building flag wall banner bus cars bus face street lamp

11/26/2015 Copyright G.D. Hager Scene and context categorization outdoor city traffic …

11/26/2015 Copyright G.D. Hager Challenges 1: view point variation Michelangelo

11/26/2015 Copyright G.D. Hager Challenges 2: illumination slide credit: S. Ullman

11/26/2015 Copyright G.D. Hager Challenges 3: occlusion Magritte, 1957

11/26/2015 Copyright G.D. Hager Challenges 4: scale

11/26/2015 Copyright G.D. Hager Challenges 5: deformation Xu, Beihong 1943

11/26/2015 Copyright G.D. Hager Challenges 6: background clutter Klimt, 1913

11/26/2015 Copyright G.D. Hager History: single object recognition

11/26/2015 Copyright G.D. Hager History: single object recognition Lowe, et al. 1999, 2003 Mahamud and Herbert, 2000 Ferrari, Tuytelaars, and Van Gool, 2004 Rothganger, Lazebnik, and Ponce, 2004 Moreels and Perona, 2005 …

11/26/2015 Copyright G.D. Hager Challenges 7: intra-class variation

11/26/2015 Copyright G.D. Hager History: early object categorization

11/26/2015 Copyright G.D. Hager Turk and Pentland, 1991 Belhumeur et al Schneiderman et al Viola and Jones, 2000 Amit and Geman, 1999 LeCun et al Belongie and Malik, 2002 Schneiderman et al Argawal and Roth, 2002 Poggio et al. 1993

11/26/2015 Copyright G.D. Hager

11/26/2015 Copyright G.D. Hager OBJECTS ANIMALS INANIMATE PLANTS MAN-MADENATURAL VERTEBRATE ….. MAMMALS BIRDS GROUSEBOARTAPIR CAMERA

11/26/2015 Copyright G.D. Hager

11/26/2015 Copyright G.D. Hager Scenes, Objects, and Parts Features Parts Objects Scene E. Sudderth, A. Torralba, W. Freeman, A. Willsky. ICCV 2005.

11/26/2015 Copyright G.D. Hager Object categorization: the statistical viewpoint vs. Bayes rule: posterior ratio likelihood ratioprior ratio

11/26/2015 Copyright G.D. Hager Object categorization: the statistical viewpoint posterior ratio likelihood ratioprior ratio Discriminative methods model posterior Generative methods model likelihood and prior

11/26/2015 Copyright G.D. Hager Discriminative Direct modeling of Zebra Non-zebra Decision boundary

11/26/2015 Copyright G.D. Hager Model and Generative LowMiddle HighMiddle  Low

11/26/2015 Copyright G.D. Hager Three main issues Representation –How to represent an object category Learning –How to form the classifier, given training data Recognition –How the classifier is to be used on novel data

11/26/2015 Copyright G.D. Hager Representation –Generative / discriminative / hybrid

11/26/2015 Copyright G.D. Hager Representation –Generative / discriminative / hybrid –Appearance only or location and appearance

11/26/2015 Copyright G.D. Hager Representation –Generative / discriminative / hybrid –Appearance only or location and appearance –Invariances View point Illumination Occlusion Scale Deformation Clutter etc.

11/26/2015 Copyright G.D. Hager Representation –Generative / discriminative / hybrid –Appearance only or location and appearance –invariances –Part-based or global w/sub-window

11/26/2015 Copyright G.D. Hager Representation –Generative / discriminative / hybrid –Appearance only or location and appearance –invariances –Parts or global w/sub-window –Use set of features or each pixel in image

11/26/2015 Copyright G.D. Hager –Unclear how to model categories, so we learn what distinguishes them rather than manually specify the difference -- hence current interest in machine learning Learning

11/26/2015 Copyright G.D. Hager –Unclear how to model categories, so we learn what distinguishes them rather than manually specify the difference -- hence current interest in machine learning) –Methods of training: generative vs. discriminative Learning

11/26/2015 Copyright G.D. Hager –Unclear how to model categories, so we learn what distinguishes them rather than manually specify the difference -- hence current interest in machine learning) –What are you maximizing? Likelihood (Gen.) or performances on train/validation set (Disc.) –Level of supervision Manual segmentation; bounding box; image labels; noisy labels Learning Contains a motorbike

11/26/2015 Copyright G.D. Hager –Unclear how to model categories, so we learn what distinguishes them rather than manually specify the difference -- hence current interest in machine learning) –What are you maximizing? Likelihood (Gen.) or performances on train/validation set (Disc.) –Level of supervision Manual segmentation; bounding box; image labels; noisy labels –Batch/incremental (on category and image level; user- feedback ) Learning

11/26/2015 Copyright G.D. Hager –Unclear how to model categories, so we learn what distinguishes them rather than manually specify the difference -- hence current interest in machine learning) –What are you maximizing? Likelihood (Gen.) or performances on train/validation set (Disc.) –Level of supervision Manual segmentation; bounding box; image labels; noisy labels –Batch/incremental (on category and image level; user- feedback ) –Training images: Issue of overfitting Negative images for discriminative methods Priors Learning

11/26/2015 Copyright G.D. Hager –Unclear how to model categories, so we learn what distinguishes them rather than manually specify the difference -- hence current interest in machine learning) –What are you maximizing? Likelihood (Gen.) or performances on train/validation set (Disc.) –Level of supervision Manual segmentation; bounding box; image labels; noisy labels –Batch/incremental (on category and image level; user- feedback ) –Training images: Issue of overfitting Negative images for discriminative methods –Priors Learning

11/26/2015 Copyright G.D. Hager –Scale / orientation range to search over –Speed Recognition

11/26/2015 Copyright G.D. Hager State of The Art Current systems deal with simple nearly 2D situations or very restricted input On ~100 categories, reported accuracies in the 70-90% range, but with huge computational loads Not clear we have the right approach (yet)!

11/26/2015 Copyright G.D. Hager Example: Fergus 2003

11/26/2015 Copyright G.D. Hager Results, Fergus 2003

11/26/2015 Copyright G.D. Hager Summary Object recognition/categorization is a rapidly evolving area Current systems are getting to the point they may be useful in real applications. Much more remains to be done in understanding how to move to the next level of performance.