Li Fei-Fei, Stanford Rob Fergus, NYU Antonio Torralba, MIT Recognizing and Learning Object Categories: Year 2009 ICCV 2009 Kyoto, Short Course, September.

Slides:



Advertisements
Similar presentations
Copyright Jiawei Han, modified by Charles Ling for CS411a
Advertisements

INF1090 Special Project Photography.
Get Close to your subject Your subject should be the star of your photos, and the one way to make that happen is to be sure you are close enough to the.
1 Part 1: Classical Image Classification Methods Kai Yu Dept. of Media Analytics NEC Laboratories America Andrew Ng Computer Science Dept. Stanford University.
Last week... why object recognition is difficult, the template model the feature recognition model, word recognition as a case study Today... Recognition.
Large dataset for object and scene recognition A. Torralba, R. Fergus, W. T. Freeman 80 million tiny images Ron Yanovich Guy Peled.
Beyond bags of features: Part-based models Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Object Detection and Recognition
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Cognitive Processes PSY 334 Chapter 2 – Perception April 9, 2003.
Statistical Recognition Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Kristen Grauman.
1 Image Recognition - I. Global appearance patterns Slides by K. Grauman, B. Leibe.
Visual Cognition II Object Perception. Theories of Object Recognition Template matching models Feature matching Models Recognition-by-components Configural.
Cognitive Processes PSY 334 Chapter 2 – Perception June 30, 2003.
Recognising objects and faces. General problems Given that objects move on a surface, why do they not appear to change shape? How do we recognise objects.
Pattern Recognition Pattern - complex composition of sensory stimuli that the human observer may recognize as being a member of a class of objects Issue.
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce.
Visual Cognition II Object Perception. Theories of Object Recognition Template matching models Feature matching Models Recognition-by-components Configural.
Object Recognition: Conceptual Issues Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and K. Grauman.
Object Recognition: Conceptual Issues Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and K. Grauman.
Visual Object Recognition Rob Fergus Courant Institute, New York University
HUMAN INFORMATION PROCESSING II STRUCTURE, SYSTEMS AND FUNCTION.
Opportunities of Scale, Part 2 Computer Vision James Hays, Brown Many slides from James Hays, Alyosha Efros, and Derek Hoiem Graphic from Antonio Torralba.
Categorization: Scenes & Objects (P) Lavanya Sharan March 16th, 2011.
Cognitive Processes PSY 334 Chapter 2 – Perception.
Computer vision.
Edit by Dalton Lin Copyright © 2012 Elsevier Inc. All rights reserved.. Chapter 1 Vision, the Challenge.
Human abilities Presented By Mahmoud Awadallah 1.
Observations and Inferences Scientists use observations and inferences to understand phenomena in the real world.
Introduction to Textures and Skins Chapter 8 & 9 3D Game Programming All-in-One By Ken Finney.
Perceptual and Sensory Augmented Computing Visual Object Recognition Tutorial Visual Object Recognition Bastian Leibe & Computer Vision Laboratory ETH.
Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,
Local Non-Negative Matrix Factorization as a Visual Representation Tao Feng, Stan Z. Li, Heung-Yeung Shum, HongJiang Zhang 2002 IEEE Presenter : 張庭豪.
Yao, B., and Fei-fei, L. IEEE Transactions on PAMI(2012)
1 Perception, Illusion and VR HNRS 299, Spring 2008 Lecture 2 Introduction, Light Course webpage:
MSRI workshop, January 2005 Object Recognition Collected databases of objects on uniform background (no occlusions, no clutter) Mostly focus on viewpoint.
1 Artificial Intelligence: Vision Stages of analysis Low level vision Surfaces and distance Object Matching.
11/26/2015 Copyright G.D. Hager Class 2 - Schedule 1.Optical Illusions 2.Lecture on Object Recognition 3.Group Work 4.Sports Videos 5.Short Lecture on.
Object Lesson: Discovering and Learning to Recognize Objects Object Lesson: Discovering and Learning to Recognize Objects – Paul Fitzpatrick – MIT CSAIL.
Autonomous Robots Vision © Manfred Huber 2014.
Context-based vision system for place and object recognition Antonio Torralba Kevin Murphy Bill Freeman Mark Rubin Presented by David Lee Some slides borrowed.
Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning Object Categories ICCV 2005 Beijing, Short Course, Oct 15.
©Michael J. BlackCS143 Intro to Computer VisionBrown University Introduction to Computer Vision Michael J. Black Oct 2004 Object Recognition.
UNITY Principle of design that allows the viewer to see a complex combination of elements, principles, and media as a complete whole. (The invisible glue)
Recognition Readings – C. Bishop, “Neural Networks for Pattern Recognition”, Oxford University Press, 1998, Chapter 1. – Szeliski, Chapter (eigenfaces)
High-Level Vision Object Recognition.
Learning Styles. Numbers The numbers indicate how strongly you prefer one learning style over another. A 1 or 3 means you are pretty well balanced and.
How we actively interpret our environment..  Perception: The process in which we understand sensory information.  Illusions are powerful examples of.
Introduction to Recognition CS4670/5670: Intro to Computer Vision Noah Snavely mountain building tree banner vendor people street lamp.
1 What makes a Chair a Chair? Joydeep Bhadra Simon Fraser University Helmut Grabner, Jeurgen Gall, Luc Van Gool.
Another Example: Circle Detection
Cognitive Processes PSY 334
Lecture 25: Introduction to Recognition
Prof. Riyadh Al_Azzawi F.R.C.Psych
How to Draw What You See!.
Lecture 14: Introduction to Recognition
Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT
Perceiving and Recognizing Objects
An opposition to: Context-Based Vision System for Place and Object Recognition Contextual Models for Object Detection Using BRFs.
Neural Correlates of Shape from Shading
Sensation/Perception AP PSYCH 42S (15) ~ Daniel B. & Hao-Yi S.
Lecture 25: Introduction to Recognition
ICCV 2009 Kyoto, Short Course, September 24
Brief Review of Recognition + Context
Prof. Riyadh Al_Azzawi F.R.C.Psych
Cognitive Processes PSY 334
COMPOSITION in photography
Cognitive Processes PSY 334
Prof. Riyadh Al_Azzawi F.R.C.Psych
Presentation transcript:

Li Fei-Fei, Stanford Rob Fergus, NYU Antonio Torralba, MIT Recognizing and Learning Object Categories: Year 2009 ICCV 2009 Kyoto, Short Course, September 24 Testimonials: “since I attended this course, I can recognize all the objects that I see”

Why do we care about recognition? Perception of function: We can perceive the 3D shape, texture, material properties, without knowing about objects. But, the concept of category encapsulates also information about what can we do with those objects. “We therefore include the perception of function as a proper –indeed, crucial- subject for vision science”, from Vision Science, chapter 9, Palmer.

The perception of function Direct perception (affordances): Gibson Flat surface Horizontal Knee-high … Sittable upon Chair Chair? Flat surface Horizontal Knee-high … Sittable upon Chair Mediated perception (Categorization)

Direct perception Some aspects of an object function can be perceived directly Functional form: Some forms clearly indicate to a function (“sittable-upon”, container, cutting device, …) Sittable-upon It does not seem easy to sit-upon this…

Direct perception Some aspects of an object function can be perceived directly Observer relativity: Function is observer dependent From

Limitations of Direct Perception The functions are the same at some level of description: we can put things inside in both and somebody will come later to empty them. However, we are not expected to put inside the same kinds of things… Objects of similar structure might have very different functions Not all functions seem to be available from direct visual information only.

Limitations of Direct Perception Propulsion system Strong protective surface Something that looks like a door Sure, I can travel to space on this object Visual appearance might be a very weak cue to function

How do we achieve Mediated perception? Well… this requires object recognition (for more details, see entire course)

Object recognition Is it really so hard? This is a chair Find the chair in this image Output of normalized correlation

Object recognition Is it really so hard? Find the chair in this image Pretty much garbage Simple template matching is not going to make it

Object recognition Is it really so hard? Find the chair in this image A “popular method is that of template matching, by point to point correlation of a model pattern with the image pattern. These techniques are inadequate for three- dimensional scene analysis for many reasons, such as occlusion, changes in viewing angle, and articulation of parts.” Nivatia & Binford, 1977.

Brady, M. J., & Kersten, D. (2003). Bootstrapped learning of novel objects. J Vis, 3(6), And it can get a lot harder

your visual system is amazing

is your visual system amazing?

Discover the camouflaged object Brady, M. J., & Kersten, D. (2003). Bootstrapped learning of novel objects. J Vis, 3(6),

Discover the camouflaged object Brady, M. J., & Kersten, D. (2003). Bootstrapped learning of novel objects. J Vis, 3(6),

Any guesses?

Outline 1. Introduction (15’) 2. Single object categories (1h15’) - Bag of words (rob) - Part-based (rob) - Discriminative (rob) - Detecting single objects in contexts (antonio) - 3D object classes (fei-fei) 15:30 – 16:00 Coffee break 3. Multiple object categories (1h30’) - Recognizing a large number of objects (rob) - Recognizing multiple objects in an image (antonio) - Objects and annotations (fei-fei) 4. Object-related datasets and challenges (30’)