An opposition to Window- Scanning Approaches in Computer Vision Presented by Tomasz Malisiewicz March 6, 2006 Advanced The Robotics Institute.

Slides:

Advertisements

Similar presentations

Putting Objects in Perspective Derek Hoiem Alexei A. Efros Martial Hebert Carnegie Mellon University Robotics Institute.

Advertisements

Learning Shared Body Plans Ian Endres University of Illinois work with Derek Hoiem, Vivek Srikumar and Ming-Wei Chang.

Object Detection Using Semi- Naïve Bayes to Model Sparse Structure Henry Schneiderman Robotics Institute Carnegie Mellon University.

Rapid Object Detection using a Boosted Cascade of Simple Features Paul Viola, Michael Jones Conference on Computer Vision and Pattern Recognition 2001.

3 Small Comments Alex Berg Stony Brook University I work on recognition: features – action recognition – alignment – detection – attributes – hierarchical.

Wrap Up. We talked about Filters Edges Corners Interest Points Descriptors Image Stitching Stereo SFM.

Tracking Learning Detection

Ivan Laptev IRISA/INRIA, Rennes, France September 07, 2006 Boosted Histograms for Improved Object Detection.

Enhancing Exemplar SVMs using Part Level Transfer Regularization 1.

Recognition: A machine learning approach

Tracking Objects with Dynamics Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 04/21/15 some slides from Amin Sadeghi, Lana Lazebnik,

Training Regimes Motivation  Allow state-of-the-art subcomponents  With “Black-box” functionality  This idea also occurs in other application areas.

Statistical Recognition Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Kristen Grauman.

1 Image Recognition - I. Global appearance patterns Slides by K. Grauman, B. Leibe.

Object Recognition with Informative Features and Linear Classification Authors: Vidal-Naquet & Ullman Presenter: David Bradley.

Self-Supervised Segmentation of River Scenes Supreeth Achar *, Bharath Sankaran ‡, Stephen Nuske *, Sebastian Scherer *, Sanjiv Singh * * ‡

Object Recognition: Conceptual Issues Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and K. Grauman.

Lecture 17: Parts-based models and context CS6670: Computer Vision Noah Snavely.

CS 223B Assignment 1 Help Session Dan Maynes-Aminzade.

Object Recognition: Conceptual Issues Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and K. Grauman.

Visual Object Recognition Rob Fergus Courant Institute, New York University

Opportunities of Scale, Part 2 Computer Vision James Hays, Brown Many slides from James Hays, Alyosha Efros, and Derek Hoiem Graphic from Antonio Torralba.

Presenter: Stefan Zickler

Foundations of Computer Vision Rapid object / face detection using a Boosted Cascade of Simple features Presented by Christos Stoilas Rapid object / face.

Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

Salient Object Detection by Composition

A Scale and Rotation Invariant Approach to Tracking Human Body Part Regions in Videos Yihang BoHao Jiang Institute of Automation, CAS Boston College.

Studying Visual Attention with the Visual Search Paradigm Marc Pomplun Department of Computer Science University of Massachusetts at Boston

Learning Based Hierarchical Vessel Segmentation

Internet-scale Imagery for Graphics and Vision James Hays cs195g Computational Photography Brown University, Spring 2010.

Feature and object tracking algorithms for video tracking Student: Oren Shevach Instructor: Arie nakhmani.

Object Detection Sliding Window Based Approach Context Helps

Computer Vision CS 776 Spring 2014 Recognition Machine Learning Prof. Alex Berg.

Object Detection Using the Statistics of Parts Presented by Nicholas Chan – Advanced Perception Robust Real-time Object Detection Henry Schneiderman.

Perceptual and Sensory Augmented Computing Visual Object Recognition Tutorial Visual Object Recognition Bastian Leibe & Computer Vision Laboratory ETH.

Recovering Surface Layout from a Single Image D. Hoiem, A.A. Efros, M. Hebert Robotics Institute, CMU Presenter: Derek Hoiem CS 598, Spring 2009 Jan 29,

Marco Pedersoli, Jordi Gonzàlez, Xu Hu, and Xavier Roca

Face detection Slides adapted Grauman & Liebe’s tutorial

Summary Marie Yarbrough. Introduction History of Image Forgery Method Segmentation Classification Common-Sense Reasoning Conclusion.

Clustering Supervised vs. Unsupervised Learning Examples of clustering in Web IR Characteristics of clustering Clustering algorithms Cluster Labeling 1.

Reading Between The Lines: Object Localization Using Implicit Cues from Image Tags Sung Ju Hwang and Kristen Grauman University of Texas at Austin Jingnan.

Object Detection with Discriminatively Trained Part Based Models

1Session 6.1 Life’s BIG Questions Week 6 Session 1.

ECE 172A SIMPLE OBJECT DETECTOR WITH INDICATOR WHEN A NEW OBJECT HAS BEEN ADDED TO OR MISSING IN A ROOM Presented by by Hugo Groening.

UNBIASED LOOK AT DATASET BIAS Antonio Torralba Massachusetts Institute of Technology Alexei A. Efros Carnegie Mellon University CVPR 2011.

Putting Context into Vision Derek Hoiem September 15, 2004.

Tracking CSE 6367 – Computer Vision Vassilis Athitsos University of Texas at Arlington.

Face Detection Ying Wu Electrical and Computer Engineering Northwestern University, Evanston, IL

What are animals ? Animals are living things that move, eat, breathe grow, and change. Are you an animal ?

Efficient Visual Object Tracking with Online Nearest Neighbor Classifier Many slides adapt from Steve Gu.

Project 3 Results.

Context-based vision system for place and object recognition Antonio Torralba Kevin Murphy Bill Freeman Mark Rubin Presented by David Lee Some slides borrowed.

Learning to Detect Faces A Large-Scale Application of Machine Learning (This material is not in the text: for further information see the paper by P.

Pictorial Structures and Distance Transforms Computer Vision CS 543 / ECE 549 University of Illinois Ian Endres 03/31/11.

Vision Overview  Like all AI: in its infancy  Many methods which work well in specific applications  No universal solution  Classic problem: Recognition.

Recognition Using Visual Phrases

Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning Object Categories ICCV 2005 Beijing, Short Course, Oct 15.

Context Neelima Chavali ECE /21/2013. Roadmap Introduction Paper1 – Motivation – Problem statement – Approach – Experiments & Results Paper 2 Experiments.

Point Distribution Models Active Appearance Models Compilation based on: Dhruv Batra ECE CMU Tim Cootes Machester.

Object Recognition as Ranking Holistic Figure-Ground Hypotheses Fuxin Li and Joao Carreira and Cristian Sminchisescu 1.

Carl Vondrick, Aditya Khosla, Tomasz Malisiewicz, Antonio Torralba Massachusetts Institute of Technology

SHAHAB iCV Research Group.

A Forest of Sensors: Using adaptive tracking to classify and monitor activities in a site Eric Grimson AI Lab, Massachusetts Institute of Technology

Tracking Objects with Dynamics

Presented by Minh Hoai Nguyen Date: 28 March 2007

Context-based vision system for place and object recognition

An opposition to: Context-Based Vision System for Place and Object Recognition Contextual Models for Object Detection Using BRFs.

Object detection as supervised classification

Brief Review of Recognition + Context

Liyuan Li, Jerry Kah Eng Hoe, Xinguo Yu, Li Dong, and Xinqi Chu

Presentation transcript:

An opposition to Window- Scanning Approaches in Computer Vision Presented by Tomasz Malisiewicz March 6, 2006 Advanced The Robotics Institute

2 Problems Does scanning windows across an image work? What types of objects does it work for?

What are window-scanning approaches missing? *Following Slides Borrowed From Derek Hoiem’s “Putting Context Into Vision” PresentationPutting Context Into Vision Context aka Top-Down Processing

Quick Question: What is this?

What is context? Any data or meta-data not directly produced by the presence of an object –Nearby image data Context

What is context? Any data or meta-data not directly produced by the presence of an object –Nearby image data –Scene information Context

What is context? Any data or meta-data not directly produced by the presence of an object –Nearby image data –Scene information –Presence, locations of other objects Tree

Clues for Function What is this?

Clues for Function What is this? Now can you tell?

Low-Res Scenes What is this?

Low-Res Scenes What is this? Now can you tell?

More Low-Res What are these blobs?

More Low-Res The same pixels! (a car)

Why is context useful? Objects defined at least partially by function –Trees grow in ground –Birds can fly (usually) –Door knobs help open doors

Why is context useful? Objects defined at least partially by function –Context gives clues about function Not rooted into the ground  not tree Object in sky  {cloud, bird, UFO, plane, superman} Door knobs always on doors

Why is context useful? Objects defined at least partially by function –Context gives clues about function Objects like some scenes better than others Toilets like bathrooms Fish like water

Why is context useful? Objects defined at least partially by function –Context gives clues about function Objects like some scenes better than others Many objects are used together and, thus, often appear together Kettle and stove Keyboard and monitor

The other* problem What types of objects does it work for? *Assuming we can just directly avoid the first problem

“Our goal is to develop a system that detects and recognizes many kinds of objects in photographs and video including everyday office objects, text captions in video, and various structures in biomedical imagery.” – Schneiderman and Kanade from Object Detection Using the Statistics of Parts How many different classifiers must one construct? A different classifier for each object? A different classifier for each pose of an object? How many poses do we need per object? “However, such approaches seem unlikely to scale up to the detection of hundreds or thousands of different object classes because each classifier is trained and run independently.” – Torralba and Murphy and Freeman from Sharing features: efficient boosting procedures for multiclass object detection

Too many windows Now imagine scanning a window and applying 100K independent classifiers at each window

Conclusion Without context, we can’t find all things we want to find. We need context to help constrain the search for objects. With independent classifiers per object (and per pose), we can’t detect a large number of objects. Should cow detectors and a horse detectors be built independently? Think along the lines of a horse and a cow are types of animals that often occur in similar contexts. Remember that complex and deformable objects would require many poses if are to adhere to the window-based classifier paradigm.

Thank you. *Pascal 2006 Visual Challenge Image