An opposition to: Context-Based Vision System for Place and Object Recognition Contextual Models for Object Detection Using BRFs.

Slides:

Advertisements

Similar presentations

Three things everyone should know to improve object retrieval

Advertisements

CS395: Visual Recognition Spatial Pyramid Matching Heath Vinicombe The University of Texas at Austin 21 st September 2012.

Tracking Multiple Occluding People by Localizing on Multiple Scene Planes Saad M. Khan and Mubarak Shah, PAMI, VOL. 31, NO. 3, MARCH 2009, Donguk Seo

Ghunhui Gu, Joseph J. Lim, Pablo Arbeláez, Jitendra Malik University of California at Berkeley Berkeley, CA

Recognition: A machine learning approach

Event prediction CS 590v. Applications Video search Surveillance – Detecting suspicious activities – Illegally parked cars – Abandoned bags Intelligent.

Statistical Recognition Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Kristen Grauman.

1 Image Recognition - I. Global appearance patterns Slides by K. Grauman, B. Leibe.

Visual Cognition II Object Perception. Theories of Object Recognition Template matching models Feature matching Models Recognition-by-components Configural.

A Study of Approaches for Object Recognition

Object Recognition with Informative Features and Linear Classification Authors: Vidal-Naquet & Ullman Presenter: David Bradley.

An opposition to Window- Scanning Approaches in Computer Vision Presented by Tomasz Malisiewicz March 6, 2006 Advanced The Robotics Institute.

1 Segmentation with Scene and Sub-Scene Categories Joseph Djugash Input Image Scene/Sub-Scene Classification Segmentation.

Lecture 17: Parts-based models and context CS6670: Computer Vision Noah Snavely.

5/30/2006EE 148, Spring Visual Categorization with Bags of Keypoints Gabriella Csurka Christopher R. Dance Lixin Fan Jutta Willamowski Cedric Bray.

Opportunities of Scale, Part 2 Computer Vision James Hays, Brown Many slides from James Hays, Alyosha Efros, and Derek Hoiem Graphic from Antonio Torralba.

Lecture 29: Recent work in recognition CS4670: Computer Vision Noah Snavely.

Lecture 6: Feature matching and alignment CS4670: Computer Vision Noah Snavely.

© 2013 IBM Corporation Efficient Multi-stage Image Classification for Mobile Sensing in Urban Environments Presented by Shashank Mujumdar IBM Research,

AdaBoost Robert E. Schapire (Princeton University) Yoav Freund (University of California at San Diego) Presented by Zhi-Hua Zhou (Nanjing University)

 By the end of this, you should be able to state the difference between DATE and INFORMAITON.

Internet-scale Imagery for Graphics and Vision James Hays cs195g Computational Photography Brown University, Spring 2010.

My Science Experiment By, Morgan M.. BIG QUESTION What percentage of my neighborhood’s pets are overweight?

Computer Vision CS 776 Spring 2014 Recognition Machine Learning Prof. Alex Berg.

Perceptual and Sensory Augmented Computing Visual Object Recognition Tutorial Visual Object Recognition Bastian Leibe & Computer Vision Laboratory ETH.

Why Categorize in Computer Vision ?. Why Use Categories? People love categories!

Lecture 4: Feature matching CS4670 / 5670: Computer Vision Noah Snavely.

SVM-KNN Discriminative Nearest Neighbor Classification for Visual Category Recognition Hao Zhang, Alex Berg, Michael Maire, Jitendra Malik.

Spatio-temporal constraints for recognizing 3D objects in videos Nicoletta Noceti Università degli Studi di Genova.

Representations for object class recognition David Lowe Department of Computer Science University of British Columbia Vancouver, Canada Sept. 21, 2006.

Data and information. Information and data By the end of this, you should be able to state the difference between DATE and INFORMAITON.

Putting Context into Vision Derek Hoiem September 15, 2004.

This is a slide show to explain in detail how to solve a puzzle of a common sort. Use the right arrow key to go to the next step and left arrow keys to.

School of Engineering and Computer Science Victoria University of Wellington Copyright: Peter Andreae, VUW Image Recognition COMP # 18.

Epitomic Location Recognition A generative approach for location recognition K. Ni, A. Kannan, A. Criminisi and J. Winn In proc. CVPR Anchorage,

Categories What are categories? The internal structure of categories Rule-based approaches Similarity-based approaches Theory-based approaches.

Context-based vision system for place and object recognition Antonio Torralba Kevin Murphy Bill Freeman Mark Rubin Presented by David Lee Some slides borrowed.

Lecture 8: Feature matching CS6670: Computer Vision Noah Snavely.

Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning Object Categories ICCV 2005 Beijing, Short Course, Oct 15.

Fundamentals of Sensation and Perception EXAM REVIEW ERIK CHEVRIER OCTOBER 20 TH, 2015.

Object Recognition as Ranking Holistic Figure-Ground Hypotheses Fuxin Li and Joao Carreira and Cristian Sminchisescu 1.

TextonBoost: Joint Appearance, Shape and Context Modeling for Multi-Class Object Recognition and Segmentation J. Shotton ; University of Cambridge J. Jinn,

Goggle Gist on the Google Phone A Content-based image retrieval system for the Google phone Manu Viswanathan Chin-Kai Chang Ji Hyun Moon.

SUN Database: Large-scale Scene Recognition from Abbey to Zoo Jianxiong Xiao *James Haysy Krista A. Ehinger Aude Oliva Antonio Torralba Massachusetts Institute.

Using the Forest to see the Trees: A computational model relating features, objects and scenes Antonio Torralba CSAIL-MIT Joint work with Aude Oliva, Kevin.

Max-Margin Training of Upstream Scene Understanding Models Jun Zhu Carnegie Mellon University Joint work with Li-Jia Li *, Li Fei-Fei *, and Eric P. Xing.

Introduction to Recognition CS4670/5670: Intro to Computer Vision Noah Snavely mountain building tree banner vendor people street lamp.

Li Fei-Fei, Stanford Rob Fergus, NYU Antonio Torralba, MIT Recognizing and Learning Object Categories: Year 2009 ICCV 2009 Kyoto, Short Course, September.

Machine learning & object recognition Cordelia Schmid Jakob Verbeek.

Lecture 25: Introduction to Recognition

Opportunities of Scale, Part 2

Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT

Context-based vision system for place and object recognition

Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science

Lecture 25: Introduction to Recognition

By: Kevin Yu Ph.D. in Computer Engineering

Tremor Detection Using Motion Filtering and SVM Bilge Soran, Jenq-Neng Hwang, Linda Shapiro, ICPR, /16/2018.

CS 1674: Intro to Computer Vision Scene Recognition

ICCV 2009 Kyoto, Short Course, September 24

Poster Spotlights Shape Anchors for Data-driven

Aim of the project Take your image Submit it to the search engine

Brief Review of Recognition + Context

A New Technique for Destination Choice

Make sure you think & read carefully!

Multiple Choice Quiz.

Your Frankenstein book?

Semantic Segmentation

Planning an integrated Document to meet someone’s needs

Report 2 Brandon Silva.

Presentation transcript:

An opposition to: Context-Based Vision System for Place and Object Recognition Contextual Models for Object Detection Using BRFs Authors: Antonio Torralba, Kevin P. Murphy, William T. Freeman, and Mark A. Rubin Opponent: Carlos Vallespi

Paper claims Claims to recognize 63 different locations. Claims to categorize new environments Claims to help object recognition by suggesting presence and location.

Is the classifier really Place recognition Is the classifier really doing anything? Temporal information is available. HMM will help a lot to the classifier. Only 2-3 choices are possible at a time, knowing the current state.

Simple place recognition with SIFT Database

Simple place recognition with SIFT Test DB

Comparing with SIFT 74 matches

Comparing with SIFT Some correct matches

Comparing with SIFT Correct no matches

Comparing with SIFT No incorrect mismatches Just one weak match (22 matches): Provided 9 locations and 100% accuracy in the test set.

Scene categorization This paper claims that they are able to categorize 17 unseen scenarios. We have seen other methods in the past for scene categorization that also worked well (with up to 13 classes): Bag-of-words approaches (using textons, for instance). Histogram-based approaches. Torralba’s paper (using image frequencies). They use an average of local features over the image with a sliding window. In fact, this is just a sort of histogram approach (nothing new). DB does not seem very generic. They do not compare with other methods. It performs poorly, except for the exception of the HMM:

Object presence and location Their own images speak for themselves ;) ??? A filecabinet is expected to be seen in almost the entire image. Most of the objects that are highly expected to be found, do not show up.

Object presence and location Their own images speak for themselves ;) Except for the case of the building (which I am sure I could get something similar by averaging all the bounding boxes of buildings), all others are wrong… even the sky.

Conclusions Place recognition: Scene categorization: It seems to be an easy problem, that can be solved by simpler methods without temporal information. An HMM alone could have done similar work. Scene categorization: Suspicious DB Only works because of the temporal information. Object presence and location: Just does not work.