Agenda Introduction Bag-of-words models Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based.

Slides:



Advertisements
Similar presentations
Semantic Contours from Inverse Detectors Bharath Hariharan et.al. (ICCV-11)
Advertisements

By: Mani Baghaei Fard.  During recent years number of moving vehicles in roads and highways has been considerably increased.
Articulated People Detection and Pose Estimation: Reshaping the Future
Diagnosing Error in Object Detectors Department of Computer Science University of Illinois at Urbana-Champaign (UIUC) Derek Hoiem Yodsawalai Chodpathumwan.
Agenda Introduction Bag-of-words models Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based.
3 Small Comments Alex Berg Stony Brook University I work on recognition: features – action recognition – alignment – detection – attributes – hierarchical.
Ivan Laptev IRISA/INRIA, Rennes, France September 07, 2006 Boosted Histograms for Improved Object Detection.
1 Part 1: Classical Image Classification Methods Kai Yu Dept. of Media Analytics NEC Laboratories America Andrew Ng Computer Science Dept. Stanford University.
1 Building a Dictionary of Image Fragments Zicheng Liao Ali Farhadi Yang Wang Ian Endres David Forsyth Department of Computer Science, University of Illinois.
Global spatial layout: spatial pyramid matching Spatial weighting the features Beyond bags of features: Adding spatial information.
Analyzing Semantic Segmentation Using Hybrid Human-Machine CRFs Roozbeh Mottaghi 1, Sanja Fidler 2, Jian Yao 2, Raquel Urtasun 2, Devi Parikh 3 1 UCLA.
Contour Based Approaches for Visual Object Recognition Jamie Shotton University of Cambridge Joint work with Roberto Cipolla, Andrew Blake.
Proceedings of the IEEE 2010 Antonio Torralba, MIT Jenny Yuen, MIT Bryan C. Russell, MIT.
Recognition: A machine learning approach
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Statistical Recognition Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Kristen Grauman.
EECS 442 – Computer vision Segments of this lectures are courtesy of Prof F. Li, R. Fergus and A. Zisserman Databases for object recognition and beyond.
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Unsupervised discovery of visual object class hierarchies Josef Sivic (INRIA / ENS), Bryan Russell (MIT), Andrew Zisserman (Oxford), Alyosha Efros (CMU)
Object Recognition: Conceptual Issues Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and K. Grauman.
Agenda Introduction Bag-of-words model Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based.
Object Recognition: Conceptual Issues Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and K. Grauman.
Visual Object Recognition Rob Fergus Courant Institute, New York University
Spatial Pyramid Pooling in Deep Convolutional
Lecture 29: Recent work in recognition CS4670: Computer Vision Noah Snavely.
Programme 2pm Introduction –Andrew Zisserman, Chris Williams 2.10pm Overview of the challenge and results –Mark Everingham (Oxford) 2.40pm Session 1: The.
Agenda Introduction Bag-of-words models Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based.
Review: Intro to recognition Recognition tasks Machine learning approach: training, testing, generalization Example classifiers Nearest neighbor Linear.
Salient Object Detection by Composition
3D LayoutCRF Derek Hoiem Carsten Rother John Winn.
Computer Vision CS 776 Spring 2014 Recognition Machine Learning Prof. Alex Berg.
“Secret” of Object Detection Zheng Wu (Summer intern in MSRNE) Sep. 3, 2010 Joint work with Ce Liu (MSRNE) William T. Freeman (MIT) Adam Kalai (MSRNE)
Perceptual and Sensory Augmented Computing Visual Object Recognition Tutorial Visual Object Recognition Bastian Leibe & Computer Vision Laboratory ETH.
Last part: datasets and object collections. CMU/MIT frontal facesvasc.ri.cmu.edu/idb/html/face/frontal_images cbcl.mit.edu/software-datasets/FaceData2.html.
Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,
Detection, Segmentation and Fine-grained Localization
Learning Collections of Parts for Object Recognition and Transfer Learning University of Illinois at Urbana- Champaign.
Reading Between The Lines: Object Localization Using Implicit Cues from Image Tags Sung Ju Hwang and Kristen Grauman University of Texas at Austin Jingnan.
Object Detection with Discriminatively Trained Part Based Models
80 million tiny images: a large dataset for non-parametric object and scene recognition CS 4763 Multimedia Systems Spring 2008.
Deformable Part Model Presenter : Liu Changyu Advisor : Prof. Alex Hauptmann Interest : Multimedia Analysis April 11 st, 2013.
Efficient Subwindow Search: A Branch and Bound Framework for Object Localization ‘PAMI09 Beyond Sliding Windows: Object Localization by Efficient Subwindow.
Deformable Part Models (DPM) Felzenswalb, Girshick, McAllester & Ramanan (2010) Slides drawn from a tutorial By R. Girshick AP 12% 27% 36% 45% 49% 2005.
MSRI workshop, January 2005 Object Recognition Collected databases of objects on uniform background (no occlusions, no clutter) Mostly focus on viewpoint.
UNBIASED LOOK AT DATASET BIAS Antonio Torralba Massachusetts Institute of Technology Alexei A. Efros Carnegie Mellon University CVPR 2011.
11/26/2015 Copyright G.D. Hager Class 2 - Schedule 1.Optical Illusions 2.Lecture on Object Recognition 3.Group Work 4.Sports Videos 5.Short Lecture on.
Object detection, deep learning, and R-CNNs
CS 1699: Intro to Computer Vision Detection II: Deformable Part Models Prof. Adriana Kovashka University of Pittsburgh November 12, 2015.
Recognition Using Visual Phrases
Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning Object Categories ICCV 2005 Beijing, Short Course, Oct 15.
Object Recognition as Ranking Holistic Figure-Ground Hypotheses Fuxin Li and Joao Carreira and Cristian Sminchisescu 1.
A Discriminatively Trained, Multiscale, Deformable Part Model Yeong-Jun Cho Computer Vision and Pattern Recognition,2008.
PANDA: Pose Aligned Networks for Deep Attribute Modeling Ning Zhang 1,2 Manohar Paluri 1 Marć Aurelio Ranzato 1 Trevor Darrell 2 Lumbomir Boudev 1 1 Facebook.
Week 4: 6/6 – 6/10 Jeffrey Loppert. This week.. Coded a Histogram of Oriented Gradients (HOG) Feature Extractor Extracted features from positive and negative.
Jo˜ao Carreira, Abhishek Kar, Shubham Tulsiani and Jitendra Malik University of California, Berkeley CVPR2015 Virtual View Networks for Object Reconstruction.
CALTECH 256 Greg Griffin, Alex Holub and Pietro Perona.
Object detection with deformable part-based models
Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT
R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.
Finding Clusters within a Class to Improve Classification Accuracy
Speaker: Lingxi Xie Authors: Lingxi Xie, Qi Tian, Bo Zhang
Recognizing and Learning Object categories
Bringing Salient Object Detection to the Foreground
“The Truth About Cats And Dogs”
Brief Review of Recognition + Context
The topic discovery models
Faster R-CNN By Anthony Martinez.
Outline Background Motivation Proposed Model Experimental Results
Semantic Segmentation
Presentation transcript:

Agenda Introduction Bag-of-words models Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based image retrieval Datasets & Conclusions

Databases Caltech 101 Caltech 256 Pascal Visual Object Classes (VOC) LabelMe Slides from Andrew Zisserman

Caltech 101 Pictures of objects belonging to 101 categories. About 40 to 800 images per category. Most categories have about 50 images. The size of each image is roughly 300 x 200 pixels. Collected in September 2003 by Fei-Fei Li, Marco Andreetto, and Marc 'Aurelio Ranzato. Train on 5, 10, 15, 20 or 30 images Test on rest – report results per class

Caltech 101 images

Smallest category size is 31 images: Too easy? –left-right aligned –Rotation artifacts –Soon will saturate performance Caltech-101: Drawbacks

Caltech-256 Smallest category size now 80 images About 30K images Harder –Not left-right aligned –No artifacts –Performance is halved –More categories New and larger clutter category

Caltech 256 images baseball-bat basketball-hoop dog kayac traffic light

The PASCAL Visual Object Classes (VOC) Dataset and Challenge Mark Everingham Luc Van Gool Chris Williams John Winn Andrew Zisserman

The PASCAL VOC Challenge Challenge in visual object recognition funded by PASCAL network of excellence Publicly available dataset of annotated images. Development kit available. Main competitions in classification (is there an X in this image) and detection (where are the X’s) “Taster competitions” in segmentation and 2-D human “pose estimation” (2007-present)

Dataset Content 20 classes: aeroplane, bicycle, boat, bottle, bus, car, cat, chair, cow, dining table, dog, horse, motorbike, person, potted plant, sheep, train, TV Real images downloaded from flickr, not filtered for “quality” Complex scenes, scale, pose, lighting, occlusion,...

Annotation Complete annotation of all objects Annotated in one session with written guidelines Truncated Object extends beyond BB Occluded Object is significantly occluded within BB Pose Facing left Difficult Not scored in evaluation

Examples Aeroplane Bus BicycleBirdBoatBottle CarCatChairCow

History New dataset annotated annually –Annotation of test set is withheld until after challenge ImagesObjectsClassesEntries 20052,2322, Collection of existing and some new data ,3049, Completely new dataset from flickr (+MSRC) 20079,96324, Increased classes to 20. Introduced tasters ,77620,73920 Added “occlusion” flag. Reuse of taster data. Release detailed results to support “meta-analysis”

Main Challenge Tasks Classification –Is there a dog in this image? –Evaluation by precision/recall Detection –Localize all the people (if any) in this image –Evaluation by precision/recall based on bounding box overlap

Person detection Example Precision/Recall: 2007

Russell, Torralba, Freman, 2005 LabelMe

CMU/MIT frontal facesvasc.ri.cmu.edu/idb/html/face/frontal_images cbcl.mit.edu/software-datasets/FaceData2.html PatchesFrontal faces Graz-02 Databasewww.emt.tugraz.at/~pinz/data/GRAZ_02/Segmentation masksBikes, cars, people UIUC Image Databasel2r.cs.uiuc.edu/~cogcomp/Data/Car/Bounding boxesCars TU Darmstadt Databasewww.vision.ethz.ch/leibe/data/Segmentation masksMotorbikes, cars, cows LabelMe datasetpeople.csail.mit.edu/brussell/research/LabelMe/intro.htmlPolygonal boundary>500 Categories Caltech 101www.vision.caltech.edu/Image_Datasets/Caltech101/Caltech101.htmlSegmentation masks101 categories Caltech 256 COIL www1.cs.columbia.edu/CAVE/research/softlib/coil-100.html Bounding Box Patches 256 Categories 100 instances NORBwww.cs.nyu.edu/~ylclab/data/norb-v1.0/Bounding box50 toys Databases for object localization Databases for object recognition On-line annotation tools ESP gamewww.espgame.orgGlobal image descriptionsWeb images LabelMepeople.csail.mit.edu/brussell/research/LabelMe/intro.htmlPolygonal boundaryHigh resolution images The next tables summarize some of the available datasets for training and testing object detection and recognition algorithms. These lists are far from exhaustive. Links to datasets Collections PASCALhttp:// boxesvarious

Topics not covered Context –Scene –Inter-object relations Video –Tracking & detection Multiple viewpoints

Summary Methods reviewed here –Bag of words –Bag of words with location –Parts and structure –Discriminative methods –Combined Segmentation and recognition –Recognition for retrieval Resources online: –Slides –Code –Links to datasets