Object Detection Sliding Window Based Approach Context Helps

Slides:

Advertisements

Similar presentations

The Layout Consistent Random Field for detecting and segmenting occluded objects CVPR, June 2006 John Winn Jamie Shotton.

Advertisements

Context-based object-class recognition and retrieval by generalized correlograms by J. Amores, N. Sebe and P. Radeva Discussion led by Qi An Duke University.

Rapid Object Detection using a Boosted Cascade of Simple Features Paul Viola, Michael Jones Conference on Computer Vision and Pattern Recognition 2001.

Rapid Object Detection using a Boosted Cascade of Simple Features Paul Viola, Michael Jones Conference on Computer Vision and Pattern Recognition 2001.

A generic model to compose vision modules for holistic scene understanding Adarsh Kowdle *, Congcong Li *, Ashutosh Saxena, and Tsuhan Chen Cornell University,

Carolina Galleguillos, Brian McFee, Serge Belongie, Gert Lanckriet Computer Science and Engineering Department Electrical and Computer Engineering Department.

Wrap Up. We talked about Filters Edges Corners Interest Points Descriptors Image Stitching Stereo SFM.

LARGE-SCALE IMAGE PARSING Joseph Tighe and Svetlana Lazebnik University of North Carolina at Chapel Hill road building car sky.

Lecture 31: Modern object recognition

Many slides based on P. FelzenszwalbP. Felzenszwalb General object detection with deformable part-based models.

Bangpeng Yao and Li Fei-Fei

Bangpeng Yao Li Fei-Fei Computer Science Department, Stanford University, USA.

Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,

Global spatial layout: spatial pyramid matching Spatial weighting the features Beyond bags of features: Adding spatial information.

Cos 429: Face Detection (Part 2) Viola-Jones and AdaBoost Guest Instructor: Andras Ferencz (Your Regular Instructor: Fei-Fei Li) Thanks to Fei-Fei Li,

EE462 MLCV Lecture 5-6 Object Detection – Boosting Tae-Kyun Kim.

Groups of Adjacent Contour Segments for Object Detection Vittorio Ferrari Loic Fevrier Frederic Jurie Cordelia Schmid.

Ghunhui Gu, Joseph J. Lim, Pablo Arbeláez, Jitendra Malik University of California at Berkeley Berkeley, CA

Sketch Tokens: A Learned Mid-level Representation for Contour and Object Detection CVPR2013 POSTER.

Fast intersection kernel SVMs for Realtime Object Detection

More sliding window detection: Discriminative part-based models Many slides based on P. FelzenszwalbP. Felzenszwalb.

Bag-of-features models Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

Daozheng Chen 1, Mustafa Bilgic 2, Lise Getoor 1, David Jacobs 1, Lilyana Mihalkova 1, Tom Yeh 1 1 Department of Computer Science, University of Maryland,

Recognition using Regions CVPR Outline Introduction Overview of the Approach Experimental Results Conclusion.

1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.

Lecture 28: Bag-of-words models

Generic Object Detection using Feature Maps Oscar Danielsson Stefan Carlsson

LARGE-SCALE NONPARAMETRIC IMAGE PARSING Joseph Tighe and Svetlana Lazebnik University of North Carolina at Chapel Hill CVPR 2011Workshop on Large-Scale.

TextonBoost : Joint Appearance, Shape and Context Modeling for Multi-Class Object Recognition and Segmentation J. Shotton*, J. Winn†, C. Rother†, and A.

An opposition to Window- Scanning Approaches in Computer Vision Presented by Tomasz Malisiewicz March 6, 2006 Advanced The Robotics Institute.

Bag-of-features models

Lecture 17: Parts-based models and context CS6670: Computer Vision Noah Snavely.

Graph Cut based Inference with Co-occurrence Statistics Ľubor Ladický, Chris Russell, Pushmeet Kohli, Philip Torr.

Visual Object Recognition Rob Fergus Courant Institute, New York University

What, Where & How Many? Combining Object Detectors and CRFs

Lecture 29: Recent work in recognition CS4670: Computer Vision Noah Snavely.

Generic object detection with deformable part-based models

Review: Intro to recognition Recognition tasks Machine learning approach: training, testing, generalization Example classifiers Nearest neighbor Linear.

Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,

Object Bank Presenter ： Liu Changyu Advisor ： Prof. Alex Hauptmann Interest ： Multimedia Analysis April 4 th, 2013.

“Secret” of Object Detection Zheng Wu (Summer intern in MSRNE) Sep. 3, 2010 Joint work with Ce Liu (MSRNE) William T. Freeman (MIT) Adam Kalai (MSRNE)

Why Categorize in Computer Vision ?. Why Use Categories? People love categories!

1 Action Classification: An Integration of Randomization and Discrimination in A Dense Feature Representation Computer Science Department, Stanford University.

Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,

Window-based models for generic object detection Mei-Chen Yeh 04/24/2012.

Reading Between The Lines: Object Localization Using Implicit Cues from Image Tags Sung Ju Hwang and Kristen Grauman University of Texas at Austin Jingnan.

Lecture 31: Modern recognition CS4670 / 5670: Computer Vision Noah Snavely.

Deformable Part Model Presenter ： Liu Changyu Advisor ： Prof. Alex Hauptmann Interest ： Multimedia Analysis April 11 st, 2013.

Deformable Part Models (DPM) Felzenswalb, Girshick, McAllester & Ramanan (2010) Slides drawn from a tutorial By R. Girshick AP 12% 27% 36% 45% 49% 2005.

Grouplet: A Structured Image Representation for Recognizing Human and Object Interactions Bangpeng Yao and Li Fei-Fei Computer Science Department, Stanford.

Towards Total Scene Understanding: Classiﬁcation, Annotation and Segmentation in an Automatic Framework N 工科所錢雅馨 2011/01/16 Li-Jia Li, Richard.

CS654: Digital Image Analysis

Object-Graphs for Context-Aware Category Discovery Yong Jae Lee and Kristen Grauman University of Texas at Austin 1.

Context Neelima Chavali ECE /21/2013. Roadmap Introduction Paper1 – Motivation – Problem statement – Approach – Experiments & Results Paper 2 Experiments.

Object Recognition by Integrating Multiple Image Segmentations Caroline Pantofaru, Cordelia Schmid, Martial Hebert ECCV 2008 E.

SUN Database: Large-scale Scene Recognition from Abbey to Zoo Jianxiong Xiao *James Haysy Krista A. Ehinger Aude Oliva Antonio Torralba Massachusetts Institute.

Object Recognition by Discriminative Combinations of Line Segments and Ellipses Alex Chia ^˚ Susanto Rahardja ^ Deepu Rajan ˚ Maylor Leung ˚ ^ Institute.

More sliding window detection: Discriminative part-based models

Rich feature hierarchies for accurate object detection and semantic segmentation 2014 IEEE Conference on Computer Vision and Pattern Recognition Ross Girshick,

Parsing Natural Scenes and Natural Language with Recursive Neural Networks INTERNATIONAL CONFERENCE ON MACHINE LEARNING (ICML 2011) RICHARD SOCHER CLIFF.

CNN-RNN: A Uniﬁed Framework for Multi-label Image Classiﬁcation

Object detection with deformable part-based models

Recognizing Deformable Shapes

Paper Presentation: Shape and Matching

Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science

An HOG-LBP Human Detector with Partial Occlusion Handling

Object-Graphs for Context-Aware Category Discovery

CS 1674: Intro to Computer Vision Scene Recognition

Brief Review of Recognition + Context

ADABOOST(Adaptative Boosting)

Presentation transcript:

Detection Evolution with Multi-Order Contextual Co-Occurrence Guang Chen (Missouri) Yuanyuan Ding (Epson) Jing Xiao (Epson) Tony Han (Missouri)

Object Detection Sliding Window Based Approach Context Helps Classifiers and features are typically inside the window. Context Helps Context outside the sliding window can be used to achieve better performances.

Context in Computer Vision High Level Context Semantic Context Geometric Context Low Level Context Pixel Context Shape Context Murphy et al, 2003 Hoiem et al, 2006 Avidan, 2006 Shotton et al, 2006 Rabinovich et al, 2007 Oliva & Torralba, 2007 Heitz & Koller, 2008 Desai et al, 2009 Divvala et al, 2009 Li, Socher & Fei-Fei, 2009 Marszalek et al, 2009 Bao & Savarese, 2010 Yao & Fei-Fei, 2010 Tu & Bai, 2010 Li, Parikh & Chen, 2011 Wolf & Bileschi, 2006 Belongie et al, 2000 [Rabinovich et al, 2007] [Yao & Fei-Fei, 2010] [Hoiem et al, 2006]

Classification Context for Segmentation Spatialboost and Auto-context Integrate classifier responses from nearby individual pixels for pixel level segmentation or labeling Auto-context [Tu & Bai, 2010] Spatial boost [Avidan 2006]

Classification Context for Object Detection Contextual Boost [Ding & Xiao, 2012] Directly uses the detector responses Adaboost Classification Based on Image Context Image Context + Adaboost Image Context Multi-scale HOG-LDP for Each Scan Window Classification Responses at Scale & Spatial Neighborhood Based on Augmented Context Contextual Boost

Co-Occurrence Context Can we further exploit co-occurrence information given only detectors for a single object type?

Co-Occurrence Context Co-Occurrence from Detector Response Map.

Our Contribution An Effective and Efficient Multi-Order Co-Occurrence Context Representation Using a Single Object Detector.

Our Contribution An Effective and Efficient Multi-Order Co-Occurrence Context Representation Using a Single Object Detector. Multi-Order Contextual Co-Occurrence (MOCO) 0th order: Classification Context 1st order: Randomized Binary Comparison High order: Co-Occurrence Descriptor

Constructing MOCO

0th Order Context Directly Using Classifier Responses Classifier response map (window width=100pixels) Classifier response map (window width=25pixels) Classifier response map (window width=50pixels)

0th Order Context Define Scale and Space Neighborhood P Spatial (x, y) Scale (l) P y x l

1st Order Context Comparison of Response Values P

1st Order Context Randomized Arrangement

High Order Context 1. Closeness Vector 2. Histogram

High Order Context 3. High Order Representation Tensor Product of Normalized Histogram

Detection Evolution Bootstrap training samples using detector responses from the previous iteration. Add MOCO context from previous iteration as additional features.

Baseline Detector Any Object Detection Algorithm Can be Used as Baseline Detector.

Baseline Detector Any Object Detection Algorithm Can be Used as Baseline Detector. Deformable-Parts-Model [Felzenszwalb et al, 2010] Inner Context: Parts Models Encodes Relationship between Parts. Outer Context: MOCO deals with Co-Occurrence among Scanning Windows

Experiments Datasets Deformable-Parts-Model PASCAL VOC 2007, 20 Object Categories Caltech Pedestrian Deformable-Parts-Model Default setting ( 3 components, each with 1 root and 8 part filters)

Experiment – 1st Order 1st Order & Context Neighbor Size

Experiment – 1st Order Pairwise Comparison: Arrangements

Experiment – High Order High Order Context Dimension

Experiment – Combinations Iterations

Comparison on Caltech Dataset

Comparison on PASCAL’07 Mean AP on 20 Categories

Conclusion An Efficient Context Representation Future Work Only Relying on Detectors for a Single Object Type Combining Deformable Parts Model to Model both inner and Outer Context around Detection Window Future Work Exploit Context With Detectors of Multiple Object Types?

Thanks for your attention! Questions? Thanks for your attention!