Good morning, everyone, thank you for coming to my presentation.

Slides:

Advertisements

Similar presentations

Learning Shared Body Plans Ian Endres University of Illinois work with Derek Hoiem, Vivek Srikumar and Ming-Wei Chang.

Advertisements

Location Recognition Given: A query image A database of images with known locations Two types of approaches: Direct matching: directly match image features.

Articulated People Detection and Pose Estimation: Reshaping the Future

Spectral graph reduction for image and streaming video segmentation Fabio Galasso 1 Margret Keuper 2 Thomas Brox 2 Bernt Schiele 1 1 Max Planck Institute.

A generic model to compose vision modules for holistic scene understanding Adarsh Kowdle *, Congcong Li *, Ashutosh Saxena, and Tsuhan Chen Cornell University,

Ľubor Ladický1 Phil Torr2 Andrew Zisserman1

Agenda Introduction Bag-of-words models Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based.

Shape Sharing for Object Segmentation

Scene Labeling Using Beam Search Under Mutex Constraints ID: O-2B-6 Anirban Roy and Sinisa Todorovic Oregon State University 1.

Large Scale Visual Recognition Challenge (ILSVRC) 2013: Detection spotlights.

Lecture 31: Modern object recognition

Many slides based on P. FelzenszwalbP. Felzenszwalb General object detection with deformable part-based models.

Mixture of trees model: Face Detection, Pose Estimation and Landmark Localization Presenter: Zhang Li.

Steerable Part Models Hamed Pirsiavash and Deva Ramanan

Structural Human Action Recognition from Still Images Moin Nabi Computer Vision Lab. ©IPM - Oct

Yuanlu Xu Advisor: Prof. Liang Lin Person Re-identification by Matching Compositional Template with Cluster Sampling.

DeepID-Net: deformable deep convolutional neural network for generic object detection Wanli Ouyang, Ping Luo, Xingyu Zeng, Shi Qiu, Yonglong Tian, Hongsheng.

Robust Object Tracking via Sparsity-based Collaborative Model

Global spatial layout: spatial pyramid matching Spatial weighting the features Beyond bags of features: Adding spatial information.

Object-centric spatial pooling for image classification Olga Russakovsky, Yuanqing Lin, Kai Yu, Li Fei-Fei ECCV 2012.

Boundary Preserving Dense Local Regions

Enhancing Exemplar SVMs using Part Level Transfer Regularization 1.

Large-Scale Object Recognition with Weak Supervision

Biased Normalized Cuts 1 Subhransu Maji and Jithndra Malik University of California, Berkeley IEEE Conference on Computer Vision and Pattern Recognition.

More sliding window detection: Discriminative part-based models Many slides based on P. FelzenszwalbP. Felzenszwalb.

Robust Higher Order Potentials For Enforcing Label Consistency

Generic Object Detection using Feature Maps Oscar Danielsson Stefan Carlsson

Object Recognition: Conceptual Issues Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and K. Grauman.

Grammar of Image Zhaoyin Jia, Problems  Enormous amount of vision knowledge:  Computational complexity  Semantic gap …… Classification,

Generic object detection with deformable part-based models

Describing People: A Poselet-Based Approach to Attribute Classification Lubomir Bourdev 1,2 Subhransu Maji 1 Jitendra Malik 1 1 EECS U.C. Berkeley 2 Adobe.

CIVS, Statistics Dept. UCLA Deformable Template as Active Basis Zhangzhang Si UCLA Department of Statistics Ying Nian Wu, Zhangzhang Si, Chuck.

Object Bank Presenter ： Liu Changyu Advisor ： Prof. Alex Hauptmann Interest ： Multimedia Analysis April 4 th, 2013.

Object Detection Sliding Window Based Approach Context Helps

“Secret” of Object Detection Zheng Wu (Summer intern in MSRNE) Sep. 3, 2010 Joint work with Ce Liu (MSRNE) William T. Freeman (MIT) Adam Kalai (MSRNE)

CVPR 2013 Diversity Tutorial Closing Remarks: What can we do with multiple diverse solutions? Dhruv Batra Virginia Tech.

Leo Zhu CSAIL MIT Joint work with Chen, Yuille, Freeman and Torralba 1.

Detection, Segmentation and Fine-grained Localization

Learning Collections of Parts for Object Recognition and Transfer Learning University of Illinois at Urbana- Champaign.

Week 9 Presented by Christina Peterson. Recognition Accuracies on UCF Sports data set Method Accuracy (%)DivingGolfingKickingLiftingRidingRunningSkating.

Object Detection with Discriminatively Trained Part Based Models

Lecture 31: Modern recognition CS4670 / 5670: Computer Vision Noah Snavely.

BING: Binarized Normed Gradients for Objectness Estimation at 300fps

O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang.

Deformable Part Model Presenter ： Liu Changyu Advisor ： Prof. Alex Hauptmann Interest ： Multimedia Analysis April 11 st, 2013.

Deformable Part Models (DPM) Felzenswalb, Girshick, McAllester & Ramanan (2010) Slides drawn from a tutorial By R. Girshick AP 12% 27% 36% 45% 49% 2005.

Training and Evaluating of Object Bank Models Presenter ： Changyu Liu Advisor ： Prof. Alex Interest ： Multimedia Analysis May 16 th, 2013.

Layered Object Detection for Multi-Class Image Segmentation UC Irvine Yi Yang Sam Hallman Deva Ramanan Charless Fowlkes.

Category Independent Region Proposals Ian Endres and Derek Hoiem University of Illinois at Urbana-Champaign.

Image Classification over Visual Tree Jianping Fan Dept of Computer Science UNC-Charlotte, NC

Recognition Using Visual Phrases

Object Recognition by Integrating Multiple Image Segmentations Caroline Pantofaru, Cordelia Schmid, Martial Hebert ECCV 2008 E.

Max-Margin Training of Upstream Scene Understanding Models Jun Zhu Carnegie Mellon University Joint work with Li-Jia Li *, Li Fei-Fei *, and Eric P. Xing.

A Discriminatively Trained, Multiscale, Deformable Part Model Yeong-Jun Cho Computer Vision and Pattern Recognition,2008.

Fine-grained Fine-grained Recognition( 细粒度分类 ) 沈志强.

Rich feature hierarchies for accurate object detection and semantic segmentation 2014 IEEE Conference on Computer Vision and Pattern Recognition Ross Girshick,

Recent developments in object detection

Object detection with deformable part-based models

Data Driven Attributes for Action Detection

Krishna Kumar Singh, Yong Jae Lee University of California, Davis

Segmentation Driven Object Detection with Fisher Vectors

Nonparametric Semantic Segmentation

ICCV Hierarchical Part Matching for Fine-Grained Image Classification

“The Truth About Cats And Dogs”

Integration and Graphical Models

CornerNet: Detecting Objects as Paired Keypoints

Outline Background Motivation Proposed Model Experimental Results

Adarsh Kowdle*, Congcong Li*, Ashutosh Saxena, and Tsuhan Chen

Xilai Li, Tianfu Wu, and Xi Song CVPR 2019 Presented by Dingquan Li

Presentation transcript:

Integrating Context and Occlusion for Car Detection by Hierarchical And-Or Model Good morning, everyone, thank you for coming to my presentation. Bo Li1,2 Tianfu Wu2 Song-Chun Zhu2 1Beijing Institute of Technology 2University of California, Los Angeles (UCLA)

Car Detection E.g., in PASCAL VOC -- not much occlusion (D. Hoiem et al., ECCV 2012) Occlusion and Car-to-Car Context in Car Detection KITTI (A. Geiger et al., CVPR2012) Street Parking (B. Li et al., ICCV 2013) Parking Lot (self-collected in this paper)

Literature Review Single Object Model & Occlusion Modeling Felzenszwalb et al., TPAMI (2010) Girshick et al., In: NIPS (2011) Zhu et al., In: CVPR (2010) Pepik et al., In: CVPR (2012) & CVPR (2013) Li et al, In: ICCV (2013) Song et al., In: CVPR (2013)

Literature Review Single Object Model & Occlusion Modelling Felzenszwalb et al., TPAMI (2010) Girshick et al., In: NIPS (2011) Zhu et al., In: CVPR (2010) Pepik et al., In: CVPR (2012) & CVPR (2013) Li et al, In: ICCV (2013) Song et al., In: CVPR (2013) Object-Pair & Visual Phrase Models Li et al., PR (2014) Tang et al., In: BMVC (2012) Ouyang, W. & Wang, X., In: CVPR (2013) Pepik et al., In: CVPR (2013) Sadeghi, M. & Farhadi, A., In: CVPR (2011)

Literature Review Single Object Model & Occlusion Modelling Felzenszwalb et al., TPAMI (2010) Girshick et al., In: NIPS (2011) Zhu et al., In: CVPR (2010) Pepik et al., In: CVPR (2012) & CVPR (2013) Li et al, In: ICCV (2013) Song et al., In: CVPR (2013) Object-Pair & Visual Phrase Models Li et al., PR 47, 3254 - 3264 (2014) Tang et al., In: BMVC (2012) Ouyang, W. & Wang, X., In: CVPR (2013) Pepik et al., In: CVPR (2013) Sadeghi, M. & Farhadi, A., In: CVPR (2011) Context Model Hoiem et al., IJCV (2008) Desai et al., IJCV (2011) Tu, Z. & Bai, X., TPAMI (2010) Chen et al., In: CVPR (2013) Yang et al., In: CVPR (2012)

Regularity and Reconfigurability from the object detection grammar perspective 1, Occlusions have different regularities (e.g., consistently visible parts for a given viewpoint). 2, Car-to-car layouts (i.e., N-car together) with reconfigurable single cars and car parts can address the intra-class variations caused by occlusions.

Modeling Occlusion by Discriminative And-Or Structures B. Li et al., ICCV 2013 S.C. Zhu and D. Mumford, FTCGV 2006

Results on Street-Parking-Car Dataset

… Integrating Occlusion and Context Car-to-Car Context: Representation: Hierarchical and Reconfigurable And-Or Models Car-to-Car Context: …

Representation : Hierarchical and Reconfigurable And-Or Models

Representation : Hierarchical and Reconfigurable And-Or Models

Representation : Hierarchical and Reconfigurable And-Or Models

Integrating Occlusion and Context (1) Learning the And-Or Structure by mining car-to-car context patterns and viewpoint-occlusion patterns (2) Learning the parameters for appearance, deformation and bias

Learning Context Feature Vector: Clustering : Mining Car-to-Car Context Patterns Context Feature Vector: Clustering Each cluster corresponds to an And-node at Layer 1

Learning : Mining viewpoint-occlusion Patterns For a single car example, we use the maximum overlap ratio between it and its neighbor cars as its occlusion ratio inter Two methods to model occlusion patterns Greedy part selection (Felzsenszwalb, et al., PAMI 2010) And-Or Structure Learning

And-Or Structure Learning Generating CAD Examples & Building Data Matrix Information projection & Graph Compression Z. Si & S.C. Zhu., TPAMI (2013)

Inference Bottom-up Pass – Depth-First-Search : the DP algorithm Bottom-up Pass – Depth-First-Search Compute appearance & deformation score maps for And-Or nodes Top-down Pass – Breadth-First-Search Retrieve parse trees with scores above the threshold

Parameters Learning Weak-Label Structural SVM (WLSSVM) Girshick et al., In: NIPS (2011) 1) During learning, we run the DP inference to find the optimal parse tree 2) For a N-car example, the loss function is enforced to consider N car bounding boxes together

Experimental Results Public Datasets KITTI, Street Parking, Pascal VOC 2007 Self-collected Parking Lot Dataset More Cars per Image More Car-to-Car Context More Occlusion More Diverse Viewpoints

Results: KITTI & Pascal VOC 2007 Training and Testing by Splitting the Trainset Training on Half-Trainset, and Testing on the benchmark Methods Easy Moderate Hard mBoW 36.02% 23.76% 18.44% LSVM-MDPM-us 66.53% 55.42% 41.04% LSVM-MDPM-sv 68.02% 56.48% 44.18% MDPM-un-BB 71.19% 62.16% 48.43% OC-DPM 74.94% 65.95% 53.86% DPM (trained by us) 77.24% 56.02% 43.14% AOG 80.26% 67.03% 55.60% On the first experiment, we test Pascal VOC 2007

Results: KITTI Missing Detection True Positive False Alarm

Results: Street Parking Quantitative Results DPM-V5 And-Or Structure Our AOG1 Our AOG2 AP 52.0% 57.8% 62.1% 65.3% Qualitative Results Part Layouts

Results: Parking Lot Qualitative Results Evaluation Settings Overlap threshold: 0.60 Minimum area: 1000 minimum height: 24 Qualitative Results

Conclusion A flexible & reconfigurable And-Or model Mine context and occlusion patterns A new parking lot dataset Future Work Apply on other categories New ways to mine context & occlusion pattern Integrating other context cues

Thank you! And Questions? Poster: P4B-46 The code and data will be available on the following project page: http://www.stat.ucla.edu/~tfwu/project/OcclusionModeling.htm