Efficient Region Search for Object Detection Sudheendra Vijayanarasimhan and Kristen Grauman Department of Computer Science, University of Texas at Austin.

Slides:

Advertisements

Similar presentations

Semantic Contours from Inverse Detectors Bharath Hariharan et.al. (ICCV-11)

Advertisements

Combining Detectors for Human Hand Detection Antonio Hernández, Petia Radeva and Sergio Escalera Computer Vision Center, Universitat Autònoma de Barcelona,

Location Recognition Given: A query image A database of images with known locations Two types of approaches: Direct matching: directly match image features.

Three things everyone should know to improve object retrieval

Foreground Focus: Finding Meaningful Features in Unlabeled Images Yong Jae Lee and Kristen Grauman University of Texas at Austin.

Recovering Human Body Configurations: Combining Segmentation and Recognition Greg Mori, Xiaofeng Ren, and Jitentendra Malik (UC Berkeley) Alexei A. Efros.

Carolina Galleguillos, Brian McFee, Serge Belongie, Gert Lanckriet Computer Science and Engineering Department Electrical and Computer Engineering Department.

Lecture 31: Modern object recognition

Many slides based on P. FelzenszwalbP. Felzenszwalb General object detection with deformable part-based models.

AdaBoost & Its Applications

Boundary Preserving Dense Local Regions

Face detection Many slides adapted from P. Viola.

EE462 MLCV Lecture 5-6 Object Detection – Boosting Tae-Kyun Kim.

Enhancing Exemplar SVMs using Part Level Transfer Regularization 1.

Groups of Adjacent Contour Segments for Object Detection Vittorio Ferrari Loic Fevrier Frederic Jurie Cordelia Schmid.

Ghunhui Gu, Joseph J. Lim, Pablo Arbeláez, Jitendra Malik University of California at Berkeley Berkeley, CA

Detecting Pedestrians by Learning Shapelet Features

More sliding window detection: Discriminative part-based models Many slides based on P. FelzenszwalbP. Felzenszwalb.

Recognition using Regions CVPR Outline Introduction Overview of the Approach Experimental Results Conclusion.

Generic Object Detection using Feature Maps Oscar Danielsson Stefan Carlsson

Robust Real-time Object Detection by Paul Viola and Michael Jones ICCV 2001 Workshop on Statistical and Computation Theories of Vision Presentation by.

Segmentation Divide the image into segments. Each segment:

On the Object Proposal Presented by Yao Lu

© 2013 IBM Corporation Efficient Multi-stage Image Classification for Mobile Sensing in Urban Environments Presented by Shashank Mujumdar IBM Research,

Face Detection CSE 576. Face detection State-of-the-art face detection demo (Courtesy Boris Babenko)Boris Babenko.

Global and Efficient Self-Similarity for Object Classification and Detection CVPR 2010 Thomas Deselaers and Vittorio Ferrari.

Generic object detection with deformable part-based models

Jifeng Dai 2011/09/27.  Introduction  Structural SVM  Kernel Design  Segmentation and parameter learning  Object Feature Descriptors  Experimental.

“Secret” of Object Detection Zheng Wu (Summer intern in MSRNE) Sep. 3, 2010 Joint work with Ce Liu (MSRNE) William T. Freeman (MIT) Adam Kalai (MSRNE)

Detecting Curved Symmetric Parts using a Deformable Disc Model Tom Sie Ho Lee, University of Toronto Sanja Fidler, TTI Chicago Sven Dickinson, University.

Window-based models for generic object detection Mei-Chen Yeh 04/24/2012.

Lecture 29: Face Detection Revisited CS4670 / 5670: Computer Vision Noah Snavely.

Face detection Slides adapted Grauman & Liebe’s tutorial

Visual Object Recognition

Learning Collections of Parts for Object Recognition and Transfer Learning University of Illinois at Urbana- Champaign.

Reading Between The Lines: Object Localization Using Implicit Cues from Image Tags Sung Ju Hwang and Kristen Grauman University of Texas at Austin Jingnan.

Lecture 31: Modern recognition CS4670 / 5670: Computer Vision Noah Snavely.

Pedestrian Detection and Localization

Beyond Sliding Windows: Object Localization by Efficient Subwindow Search The best paper prize at CVPR 2008.

BING: Binarized Normed Gradients for Objectness Estimation at 300fps

Efficient Subwindow Search: A Branch and Bound Framework for Object Localization ‘PAMI09 Beyond Sliding Windows: Object Localization by Efficient Subwindow.

Deformable Part Models (DPM) Felzenswalb, Girshick, McAllester & Ramanan (2010) Slides drawn from a tutorial By R. Girshick AP 12% 27% 36% 45% 49% 2005.

Chao-Yeh Chen and Kristen Grauman University of Texas at Austin Efficient Activity Detection with Max- Subgraph Search.

Project 3 Results.

Object detection, deep learning, and R-CNNs

CS 1699: Intro to Computer Vision Detection II: Deformable Part Models Prof. Adriana Kovashka University of Pittsburgh November 12, 2015.

Category Independent Region Proposals Ian Endres and Derek Hoiem University of Illinois at Urbana-Champaign.

Recognition Using Visual Phrases

FACE DETECTION : AMIT BHAMARE. WHAT IS FACE DETECTION ? Face detection is computer based technology which detect the face in digital image. Trivial task.

Fast Query-Optimized Kernel Machine Classification Via Incremental Approximate Nearest Support Vectors by Dennis DeCoste and Dominic Mazzoni International.

Object Recognition as Ranking Holistic Figure-Ground Hypotheses Fuxin Li and Joao Carreira and Cristian Sminchisescu 1.

BEYOND SLIDING WINDOW: Object Localization by Efficient Subwindow Search Christoph H. Lampert, Matthew B. Blaschko, and Thomas Hofmann.

More sliding window detection: Discriminative part-based models

Spatial Localization and Detection

Scene Parsing with Object Instances and Occlusion Ordering JOSEPH TIGHE, MARC NIETHAMMER, SVETLANA LAZEBNIK 2014 IEEE CONFERENCE ON COMPUTER VISION AND.

1 Bilinear Classifiers for Visual Recognition Computational Vision Lab. University of California Irvine To be presented in NIPS 2009 Hamed Pirsiavash Deva.

Recent developments in object detection

Object detection with deformable part-based models

Boosted Augmented Naive Bayes. Efficient discriminative learning of

Lit part of blue dress and shadowed part of white dress are the same color

Nonparametric Semantic Segmentation

Object Localization Goal: detect the location of an object within an image Fully supervised: Training data labeled with object category and ground truth.

Object detection as supervised classification

R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.

Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science

A Tutorial on HOG Human Detection

Object-Graphs for Context-Aware Category Discovery

“The Truth About Cats And Dogs”

Outline Background Motivation Proposed Model Experimental Results

RCNN, Fast-RCNN, Faster-RCNN

Presentation transcript:

Efficient Region Search for Object Detection Sudheendra Vijayanarasimhan and Kristen Grauman Department of Computer Science, University of Texas at Austin Motivation Main Idea Efficient Region Search (ERS) 1. A rectangle is imprecise Results (code Background: Linear SVM with BoW 2. Extra features in a window can mislead the detector Goal: Identify the best-scoring region--- the subset of spatially contiguous subregions whose features will maximize a classifier’s score. Naïve approach would require exponential time. Our optimal solution leads to significantly more accurate results on this challenging dataset. ERS search times similar to ESS, and orders of magnitude faster than sliding windows. Unlike ESS, ERS permits pixel-level detections of any shape. Detection overlap accuracy on PASCAL 2008 compared to the global connectivity CRF [Nowozin et al. CVPR 2009] Contour strengths Given a test image, we construct a region-graph on an oversegmentation: Maximum-Weight Connected Subgraph (MWCS) Problem Region-graph Prize-collecting Steiner tree (PCST) problem: connected subgraph that maximizes sum of vertex weights minus (positive) edge costs Convert MWCS  PCST: subtract the smallest vertex weight from all vertex and edge weights. Point feature words: SURF within the superpixel Shape feature words: HoG on whole superpixel Branch-and-Cut Solution Branch-and-cut algorithm for PCST [Ljubic et al. ‘06] to obtain best scoring region: Optimal solutions Efficient in practice (100s of nodes) Efficient Region Search with Contours (ERS-C) A variant of ERS to help exclude background regions Training: Learning the Weights Vertex weights are obtained from SVM weights for: Our goal is to determine the arbitrarily shaped region within a novel image that maximizes the score: Region-graph Oversegmentation MWCS instance Point descriptors Shape descriptors Bag of features SVM PCST instance PCST instance Branch-and-cut solution Best-scoring region negative features, - positive features Our Approach 4 Main contribution: We show how to obtain the best-scoring region efficiently with a branch-and-cut solution. Applicable to classifiers whose total score is sum of localized feature scores (e.g., linear SVM, Naïve Bayes NN, boosting). Visual word histogram weights – linear SVM on segmented examples Bag-of-contours histogram weights – structured SVM Datasets Baselines Efficient Subwindow Search (ESS) [Lampert et al. 2008] Global connectivity CRF [Nowozin et al. 2009] Evaluation metrics Pixel-level AP, PASCAL bounding box metric, overlap scores ETHZ Shapes: 5 classes PASCAL 2008 seg: 20 classes PASCAL 2007: cat, dog PASCAL 2008 seg While windows over/underestimate object, ERS allows precise arbitrarily-shaped detections. Pixel-level precision recall curves on PASCAL 2007 (cat, dog) and ETHZ for our approach and ESS ERS more accurate than ESS, even under bounding box metric (19-70% better). Shape features excel on ETHZ; region detection crucial for “non-boxy” objects. Comparison with ESS Comparison with CRF Computation Time An efficient branch-and-cut method for region-based detection Demonstrated its advantages over both window-based detection and a CRF model In future work, we will examine the alternate classifiers accepted by our model. Conclusions Object detection via exhaustive search is too expensive. Branch-and-bound schemes can limit the search (Lampert et al. ’08, Lehmann et al. ’09, Yeh et al. ’09), but existing methods are restricted to rectangular or simple polygonal candidate windows. Problem: Divide image into superpixels and construct region-graph Weight each superpixel vertex by classifier output on its features Branch-and-cut to find best connected subgraph Maximum-weight connected subgraph → Prize- collecting Steiner tree problem As noted by Lampert et al. ‘08, for a linear SVM and bag-of- words, the classifier response for a region R can be written as sum of its N features’ word weights: Num occurrences of j-th word SVM weight for j-th word SVM weight for i-th feature’s word Identify the connected subgraph R* whose summed vertex weights are maximal. Edges set by adjacency, and to impose spatial layout. Class-specific edge weights via bag-of-contour strengths Example Detections PASCAL 2007 ETHZ (point/shape features) - neg features - pos features