More sliding window detection: Discriminative part-based models Many slides based on P. FelzenszwalbP. Felzenszwalb.

Slides:



Advertisements
Similar presentations
Latent SVMs for Human Detection with a Locally Affine Deformation Field Ľubor Ladický 1 Phil Torr 2 Andrew Zisserman 1 1 University of Oxford 2 Oxford.
Advertisements

Presenter: Duan Tran (Part of slides are from Pedro’s)
Human Detection Phanindra Varma. Detection -- Overview  Human detection in static images is based on the HOG (Histogram of Oriented Gradients) encoding.
Combining Detectors for Human Hand Detection Antonio Hernández, Petia Radeva and Sergio Escalera Computer Vision Center, Universitat Autònoma de Barcelona,
Jan-Michael Frahm, Enrique Dunn Spring 2013
Ľubor Ladický1 Phil Torr2 Andrew Zisserman1
Large Scale Visual Recognition Challenge (ILSVRC) 2013: Detection spotlights.
Computer Vision for Human-Computer InteractionResearch Group, Universität Karlsruhe (TH) cv:hci Dr. Edgar Seemann 1 Computer Vision: Histograms of Oriented.
Lecture 31: Modern object recognition
Many slides based on P. FelzenszwalbP. Felzenszwalb General object detection with deformable part-based models.
Histograms of Oriented Gradients for Human Detection Navneet Dalal and Bill Triggs CVPR 2005 Another Descriptor.
Activity Recognition Aneeq Zia. Agenda What is activity recognition Typical methods used for action recognition “Evaluation of local spatio-temporal features.
Steerable Part Models Hamed Pirsiavash and Deva Ramanan
Intro to DPM By Zhangliliang. Outline Intuition Introduction to DPM Model Inference(matching) Training latent SVM Training Procedure Initialization Post-processing.
Advisers: Prof. C.V. Jawahar Prof. A. P.Zisserman 3rd August 2011
Object Category Detection: Sliding Windows Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 04/10/12.
Enhancing Exemplar SVMs using Part Level Transfer Regularization 1.
Viola/Jones: features “Rectangle filters” Differences between sums of pixels in adjacent rectangles { y t (x) = +1 if h t (x) >  t -1 otherwise Unique.
Detecting Pedestrians by Learning Shapelet Features
Student: Yao-Sheng Wang Advisor: Prof. Sheng-Jyh Wang ARTICULATED HUMAN DETECTION 1 Department of Electronics Engineering National Chiao Tung University.
DISCRIMINATIVE DECORELATION FOR CLUSTERING AND CLASSIFICATION ECCV 12 Bharath Hariharan, Jitandra Malik, and Deva Ramanan.
Generic Object Detection using Feature Maps Oscar Danielsson Stefan Carlsson
Object Recognizing We will discuss: Features Classifiers Example ‘winning’ system.
Object Detection using Histograms of Oriented Gradients
Lecture 29: Recent work in recognition CS4670: Computer Vision Noah Snavely.
Global and Efficient Self-Similarity for Object Classification and Detection CVPR 2010 Thomas Deselaers and Vittorio Ferrari.
Generic object detection with deformable part-based models
Object Recognizing. Object Classes Individual Recognition.
Object Recognizing. Recognition -- topics Features Classifiers Example ‘winning’ system.
“Secret” of Object Detection Zheng Wu (Summer intern in MSRNE) Sep. 3, 2010 Joint work with Ce Liu (MSRNE) William T. Freeman (MIT) Adam Kalai (MSRNE)
Marco Pedersoli, Jordi Gonzàlez, Xu Hu, and Xavier Roca
Visual Object Recognition
Object Detection with Discriminatively Trained Part Based Models
Lecture 31: Modern recognition CS4670 / 5670: Computer Vision Noah Snavely.
Pedestrian Detection and Localization
O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang.
Deformable Part Model Presenter : Liu Changyu Advisor : Prof. Alex Hauptmann Interest : Multimedia Analysis April 11 st, 2013.
Beyond Sliding Windows: Object Localization by Efficient Subwindow Search The best paper prize at CVPR 2008.
Efficient Subwindow Search: A Branch and Bound Framework for Object Localization ‘PAMI09 Beyond Sliding Windows: Object Localization by Efficient Subwindow.
Deformable Part Models (DPM) Felzenswalb, Girshick, McAllester & Ramanan (2010) Slides drawn from a tutorial By R. Girshick AP 12% 27% 36% 45% 49% 2005.
Recognition II Ali Farhadi. We have talked about Nearest Neighbor Naïve Bayes Logistic Regression Boosting.
Training and Evaluating of Object Bank Models Presenter : Changyu Liu Advisor : Prof. Alex Interest : Multimedia Analysis May 16 th, 2013.
Object detection, deep learning, and R-CNNs
Histograms of Oriented Gradients for Human Detection Navneet Dalal and Bill Triggs CVPR 2005 Another Descriptor.
CS 1699: Intro to Computer Vision Detection II: Deformable Part Models Prof. Adriana Kovashka University of Pittsburgh November 12, 2015.
Object Detection Overview Viola-Jones Dalal-Triggs Deformable models Deep learning.
Pictorial Structures and Distance Transforms Computer Vision CS 543 / ECE 549 University of Illinois Ian Endres 03/31/11.
Week 10 Emily Hand UNR.
Object Recognizing. Object Classes Individual Recognition.
CS 2750: Machine Learning Support Vector Machines Prof. Adriana Kovashka University of Pittsburgh February 17, 2016.
More sliding window detection: Discriminative part-based models
Object Recognizing. Object Classes Individual Recognition.
A Discriminatively Trained, Multiscale, Deformable Part Model Yeong-Jun Cho Computer Vision and Pattern Recognition,2008.
Fine-grained Fine-grained Recognition( 细粒度分类 ) 沈志强.
Strong Supervision From Weak Annotation Interactive Training of Deformable Part Models ICCV /05/23.
Week 4: 6/6 – 6/10 Jeffrey Loppert. This week.. Coded a Histogram of Oriented Gradients (HOG) Feature Extractor Extracted features from positive and negative.
1 Bilinear Classifiers for Visual Recognition Computational Vision Lab. University of California Irvine To be presented in NIPS 2009 Hamed Pirsiavash Deva.
Cascade for Fast Detection
Object detection with deformable part-based models
Data Driven Attributes for Action Detection
Lit part of blue dress and shadowed part of white dress are the same color
Object detection, deep learning, and R-CNNs
Recap: Advanced Feature Encoding
Object Localization Goal: detect the location of an object within an image Fully supervised: Training data labeled with object category and ground truth.
Object detection as supervised classification
R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.
A Tutorial on HOG Human Detection
HOGgles Visualizing Object Detection Features
An HOG-LBP Human Detector with Partial Occlusion Handling
“The Truth About Cats And Dogs”
Presentation transcript:

More sliding window detection: Discriminative part-based models Many slides based on P. FelzenszwalbP. Felzenszwalb

Challenge: Generic object detection

Pedestrian detection Features: Histograms of oriented gradients (HOG) Partition image into 8x8 pixel blocks and compute histogram of gradient orientations in each block Learn a pedestrian template using a linear support vector machine At test time, convolve feature map with template N. Dalal and B. Triggs, Histograms of Oriented Gradients for Human Detection, CVPR 2005Histograms of Oriented Gradients for Human Detection Template HOG feature mapDetector response map

Discriminative part-based models P. Felzenszwalb, R. Girshick, D. McAllester, D. Ramanan, Object Detection with Discriminatively Trained Part Based Models, PAMI 32(9), 2010Object Detection with Discriminatively Trained Part Based Models Root filter Part filters Deformation weights

Object hypothesis Multiscale model: the resolution of part filters is twice the resolution of the root

Scoring an object hypothesis The score of a hypothesis is the sum of filter scores minus the sum of deformation costs Filters Subwindow features Deformation weights Displacements

Scoring an object hypothesis The score of a hypothesis is the sum of filter scores minus the sum of deformation costs Recall: pictorial structures Matching cost Deformation cost Filters Subwindow features Deformation weights Displacements

Scoring an object hypothesis The score of a hypothesis is the sum of filter scores minus the sum of deformation costs Concatenation of filter and deformation weights Concatenation of subwindow features and displacements Filters Subwindow features Deformation weights Displacements

Detection Define the score of each root filter location as the score given the best part placements:

Detection Define the score of each root filter location as the score given the best part placements: Efficient computation: generalized distance transforms For each “default” part location, find the best- scoring displacement Head filter Head filter responsesDistance transform

Detection

Matching result

Training Training data consists of images with labeled bounding boxes Need to learn the filters and deformation parameters

Training Our classifier has the form w are model parameters, z are latent hypotheses Latent SVM training: Initialize w and iterate: Fix w and find the best z for each training example (detection) Fix z and solve for w (standard SVM training) Issue: too many negative examples Do “data mining” to find “hard” negatives

Car model Component 1 Component 2

Car detections

Person model

Person detections

Cat model

Cat detections

Bottle model

More detections

Quantitative results (PASCAL 2008) 7 systems competed in the 2008 challenge Out of 20 classes, first place in 7 classes and second place in 8 classes BicyclesPersonBird Proposed approach