O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang.

Slides:

Advertisements

Similar presentations

Rich feature Hierarchies for Accurate object detection and semantic segmentation Ross Girshick, Jeff Donahue, Trevor Darrell, Jitandra Malik (UC Berkeley)

Advertisements

Presenter: Duan Tran (Part of slides are from Pedro’s)

Human Detection Phanindra Varma. Detection -- Overview  Human detection in static images is based on the HOG (Histogram of Oriented Gradients) encoding.

Combining Detectors for Human Hand Detection Antonio Hernández, Petia Radeva and Sergio Escalera Computer Vision Center, Universitat Autònoma de Barcelona,

Jan-Michael Frahm, Enrique Dunn Spring 2013

Histograms of Oriented Gradients for Human Detection

Ľubor Ladický1 Phil Torr2 Andrew Zisserman1

Efficient Large-Scale Structured Learning

Large Scale Visual Recognition Challenge (ILSVRC) 2013: Detection spotlights.

Lecture 31: Modern object recognition

Many slides based on P. FelzenszwalbP. Felzenszwalb General object detection with deformable part-based models.

Intelligent Systems Lab. Recognizing Human actions from Still Images with Latent Poses Authors: Weilong Yang, Yang Wang, and Greg Mori Simon Fraser University,

Intro to DPM By Zhangliliang. Outline Intuition Introduction to DPM Model Inference(matching) Training latent SVM Training Procedure Initialization Post-processing.

DeepID-Net: deformable deep convolutional neural network for generic object detection Wanli Ouyang, Ping Luo, Xingyu Zeng, Shi Qiu, Yonglong Tian, Hongsheng.

Enhancing Exemplar SVMs using Part Level Transfer Regularization 1.

Large-Scale Object Recognition with Weak Supervision

More sliding window detection: Discriminative part-based models Many slides based on P. FelzenszwalbP. Felzenszwalb.

DISCRIMINATIVE DECORELATION FOR CLUSTERING AND CLASSIFICATION ECCV 12 Bharath Hariharan, Jitandra Malik, and Deva Ramanan.

Good morning, everyone, thank you for coming to my presentation.

Object Recognizing We will discuss: Features Classifiers Example ‘winning’ system.

Learning to Segment from Diverse Data M. Pawan Kumar Daphne KollerHaithem TurkiDan Preston.

On the Object Proposal Presented by Yao Lu

Lecture 29: Recent work in recognition CS4670: Computer Vision Noah Snavely.

Generic object detection with deformable part-based models

Object Recognizing. Object Classes Individual Recognition.

Object Recognizing. Recognition -- topics Features Classifiers Example ‘winning’ system.

“Secret” of Object Detection Zheng Wu (Summer intern in MSRNE) Sep. 3, 2010 Joint work with Ce Liu (MSRNE) William T. Freeman (MIT) Adam Kalai (MSRNE)

Marco Pedersoli, Jordi Gonzàlez, Xu Hu, and Xavier Roca

Face detection Slides adapted Grauman & Liebe’s tutorial

Visual Object Recognition

Object Detection with Discriminatively Trained Part Based Models

Lecture 31: Modern recognition CS4670 / 5670: Computer Vision Noah Snavely.

Pedestrian Detection and Localization

Latent SVM 1 st Frame: manually select target Find 6 highest weighted areas in template Area of 16 blocks Train 6 SVMs on those areas Train 1 SVM on entire.

BING: Binarized Normed Gradients for Objectness Estimation at 300fps

Efficient Subwindow Search: A Branch and Bound Framework for Object Localization ‘PAMI09 Beyond Sliding Windows: Object Localization by Efficient Subwindow.

Deformable Part Models (DPM) Felzenswalb, Girshick, McAllester & Ramanan (2010) Slides drawn from a tutorial By R. Girshick AP 12% 27% 36% 45% 49% 2005.

Recognition II Ali Farhadi. We have talked about Nearest Neighbor Naïve Bayes Logistic Regression Boosting.

Project 3 Results.

Object detection, deep learning, and R-CNNs

CS 1699: Intro to Computer Vision Detection II: Deformable Part Models Prof. Adriana Kovashka University of Pittsburgh November 12, 2015.

Object Detection Overview Viola-Jones Dalal-Triggs Deformable models Deep learning.

Recognition Using Visual Phrases

Object Recognition as Ranking Holistic Figure-Ground Hypotheses Fuxin Li and Joao Carreira and Cristian Sminchisescu 1.

Object Recognizing. Object Classes Individual Recognition.

CS 2750: Machine Learning Support Vector Machines Prof. Adriana Kovashka University of Pittsburgh February 17, 2016.

More sliding window detection: Discriminative part-based models

A Discriminatively Trained, Multiscale, Deformable Part Model Yeong-Jun Cho Computer Vision and Pattern Recognition,2008.

Fine-grained Fine-grained Recognition( 细粒度分类 ) 沈志强.

Strong Supervision From Weak Annotation Interactive Training of Deformable Part Models ICCV /05/23.

Rich feature hierarchies for accurate object detection and semantic segmentation 2014 IEEE Conference on Computer Vision and Pattern Recognition Ross Girshick,

Week 4: 6/6 – 6/10 Jeffrey Loppert. This week.. Coded a Histogram of Oriented Gradients (HOG) Feature Extractor Extracted features from positive and negative.

Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition arXiv: v4 [cs.CV(CVPR)] 23 Apr 2015 Kaiming He, Xiangyu Zhang, Shaoqing.

Recent developments in object detection

Cascade for Fast Detection

Object detection with deformable part-based models

Data Driven Attributes for Action Detection

Performance of Computer Vision

Lit part of blue dress and shadowed part of white dress are the same color

Object Localization Goal: detect the location of an object within an image Fully supervised: Training data labeled with object category and ground truth.

R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.

Group Norm for Learning Latent Structural SVMs

A Tutorial on HOG Human Detection

HOGgles Visualizing Object Detection Features

Image Classification.

Object Detection + Deep Learning

Progress report 2019/1/14 PHHung.

Outline Background Motivation Proposed Model Experimental Results

RCNN, Fast-RCNN, Faster-RCNN

Presentation transcript:

O BJECT D ETECTION WITH D ISCRIMINATIVELY T RAINED P ART B ASED M ODELS PRESENTED BY Xiaolong Wang

D ETECTION

C HALLENGE Deformation Part of the Slides From Ross Girshick

C HALLENGE Viewpoint

C HALLENGE Variable structure

C HALLENGE Images from Chaitanya Desai

2-layer Model Deformable D EFORMABLE P ART M ODELS Leo Zhu, CVPR 2010

HOG P YRAMID Root Filter Part Filters

F ORMULATION One root (i=0) + n parts. Model Parameters for HOG HOG Features Model Parameters for Deformation

I NFERENCE

M ULTI - VIEWS

L ATENT O RIENTATION No orientation in PAMI paper (DPM v3) Use latent orientation (DPM v4) Guess what is it? right-facing horse

U NSUPERVISED ORIENTATION CLUSTERING

L ATENT O RIENTATION Inference: Choose the best view and best orientation. Learning: Train the parameters for 3 views, and flip the weights to get 3*2 views.

H OW IMPORTANT IT IS One view:42.1% 3-view: 47.3% 3*2-view: 56.8% For horse:

H OW IMPORTANT IT IS For all classes (DPM v4):

L EARNING Linear Formulation  Putting all features in one vector  Latent variable z represents part locations (and component index for multi-views)

L ATENT SVM

Detection on Positive Samples  Sliding window  Overlap with root-node window > 0.7

L ATENT SVM Hard Negative Mining Carl Vondrick HOGgles, ICCV 2013

L ATENT SVM Hard Negative Mining  Small or no overlap  High detection score Maintaining Sample Cache  Select no more than 500 negative samples per image;  Cache size = 20000

L ATENT SVM Dual Method  Not scalable. Stochastic gradient descent(DPM v4)  Important: Shuffle everytime! LBFGS(DPM v5)  Second-order Newton Method  Faster & better performance

3- STEP I NITIALIZATION Step-1: Only Train Root Filter  positive data (highest overlap)  No hard negative mining Car

3- STEP I NITIALIZATION Step-2: Merg Components  Setting root selection as latent variable

3- STEP I NITIALIZATION Step-3: Initialize Part Filters  Fix part number as 8 (DPM v4/5)  Sliding window, calculate L1/L2 norm of the positive weights.

P OST P ROCESSING Bounding Box Regression  Linear regression for (x1,y1,x2,y2) Non-Maximum Suppression  Pick up high score boxes Context

C ONTEXT Marr Prize 2009 Context SVM,CVPR2010 segDPM,CVPR2013

N UMBERS VOC 2010: 29.6 and 32.2 VOC 2007: 33.7 and 35.4 VOC 2010: segDPM(with tons of things) 40.4

L ARGE - SCALE D ATASET ImageNet 2013 DPM v4 in cpp

S UMMARY Although DPMs is loosing to CNNs, the techniques and small tricks we learned from DPMs help solving many other vision problems.

Q UESTIONS