Poselets Michael Krainin CSE 590V Oct 18, 2011. Person Detection Dalal and Triggs ‘05 – Learn to classify pedestrians vs. background – HOG + linear SVM.

Slides:

Advertisements

Similar presentations

Max-Margin Additive Classifiers for Detection

Advertisements

Poselets: Body Part Detectors trained Using 3D Human Pose Annotations Lubomir Bourdev & Jitendra Malik ICCV 2009.

Contributions A people dataset of 8035 images. Three layer attribute classification framework using poselets. 1 2.

Thomas Berg and Peter Belhumeur

Learning Semantics with Less Supervision

Pose Estimation and Segmentation of People in 3D Movies Karteek Alahari, Guillaume Seguin, Josef Sivic, Ivan Laptev Inria, Ecole Normale Superieure ICCV.

Jan-Michael Frahm, Enrique Dunn Spring 2013

Limin Wang, Yu Qiao, and Xiaoou Tang

Human Identity Recognition in Aerial Images Omar Oreifej Ramin Mehran Mubarak Shah CVPR 2010, June Computer Vision Lab of UCF.

Ľubor Ladický1 Phil Torr2 Andrew Zisserman1

Classification using intersection kernel SVMs is efficient Joint work with Subhransu Maji and Alex Berg Jitendra Malik UC Berkeley.

Face Alignment with Part-Based Modeling

Lecture 31: Modern object recognition

Many slides based on P. FelzenszwalbP. Felzenszwalb General object detection with deformable part-based models.

Yuanlu Xu Human Re-identification: A Survey.

Viewpoint Invariant Human Re-identification in Camera Networks Using Pose Priors and Subject-Discriminative Features Ziyan Wu, Student Member, IEEE, Yang.

Structural Human Action Recognition from Still Images Moin Nabi Computer Vision Lab. ©IPM - Oct

Intelligent Systems Lab. Recognizing Human actions from Still Images with Latent Poses Authors: Weilong Yang, Yang Wang, and Greg Mori Simon Fraser University,

Robust Object Tracking via Sparsity-based Collaborative Model

Global spatial layout: spatial pyramid matching Spatial weighting the features Beyond bags of features: Adding spatial information.

Enhancing Exemplar SVMs using Part Level Transfer Regularization 1.

Detecting Pedestrians by Learning Shapelet Features

More sliding window detection: Discriminative part-based models Many slides based on P. FelzenszwalbP. Felzenszwalb.

Student: Yao-Sheng Wang Advisor: Prof. Sheng-Jyh Wang ARTICULATED HUMAN DETECTION 1 Department of Electronics Engineering National Chiao Tung University.

DISCRIMINATIVE DECORELATION FOR CLUSTERING AND CLASSIFICATION ECCV 12 Bharath Hariharan, Jitandra Malik, and Deva Ramanan.

Generic Object Detection using Feature Maps Oscar Danielsson Stefan Carlsson

Object Recognizing We will discuss: Features Classifiers Example ‘winning’ system.

PANDA: Pose Aligned Networks for Deep Attribute Modeling Ning Zhang1;2, Manohar Paluri1, Marc’Aurelio Ranzato1, Trevor Darrell2, Lubomir Bourdev1 1: Facebook.

Generic object detection with deformable part-based models

Describing People: A Poselet-Based Approach to Attribute Classification Lubomir Bourdev 1,2 Subhransu Maji 1 Jitendra Malik 1 1 EECS U.C. Berkeley 2 Adobe.

School of Electronic Information Engineering, Tianjin University Human Action Recognition by Learning Bases of Action Attributes and Parts Jia pingping.

The Three R’s of Vision Jitendra Malik.

“Secret” of Object Detection Zheng Wu (Summer intern in MSRNE) Sep. 3, 2010 Joint work with Ce Liu (MSRNE) William T. Freeman (MIT) Adam Kalai (MSRNE)

Handwritten digit recognition Jitendra Malik. Handwritten digit recognition (MNIST,USPS) LeCun’s Convolutional Neural Networks variations (0.8%, 0.6%

Face detection Slides adapted Grauman & Liebe’s tutorial

Learning Collections of Parts for Object Recognition and Transfer Learning University of Illinois at Urbana- Champaign.

Lecture 31: Modern recognition CS4670 / 5670: Computer Vision Noah Snavely.

Pedestrian Detection and Localization

Object Recognition in Images Slides originally created by Bernd Heisele.

Deformable Part Models (DPM) Felzenswalb, Girshick, McAllester & Ramanan (2010) Slides drawn from a tutorial By R. Girshick AP 12% 27% 36% 45% 49% 2005.

Object detection, deep learning, and R-CNNs

Histograms of Oriented Gradients for Human Detection(HOG)

Methods for classification and image representation

Grouplet: A Structured Image Representation for Recognizing Human and Object Interactions Bangpeng Yao and Li Fei-Fei Computer Science Department, Stanford.

Human Detection Method Combining HOG and Cumulative Sum based Binary Pattern Jong Gook Ko', Jin Woo Choi', So Hee Park', Jang Hee You', ' Electronics and.

CS 1699: Intro to Computer Vision Detection II: Deformable Part Models Prof. Adriana Kovashka University of Pittsburgh November 12, 2015.

Face detection Behold a state-of-the-art face detector! (Courtesy Boris Babenko)Boris Babenko slides adapted from Svetlana Lazebnik.

Poselets: Body Part Detectors Trained Using 3D Human Pose Annotations ZUO ZHEN 27 SEP 2011.

Object Recognition as Ranking Holistic Figure-Ground Hypotheses Fuxin Li and Joao Carreira and Cristian Sminchisescu 1.

Object Recognition by Integrating Multiple Image Segmentations Caroline Pantofaru, Cordelia Schmid, Martial Hebert ECCV 2008 E.

Describing People: A Poselet-Based Approach to Attribute Classification.

Object Recognizing. Object Classes Individual Recognition.

9.913 Pattern Recognition for Vision Class9 - Object Detection and Recognition Bernd Heisele.

Face detection Many slides adapted from P. Viola.

Finding Clusters within a Class to Improve Classification Accuracy Literature Survey Yong Jae Lee 3/6/08.

PANDA: Pose Aligned Networks for Deep Attribute Modeling Ning Zhang 1,2 Manohar Paluri 1 Marć Aurelio Ranzato 1 Trevor Darrell 2 Lumbomir Boudev 1 1 Facebook.

Presenter: Jae Sung Park

1 Bilinear Classifiers for Visual Recognition Computational Vision Lab. University of California Irvine To be presented in NIPS 2009 Hamed Pirsiavash Deva.

Bangpeng Yao1, Xiaoye Jiang2, Aditya Khosla1,

Computer vision: models, learning and inference

Object detection with deformable part-based models

Data Driven Attributes for Action Detection

Paper Presentation: Shape and Matching

Recognizing Humans: Action Recognition

Digit Recognition using SVMS

Object detection as supervised classification

A Tutorial on HOG Human Detection

AHED Automatic Human Emotion Detection

Outline Background Motivation Proposed Model Experimental Results

Presentation transcript:

Poselets Michael Krainin CSE 590V Oct 18, 2011

Person Detection Dalal and Triggs ‘05 – Learn to classify pedestrians vs. background – HOG + linear SVM – Doesn’t account for variations in body pose and viewpoint

Poselets Poselets capture part of the pose from a given viewpoint [Bourdev & Malik, ICCV09]

Poselets But how are we going to create training examples of poselets? [Bourdev & Malik, ICCV09]

How do we train a poselet for a given pose configuration? [Bourdev & Malik, ICCV09]

Finding correspondences at training time Given part of a human pose How do we find a similar pose configuration in the training set? [Bourdev & Malik, ICCV09]

We use keypoints to annotate the joints, eyes, nose, etc. of people Left Hip Left Shoulder Finding correspondences at training time [Bourdev & Malik, ICCV09]

Residual Error Finding correspondences at training time [Bourdev & Malik, ICCV09]

Training poselet classifiers Residual Error: Given a seed patch 2. Find the closest patch for every other person 3. Sort them by residual error 4. Threshold them [Bourdev & Malik, ICCV09]

Training poselet classifiers 1. Given a seed patch 2. Find the closest patch for every other person 3. Sort them by residual error 4. Threshold them 5. Use them as positive training examples to train a linear SVM with HOG features [Bourdev & Malik, ICCV09]

Which poselets should we train? Choose thousands of random windows, generate poselet candidates, train linear SVMs Select a small set of poselets that are: – Individually effective – Complementary [Bourdev & Malik, ICCV09]

Person Detection Using Poselets [12] Felzenszwalb et al. Object Detection with Discriminatively Trained Part Based Models. PAMI 2010.

Other Uses of Poselets Object Segmentation (CVPR 2011) – Predict area using “soft masks” – Deformation to match image edges – Extends poselets to other objects Activity Recognition (CVPR 2011) – Recognition from a single image – Use which poselets fired and to what extent to predict the activity

Describing People: A Poselet-Based Approach to Attribute Classification L. Bourdev, S. Maji, and J. Malik ICCV 2011

Random 1¼% of the test set [Bourdev et al., ICCV11]

How poselets help in high-level vision The image is a complex function of the viewpoint, pose, appearance, etc. Poselets decouple pose and camera view from appearance [Bourdev et al., ICCV11]

Gender recognition with poselets [Bourdev et al., ICCV11]

Attribute Classification Overview Poselet Activations Given a test image [Bourdev et al., ICCV11]

Features Pyramid HOG LAB histogram Skin features Hands-skin Legs-skin Poselet patch B.* C Skin mask Arms mask Features Poselet Activations [Bourdev et al., ICCV11]

Poselet Activations Features Poselet-level Classifiers Poselet Level Classification Linear SVM + Sigmoid Some attributes are associated with a particular body part [Bourdev et al., ICCV11]

Poselet Activations Features Poselet-level Classifiers Person-level Classifiers Person-level ClassificationPerson-level Classification Another Linear SVM + Sigmoid [Bourdev et al., ICCV11]

Poselet Activations Features Poselet-level Classifiers Person-level Classifiers Context-level Classifiers Context-Level Classification [Bourdev et al., ICCV11]

Results

Random 1¼% of the test set [Bourdev et al., ICCV11]

Label Frequency SPMPoselets-No-context Poselets-Full Precision/recall on our test set [Bourdev et al., ICCV11]

Gender Recognition vs. Cognitec Cognitec is one of the leading face recognition companies (according to latest NIST test) CognitecPoselets ScopeFrontal faces onlyAny view FeaturesProprietary biometric Standard HOG Min resolution60pixel face widthAny resolution Performance75.0% AP82.4% AP Upper bound for any gender recognizer based on frontal faces: Max AP=80.5% vs. 82.4% [Bourdev et al., ICCV11]

Highest-scoring “is male” Lowest-scoring “is male”Lowest-scoring “is male” [Bourdev et al., ICCV11]

Highest-scoring “has long hair” Lowest-scoring “has long hair”Lowest-scoring “has long hair” [Bourdev et al., ICCV11]

Highest-scoring “wears a hat” Lowest-scoring “wears a hat”Lowest-scoring “wears a hat” [Bourdev et al., ICCV11]

Highest-scoring “wears glasses” Lowest-scoring “wears glasses”Lowest-scoring “wears glasses” [Bourdev et al., ICCV11]

Highest-scoring “long sleeves” Lowest-scoring “long sleeves”Lowest-scoring “long sleeves” [Bourdev et al., ICCV11]

“100 Special Moments” by Jason Salavon

Highly-Weighted Poselets is-male has-long-hair has-glasses

Which poselets are discriminative for gender? Preferred by human subjects Preferred by our system best worst [Bourdev et al., ICCV11]

“A person with long pants” “A man with short hair and long sleeves” “A woman with long hair, glasses and long pants”(??) Describing people [Bourdev et al., ICCV11]

Poselets website The set of published poselet papers H3D data set + Matlab tools Java3D annotation tool + video tutorial Matlab code to detect people using poselets Latest trained poselets [Bourdev et al., ICCV11]