Contributions A people dataset of 8035 images. Three layer attribute classification framework using poselets. 1 2.

Slides:

Advertisements

Similar presentations

Classification using intersection kernel SVMs is efficient

Advertisements

Poselets: Body Part Detectors trained Using 3D Human Pose Annotations Lubomir Bourdev & Jitendra Malik ICCV 2009.

Semantic Contours from Inverse Detectors Bharath Hariharan et.al. (ICCV-11)

Rich feature Hierarchies for Accurate object detection and semantic segmentation Ross Girshick, Jeff Donahue, Trevor Darrell, Jitandra Malik (UC Berkeley)

Thomas Berg and Peter Belhumeur

Learning Semantics with Less Supervision

Adding Unlabeled Samples to Categories by Learned Attributes Jonghyun Choi Mohammad Rastegari Ali Farhadi Larry S. Davis PPT Modified By Elliot Crowley.

Combining Detectors for Human Hand Detection Antonio Hernández, Petia Radeva and Sergio Escalera Computer Vision Center, Universitat Autònoma de Barcelona,

Pose Estimation and Segmentation of People in 3D Movies Karteek Alahari, Guillaume Seguin, Josef Sivic, Ivan Laptev Inria, Ecole Normale Superieure ICCV.

Zhimin CaoThe Chinese University of Hong Kong Qi YinITCS, Tsinghua University Xiaoou TangShenzhen Institutes of Advanced Technology Chinese Academy of.

Classification using intersection kernel SVMs is efficient Joint work with Subhransu Maji and Alex Berg Jitendra Malik UC Berkeley.

Human Action Recognition by Learning Bases of Action Attributes and Parts Bangpeng Yao, Xiaoye Jiang, Aditya Khosla, Andy Lai Lin, Leonidas Guibas, and.

Semantic Texton Forests for Image Categorization and Segmentation We would like to thank Amnon Drory for this deck הבהרה : החומר המחייב הוא החומר הנלמד.

Learning Globally-Consistent Local Distance Functions for Shape-Based Image Retrieval and Classification Computer Vision, ICCV IEEE 11th International.

Lecture 31: Modern object recognition

Structural Human Action Recognition from Still Images Moin Nabi Computer Vision Lab. ©IPM - Oct

Intelligent Systems Lab. Recognizing Human actions from Still Images with Latent Poses Authors: Weilong Yang, Yang Wang, and Greg Mori Simon Fraser University,

Human Action Recognition by Learning Bases of Action Attributes and Parts.

Object-centric spatial pooling for image classification Olga Russakovsky, Yuanqing Lin, Kai Yu, Li Fei-Fei ECCV 2012.

Knowing a Good HOG Filter When You See It: Efficient Selection of Filters for Detection Ejaz Ahmed 1, Gregory Shakhnarovich 2, and Subhransu Maji 3 1 University.

Enhancing Exemplar SVMs using Part Level Transfer Regularization 1.

Sketch Tokens: A Learned Mid-level Representation for Contour and Object Detection CVPR2013 POSTER.

Fast intersection kernel SVMs for Realtime Object Detection

Student: Yao-Sheng Wang Advisor: Prof. Sheng-Jyh Wang ARTICULATED HUMAN DETECTION 1 Department of Electronics Engineering National Chiao Tung University.

DISCRIMINATIVE DECORELATION FOR CLUSTERING AND CLASSIFICATION ECCV 12 Bharath Hariharan, Jitandra Malik, and Deva Ramanan.

Recognition: A machine learning approach

Poselets Michael Krainin CSE 590V Oct 18, Person Detection Dalal and Triggs ‘05 – Learn to classify pedestrians vs. background – HOG + linear SVM.

On the Relationship between Visual Attributes and Convolutional Networks Paper ID - 52.

Generic Object Detection using Feature Maps Oscar Danielsson Stefan Carlsson

3D Hand Pose Estimation by Finding Appearance-Based Matches in a Large Database of Training Views

PANDA: Pose Aligned Networks for Deep Attribute Modeling Ning Zhang1;2, Manohar Paluri1, Marc’Aurelio Ranzato1, Trevor Darrell2, Lubomir Bourdev1 1: Facebook.

R-CNN By Zhang Liliang.

Face Processing System Presented by: Harvest Jang Group meeting Fall 2002.

Describing People: A Poselet-Based Approach to Attribute Classification Lubomir Bourdev 1,2 Subhransu Maji 1 Jitendra Malik 1 1 EECS U.C. Berkeley 2 Adobe.

School of Electronic Information Engineering, Tianjin University Human Action Recognition by Learning Bases of Action Attributes and Parts Jia pingping.

Bag-of-Words based Image Classification Joost van de Weijer.

The Three R’s of Vision Jitendra Malik.

CSE 185 Introduction to Computer Vision Pattern Recognition.

Window-based models for generic object detection Mei-Chen Yeh 04/24/2012.

Texture We would like to thank Amnon Drory for this deck הבהרה : החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.

Beauty is Here! Evaluating Aesthetics in Videos Using Multimodal Features and Free Training Data Yanran Wang, Qi Dai, Rui Feng, Yu-Gang Jiang School of.

Lecture 31: Modern recognition CS4670 / 5670: Computer Vision Noah Snavely.

Object Recognition in Images Slides originally created by Bernd Heisele.

Human pose recognition from depth image MS Research Cambridge.

Gang WangDerek HoiemDavid Forsyth. INTRODUCTION APROACH (implement detail) EXPERIMENTS CONCLUSION.

Categorization by Learning and Combing Object Parts B. Heisele, T. Serre, M. Pontil, T. Vetter, T. Poggio. Presented by Manish Jethwa.

Experimental Results Abstract Fingerspelling is widely used for education and communication among signers. We propose a new static fingerspelling recognition.

CS 1699: Intro to Computer Vision Detection II: Deformable Part Models Prof. Adriana Kovashka University of Pittsburgh November 12, 2015.

Poselets: Body Part Detectors Trained Using 3D Human Pose Annotations ZUO ZHEN 27 SEP 2011.

Unsupervised Salience Learning for Person Re-identification

Object Recognition as Ranking Holistic Figure-Ground Hypotheses Fuxin Li and Joao Carreira and Cristian Sminchisescu 1.

Describing People: A Poselet-Based Approach to Attribute Classification.

9.913 Pattern Recognition for Vision Class9 - Object Detection and Recognition Bernd Heisele.

Rich feature hierarchies for accurate object detection and semantic segmentation 2014 IEEE Conference on Computer Vision and Pattern Recognition Ross Girshick,

PANDA: Pose Aligned Networks for Deep Attribute Modeling Ning Zhang 1,2 Manohar Paluri 1 Marć Aurelio Ranzato 1 Trevor Darrell 2 Lumbomir Boudev 1 1 Facebook.

Presenter: Jae Sung Park

Week 4: 6/6 – 6/10 Jeffrey Loppert. This week.. Coded a Histogram of Oriented Gradients (HOG) Feature Extractor Extracted features from positive and negative.

Evaluation of Gender Classification Methods with Automatically Detected and Aligned Faces Speaker: Po-Kai Shen Advisor: Tsai-Rong Chang Date: 2010/6/14.

What is an Object? —— an experimental evaluation Presented by: Yao Pan.

CNN-RNN: A Uniﬁed Framework for Multi-label Image Classiﬁcation

Bangpeng Yao1, Xiaoye Jiang2, Aditya Khosla1,

Nonparametric Semantic Segmentation

Using Transductive SVMs for Object Classification in Images

R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.

Image Classification.

Categorization by Learning and Combing Object Parts

Adaboost for faces. Material

Jie Chen, Shiguang Shan, Shengye Yan, Xilin Chen, Wen Gao

Sign Language Recognition With Unsupervised Feature Learning

Presentation transcript:

Contributions A people dataset of 8035 images. Three layer attribute classification framework using poselets. 1 2

People Dataset H3D (Humans in 3D)Pascal VOC 2010 (trn+val) 8035 images TRN: 2003VAL: 2011TEST: different attributes in total. At least 2 attributes per image Agreement of 4 out of 5 1

Attribute classification using poselets 2

What is a Poselet ? Poselets capture part of the pose from a given viewpoint [Bourdev & Malik, ICCV09]

Poselets Examples may differ visually but have common semantics [Bourdev & Malik, ICCV09]

Poselets But how are we going to create training examples of poselets?

How do we train a poselet for a given pose configuration?

Finding correspondences at training time Given part of a human pose How do we find a similar pose configuration in the training set?

We use keypoints to annotate the joints, eyes, nose, etc. of people Left Hip Left Shoulder Finding correspondences at training time

Residual Error Finding correspondences at training time

Training poselet classifiers Residual Error: Given a seed patch 2. Find the closest patch for every other person 3. Sort them by residual error 4. Threshold them

Training poselet classifiers 1. Given a seed patch 2. Find the closest patch for every other person 3. Sort them by residual error 4. Threshold them 5. Use them as positive training examples to train a linear SVM with HOG features

Which poselets should we train? Choose thousands of random windows, generate poselet candidates, train linear SVMs Select a small set of poselets that are: –Individually effective –Complementary

Selecting a small set of complementary poselets

Some Poselets

Attribute classification using poselets 2

Features HOGs at two levels (~2K-4K features) –16 x 16 –32 x 32 Color Histograms in H,S,B (30 features) –10 bins for H, S and B Skin classifier output (3 features) –GMM with 5 components –Fraction of skin pixels –hands-skin, legs-skin, neck skin

Poselet-level Attribute Classifiers

Person-level Attribute Classifiers

Context-level Attribute Classifiers

Results

Gender Classification Results

Thanks