1 Action Classification: An Integration of Randomization and Discrimination in A Dense Feature Representation Computer Science Department, Stanford University.

Slides:

Advertisements

Similar presentations

Human Action Recognition by Learning Bases of Action Attributes and Parts Bangpeng Yao, Xiaoye Jiang, Aditya Khosla, Andy Lai Lin, Leonidas Guibas, and.

Advertisements

Random Forest Predrag Radenković 3237/10

Zhimin CaoThe Chinese University of Hong Kong Qi YinITCS, Tsinghua University Xiaoou TangShenzhen Institutes of Advanced Technology Chinese Academy of.

Three things everyone should know to improve object retrieval

Foreground Focus: Finding Meaningful Features in Unlabeled Images Yong Jae Lee and Kristen Grauman University of Texas at Austin.

Human Action Recognition by Learning Bases of Action Attributes and Parts Bangpeng Yao, Xiaoye Jiang, Aditya Khosla, Andy Lai Lin, Leonidas Guibas, and.

Multi-layer Orthogonal Codebook for Image Classification Presented by Xia Li.

Learning Visual Similarity Measures for Comparing Never Seen Objects Eric Nowak, Frédéric Jurie CVPR 2007.

1 Part 1: Classical Image Classification Methods Kai Yu Dept. of Media Analytics NEC Laboratories America Andrew Ng Computer Science Dept. Stanford University.

Bangpeng Yao and Li Fei-Fei

Bangpeng Yao Li Fei-Fei Computer Science Department, Stanford University, USA.

Intelligent Systems Lab. Recognizing Human actions from Still Images with Latent Poses Authors: Weilong Yang, Yang Wang, and Greg Mori Simon Fraser University,

Human Action Recognition by Learning Bases of Action Attributes and Parts.

Global spatial layout: spatial pyramid matching Spatial weighting the features Beyond bags of features: Adding spatial information.

Object-centric spatial pooling for image classification Olga Russakovsky, Yuanqing Lin, Kai Yu, Li Fei-Fei ECCV 2012.

Software Quality Ranking: Bringing Order to Software Modules in Testing Fei Xing Michael R. Lyu Ping Guo.

Enhancing Exemplar SVMs using Part Level Transfer Regularization 1.

Desheng Liu, Maggi Kelly and Peng Gong Dept. of Environmental Science, Policy & Management University of California, Berkeley May 18, 2005 Classifying.

Discriminative and generative methods for bags of features

Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

1 Image Recognition - I. Global appearance patterns Slides by K. Grauman, B. Leibe.

Generic Object Detection using Feature Maps Oscar Danielsson Stefan Carlsson

TextonBoost : Joint Appearance, Shape and Context Modeling for Multi-Class Object Recognition and Segmentation J. Shotton*, J. Winn†, C. Rother†, and A.

Object Recognition: Conceptual Issues Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and K. Grauman.

Object Recognition: Conceptual Issues Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and K. Grauman.

5/30/2006EE 148, Spring Visual Categorization with Bags of Keypoints Gabriella Csurka Christopher R. Dance Lixin Fan Jutta Willamowski Cedric Bray.

Agenda Introduction Bag-of-words models Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based.

A String Matching Approach for Visual Retrieval and Classification Mei-Chen Yeh* and Kwang-Ting Cheng Learning-Based Multimedia Lab Department of Electrical.

Ensemble Learning (2), Tree and Forest

Review: Intro to recognition Recognition tasks Machine learning approach: training, testing, generalization Example classifiers Nearest neighbor Linear.

Ranking with High-Order and Missing Information M. Pawan Kumar Ecole Centrale Paris Aseem BehlPuneet DokaniaPritish MohapatraC. V. Jawahar.

School of Electronic Information Engineering, Tianjin University Human Action Recognition by Learning Bases of Action Attributes and Parts Jia pingping.

Bag-of-Words based Image Classification Joost van de Weijer.

Bag of Video-Words Video Representation

Learning Visual Similarity Measures for Comparing Never Seen Objects By: Eric Nowark, Frederic Juric Presented by: Khoa Tran.

Loss-based Learning with Latent Variables M. Pawan Kumar École Centrale Paris École des Ponts ParisTech INRIA Saclay, Île-de-France Joint work with Ben.

Object Detection Sliding Window Based Approach Context Helps

Why Categorize in Computer Vision ?. Why Use Categories? People love categories!

Ranking with High-Order and Missing Information M. Pawan Kumar Ecole Centrale Paris Aseem BehlPuneet KumarPritish MohapatraC. V. Jawahar.

Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,

Window-based models for generic object detection Mei-Chen Yeh 04/24/2012.

Svetlana Lazebnik, Cordelia Schmid, Jean Ponce

Yao, B., and Fei-fei, L. IEEE Transactions on PAMI(2012)

Lecture 31: Modern recognition CS4670 / 5670: Computer Vision Noah Snavely.

Representations for object class recognition David Lowe Department of Computer Science University of British Columbia Vancouver, Canada Sept. 21, 2006.

A Codebook-Free and Annotation-free Approach for Fine-Grained Image Categorization Authors Bangpeng Yao et al. Presenter Hyung-seok Lee ( 이형석 ) CVPR 2012.

Combining multiple learners Usman Roshan. Bagging Randomly sample training data Determine classifier C i on sampled data Goto step 1 and repeat m times.

ECE738 Advanced Image Processing Face Detection IEEE Trans. PAMI, July 1997.

BAGGING ALGORITHM, ONLINE BOOSTING AND VISION Se – Hoon Park.

Visual Categorization With Bags of Keypoints Original Authors: G. Csurka, C.R. Dance, L. Fan, J. Willamowski, C. Bray ECCV Workshop on Statistical Learning.

Levels of Image Data Representation 4.2. Traditional Image Data Structures 4.3. Hierarchical Data Structures Chapter 4 – Data structures for.

Grouplet: A Structured Image Representation for Recognizing Human and Object Interactions Bangpeng Yao and Li Fei-Fei Computer Science Department, Stanford.

Guest lecture: Feature Selection Alan Qi Dec 2, 2004.

Recognition Using Visual Phrases

Object Recognition by Integrating Multiple Image Segmentations Caroline Pantofaru, Cordelia Schmid, Martial Hebert ECCV 2008 E.

Locally Linear Support Vector Machines Ľubor Ladický Philip H.S. Torr.

Max-Margin Training of Upstream Scene Understanding Models Jun Zhu Carnegie Mellon University Joint work with Li-Jia Li *, Li Fei-Fei *, and Eric P. Xing.

Tree and Forest Classification and Regression Tree Bagging of trees Boosting trees Random Forest.

Presenter: Jae Sung Park

Discriminative Machine Learning Topic 4: Weak Supervision M. Pawan Kumar Slides available online

NICTA SML Seminar, May 26, 2011 Modeling spatial layout for image classification Jakob Verbeek 1 Joint work with Josip Krapac 1 & Frédéric Jurie 2 1: LEAR.

Bangpeng Yao1, Xiaoye Jiang2, Aditya Khosla1,

Guillaume-Alexandre Bilodeau

Lecture 21: GIS Analytical Functionality (V)

Learning Mid-Level Features For Recognition

Paper Presentation: Shape and Matching

Multiple Feature Learning for Action Classification

A Graph-Matching Kernel for Object Categorization

Advisor: Dr.vahidipour Zahra salimian Shaghayegh jalali Dec 2017

Presentation transcript:

1 Action Classification: An Integration of Randomization and Discrimination in A Dense Feature Representation Computer Science Department, Stanford University Bangpeng Yao, Aditya Khosla, and Li Fei-Fei

2 Action Classification & Intuition Our Method Our Results Conclusion Outline

Action Classification & Intuition Our Method Our Results Conclusion Outline 3

Action Classification 4 Object classification: Presence of parts and their spatial configurations. [Lazebnik et al, 2006] [Fergus et al, 2003] … PhoningRidingBike Running

Action Classification 5 All images contain humans; Object classification: Presence of parts and their spatial configurations. [Lazebnik et al, 2006] [Fergus et al, 2003] …

Action Classification 6 All images contain humans; Objects small or absent; Object classification: Presence of parts and their spatial configurations. [Lazebnik et al, 2006] [Fergus et al, 2003] …

Action Classification 7 All images contain humans; Objects small or absent; Large pose variation & occlusion; Background clutter; Challenging… Object classification: Presence of parts and their spatial configurations. [Lazebnik et al, 2006] [Fergus et al, 2003] …

Our Intuition 8 Focus on image regions that contain the most discriminative information.

Our Intuition 9 Focus on image regions that contain the most discriminative information. How to represent the features? Dense feature space Randomization & Discrimination How to explore this feature space?

Outline 10 Action Classification & Intuition Our Method Our Results Conclusion

11... Region Height Region Width Dense Feature Space Normalized Image Size of image region Center of image region

12... Region Height Region Width Normalized Image Size of image region Center of image region Dense Feature Space

13... Region Height Region Width Normalized Image Size of image region Center of image region Dense Feature Space

14... Region Height Region Width How can we identify the discriminative regions efficiently and effectively? Normalized Image Size of image region Center of image region Dense Feature Space Image size: N×N Image regions: O(N 6 )

15... Region Height Region Width Normalized Image Size of image region Center of image region Apply randomization to sample a subset of image patches Dense Feature Space Random Forests (RF)

16... Region Height Region Width This classOther classes Random Forests (RF) Normalized Image Size of image region Center of image region Dense Feature Space

17... Region Height Region Width RF with discriminative classifiers Normalized Image Size of image region Center of image region This classOther classes Dense Feature Space

18 Generalization Ability of RF Generalization error of a RF: : correlation between decision trees : strength of the decision trees Discriminative classifiers Better generalization Dense feature space decreases increases

19 …… …… RF with Discriminative Classifiers

20 …… …… Train a binary SVM RF with Discriminative Classifiers BoW or SPM of SIFT-LLC features

21 …… …… Train a binary SVM RF with Discriminative Classifiers Biggest information gain

22 …… …… Train a binary SVM RF with Discriminative Classifiers

23 …… …… RF with Discriminative Classifiers We stop growing the tree if: - The maximum depth is reached; - There is only one class at the node; - The entropy of the training data at the node is low

Classification With RF 24 …… …… Number of trees Class Label

Action Classification & Intuition Our Method Our Results Conclusion Outline 25

Results on VOC 2011 Actions 26 Action Others’ Best Our Method Jumping Phoning Playing instrument Reading Riding bike Riding horse Running Taking photo Using computer Walking Our method ranks the first in six out of ten classes.

Results on VOC 2011 Actions 27 Action Others’ Best Our Method Jumping Phoning Playing instrument Reading Riding bike Riding horse Running Taking photo Using computer Walking

Results on VOC 2011 Actions 28 Action Others’ Best Our Method Jumping Phoning Playing instrument Reading Riding bike Riding horse Running Taking photo Using computer Walking

29 Generalization Ability of RF Discriminative classifiers Better generalization Dense feature space Tree correlation decreases Tree strength increases dense feature(spatial pyramid) SPM feature Vs. strong classifierweak classifierVs. Train discriminative SVM classifiers Generate feature weights randomly (Results on PASCAL VOC 2010)

Action Classification & Intuition Our Method Our Results Conclusion Outline 30

Conclusion 31 Exploring dense image features can benefit action classification; Combining randomization and discrimination is an effective way to explore the dense image representation; Achieves very good performance based on only one type of image descriptor; Code will be available soon.

32 …… …… Train a binary SVM Acknowledgement Bangpeng Yao, Aditya Khosla, and Li Fei-Fei. “Combining Randomization and Discrimination for Fine-Grained Image Categorization.” CVPR Thanks to Su Hao, Olga Russakovsky, and Carsten Rother. Reference: