Exemplar-SVM for Action Recognition

Slides:



Advertisements
Similar presentations
Max-Margin Additive Classifiers for Detection
Advertisements

Spatio-Temporal Relationship Match: Video Structure Comparison for Recognition of Complex Human Activities M. S. Ryoo and J. K. Aggarwal ICCV2009.
A Discriminative Key Pose Sequence Model for Recognizing Human Interactions Arash Vahdat, Bo Gao, Mani Ranjbar, and Greg Mori ICCV2011.
Recognizing Human Actions by Attributes CVPR2011 Jingen Liu, Benjamin Kuipers, Silvio Savarese Dept. of Electrical Engineering and Computer Science University.
Learning an Attribute Dictionary for Human Action Classification
Object recognition and scene “understanding”
Foreground Focus: Finding Meaningful Features in Unlabeled Images Yong Jae Lee and Kristen Grauman University of Texas at Austin.
Limin Wang, Yu Qiao, and Xiaoou Tang
Human Action Recognition across Datasets by Foreground-weighted Histogram Decomposition Waqas Sultani, Imran Saleemi CVPR 2014.
Human Action Recognition by Learning Bases of Action Attributes and Parts Bangpeng Yao, Xiaoye Jiang, Aditya Khosla, Andy Lai Lin, Leonidas Guibas, and.
Activity Recognition Aneeq Zia. Agenda What is activity recognition Typical methods used for action recognition “Evaluation of local spatio-temporal features.
Juergen Gall Action Recognition.
Some Recent Works of Human Activity Recognition 吴心筱
Space-time interest points Computational Vision and Active Perception Laboratory (CVAP) Dept of Numerical Analysis and Computer Science KTH (Royal Institute.
Structural Human Action Recognition from Still Images Moin Nabi Computer Vision Lab. ©IPM - Oct
LOCUS (Learning Object Classes with Unsupervised Segmentation) A variational approach to learning model- based segmentation. John Winn Microsoft Research.
Transferable Dictionary Pair based Cross-view Action Recognition Lin Hong.
Enhancing Exemplar SVMs using Part Level Transfer Regularization 1.
Discriminative and generative methods for bags of features
Local Descriptors for Spatio-Temporal Recognition
Good morning, everyone, thank you for coming to my presentation.
CVR05 University of California Berkeley 1 Familiar Configuration Enables Figure/Ground Assignment in Natural Scenes Xiaofeng Ren, Charless Fowlkes, Jitendra.
LOCUS Demo Stefan Zickler. Two “different” classes Class “Car Side Views” Class “Car Rears”
Large Scale Recognition and Retrieval. What does the world look like? High level image statistics Object Recognition for large-scale search Focus on scaling.
School of Electronic Information Engineering, Tianjin University Human Action Recognition by Learning Bases of Action Attributes and Parts Jia pingping.
Real-time Action Recognition by Spatiotemporal Semantic and Structural Forest Tsz-Ho Yu, Tae-Kyun Kim and Roberto Cipolla Machine Intelligence Laboratory,
Bag of Video-Words Video Representation
Watch, Listen & Learn: Co-training on Captioned Images and Videos Sonal Gupta, Joohyun Kim, Kristen Grauman, Raymond Mooney The University of Texas at.
Flow Based Action Recognition Papers to discuss: The Representation and Recognition of Action Using Temporal Templates (Bobbick & Davis 2001) Recognizing.
Taylor Rassmann.  Look at a confusion matrix of the UCF50 dataset  Dollar Features  Find the two most confused classes  Train an SVM specifically.
Action recognition with improved trajectories
IRISA / INRIA Rennes Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology)
Player Action Recognition in Broadcast Tennis Video with Applications to Semantic Analysis of Sport Game Guangyu Zhu, Changsheng Xu Qingming Huang, Wen.
Watch, Listen and Learn Sonal Gupta, Joohyun Kim, Kristen Grauman and Raymond Mooney -Pratiksha Shah.
Marcin Marszałek, Ivan Laptev, Cordelia Schmid Computer Vision and Pattern Recognition, CVPR Actions in Context.
Periodic Motion Detection via Approximate Sequence Alignment Ivan Laptev*, Serge Belongie**, Patrick Perez* *IRISA/INRIA, Rennes, France **Univ. of California,
Multi-task Low-rank Affinity Pursuit for Image Segmentation Bin Cheng, Guangcan Liu, Jingdong Wang, Zhongyang Huang, Shuicheng Yan (ICCV’ 2011) Presented.
Building local part models for category-level recognition C. Schmid, INRIA Grenoble Joint work with G. Dorko, S. Lazebnik, J. Ponce.
1 Action Classification: An Integration of Randomization and Discrimination in A Dense Feature Representation Computer Science Department, Stanford University.
Video Tracking Using Learned Hierarchical Features
Spatio-temporal constraints for recognizing 3D objects in videos Nicoletta Noceti Università degli Studi di Genova.
Week 9 Presented by Christina Peterson. Recognition Accuracies on UCF Sports data set Method Accuracy (%)DivingGolfingKickingLiftingRidingRunningSkating.
Semantic Embedding Space for Zero ­ Shot Action Recognition Xun XuTimothy HospedalesShaogang GongAuthors: Computer Vision Group Queen Mary University of.
MSRI workshop, January 2005 Object Recognition Collected databases of objects on uniform background (no occlusions, no clutter) Mostly focus on viewpoint.
A DISTRIBUTION BASED VIDEO REPRESENTATION FOR HUMAN ACTION RECOGNITION Yan Song, Sheng Tang, Yan-Tao Zheng, Tat-Seng Chua, Yongdong Zhang, Shouxun Lin.
Grouplet: A Structured Image Representation for Recognizing Human and Object Interactions Bangpeng Yao and Li Fei-Fei Computer Science Department, Stanford.
E XEMPLAR -SVM FOR A CTION R ECOGNITION Week 11 Presented by Christina Peterson.
First-Person Activity Recognition: What Are They Doing to Me? M. S. Ryoo and Larry Matthies Jet Propulsion Laboratory, California Institute of Technology,
Zuxuan Wu, Xi Wang, Yu-Gang Jiang, Hao Ye, Xiangyang Xue
Max-Margin Training of Upstream Scene Understanding Models Jun Zhu Carnegie Mellon University Joint work with Li-Jia Li *, Li Fei-Fei *, and Eric P. Xing.
Multi-view Synchronization of Human Actions and Dynamic Scenes Emilie Dexter, Patrick Pérez, Ivan Laptev INRIA Rennes - Bretagne Atlantique
Finding Clusters within a Class to Improve Classification Accuracy Literature Survey Yong Jae Lee 3/6/08.
Hierarchical Motion Evolution for Action Recognition Authors: Hongsong Wang, Wei Wang, Liang Wang Center for Research on Intelligent Perception and Computing,
1 Bilinear Classifiers for Visual Recognition Computational Vision Lab. University of California Irvine To be presented in NIPS 2009 Hamed Pirsiavash Deva.
Visual Event Recognition in Videos by Learning from Web Data
Human Action Recognition Week 10
Data Driven Attributes for Action Detection
Action Recognition ECE6504 Xiao Lin.
Paper Presentation: Shape and Matching
Action Recognition in Temporally Untrimmed Videos
Bilinear Classifiers for Visual Recognition
Human Activity Analysis
Data Driven Attributes for Action Detection
Week 6 Fatemeh Yazdiananari.
Human Action Recognition Week 8
Anomaly Detection in Crowded Scenes
Weakly Supervised Action Recognition
Exemplar-SVM for Action Recognition
University of Central Florida
Presentation transcript:

Exemplar-SVM for Action Recognition Week 12 Presented by Christina Peterson

Recognition Accuracies on UCF Sports data set Method Accuracy (%) Diving Golfing Kicking Lifting Riding Running Skating Swing-bench High-swing Walking Rodriguez et al. [1] 69.2 68 61 66 75 74 73 - Yeffet and Wolf [2] 79.3 100 65 67 69 92 86 Le et al. [4] 86.5 77.8 80 66.7 83.3 90.9 Wu et al. [6] 91.3 88 93 84 95 91 Action Bank [7] 95.0 83 89 Standard Multiclass-SVMs 90.2 50 71.4 Combined Exemplar-SVMs 94.4 96.6 87.5 91.7 98.1 97.9 82.5

Confusion Matrix: Combined Exemplar-SVM Di Go Ki Li Ho Ru Sk Sb Ss Wa Diving 96.6 1.9 1.5 87.5 8.3 4.2 91.7 100.0 83.3 16.7 98.1 97.9 2.1 2.8 94.4 3.3 10.9 82.5 Golf Kick Lift Horse-Ride Run Skateboard Swing-bench Swing-side Walk

Modifications Ran STIP for kicking action class Lowered the threshold for weak interest points to obtain more interest points The number of interest points collected for each video should be approximately equal to each other

Constraints The Exemplar Set The Validation Set/Test Set Each exemplar should be a good representation of the action class Needs a good variety The Validation Set/Test Set The validation set and test set should be similar to each other Motion Color Confusing Videos were omitted from all sets For example: Accelerating a skateboard by foot closely resembles walking

References [1] M. D. Rodriguez, J. Ahmed, and M. Shah. Action mach: A spatio- temporal maximum average correlation height filter for action recognition. In CVPR, 2008. [2] Yeffet and L. Wolf. Local trinary patterns for human action recognition. In ICCV, 2009. [3] H. Wang, M. Ullah, A. Klaser, I. Laptev, and C. Schmid. Evaluation of local spatio-temporal features for action recognition. In BMVC, 2009. [4] Q. Le, W. Zou, S. Yeung, and A. Ng. Learning hierarchical invariant spatiotemporal features for action recognition with independent subspace analysis. In CVPR, 2011. [5] A. Kovashka and K. Grauman. Learning a hierarchy of discriminative spacetime neighborhood features for human action recognition. InCVPR, 2010. [6] X. Wu, D. Xu, L. Duan, and J. Luo. Action recognition using context and appearance distribution features. InCVPR, 2011. [7] S. Sadanand and J. J. Corso. Action bank: A high-level representation of activity in video. CVPR, 2012.