H UMAN A CTION R ECOGNITION USING L OCAL S PATIO -T EMPORAL D ISCRIMINANT E MBEDDING Kui Jia and Dit-Yan Yeung, IEEE Conference on Computer Vision and.

Slides:



Advertisements
Similar presentations
Coherent Laplacian 3D protrusion segmentation Oxford Brookes Vision Group Queen Mary, University of London, 11/12/2009 Fabio Cuzzolin.
Advertisements

Robust spectral 3D-bodypart segmentation along time Fabio Cuzzolin, Diana Mateus, Edmond Boyer, Radu Horaud Perception project meeting 24/4/2007 Submitted.
Simultaneous surveillance camera calibration and foot-head homology estimation from human detection 1 Author : Micusic & Pajdla Presenter : Shiu, Jia-Hau.
W HAT A RE W E T RACKING : A U NIFIED A PPROACH OF T RACKING AND R ECOGNITION Jialue Fan, Xiaohui Shen, Student Member, IEEE, and Ying Wu, Senior Member,
Improved Seam Carving for Video Retargeting Authors: Michael Rubinstein, Ariel Shamir and Shai Avidan Source: ACM Transactions on Graphics (TOG), Volume.
A CTION R ECOGNITION FROM V IDEO U SING F EATURE C OVARIANCE M ATRICES Kai Guo, Prakash Ishwar, Senior Member, IEEE, and Janusz Konrad, Fellow, IEEE.
A Geometric Perspective on Machine Learning 何晓飞 浙江大学计算机学院 1.
Byung Cheol Song Shin-Cheol Jeong Yanglim Choi Video Super-Resolution Algorithm Using Bi-directional Overlapped Block Motion Compensation IEEE TRANSACTIONS.
Proposed concepts illustrated well on sets of face images extracted from video: Face texture and surface are smooth, constraining them to a manifold Recognition.
Activity Recognition Aneeq Zia. Agenda What is activity recognition Typical methods used for action recognition “Evaluation of local spatio-temporal features.
Automatic Feature Extraction for Multi-view 3D Face Recognition
Presented by: Mingyuan Zhou Duke University, ECE April 3, 2009
2.Our Framework 2.1. Enforcing Temporal Consistency by Post Processing  Human Detection from Yang and Ramanan [1] Articulated Pose Estimation using Flexible.
Watching Unlabeled Video Helps Learn New Human Actions from Very Few Labeled Snapshots Chao-Yeh Chen and Kristen Grauman University of Texas at Austin.
A Robust Pedestrian Detection Approach Based on Shapelet Feature and Haar Detector Ensembles Wentao Yao, Zhidong Deng TSINGHUA SCIENCE AND TECHNOLOGY ISSNl.
Haojie Li Jinhui Tang Si Wu Yongdong Zhang Shouxun Lin Automatic Detection and Analysis of Player Action in Moving Background Sports Video Sequences IEEE.
{ Fast Disparity Estimation Using Spatio- temporal Correlation of Disparity Field for Multiview Video Coding Wei Zhu, Xiang Tian, Fan Zhou and Yaowu Chen.
A Novel Method for Generation of Motion Saliency Yang Xia, Ruimin Hu, Zhenkun Huang, and Yin Su ICIP 2010.
A New Block Based Motion Estimation with True Region Motion Field Jozef Huska & Peter Kulla EUROCON 2007 The International Conference on “Computer as a.
Natan Jacobson, Yen-Lin Lee, Vijay Mahadevan, Nuno Vasconcelos, Truong Q. Nguyen IEEE, ICME 2010.
Ai-Mei Huang and Truong Nguyen Video Processing LabECE Dept, UCSD, La Jolla, CA This paper appears in: Image Processing, ICIP IEEE International.
Dimensionality Reduction Chapter 3 (Duda et al.) – Section 3.8
MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.
Efficient Moving Object Segmentation Algorithm Using Background Registration Technique Shao-Yi Chien, Shyh-Yih Ma, and Liang-Gee Chen, Fellow, IEEE Hsin-Hua.
CS 790Q Biometrics Face Recognition Using Dimensionality Reduction PCA and LDA M. Turk, A. Pentland, "Eigenfaces for Recognition", Journal of Cognitive.
Face Recognition using PCA (Eigenfaces) and LDA (Fisherfaces)
Face Recognition Based on 3D Shape Estimation
MVL (Machine Vision Lab) UIC HUMAN MOTION VIDEO DATABASE Jezekiel Ben-Arie ECE Department University Of Illinois at Chicago Scripts, Queries, Recognition.
A Global Geometric Framework for Nonlinear Dimensionality Reduction Joshua B. Tenenbaum, Vin de Silva, John C. Langford Presented by Napat Triroj.
Atul Singh Junior Undergraduate CSE, IIT Kanpur.  Dimension reduction is a technique which is used to represent a high dimensional data in a more compact.
Face Recognition: An Introduction
Lightseminar: Learned Representation in AI An Introduction to Locally Linear Embedding Lawrence K. Saul Sam T. Roweis presented by Chan-Su Lee.
Manifold learning: Locally Linear Embedding Jieping Ye Department of Computer Science and Engineering Arizona State University
3D Fingertip and Palm Tracking in Depth Image Sequences
Graph Embedding: A General Framework for Dimensionality Reduction Dong XU School of Computer Engineering Nanyang Technological University
A Tensorial Approach to Access Cognitive Workload related to Mental Arithmetic from EEG Functional Connectivity Estimates S.I. Dimitriadis, Yu Sun, K.
A Two-level Pose Estimation Framework Using Majority Voting of Gabor Wavelets and Bunch Graph Analysis J. Wu, J. M. Pedersen, D. Putthividhya, D. Norgaard,
Face Recognition: An Introduction
Local Fisher Discriminant Analysis for Supervised Dimensionality Reduction Presented by Xianwang Wang Masashi Sugiyama.
Chin-Hsien Fang( 方競賢 ), Ju-Chin Chen( 陳洳瑾 ), Chien-Chung Tseng( 曾建中 ),and Jenn-Jier James Lien( 連震杰 ) Department of Computer Science and Information Engineering,
ISOMAP TRACKING WITH PARTICLE FILTER Presented by Nikhil Rane.
Action and Gait Recognition From Recovered 3-D Human Joints IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS— PART B: CYBERNETICS, VOL. 40, NO. 4, AUGUST.
Nonlinear Dimensionality Reduction Approach (ISOMAP)
Multi-Eigenspace Learning for Video-Base Face Recognition Liang Liu 1, Yunhong Wang 2, Tieniu Tan 1 1 National Laboratory of Pattern Recognition, Institute.
A split-and-merge framework for 2D shape summarization D. Gerogiannis, C. Nikou and A. Likas Department of Computer Science, University of Ioannina, Greece.
Design of PCA and SVM based face recognition system for intelligent robots Department of Electrical Engineering, Southern Taiwan University, Tainan County,
Unsupervised Feature Selection for Multi-Cluster Data Deng Cai, Chiyuan Zhang, Xiaofei He Zhejiang University.
Feature extraction using fuzzy complete linear discriminant analysis The reporter : Cui Yan
Journal of Visual Communication and Image Representation
June 25-29, 2006ICML2006, Pittsburgh, USA Local Fisher Discriminant Analysis for Supervised Dimensionality Reduction Masashi Sugiyama Tokyo Institute of.
Jenn-Jier James Lien (連震杰)
Irfan Ullah Department of Information and Communication Engineering Myongji university, Yongin, South Korea Copyright © solarlits.com.
Value Function Approximation with Diffusion Wavelets and Laplacian Eigenfunctions by S. Mahadevan & M. Maggioni Discussion led by Qi An ECE, Duke University.
哈工大信息检索研究室 HITIR ’ s Update Summary at TAC2008 Extractive Content Selection Using Evolutionary Manifold-ranking and Spectral Clustering Reporter: Ph.d.
Zhiming Liu and Chengjun Liu, IEEE. Introduction Algorithms and procedures Experiments Conclusion.
Hierarchical Motion Evolution for Action Recognition Authors: Hongsong Wang, Wei Wang, Liang Wang Center for Research on Intelligent Perception and Computing,
Local Stereo Matching Using Motion Cue and Modified Census in Video Disparity Estimation Zucheul Lee, Ramsin Khoshabeh, Jason Juang and Truong Q. Nguyen.
Spectral Methods for Dimensionality
A. M. R. R. Bandara & L. Ranathunga
Gait Recognition Gökhan ŞENGÜL.
Temporal Order-Preserving Dynamic Quantization for Human Action Recognition from Multimodal Sensor Streams Jun Ye Kai Li Guo-Jun Qi Kien.
Unsupervised Riemannian Clustering of Probability Density Functions
Outline Nonlinear Dimension Reduction Brief introduction Isomap LLE
Lecture 21 SVD and Latent Semantic Indexing and Dimensional Reduction
2 variants: Global fusion & Local perturbation
Outline H. Murase, and S. K. Nayar, “Visual learning and recognition of 3-D objects from appearance,” International Journal of Computer Vision, vol. 14,
An Introduction to Computer Vision& Pattern Recognition Group
FACE RECOGNITION USING LAPLACIANFACES
Evaluating Reliability of Motion Features in Surveillance Videos
Problem Image and Volume Segmentation:
Presentation transcript:

H UMAN A CTION R ECOGNITION USING L OCAL S PATIO -T EMPORAL D ISCRIMINANT E MBEDDING Kui Jia and Dit-Yan Yeung, IEEE Conference on Computer Vision and Pattern Recognition Instructor: Jenn-Jier Lien Reporter: Mei-Hsuan Chao

O UTLINE Introduction Related work Local spatio-temporal discriminant embedding Experiments Conclusion 2

I NTRODUCTION Recognizing human activities in videos has many important computer vision applications. A human silhouette contains both instant spatial information about the body pose and dynamic temporal motion information of the global body and local body parts. Human silhouettes can be considered as data points on nonlinear dynamic shape manifolds. The aim in this paper is to find a manifold embedding method which can optimally make use of the discriminative temporal shape variation information between different types of actions. 3

R ELATED WORK LPP(Locality preserving projections) LPP constructs a nearest neighbor graph. By using the Laplacian of the graph, LPP can find a mapping which optimally preserves the local neighborhood information. LSDA(locality sensitive discriminant analysis) LSDA first constructs one nearest neighbor graph, and then splits it into the within-class graph and the between-class graph. 4

L OCAL SPATIO - TEMPORAL DISCRIMINANT EMBEDDING Neighbor graph G Short video segment Si 5

L OCAL SPATIO - TEMPORAL DISCRIMINANT EMBEDDING Objective functions 6

L OCAL SPATIO - TEMPORAL DISCRIMINANT EMBEDDING Principal angles between Si and Sj 7

L OCAL SPATIO - TEMPORAL DISCRIMINANT EMBEDDING Optimal embedding The columns of an optimal A can be obtained as the generalized eigenvectors corresponding to the l largest eigenvalues. 8

L OCAL SPATIO - TEMPORAL DISCRIMINANT EMBEDDING Iterative learning 9

E XPERIMENTS Data setting Design of two-stage recognition scheme 10 Frame by frame basis Short segment basis Test silhouette frame Recognition result

E XPERIMENTS 11

C ONCLUSION Propose a novel local spatio-temporal discriminant embedding (LSTDE) method. Perform recognition on a frame-by-frame or short video segment basis. Experimental results demonstrate that the proposed method can accurately recognize human actions, and outperforms some representative manifold embedding methods. 12