Shape and Dynamics in Human Movement Analysis Ashok Veeraraghavan.

Slides:



Advertisements
Similar presentations
Gestures Recognition. Image acquisition Image acquisition at BBC R&D studios in London using eight different viewpoints. Sequence frame-by-frame segmentation.
Advertisements

Bilinear models for action and identity recognition Oxford Brookes Vision Group 26/01/2009 Fabio Cuzzolin.
FEATURE PERFORMANCE COMPARISON FEATURE PERFORMANCE COMPARISON y SC is a training set of k-dimensional observations with labels S and C b C is a parameter.
Active Appearance Models
Context-based object-class recognition and retrieval by generalized correlograms by J. Amores, N. Sebe and P. Radeva Discussion led by Qi An Duke University.
Robust Speech recognition V. Barreaud LORIA. Mismatch Between Training and Testing n mismatch influences scores n causes of mismatch u Speech Variation.
Learning Trajectory Patterns by Clustering: Comparative Evaluation Group D.
Caroline Rougier, Jean Meunier, Alain St-Arnaud, and Jacqueline Rousseau IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 21, NO. 5,
Automatic determination of skeletal age from hand radiographs of children Image Science Institute Utrecht University C.A.Maas.
Computer vision: models, learning and inference Chapter 13 Image preprocessing and feature extraction.
Computer vision: models, learning and inference Chapter 18 Models for style and identity.
Face Recognition and Biometric Systems
Multiple People Detection and Tracking with Occlusion Presenter: Feifei Huo Supervisor: Dr. Emile A. Hendriks Dr. A. H. J. Stijn Oomes Information and.
Uncertainty Representation. Gaussian Distribution variance Standard deviation.
Face Recognition & Biometric Systems, 2005/2006 Face recognition process.
Hidden Markov Models Theory By Johan Walters (SR 2003)
Hidden Markov Model based 2D Shape Classification Ninad Thakoor 1 and Jean Gao 2 1 Electrical Engineering, University of Texas at Arlington, TX-76013,
Shape and Dynamics in Human Movement Analysis Ashok Veeraraghavan.
Region labelling Giving a region a name. Image Processing and Computer Vision: 62 Introduction Region detection isolated regions Region description properties.
Face Recognition Under Varying Illumination Erald VUÇINI Vienna University of Technology Muhittin GÖKMEN Istanbul Technical University Eduard GRÖLLER Vienna.
MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.
HMM-BASED PATTERN DETECTION. Outline  Markov Process  Hidden Markov Models Elements Basic Problems Evaluation Optimization Training Implementation 2-D.
RBF Neural Networks x x1 Examples inside circles 1 and 2 are of class +, examples outside both circles are of class – What NN does.
A Data-Driven Approach to Quantifying Natural Human Motion SIGGRAPH ’ 05 Liu Ren, Alton Patrick, Alexei A. Efros, Jassica K. Hodgins, and James M. Rehg.
Probabilistic video stabilization using Kalman filtering and mosaicking.
Video Google: Text Retrieval Approach to Object Matching in Videos Authors: Josef Sivic and Andrew Zisserman ICCV 2003 Presented by: Indriyati Atmosukarto.
Recognition of Human Gait From Video Rong Zhang, C. Vogler, and D. Metaxas Computational Biomedicine Imaging and Modeling Center Rutgers University.
Face Recognition Based on 3D Shape Estimation
Announcements Take home quiz given out Thursday 10/23 –Due 10/30.
Video Google: Text Retrieval Approach to Object Matching in Videos Authors: Josef Sivic and Andrew Zisserman University of Oxford ICCV 2003.
Learning the space of time warping functions for Activity Recognition Function-Space of an Activity Ashok Veeraraghavan Rama Chellappa Amit K. Roy-Chowdhury.
Gait Recognition Simon Smith Jamie Hutton Thomas Moore David Newman.
Smart Traveller with Visual Translator for OCR and Face Recognition LYU0203 FYP.
Dynamic Time Warping Applications and Derivation
Hand Signals Recognition from Video Using 3D Motion Capture Archive Tai-Peng Tian Stan Sclaroff Computer Science Department B OSTON U NIVERSITY I. Introduction.
Statistical Shape Models Eigenpatches model regions –Assume shape is fixed –What if it isn’t? Faces with expression changes, organs in medical images etc.
An Illumination Invariant Face Recognition System for Access Control using Video Ognjen Arandjelović Roberto Cipolla Funded by Toshiba Corp. and Trinity.
Computer Vision - A Modern Approach Set: Segmentation Slides by D.A. Forsyth Segmentation and Grouping Motivation: not information is evidence Obtain a.
Manifold learning: Locally Linear Embedding Jieping Ye Department of Computer Science and Engineering Arizona State University
Isolated-Word Speech Recognition Using Hidden Markov Models
Action and Gait Recognition From Recovered 3-D Human Joints IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS— PART B: CYBERNETICS, VOL. 40, NO. 4, AUGUST.
Multimodal Interaction Dr. Mike Spann
2D Shape Matching (and Object Recognition)
Rongxiang Hu, Wei Jia, Haibin ling, and Deshuang Huang Multiscale Distance Matrix for Fast Plant Leaf Recognition.
General Tensor Discriminant Analysis and Gabor Features for Gait Recognition by D. Tao, X. Li, and J. Maybank, TPAMI 2007 Presented by Iulian Pruteanu.
Incorporating Dynamic Time Warping (DTW) in the SeqRec.m File Presented by: Clay McCreary, MSEE.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Definitions Random Signal Analysis (Review) Discrete Random Signals Random.
Action and Gait Recognition From Recovered 3-D Human Joints IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS— PART B: CYBERNETICS, VOL. 40, NO. 4, AUGUST.
Extracting features from spatio-temporal volumes (STVs) for activity recognition Dheeraj Singaraju Reading group: 06/29/06.
CVPR2013 Poster Detecting and Naming Actors in Movies using Generative Appearance Models.
CS654: Digital Image Analysis Lecture 36: Feature Extraction and Analysis.
University of South Florida, Tampa1 Gait Recognition and Inverse Biometrics Sudeep Sarkar (Zongyi Liu, Pranab Mohanty) Computer Science and Engineering.
Introduction to Scale Space and Deep Structure. Importance of Scale Painting by Dali Objects exist at certain ranges of scale. It is not known a priory.
Video Google: Text Retrieval Approach to Object Matching in Videos Authors: Josef Sivic and Andrew Zisserman University of Oxford ICCV 2003.
11/25/03 3D Model Acquisition by Tracking 2D Wireframes Presenter: Jing Han Shiau M. Brown, T. Drummond and R. Cipolla Department of Engineering University.
Motion Segmentation with Missing Data using PowerFactorization & GPCA
Supervised Time Series Pattern Discovery through Local Importance
René Vidal and Xiaodong Fan Center for Imaging Science
PRESENTED BY Yang Jiao Timo Ahonen, Matti Pietikainen
Particle Filtering for Geometric Active Contours
A Unified Algebraic Approach to 2D and 3D Motion Segmentation
Video Google: Text Retrieval Approach to Object Matching in Videos
Generalized Principal Component Analysis CVPR 2008
Dynamical Statistical Shape Priors for Level Set Based Tracking
Simon Smith Jamie Hutton Thomas Moore David Newman
The Functional Space of an Activity Ashok Veeraraghavan , Rama Chellappa, Amit Roy-Chowdhury Avinash Ravichandran.
Hu Li Moments for Low Resolution Thermal Face Recognition
Scale-Space Representation for Matching of 3D Models
Video Google: Text Retrieval Approach to Object Matching in Videos
Recognition and Matching based on local invariant features
Presentation transcript:

Shape and Dynamics in Human Movement Analysis Ashok Veeraraghavan

Outline Motivation What do we want to do? Shape Shape based methods for recognition Dynamics based methods for recognition Results

Motivation Human Perception Shape or Dynamics (or is it Both??)

Laurel and Hardy

Laurel ??? Hardy ???

Who is this ? ? ?

Introduction Psychophysics work indicates that dynamics is important for recognition in humans. Johansson: Light Display Moving dots Murray(1964) : 24 gait components Cutting:Familiarity;Static Vs Dynamic Kozlowski: dynamics speed, bounciness, rhythm. Cutting :Dynamic Invariant Gender Discrimination

Prior Work Image Correlation. Silhoutte Based Nearest Neighbour. Dynamic Time Warping Hidden Markov Model Model parts of human body and extract gait signature.(eg., Thigh)

Most gait recognition algorithms are shape based ! Relative importance of shape and dynamics

Definition of Shape “Shape is all the geometric information that remains when location, scale and rotational effects are filtered out from the object”. Kendall’s Statistical Shape Theory used for the characterization of shape. Pre-shape accounts for location and scale invariance alone.

Pre-Shape k landmark points (complex vector) Translational Invariance: Subtract mean Scale Invariance: Normalize the scale

Feature Extraction Silhoutte Landmarks Centered Landmarks Pre-shape vector

Distance between shapes Shape lies on a spherical manifold. Shape distance must incorporate the non-Euclidean nature of the shape space. 1)Full Procrustes distance. 2)Partial Procrustes distance. 3)Procrustes distance.

Full Procrustes Distance Procrustes Fit Full Procrustes Distance=Minimum Procrustes Fit.

Other shape distances Partial procrustes distance Procrustes distance (ρ): distance on the Great circle.

Tangent Space Linearization of spherical shape space around a particular pole. The Procrustes mean shape is usually chosen as the pole. If the shapes in the data are very close to each other then Euclidean distance in tangent space approximates shape distances.

Shape based methods for Recognition Stance Correlation. Dynamic time warping in shape space. Hidden Markov Model in shape space.

Stance Correlation Exemplars for 6 stances for each individual. The correlation between exemplars is used as the matching criterion. Performance comparable to Baseline.

Dynamic time warping in shape space. Enforce end-point constraint. Obtain best warping path. Cumulative error is computed using the shape distances described. Performance is better than baseline.

Hidden Markov Model in shape-space Exemplars are regarded as states. HMM built for each person in the gallery. Identity established by maximizing the probability that the observation came from the model in the gallery. Performance is better than baseline and comparable to DTW.

Dynamical Models Stance based AR model. Linear Dynamical System

Stance based AR model Video sequence is clustered into 3 distinct stances. Each frame is identified as belonging to one of these three stances. Parameters of an AR model learnt for each stance. Model parameters used for recognition. Performance is below baseline.

Linear Dynamical System(ARMA) Parameters (A,C) of a dynamical system learnt for each individual. Distance between models used as score for recognition.

Learning the model

Distance between models Subspace angles (θ i : i=1,2….n). Martin,Gap and Frobenius Distance.

Results on USF database Gallery 71 people. Probe varies from Gallery in view, shoe and surface. CMS curves shown.

Sample Sequences

Stance Correlation.

Dynamic time warping

Comparison of DTW with Baseline

Stance based AR model

Linear Dynamical system

Comparison of various methods on the USF database.

Results on the CMU database Gallery consists of 25 people. 3 different activities studied: Slow walk, Fast walk and walk with ball. Recognition performed within and across activities.

Percentage of Recognition using Stance correlation. Slow Walk Fast Walk Ball Slow Walk Fast Walk Ball684892

Similarity Matrix using Linear Dynamical system(ARMA)

Percentage of Recognition using Linear Dynamical system Slow WalkFast WalkBall Slow Walk Fast Walk Ball

Mocap Data

Mocap (Activity Recognition)

Mocap (Activity using ARMA)

Conclusions Shape is more important for recognition than dynamics. Shape also provides for speed change invariance. Dynamics can help to improve performance of shape based methods. Activity Recognition: Dynamics plays a important role. Dynamical models like ARMA can perform recognition across activities.

Thank You.