Learning Human Pose and Motion Models for Animation Aaron Hertzmann University of Toronto.

Slides:

Advertisements

Similar presentations

Bayesian Belief Propagation

Advertisements

CSCE643: Computer Vision Bayesian Tracking & Particle Filtering Jinxiang Chai Some slides from Stephen Roth.

Computer vision: models, learning and inference Chapter 18 Models for style and identity.

1 Articulated Pose Estimation in a Learned Smooth Space of Feasible Solutions Taipeng Tian, Rui Li and Stan Sclaroff Computer Science Dept. Boston University.

Reducing Drift in Parametric Motion Tracking

Automating Graph-Based Motion Synthesis Lucas Kovar Michael Gleicher University of Wisconsin-Madison.

Optimization & Learning for Registration of Moving Dynamic Textures Junzhou Huang 1, Xiaolei Huang 2, Dimitris Metaxas 1 Rutgers University 1, Lehigh University.

Gaussian process emulation of multiple outputs Tony O’Hagan, MUCM, Sheffield.

Summary & Homework Jinxiang Chai. Outline Motion data process paper summary Presentation tips Homework Paper assignment.

A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University.

CSCE 641: Forward kinematics and inverse kinematics Jinxiang Chai.

Physically Based Motion Transformation Zoran Popović Andrew Witkin SIGGRAPH ‘99.

Introduction to Data-driven Animation Jinxiang Chai Computer Science and Engineering Texas A&M University.

Hilbert Space Embeddings of Hidden Markov Models Le Song, Byron Boots, Sajid Siddiqi, Geoff Gordon and Alex Smola 1.

Pattern Recognition and Machine Learning

Advanced Computer Graphics (Fall 2010) CS 283, Lecture 24: Motion Capture Ravi Ramamoorthi Most slides courtesy.

Adam Rachmielowski 615 Project: Real-time monocular vision-based SLAM.

3D Human Body Pose Estimation using GP-LVM Moin Nabi Computer Vision Group Institute for Research in Fundamental Sciences (IPM)

A Data-Driven Approach to Quantifying Natural Human Motion SIGGRAPH ’ 05 Liu Ren, Alton Patrick, Alexei A. Efros, Jassica K. Hodgins, and James M. Rehg.

Curve Analogies Aaron Hertzmann Nuria Oliver Brain Curless Steven M. Seitz University of Washington Microsoft Research Thirteenth Eurographics.

Kinematics. ILE5030 Computer Animation and Special Effects2 Kinematics The branch of mechanics concerned with the motions of objects without regard to.

Tracking using the Kalman Filter. Point Tracking Estimate the location of a given point along a sequence of images. (x 0,y 0 ) (x n,y n )

Machine Learning CUNY Graduate Center Lecture 3: Linear Regression.

Today Introduction to MCMC Particle filters and MCMC

Constraint-based Motion Optimization Using A Statistical Dynamic Model Jinxiang Chai Texas A&M University.

A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University.

Gaussian Process Dynamical Models JM Wang, DJ Fleet, A Hertzmann Dan Grollman, RLAB 3/21/2007.

Composition of complex optimal multi-character motions C. Karen Liu Aaron Hertzmann Zoran Popović.

Review Rong Jin. Comparison of Different Classification Models  The goal of all classifiers Predicating class label y for an input x Estimate p(y|x)

Particle Filtering. Sensors and Uncertainty Real world sensors are noisy and suffer from missing data (e.g., occlusions, GPS blackouts) Use sensor models.

Function Approximation for Imitation Learning in Humanoid Robots Rajesh P. N. Rao Dept of Computer Science and Engineering University of Washington,

Cao et al. ICML 2010 Presented by Danushka Bollegala.

Gaussian Mixture Model and the EM algorithm in Speech Recognition

Human-Computer Interaction Human-Computer Interaction Tracking Hanyang University Jong-Il Park.

Learning and Recognizing Human Dynamics in Video Sequences Christoph Bregler Alvina Goh Reading group: 07/06/06.

Cognitive Computer Vision Kingsley Sage and Hilary Buxton Prepared under ECVision Specific Action 8-3

Computer Vision Lab. SNU Young Ki Baik Nonlinear Dimensionality Reduction Approach (ISOMAP, LLE)

Multifactor GPs Suppose now we wish to model different mappings for different styles. We will add a latent style vector s along with x, and define the.

Virtual Vector Machine for Bayesian Online Classification Yuan (Alan) Qi CS & Statistics Purdue June, 2009 Joint work with T.P. Minka and R. Xiang.

Discovering Deformable Motifs in Time Series Data Jin Chen CSE Fall 1.

CSCE 441: Computer Graphics Forward/Inverse kinematics Jinxiang Chai.

Vision-based human motion analysis: An overview Computer Vision and Image Understanding(2007)

CS-378: Game Technology Lecture #13: Animation Prof. Okan Arikan University of Texas, Austin Thanks to James O’Brien, Steve Chenney, Zoran Popovic, Jessica.

ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: ML and Simple Regression Bias of the ML Estimate Variance of the ML Estimate.

Graphics Graphics Korea University cgvr.korea.ac.kr 1 Computer Animation 고려대학교 컴퓨터 그래픽스 연구실.

CS Statistical Machine learning Lecture 24

Paper Reading Dalong Du Nov.27, Papers Leon Gu and Takeo Kanade. A Generative Shape Regularization Model for Robust Face Alignment. ECCV08. Yan.

Additional Topics in Prediction Methodology. Introduction Predictive distribution for random variable Y 0 is meant to capture all the information about.

Bayes Theorem The most likely value of x derived from this posterior pdf therefore represents our inverse solution. Our knowledge contained in is explicitly.

ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.

Particle Filtering. Sensors and Uncertainty Real world sensors are noisy and suffer from missing data (e.g., occlusions, GPS blackouts) Use dynamics models.

CS Statistical Machine learning Lecture 12 Yuan (Alan) Qi Purdue CS Oct

Lecture Fall 2001 Controlling Animation Boundary-Value Problems Shooting Methods Constrained Optimization Robot Control.

Hmm, HID HMMs Gerald Dalley MIT AI Lab Activity Perception Group Group Meeting 17 April 2003.

Markov Networks: Theory and Applications Ying Wu Electrical Engineering and Computer Science Northwestern University Evanston, IL 60208

Constrained Synthesis of Textural Motion for Animation Shmuel Moradoff Dani Lischinski The Hebrew University of Jerusalem.

ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.

CSE Advanced Computer Animation Short Presentation Topic: Locomotion Kang-che Lee 2009 Fall 1.

High Dimensional Probabilistic Modelling through Manifolds

Character Animation Forward and Inverse Kinematics

Neil Lawrence Machine Learning Group Department of Computer Science

Neil Lawrence Machine Learning Group Department of Computer Science

Computer Animation cgvr.korea.ac.kr.

Neil Lawrence Machine Learning Group Department of Computer Science

Dynamical Statistical Shape Priors for Level Set Based Tracking

Probabilistic Models with Latent Variables

Biointelligence Laboratory, Seoul National University

Neil Lawrence Machine Learning Group Department of Computer Science

Synthesis of Motion from Simple Animations

Presentation transcript:

Learning Human Pose and Motion Models for Animation Aaron Hertzmann University of Toronto

Animation is maturing … … but it’s still hard to create

Keyframe animation

q1q1 q2q2 q3q3 q (t)

Characters are very complex Woody: facial controls controls in his body

Motion capture [Images from NYU and UW]

Motion capture

Mocap is not a panacea

Problem Animation is very time-consuming Fine for big studios Problem for:

Goal: model human motion What motions are likely? Applications: Computer animation Computer vision

Related work: physical models Accurate, in principle Too complex to work with (but see [Liu, Hertzmann, Popović 2005]) Computationally expensive

Related work: motion graphs Input: raw motion capture “Motion graph” (slide from J. Lee)

Approach: statistical models of motions Learn a PDF over motions, and synthesize from this PDF [Brand and Hertzmann 1999] What PDF do we use?

Style-Based Inverse Kinematics with: Keith Grochow, Steve Martin, Zoran Popović

Motivation

Body parameterization Pose at time t : q t Root pos./orientation (6 DOFs) Joint angles (29 DOFs) Motion X = [ q 1, …, q T ]

Forward kinematics Pose to 3D positions: qtqt [x i,y i,z i ] t FK

Problem Statement Generate a character pose based on a chosen style subject to constraints Constraints Degrees of freedom (DOFs) q

Real-time Pose Synthesis Off-Line Learning Approach Motion Learning Style Synthesis Pose Constraints

Style Representation Objective function –given a pose evaluate how well it matches a style –allow any pose Probability Distribution Function (PDF) –principled way of automatically learning the style

y(q) = q orientation(q) velocity(q) [ q 0 q 1 q 2 …… r 0 r 1 r 2 v 0 v 1 v 2 … ] Features

Goals for the PDF Learn PDF from any data Smooth and descriptive Minimal parameter tuning Real-time synthesis

Mixtures-of-Gaussians

GPLVM y1y1 y2y2 y3y3 x1x1 x2x2 Latent Space Feature Space Gaussian Process Latent Variable Model [Lawrence 2004] GP   -1 x ~ N (0,I) y ~ GP(x;  ) Learning: arg max p(X,  | Y) = arg max p(Y | X,  ) p(X)

Scaled Outputs Different DOFs have different “importances” Solution: RBF kernel function k(x,x’) k i (x,x’) = k(x,x’)/w i 2 Equivalently: learn x  Wy where W = diag(w 1, w 2, … w D )

Style Learning y1y1 y2y2 y3y3 x1x1 x2x2

Precision in Latent Space  2 (x)

Pose Synthesis y1y1 y2y2 y3y3 x1x1 x2x2 arg min x,q p(y(q),x|X,Y,  ) s.t. C(q) = 0

Pose Synthesis arg min x,q p(y(q),x|X,Y,  ) s.t. C(q) = 0 Constraints Degrees of freedom (DOFs) q

SGPLVM Objective Function y1y1 y2y2 y3y3 x1x1 x2x2

Baseball Pitch

Track Start

Jump Shot

The Active Set All training dataActive set data Training Data

Annealing Original Style High Variance Medium Variance Original Style

Style interpolation Given two styles  1 and  2, can we “interpolate” them? Approach: interpolate in log-domain

Style interpolation (1-s)s

Style interpolation in log space (1-s) s

Applications

Interactive Posing

Multiple motion style

Realtime Motion Capture

Style Interpolation

Trajectory Keyframing

Posing from an Image

Modeling motion GPLVM doesn’t model motions Velocity features are a hack How do we model and learn dynamics?

Gaussian Process Dynamical Models with: David Fleet, Jack Wang

Dynamical models x t+1 xtxt

Hidden Markov Model (HMM) Linear Dynamical Systems (LDS) [van Overschee et al ‘94; Doretto et al ‘01] Switching LDS [Ghahramani and Hinton ’98; Pavlovic et al ‘00; Li et al ‘02] Nonlinear Dynamical Systems [e.g., Ghahramani and Roweis ‘00] Dynamical models

Gaussian Process Dynamical Model (GPDM) Marginalize out, and then optimize the latent positions to simultaneously minimize pose reconstruction error and (dynamic) prediction error on training data. pose reconstruction latent dynamics Latent dynamical model : Assume IID Gaussian noise, and with Gaussian priors on and

Reconstruction where contains the th -dimension of each training pose is a kernel matrix with entries for kernel function (with hyperparameters ) scales different pose dimensions The data likelihood for the reconstruction mapping, given centered inputs has the form:

Reconstruction The data likelihood for the reconstruction mapping, given centered inputs has the form: where is a kernel matrix with entries for kernel function (with hyperparameters ) scales different pose dimensions

Dynamics The latent dynamic process on has a similar form: where is a kernel matrix defined by kernel function with hyperparameters

Subspace dynamical model : Markov Property Remark: Conditioned on, the dynamical model is 1 st -order Markov, but the marginalization introduces longer temporal dependence.

Learning To estimate the latent coordinates & kernel parameters we minimize with respect to and. GPDM posterior: reconstruction likelihood priorsdynamics likelihood training motions hyperparameterslatent trajectories

Motion Capture Data ~2.5 gait cycles (157 frames)Learned latent coordinates (1st-order prediction, RBF kernel) 56 joint angles + 3 global translational velocity + 3 global orientation from CMU motion capture database

3D GPLVM Latent Coordinates large “jumps’ in latent space

Reconstruction Variance Volume visualization of. (1 st -order prediction, RBF kernel)

Motion Simulation Animation of mean motion (200 step sequence) initial state Random trajectories from MCMC (~1 gait cycle, 60 steps)

Simulation: 1 st -Order Mean Prediction Red: 200 steps of mean prediction Green: 60-step MCMC mean Animation

Linear Kernel Dynamics Animation 200 steps of mean prediction

Missing Data 50 of 147 frames dropped (almost a full gait cycle) spline interpolation

Missing Data: RBF Dynamics

Missing Data: Linear Dynamics

Determining hyperparameters GPDMNeil’s parametersMCEM Data: six distinct walkers

Where do we go from here? Let’s look at some limitations of the model 60 Hz120 Hz

What do we want? Phase Variation x1x1 x2x2 A walk cycle

Branching motions WalkRun

Stylistic variation

Current work: manifold GPs Latent space (x) Data space (y)

Summary GPLVM and GPDM provide priors from small data sets Dependence on initialization, hyperpriors, latent dimensionality Open problems modeling data topology and stylistic variation