Segmentation and tracking of the upper body model from range data with applications in hand gesture recognition Navin Goel Intel Corporation Department.

Slides:

Advertisements

Similar presentations

Gestures Recognition. Image acquisition Image acquisition at BBC R&D studios in London using eight different viewpoints. Sequence frame-by-frame segmentation.

Advertisements

We consider situations in which the object is unknown the only way of doing pose estimation is then building a map between image measurements (features)

Evidential modeling for pose estimation Fabio Cuzzolin, Ruggero Frezza Computer Science Department UCLA.

Probabilistic Tracking and Recognition of Non-rigid Hand Motion

Pose Estimation and Segmentation of People in 3D Movies Karteek Alahari, Guillaume Seguin, Josef Sivic, Ivan Laptev Inria, Ecole Normale Superieure ICCV.

Robust Speech recognition V. Barreaud LORIA. Mismatch Between Training and Testing n mismatch influences scores n causes of mismatch u Speech Variation.

Road-Sign Detection and Recognition Based on Support Vector Machines Saturnino, Sergio et al. Yunjia Man ECG 782 Dr. Brendan.

Developable Surface Fitting to Point Clouds Martin Peternell Computer Aided Geometric Design 21(2004) Reporter: Xingwang Zhang June 19, 2005.

Human Identity Recognition in Aerial Images Omar Oreifej Ramin Mehran Mubarak Shah CVPR 2010, June Computer Vision Lab of UCF.

Wen-Hung Liao Department of Computer Science National Chengchi University November 27, 2008 Estimation of Skin Color Range Using Achromatic Features.

Hilal Tayara ADVANCED INTELLIGENT ROBOTICS 1 Depth Camera Based Indoor Mobile Robot Localization and Navigation.

3D Graphics Rendering and Terrain Modeling

Computer Graphics Visible Surface Determination. Goal of Visible Surface Determination To draw only the surfaces (triangles) that are visible, given a.

Learning to estimate human pose with data driven belief propagation Gang Hua, Ming-Hsuan Yang, Ying Wu CVPR 05.

Automatic Feature Extraction for Multi-view 3D Face Recognition

3/5/2002Phillip Saltzman Video Motion Capture Christoph Bregler Jitendra Malik UC Berkley 1997.

Real Time Motion Capture Using a Single Time-Of-Flight Camera

Segmentation and Fitting Using Probabilistic Methods

AlgirdasBeinaravičius Gediminas Mazrimas Salman Mosslem.

1 Formation et Analyse d’Images Session 3 Daniela Hall 14 October 2004.

Motion Tracking. Image Processing and Computer Vision: 82 Introduction Finding how objects have moved in an image sequence Movement in space Movement.

Exchanging Faces in Images SIGGRAPH ’04 Blanz V., Scherbaum K., Vetter T., Seidel HP. Speaker: Alvin Date: 21 July 2004.

Segmentation and Tracking of Multiple Humans in Crowded Environments Tao Zhao, Ram Nevatia, Bo Wu IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE,

A Bayesian Formulation For 3d Articulated Upper Body Segmentation And Tracking From Dense Disparity Maps Navin Goel Dr Ara V Nefian Dr George Bebis.

A Bayesian algorithm for tracking multiple moving objects in outdoor surveillance video Department of Electrical Engineering and Computer Science The University.

Multiple Human Objects Tracking in Crowded Scenes Yao-Te Tsai, Huang-Chia Shih, and Chung-Lin Huang Dept. of EE, NTHU International Conference on Pattern.

Stereo and Multiview Sequence Processing. Outline Stereopsis Stereo Imaging Principle Disparity Estimation Intermediate View Synthesis Stereo Sequence.

Stereo Computation using Iterative Graph-Cuts

A Probabilistic Framework for Video Representation Arnaldo Mayer, Hayit Greenspan Dept. of Biomedical Engineering Faculty of Engineering Tel-Aviv University,

© 2004 by Davi GeigerComputer Vision March 2004 L1.1 Binocular Stereo Left Image Right Image.

A Novel 2D To 3D Image Technique Based On Object- Oriented Conversion.

Speech Technology Lab Ƅ ɜ: m ɪ ŋ ǝ m EEM4R Spoken Language Processing - Introduction Training HMMs Version 4: February 2005.

Real-Time Face Detection and Tracking Using Multiple Cameras RIT Computer Engineering Senior Design Project John RuppertJustin HnatowJared Holsopple This.

Learning and Recognizing Activities in Streams of Video Dinesh Govindaraju.

Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to.

I mage and M edia U nderstanding L aboratory for Performance Evaluation of Vision-based Real-time Motion Capture Naoto Date, Hiromasa Yoshimoto, Daisaku.

Optical flow (motion vector) computation Course: Computer Graphics and Image Processing Semester:Fall 2002 Presenter:Nilesh Ghubade

Technology and Historical Overview. Introduction to 3d Computer Graphics  3D computer graphics is the science, study, and method of projecting a mathematical.

3D Fingertip and Palm Tracking in Depth Image Sequences

Mutual Information-based Stereo Matching Combined with SIFT Descriptor in Log-chromaticity Color Space Yong Seok Heo, Kyoung Mu Lee, and Sang Uk Lee.

Associative Pattern Memory (APM) Larry Werth July 14, 2007

Multimodal Interaction Dr. Mike Spann

Prakash Chockalingam Clemson University Non-Rigid Multi-Modal Object Tracking Using Gaussian Mixture Models Committee Members Dr Stan Birchfield (chair)

What we didn’t have time for CS664 Lecture 26 Thursday 12/02/04 Some slides c/o Dan Huttenlocher, Stefano Soatto, Sebastian Thrun.

Lecture 12 Stereo Reconstruction II Lecture 12 Stereo Reconstruction II Mata kuliah: T Computer Vision Tahun: 2010.

Page 1 | Microsoft Work With Skeleton Data Kinect for Windows Video Courses Jan 2013.

EE 492 ENGINEERING PROJECT LIP TRACKING Yusuf Ziya Işık & Ashat Turlibayev Yusuf Ziya Işık & Ashat Turlibayev Advisor: Prof. Dr. Bülent Sankur Advisor:

Enforcing Constraints for Human Body Tracking David Demirdjian Artificial Intelligence Laboratory, MIT.

資訊工程系智慧型系統實驗室 iLab 南台科技大學 1 A Static Hand Gesture Recognition Algorithm Using K- Mean Based Radial Basis Function Neural Network 作者 :Dipak Kumar Ghosh,

A Region Based Stereo Matching Algorithm Using Cooperative Optimization Zeng-Fu Wang, Zhi-Gang Zheng University of Science and Technology of China Computer.

Expectation-Maximization (EM) Case Studies

CVPR2013 Poster Detecting and Naming Actors in Movies using Generative Appearance Models.

3D Face Recognition Using Range Images

1 Formation et Analyse d’Images Session 4 Daniela Hall 10 October 2005.

Course14 Dynamic Vision. Biological vision can cope with changing world Moving and changing objects Change illumination Change View-point.

Visual Tracking by Cluster Analysis Arthur Pece Department of Computer Science University of Copenhagen

Stereo Vision Local Map Alignment for Robot Environment Mapping Computer Vision Center Dept. Ciències de la Computació UAB Ricardo Toledo Morales (CVC)

Image-Based Rendering Geometry and light interaction may be difficult and expensive to model –Think of how hard radiosity is –Imagine the complexity of.

11/25/03 3D Model Acquisition by Tracking 2D Wireframes Presenter: Jing Han Shiau M. Brown, T. Drummond and R. Cipolla Department of Engineering University.

Toward humanoid manipulation in human-centered environments T. Asfour, P. Azad, N. Vahrenkamp, K. Regenstein, A. Bierbaum, K. Welke, J. Schroder, R. Dillmann.

SPACE MOUSE. INTRODUCTION  It is a human computer interaction technology  Helps in movement of manipulator in 6 degree of freedom * 3 translation degree.

Student Gesture Recognition System in Classroom 2.0 Chiung-Yao Fang, Min-Han Kuo, Greg-C Lee, and Sei-Wang Chen Department of Computer Science and Information.

A Plane-Based Approach to Mondrian Stereo Matching

A segmentation and tracking algorithm

3D Rendering Pipeline Hidden Surface Removal 3D Primitives

Parallel Integration of Video Modules

“grabcut”- Interactive Foreground Extraction using Iterated Graph Cuts

Chapter 4 . Trajectory planning and Inverse kinematics

A Gentle Tutorial of the EM Algorithm and its Application to Parameter Estimation for Gaussian Mixture and Hidden Markov Models Jeff A. Bilmes International.

A Gentle Tutorial of the EM Algorithm and its Application to Parameter Estimation for Gaussian Mixture and Hidden Markov Models Jeff A. Bilmes International.

Presentation transcript:

Segmentation and tracking of the upper body model from range data with applications in hand gesture recognition Navin Goel Intel Corporation Department of Computer Science, University of Nevada, Reno

Overview n Introduction n Overall System n Upper Body Model n Segmentation Problem n Tracking n Color Based Segmentation n Results n Conclusion and Future Work

Introduction n Applications 3D editing system/ HCI systems, American Sign Language Recognition, Entertainment, Industrial Control, Video coding, teleconferencing n Requirements Background and illumination independent, Occlusions and self occlusions of the body components, Robust hand free initialization, Robust tracking.

Overall System Initial Segmentation Tracking Stereo (RGB+Z) video sequence Valid Track Invalid Track Color-based segmentation Hue Moments Calculation Train Reco Upper Body Model Color video sequence

Upper Body Model Ha l J C O O ij L FlFl UlUl HeT UrUr FrFr Ha r L Ha L He LTLT LULU WlWl ElEl SlSl NSrSr ErEr WrWr L Ha LFLF LULU LFLF

Head — Normal component model Upper Body Model Size Head Neck Planar component model Neck Width Torso Linear component models Elbow Wrist

Upper Body Model Linear PDF Parameters: Where, are the spherical coordinates of J c with the origin in J p The conditional probability of a joint Jc given its parent joint Jp and the anthropological measure L is given by: Where, K J c is a normalization constant, represent the minimum and maximum values of parameters

state assignments and joint for the arm and body (head &torso) regions. Stage IStage II Looking for all possible joint configuration is computationally impractical. Therefore, segmentation takes place in two stages. The Segmentation Problem Simplifying assumptions Notations Only one user is visible and his/hers torso is the largest body component, The torso plane is perpendicular to the camera and, Head is in vertical position.

Step 3 Compute Step 4 Estimate the joints: Step 1 Estimate the torso plane parameters from all data using EM. Estimate the torso and head bounding box, and the plane that includes N. Step 2 Estimate the head blob parameters from all data using EM. Step 5 Repeat steps 3-4 until convergence of The Upper Body Segmentation. Stage I

Step 1. For each possible arm parameters estimate the mean of the linear pdfs corresponding to the upper and fore arms, and the mean of the normal pdf for the hands, Step 2. For each joint configuration J A : a) compute the best state assignment of the observation vectors given the joint configuration, b) compute the observation likelihood given the joint configuration. Step 3. Find the max likelihood over all joint configuration and determine the “best” set of joints and the corresponding best state assignment. Given the fix positions of S l and S r, we sub sample the joint space to get N E =18 possible positions for each of the joints E l and E r. Given each position of the elbow joints we search for N W = 16 possible positions for each of the joints W l, W r. The Upper Body Segmentation. Stage II

Arm Tracking for each joint J p we build a set of [J c 1, J c 2, J c 3, J c 4, J c 5 ] five possible child joint positions such that each of them lies on the surface of the sphere with parent joint as the center. Z Y X Φ θ J c 1 = (r,Φ,θ) joint center from last frame J c3 = (r,Φ,θ+Δθ) J c 5 = (r,Φ+ΔΦ,θ) J c 4 = (r,Φ,θ-Δθ) Step 2 for each joint configuration we determine the best state assignment of the observations J c 2 = (r,Φ-ΔΦ,θ) Jc1Jc1 Jc2Jc2 Jc3Jc3 Jc5Jc5 Jc4Jc4 Step 3 the max log likelihood determines the best joint configuration. Step 1 estimate the mean of the linear pdfs corresponding to the upper and fore arms, and the mean of the normal pdf for the hands

Color Based Segmentation Pixels with no depth information cannot be assigned to body components by the previous segmentation algorithm. Need to estimate the depth of all pixels and perform global segmentation. Depth Segmentation

Color Based Segmentation In practice Suppose, k = “left forearm”, then l = “all the body components except left forearm”, and if Z k = “a” then Z l = “[z min … z max ] > a’’. Color Segmentation

Upper Body Segmentation and Tracking. Results

Contributions n Articulated upper body model from dense disparity maps, n Linear pdf for the fore arms and upper arms, n Hand free initialization of the system from the optimal joint configuration, n Upper body tracking, seen as a particular case of the initialization. Future work n Improvements to the background segmentation, n Learn the anthropological measures, n Integration with other HCI systems (gesture reco, face reco, speech reco, speaker identification etc.) Conclusion and Future Work