3D Fingertip and Palm Tracking in Depth Image Sequences

Slides:



Advertisements
Similar presentations
Simultaneous surveillance camera calibration and foot-head homology estimation from human detection 1 Author : Micusic & Pajdla Presenter : Shiu, Jia-Hau.
Advertisements

Víctor Ponce Miguel Reyes Xavier Baró Mario Gorga Sergio Escalera Two-level GMM Clustering of Human Poses for Automatic Human Behavior Analysis Departament.
Robust Part-Based Hand Gesture Recognition Using Kinect Sensor
Wrist Recognition and the Center of the Palm Estimation Based on Depth Camera Zhengwei Yao ; Zhigeng Pan ; Shuchang Xu Virtual Reality and Visualization.
Xin Zhang, Zhichao Ye, Lianwen Jin, Ziyong Feng, and Shaojie Xu
Multi-scenario Gesture Recognition Using Kinect Student : Sin- Jhu YE Student Id : MA Computer Engineering & Computer Science University of Louisville.
Real-Time Accurate Stereo Matching using Modified Two-Pass Aggregation and Winner- Take-All Guided Dynamic Programming Xuefeng Chang, Zhong Zhou, Yingjie.
Kiyoshi Irie, Tomoaki Yoshida, and Masahiro Tomono 2011 IEEE International Conference on Robotics and Automation Shanghai International Conference Center.
Cambridge, Massachusetts Pose Estimation in Heavy Clutter using a Multi-Flash Camera Ming-Yu Liu, Oncel Tuzel, Ashok Veeraraghavan, Rama Chellappa, Amit.
A Robust Method of Detecting Hand Gestures Using Depth Sensors Yan Wen, Chuanyan Hu, Guanghui Yu, Changbo Wang Haptic Audio Visual Environments and Games.
Texture Segmentation Based on Voting of Blocks, Bayesian Flooding and Region Merging C. Panagiotakis (1), I. Grinias (2) and G. Tziritas (3)
Adviser : Ming-Yuan Shieh Student ID : M Student : Chung-Chieh Lien VIDEO OBJECT SEGMENTATION AND ITS SALIENT MOTION DETECTION USING ADAPTIVE BACKGROUND.
Proceedings of the British Machine Vision Conference (BMVC), 2010 Qi Wang, Xilin Chen, Wen Gao Skin Color Weighted Disparity Competition for Hand Segmentation.
1 Robust Video Stabilization Based on Particle Filter Tracking of Projected Camera Motion (IEEE 2009) Junlan Yang University of Illinois,Chicago.
A Bayesian Formulation For 3d Articulated Upper Body Segmentation And Tracking From Dense Disparity Maps Navin Goel Dr Ara V Nefian Dr George Bebis.
Rodent Behavior Analysis Tom Henderson Vision Based Behavior Analysis Universitaet Karlsruhe (TH) 12 November /9.
3D Hand Pose Estimation by Finding Appearance-Based Matches in a Large Database of Training Views
Real-time Hand Pose Recognition Using Low- Resolution Depth Images
Extension of M-VOTE: Improving Feature Detection
Presented by Pat Chan Pik Wah 28/04/2005 Qualifying Examination
Smart Traveller with Visual Translator for OCR and Face Recognition LYU0203 FYP.
IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 20, NO. 11, NOVEMBER 2011 Qian Zhang, King Ngi Ngan Department of Electronic Engineering, the Chinese university.
Jacinto C. Nascimento, Member, IEEE, and Jorge S. Marques
Segmentation and tracking of the upper body model from range data with applications in hand gesture recognition Navin Goel Intel Corporation Department.
Jason Li Jeremy Fowers Ground Target Following for Unmanned Aerial Vehicles.
Performance Evaluation of Grouping Algorithms Vida Movahedi Elder Lab - Centre for Vision Research York University Spring 2009.
Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’04) /04 $20.00 c 2004 IEEE 1 Li Hong.
A Fast and Robust Fingertips Tracking Algorithm for Vision-Based Multi-touch Interaction Qunqun Xie, Guoyuan Liang, Cheng Tang, and Xinyu Wu th.
Fingertip Tracking Based Active Contour for General HCI Application Proceedings of the First International Conference on Advanced Data and Information.
Robust Hand Tracking with Refined CAMShift Based on Combination of Depth and Image Features Wenhuan Cui, Wenmin Wang, and Hong Liu International Conference.
Mutual Information-based Stereo Matching Combined with SIFT Descriptor in Log-chromaticity Color Space Yong Seok Heo, Kyoung Mu Lee, and Sang Uk Lee.
Multimodal Interaction Dr. Mike Spann
Zhengyou Zhang Microsoft Research Digital Object Identifier: /MMUL Publication Year: 2012, Page(s): Professor: Yih-Ran Sheu Student.
Human-Computer Interaction Human-Computer Interaction Tracking Hanyang University Jong-Il Park.
KinectFusion : Real-Time Dense Surface Mapping and Tracking IEEE International Symposium on Mixed and Augmented Reality 2011 Science and Technology Proceedings.
A Method for Hand Gesture Recognition Jaya Shukla Department of Computer Science Shiv Nadar University Gautam Budh Nagar, India Ashutosh Dwivedi.
3D SLAM for Omni-directional Camera
EE 492 ENGINEERING PROJECT LIP TRACKING Yusuf Ziya Işık & Ashat Turlibayev Yusuf Ziya Işık & Ashat Turlibayev Advisor: Prof. Dr. Bülent Sankur Advisor:
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 34, NO. 2, FEBRUARY Leonardo De-Maeztu, Arantxa Villanueva, Member, IEEE, and.
出處: Signal Processing and Communications Applications, 2006 IEEE 作者: Asanterabi Malima, Erol Ozgur, and Miijdat Cetin 2015/10/251 指導教授:張財榮 學生:陳建宏 學號: M97G0209.
A New Fingertip Detection and Tracking Algorithm and Its Application on Writing-in-the-air System The th International Congress on Image and Signal.
Online Kinect Handwritten Digit Recognition Based on Dynamic Time Warping and Support Vector Machine Journal of Information & Computational Science, 2015.
Vision-based human motion analysis: An overview Computer Vision and Image Understanding(2007)
December 9, 2014Computer Vision Lecture 23: Motion Analysis 1 Now we will talk about… Motion Analysis.
Action as Space-Time Shapes
A survey of different shape analysis techniques 1 A Survey of Different Shape Analysis Techniques -- Huang Nan.
Raquel A. Romano 1 Scientific Computing Seminar May 12, 2004 Projective Geometry for Computer Vision Projective Geometry for Computer Vision Raquel A.
Chapter 5 Multi-Cue 3D Model- Based Object Tracking Geoffrey Taylor Lindsay Kleeman Intelligent Robotics Research Centre (IRRC) Department of Electrical.
Interactive Sand Art Drawing Using RGB-D Sensor
CVPR2013 Poster Detecting and Naming Actors in Movies using Generative Appearance Models.
Fingertip Detection with Morphology and Geometric Calculation Dung Duc Nguyen ; Thien Cong Pham ; Jae Wook Jeon Intelligent Robots and Systems, IEEE/RSJ.
Jiu XU, Axel BEAUGENDRE and Satoshi GOTO Computer Sciences and Convergence Information Technology (ICCIT), th International Conference on 1 Real-time.
Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team
Supervisor: Nakhmani Arie Semester: Winter 2007 Target Recognition Harmatz Isca.
Su-ting, Chuang 1. Outline Introduction Related work Hardware configuration Detection system Optimal parameter estimation framework Conclusion 2.
Robust and Accurate Surface Measurement Using Structured Light IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, VOL. 57, NO. 6, JUNE 2008 Rongqian.
By Pushpita Biswas Under the guidance of Prof. S.Mukhopadhyay and Prof. P.K.Biswas.
A New Method for Crater Detection Heather Dunlop November 2, 2006.
Journal of Visual Communication and Image Representation
IEEE International Conference on Multimedia and Expo.
Line Matching Jonghee Park GIST CV-Lab..  Lines –Fundamental feature in many computer vision fields 3D reconstruction, SLAM, motion estimation –Useful.
Su-ting, Chuang 1. Outline Introduction Related work Hardware configuration Finger Detection system Optimal parameter estimation framework Conclusion.
Fast Human Detection in Crowded Scenes by Contour Integration and Local Shape Estimation Csaba Beleznai, Horst Bischof Computer Vision and Pattern Recognition,
11/25/03 3D Model Acquisition by Tracking 2D Wireframes Presenter: Jing Han Shiau M. Brown, T. Drummond and R. Cipolla Department of Engineering University.
Video object segmentation and its salient motion detection using adaptive background generation Kim, T.K.; Im, J.H.; Paik, J.K.;  Electronics Letters 
Recognition with Expression Variations
Approximate Models for Fast and Accurate Epipolar Geometry Estimation
Vehicle Segmentation and Tracking in the Presence of Occlusions
Presented by :- Vishal Vijayshankar Mishra
EE 492 ENGINEERING PROJECT
Presentation transcript:

3D Fingertip and Palm Tracking in Depth Image Sequences Hui Liang, Junsong Yuan and Daniel Thalmann Proceedings of the 20th ACM international conference on Multimedia 2012

Outline Introduction Related Work Proposed Method Experimental Results Conclusion

Introduction

Introduction Human hand is an essential body part for human-computer interaction. The positions of tracked fingertips: gesture estimation Difficulty in fingertip tracking: Anaglyph 紅藍眼鏡 Polarization 偏光眼鏡 – 在一般LCD TV前面貼上一層微相位差膜(Micro-retarder),利用光的偏振方向 來將左眼與右眼的影像分離 Shutter 快門眼鏡 - 更新頻率120Hz以上播放左、右眼視角畫面,藉由快速切換左右眼資訊,使得左右眼分別看到正確的左眼與右眼畫面,經過視覺暫留與大腦融合後,即可呈現出具 深度感的立體影像。 此技術所顯示的3D畫面解析度不會下降,且立體效果非常的好。 然而由於要提升3D影像品質,因此左眼與右眼觀看螢幕的時間非常短,使得整體亮度會下降許多,亦是目前研發上需克服的重點。 Side-by-side Bending Nearby

Introduction Many previous methods: In this paper: Only focus on extracting 2D fingertips Cannot track fingertips robustly for a freely moving hand In this paper: Present a robust fingertip and palm tracking scheme With the input of depth images (KINECT) Track the 3D fingertip positions quite accurately Anaglyph 紅藍眼鏡 Polarization 偏光眼鏡 – 在一般LCD TV前面貼上一層微相位差膜(Micro-retarder),利用光的偏振方向 來將左眼與右眼的影像分離 Shutter 快門眼鏡 - 更新頻率120Hz以上播放左、右眼視角畫面,藉由快速切換左右眼資訊,使得左右眼分別看到正確的左眼與右眼畫面,經過視覺暫留與大腦融合後,即可呈現出具 深度感的立體影像。 此技術所顯示的3D畫面解析度不會下降,且立體效果非常的好。 然而由於要提升3D影像品質,因此左眼與右眼觀看螢幕的時間非常短,使得整體亮度會下降許多,亦是目前研發上需克服的重點。

Related Work

Related work Focus only on 2D fingertips:[4][5][6][9] Based on contour analysis of the extracted hand region:[2][4][5][6] Usually can track the fingertips for only stretched fingers.

Related work In [6], Fingertips are tracked for infrared image sequences. It utilizes a template matching strategy Fingertip tracking : Kalman filter In [2], Stereoscopic vision is adopted Maximize the distance center of gravity of the hand & the boundary

Related work In [9] (Kinect), Minimum depth Depth < Threshold [9] J. L. Raheja, A. Chaudhary, and K. Singal. Tracking of fingertips and centres of palm using kinect, 2011. Proc. Of the 3rd Int’l Conf. on Computational Intelligence, Modelling and Simulation. Related work In [9] (Kinect), Minimum depth Depth < Threshold Circular filter

Related work [2] S. Consei1, S. Bourennane, and L. Martin. Three dimensional fingertip tracking in stereovision, 2005. Proc. of the 7th Int’l Conf. on Advanced Concepts for Intelligent Vision Systems. [4] K. Hsiao, T. Chen, and S. Chien. Fast fingertip positioning by combining particle filtering with particle random diffusion, 2008. Proc. IEEE Int’l Conf. on Multimedia and Expo. [5] I. Katz, K. Gabayan, and H. Aghajan. A multi-touch surface using multiple cameras, 2007. Proc. of the 9th Int’l Conf. on Advanced concepts for intelligent vision systems. [6] K. Oka, Y. Sato, and H. Koike. Real-time tracking of multiple fingertips and gesture recognition for augmented desk interface systems, 2002. Proc. IEEE Int’l Conf. on Automatic Face and Gesture Recognition. [9] J. L. Raheja, A. Chaudhary, and K. Singal. Tracking of fingertips and centres of palm using kinect, 2011. Proc. Of the 3rd Int’l Conf. on Computational Intelligence, Modelling and Simulation.

Proposed Method

Foreground Segmentation Overview Stereo images (Depth map) Foreground Segmentation Palm Localization Hand Segmentation Fingertip Detection Fingertip Tracking Fingertips

Hand and Palm Detection 1) Assume the hand is the nearest object 2) Constrain global hand rotation by: : global rotation angle of the hand Foreground Segmentation Palm Localization Hand

Hand and Palm Detection Foreground Segmentation  Palm Localization Threshold the depth frame to obtain the foreground F : p : a pixel coordinate z(p) : depth value (of point p ) z0 : the minimum depth value zD : depth threshold Hand Segmentation 0.2m foreground F

Hand and Palm Detection Foreground Segmentation Palm Localization The palm region is approximated with a circle: pp : the palm center (of point p ) rp : the radius Assume that hand palm forms a globally largest blob Cp equals to the largest inscribed circle of the contour of F . 2D Kalman filter : reduce computational complexity Hand Segmentation

Hand and Palm Detection Foreground Segmentation Palm Localization Separate hand and forearm by a line: 1) Tangent to Cp 2) Perpendicular to the orientation of the forearm Orientation of the forearm : The Eigenvector that corresponds to the largest Eigenvalue of the covariance matrix of the contour pixel coordinates of F Hand Segmentation Hand region : FV(2D) → FD(3D)

Fingertip Detection and Tracking Constraints on possible fingertip locations: 1) Only in depth discontinuous region ( in contour Fv) 2) | Depth(one point) – Depth(neighborhoods) | are important. 3) Utilize the 3D geodesic shortest path (GSP) Fingertip vs. Non-fingertip Nearby Fingertips Fingertip Detection Tracking

Fingertip Detection Fingertip detection Fingertip tracking Goal: detect all five fingertips in the depth image Based on three depth-based features Build a graph G : Vh : contains of all points within FV (hand contour) Eh : for each pair of vertices(p,q), 1) they are in the 8-neighborhood of each other 2) their 3D distance 𝑑 𝑝,𝑞 = 𝑝−𝑞 2 is within threshold τ Edge weight : 3D Euclidean distance

Fingertip Detection Calculate Geodesic distance dg(p): Fingertip tracking Calculate Geodesic distance dg(p): From palm center pp for each vertex ∈ Vh Dijkstra graph search on Gh GSP point set Ug(p): The set of vertices on the shortest path from pp to p Rectangle local feature RL(p): Describe the neighborhood of a point p in FV 𝜂(𝑝) : ratio of 1s 1 X 1cm

Fingertip Detection Calculate Geodesic distance dg(p): 𝜂(𝑝) dg(p) Fingertip tracking Calculate Geodesic distance dg(p): 𝜂(𝑝) 0.4 dg(p) fingertips

Fingertip Detection Fingertip labeling: Fingertip tracking Fingertip labeling: Nmax=6 j : label 𝚪 1 2 3 4 5 𝜞(𝒊,𝒋) :estimate the probability that 𝑝 𝑓 𝑖 has the label lj 𝑵 : number of GSP points 𝒌 : kth GSP point i : fingertip

Fingertip Detection Fingertip labeling: Fingertip detection Fingertip tracking Fingertip labeling:

Fingertip Tracking Build a particle filter for each fingertip Fingertip detection Fingertip tracking Build a particle filter for each fingertip (x, ω) denote a particle x : 2D position in FV ω : the particle weight Constrain the positions of each particle to the border point set UB to reduce the search space

Fingertip Tracking Likelihood function : / / pre now 𝑹𝑳(𝒑) Metric Fingertip detection Fingertip tracking Likelihood function : pre now 𝑹𝑳(𝒑) Metric parameters difference Hausdorff difference feature difference Geodesic distance / dg(p) GSP points Neighbor depth /

Fingertip Tracking Likelihood function : Fingertip detection

Experimental Results

Experimental Results Quantitative results on synthetic sequences: ‧Error : Euclidean distance ‧Ground truth : phalanx end point Quantitative results on synthetic sequences: Seq. No. Motion Seq. 1 grasping Seq. 4 flexion Seq. 2 adduction/abduction Seq. 5 global rotation Seq. 3 successive single finger Seq. 6 combination of grasping and global rotation

Experimental Results

Experimental Results Virtual object grasping:

Conclusion

Conclusion Using multiple depth-based features for accurate fingertip localization Adopting a particle filter to track the fingertips over successive frames Track the 3D positions of fingertips robustly Great potential for extension to other HCI applications