Presentation is loading. Please wait.

Presentation is loading. Please wait.

Robust Part-Based Hand Gesture Recognition Using Kinect Sensor

Similar presentations


Presentation on theme: "Robust Part-Based Hand Gesture Recognition Using Kinect Sensor"β€” Presentation transcript:

1 Robust Part-Based Hand Gesture Recognition Using Kinect Sensor
IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 15, NO. 5, AUGUST 2013 Zhou Ren, Junsong Yuan, Member, IEEE, Jingjing Meng, Member, IEEE, and Zhengyou Zhang, Fellow, IEEE

2 Outline Introduction Part-Based Hand Gesture Recognition Evaluation
A. Hand Detection B. Hand Gesture Recognition C. Finger Detection Evaluation Application Conclusions

3 Outline Introduction Part-Based Hand Gesture Recognition Evaluation
A. Hand Detection B. Hand Gesture Recognition C. Finger Detection Evaluation Application Conclusions

4 Introduction Hand gesture recognition is of great importance for human-computer interaction (HCI). Traditional vision-based hand gesture recognition methods are still far from satisfactory for real-life applications. sensitive to lighting conditions cluttered backgrounds Optical sensor based methods are usually unable to detect and track the hands robustly.

5 Introduction(Cont.) To enable a more robust hand gesture recognition through the data glove It not affected by lighting conditions or cluttered backgrounds However, as it requires the user to wear a data glove and sometimes requires calibration. It is inconvenient to use and may hinder the natural articulation of hand gesture.

6 Introduction(Cont.) Using the Kinect sensor can also detect and segment the hands robustly, thus it provides a valid base for gesture recognition. But it is difficult to detect and segment a small object from an image with this resolution(640X480) e.g., a human hand which occupies a very small portion of the image with more complex articulations.

7 Introduction(Cont.) 1.Gesture recognition using Kinect sensor
122 3 1.Gesture recognition using Kinect sensor 2.Skeleton representations 3.Part-based representations

8 Introduction(Cont.) Using the proposed distance metric, Finger-Earth Mover’s Distance, We can classify the first two hands as the same gesture and handle the noisy hand shapes obtained by Kinect sensor.

9 Introduction(Cont.) The framework of our part-based hand gesture recognition system

10 Introduction(Cont.) The main contributions of this paper
Propose a part-based hand gesture recognition system, based on a novel distance metric Finger Earth Mover Distance (FEMD). Demonstrate our hand gesture recognition algorithm in two HCI applications.

11 Outline Introduction Part-Based Hand Gesture Recognition Evaluation
A. Hand Detection B. Hand Gesture Recognition C. Finger Detection Evaluation Application Conclusions

12 Part-Based Hand Gesture Recognition (Hand Detection)
(a) The rough hand segmented by depth thresholding; (b) A more accurate hand detected with black belt (the green line), the initial point (the red point) and the center point (the cyan point); (c) Its time-series curve representation.

13 Part-Based Hand Gesture Recognition (Hand Detection Cont. )
The horizontal axis denotes the angle between each contour vertex and the initial point relative to the center point, normalized by 360 . The vertical axis denotes the Euclidean distance between the contour vertices and the center point, normalized by the radius of the maximal inscribed circle.

14 Outline Introduction Part-Based Hand Gesture Recognition Evaluation
A. Hand Detection B. Hand Gesture Recognition C. Finger Detection Evaluation Application Conclusions

15 Part-Based Hand Gesture Recognition (Hand Gesture Recognition)
Template Matching Use template matching for recognition, i.e., the input hand is recognized as the class with which it has the minimum dissimilarity distance: 𝑐= arg min 𝑐 𝐹𝐸𝑀𝐷(𝐻, 𝑇 𝑐 ) 𝑯 is the input hand; 𝑻 𝒄 is the template of class 𝑐; 𝑭𝑬𝑴𝑫(𝑯, 𝑻 𝒄 ) denotes the proposed Finger-Earth Mover’s Distance between the input hand and each template;

16 Part-Based Hand Gesture Recognition (Hand Gesture Recognition Cont.)
Introduce Finger-Earth Mover’s Distance

17 Part-Based Hand Gesture Recognition (Hand Gesture Recognition Cont.)
let 𝑅= π‘Ÿ 1 , πœ” π‘Ÿ 1 ,…, π‘Ÿ π‘š , πœ” π‘Ÿ π‘š be the first hand signature with π‘š clusters, where π‘Ÿ 𝑖 is the cluster representative and πœ” π‘Ÿ 𝑖 is the weight of the cluster; 𝑇= 𝑑 1 , πœ” 𝑑 1 ,…, π‘Ÿ 𝑛 , πœ” 𝑑 𝑛 is the second hand signature with 𝑛 clusters. π‘Ÿ 𝑖 is the angle interval between the endpoints of each segment. πœ” π‘Ÿ 𝑖 is the normalized area within the finger segment

18 Part-Based Hand Gesture Recognition (Hand Gesture Recognition Cont.)
Define Ground Distance 𝑑 𝑖𝑗 = 0, π‘Ÿ 𝑖 π‘‘π‘œπ‘‘π‘Žπ‘™π‘™π‘¦ π‘œπ‘£π‘’π‘Ÿπ‘™π‘Žπ‘ π‘€π‘–π‘‘β„Ž 𝑑 𝑗 , π‘šπ‘–π‘› π‘Ÿ π‘–π‘Ž βˆ’ 𝑑 π‘—π‘Ž , π‘Ÿ 𝑖𝑏 βˆ’ 𝑑 𝑗𝑏 , π‘œπ‘‘β„Žπ‘’π‘Ÿπ‘€π‘–π‘ π‘’. 𝐷=[ 𝑑 𝑖𝑗 ] 𝑖𝑠 π‘‘β„Žπ‘’ π‘”π‘Ÿπ‘œπ‘’π‘›π‘‘ π‘‘π‘–π‘ π‘‘π‘Žπ‘›π‘π‘’ π‘šπ‘Žπ‘‘π‘Ÿπ‘–π‘₯ π‘œπ‘“ π‘ π‘–π‘”π‘›π‘Žπ‘‘π‘’π‘Ÿπ‘’ 𝑅 π‘Žπ‘›π‘‘ 𝑇 . 𝑑 𝑖𝑗 is the ground distance from cluster π‘Ÿ 𝑖 π‘‘π‘œ 𝑑 𝑗 and defined as the minimum moving distance for interval [ π‘Ÿ π‘–π‘Ž, π‘Ÿ 𝑖𝑏 ] to totally overlap with [ 𝑑 π‘—π‘Ž, 𝑑 𝑗𝑏 ]

19 Part-Based Hand Gesture Recognition (Hand Gesture Recognition Cont.)
Define FEMD Distance Between Signatures R and T 𝐹𝐸𝐷𝑀 𝑅,𝑇 =𝛽 𝐸 π‘šπ‘œπ‘£π‘’ + 1βˆ’π›½ 𝐸 π‘’π‘šπ‘π‘‘π‘¦, = 𝛽 𝑖=1 π‘š 𝑗=1 𝑛 𝑑 𝑖𝑗 𝑓 𝑖𝑗 +(1βˆ’π›½) 𝑖=1 π‘š πœ” π‘Ÿ 𝑖 βˆ’ 𝑗=1 𝑛 πœ” 𝑑 𝑗 𝑖=1 π‘š 𝑗=1 𝑛 𝑓 𝑖𝑗 𝑖=1 π‘š 𝑗=1 𝑛 𝑓 𝑖𝑗 is the normalization factor. 𝑓 𝑖𝑗 𝑖𝑠 the flow from cluster π‘Ÿ 𝑖 π‘‘π‘œ π‘π‘™π‘’π‘ π‘‘π‘’π‘Ÿ 𝑑 𝑗 .

20 Outline Introduction Part-Based Hand Gesture Recognition Evaluation
A. Hand Detection B. Hand Gesture Recognition C. Finger Detection Evaluation Application Conclusions

21 Part-Based Hand Gesture Recognition (Finger Detection)
Near-Convex Decomposition Minimum Near-Convex Decomposition (MNCD) It is generally complexly formulated and cannot be solved in real time min 𝛼 π‘₯ βˆ’π›Ό 𝑀 ⊺ π‘₯, 𝑠.𝑑. 𝐴π‘₯β‰₯1, 𝐡π‘₯≀1, π‘₯∈ 0,1 𝑛 . Thresholding Decomposition Define a threshold of height to segment.

22 Part-Based Hand Gesture Recognition (Finger Detection Cont.)
Near-Convex Decomposition Thresholding Decomposition

23 Outline Introduction Part-Based Hand Gesture Recognition Evaluation
A. Hand Detection B. Hand Gesture Recognition C. Finger Detection Evaluation Application Conclusions

24 Evaluation Robustness to Cluttered Backgrounds
(a) The hand that is cluttered by background can be detected accurately. (b) The hand that is cluttered by face can be detected accurately.

25 Evaluation(Cont.) Robustness to Distortions and Hand Variations in Orientation, Scale. (a) The hands with orientation changes, and their time-series curves. (b) The hands with scale changes, and their time-series curves.

26 Evaluation(Cont.) The system is insensitive to the distortions and articulation 𝐹𝐸𝐷𝑀 𝑅,𝑇 =𝛽 𝐸 π‘šπ‘œπ‘£π‘’ + 1βˆ’π›½ 𝐸 π‘’π‘šπ‘π‘‘π‘¦, = 𝛽 𝑖=1 π‘š 𝑗=1 𝑛 𝑑 𝑖𝑗 𝑓 𝑖𝑗 +(1βˆ’π›½) 𝑖=1 π‘š πœ” π‘Ÿ 𝑖 βˆ’ 𝑗=1 𝑛 πœ” 𝑑 𝑗 𝑖=1 π‘š 𝑗=1 𝑛 𝑓 𝑖𝑗 𝐹𝑖𝑔. π‘Ž π‘Žπ‘›π‘‘ 𝑏 β„Žπ‘Žπ‘£π‘’ 2 π‘π‘™π‘’π‘ π‘‘π‘’π‘Ÿπ‘ : π‘Ÿ 1 , πœ” π‘Ÿ 1 , π‘Ÿ 2 , πœ” π‘Ÿ 2 𝐹𝑖𝑔. 𝑐 π‘Žπ‘›π‘‘ 𝑑 π‘œπ‘›π‘™π‘¦ β„Žπ‘Žπ‘£π‘’ 1 π‘π‘™π‘’π‘ π‘‘π‘’π‘Ÿ :{( 𝑑 1 , πœ” 𝑑 1 } β†’ ( πœ” π‘Ÿ 1 + πœ” π‘Ÿ 2 )β‰ˆ πœ” 𝑑 1 ground distance 𝑑 11 , 𝑑 21 β‰ˆ0

27 Evaluation(Cont.) Accuracy and Efficiency
Experiment I: Thresholding Decomposition + FEMD β„Ž 𝑓 =1.6 and the FEMD parameter 𝛽=0.5 The mean accuracy is 93.2%. The mean running time is s. Two pairs of confusing gestures in Experiment I. (a) Gesture 4 and 5. (b) Gesture 1 and 8. The confusion matrix of Experiment I.

28 Evaluation(Cont.) Experiment II: Near-Convex Decomposition + FEMD
𝛼=0.5 and the FEMD parameter 𝛽=0.5 . The mean accuracy is 93.9%. The mean running time is s. Finger Detection results of Experiment II using near-convex decomposition algorithm. The confusion matrix of Experiment II.

29 Outline Introduction Part-Based Hand Gesture Recognition Evaluation
A. Hand Detection B. Hand Gesture Recognition C. Finger Detection Evaluation Application Conclusions

30 Application Arithmetic Computation
The 14 gesture commands in our arithmetic computation system. Arithmetic computation.

31 Application(Cont.) Rock-Paper-Scissors Game Rock-paper-scissors game.

32 Outline Introduction Part-Based Hand Gesture Recognition Evaluation
A. Hand Detection B. Hand Gesture Recognition C. Finger Detection Evaluation Application Conclusions

33 Conclusions A novel distance metric, Finger-Earth Mover’s Distance (FEMD), is used for dissimilarity measure, FEMD based hand gesture recognition system achieves 93.2% mean accuracy and runs in s per frame The real-life HCI applications we built on top of our hand gesture recognition system. Future research will focus on the efficiency drawback of near-convex decomposition based finger detection method


Download ppt "Robust Part-Based Hand Gesture Recognition Using Kinect Sensor"

Similar presentations


Ads by Google