A Modified EM Algorithm for Hand Gesture Segmentation in RGB-D Data 2014 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE) July 6-11, 2014, Beijing,

Slides:

Advertisements

Similar presentations

Bayesian Belief Propagation

Advertisements

Miroslav Hlaváč Martin Kozák Fish position determination in 3D space by stereo vision.

Víctor Ponce Miguel Reyes Xavier Baró Mario Gorga Sergio Escalera Two-level GMM Clustering of Human Poses for Automatic Human Behavior Analysis Departament.

Wrist Recognition and the Center of the Palm Estimation Based on Depth Camera Zhengwei Yao ; Zhigeng Pan ; Shuchang Xu Virtual Reality and Visualization.

Real-Time Hand Gesture Recognition with Kinect for Playing Racing Video Games 2014 International Joint Conference on Neural Networks (IJCNN) July 6-11,

Xin Zhang, Zhichao Ye, Lianwen Jin, Ziyong Feng, and Shaojie Xu

Automatic measurement of pores and porosity in pork ham and their correlations with processing time, water content and texture JAVIER MERÁS FERNÁNDEZ MSc.

Foreground Background detection from video Foreground Background detection from video מאת : אבישג אנגרמן.

Automatic Identification of Bacterial Types using Statistical Image Modeling Sigal Trattner, Dr. Hayit Greenspan, Prof. Shimon Abboud Department of Biomedical.

Adviser ： Ming-Yuan Shieh Student ID ： M Student ： Chung-Chieh Lien VIDEO OBJECT SEGMENTATION AND ITS SALIENT MOTION DETECTION USING ADAPTIVE BACKGROUND.

A Versatile Depalletizer of Boxes Based on Range Imagery Dimitrios Katsoulas*, Lothar Bergen*, Lambis Tassakos** *University of Freiburg **Inos Automation-software.

EE-148 Expectation Maximization Markus Weber 5/11/99.

Broadcast Court-Net Sports Video Analysis Using Fast 3-D Camera Modeling Jungong Han Dirk Farin Peter H. N. IEEE CSVT 2008.

Active Calibration of Cameras: Theory and Implementation Anup Basu Sung Huh CPSC 643 Individual Presentation II March 4 th,

Boundary matting for view synthesis Samuel W. Hasinoff Sing Bing Kang Richard Szeliski Computer Vision and Image Understanding 103 (2006) 22–32.

First introduced in 1977 Lots of mathematical derivation Problem : given a set of data (data is incomplete or having missing values). Goal : assume the.

Efficient Moving Object Segmentation Algorithm Using Background Registration Technique Shao-Yi Chien, Shyh-Yih Ma, and Liang-Gee Chen, Fellow, IEEE Hsin-Hua.

A Bayesian Formulation For 3d Articulated Upper Body Segmentation And Tracking From Dense Disparity Maps Navin Goel Dr Ara V Nefian Dr George Bebis.

Expectation Maximization Method Effective Image Retrieval Based on Hidden Concept Discovery in Image Database By Sanket Korgaonkar Masters Computer Science.

Prénom Nom Document Analysis: Data Analysis and Clustering Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.

Multiple Human Objects Tracking in Crowded Scenes Yao-Te Tsai, Huang-Chia Shih, and Chung-Lin Huang Dept. of EE, NTHU International Conference on Pattern.

Learning Multiple Evolutionary Pathways from Cross-sectional Data Niko Beerenwinkel, Jorg Rahnenfuhrer, Martin Daumer, Daniel Hoffmann,Rolf Kaiser, Joachim.

1 Integration of Background Modeling and Object Tracking Yu-Ting Chen, Chu-Song Chen, Yi-Ping Hung IEEE ICME, 2006.

ECEn 670 Mini-Conference29-Nov.-2011Everett Bryan, Bryce Pincock Velocity Estimation using the Kinect Sensor Everett Bryan Bryce Pincock 29-Nov

Jacinto C. Nascimento, Member, IEEE, and Jorge S. Marques

My Research Experience Cheng Qian. Outline 3D Reconstruction Based on Range Images Color Engineering Thermal Image Restoration.

A Brief Overview of Computer Vision Jinxiang Chai.

Presented by: Kamakhaya Argulewar Guided by: Prof. Shweta V. Jain

Introduction Kinect for Xbox 360, referred to as Kinect, is developed by Microsoft, used in Xbox 360 video game console and Windows PCs peripheral equipment.

3D Fingertip and Palm Tracking in Depth Image Sequences

ICPR/WDIA-2012 High Quality Novel View Synthesis Based on Low Resolution Depth Image and High Resolution Color Image Jui-Chiu Chiang, Zheng-Feng Liu, and.

Prakash Chockalingam Clemson University Non-Rigid Multi-Modal Object Tracking Using Gaussian Mixture Models Committee Members Dr Stan Birchfield (chair)

What we didn’t have time for CS664 Lecture 26 Thursday 12/02/04 Some slides c/o Dan Huttenlocher, Stefano Soatto, Sebastian Thrun.

KinectFusion : Real-Time Dense Surface Mapping and Tracking IEEE International Symposium on Mixed and Augmented Reality 2011 Science and Technology Proceedings.

REU Project RGBD gesture recognition with the Microsoft Kinect Steven Hickson.

Miguel Reyes 1,2, Gabriel Dominguez 2, Sergio Escalera 1,2 Computer Vision Center (CVC) 1, University of Barcelona (UB) 2

1 Physical Fluctuomatics 5th and 6th Probabilistic information processing by Gaussian graphical model Kazuyuki Tanaka Graduate School of Information Sciences,

A Method for Hand Gesture Recognition Jaya Shukla Department of Computer Science Shiv Nadar University Gautam Budh Nagar, India Ashutosh Dwivedi.

Takuya Matsuo, Norishige Fukushima and Yutaka Ishibashi

Object Stereo- Joint Stereo Matching and Object Segmentation Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on Michael Bleyer Vienna.

A Region Based Stereo Matching Algorithm Using Cooperative Optimization Zeng-Fu Wang, Zhi-Gang Zheng University of Science and Technology of China Computer.

14 October, 2010LRI Seminar 2010 (Univ. Paris-Sud)1 Statistical performance analysis by loopy belief propagation in probabilistic image processing Kazuyuki.

Bo QIN, Zongshun MA, Zhenghua FANG, Shengke WANG Computer-Aided Design and Computer Graphics, th IEEE International Conference on, p Presenter.

Human pose recognition from depth image MS Research Cambridge.

A Passive Approach to Sensor Network Localization Rahul Biswas and Sebastian Thrun International Conference on Intelligent Robots and Systems 2004 Presented.

Expectation-Maximization (EM) Case Studies

Interactive Sand Art Drawing Using RGB-D Sensor

A Flexible New Technique for Camera Calibration Zhengyou Zhang Sung Huh CSPS 643 Individual Presentation 1 February 25,

Approximate Inference: Decomposition Methods with Applications to Computer Vision Kyomin Jung ( KAIST ) Joint work with Pushmeet Kohli (Microsoft Research)

Inferring High-Level Behavior from Low-Level Sensors Donald J. Patterson, Lin Liao, Dieter Fox, and Henry Kautz.

Final Review Course web page: vision.cis.udel.edu/~cv May 21, 2003  Lecture 37.

Skeleton Based Action Recognition with Convolutional Neural Network

Chapter 8. Learning of Gestures by Imitation in a Humanoid Robot in Imitation and Social Learning in Robots, Calinon and Billard. Course: Robots Learning.

REU Project RGBD gesture recognition with the Microsoft Kinect.

Journal of Visual Communication and Image Representation

Multi Scale CRF Based RGB-D Image Segmentation Using Inter Frames Potentials Taha Hamedani Robot Perception Lab Ferdowsi University of Mashhad The 2 nd.

Representing Moving Images with Layers J. Y. Wang and E. H. Adelson MIT Media Lab.

Marco Maisto, Massimo Panella, Luca Liparulo, and Andrea Proietti

Presenter: Jae Sung Park

Che-An Wu Background substitution. Background Substitution AlphaMa p Trimap Depth Map Extract the foreground object and put into another background Objective.

Microsoft Kinect How does a machine infer body position?

Video object segmentation and its salient motion detection using adaptive background generation Kim, T.K.; Im, J.H.; Paik, J.K.; Electronics Letters

Processing visual information for Computer Vision

Image Segmentation Techniques

RGB-D Image for Scene Recognition by Jiaqi Guo

PRAKASH CHOCKALINGAM, NALIN PRADEEP, AND STAN BIRCHFIELD

Video Compass Jana Kosecka and Wei Zhang George Mason University

RGBD gesture recognition with the Microsoft Kinect

Sign Language Recognition With Unsupervised Feature Learning

Optimization under Uncertainty

Presentation transcript:

A Modified EM Algorithm for Hand Gesture Segmentation in RGB-D Data 2014 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE) July 6-11, 2014, Beijing, China Zhaojie Ju, Yuehui Wang, Wei Zeng, Haibin Cai and Honghai Liu

outline Introduction Problem of hand gesture segmentation via Kinect Hand gesture segmentation using a modified EM algorithm Experiment results Concluding remarks

Introduction the problem of acquisition and recognition of human hand gesture from RGB-Depth (RGB-D) sensors, such as Microsofts Kinect, is an important subject in the area of the computer vision there are many problems such as distortion and disaccord of depth and RGB images in corresponding pixels

Introduction In computer vision, camera calibration is a necessary step in scene reconstruction in order to extract metric information from images Colour camera calibration has been studied extensively and different calibration techniques have been developed for depth sensors depending on the circumstances the calibration of RGB image and depth map is much essential for their consistency and synchronization

outline Introduction Problem of hand gesture segmentation via Kinect Hand gesture segmentation using a modified EM algorithm Experiment results Concluding remarks

Problem of hand gesture segmentation via Kinect due to the limitations of the systematic design and the random errors, the alignment of the depth and RGB images highly relies on the identification of the mathematical model of the measurement and the calibration parameters involved A proprietary algorithm is used to calibrate Kinect devices when manufacturing, and these calibrated parameters stored in the devices’ internal memory are used to perform the image construction

Problem of hand gesture segmentation via Kinect

The official calibration is adequate for human body motion analysis or casual use, but it lacks accuracy in hand gesture segmentation and recognition due to the noises and holes in the RGB-D data, the colour map of the human hand can not be effectively segmented using only the depth information an Expectation-Maximisation (EM) algorithm is proposed to further adjust the segmentation edge based on the depth map, RGB image and locations of the pixels.

outline Introduction Problem of hand gesture segmentation via Kinect Hand gesture segmentation using a modified EM algorithm Experiment results Concluding remarks

Bayesian networks

Hand gesture segmentation using a modified EM algorithm the probability of this pixel belonging to the hand gesture can be expressed by p(H = 1|RGB,D,L) or p(H|RGB,D,L) D is Depth L is Location H is a binary variable

Hand gesture segmentation using a modified EM algorithm The events of RGB, Depth and Location can be reasonably assumed to be independent, and according to the Bayesian Network p(RGB|H) is the probability of the RGB value given that this pixel is part of a hand and it assumes to be a Gaussian distribution with a mean of μ RGBH and a covariance of Σ RGBH p(D|H) is the probability of the depth value given this pixel is part of a hand and it assumes to be Gaussian distributed with a mean of μ D and a covariance of Σ D p(L|H) is the probability of the pixel location given this pixel belongs to a hand

Hand gesture segmentation using a modified EM algorithm the function dist(L) is to get the minimum distance between the pixel and the hand edge dist(L) is negative when the pixel is inside of the edge and positive when outside of the edge is the probability of this pixel not belonging to a hand, and = 1-p(H) is the probability of the RGB value given that this pixel is part of the background and it assumes to be a Gaussian distribution with a mean of μ RGBB and a covariance of Σ RGBB

Hand gesture segmentation using a modified EM algorithm Θ = Eq.4 is called the likelihood of the parameters given the data, or the likelihood function

Hand gesture segmentation using a modified EM algorithm the objective is to estimate the parameters set Θ that maximizes

Hand gesture segmentation using a modified EM algorithm

outline Introduction Problem of hand gesture segmentation via Kinect Hand gesture segmentation using a modified EM algorithm Experiment results Concluding remarks

Experiment results

outline Introduction Problem of hand gesture segmentation via Kinect Hand gesture segmentation using a modified EM algorithm Experiment results Concluding remarks

The proposed alignment method employs genetic algorithms to estimate the key points from both depth and RGB images due to the noise and holes in the depth map, the segmented result using the depth information has lots of mismatched pixels, which need further adjustment the proposed methods precisely segment the hand gestures and are much better than the official calibrated images and the results with only the proposed alignment method Our future research will be on the efficiency improvement of the proposed methods to adapt to the real-time applications