Potential Projects RGBD gesture recognition with the Microsoft Kinect Person recognition by parts.

Slides:



Advertisements
Similar presentations
1 Gesture recognition Using HMMs and size functions.
Advertisements

Gesture recognition using salience detection and concatenated HMMs Ying Yin Randall Davis Massachusetts Institute.
Combining Detectors for Human Hand Detection Antonio Hernández, Petia Radeva and Sergio Escalera Computer Vision Center, Universitat Autònoma de Barcelona,
Weiwei Zhang, Jian Sun, and Xiaoou Tang, Fellow, IEEE.
Large Scale Visual Recognition Challenge (ILSVRC) 2013: Detection spotlights.
Lecture 31: Modern object recognition
Many slides based on P. FelzenszwalbP. Felzenszwalb General object detection with deformable part-based models.
Steerable Part Models Hamed Pirsiavash and Deva Ramanan
Real Time Motion Capture Using a Single Time-Of-Flight Camera
Probability-based Dynamic Time Warping for Gesture Recognition on RGB-D data All rights reserved HuBPA© Human Pose Recovery and Behavior Analysis Group.
Multiple People Detection and Tracking with Occlusion Presenter: Feifei Huo Supervisor: Dr. Emile A. Hendriks Dr. A. H. J. Stijn Oomes Information and.
SOMM: Self Organizing Markov Map for Gesture Recognition Pattern Recognition 2010 Spring Seung-Hyun Lee G. Caridakis et al., Pattern Recognition, Vol.
Move With Me S.W Graduation Project An Najah National University Engineering Faculty Computer Engineering Department Supervisor : Dr. Raed Al-Qadi Ghada.
More sliding window detection: Discriminative part-based models Many slides based on P. FelzenszwalbP. Felzenszwalb.
Student: Yao-Sheng Wang Advisor: Prof. Sheng-Jyh Wang ARTICULATED HUMAN DETECTION 1 Department of Electronics Engineering National Chiao Tung University.
Recent Developments in Human Motion Analysis
Recognition of Human Gait From Video Rong Zhang, C. Vogler, and D. Metaxas Computational Biomedicine Imaging and Modeling Center Rutgers University.
3D Hand Pose Estimation by Finding Appearance-Based Matches in a Large Database of Training Views
Object Recognition Using Distinctive Image Feature From Scale-Invariant Key point D. Lowe, IJCV 2004 Presenting – Anat Kaspi.
CSCE 641 Computer Graphics: Image-based Rendering (cont.) Jinxiang Chai.
Jacinto C. Nascimento, Member, IEEE, and Jorge S. Marques
What, Where & How Many? Combining Object Detectors and CRFs
FACE DETECTION AND RECOGNITION By: Paranjith Singh Lohiya Ravi Babu Lavu.
Generic object detection with deformable part-based models
Human tracking and counting using the KINECT range sensor based on Adaboost and Kalman Filter ISVC 2013.
A Tutorial on Object Detection Using OpenCV
Multimedia Specification Design and Production 2013 / Semester 2 / week 8 Lecturer: Dr. Nikos Gazepidis
M4 – Video Processing, Brno University of Technology1 M4 – Video Processing Igor Potůček, Michal Španěl, Ibrahim Abu Kteish, Olivier Lai Kan Thon, Pavel.
Introduction to Computer Vision Olac Fuentes Computer Science Department University of Texas at El Paso El Paso, TX, U.S.A.
Abstract Developing sign language applications for deaf people is extremely important, since it is difficult to communicate with people that are unfamiliar.
REU Project RGBD gesture recognition with the Microsoft Kinect Steven Hickson.
Hurieh Khalajzadeh Mohammad Mansouri Mohammad Teshnehlab
Miguel Reyes 1,2, Gabriel Dominguez 2, Sergio Escalera 1,2 Computer Vision Center (CVC) 1, University of Barcelona (UB) 2
A General Framework for Tracking Multiple People from a Moving Camera
Object Detection with Discriminatively Trained Part Based Models
DIEGO AGUIRRE COMPUTER VISION INTRODUCTION 1. QUESTION What is Computer Vision? 2.
Lecture 31: Modern recognition CS4670 / 5670: Computer Vision Noah Snavely.
M Institute for Human-Machine Communication Munich University of Technology Sascha Schreiber Face Tracking and Person Action.
Deformable Part Model Presenter : Liu Changyu Advisor : Prof. Alex Hauptmann Interest : Multimedia Analysis April 11 st, 2013.
ECE 8443 – Pattern Recognition EE 3512 – Signals: Continuous and Discrete Objectives: Spectrograms Revisited Feature Extraction Filter Bank Analysis EEG.
Deformable Part Models (DPM) Felzenswalb, Girshick, McAllester & Ramanan (2010) Slides drawn from a tutorial By R. Girshick AP 12% 27% 36% 45% 49% 2005.
Head Tracking in Meeting Scenarios Sascha Schreiber.
A Face processing system Based on Committee Machine: The Approach and Experimental Results Presented by: Harvest Jang 29 Jan 2003.
Human pose recognition from depth image MS Research Cambridge.
Action and Gait Recognition From Recovered 3-D Human Joints IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS— PART B: CYBERNETICS, VOL. 40, NO. 4, AUGUST.
Epitomic Location Recognition A generative approach for location recognition K. Ni, A. Kannan, A. Criminisi and J. Winn In proc. CVPR Anchorage,
Training and Evaluating of Object Bank Models Presenter : Changyu Liu Advisor : Prof. Alex Interest : Multimedia Analysis May 16 th, 2013.
Individual Recognition Using Gait Energy Image Ju Han and Bir Bhanu IEEE TPAMI, VOL. 28, NO. 2, 2006.
Recognition Using Visual Phrases
Team Members Ming-Chun Chang Lungisa Matshoba Steven Preston Supervisors Dr James Gain Dr Patrick Marais.
REU Project RGBD gesture recognition with the Microsoft Kinect.
Object Recognizing. Object Classes Individual Recognition.
REU Project 4D Efficient Real-Time Graph Based Temporal Point Cloud Segmentation Steven Hickson.
More sliding window detection: Discriminative part-based models
Object Recognizing. Object Classes Individual Recognition.
Deep Belief Network Training Same greedy layer-wise approach First train lowest RBM (h 0 – h 1 ) using RBM update algorithm (note h 0 is x) Freeze weights.
Hierarchical Motion Evolution for Action Recognition Authors: Hongsong Wang, Wei Wang, Liang Wang Center for Research on Intelligent Perception and Computing,
Date of download: 7/8/2016 Copyright © 2016 SPIE. All rights reserved. A scalable platform for learning and evaluating a real-time vehicle detection system.
Introduction to Skin and Face Detection
Object detection with deformable part-based models
Calculate HOC on Depth and HOG on RGB and concatenate them
Iterative Optimization
“The Truth About Cats And Dogs”
Human Detection using depth
Introduction to Object Tracking
RGBD gesture recognition with the Microsoft Kinect
Triplet Person Detection and Behavior Estimation
Problem Image and Volume Segmentation:
Sign Language Recognition With Unsupervised Feature Learning
Presentation transcript:

Potential Projects RGBD gesture recognition with the Microsoft Kinect Person recognition by parts

RGDB Gesture Recognition Read through multiple papers and it seems like modifying existing algorithms based on Hidden Markov Models is the best way to go. Most of the top performers of the ChaLearn Gesture challenge only used depth; however, improvements could be made by using depth and color (also upping the resolution to 640x480)

Getting Data from the Kinect Although the ChaLearn database gives out some examples, they give an incorrect way to sync the depth and color image. Also they take images 320x240 when the Kinect can supply a higher resolution. An alternative is to test on both the ChaLearn database and real samples

Constructing Point Clouds Reconstructing Point clouds is just a matter of understanding multiple view geometry.

HMM Advantages A Hidden Markov Model is a statistical system model that is a Markov process with hidden states. HMM are often used in temporal pattern recognition and are useful for one-shot learning applications The highest ranking winners of the first ChaLearn competition used HMM or CRF

Approach Based off two papers: – A Hidden Markov Model-Based Continuous Gesture Recognition System for Hand Motion Trajectory – A Hidden Markov Model-Based Isolated and Meaningful Hand Gesture Recognition To make the approach more novel, add and use identification of the parts of the body referencing the paper: – Real-time Identification and Localization of Body Parts from Depth Images

Person Detection by Parts Based off the paper: – P. Felzenszwalb, R. Girshick, D. McAllester, D. Ramanan. Object Detection with Discriminatively Trained Part Based Models. TPAMI, Vol. 32, No. 9, September – This paper performed very well and could be drastically improved by specifying it to work with only people

Method Features are chosen with HOG using an enriched Dalal-TriggsDa detector based on star roots. Training is done with a method called LSVM.

Databases to Test The Pascal competition has test sets we can use Improve the algorithm by training with only people and changing the root points to be specific to humans, giving higher weight to the head since it is rarely occluded.

Novel Improvements Implement a new algorithm based off Felzenszwalb’s using time (video) or depth to improve the results Add rotation as a parameter of the model Model human activities based off the rotation and translation of the parts (this would only work on largely differing activities) Change the detector to improve it. Options are including a weight for human skin color which is in a small region on the chromatic scale.