3d Pose Detection Used by Kinect

Slides:



Advertisements
Similar presentations
Lindsey Bleimes Charlie Garrod Adam Meyerson
Advertisements

The fundamental matrix F
For Internal Use Only. © CT T IN EM. All rights reserved. 3D Reconstruction Using Aerial Images A Dense Structure from Motion pipeline Ramakrishna Vedantam.
KinectFusion: Real-Time Dense Surface Mapping and Tracking
Hilal Tayara ADVANCED INTELLIGENT ROBOTICS 1 Depth Camera Based Indoor Mobile Robot Localization and Navigation.
1 Video Processing Lecture on the image part (8+9) Automatic Perception Volker Krüger Aalborg Media Lab Aalborg University Copenhagen
F ACE TRACKING EE 7700 Name: Jing Chen Shaoming Chen.
Human Pose detection Abhinav Golas S. Arun Nair. Overview Problem Previous solutions Solution, details.
Announcements Final Exam May 13th, 8 am (not my idea).
Parallel Tracking and Mapping for Small AR Workspaces Vision Seminar
Computer Vision Detecting the existence, pose and position of known objects within an image Michael Horne, Philip Sterne (Supervisor)
Real-Time Human Pose Recognition in Parts from Single Depth Images Presented by: Mohammad A. Gowayyed.
Multiple People Detection and Tracking with Occlusion Presenter: Feifei Huo Supervisor: Dr. Emile A. Hendriks Dr. A. H. J. Stijn Oomes Information and.
GrabCut Interactive Image (and Stereo) Segmentation Joon Jae Lee Keimyung University Welcome. I will present Grabcut – an Interactive tool for foreground.
Segmentation-Free, Area-Based Articulated Object Tracking Daniel Mohr, Gabriel Zachmann Clausthal University, Germany ISVC.
Formation et Analyse d’Images Session 8
Shape from Contours and Multiple Stereo A Hierarchical, Mesh-Based Approach Hendrik Kück, Wolfgang Heidrich, Christian Vogelgsang.
Rasterization and Ray Tracing in Real-Time Applications (Games) Andrew Graff.
Group S3. Lab Session 5 Following on from our previous lab session decided to find the relationship between Disparity vs Camera Separation. Measured Disparity.
Tracking Migratory Birds Around Large Structures Presented by: Arik Brooks and Nicholas Patrick Advisors: Dr. Huggins, Dr. Schertz, and Dr. Stewart Senior.
A Closed Form Solution to Natural Image Matting
Real-time Hand Pose Recognition Using Low- Resolution Depth Images
Synchronization and Calibration of Camera Networks from Silhouettes Sudipta N. Sinha Marc Pollefeys University of North Carolina at Chapel Hill, USA.
Fitting a Model to Data Reading: 15.1,
C&A 10April06 1 Point Source Detection and Localization Using the UW HealPixel database Toby Burnett University of Washington.
Cliff Rhyne and Jerry Fu June 5, 2007 Parallel Image Segmenter CSE 262 Spring 2007 Project Final Presentation.
Hand Movement Recognition By: Tokman Niv Levenbroun Guy Instructor: Todtfeld Ari.
David Luebke Modeling and Rendering Architecture from Photographs A hybrid geometry- and image-based approach Debevec, Taylor, and Malik SIGGRAPH.
ICBV Course Final Project Arik Krol Aviad Pinkovezky.
כמה מהתעשייה? מבנה הקורס השתנה Computer vision.
Fractal Image Compression By Cabel Sholdt and Paul Zeman.
Path-Based Constraints for Accurate Scene Reconstruction from Aerial Video Mauricio Hess-Flores 1, Mark A. Duchaineau 2, Kenneth I. Joy 3 Abstract - This.
BraMBLe: The Bayesian Multiple-BLob Tracker By Michael Isard and John MacCormick Presented by Kristin Branson CSE 252C, Fall 2003.
Shape-Based Human Detection and Segmentation via Hierarchical Part- Template Matching Zhe Lin, Member, IEEE Larry S. Davis, Fellow, IEEE IEEE TRANSACTIONS.
Optical Tracking for VR Bertus Labuschagne Christopher Parker Russell Joffe.
KinectFusion : Real-Time Dense Surface Mapping and Tracking IEEE International Symposium on Mixed and Augmented Reality 2011 Science and Technology Proceedings.
3D SLAM for Omni-directional Camera
EE369C Final Project: Accelerated Flip Angle Sequences Jan 9, 2012 Jason Su.
#MOTION ESTIMATION AND OCCLUSION DETECTION #BLURRED VIDEO WITH LAYERS
Video Segmentation Prepared By M. Alburbar Supervised By: Mr. Nael Abu Ras University of Palestine Interactive Multimedia Application Development.
ESR 2 / ER 2 Testing Campaign Review A. CrivellaroY. Verdie.
Kevin Cherry Robert Firth Manohar Karki. Accurate detection of moving objects within scenes with dynamic background, in scenarios where the camera is.
Vision-based human motion analysis: An overview Computer Vision and Image Understanding(2007)
Robot Vision: Multi-sensor Reconnaissance. Overview An individual robot can develop an interpretation about its environment. Groups of robots can combine.
Human pose recognition from depth image MS Research Cambridge.
Computer Vision: Eye Tracking By: Geraud Campion Michael O’Connor.
Expectation-Maximization (EM) Case Studies
Figure ground segregation in video via averaging and color distribution Introduction to Computational and Biological Vision 2013 Dror Zenati.
High Quality Silhouette Illustration for Texture Based Volume Rendering, Nagy and Klein.
Sean M. Ficht.  Problem Definition  Previous Work  Methods & Theory  Results.
Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team
Learning to Detect Faces A Large-Scale Application of Machine Learning (This material is not in the text: for further information see the paper by P.
Molecular Classification of Cancer Class Discovery and Class Prediction by Gene Expression Monitoring.
By: David Gelbendorf, Hila Ben-Moshe Supervisor : Alon Zvirin
Fast Semi-Direct Monocular Visual Odometry
3D head pose estimation from multiple distant views X. Zabulis, T. Sarmis, A. A. Argyros Institute of Computer Science, Foundation for Research and Technology.
EE368: Digital Image Processing Bernd Girod Leahy, p.1/15 Face Detection on Similar Color Images Scott Leahy EE368, Stanford University May 30, 2003.
G2 Presentation Week 12 Rehearsal. Outline Demo User Testing Internal External Improvements Project Management.
Presenter: Jae Sung Park
Zhaoxia Fu, Yan Han Measurement Volume 45, Issue 4, May 2012, Pages 650–655 Reporter: Jing-Siang, Chen.
Science, Measurement, Uncertainty and Error1 Science, Measurements, Uncertainty and Error.
Over the recent years, computer vision has started to play a significant role in the Human Computer Interaction (HCI). With efficient object tracking.
University of Zagreb, Faculty of Electrical Engineering and Computing
3D Puppetry: A Kinect-based Interface for 3D Animation
Approximate Models for Fast and Accurate Epipolar Geometry Estimation
CDA 6938 Final Project Triangulation from Point Cloud
Mauricio Hess-Flores1, Mark A. Duchaineau2, Kenneth I. Joy3
A New Approach to Track Multiple Vehicles With the Combination of Robust Detection and Two Classifiers Weidong Min , Mengdan Fan, Xiaoguang Guo, and Qing.
Identifying Human-Object Interaction in Range and Video Data
Fusion, Face, HD Face Matthew Simari | Program Manager, Kinect Team
Presentation transcript:

3d Pose Detection Used by Kinect Accurate when the pose closely matches a stored pose Inaccurate when novel poses are made Can often produce shaky movement due to pose snapping

3d Pose Tracking Calculate poses based on previous poses and current data No datasets required Has issues with local minima

Hybrid Store a collection of preprocessed poses Start session with a detected pose Track until a failure is hit Detect that frame, and continue tracking from there

Pose Tracking in Detail Given the current set of pixels, and the calculated poses of the past few frames, calculate the most probable current pose Repeat the following until error is reduced Calculate a hypothesized 3d representation (rendered image) using the 3d camera image (observed data) Calculate quite a lot about the rendered image: Edepth, Eextra, Esilhouette, Eprior Adjust the hypothesized pose accordingly If that fails significantly, “detect” the current pose and try again

Optimizing the Position Solve energy minimization problems: Edepth: Depth data, excluding any rendered foreground pixels that match background observed pixels Eextra: The above depth term, but with added calculations where rendered pixels don’t match with observed pixels Esilhouette: The difference in silhouettes (pixel is in foreground of one image but the background of the other) Eprior: Evaluation of the rendered pose based on the previous rendered poses

More Pose Detecting Grab a random subset of pixels and match them to known poses Throw out uncertain data 20k poses total

Setup User must stand in a specific pose for calibration Select random subset of pixels to determine widths of limbs

Results

Results Roughly 1 in 200 frames reinitialized by the detector Very fast movement often causes local minima On average, 1 failure every 33 seconds High performance running in parallel on the GPU

http://www.youtube.com/watch?v=ICFKEOk3SyA&feature=youtu.be

Further Work Use skinned mesh models Increase pose set Use color data in the algorithm as well