M4 22-23.01.2004 Institute for Human-Machine Communication Munich University of Technology Sascha Schreiber Face Tracking and Person Action.

Slides:



Advertisements
Similar presentations
Université du Québec École de technologie supérieure Face Recognition in Video Using What- and-Where Fusion Neural Network Mamoudou Barry and Eric Granger.
Advertisements

Angelo Dalli Department of Intelligent Computing Systems
M. Emre Sargın, Ferda Ofli, Yelena Yasinnik, Oya Aran, Alexey Karpov, Stephen Wilson,Engin Erzin, Yücel Yemez, A. Murat Tekalp Combined Gesture- Speech.
Markov Localization & Bayes Filtering 1 with Kalman Filters Discrete Filters Particle Filters Slides adapted from Thrun et al., Probabilistic Robotics.
Chapter 15 Probabilistic Reasoning over Time. Chapter 15, Sections 1-5 Outline Time and uncertainty Inference: ltering, prediction, smoothing Hidden Markov.
Multiple People Detection and Tracking with Occlusion Presenter: Feifei Huo Supervisor: Dr. Emile A. Hendriks Dr. A. H. J. Stijn Oomes Information and.
SOMM: Self Organizing Markov Map for Gesture Recognition Pattern Recognition 2010 Spring Seung-Hyun Lee G. Caridakis et al., Pattern Recognition, Vol.
Potential Projects RGBD gesture recognition with the Microsoft Kinect Person recognition by parts.
TOWARD DYNAMIC GRASP ACQUISITION: THE G-SLAM PROBLEM Li (Emma) Zhang and Jeff Trinkle Department of Computer Science, Rensselaer Polytechnic Institute.
Tracking Objects with Dynamics Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 04/21/15 some slides from Amin Sadeghi, Lana Lazebnik,
1 Robust Video Stabilization Based on Particle Filter Tracking of Projected Camera Motion (IEEE 2009) Junlan Yang University of Illinois,Chicago.
Stanford CS223B Computer Vision, Winter 2007 Lecture 12 Tracking Motion Professors Sebastian Thrun and Jana Košecká CAs: Vaibhav Vaish and David Stavens.
Recent Developments in Human Motion Analysis
Target Tracking a Non-Linear Target Path Using Kalman Predictive Algorithm and Maximum Likelihood Estimation by James Dennis Musick.
Stanford CS223B Computer Vision, Winter 2007 Lecture 12 Tracking Motion Professors Sebastian Thrun and Jana Košecká CAs: Vaibhav Vaish and David Stavens.
MVL (Machine Vision Lab) UIC HUMAN MOTION VIDEO DATABASE Jezekiel Ben-Arie ECE Department University Of Illinois at Chicago Scripts, Queries, Recognition.
Multi-camera Video Surveillance: Detection, Occlusion Handling, Tracking and Event Recognition Oytun Akman.
Particle Filtering for Non- Linear/Non-Gaussian System Bohyung Han
© 2003 by Davi GeigerComputer Vision November 2003 L1.1 Tracking We are given a contour   with coordinates   ={x 1, x 2, …, x N } at the initial frame.
Face Processing System Presented by: Harvest Jang Group meeting Fall 2002.
Real-Time Decentralized Articulated Motion Analysis and Object Tracking From Videos Wei Qu, Member, IEEE, and Dan Schonfeld, Senior Member, IEEE.
A Vision-Based System that Detects the Act of Smoking a Cigarette Xiaoran Zheng, University of Nevada-Reno, Dept. of Computer Science Dr. Mubarak Shah,
Learning and Recognizing Activities in Streams of Video Dinesh Govindaraju.
HAND GESTURE BASED HUMAN COMPUTER INTERACTION. Hand Gesture Based Applications –Computer Interface A 2D/3D input device (Hand Tracking) Translation of.
Isolated-Word Speech Recognition Using Hidden Markov Models
Action and Gait Recognition From Recovered 3-D Human Joints IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS— PART B: CYBERNETICS, VOL. 40, NO. 4, AUGUST.
Driver’s View and Vehicle Surround Estimation using Omnidirectional Video Stream Abstract Our research is focused on the development of novel machine vision.
Markov Localization & Bayes Filtering
Exploiting video information for Meeting Structuring ….
The Language Archive – Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands Application of Audio and Video Processing Methods for Language.
Brian Renzenbrink Jeff Robble Object Tracking Using the Extended Kalman Particle Filter.
Computer vision: models, learning and inference Chapter 19 Temporal models.
Hand Gesture Recognition System for HCI and Sign Language Interfaces Cem Keskin Ayşe Naz Erkan Furkan Kıraç Özge Güler Lale Akarun.
Abstract Developing sign language applications for deaf people is extremely important, since it is difficult to communicate with people that are unfamiliar.
Learning and Recognizing Human Dynamics in Video Sequences Christoph Bregler Alvina Goh Reading group: 07/06/06.
A General Framework for Tracking Multiple People from a Moving Camera
Multimodal Integration for Meeting Group Action Segmentation and Recognition M. Al-Hames, A. Dielmann, D. Gatica-Perez, S. Reiter, S. Renals, G. Rigoll,
Bayesian networks Classification, segmentation, time series prediction and more. Website: Twitter:
Jamal Saboune - CRV10 Tutorial Day 1 Bayesian state estimation and application to tracking Jamal Saboune VIVA Lab - SITE - University.
Cognitive Computer Vision Kingsley Sage and Hilary Buxton Prepared under ECVision Specific Action 8-3
Recognition, Analysis and Synthesis of Gesture Expressivity George Caridakis IVML-ICCS.
Head Tracking in Meeting Scenarios Sascha Schreiber.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Reestimation Equations Continuous Distributions.
Action and Gait Recognition From Recovered 3-D Human Joints IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS— PART B: CYBERNETICS, VOL. 40, NO. 4, AUGUST.
Efficient Visual Object Tracking with Online Nearest Neighbor Classifier Many slides adapt from Steve Gu.
Chapter 5 Multi-Cue 3D Model- Based Object Tracking Geoffrey Taylor Lindsay Kleeman Intelligent Robotics Research Centre (IRRC) Department of Electrical.
CS Statistical Machine learning Lecture 24
1/20 System Overview Cyclic mo-cap data (walking, running..) Cyclic mo-cap data (walking, running..) Music / Sound (audio) Music / Sound (audio) Resulting.
Ch 5b: Discriminative Training (temporal model) Ilkka Aho.
Experimental Results Abstract Fingerspelling is widely used for education and communication among signers. We propose a new static fingerspelling recognition.
Context-based vision system for place and object recognition Antonio Torralba Kevin Murphy Bill Freeman Mark Rubin Presented by David Lee Some slides borrowed.
1 Chapter 15 Probabilistic Reasoning over Time. 2 Outline Time and UncertaintyTime and Uncertainty Inference: Filtering, Prediction, SmoothingInference:
 Present by 陳群元.  Introduction  Previous work  Predicting motion patterns  Spatio-temporal transition distribution  Discerning pedestrians  Experimental.
Lane Detection in OpenCL
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Elements of a Discrete Model Evaluation.
Chapter 8. Learning of Gestures by Imitation in a Humanoid Robot in Imitation and Social Learning in Robots, Calinon and Billard. Course: Robots Learning.
Week 10 Emily Hand UNR.
OBJECT TRACKING USING PARTICLE FILTERS. Table of Contents Tracking Tracking Tracking as a probabilistic inference problem Tracking as a probabilistic.
Belief Networks in Computer Vision Applications Alex Yakushev CMPS 290C final project Winter 2006.
CS Statistical Machine learning Lecture 25 Yuan (Alan) Qi Purdue CS Nov
CPSC 7373: Artificial Intelligence Lecture 12: Hidden Markov Models and Filters Jiang Bian, Fall 2012 University of Arkansas at Little Rock.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Reestimation Equations Continuous Distributions.
Segmentation and Recognition of Meeting Events M4 – Meeting Munich 23. September 2004 Stephan Reiter.
Fast Human Detection in Crowded Scenes by Contour Integration and Local Shape Estimation Csaba Beleznai, Horst Bischof Computer Vision and Pattern Recognition,
Tracking We are given a contour G1 with coordinates G1={x1 , x2 , … , xN} at the initial frame t=1, were the image is It=1 . We are interested in tracking.
Tracking Objects with Dynamics
Particle Filtering for Geometric Active Contours
Probabilistic Reasoning Over Time
The Open World of Micro-Videos
Chapter14-cont..
Presentation transcript:

M Institute for Human-Machine Communication Munich University of Technology Sascha Schreiber Face Tracking and Person Action Recognition - Update

Sascha Schreiber 2/14 Institute for Human-Machine Communication Munich University of Technology M Recapitulation of methodology for action recognition Face tracking with I-Condensation Recognition performance comparison on actions from the m4 dataset Kalman filtering of occluded gestures Outlook Overview

Sascha Schreiber 3/14 Institute for Human-Machine Communication Munich University of Technology M Person Action Recognition Extraction of person locations Temporal segmentation Feature calculation Classification of segments Face detection/Blob tracking Global Motion Features Bayesian Information Criterion Hidden Markov Models Actions, timestamps

Sascha Schreiber 4/14 Institute for Human-Machine Communication Munich University of Technology M Person Action Recognition Extraction of person locations Temporal segmentation Feature calculation Classification of segments Face detection/Blob tracking Global Motion Features Bayesian Information Criterion Hidden Markov Models Actions, timestamps

Sascha Schreiber 5/14 Institute for Human-Machine Communication Munich University of Technology M Face Tracking Particle filtering with ICondensation N weighted particles Updating using their likelihood Sampling from prediction density - Standard Condensation sampling - Sampling from importance function for reinitialisation - Importance sampling with weighting correction factor  Introduction of importance function: skin color distribution Automatic initialization by pyramid sampling and MLP classification

Sascha Schreiber 6/14 Institute for Human-Machine Communication Munich University of Technology M Performance of Face Tracking Standard CondensationICondensation Demonstration of difference between:

Sascha Schreiber 7/14 Institute for Human-Machine Communication Munich University of Technology M Person Action Recognition Extraction of person locations Temporal segmentation Feature calculation Classification of segments Face detection/Blob tracking Global Motion Features Bayesian Information Criterion Hidden Markov Models Actions, timestamps

Sascha Schreiber 8/14 Institute for Human-Machine Communication Munich University of Technology M IDIAP training data (TRN 01-30), IDIAP test data (TST 01-30) Continuous HMMs (6 states, 3 mixtures) Sit downStand upNodding Shaking head WritingPointing Score Sit down % Stand up % Nodding % Shaking head % Writing % Pointing % Overall80% Recognition Performance m4

Sascha Schreiber 9/14 Institute for Human-Machine Communication Munich University of Technology M IDIAP training data (TRN 01-30), IDIAP test data (TST 01-30) Discrete HMMs (6 states, codebook 1500) Sit downStand upNodding Shaking head WritingPointing Score Sit down % Stand up % Nodding % Shaking head % Writing % Pointing % Overall86% Recognition Performance m4

Sascha Schreiber 10/14 Institute for Human-Machine Communication Munich University of Technology M Occluded Gestures Classification result Classification Compensation of occlusion Stream- segmentation Feature- extraction Smoothed featurestream Featurestream Segmented Featurestream Video- stream Occlusion Scenario: Person walking on front of a tracked object

Sascha Schreiber 11/14 Institute for Human-Machine Communication Munich University of Technology M Occluded Gestures Application for Kalman filtering: Calculation of an estimate Time update equationMeasurement update equation Discrete-time process:

Sascha Schreiber 12/14 Institute for Human-Machine Communication Munich University of Technology M Occluded Gestures Kalman- filter Kalman- filter Kalman- filter N action-specialized Kalman-Filters, each trained for a special gesture to be recognized by the HMM Improving featurestream by smoothing with : Kalman- filter One general Kalman-Filter for the disturbed featurestream

Sascha Schreiber 13/14 Institute for Human-Machine Communication Munich University of Technology M Performance of Kalman filtering Score Featurestream unoccluded & unfiltered79,86% Featurestream occluded & unfiltered56,75% Featurestream occluded & filtered (general)57,98% Featurestream occluded & filtered (specialized)60,12% IDIAP training data (TRN 01-30), IDAP test data (TST 01-30) Continuous HMMs (6 states, 3 mixtures)

Sascha Schreiber 14/14 Institute for Human-Machine Communication Munich University of Technology M Implementation of extended Kalman filter Head orientation tracking Integration of face recognition into particle filter Further improvement of action detection on m4 data Connection to Meeting Segmentation / Multimodal Recognizer Outlook

M Institute for Human-Machine Communication Munich University of Technology Face Tracking and Person Action Recognition - Update Sascha Schreiber