Person detection, tracking and human body analysis in multi-camera scenarios Montse Pardàs (UPC) ACV, Bilkent University, MTA-SZTAKI, Technion-ML, University.

Slides:



Advertisements
Similar presentations
ARTIFICIAL PASSENGER.
Advertisements

Applications of one-class classification
IEEE CDC Nassau, Bahamas, December Integration of shape constraints in data association filters Integration of shape constraints in data.
Robust spectral 3D-bodypart segmentation along time Fabio Cuzzolin, Diana Mateus, Edmond Boyer, Radu Horaud Perception project meeting 24/4/2007 Submitted.
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
CSCE643: Computer Vision Bayesian Tracking & Particle Filtering Jinxiang Chai Some slides from Stephen Roth.
Change Detection C. Stauffer and W.E.L. Grimson, “Learning patterns of activity using real time tracking,” IEEE Trans. On PAMI, 22(8): , Aug 2000.
Human Identity Recognition in Aerial Images Omar Oreifej Ramin Mehran Mubarak Shah CVPR 2010, June Computer Vision Lab of UCF.
Face Alignment with Part-Based Modeling
Learning to estimate human pose with data driven belief propagation Gang Hua, Ming-Hsuan Yang, Ying Wu CVPR 05.
Vision Based Control Motion Matt Baker Kevin VanDyke.
Kiyoshi Irie, Tomoaki Yoshida, and Masahiro Tomono 2011 IEEE International Conference on Robotics and Automation Shanghai International Conference Center.
Real-Time Human Pose Recognition in Parts from Single Depth Images Presented by: Mohammad A. Gowayyed.
Foreground Background detection from video Foreground Background detection from video מאת : אבישג אנגרמן.
Tracking Multiple Occluding People by Localizing on Multiple Scene Planes Professor :王聖智 教授 Student :周節.
Robust Object Tracking via Sparsity-based Collaborative Model
Multiple People Detection and Tracking with Occlusion Presenter: Feifei Huo Supervisor: Dr. Emile A. Hendriks Dr. A. H. J. Stijn Oomes Information and.
Watching Unlabeled Video Helps Learn New Human Actions from Very Few Labeled Snapshots Chao-Yeh Chen and Kristen Grauman University of Texas at Austin.
Modeling Pixel Process with Scale Invariant Local Patterns for Background Subtraction in Complex Scenes (CVPR’10) Shengcai Liao, Guoying Zhao, Vili Kellokumpu,
Motion Detection And Analysis Michael Knowles Tuesday 13 th January 2004.
Segmentation and Tracking of Multiple Humans in Crowded Environments Tao Zhao, Ram Nevatia, Bo Wu IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE,
Adaptive Rao-Blackwellized Particle Filter and It’s Evaluation for Tracking in Surveillance Xinyu Xu and Baoxin Li, Senior Member, IEEE.
Ensemble Tracking Shai Avidan IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE February 2007.
Probabilistic Robotics
Object Detection and Tracking Mike Knowles 11 th January 2005
1 Integration of Background Modeling and Object Tracking Yu-Ting Chen, Chu-Song Chen, Yi-Ping Hung IEEE ICME, 2006.
Recognizing and Tracking Human Action Josephine Sullivan and Stefan Carlsson.
Multi-camera Video Surveillance: Detection, Occlusion Handling, Tracking and Event Recognition Oytun Akman.
California Car License Plate Recognition System ZhengHui Hu Advisor: Dr. Kang.
A REAL-TIME VIDEO OBJECT SEGMENTATION ALGORITHM BASED ON CHANGE DETECTION AND BACKGROUND UPDATING 楊靜杰 95/5/18.
Face Processing System Presented by: Harvest Jang Group meeting Fall 2002.
Jacinto C. Nascimento, Member, IEEE, and Jorge S. Marques
Real-Time Decentralized Articulated Motion Analysis and Object Tracking From Videos Wei Qu, Member, IEEE, and Dan Schonfeld, Senior Member, IEEE.
Image Subtraction for Real Time Moving Object Extraction Shahbe Mat Desa, Qussay A. Salih, CGIV’04.
Action and Gait Recognition From Recovered 3-D Human Joints IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS— PART B: CYBERNETICS, VOL. 40, NO. 4, AUGUST.
Prakash Chockalingam Clemson University Non-Rigid Multi-Modal Object Tracking Using Gaussian Mixture Models Committee Members Dr Stan Birchfield (chair)
BraMBLe: The Bayesian Multiple-BLob Tracker By Michael Isard and John MacCormick Presented by Kristin Branson CSE 252C, Fall 2003.
1. Introduction Motion Segmentation The Affine Motion Model Contour Extraction & Shape Estimation Recursive Shape Estimation & Motion Estimation Occlusion.
A General Framework for Tracking Multiple People from a Moving Camera
Object Stereo- Joint Stereo Matching and Object Segmentation Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on Michael Bleyer Vienna.
Kaihua Zhang Lei Zhang (PolyU, Hong Kong) Ming-Hsuan Yang (UC Merced, California, U.S.A. ) Real-Time Compressive Tracking.
Scientific Writing Abstract Writing. Why ? Most important part of the paper Number of Readers ! Make people read your work. Sell your work. Make your.
Forward-Scan Sonar Tomographic Reconstruction PHD Filter Multiple Target Tracking Bayesian Multiple Target Tracking in Forward Scan Sonar.
MURI: Integrated Fusion, Performance Prediction, and Sensor Management for Automatic Target Exploitation 1 Dynamic Sensor Resource Management for ATE MURI.
Computer Vision Michael Isard and Dimitris Metaxas.
Vision-based human motion analysis: An overview Computer Vision and Image Understanding(2007)
Learning the Appearance and Motion of People in Video Hedvig Sidenbladh, KTH Michael Black, Brown University.
ISOMAP TRACKING WITH PARTICLE FILTER Presented by Nikhil Rane.
Action and Gait Recognition From Recovered 3-D Human Joints IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS— PART B: CYBERNETICS, VOL. 40, NO. 4, AUGUST.
Expectation-Maximization (EM) Case Studies
Boosted Particle Filter: Multitarget Detection and Tracking Fayin Li.
Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team
Sequential Monte-Carlo Method -Introduction, implementation and application Fan, Xin
Team Members Ming-Chun Chang Lungisa Matshoba Steven Preston Supervisors Dr James Gain Dr Patrick Marais.
Visual Tracking by Cluster Analysis Arthur Pece Department of Computer Science University of Copenhagen
Face Detection and Head Tracking Ying Wu Electrical Engineering & Computer Science Northwestern University, Evanston, IL
REAL-TIME DETECTOR FOR UNUSUAL BEHAVIOR
Traffic Sign Recognition Using Discriminative Local Features Andrzej Ruta, Yongmin Li, Xiaohui Liu School of Information Systems, Computing and Mathematics.
C. Canton1, J.R. Casas1, A.M.Tekalp2, M.Pardàs1
A Forest of Sensors: Using adaptive tracking to classify and monitor activities in a site Eric Grimson AI Lab, Massachusetts Institute of Technology
Tracking Objects with Dynamics
Motion Detection And Analysis
Probabilistic Robotics
Introduction to particle filter
Eric Grimson, Chris Stauffer,
David Harwin Adviser: Petros Faloutsos
Combining Geometric- and View-Based Approaches for Articulated Pose Estimation David Demirdjian MIT Computer Science and Artificial Intelligence Laboratory.
PRAKASH CHOCKALINGAM, NALIN PRADEEP, AND STAN BIRCHFIELD
Introduction to particle filter
Online Graph-Based Tracking
Presentation transcript:

Person detection, tracking and human body analysis in multi-camera scenarios Montse Pardàs (UPC) ACV, Bilkent University, MTA-SZTAKI, Technion-ML, University of Surrey, University of Amsterdam, UPC

Summary Multi-camara scenarios definition and applications. Multi-level foreground segmentation and tracking 3D person tracking with particle filters Human motion capture Marker-based Hierarchical annealing Latent-space based Examples of other works in the e-team: Moving object detection and classification Eye tracking

Input Data Generation Original images F/B Segmentation 3D (SfS) Real World 3D coloring

Applications 3D (SfS) 3D coloring Multi-person tracking Head orientation Human motion capture Activity recognition Focus of attention Gesture/Gaitrecognition High semantinc level Raw features

Foreground detection: multi-level fg segmentation Based on a statistical modeling of the pixels’ value Xt in the (i,j) coordinates,using Gaussian models. A pixel will be classified as belonging to a Gaussian model if Background model Moving foreground model: A pixel not classified as bg or static fg is classified as moving fg. Static foreground model: When a moving foreground model has been observed during a certaint time T, the model is transferred to the static foreground. An incoming pixel can: Match the current foreground model: it is updated and its counter is increased Match the background or static foreground models: it is updated and the counter of the foreground model is decreased Not match any existing model: The foreground model and its counter are re-initialized.

Multi-level foreground segmentation

Mean-shift tracking with multi-level foreground Double register, for static and moving objects: centroid position, size, color histogram and counter of appearance Foreground objects detection Filtering of the multi-level foreground detection to generate Connected Components if minsize. Temporal association of static objects Check if there is a static CC in the position of a static object. Update the corresponding static object register Otherwise: The object has re-started its motion if in the area of the static object there is at least 70% background. The static object is transferred to the moving objects register. The static object has been occluded: a moving CC is detected in the area of the static object. No action is taken.

Mean-shift tracking with multi-level foreground Mean-shift tracking of moving foreground objects The estimation of the position of the objects in the moving object register is performed using mean-shift. The foreground mask is applied to the original image before performing the mean shift All the CC in an area of the size of the object around the mean shift estimation are assigned to the object. The object features are updated. If two or more moving objects share the same CC, we enter an occlusion situation and the update is done only on the centroid value using the mean shift estimation. Detection of new Static and Moving objects The CC which have not been associated to any object are introduced as new objects.

Mean shift tracking with multi level foreground

Person tracking in multi camera scenarios  Perform multi-person tracking in an indoor scenario employing 3D information (SfS+color)  Particle Filters based solution  Evaluate the performance of this method by using a standardized database (CLEAR 2007)  Particle filters: Key idea: represent the posterior p(xk|z1:k) by a set of random samples. Four steps are followed: resampling, propagation, evaluation and estimation When implementing a PF, two issues are to be taken into account: Likelihood evaluation: how to assign weights to our particles Propagation model: how to “move” our particles to efficiently sample the state space

Particle Filter implementation  Likelihood evaluation:  Every particle defines a possible location of the person represented by an ellipsoid  For every person a reference color histogram is built up  Weight assigned to a particle is a function of its overlap with the binary 3D reconstruction and its color similarity with the reference histogram (Battacharyia distance)

 Multiple targets may be tracked using a PF with a state containing the 3D position of each of them. However, complexity makes it unfeasible  A PF is assigned to every target and an interaction model is defined Multi-Person Issue

Results

Results Accuracy Precision Best performing tracking algorithm at CLEAR 2007 Evaluation Campaing

Multimodal integration  Audio information was added to produce a more robust tracker  However, only information fusion at feature level has been conducted so far, giving slightly better results (5%)  Information fusion at data level is under study

Human Motion Capture  Two approaches are widely employed: marker and makerless  It provides information related with the pose of the person. A human body model is assumed  It poses a technology challenge due to the high dimensionality of the state space (~22 DOF). Moreover, this space is highly non- convex

Marker based capture  Intrusive but provides very precise results  Widely used by the cinema industry. Quite expensive equipment (>10K$) and requires dedicated hardware  An annealing PF together with off-the-shelf hardware solution has been developed

 Non intrusive but more challenge since there are no artificial aid (markers)  Still an open problem for unconstrained motion  PF definition: every particle represents a possible pose of the person  Likelihood evaluation: defining a fitness function matching the input data against the particle pose taking into account 3D information and color Markerless based capture (I)

Markerless based capture (II) Original images Voxel reconstruction Particle x t m – β(x t m ) Particle x t n – β(x t n ) β(x t m ) U z t = 0.84 β(x t n ) U z t = 0.62

Markerless based capture (III)  Structural annealing PF technique that performs a layered fitting of a hierarchical model to the input data Input data is very corrupted: a simple body model is used Data is more complete: the full articulated model is used Legs are not visible, hence they are removed from the analysis model

Markerless based capture (IV)

Technion-Israel Institute of Technology 22 Learning The drawback in the annealing particle filter tracker is that a high dimensionality of the state space causes an exponential increase in the number of particles that is needed to be generated in order to preserve the same density of particles. We use a set of poses in order to create a latent space with a low dimensionality. The poses are taken from different sequences: Walking Running Punching Kicking We perform non linear dimensionality reduction using Gaussian Process Dynamic Model (GPDM) and construct a latent space.

Technion-Israel Institute of Technology 23 The Tracking Scheme Model to Camera Projection Latent to Model Mapping Feedback

Technion-Israel Institute of Technology 24 Multi-Action Learning Multi-Action Learning 2D latent space from 3 different motions: lifting an object (red), kicking with the left (green) and the right (magenta) legs.

Results

Other works: Moving object detection and classification

Other works: Eye tracking