Segmentation and Tracking of Multiple Humans in Crowded Environments Tao Zhao, Ram Nevatia, Bo Wu IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE,

Slides:



Advertisements
Similar presentations
IEEE CDC Nassau, Bahamas, December Integration of shape constraints in data association filters Integration of shape constraints in data.
Advertisements

Bayesian Belief Propagation
An Adaptive Learning Method for Target Tracking across Multiple Cameras Kuan-Wen Chen, Chih-Chuan Lai, Yi-Ping Hung, Chu-Song Chen National Taiwan University.
1 Approximated tracking of multiple non-rigid objects using adaptive quantization and resampling techniques. J. M. Sotoca 1, F.J. Ferri 1, J. Gutierrez.
Learning to estimate human pose with data driven belief propagation Gang Hua, Ming-Hsuan Yang, Ying Wu CVPR 05.
Reducing Drift in Parametric Motion Tracking
Monte Carlo Localization for Mobile Robots Karan M. Gupta 03/10/2004
Proportion Priors for Image Sequence Segmentation Claudia Nieuwenhuis, etc. ICCV 2013 Oral.
Robust Foreground Detection in Video Using Pixel Layers Kedar A. Patwardhan, Guillermoo Sapire, and Vassilios Morellas IEEE TRANSACTION ON PATTERN ANAYLSIS.
Foreground Modeling The Shape of Things that Came Nathan Jacobs Advisor: Robert Pless Computer Science Washington University in St. Louis.
Tracking Multiple Occluding People by Localizing on Multiple Scene Planes Professor :王聖智 教授 Student :周節.
Multiple People Detection and Tracking with Occlusion Presenter: Feifei Huo Supervisor: Dr. Emile A. Hendriks Dr. A. H. J. Stijn Oomes Information and.
A KLT-Based Approach for Occlusion Handling in Human Tracking Chenyuan Zhang, Jiu Xu, Axel Beaugendre and Satoshi Goto 2012 Picture Coding Symposium.
Robust Multi-Pedestrian Tracking in Thermal-Visible Surveillance Videos Alex Leykin and Riad Hammoud.
Oklahoma State University Generative Graphical Models for Maneuvering Object Tracking and Dynamics Analysis Xin Fan and Guoliang Fan Visual Computing and.
1 Vertically Integrated Seismic Analysis Stuart Russell Computer Science Division, UC Berkeley Nimar Arora, Erik Sudderth, Nick Hay.
Formation et Analyse d’Images Session 8
EE-148 Expectation Maximization Markus Weber 5/11/99.
HMM-BASED PATTERN DETECTION. Outline  Markov Process  Hidden Markov Models Elements Basic Problems Evaluation Optimization Training Implementation 2-D.
1 Robust Video Stabilization Based on Particle Filter Tracking of Projected Camera Motion (IEEE 2009) Junlan Yang University of Illinois,Chicago.
Efficient Moving Object Segmentation Algorithm Using Background Registration Technique Shao-Yi Chien, Shyh-Yih Ma, and Liang-Gee Chen, Fellow, IEEE Hsin-Hua.
A Bayesian Formulation For 3d Articulated Upper Body Segmentation And Tracking From Dense Disparity Maps Navin Goel Dr Ara V Nefian Dr George Bebis.
Motion based Correspondence for Distributed 3D tracking of multiple dim objects Ashok Veeraraghavan.
Ensemble Tracking Shai Avidan IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE February 2007.
Multiple Human Objects Tracking in Crowded Scenes Yao-Te Tsai, Huang-Chia Shih, and Chung-Lin Huang Dept. of EE, NTHU International Conference on Pattern.
Object Detection and Tracking Mike Knowles 11 th January 2005
1 Integration of Background Modeling and Object Tracking Yu-Ting Chen, Chu-Song Chen, Yi-Ping Hung IEEE ICME, 2006.
Today Introduction to MCMC Particle filters and MCMC
Tracking Video Objects in Cluttered Background
Using ranking and DCE data to value health states on the QALY scale using conventional and Bayesian methods Theresa Cain.
Computer vision: models, learning and inference Chapter 10 Graphical Models.
Bayesian Analysis for Extreme Events Pao-Shin Chu and Xin Zhao Department of Meteorology School of Ocean & Earth Science & Technology University of Hawaii-
Real-Time Decentralized Articulated Motion Analysis and Object Tracking From Videos Wei Qu, Member, IEEE, and Dan Schonfeld, Senior Member, IEEE.
Data-Driven Markov Chain Monte Carlo Presented by Tomasz MalisiewiczTomasz Malisiewicz for Advanced PerceptionAdvanced Perception 3/1/2006.
Tracking Pedestrians Using Local Spatio- Temporal Motion Patterns in Extremely Crowded Scenes Louis Kratz and Ko Nishino IEEE TRANSACTIONS ON PATTERN ANALYSIS.
Prakash Chockalingam Clemson University Non-Rigid Multi-Modal Object Tracking Using Gaussian Mixture Models Committee Members Dr Stan Birchfield (chair)
Mean-shift and its application for object tracking
BraMBLe: The Bayesian Multiple-BLob Tracker By Michael Isard and John MacCormick Presented by Kristin Branson CSE 252C, Fall 2003.
Shape-Based Human Detection and Segmentation via Hierarchical Part- Template Matching Zhe Lin, Member, IEEE Larry S. Davis, Fellow, IEEE IEEE TRANSACTIONS.
1. Introduction Motion Segmentation The Affine Motion Model Contour Extraction & Shape Estimation Recursive Shape Estimation & Motion Estimation Occlusion.
A General Framework for Tracking Multiple People from a Moving Camera
Mapping and Localization with RFID Technology Matthai Philipose, Kenneth P Fishkin, Dieter Fox, Dirk Hahnel, Wolfram Burgard Presenter: Aniket Shah.
Object Stereo- Joint Stereo Matching and Object Segmentation Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on Michael Bleyer Vienna.
#MOTION ESTIMATION AND OCCLUSION DETECTION #BLURRED VIDEO WITH LAYERS
Kevin Cherry Robert Firth Manohar Karki. Accurate detection of moving objects within scenes with dynamic background, in scenarios where the camera is.
Tracking People by Learning Their Appearance Deva Ramanan David A. Forsuth Andrew Zisserman.
1 University of Texas at Austin Machine Learning Group 图像与视频处理 计算机学院 Motion Detection and Estimation.
Stable Multi-Target Tracking in Real-Time Surveillance Video
Robust Object Tracking by Hierarchical Association of Detection Responses Present by fakewen.
Expectation-Maximization (EM) Case Studies
Chapter 5 Multi-Cue 3D Model- Based Object Tracking Geoffrey Taylor Lindsay Kleeman Intelligent Robotics Research Centre (IRRC) Department of Electrical.
CVPR2013 Poster Detecting and Naming Actors in Movies using Generative Appearance Models.
Tracking Multiple Cells By Correspondence Resolution In A Sequential Bayesian Framework Nilanjan Ray Gang Dong Scott T. Acton C.L. Brown Department of.
Paper Reading Dalong Du Nov.27, Papers Leon Gu and Takeo Kanade. A Generative Shape Regularization Model for Robust Face Alignment. ECCV08. Yan.
Boosted Particle Filter: Multitarget Detection and Tracking Fayin Li.
 Present by 陳群元.  Introduction  Previous work  Predicting motion patterns  Spatio-temporal transition distribution  Discerning pedestrians  Experimental.
Journal of Visual Communication and Image Representation
Tracking with dynamics
Visual Tracking by Cluster Analysis Arthur Pece Department of Computer Science University of Copenhagen
Silhouette Segmentation in Multiple Views Wonwoo Lee, Woontack Woo, and Edmond Boyer PAMI, VOL. 33, NO. 7, JULY 2011 Donguk Seo
Introduction to Sampling Methods Qi Zhao Oct.27,2004.
The Unscented Particle Filter 2000/09/29 이 시은. Introduction Filtering –estimate the states(parameters or hidden variable) as a set of observations becomes.
Multiple Target Tracking Using Spatio-Temporal Monte Carlo Markov Chain Data Association Qian Yu, Gerard Medioni, and Isaac Cohen Edwin Lei.
CS Statistical Machine learning Lecture 25 Yuan (Alan) Qi Purdue CS Nov
Generalization Performance of Exchange Monte Carlo Method for Normal Mixture Models Kenji Nagata, Sumio Watanabe Tokyo Institute of Technology.
Dynamical Statistical Shape Priors for Level Set Based Tracking
Outline Image Segmentation by Data-Driven Markov Chain Monte Carlo
PRAKASH CHOCKALINGAM, NALIN PRADEEP, AND STAN BIRCHFIELD
Image and Video Processing
Yalchin Efendiev Texas A&M University
Presentation transcript:

Segmentation and Tracking of Multiple Humans in Crowded Environments Tao Zhao, Ram Nevatia, Bo Wu IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 30, NO. 7, JULY 2008

Outline Introduction Overview Probabilistic modeling Computing MAP by efficient MCMC Experimental results Conclusion

Introduction Segmentation and tracking of multiple humans in crowded situations is made difficult by interobject occlusion.

Introduction The method is feasible for a crowed scene: –persistent and temporarily heavy occlusion –Do not require that humans isolated when they first enter the scene. –More complex shape models are needed. –Joint reasoning about the collection of objects is needed..

Introduction Main features of this work: –A three-dimensional part-based human body model which enables the segmentation and tracking of humans in 3D and the inference of interobject occlusion naturally. –A Bayesian framework that integrates segmentaion and tracking based on a joint likelihood for the appearance of multiple objects.

Introduction –The design of an efficient Markov chain dynamics, directed by proposal probabilities based on image cues. –The incorporation of a color-based background model in a mean-shift tracking step.

Overview The prior models: –Background model: Based on a background model, the foreground blobs are extracted as the basic observation. –3D human shape model: Since the hypotheses are in 3D, occlusion reasoning is straightforward. –Camera model & Ground Plane Multiple 3D human hypotheses are projected onto the image plane and matched with the foreground blobs.

Overview The segmentation and tracking are integrated in a unified framework and interoperate along time: Segment the foreground blobs into multiple humans and associate the segmented humans with the existing trajectories. The tracks are used to propose human hypothesis in the next frame.

Overview We formulate the problem as one of Bayesian inference to find the best interpretation given the image observations, the prior model, and the estimates from the previous frame analysis. That is the maximun a posteriori (MAP) estimation.

Overview The state to be estimated at each frame: –The number of objects –Their correspondences to the objects in the previous frame (if any). –Their parameters (for example, position) –Uncertainty of the parameters – …

Probabilistic modeling Our goal is to estimate the state at time t,  (t), given the image observation, I (1),…, I (t)  : the state of the objects.  : the solution space.

Probabilistic modeling a state containing n objects can be written as where k i is the unique identity of the ith object whose parameters are m i and  n is the solution space of exactly n objects. The entire solution space is

3D human shape model The parameter of an individual human, m, are defined based on a 3D human shape model. Do not attempt to capture the detailed shape and articulation parameters of the human body. Head, torso, and legs, with fixed spatial relationship.

3D human shape model The parameters (m i ) to describe 3D human hypothesis: –size (h i ): 3D height of the model, it also control the overall scaling of the object in the three directions. –thickness (f i ): captures extra scaling in the horizontal directions. –position (u i or (x i,y i )): the image position of the head.

3D human shape model –orientation (o i ): 3D orientation of the body Orientations of the models are quantized into few levels for computation efficiency. –inclination (i i ): 2D inclination of the body There is the chance that the body may be inclined slgithly.

Object appearance model We use a color histogram of the object, defined within the object shape. It help establish correspondence in tracking because it is insensitive to the nonrigidity of human motion. There exists an efficient algorithm, for example, the mean-shift technique, to optimize a histogram- based object function.

Background appearance model The probability of pixel j being from the background is

The prior distribution The first term : – is independent of time and is defined by –S i is the projected image of the ith object and |S i | is its area. –

The prior distribution –P(o frontal )=P(o profile )=1/2 –P(x i,y i ) is a uniform distribution in the region where a human head is plausible –P(h i ) is a Gaussian distribution N(  h,  h 2 ) truncated in the range of [h min,h max ] –P(f i ) is a Gaussian distribution N(  f,  f 2 ) truncated in the range of [f min,f max ] – P(i i ) is a Gaussian distribution N(  i,  i 2 )

The prior distribution the second term –We approximate it by –We rearrange  (t) and  (t-1) as such that one of is true.

The prior distribution – – P assoc We assume that the position and the inclination of an object follow constant velocity models with Gaussian noise.

The prior distribution The height and thickness follow a Gaussian distribution. We use Kalman filters for temporal estimation. –P new & P dead the likelihood of the initialization of a new track the likelihood of the termination of a existing track They are set empirically according to the distance of the object to the entrance/exits.

Joint image likelihood for multiple objects and the background The visible part of object ( ): – determined by the depth order of all of the objects, which can be inferred from their 3D position and the camera model. Non object region ( )

Joint image likelihood for multiple objects and the background The joint likelihood P(I|  ) consists of two terms: The first term: Background exclusion: the likelihood favors difference in an object hypothesis from the background. Object attraction: this likelihood favors its similarity to its corresponding object in the previous frame.

Joint image likelihood for multiple objects and the background –d i is the color histogram of the background image within the visibility mask of object i. –p i is the color histogram of the object. – is the Bhattachayya coefficient, which reflects the similarity of the two histogram.

Joint image likelihood for multiple objects and the background The second term is: –e j =log(P b (I j )) is the probability of belonging to the background model The likelihood penalizes the difference from the background model.

Computing MAP by efficient MCMC Computing the MAP is an optimization problem. Optimization is challenging: –An unknown number of objects, the solution space contains subspaces of varying dimension. –Includes both discrete variables and continuous variable. we adapt a data-driven Markov chain Monte Carlo (MCMC) approach to explore this complex solution space.

Computing MAP by efficient MCMC MCMC method with jump/diffusion dynamics to sample the posterior probability. –Jump: cause the Markov chain to move between subspaces with different dimension and traverse the discrete variables. –Diffusions: make the Markov chain sample continuous variables. In the process of sampling, the best solution is recorded and the uncertainty associated with the solution is also obtained.

Computing MAP by efficient MCMC

MCMC method: –We want to design a Markov chain with stationary distribution. –At the gth iteration, we sample a candidate state  ’ from a proposal distribution q(  g |  g-1 ). –If the candidate state  ’ is accepted,  g =  ’. –Otherwise,  g =  g-1.

Computing MAP by efficient MCMC Markov chain constructed in this way has its stationary distribution equal to P(), independent of the choice of the proposal probability q() and the initial state  0. The choice of the proposal probability q() can affect the efficiency of MCMC significantly. Using more informed proposal probabilities, for example, as in the data-driven MCMC, will make the Markov chain traverse the solution space more efficiently. Therefore, the proposal distribution is written as q(  g |  g-1, I).

Markov chain dynamic The dynamics correspond to the proposal distribution with a mixture density where A is the set of all dynamic = {add, remove, establish, break, exchange, diff} We assume that we have the sample in the (g-1)th iteration,and now propose a candidate  ’ for the gth iteration.

Markov chain dynamic Dynamics: –object hypothesis addition Sample the parameter of a new human hypothesis (k n+1,m n+1 ) and add it to  g-1. –object hypothesis removal –establish correspondence

Markov chain dynamic –break correspondence –exchange identity –Parameter update

Experimental results Evaluation on an outdoor scene

Experimental results –There are 20 occlusions events overall, nine of which are heavy occlusions. –We use 500 iterations per frame. –Trajectory-based errors: Trajectories of three objects are broken once (ID 28 -> ID 35, ID 31 - > ID 32, ID 30 -> ID 41) –Trajectories initialization: Some start when the objects are only partial inside. Only the initialization of three objects (object 31, 50, 52) are noticeably delayed. Partially occlusion and/or the lack of contrast with the background are the causes of the delays. –The detection rate and the false the false-alarm are and 0.27 percent.

Conclusion A principled approach to simultaneously detect and track humans in a crowed scene. We formulate the problem as a Bayesian MAP estimation problem. The inference is performed by an MCMC-based approach to explore the joint solution space. The success lies in the integration of the top-down Bayesian formulation following the image formation process and the bottom-up features that are directly extracted from images.