Tracking Multiple Occluding People by Localizing on Multiple Scene Planes Professor :王聖智 教授 Student :周節.

Slides:



Advertisements
Similar presentations
Reconstructing Non-stationary Articulated Objects in Monocular Video using Silhouette Information Saad M. Khan and Mubarak Shah University of Central Florida,
Advertisements

People Counting and Human Detection in a Challenging Situation Ya-Li Hou and Grantham K. H. Pang IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS—PART.
Simultaneous surveillance camera calibration and foot-head homology estimation from human detection 1 Author : Micusic & Pajdla Presenter : Shiu, Jia-Hau.
Department of Electrical and Electronic Engineering, The University of Hong Kong, Pokfulam Road, Hong Kong Three-dimensional curve reconstruction from.
Object Inter-Camera Tracking with non- overlapping views: A new dynamic approach Trevor Montcalm Bubaker Boufama.
Multiple People Detection and Tracking with Occlusion Presenter: Feifei Huo Supervisor: Dr. Emile A. Hendriks Dr. A. H. J. Stijn Oomes Information and.
Adviser : Ming-Yuan Shieh Student ID : M Student : Chung-Chieh Lien VIDEO OBJECT SEGMENTATION AND ITS SALIENT MOTION DETECTION USING ADAPTIVE BACKGROUND.
Tracking Multiple Occluding People by Localizing on Multiple Scene Planes Saad M. Khan and Mubarak Shah, PAMI, VOL. 31, NO. 3, MARCH 2009, Donguk Seo
A KLT-Based Approach for Occlusion Handling in Human Tracking Chenyuan Zhang, Jiu Xu, Axel Beaugendre and Satoshi Goto 2012 Picture Coding Symposium.
December 5, 2013Computer Vision Lecture 20: Hidden Markov Models/Depth 1 Stereo Vision Due to the limited resolution of images, increasing the baseline.
Formation et Analyse d’Images Session 8
Broadcast Court-Net Sports Video Analysis Using Fast 3-D Camera Modeling Jungong Han Dirk Farin Peter H. N. IEEE CSVT 2008.
Boundary matting for view synthesis Samuel W. Hasinoff Sing Bing Kang Richard Szeliski Computer Vision and Image Understanding 103 (2006) 22–32.
Computational Photography: Image-based Modeling Jinxiang Chai.
Segmentation and Tracking of Multiple Humans in Crowded Environments Tao Zhao, Ram Nevatia, Bo Wu IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE,
Ensemble Tracking Shai Avidan IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE February 2007.
Region-Level Motion- Based Background Modeling and Subtraction Using MRFs Shih-Shinh Huang Li-Chen Fu Pei-Yung Hsiao 2007 IEEE.
Multiview stereo. Volumetric stereo Scene Volume V Input Images (Calibrated) Goal: Determine occupancy, “color” of points in V.
Multiple Human Objects Tracking in Crowded Scenes Yao-Te Tsai, Huang-Chia Shih, and Chung-Lin Huang Dept. of EE, NTHU International Conference on Pattern.
Multi-view stereo Many slides adapted from S. Seitz.
Fast Illumination-invariant Background Subtraction using Two Views: Error Analysis, Sensor Placement and Applications Ser-Nam Lim, Anurag Mittal, Larry.
Domenico Bloisi, Luca Iocchi, Dorothy Monekosso, Paolo Remagnino
Effective Gaussian mixture learning for video background subtraction Dar-Shyang Lee, Member, IEEE.
CS664 Lecture #19: Layers, RANSAC, panoramas, epipolar geometry Some material taken from:  David Lowe, UBC  Jiri Matas, CMP Prague
Tracking Video Objects in Cluttered Background
MULTIPLE MOVING OBJECTS TRACKING FOR VIDEO SURVEILLANCE SYSTEMS.
Multi-camera Video Surveillance: Detection, Occlusion Handling, Tracking and Event Recognition Oytun Akman.
Introduction to Object Tracking Presented by Youyou Wang CS643 Texas A&M University.
IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 20, NO. 11, NOVEMBER 2011 Qian Zhang, King Ngi Ngan Department of Electronic Engineering, the Chinese university.
1 Real Time, Online Detection of Abandoned Objects in Public Areas Proceedings of the 2006 IEEE International Conference on Robotics and Automation Authors.
Jacinto C. Nascimento, Member, IEEE, and Jorge S. Marques
Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.
1 Video Surveillance systems for Traffic Monitoring Simeon Indupalli.
EE392J Final Project, March 20, Multiple Camera Object Tracking Helmy Eltoukhy and Khaled Salama.
1 Activity and Motion Detection in Videos Longin Jan Latecki and Roland Miezianko, Temple University Dragoljub Pokrajac, Delaware State University Dover,
Computer vision: models, learning and inference
Learning to classify the visual dynamics of a scene Nicoletta Noceti Università degli Studi di Genova Corso di Dottorato.
Distinctive Image Features from Scale-Invariant Keypoints By David G. Lowe, University of British Columbia Presented by: Tim Havinga, Joël van Neerbos.
What Does the Scene Look Like From a Scene Point? Donald Tanguay August 7, 2002 M. Irani, T. Hassner, and P. Anandan ECCV 2002.
Tracking Pedestrians Using Local Spatio- Temporal Motion Patterns in Extremely Crowded Scenes Louis Kratz and Ko Nishino IEEE TRANSACTIONS ON PATTERN ANALYSIS.
Action recognition with improved trajectories
Prakash Chockalingam Clemson University Non-Rigid Multi-Modal Object Tracking Using Gaussian Mixture Models Committee Members Dr Stan Birchfield (chair)
Olga Zoidi, Anastasios Tefas, Member, IEEE Ioannis Pitas, Fellow, IEEE
Shape-Based Human Detection and Segmentation via Hierarchical Part- Template Matching Zhe Lin, Member, IEEE Larry S. Davis, Fellow, IEEE IEEE TRANSACTIONS.
1. Introduction Motion Segmentation The Affine Motion Model Contour Extraction & Shape Estimation Recursive Shape Estimation & Motion Estimation Occlusion.
Vision Surveillance Paul Scovanner.
A General Framework for Tracking Multiple People from a Moving Camera
Object Stereo- Joint Stereo Matching and Object Segmentation Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on Michael Bleyer Vienna.
Person detection, tracking and human body analysis in multi-camera scenarios Montse Pardàs (UPC) ACV, Bilkent University, MTA-SZTAKI, Technion-ML, University.
Spatio-temporal constraints for recognizing 3D objects in videos Nicoletta Noceti Università degli Studi di Genova.
Visual SLAM Visual SLAM SPL Seminar (Fri) Young Ki Baik Computer Vision Lab.
Expectation-Maximization (EM) Case Studies
Interactive Sand Art Drawing Using RGB-D Sensor
Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team
By Naveen kumar Badam. Contents INTRODUCTION ARCHITECTURE OF THE PROPOSED MODEL MODULES INVOLVED IN THE MODEL FUTURE WORKS CONCLUSION.
Journal of Visual Communication and Image Representation
Presented by: Idan Aharoni
Suspicious Behavior in Outdoor Video Analysis - Challenges & Complexities Air Force Institute of Technology/ROME Air Force Research Lab Unclassified IED.
Matching of Objects Moving Across Disjoint Cameras Eric D. Cheng and Massimo Piccardi IEEE International Conference on Image Processing
1 Shape Descriptors for Maximally Stable Extremal Regions Per-Erik Forss´en and David G. Lowe Department of Computer Science University of British Columbia.
Fast Human Detection in Crowded Scenes by Contour Integration and Local Shape Estimation Csaba Beleznai, Horst Bischof Computer Vision and Pattern Recognition,
Motion tracking TEAM D, Project 11: Laura Gui - Timisoara Calin Garboni - Timisoara Peter Horvath - Szeged Peter Kovacs - Debrecen.
Motion Estimation of Moving Foreground Objects Pierre Ponce ee392j Winter March 10, 2004.
Video object segmentation and its salient motion detection using adaptive background generation Kim, T.K.; Im, J.H.; Paik, J.K.;  Electronics Letters 
Lecture 07 13/12/2011 Shai Avidan הבהרה: החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.
C. Canton1, J.R. Casas1, A.M.Tekalp2, M.Pardàs1
Common Classification Tasks
PRAKASH CHOCKALINGAM, NALIN PRADEEP, AND STAN BIRCHFIELD
Occlusion and smoothness probabilities in 3D cluttered scenes
Presentation transcript:

Tracking Multiple Occluding People by Localizing on Multiple Scene Planes Professor :王聖智 教授 Student :周節

Outline Introduction Related work Localization Algorithm Tracking Algorithm Result Conclusion Reference

Outline Introduction Related work Localization Algorithm Tracking Algorithm Result Conclusion Reference

Introduction Surveillance Detection Tracking

Introduction If the object is isolated… Physical properties are useful. Such as color, shape… It is much simpler.

Introduction ? But if the objects are not isolated… Occlusion Lack of visibility In crowded and cluttered scenes

Introduction Difficulty It’s difficult to track individual people when occlusion occurred.

Outline Introduction Related work Localization Algorithm Tracking Algorithm Result Conclusion Reference

Related work Monocular Approach Multi-camera Approach

Monocular Approach Handling occlusions (1) Color and shape (2) Objects contours and appearances (3) Blob split and merge analysis (4) ……

Monocular Approach Drawbacks (1) Can’t deal with full occlusion. (2) Can’t deal with long periods. Therefore (1) Single view is limited. (2) Multi-camera is preferred.

Multi-camera Approach Methods (1) Construct 3D model using voxel (2) Calibrated camera to obtain 3D locations (3) Switching camera (4) Stereo camera (5) ……

Multi-camera Approach Problems (1) Features might be corrupted by occlusions. (2) Very similar color and shape. (3) Full occlusion. Therefore This paper…

Outline Introduction Related work Localization Algorithm Tracking Algorithm Result Conclusion Reference

Localization Algorithm 1.Obtain the foreground likelihood maps 2.Obtain reference plane homographies 3.Compute and find the localization

Localization Algorithm 1.Obtain the foreground likelihood maps –Model Background using a Mixture of Gaussians. –Perform Background Subtraction to obtain foreground likelihood information.

Localization Algorithm 2.Obtain reference plane homographies –Homography Constraint

Homography Constraint (R, T)

Homography Constraint

Localization Algorithm x 1 ~x n are the observations in images. X is the event that the pixel is inside a foreground object. L(x i ) is the likelihood of observation x i belonging to the foreground.

Example

Localization Algorithm For robustness Modeling Clutter and FOV Constraints

Example

Localization Algorithm For robustness –Localization at Multiple Planes

Outline Introduction Related work Localization Algorithm Tracking Algorithm Result Conclusion Reference

Tracking Algorithm Object trajectories are spatially and temporally coherent. T=0 T=10 T=20 T=30

Tracking Algorithm Spatially and temporally coherent

Tracking Algorithm

DEMO

Outline Introduction Related work Localization Algorithm Tracking Algorithm Result Conclusion Reference

Result Case 1 –Parking lot

Result Multi-camera improves performance.

Result Case 2 –Indoor

Result Multi-plane improves performance.

Result Case 3 –Basketball View limited –Error –Miss

Result Case 4 –Soccer Background limited –Error –Miss

Outline Introduction Related work Localization Algorithm Tracking Algorithm Result Conclusion Reference

Conclusion To resolve occlusions and localize people: Planar homography constraints are used to fuse foreground likelihood information. Detection and tracking are performed simultaneously in the space-time occupancy likelihood data.

Conclusion Advantage –Very good performance when occlusions occur. –No calibration is needed. –Only 2D constructs, purely image-based.

Conclusion Disadvantage –If a person’s appearance is very similar to the background or –If a person is occluded by some portion of the background itself. –If a part of the scene is occluded in all views by the foreground objects. Solutions –Add other models –like color model,human motion model…

Reference Saad M. Khan and Mubarak Shah, “Tracking Multiple Occluding People by Localizing on Multiple Scene Planes”, IEEE Transactions on Pattern Analysis and Machine Intelligence, Volume: 31, Issue: 3, March 2009 S.M. Khan and M. Shah, “A Multi-View Approach to Tracking People in Crowded Scenes Using a Planar Homography Constraint,” Proc. Ninth European Conf. Computer Vision, 2006.

Property 1.Do not use color models or shape cues of individual people 2.Our method of detection and occlusion resolution is based on geometrical constructs and requires only the distinction of foreground from back-ground, which is obtained using standard background modeling techniques. 3.the core of our method is a planar homographic occupancy constraint that combines foreground likelihood information from different views to resolve occlusions and determine regions on scene planes that are occupied by people. 4.consistently warp (under homographies of the reference plane) to foreground regions in every view. 5.The reason we use foreground likelihood maps instead of binary foreground maps is to delay the thresholding step to the last possible stage. 6.multiple planes parallel to the reference plane to robustly localize scene objects. 7.To track,we obtain object scene occupancies for awindow of time and stack them together, creating a space-time volume. 8.designing an energy functional that combines scene occupancy information and spatio-temporal proximity. 9.Homographies induced by the reference plane between views are computed using SIFT feature matches and employing the RANSAC algorithm. 10.The result is that our approach is purely image based and performs fusion in the image plane without requiring to go in 3D space, and thus eliminating the need for fully calibrated cameras.

Monocular Approach Methods (1) Blob tracking (2) Shape and Color (3) Adaboost and Particle filter (4) ……