Kapitel 7 “Tracking” – p. 1 Tracking  Fundamentals  Object representation  Object detection  Object tracking A. Yilmaz, O. Javed, and M. Shah Object.

Slides:

Advertisements

Similar presentations

Kapitel 11 Tracking Fundamentals Object representation

Advertisements

Pose Estimation and Segmentation of People in 3D Movies Karteek Alahari, Guillaume Seguin, Josef Sivic, Ivan Laptev Inria, Ecole Normale Superieure ICCV.

CSCE643: Computer Vision Bayesian Tracking & Particle Filtering Jinxiang Chai Some slides from Stephen Roth.

Human Identity Recognition in Aerial Images Omar Oreifej Ramin Mehran Mubarak Shah CVPR 2010, June Computer Vision Lab of UCF.

Recovering Human Body Configurations: Combining Segmentation and Recognition Greg Mori, Xiaofeng Ren, and Jitentendra Malik (UC Berkeley) Alexei A. Efros.

Introduction To Tracking

1 Video Processing Lecture on the image part (8+9) Automatic Perception Volker Krüger Aalborg Media Lab Aalborg University Copenhagen

Learning to estimate human pose with data driven belief propagation Gang Hua, Ming-Hsuan Yang, Ying Wu CVPR 05.

Forward-Backward Correlation for Template-Based Tracking Xiao Wang ECE Dept. Clemson University.

Instructor: Mircea Nicolescu Lecture 13 CS 485 / 685 Computer Vision.

Computer Vision REU Week 2 Adam Kavanaugh. Video Canny Put canny into a loop in order to process multiple frames of a video sequence Put canny into a.

Modeling Pixel Process with Scale Invariant Local Patterns for Background Subtraction in Complex Scenes (CVPR’10) Shengcai Liao, Guoying Zhao, Vili Kellokumpu,

Tracking Objects with Dynamics Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 04/21/15 some slides from Amin Sadeghi, Lana Lazebnik,

Region labelling Giving a region a name. Image Processing and Computer Vision: 62 Introduction Region detection isolated regions Region description properties.

Motion Detection And Analysis Michael Knowles Tuesday 13 th January 2004.

SWE 423: Multimedia Systems Chapter 4: Graphics and Images (4)

A Study of Approaches for Object Recognition

Processing Digital Images. Filtering Analysis –Recognition Transmission.

Face Detection: a Survey Speaker: Mine-Quan Jing National Chiao Tung University.

Automatic Image Alignment (feature-based) : Computational Photography Alexei Efros, CMU, Fall 2005 with a lot of slides stolen from Steve Seitz and.

Object Detection and Tracking Mike Knowles 11 th January 2005

Recognizing and Tracking Human Action Josephine Sullivan and Stefan Carlsson.

Tracking Video Objects in Cluttered Background

Introduction to Object Tracking Presented by Youyou Wang CS643 Texas A&M University.

Dorin Comaniciu Visvanathan Ramesh (Imaging & Visualization Dept., Siemens Corp. Res. Inc.) Peter Meer (Rutgers University) Real-Time Tracking of Non-Rigid.

Shadow Detection In Video Submitted by: Hisham Abu saleh.

Jacinto C. Nascimento, Member, IEEE, and Jorge S. Marques

A Vision-Based System that Detects the Act of Smoking a Cigarette Xiaoran Zheng, University of Nevada-Reno, Dept. of Computer Science Dr. Mubarak Shah,

Face Recognition and Retrieval in Video Basic concept of Face Recog. & retrieval And their basic methods. C.S.E. Kwon Min Hyuk.

MASKS © 2004 Invitation to 3D vision Lecture 3 Image Primitives andCorrespondence.

Topic regards: ◆ Browsing of Search Results ◆ Video Retrieval using Spatio-Temporal ◆ Object Tracking ◆ Face tracking Yuan-Hao Lai.

Multimodal Interaction Dr. Mike Spann

Prakash Chockalingam Clemson University Non-Rigid Multi-Modal Object Tracking Using Gaussian Mixture Models Committee Members Dr Stan Birchfield (chair)

Olga Zoidi, Anastasios Tefas, Member, IEEE Ioannis Pitas, Fellow, IEEE

Mean-shift and its application for object tracking

Shape-Based Human Detection and Segmentation via Hierarchical Part- Template Matching Zhe Lin, Member, IEEE Larry S. Davis, Fellow, IEEE IEEE TRANSACTIONS.

1 Mean shift and feature selection ECE 738 course project Zhaozheng Yin Spring 2005 Note: Figures and ideas are copyrighted by original authors.

Human-Computer Interaction Human-Computer Interaction Tracking Hanyang University Jong-Il Park.

1. Introduction Motion Segmentation The Affine Motion Model Contour Extraction & Shape Estimation Recursive Shape Estimation & Motion Estimation Occlusion.

Video-Vigilance and Biometrics

Object Stereo- Joint Stereo Matching and Object Segmentation Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on Michael Bleyer Vienna.

報告人 : 林福城指導老師 : 陳定宏 1 From Res. Center of Intell. Transp. Syst., Beijing Univ. of Technol., Beijing, China By Zhe Liu ; Yangzhou Chen ; Zhenlong Li Appears.

1 Webcam Mouse Using Face and Eye Tracking in Various Illumination Environments Yuan-Pin Lin et al. Proceedings of the 2005 IEEE Y.S. Lee.

EECS 274 Computer Vision Segmentation by Clustering II.

Recognizing Action at a Distance Alexei A. Efros, Alexander C. Berg, Greg Mori, Jitendra Malik Computer Science Division, UC Berkeley Presented by Pundik.

Tracking People by Learning Their Appearance Deva Ramanan David A. Forsuth Andrew Zisserman.

Vision-based human motion analysis: An overview Computer Vision and Image Understanding(2007)

Motion Analysis using Optical flow CIS750 Presentation Student: Wan Wang Prof: Longin Jan Latecki Spring 2003 CIS Dept of Temple.

Lecture 7: Features Part 2 CS4670/5670: Computer Vision Noah Snavely.

1 Research Question  Can a vision-based mobile robot  with limited computation and memory,  and rapidly varying camera positions,  operate autonomously.

Efficient Visual Object Tracking with Online Nearest Neighbor Classifier Many slides adapt from Steve Gu.

CVPR2013 Poster Detecting and Naming Actors in Movies using Generative Appearance Models.

Rick Parent - CIS681 Motion Analysis – Human Figure Processing video to extract information of objects Motion tracking Pose reconstruction Motion and subject.

Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team

Real-Time Tracking with Mean Shift Presented by: Qiuhua Liu May 6, 2005.

Mean Shift ; Theory and Applications Presented by: Reza Hemati دی 89 December گروه بینایی ماشین و پردازش تصویر Machine Vision and Image Processing.

Colour and Texture. Extract 3-D information Using Vision Extract 3-D information for performing certain tasks such as manipulation, navigation, and recognition.

Image features and properties. Image content representation The simplest representation of an image pattern is to list image pixels, one after the other.

Instructor: Mircea Nicolescu Lecture 10 CS 485 / 685 Computer Vision.

MASKS © 2004 Invitation to 3D vision Lecture 3 Image Primitives andCorrespondence.

Portable Camera-Based Assistive Text and Product Label Reading From Hand-Held Objects for Blind Persons.

COMP 9517 Computer Vision Motion and Tracking 6/11/2018

COMP 9517 Computer Vision Motion 7/21/2018 COMP 9517 S2, 2012.

Tracking Objects with Dynamics

Image Primitives and Correspondence

Dynamical Statistical Shape Priors for Level Set Based Tracking

PRAKASH CHOCKALINGAM, NALIN PRADEEP, AND STAN BIRCHFIELD

Brief Review of Recognition + Context

Presentation transcript:

Kapitel 7 “Tracking” – p. 1 Tracking  Fundamentals  Object representation  Object detection  Object tracking A. Yilmaz, O. Javed, and M. Shah Object tracking: A survey ACM Computing Surveys, Vol. 38, No. 4, 1-45, 2006 TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA Kapitel 7

Kapitel 7 “Tracking” – p. 2 Fundamentals (1)

Kapitel 7 “Tracking” – p. 3 Fundamentals (2) Applications of object tracking:  motion-based recognition: human identification based on gait, automatic object detection, etc.  automated surveillance: monitoring a scene to detect suspicious activities or unlikely events  video indexing: automatic annotation and retrieval of the videos in multimedia databases  human-computer interaction: gesture recognition, eye gaze tracking for data input to computers, etc.  traffic monitoring: real-time gathering of traffic statistics to direct traffic flow  vehicle navigation: video-based path planning and obstacle avoidance capabilities

Kapitel 7 “Tracking” – p. 4 Fundamentals (3) Tracking task:  In the simplest form, tracking can be defined as the problem of estimating the trajectory of an object in the image plane as it moves around a scene. In other words, a tracker assigns consistent labels to the tracked objects in different frames of a video. Additionally, depending on the tracking domain, a tracker can also provide object- centric information, such as orientation, area, or shape of an object.  Two subtasks: Build some model of what you want to track Use what you know about where the object was in the previous frame(s) to make predictions about the current frame and restrict the search Repeat the two subtasks, possibly updating the model

Kapitel 7 “Tracking” – p. 5 Fundamentals (4) Tracking objects can be complex due to:  loss of information caused by projection of 3D world on 2D image  noise in images  complex object shapes / motion  nonrigid or articulated nature of objects  partial and full object occlusions  scene illumination changes  real-time processing requirements Simplify tracking by imposing constraints:  Almost all tracking algorithms assume that the object motion is smooth with no abrupt changes  The object motion is assumed to be of constant velocity  Prior knowledge about the number and the size of objects, or the object appearance and shape

Kapitel 7 “Tracking” – p. 6 Object Represention (1) Object representation = Shape + Appearance Shape representations:  Points. The object is represented by a point, that is, the centroid or by a set of points; suitable for tracking objects that occupy small regions in an image  Primitive geometric shapes. Object shape is represented by a rectangle, ellipse, etc. Object motion for such representations is usually modeled by translation, affine, or projective transformation. Though primitive geometric shapes are more suitable for representing simple rigid objects, they are also used for tracking nonrigid objects.

Kapitel 7 “Tracking” – p. 7 Object Represention (2)  Object silhouette and contour. Contour = boundary of an object. Region inside the contour = silhouette. Silhouette and contour representations are suitable for tracking complex nonrigid shapes.  Articulated shape models. Articulated objects are composed of body parts (modelled by cylinders or ellipses) that are held together with joints. Example: human body = articulated object with torso, legs, hands, head, and feet connected by joints. The relationship between the parts are governed by kinematic motion models, e.g. joint angle, etc.  Skeletal models. Object skeleton can be extracted by applying medial axis transform to the object silhouette. Skeleton representation can be used to model both articulated and rigid objects.

Kapitel 7 “Tracking” – p. 8 Object Represention (3) Object representations. (a) Centroid, (b) multiple points, (c) rectangular patch, (d) elliptical patch, (e) part-based multiple patches, (f) object skeleton, (g) control points on object contour, (h) complete object contour, (i) object silhouette

Kapitel 7 “Tracking” – p. 9 Object Represention (4) Appearance representations:  Templates. Formed using simple geometric shapes or silhouettes. Suitable for tracking objects whose poses do not vary considerably during the course of tracking. Self-adapation of templates durch the tracking is possibe.

Kapitel 7 “Tracking” – p. 10  Probability densities of object appearance, can either be parametric (Gaussian and mixture of Gaussians) or nonparametric (histograms) Characterize an image region by its statistics. If the statistics differ from background, they should enable tracking. nonparametric: histogram (grayscale or color) Object Represention (5)

Kapitel 7 “Tracking” – p. 11 Object Represention (6) parametric: 1D Gaussian distribution

Kapitel 7 “Tracking” – p. 12 Object Represention (7) parametric: n-D Gaussian distribution Centered at (1,3) with a standard deviation of 3 in roughly the (0.878, 0.478) direction and of 1 in the orthogonal direction

Kapitel 7 “Tracking” – p. 13 Object Represention (8) parametric: Gaussian Mixture Models (GMM)

Kapitel 7 “Tracking” – p. 14 Object Represention (9) Beispiel: Mixture of three Gaussians in 2D space. (a) Contours of constant density for each mixture component. (b) Contours of constant density of mixture distribution p(x). (c) Surface plot of p(x).

Kapitel 7 “Tracking” – p. 15 Object Represention (10) Object representations are chosen according to the application  Point representations appropriate for tracking objects, which appear very small in an image (e.g. track distant birds)  For the objects whose shapes can be approximated by rectangles or ellipses, primitive geometric shape representations are more appropriate (e.g. face)  For tracking objects with complex shapes, for example, humans, a contour or a silhouette-based representation is appropriate (surveillance applications)

Kapitel 7 “Tracking” – p. 16 Object Represention (11) Feature selection for tracking:  Color: RGB, L ∗ u ∗ v ∗, L ∗ a ∗ b ∗, HSV, etc. There is no last word on which color space is more effective; a variety of color spaces have been used  Edges: less sensitive to illumination changes compared to color features. Algorithms that track the object boundary usually use edges as features. Because of its simplicity and accuracy, the most popular edge detection approach is the Canny Edge detector  Texture: measure of the intensity variation of a surface which quantifies properties such as smoothness and regularity In general, the most desirable property of a visual feature is its uniqueness so that the objects can be easily distinguished in the feature space

Kapitel 7 “Tracking” – p. 17 Object Detection (1) Object detection mechanism: required by every tracking method either at the beginning or when an object first appears in the video  Point detectors: find interest points in images which have an expressive texture in their respective localities  Segmentation: partition the image into perceptually similar regions

Kapitel 7 “Tracking” – p. 18 Object Detection (2)  Background subtraction: Object detection can be achieved by building a representation of the scene called the background model and then finding deviations from the model for each incoming frame. Any significant change in an image region from the background model signifies a moving object. The pixels constituting the regions undergoing change are marked for further processing. Usually, a connected component algorithm is applied to obtain connected regions corresponding to the objects.

Kapitel 7 “Tracking” – p. 19 Object Detection (3) Frame differencing of temporally adjacent frames:

Kapitel 7 “Tracking” – p. 20 Object Detection (4) Bildsequenz: ≈ 5 Bilder/s

Kapitel 7 “Tracking” – p. 21 Object Detection (5) Bildsubtraktion: Variante 1 Schwäche: Doppelbild eines Fahrzeugs (aus dem letzten und aktuellen Bild); Aufteilung einer konstanten Fläche

Kapitel 7 “Tracking” – p. 22 Object Detection (6) Bildsubtraktion: Variante 2 Referenzbild f r (r, c): Mittelung einer langen Sequenz von Bildern

Kapitel 7 “Tracking” – p. 23 Object Detection (7)

Kapitel 7 “Tracking” – p. 24 Object Detection (8) Statistical modeling of background: Learn gradual changes in time by Gaussian, I (x, y) ∼ N(μ(x, y), (x, y)), from the color observations in several consecutive frames. Once the background model is derived for every pixel (x, y) in the input frame, the likelihood of its color coming from N(μ(x, y), (x, y)) is computed.

Kapitel 7 “Tracking” – p. 25 Object Tracking (1)  (a) Point Tracking. Objects detected in consecutive frames are represented by points, and a point matching is done. This approach requires an external mechanism to detect the objects in every frame.  (b) Kernel Tracking. Kernel = object shape and appearance. E.g. kernel = a rectangular template or an elliptical shape with an associated histogram. Objects are tracked by computing the motion (parametric transformation such as translation, rotation, and affine) of the kernel in consecutive frames.  (c)+(d) Silhouette Tracking. Such methods use the information encoded inside the object region (appearance density and shape models). Given the object models, silhouettes are tracked by either shape matching (c) or contour evolution (d). The latter one can be considered as object segmentation applied in the temporal domain using the priors generated from the previous frames.

Kapitel 7 “Tracking” – p. 26 Object Tracking (2) Template Matching: brute force method for tracking single objects  Define a search area  Place the template defined from the previous frame at each position of the search area and compute a similarity measure between the template and the candidate  Select the best candidate with the maximal similarity measure The similarity measure can be a direct template comparison or statistical measures between two probability densities Limitation of template matching: high computation cost due to the brute force search  limit the object search to the vicinity of its previous position; position prediction

Kapitel 7 “Tracking” – p. 27 Object Tracking (3) Direct comparison: between template t(i,j) and candidate g(i,j) Bhattacharyya coefficient between two distributions:

Kapitel 7 “Tracking” – p. 28 Object Tracking (4) Example: Eye tracking (direct grayvalue comparison)

Kapitel 7 “Tracking” – p. 29 Object Tracking (5) Example: Elliptical head tracking using intensity gradients and color histograms

Kapitel 7 “Tracking” – p. 30 Object Tracking (6) Mean-shift tracking (instead of brute force search). (a) estimated object location at time t − 1, (b) frame at time t with initial location estimate using the previous object position, (c), (d), (e) location update using mean-shift iterations, (f) final object position at time t. D. Comaniciu, V. Ramesh, and P. Meer, Kernel-based object tracking. IEEE Trans. Patt. Analy. Mach. Intell. 25, 564–575, 2003