W HAT A RE W E T RACKING : A U NIFIED A PPROACH OF T RACKING AND R ECOGNITION Jialue Fan, Xiaohui Shen, Student Member, IEEE, and Ying Wu, Senior Member,

Slides:

Advertisements

Similar presentations

Neural Networks and Kernel Methods

Advertisements

CVPR2013 Poster Modeling Actions through State Changes.

David Rosen Goals  Overview of some of the big ideas in autonomous systems  Theme: Dynamical and stochastic systems lie at the intersection of mathematics.

DONG XU, MEMBER, IEEE, AND SHIH-FU CHANG, FELLOW, IEEE Video Event Recognition Using Kernel Methods with Multilevel Temporal Alignment.

Active Learning for Streaming Networked Data Zhilin Yang, Jie Tang, Yutao Zhang Computer Science Department, Tsinghua University.

A CTION R ECOGNITION FROM V IDEO U SING F EATURE C OVARIANCE M ATRICES Kai Guo, Prakash Ishwar, Senior Member, IEEE, and Janusz Konrad, Fellow, IEEE.

Tracking Learning Detection

Robust Object Tracking via Sparsity-based Collaborative Model

Texture Segmentation Based on Voting of Blocks, Bayesian Flooding and Region Merging C. Panagiotakis (1), I. Grinias (2) and G. Tziritas (3)

Software Quality Ranking: Bringing Order to Software Modules in Testing Fei Xing Michael R. Lyu Ping Guo.

1 A scheme for racquet sports video analysis with the combination of audio-visual information Visual Communication and Image Processing 2005 Liyuan Xing,

Watching Unlabeled Video Helps Learn New Human Actions from Very Few Labeled Snapshots Chao-Yeh Chen and Kristen Grauman University of Texas at Austin.

A KLT-Based Approach for Occlusion Handling in Human Tracking Chenyuan Zhang, Jiu Xu, Axel Beaugendre and Satoshi Goto 2012 Picture Coding Symposium.

ICME 2008 Huiying Liu, Shuqiang Jiang, Qingming Huang, Changsheng Xu.

Efficient Moving Object Segmentation Algorithm Using Background Registration Technique Shao-Yi Chien, Shyh-Yih Ma, and Liang-Gee Chen, Fellow, IEEE Hsin-Hua.

Predictive Automatic Relevance Determination by Expectation Propagation Yuan (Alan) Qi Thomas P. Minka Rosalind W. Picard Zoubin Ghahramani.

A Bayesian algorithm for tracking multiple moving objects in outdoor surveillance video Department of Electrical Engineering and Computer Science The University.

Dynamic Face Recognition Committee Machine Presented by Sunny Tang.

Self-Supervised Segmentation of River Scenes Supreeth Achar *, Bharath Sankaran ‡, Stephen Nuske *, Sebastian Scherer *, Sanjiv Singh * * ‡

Classification with reject option in gene expression data Blaise Hanczar and Edward R Dougherty BIOINFORMATICS Vol. 24 no , pages

1 Integration of Background Modeling and Object Tracking Yu-Ting Chen, Chu-Song Chen, Yi-Ping Hung IEEE ICME, 2006.

Linear Solution to Scale and Rotation Invariant Object Matching Professor: 王聖智教授 Student : 周節.

1 Probabilistic Formulation for Skin Detection Sanun Srisuk Seminar I.

Jacinto C. Nascimento, Member, IEEE, and Jorge S. Marques

Hand Signals Recognition from Video Using 3D Motion Capture Archive Tai-Peng Tian Stan Sclaroff Computer Science Department B OSTON U NIVERSITY I. Introduction.

Oral Defense by Sunny Tang 15 Aug 2003

Discovering Outlier Filtering Rules from Unlabeled Data Author: Kenji Yamanishi & Jun-ichi Takeuchi Advisor: Dr. Hsu Graduate: Chia- Hsien Wu.

Prakash Chockalingam Clemson University Non-Rigid Multi-Modal Object Tracking Using Gaussian Mixture Models Committee Members Dr Stan Birchfield (chair)

Shape-Based Human Detection and Segmentation via Hierarchical Part- Template Matching Zhe Lin, Member, IEEE Larry S. Davis, Fellow, IEEE IEEE TRANSACTIONS.

1 A Bayesian Method for Guessing the Extreme Values in a Data Set Mingxi Wu, Chris Jermaine University of Florida September 2007.

A General Framework for Tracking Multiple People from a Moving Camera

Marcin Marszałek, Ivan Laptev, Cordelia Schmid Computer Vision and Pattern Recognition, CVPR Actions in Context.

Neural Networks Chapter 6 Joost N. Kok Universiteit Leiden.

 Tsung-Sheng Fu, Hua-Tsung Chen, Chien-Li Chou, Wen-Jiin Tsai, and Suh-Yin Lee Visual Communications and Image Processing (VCIP), 2011 IEEE, 6-9 Nov.

Video Tracking Using Learned Hierarchical Features

Computer Vision Lab Seoul National University Keyframe-Based Real-Time Camera Tracking Young Ki BAIK Vision seminar : Mar Computer Vision Lab.

Automatic Image Annotation by Using Concept-Sensitive Salient Objects for Image Content Representation Jianping Fan, Yuli Gao, Hangzai Luo, Guangyou Xu.

Towards Semantic Embedding in Visual Vocabulary Towards Semantic Embedding in Visual Vocabulary The Twenty-Third IEEE Conference on Computer Vision and.

Sparse Bayesian Learning for Efficient Visual Tracking O. Williams, A. Blake & R. Cipolloa PAMI, Aug Presented by Yuting Qi Machine Learning Reading.

Case Study 1 Semantic Analysis of Soccer Video Using Dynamic Bayesian Network C.-L Huang, et al. IEEE Transactions on Multimedia, vol. 8, no. 4, 2006 Fuzzy.

Robust Object Tracking with Online Multiple Instance Learning

H UMAN A CTION R ECOGNITION USING L OCAL S PATIO -T EMPORAL D ISCRIMINANT E MBEDDING Kui Jia and Dit-Yan Yeung, IEEE Conference on Computer Vision and.

Probabilistic Latent Query Analysis for Combining Multiple Retrieval Sources Rong Yan Alexander G. Hauptmann School of Computer Science Carnegie Mellon.

Boosted Particle Filter: Multitarget Detection and Tracking Fayin Li.

Guest lecture: Feature Selection Alan Qi Dec 2, 2004.

 Present by 陳群元.  Introduction  Previous work  Predicting motion patterns  Spatio-temporal transition distribution  Discerning pedestrians  Experimental.

Discriminative Training and Machine Learning Approaches Machine Learning Lab, Dept. of CSIE, NCKU Chih-Pin Liao.

A REAL-TIME DEFORMABLE DETECTOR 謝汝欣 OUTLINE  Introduction  Related Work  Proposed Method  Experiments 2.

Max-Confidence Boosting With Uncertainty for Visual tracking WEN GUO, LIANGLIANG CAO, TONY X. HAN, SHUICHENG YAN AND CHANGSHENG XU IEEE TRANSACTIONS ON.

Fast Human Detection in Crowded Scenes by Contour Integration and Local Shape Estimation Csaba Beleznai, Horst Bischof Computer Vision and Pattern Recognition,

Zhaoxia Fu, Yan Han Measurement Volume 45, Issue 4, May 2012, Pages 650–655 Reporter: Jing-Siang, Chen.

Detecting Occlusion from Color Information to Improve Visual Tracking

Week 3 Emily Hand UNR. Online Multiple Instance Learning The goal of MIL is to classify unseen bags, instances, by using the labeled bags as training.

A. M. R. R. Bandara & L. Ranathunga

Visual Saliency Update

Traffic Sign Recognition Using Discriminative Local Features Andrzej Ruta, Yongmin Li, Xiaohui Liu School of Information Systems, Computing and Mathematics.

Tracking Objects with Dynamics

Tracking parameter optimization

Incremental Boosting Incremental Learning of Boosted Face Detector ICCV 2007 Unsupervised Incremental Learning for Improved Object Detection in a Video.

The Extended Cohn-Kanade Dataset (CK+): A complete dataset for action unit and emotion-specified expression By: Patrick Lucey, Jeffrey F. Cohn, Takeo.

A New Approach to Track Multiple Vehicles With the Combination of Robust Detection and Two Classifiers Weidong Min , Mengdan Fan, Xiaoguang Guo, and Qing.

PRAKASH CHOCKALINGAM, NALIN PRADEEP, AND STAN BIRCHFIELD

Online Graph-Based Tracking

Adaptive object recognition in RGBz images

Fine-Grained Visual Categorization

Sensorimotor Learning and the Development of Position Invariance

What is a WebQuest? Guided search for information

Week 3 Presentation Ngoc Ta Aidean Sharghi.

Point Set Representation for Object Detection and Beyond

SPECIAL ISSUE on Document Analysis, 5(2):1-15, 2005.

Presentation transcript:

W HAT A RE W E T RACKING : A U NIFIED A PPROACH OF T RACKING AND R ECOGNITION Jialue Fan, Xiaohui Shen, Student Member, IEEE, and Ying Wu, Senior Member, IEEE

O UTLINE Introduction Method a. overview description b. tracking procedure c. video-based object recognition Experiment result Conclusion

I NTRODUCTION One notable shortcoming of online models is that they are constructed and updated based on the previous appearance of the target without much semantic understanding.

I NTRODUCTION Once an object is discovered and tracked, the tracking results are continuously fed forward to the upper-level video-based recognition scheme. Based on the feedback from the recognition results, different off-line models dedicated to specific categories are adaptively selected, and the location of the tracked object in the next frame is determined by integrated optimization of these selected detectors and the tracking evidence.

M ETHOD Overview Description The target state : Xt. The target category : Ct. The input image at time t : It. The target measurement at time t : Zt = It(Xt). The online target model :. The object category : N different classes. The is the other class. Each object class is associated with a specific offline model

M ETHOD They employ a two-step EM-like method here: at time t, we first estimate x t (i.e., “tracking”), and then estimate ct based on the new tracking result z t = I t ( x t ) (i.e., “recognition”). Tracking procedure Based on online target model,the offine model selected by the previous recognition result,and the current input image It.

M ETHOD In a Bayesian perpective, they have By defining the energy term E = - ln p,

M ETHOD is the energy term related to tracking. is the energy term related to detection. Tracking term :

M ETHOD Detection term :

M ETHOD Video-based object recognition They instead find the optimal sequence { c 1,..., ct } given the measurement z 1,..., z t, which is indeed the video-based object recognition. Denote by ct = { c 1,..., ct }, z t = { z 1,..., z t }, they have

E XPERIMENTS σs is the (time varying) confidence score corresponding to the energy term ws, which can be obtained by: Linear SVM classifier : and is the SVM score.

E XPERIMENTS

C ONCLUSION The limitations : (1) The ambiguity of the tracking problem increases, as the number of object categories increases. (2) The wrong recognition result probably leads to error propagation. (3) The current design may not be appropriate for some tracking dataset, due to data type inconsistency.

C ONCLUSION As a mid-level task, visual tracking plays an important role for high-level semantic understanding or video analysis. Meanwhile the high-level understanding (e.g., object recognition) should feed back some guidance for low-level tracking. Motivated by this, we propose a unified approach to object tracking and recognition.

T HE E ND