A Nonparametric Treatment for Location/Segmentation Based Visual Tracking Le Lu Integrated Data Systems Dept. Siemens Corporate Research, Inc. Greg Hager.

Slides:

Advertisements

Similar presentations

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki

Advertisements

Face Alignment by Explicit Shape Regression

Three things everyone should know to improve object retrieval

Foreground Focus: Finding Meaningful Features in Unlabeled Images Yong Jae Lee and Kristen Grauman University of Texas at Austin.

Patch to the Future: Unsupervised Visual Prediction

1 Part 1: Classical Image Classification Methods Kai Yu Dept. of Media Analytics NEC Laboratories America Andrew Ng Computer Science Dept. Stanford University.

Online Multiple Classifier Boosting for Object Tracking Tae-Kyun Kim 1 Thomas Woodley 1 Björn Stenger 2 Roberto Cipolla 1 1 Dept. of Engineering, University.

Vision Based Control Motion Matt Baker Kevin VanDyke.

Structural Human Action Recognition from Still Images Moin Nabi Computer Vision Lab. ©IPM - Oct

Face Alignment at 3000 FPS via Regressing Local Binary Features

Ziming Zhang *, Ze-Nian Li, Mark Drew School of Computing Science, Simon Fraser University, Vancouver, B.C., Canada {zza27, li, Learning.

Forward-Backward Correlation for Template-Based Tracking Xiao Wang ECE Dept. Clemson University.

Robust Object Tracking via Sparsity-based Collaborative Model

Global spatial layout: spatial pyramid matching Spatial weighting the features Beyond bags of features: Adding spatial information.

Enhancing Exemplar SVMs using Part Level Transfer Regularization 1.

Special Topic on Image Retrieval Local Feature Matching Verification.

A Robust Pedestrian Detection Approach Based on Shapelet Feature and Haar Detector Ensembles Wentao Yao, Zhidong Deng TSINGHUA SCIENCE AND TECHNOLOGY ISSNl.

Interactive Generation of Integrated Schemas Laura Chiticariu et al. Presented by: Meher Talat Shaikh.

1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.

Learning from Observations Chapter 18 Section 1 – 4.

Graz University of Technology, AUSTRIA Institute for Computer Graphics and Vision Fast Visual Object Identification and Categorization Michael Grabner,

Tracking with Online Appearance Model Bohyung Han

2D1431 Machine Learning Boosting.

Ensemble Tracking Shai Avidan IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE February 2007.

1 Integration of Background Modeling and Object Tracking Yu-Ting Chen, Chu-Song Chen, Yi-Ping Hung IEEE ICME, 2006.

Multi-camera Video Surveillance: Detection, Occlusion Handling, Tracking and Event Recognition Oytun Akman.

Dorin Comaniciu Visvanathan Ramesh (Imaging & Visualization Dept., Siemens Corp. Res. Inc.) Peter Meer (Rutgers University) Real-Time Tracking of Non-Rigid.

REALTIME OBJECT-OF-INTEREST TRACKING BY LEARNING COMPOSITE PATCH-BASED TEMPLATES Yuanlu Xu, Hongfei Zhou, Qing Wang*, Liang Lin Sun Yat-sen University,

1 Comparison of Discrimination Methods for the Classification of Tumors Using Gene Expression Data Presented by: Tun-Hsiang Yang.

Computer Vision James Hays, Brown

Action recognition with improved trajectories

Prakash Chockalingam Clemson University Non-Rigid Multi-Modal Object Tracking Using Gaussian Mixture Models Committee Members Dr Stan Birchfield (chair)

Mean-shift and its application for object tracking

BraMBLe: The Bayesian Multiple-BLob Tracker By Michael Isard and John MacCormick Presented by Kristin Branson CSE 252C, Fall 2003.

Visual Tracking with Online Multiple Instance Learning

A General Framework for Tracking Multiple People from a Moving Camera

1 Action Classification: An Integration of Randomization and Discrimination in A Dense Feature Representation Computer Science Department, Stanford University.

Kaihua Zhang Lei Zhang (PolyU, Hong Kong) Ming-Hsuan Yang (UC Merced, California, U.S.A. ) Real-Time Compressive Tracking.

CSE 185 Introduction to Computer Vision Pattern Recognition 2.

Learning Collections of Parts for Object Recognition and Transfer Learning University of Illinois at Urbana- Champaign.

Boris 2 Boris Babenko 1 Ming-Hsuan Yang 2 Serge Belongie 1 (University of California, Merced, USA) 2 (University of California, San Diego, USA) Visual.

Discriminative Local Binary Patterns for Human Detection in Personal Album.

A Codebook-Free and Annotation-free Approach for Fine-Grained Image Categorization Authors Bangpeng Yao et al. Presenter Hyung-seok Lee ( 이형석 ) CVPR 2012.

BAGGING ALGORITHM, ONLINE BOOSTING AND VISION Se – Hoon Park.

In Defense of Nearest-Neighbor Based Image Classification Oren Boiman The Weizmann Institute of Science Rehovot, ISRAEL Eli Shechtman Adobe Systems Inc.

Expectation-Maximization (EM) Case Studies

Sparse Bayesian Learning for Efficient Visual Tracking O. Williams, A. Blake & R. Cipolloa PAMI, Aug Presented by Yuting Qi Machine Learning Reading.

CVPR2013 Poster Detecting and Naming Actors in Movies using Generative Appearance Models.

Associative Hierarchical CRFs for Object Class Image Segmentation

Boosted Particle Filter: Multitarget Detection and Tracking Fayin Li.

Real-Time Tracking with Mean Shift Presented by: Qiuhua Liu May 6, 2005.

Iterative similarity based adaptation technique for Cross Domain text classification Under: Prof. Amitabha Mukherjee By: Narendra Roy Roll no: Group:

Demosaicking for Multispectral Filter Array (MSFA)

Stochastic Grammars: Overview Representation: Stochastic grammar Representation: Stochastic grammar Terminals: object interactions Terminals: object interactions.

Semantic Alignment Spring 2009 Ben-Gurion University of the Negev.

Max-Confidence Boosting With Uncertainty for Visual tracking WEN GUO, LIANGLIANG CAO, TONY X. HAN, SHUICHENG YAN AND CHANGSHENG XU IEEE TRANSACTIONS ON.

Week 3 Emily Hand UNR. Online Multiple Instance Learning The goal of MIL is to classify unseen bags, instances, by using the labeled bags as training.

Parsing Natural Scenes and Natural Language with Recursive Neural Networks INTERNATIONAL CONFERENCE ON MACHINE LEARNING (ICML 2011) RICHARD SOCHER CLIFF.

Robust and Fast Collaborative Tracking with Two Stage Sparse Optimization Authors: Baiyang Liu, Lin Yang, Junzhou Huang, Peter Meer, Leiguang Gong and.

2. Skin - color filtering.

Krishna Kumar Singh, Yong Jae Lee University of California, Davis

Video Google: Text Retrieval Approach to Object Matching in Videos

COMP61011 : Machine Learning Ensemble Models

A segmentation and tracking algorithm

A New Approach to Track Multiple Vehicles With the Combination of Robust Detection and Two Classifiers Weidong Min , Mengdan Fan, Xiaoguang Guo, and Qing.

PRAKASH CHOCKALINGAM, NALIN PRADEEP, AND STAN BIRCHFIELD

Part-based visual tracking with online latent structural learning -Rui Yao et al. ICCV 2013 Cvlab Jung ilchae.

Liyuan Li, Jerry Kah Eng Hoe, Xinguo Yu, Li Dong, and Xinqi Chu

Video Google: Text Retrieval Approach to Object Matching in Videos

Deep Object Co-Segmentation

Presentation transcript:

A Nonparametric Treatment for Location/Segmentation Based Visual Tracking Le Lu Integrated Data Systems Dept. Siemens Corporate Research, Inc. Greg Hager Computer Science Dept. Johns Hopkins University CVPR 2007, Minneapolis, MN

Examples (I) Data from [Avidan 2005], [Avidan 2007]

Examples (II) Data from [Chuang et al. 2002]

Roadmap Representation: Tracking as a binary classification/matching problem through bags of patches (both model and observation) Algorithm: Online robust appearance model updating in a nonparametric manner Extensions for segmentation based tracking Results Conclusion & discussion

Representation Nonparametric bags of patches appearance model  Image patches are represented as HOG+Color [Avidan 2005] Frame (t)

Representation Binary (Foreground/Background) classification of distributions of image patches:  KNN distance matching  PCA/LDA/NDA + KDE matching  SVM matching

Representation From the normalized positive-class (ie. Foreground/object) Confidence Map, use Mean-Shift algorithm [Comaniciu et al. 2003] to locate the new object position as the highest sum of confidences within the located foreground rectangle (red). Frame (t+1)

Algorithm Maintain appearance model over time via nonparametric bidirectional consistency check and resampling:  test new image patches against bags of patches appearance models (M B |M F )  test appearance models against new observations of bags of patches (O or O B |O F ) Simple computations:  a sample-to-distribution distance metric using KNN distance  mean, variance/std over distributions of distances

Algorithm (1) Pre-filtering: reject ambiguous image patches at (t+1) where, comparing, against each other

Algorithm (2) Model Rigidity: reject redundant, outlier image patches while keeping Thus we have from ie. comparing against ; against where

Algorithm (3) Integrating from last step, we have intermediate appearance models (4) Probability of Survival:  For an image patch in above foreground appearance model we compute its distance convert it as a “probability of survival” for resampling to keep the fixed size appearance model  Similar process to obtain from against

Extension for segmentation tracking Use “superpixels” to sample image spatially adaptively. Remove pre-filtering Run a partitioning algorithm in, and resample with respective to partitions. Apply a weak shape model in the form of KDE Use “superpixels” as basic elements for {F|B} labeling by aggregating patch distances inside image segment.

Extension for segmentation tracking  The differences are that  location tracking is considered as a discriminative task;  while segmentation tracking is targeted to keep a more complete profile of {F|B} appearance over time.  HOG+Color for location tracking;  PCA+KDE for segmentation tracking  For different feature representations/matching criteria evaluation, see [Lu & Hager, 2006]

Results Data from [Jepson, Fleet, El-Maraghi 2003], [Avidan 2005]

Results Data from [Chuang et al. 2002]

Results

Discussion (other information)

Discussion (multi-target)

Discussion (full occlusion)

Discussion (differences with Ensemble Tracking [Avidan 2005, 2007] ) Appearance model encoded in sampled and resampled image patches directly; appearance model encoded in weak classifiers  Flexibility on Long-term interaction modeling  Flexibility on choosing over different classification methods besides boosting Feature (dense/sparse) based approach which is robust to partial occlusion without explicit occlusion handling Discriminative approach for location tracking, exemplar based approach for segmentation tracking; discriminative approach for ensemble tracking

Discussion (Response maps from Appearance only)

Acknowledgement Dr. Shai Avidan (Merl) for valuable discussion and providing data for testing Prof. Y.-Y Chuang (NUT) for providing data Dr. Faith Porikli (Merl) for providing data Anonymous reviewers for useful feedbacks