Kan Liu, Bingpeng Ma, Wei Zhang, Rui Huang

Slides:



Advertisements
Similar presentations
Pose Estimation and Segmentation of People in 3D Movies Karteek Alahari, Guillaume Seguin, Josef Sivic, Ivan Laptev Inria, Ecole Normale Superieure ICCV.
Advertisements

Loris Bazzani*, Marco Cristani*†, Alessandro Perina*, Michela Farenzena*, Vittorio Murino*† *Computer Science Department, University of Verona, Italy †Istituto.
Zhimin CaoThe Chinese University of Hong Kong Qi YinITCS, Tsinghua University Xiaoou TangShenzhen Institutes of Advanced Technology Chinese Academy of.
Three things everyone should know to improve object retrieval
Limin Wang, Yu Qiao, and Xiaoou Tang
Human Identity Recognition in Aerial Images Omar Oreifej Ramin Mehran Mubarak Shah CVPR 2010, June Computer Vision Lab of UCF.
- Recovering Human Body Configurations: Combining Segmentation and Recognition (CVPR’04) Greg Mori, Xiaofeng Ren, Alexei A. Efros and Jitendra Malik -
Patch to the Future: Unsupervised Visual Prediction
Activity Recognition Aneeq Zia. Agenda What is activity recognition Typical methods used for action recognition “Evaluation of local spatio-temporal features.
Yuanlu Xu Human Re-identification: A Survey.
Space-time interest points Computational Vision and Active Perception Laboratory (CVAP) Dept of Numerical Analysis and Computer Science KTH (Royal Institute.
Mixture of trees model: Face Detection, Pose Estimation and Landmark Localization Presenter: Zhang Li.
Computer Vision Laboratory 1 Unrestricted Recognition of 3-D Objects Using Multi-Level Triplet Invariants Gösta Granlund and Anders Moe Computer Vision.
Semi-Supervised Hierarchical Models for 3D Human Pose Reconstruction Atul Kanaujia, CBIM, Rutgers Cristian Sminchisescu, TTI-C Dimitris Metaxas,CBIM, Rutgers.
Yuanlu Xu Advisor: Prof. Liang Lin Person Re-identification by Matching Compositional Template with Cluster Sampling.
Intelligent Systems Lab. Recognizing Human actions from Still Images with Latent Poses Authors: Weilong Yang, Yang Wang, and Greg Mori Simon Fraser University,
Robust Object Tracking via Sparsity-based Collaborative Model
Computer and Robot Vision I
Local Descriptors for Spatio-Temporal Recognition
Automatic Image Annotation and Retrieval using Cross-Media Relevance Models J. Jeon, V. Lavrenko and R. Manmathat Computer Science Department University.
Recognition of Human Gait From Video Rong Zhang, C. Vogler, and D. Metaxas Computational Biomedicine Imaging and Modeling Center Rutgers University.
MVL (Machine Vision Lab) UIC HUMAN MOTION VIDEO DATABASE Jezekiel Ben-Arie ECE Department University Of Illinois at Chicago Scripts, Queries, Recognition.
Human Identification using Silhouette Gait Data Rutgers University Chan-Su Lee.
Wang, Z., et al. Presented by: Kayla Henneman October 27, 2014 WHO IS HERE: LOCATION AWARE FACE RECOGNITION.
Learning to classify the visual dynamics of a scene Nicoletta Noceti Università degli Studi di Genova Corso di Dottorato.
Bag-of-Words based Image Classification Joost van de Weijer.
SUPERVISED NEURAL NETWORKS FOR PROTEIN SEQUENCE ANALYSIS Lecture 11 Dr Lee Nung Kion Faculty of Cognitive Sciences and Human Development UNIMAS,
Flow Based Action Recognition Papers to discuss: The Representation and Recognition of Action Using Temporal Templates (Bobbick & Davis 2001) Recognizing.
COMPUTER VISION: SOME CLASSICAL PROBLEMS ADWAY MITRA MACHINE LEARNING LABORATORY COMPUTER SCIENCE AND AUTOMATION INDIAN INSTITUTE OF SCIENCE June 24, 2013.
Multimodal Information Analysis for Emotion Recognition
Spatio-temporal constraints for recognizing 3D objects in videos Nicoletta Noceti Università degli Studi di Genova.
Object Recognition in Images Slides originally created by Bernd Heisele.
Tracking People by Learning Their Appearance Deva Ramanan David A. Forsuth Andrew Zisserman.
Face Recognition: An Introduction
Discovering Deformable Motifs in Time Series Data Jin Chen CSE Fall 1.
MUSTAFA OZAN ÖZEN PINAR SAĞLAM LEVENT ÜNVER MEHMET YILMAZ.
Event retrieval in large video collections with circulant temporal encoding CVPR 2013 Oral.
Human Re-identification by Matching Compositional Template with Cluster Sampling Yuanlu Xu 1, Liang Lin 1, Wei-Shi Zheng 1, Xiaobai Liu 2 Abstract This.
模式识别国家重点实验室 中国科学院自动化研究所 National Laboratory of Pattern Recognition Institute of Automation, Chinese Academy of Sciences Matching Tracking Sequences Across.
Human Activity Recognition at Mid and Near Range Ram Nevatia University of Southern California Based on work of several collaborators: F. Lv, P. Natarajan,
AAM based Face Tracking with Temporal Matching and Face Segmentation Mingcai Zhou 1 、 Lin Liang 2 、 Jian Sun 2 、 Yangsheng Wang 1 1 Institute of Automation.
Unsupervised Salience Learning for Person Re-identification
Deformation Modeling for Robust 3D Face Matching Xioguang Lu and Anil K. Jain Dept. of Computer Science & Engineering Michigan State University.
A Hierarchical Deep Temporal Model for Group Activity Recognition
Naifan Zhuang, Jun Ye, Kien A. Hua
Motion and optical flow
Unsupervised Learning of Video Representations using LSTMs
Robust and Fast Collaborative Tracking with Two Stage Sparse Optimization Authors: Baiyang Liu, Lin Yang, Junzhou Huang, Peter Meer, Leiguang Gong and.
ALADDIN A Locality Aligned Deep Model for Instance Search
Guillaume-Alexandre Bilodeau
Gait Analysis for Human Identification (GAHI)
Video Google: Text Retrieval Approach to Object Matching in Videos
Globally Optimal Generalized Maximum Multi Clique Problem (GMMCP) using Python code for Pedestrian Object Tracking By Beni Mulyana.
Two-Stream Convolutional Networks for Action Recognition in Videos
Video understanding using part based object detection models
PRAKASH CHOCKALINGAM, NALIN PRADEEP, AND STAN BIRCHFIELD
Human Activity Analysis
Part-based visual tracking with online latent structural learning -Rui Yao et al. ICCV 2013 Cvlab Jung ilchae.
CNN-based Action Recognition Using Adaptive Multiscale Depth Motion Maps And Stable Joint Distance Maps Junyou He, Hailun Xia, Chunyan Feng, Yunfei Chu.
Video Google: Text Retrieval Approach to Object Matching in Videos
Related Work in Camera Network Tracking
Recorder Visual Identification Test
Range Likelihood Tree: A Compact and Effective Representation for Visual Exploration of Uncertain Data Sets Ohio State University (Shen) Problem: Uncertainty.
Spatially Supervised Recurrent Neural Networks for Visual Object Tracking Authors: Guanghan Ning, Zhi Zhang, Chen Huang, Xiaobo Ren, Haohong Wang, Canhui.
Deep Object Co-Segmentation
Self-Supervised Cross-View Action Synthesis
Week 7 Presentation Ngoc Ta Aidean Sharghi
What and How Well You Performed
Presentation transcript:

Kan Liu, Bingpeng Ma, Wei Zhang, Rui Huang A Spatio-Temporal Appearance Representation for Video-based Pedestrian Re-Identification Kan Liu, Bingpeng Ma, Wei Zhang, Rui Huang ICCV 2015 Submission ID: 1276 This paper is about a spatio-temporal appearance representation for video-based pedestrian re-identification.

Pedestrian re-identification tasks Given a video sequence of a person, the goal of the pedestrian re-identification is to match a query against the gallery set.

Pedestrian re-identification datasets Pedestrian re-id is a difficult problem due to the large variations in a person’s appearance caused by different poses and viewpoints, illumination changes, and occlusions. The spatial and temporal alignment plays an important role in this case. Occlusions Poses and Viewpoints Illumination

Spatio-temporal Representation We takes the video of a walking person as input and builds a spatio-temporal appearance representation for pedestrian re-identification. …

Walking cycles extraction Given a video sequence of a walking person, we first extract the individual walking cycles.

Walking cycles extraction For each walking cycle, we divide the chunk of video data both spatially and temporally.

Temporal Segmentation Spatial Segmentation We obtain multiple video blobs based on the spatial and temporal segmentation, and each video blob is a small chunk of data corresponding to a certain action primitive of a certain body part. We call it body-action unit. Body-action Unit

Fisher Vectors Concatenate … … … Based on the spatio-temporally meaningful body-action units, we train visual vocabularies and extract Fisher vectors. We then concatenate the Fisher vectors extracted from all the body-action units to form a fixed-length feature vector to represent the appearance of a walking person. Concatenate …

Pedestrian re-identification tasks … Finally we compare the appearance representations extracted from each video and match the query sequence to the right one. … …