Loris Bazzani*, Marco Cristani*†, Alessandro Perina*, Michela Farenzena*, Vittorio Murino*† *Computer Science Department, University of Verona, Italy †Istituto.

Slides:



Advertisements
Similar presentations
Road-Sign Detection and Recognition Based on Support Vector Machines Saturnino, Sergio et al. Yunjia Man ECG 782 Dr. Brendan.
Advertisements

Zhimin CaoThe Chinese University of Hong Kong Qi YinITCS, Tsinghua University Xiaoou TangShenzhen Institutes of Advanced Technology Chinese Academy of.
RGB-D object recognition and localization with clutter and occlusions Federico Tombari, Samuele Salti, Luigi Di Stefano Computer Vision Lab – University.
Foreground Focus: Finding Meaningful Features in Unlabeled Images Yong Jae Lee and Kristen Grauman University of Texas at Austin.
Human Identity Recognition in Aerial Images Omar Oreifej Ramin Mehran Mubarak Shah CVPR 2010, June Computer Vision Lab of UCF.
A Novel Approach for Recognizing Auditory Events & Scenes Ashish Kapoor.
Patch to the Future: Unsupervised Visual Prediction
Yuanlu Xu Human Re-identification: A Survey.
IIIT Hyderabad Pose Invariant Palmprint Recognition Chhaya Methani and Anoop Namboodiri Centre for Visual Information Technology IIIT, Hyderabad, INDIA.
Yuanlu Xu Advisor: Prof. Liang Lin Person Re-identification by Matching Compositional Template with Cluster Sampling.
Robust Object Tracking via Sparsity-based Collaborative Model
Texture Segmentation Based on Voting of Blocks, Bayesian Flooding and Region Merging C. Panagiotakis (1), I. Grinias (2) and G. Tziritas (3)
Adviser : Ming-Yuan Shieh Student ID : M Student : Chung-Chieh Lien VIDEO OBJECT SEGMENTATION AND ITS SALIENT MOTION DETECTION USING ADAPTIVE BACKGROUND.
A KLT-Based Approach for Occlusion Handling in Human Tracking Chenyuan Zhang, Jiu Xu, Axel Beaugendre and Satoshi Goto 2012 Picture Coding Symposium.
São Paulo Advanced School of Computing (SP-ASC’10). São Paulo, Brazil, July 12-17, 2010 Looking at People Using Partial Least Squares William Robson Schwartz.
Lecture 5 Template matching
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
A Study of Approaches for Object Recognition
Object Recognition with Invariant Features n Definition: Identify objects or scenes and determine their pose and model parameters n Applications l Industrial.
A Bayesian algorithm for tracking multiple moving objects in outdoor surveillance video Department of Electrical Engineering and Computer Science The University.
Multiple Human Objects Tracking in Crowded Scenes Yao-Te Tsai, Huang-Chia Shih, and Chung-Lin Huang Dept. of EE, NTHU International Conference on Pattern.
CS292 Computational Vision and Language Visual Features - Colour and Texture.
Student: Kylie Gorman Mentor: Yang Zhang COLOR-ATTRIBUTES- RELATED IMAGE RETRIEVAL.
REALTIME OBJECT-OF-INTEREST TRACKING BY LEARNING COMPOSITE PATCH-BASED TEMPLATES Yuanlu Xu, Hongfei Zhou, Qing Wang*, Liang Lin Sun Yat-sen University,
Person-Specific Domain Adaptation with Applications to Heterogeneous Face Recognition (HFR) Presenter: Yao-Hung Tsai Dept. of Electrical Engineering, NTU.
Autonomous Learning of Object Models on Mobile Robots Xiang Li Ph.D. student supervised by Dr. Mohan Sridharan Stochastic Estimation and Autonomous Robotics.
Mutual Information-based Stereo Matching Combined with SIFT Descriptor in Log-chromaticity Color Space Yong Seok Heo, Kyoung Mu Lee, and Sang Uk Lee.
Olga Zoidi, Anastasios Tefas, Member, IEEE Ioannis Pitas, Fellow, IEEE
Mining Discriminative Components With Low-Rank and Sparsity Constraints for Face Recognition Qiang Zhang, Baoxin Li Computer Science and Engineering Arizona.
Shape-Based Human Detection and Segmentation via Hierarchical Part- Template Matching Zhe Lin, Member, IEEE Larry S. Davis, Fellow, IEEE IEEE TRANSACTIONS.
Loris Bazzani Marco Cristani
Characterizing activity in video shots based on salient points Nicolas Moënne-Loccoz Viper group Computer vision & multimedia laboratory University of.
A General Framework for Tracking Multiple People from a Moving Camera
Local invariant features Cordelia Schmid INRIA, Grenoble.
Object Stereo- Joint Stereo Matching and Object Segmentation Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on Michael Bleyer Vienna.
Loris Bazzani*, Marco Cristani*†, Vittorio Murino*† Speaker: Diego Tosato* *Computer Science Department, University of Verona, Italy †Istituto Italiano.
Marco Pedersoli, Jordi Gonzàlez, Xu Hu, and Xavier Roca
Object Detection with Discriminatively Trained Part Based Models
MSRI workshop, January 2005 Object Recognition Collected databases of objects on uniform background (no occlusions, no clutter) Mostly focus on viewpoint.
Computer Vision Lecture #10 Hossam Abdelmunim 1 & Aly A. Farag 2 1 Computer & Systems Engineering Department, Ain Shams University, Cairo, Egypt 2 Electerical.
Epitomic Location Recognition A generative approach for location recognition K. Ni, A. Kannan, A. Criminisi and J. Winn In proc. CVPR Anchorage,
Expectation-Maximization (EM) Case Studies
Histograms of Oriented Gradients for Human Detection(HOG)
Human Re-identification by Matching Compositional Template with Cluster Sampling Yuanlu Xu 1, Liang Lin 1, Wei-Shi Zheng 1, Xiaobai Liu 2 Abstract This.
模式识别国家重点实验室 中国科学院自动化研究所 National Laboratory of Pattern Recognition Institute of Automation, Chinese Academy of Sciences Matching Tracking Sequences Across.
Human Detection Method Combining HOG and Cumulative Sum based Binary Pattern Jong Gook Ko', Jin Woo Choi', So Hee Park', Jang Hee You', ' Electronics and.
Learning Jigsaws for clustering appearance and shape John Winn, Anitha Kannan and Carsten Rother NIPS 2006.
Unsupervised Salience Learning for Person Re-identification
Visual Tracking by Cluster Analysis Arthur Pece Department of Computer Science University of Copenhagen
Matching of Objects Moving Across Disjoint Cameras Eric D. Cheng and Massimo Piccardi IEEE International Conference on Image Processing
Image features and properties. Image content representation The simplest representation of an image pattern is to list image pixels, one after the other.
Cell Segmentation in Microscopy Imagery Using a Bag of Local Bayesian Classifiers Zhaozheng Yin RI/CMU, Fall 2009.
Multi-View Discriminant Analysis 多视判别分析
Video object segmentation and its salient motion detection using adaptive background generation Kim, T.K.; Im, J.H.; Paik, J.K.;  Electronics Letters 
A. M. R. R. Bandara & L. Ranathunga
Guillaume-Alexandre Bilodeau
Lecture 07 13/12/2011 Shai Avidan הבהרה: החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.
ROBUST SUBSPACE LEARNING FOR VISION AND GRAPHICS
Gait Recognition Gökhan ŞENGÜL.
Nearest-neighbor matching to feature database
PRESENTED BY Yang Jiao Timo Ahonen, Matti Pietikainen
Tremor Detection Using Motion Filtering and SVM Bilge Soran, Jenq-Neng Hwang, Linda Shapiro, ICPR, /16/2018.
Nearest-neighbor matching to feature database
Image Segmentation Techniques
Outline S. C. Zhu, X. Liu, and Y. Wu, “Exploring Texture Ensembles by Efficient Markov Chain Monte Carlo”, IEEE Transactions On Pattern Analysis And Machine.
Outline H. Murase, and S. K. Nayar, “Visual learning and recognition of 3-D objects from appearance,” International Journal of Computer Vision, vol. 14,
Patch-Based Image Classification Using Image Epitomes
Related Work in Camera Network Tracking
Presentation transcript:

Loris Bazzani*, Marco Cristani*†, Alessandro Perina*, Michela Farenzena*, Vittorio Murino*† *Computer Science Department, University of Verona, Italy †Istituto Italiano di Tecnologia (IIT), Genova, Italy Multiple-shot Person Re- identification by HPE signature This research is founded by the EU-Project FP7 SAMURAI,grant FP7-SEC No

Analysis of the problem (1) Person Re-identification: Recognizing an individual in diverse locations over different (non-)overlapping camera views T = 222 T = 145 T = 1 T = 23 Different cameras Same camera 2

Analysis of the problem (2) We focus on the problem with non-overlapping cameras Problems in real scenarios: – Very low resolution – Severe Occlusions – Illumination variations – Pedestrians with very similar clothes – Pose and view-point changes – No geometry of the environment Solution: - Histogram Plus Epitome (HPE) descriptor, and - Multiple-shot approach 3

Outline 4 Overview of the proposed method Pre-processing: Background Subtraction “Images selection” for Multiple-shot HPE descriptor - Global descriptor - Local descriptors HPEs’ Matching Results Conclusions

Overview of the proposed method 5 Employing global and local appearance-based features Exploiting the temporal consistency to make robust the descriptor

Background Subtraction 6 We employ a novel generative model: STEL [Jojic el al. 2009] Capture the structure of an image class as a mixture of component segmentations Isolate meaningful parts that exhibit tight feature distributions Learned Mixture Components

“Images selection” for Multiple-shot 7 Objective: discard redundant information and images with occlusions Gaussian Mixture Models Clustering [Figueiredo and Jain 2002] of HSV histograms Automatic model selection employing the Bayesian Information Criterion [Figueiredo and Jain 2002] Discard the clusters with low number of instances Keep a random instance for each cluster Examples of ruled-out examples:

HPE descriptor: Global feature 8 36-dimensional HSV histogram (H=16, S=16, V=4) Average the histograms of the multiple instances Robust to illumination and pose variations, keeping the predominant chromatic information only Capture chromatic global information Caused by illumination changes

HPE descriptor: Local feature (1) 9 Epitome [Jojic el al. 2003]: generative model that analyzes the presence of recurrent, structured local patterns Generic Epitome Local Epitome Local Epitome

HPE descriptor: Local feature (2) 10 Generic Epitome : 36-dimensional HSV histogram of the Epitome Local Epitome : Keep the patches with high : probability that a patch in the epitome having (i, j) as left-upper corner represents several ingredient patches Discard the patches with low entropy Extract a 36-dimensional HSV histogram of the “survived” patches

HPEs’ Matching 11 Re-identification: associating each element in the probe set B to the corresponding element in the gallery set A Minimize the following distance where is the Bhattacharyya distance and

Results (1) 12 iLIDS dataset: - Multiple images of 119 pedestrians 128x64 pixels - Comparison with Context-based method [Zheng et al. 2009] - Cross-validation: SvsS 10 trials, MvsS/MvsM 100 trials

Results (2) 13 ETHZ dataset: - Three datasets of 83, 35 and 28 pedestrians of 64x32 pixels - Comparison with Partial Least Square (PLS) method [Schwartz and Davis 2009] - Cross-validation: Settings as for iLIDS

Results (3) 14 How many images do we need to perform a “good” person re-identification? N = Number of images for the multi-shot approach N = 5 seems to be the best trade-off

Conclusions 15 We proposed a novel descriptor for the person re- identification problem, i.e., HPE descriptor The descriptor is robust to low resolution, occlusions, illumination variations, pedestrians with very similar clothes, pose changes It is based on the accumulation of images to gain robustness Person re-identification problem is still far from being solved The results suggest that further improvements can be reached

References [Jojic el al. 2009] N. Jojic, A. Perina, M. Cristani, V. Murino, and B. Frey, “Stel component analysis: Modeling spatial correlations in image class structure,” IEEE Conference on Computer Vision and Pattern Recognition, pp. 2044– 2051, [Figueiredo and Jain 2002] M. Figueiredo and A. Jain, “Unsupervised learning of finite mixture models,” IEEE Trans. PAMI, vol. 24, no. 3, pp. 381–396, [Jojic el al. 2003] N. Jojic, B. J. Frey, and A. Kannan, “Epitomic analysis of appearance and shape,” in IEEE International Conference on Computer Vision. Washington, DC, USA: IEEE Computer Society, 2003, p. 34. [Schwartz and Davis 2009] W. Schwartz and L. Davis, “Learning discriminative appearance-based models using partial least squares,” in XXIISIBGRAPI, [Zheng et al. 2009] W. Zheng, S. Gong, and T. Xiang, “Associating groups of people,” in BMVC,