Reconstruction-free Inference on Compressive Measurements Suhas Lohit, Kuldeep Kulkarni, Pavan Turaga, Jian Wang, Aswin Sankaranarayanan Arizona State.

Slides:



Advertisements
Similar presentations
Automatic Image Annotation Using Group Sparsity
Advertisements

Road-Sign Detection and Recognition Based on Support Vector Machines Saturnino, Sergio et al. Yunjia Man ECG 782 Dr. Brendan.
Principal Component Analysis Based on L1-Norm Maximization Nojun Kwak IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?
Evaluating Color Descriptors for Object and Scene Recognition Koen E.A. van de Sande, Student Member, IEEE, Theo Gevers, Member, IEEE, and Cees G.M. Snoek,
CS-MUVI Video compressive sensing for spatial multiplexing cameras Aswin Sankaranarayanan, Christoph Studer, Richard G. Baraniuk Rice University.
SVM—Support Vector Machines
Machine learning continued Image source:
Face Alignment with Part-Based Modeling
Lecture 31: Modern object recognition
LPP-HOG: A New Local Image Descriptor for Fast Human Detection Andy Qing Jun Wang and Ru Bo Zhang IEEE International Symposium.
Activity Recognition Aneeq Zia. Agenda What is activity recognition Typical methods used for action recognition “Evaluation of local spatio-temporal features.
Mean transform, a tutorial KH Wong mean transform v.5a1.
Compressed sensing Carlos Becker, Guillaume Lemaître & Peter Rennert
São Paulo Advanced School of Computing (SP-ASC’10). São Paulo, Brazil, July 12-17, 2010 Looking at People Using Partial Least Squares William Robson Schwartz.
Discriminative and generative methods for bags of features
A Study of Approaches for Object Recognition
Pattern Recognition Topic 1: Principle Component Analysis Shapiro chap
Scale Invariant Feature Transform (SIFT)
5/30/2006EE 148, Spring Visual Categorization with Bags of Keypoints Gabriella Csurka Christopher R. Dance Lixin Fan Jutta Willamowski Cedric Bray.
Smart Traveller with Visual Translator for OCR and Face Recognition LYU0203 FYP.
Face Recognition Using Neural Networks Presented By: Hadis Mohseni Leila Taghavi Atefeh Mirsafian.
Generic object detection with deformable part-based models
An Introduction to Support Vector Machines Martin Law.
Compressive Sampling: A Brief Overview
Dimensionality Reduction: Principal Components Analysis Optional Reading: Smith, A Tutorial on Principal Components Analysis (linked to class webpage)
Person-Specific Domain Adaptation with Applications to Heterogeneous Face Recognition (HFR) Presenter: Yao-Hung Tsai Dept. of Electrical Engineering, NTU.
Richard Baraniuk Chinmay Hegde Marco Duarte Mark Davenport Rice University Michael Wakin University of Michigan Compressive Learning and Inference.
Mining Discriminative Components With Low-Rank and Sparsity Constraints for Face Recognition Qiang Zhang, Baoxin Li Computer Science and Engineering Arizona.
Zhengyou Zhang Microsoft Research Digital Object Identifier: /MMUL Publication Year: 2012, Page(s): Professor: Yih-Ran Sheu Student.
Introduction to Computer Vision Olac Fuentes Computer Science Department University of Texas at El Paso El Paso, TX, U.S.A.
Object Bank Presenter : Liu Changyu Advisor : Prof. Alex Hauptmann Interest : Multimedia Analysis April 4 th, 2013.
 Karthik Gurumoorthy  Ajit Rajwade  Arunava Banerjee  Anand Rangarajan Department of CISE University of Florida 1.
An Information Fusion Approach for Multiview Feature Tracking Esra Ataer-Cansizoglu and Margrit Betke ) Image and.
Object Detection with Discriminatively Trained Part Based Models
Lecture 31: Modern recognition CS4670 / 5670: Computer Vision Noah Snavely.
Compressive Sensing for Multimedia Communications in Wireless Sensor Networks By: Wael BarakatRabih Saliba EE381K-14 MDDSP Literary Survey Presentation.
Pedestrian Detection and Localization
Object Recognition in Images Slides originally created by Bernd Heisele.
An Introduction to Support Vector Machines (M. Law)
Deformable Part Model Presenter : Liu Changyu Advisor : Prof. Alex Hauptmann Interest : Multimedia Analysis April 11 st, 2013.
EE381K-14 MDDSP Literary Survey Presentation March 4th, 2008
Face Detection Using Large Margin Classifiers Ming-Hsuan Yang Dan Roth Narendra Ahuja Presented by Kiang “Sean” Zhou Beckman Institute University of Illinois.
Visual Categorization With Bags of Keypoints Original Authors: G. Csurka, C.R. Dance, L. Fan, J. Willamowski, C. Bray ECCV Workshop on Statistical Learning.
Design of PCA and SVM based face recognition system for intelligent robots Department of Electrical Engineering, Southern Taiwan University, Tainan County,
A NOVEL METHOD FOR COLOR FACE RECOGNITION USING KNN CLASSIFIER
Human Detection Method Combining HOG and Cumulative Sum based Binary Pattern Jong Gook Ko', Jin Woo Choi', So Hee Park', Jang Hee You', ' Electronics and.
Face Detection Using Neural Network By Kamaljeet Verma ( ) Akshay Ukey ( )
Learning Photographic Global Tonal Adjustment with a Database of Input / Output Image Pairs.
2D-LDA: A statistical linear discriminant analysis for image matrix
More sliding window detection: Discriminative part-based models
3D Face Recognition Using Range Images Literature Survey Joonsoo Lee 3/10/05.
Next, this study employed SVM to classify the emotion label for each EEG segment. The basic idea is to project input data onto a higher dimensional feature.
WLD: A Robust Local Image Descriptor Jie Chen, Shiguang Shan, Chu He, Guoying Zhao, Matti Pietikäinen, Xilin Chen, Wen Gao 报告人:蒲薇榄.
Evaluation of Gender Classification Methods with Automatically Detected and Aligned Faces Speaker: Po-Kai Shen Advisor: Tsai-Rong Chang Date: 2010/6/14.
Face Detection 蔡宇軒.
1 Bilinear Classifiers for Visual Recognition Computational Vision Lab. University of California Irvine To be presented in NIPS 2009 Hamed Pirsiavash Deva.
Deeply learned face representations are sparse, selective, and robust
Radon Transform Imaging
Guillaume-Alexandre Bilodeau
Bag-of-Visual-Words Based Feature Extraction
Performance of Computer Vision
PRESENTED BY Yang Jiao Timo Ahonen, Matti Pietikainen
Face Recognition and Feature Subspaces
Object detection as supervised classification
Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science
Mean transform , a tutorial
Random feature for sparse signal classification
AHED Automatic Human Emotion Detection
AHED Automatic Human Emotion Detection
Presentation transcript:

Reconstruction-free Inference on Compressive Measurements Suhas Lohit, Kuldeep Kulkarni, Pavan Turaga, Jian Wang, Aswin Sankaranarayanan Arizona State University Carnegie Mellon University

Inference in the Traditional Sensing Framework Feature Extraction Inference Traditional computer vision pipeline 2 Camera with a CCD/CMOS array * N. Dalal and B. Triggs. Histograms of oriented gradients for human detection. In CVPR, 2005 HOG descriptor*

Compressive Imaging – Single Pixel Camera * dsp.rice.edu/cscamera 3 *

Inference on Compressive Measurements Compressive Measurements from SPC Reconstruction Feature Extraction Inference Reconstruct-then-infer paradigm Major bottleneck 4

Reconstruction algorithms suffer from drawbacks Computationally expensive Do not produce good results at high compression ratios. Various parameters such as sparsity level and sparsifying basis need to be known a priori.

Is Reconstruction Necessary for Inference? We posit that one can build effective inference algorithms directly on the compressed bits. Compressive measurements Reconstruction Feature Extraction Inference Bypass reconstruction? Reconstruction-free compressive inference 6

Basis for direct feature extraction [1] W.B. Johnson and J. Lindenstrauss. "Extensions of Lipschitz mappings into a Hilbert space." Contemporary mathematics (1984): 1. [2] J. Romberg, M. Wakin, "Compressed Sensing: A Tutorial", IEEE Statistical Signal Processing Workshop, [2] 7

Basis for direct feature extraction This fact is employed in the design of the smashed filter for compressive classification by Davenport et al. (2007) [3]. We extend this idea to construct smashed correlation filters which are more robust to input variations. As a consequence of the JL lemma, inner products and hence, correlations between vectors are also preserved after mapping to a lower dimension. 8 [3] M. A. Davenport, M. F. Duarte, M. B. Wakin, J. N. Laska, D. Takhar, K. F. Kelly,and R. G. Baraniuk. “The smashed filter for compressive classification and target recognition.” In Electronic Imaging, pages 64980H–64980H. International Society for Optics and Photonics, 2007.

Correlation Filters for Visual Recognition 9

10

Recognition with Correlation Filters – Oracle Sensing SVM Class label Extracted Feature 11

Smashed Correlation Filters for Compressive Recognition SVM Class label Extracted Feature 12 Using a corollary of the JL lemma, we can write

Experiments Controlled experiments 13 1.AMP Database [4] 13 subjects with 75 images per subject. 25 for training, 50 for testing. 64 x 64 images. [4]

Results on AMP Database (64 x 64 images) At low noise + high compression ratios; accuracy is comparable to oracle sensing. Hadamard measurements are more robust to noise. 14

Experiments Controlled experiments 15 2.NIR Database [5] Near infrared images. 197 subjects with 20 images per subject. 10 for training, 10 for testing. 640 x 480 images resized to 256 x 256. [5] S. Z. Li, R. Chu, S. Liao, and L. Zhang. Illumination invariant face recognition using near-infrared images. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 29(4):627–639, 2007.

Results on NIR Database (256 x 256 images) At low noise + high compression ratios; accuracy is comparable to oracle sensing. Hadamard measurements are more robust to noise at high compression ratio. 16

Experiments on Single Pixel Camera New dataset using the SPC consists of CS measurements of 120 facial images of size 128 x 128. Images belong to 30 subjects with 4 images per subject. Images are captured using a DMD with resolution of 1024 x 768 and operating speed of 22,700 measurements per second. Hadamard matrix was used for sensing. 17 DMD photodetector objective lens relay lens

Experiments on Single Pixel Camera Compression Ratio No. of measurements Recognition Accuracy (%) 1 (Oracle) Face Recognition Results Four training-testing splits were created using the database. Each split containing 3 training images and 1 testing image. Face recognition accuracy was computed as the average over all the splits. 18

Benefits of compressive acquisition in IR IR imaging is provides illumination invariance. Non-visible wavelength sensors are expensive. Compressive imaging -- e.g. SPC -- provides a cost-effective alternative. FLIR T620 IR camera 640 x 480 pixels Price : $ 21,000 19

Conclusions and Future Work We have shown that compressive sensing technology can be employed in a practical inference application by extracting features from compressive measurements directly, using smashed correlation filters, thus avoiding reconstruction. Points to new avenues of research for understanding how to solve high-level computer vision problems from computational imagers in general. 20

References 1.W.B. Johnson, and J. Lindenstrauss. "Extensions of Lipschitz mappings into a Hilbert space." Contemporary mathematics (1984): 1. 2.J. Romberg, M. Wakin, "Compressed Sensing: A Tutorial", IEEE Statistical Signal Processing Workshop, M. A. Davenport, M. F. Duarte, M. B. Wakin, J. N. Laska, D. Takhar, K. F. Kelly,and R. G. Baraniuk. “The smashed filter for compressive classification and target recognition.” Electronic Imaging, pages 64980H–64980H. International Society for Optics and Photonics, AMP Database, 5.S. Z. Li, R. Chu, S. Liao, and L. Zhang. “Illumination invariant face recognition using near-infrared images.” Pattern Analysis and Machine Intelligence, IEEE Transactions on, 29(4):627–639, Thank you Supported by ONR and NSF

Preserving Inner Products – Proof Sketch Proving the other direction along the same lines gives us the required result. ∀ u, v ∈ Q and ∥ u ∥ = 1, ∥ v ∥ = 1, Using JL lemma, =

Compressive Sensing for Inference Benefits of IR imaging for inference – illumination invariance Color camera Near-infrared imager S. Z. Li, R. Chu, S. Liao, and L. Zhang. “Illumination invariant face recognition using near-infrared images.” Pattern Analysis and Machine Intelligence, IEEE Transactions on, 29(4):627–639,

Compressive Sensing for Inference Benefits of IR imaging for inference – illumination invariance Color camera Long-wave IR imager Reviews, Refinements and New Ideas in Face Recognition, July

Acquisition Time Can Be Significantly Reduced Our SPC senses 22.7 measurements/second For a 128 x 128 image, sensing time is Sensing Mechanism% Measurements RequiredSensing Time Sensing all measurements100% (16384)0.72 s Compressive sensing with recovery20%~ 0.1 s Compressive sensing without reconstruction1%~ s

Contributions We show that simple linear features using smashed correlation filters can be extracted without reconstruction and can be used reliably for high level inference. It is shown through experiments in face recognition that it is indeed possible to perform compressive inference with these features with only a marginal loss in accuracy compared to traditional (oracle) sensing. 27

Maximum Margin Correlation Filters Developed by Rodriguez et al. (2013) Combines the strengths of correlation filters and SVM. Optimize for High peak at target location. Maximum margin (SVM objective) Solve optimization problem of the form: Can be reduced to a single optimization problem that can be solved on any standard SVM solver. A. Rodriguez, V. N. Boddeti, B. V. Kumar, and A. Mahalanobis. Maximum margin correlation filter: A new approach for localization and classification. IEEE Transactions on Image Processing, 22(2):631–643,

Feature extraction from correlation planes Each correlation plane is divided into non-overlapping blocks and the PSR and peak values of each block is extracted. The peak and PSR for the entire correlation plane are also extracted. All these values are concatenated to form the feature vector that is input to the SVMs.