Performance of Computer Vision

Slides:



Advertisements
Similar presentations
Combining Detectors for Human Hand Detection Antonio Hernández, Petia Radeva and Sergio Escalera Computer Vision Center, Universitat Autònoma de Barcelona,
Advertisements

Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?
Evaluating Color Descriptors for Object and Scene Recognition Koen E.A. van de Sande, Student Member, IEEE, Theo Gevers, Member, IEEE, and Cees G.M. Snoek,
Classification using intersection kernel SVMs is efficient Joint work with Subhransu Maji and Alex Berg Jitendra Malik UC Berkeley.
Computer Vision for Human-Computer InteractionResearch Group, Universität Karlsruhe (TH) cv:hci Dr. Edgar Seemann 1 Computer Vision: Histograms of Oriented.
Lecture 31: Modern object recognition
LPP-HOG: A New Local Image Descriptor for Fast Human Detection Andy Qing Jun Wang and Ru Bo Zhang IEEE International Symposium.
Many slides based on P. FelzenszwalbP. Felzenszwalb General object detection with deformable part-based models.
Activity Recognition Aneeq Zia. Agenda What is activity recognition Typical methods used for action recognition “Evaluation of local spatio-temporal features.
Presenter: Hoang, Van Dung
Face detection Many slides adapted from P. Viola.
Fast intersection kernel SVMs for Realtime Object Detection
More sliding window detection: Discriminative part-based models Many slides based on P. FelzenszwalbP. Felzenszwalb.
São Paulo Advanced School of Computing (SP-ASC’10). São Paulo, Brazil, July 12-17, 2010 Looking at People Using Partial Least Squares William Robson Schwartz.
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
Generic Object Detection using Feature Maps Oscar Danielsson Stefan Carlsson
Viola and Jones Object Detector Ruxandra Paun EE/CS/CNS Presentation
Student: Kylie Gorman Mentor: Yang Zhang COLOR-ATTRIBUTES- RELATED IMAGE RETRIEVAL.
Predicting Matchability - CVPR 2014 Paper -
Viewpoint Tracking for 3D Display Systems A look at the system proposed by Yusuf Bediz, Gözde Bozdağı Akar.
Fast Human Detection Using a Novel Boosted Cascading Structure With Meta Stages Yu-Ting Chen and Chu-Song Chen, Member, IEEE.
Generic object detection with deformable part-based models
Object Bank Presenter : Liu Changyu Advisor : Prof. Alex Hauptmann Interest : Multimedia Analysis April 4 th, 2013.
“Secret” of Object Detection Zheng Wu (Summer intern in MSRNE) Sep. 3, 2010 Joint work with Ce Liu (MSRNE) William T. Freeman (MIT) Adam Kalai (MSRNE)
Marco Pedersoli, Jordi Gonzàlez, Xu Hu, and Xavier Roca
DIEGO AGUIRRE COMPUTER VISION INTRODUCTION 1. QUESTION What is Computer Vision? 2.
Lecture 31: Modern recognition CS4670 / 5670: Computer Vision Noah Snavely.
BING: Binarized Normed Gradients for Objectness Estimation at 300fps
Mentor: Salman Khokhar Action Recognition in Crowds Week 7.
Pedestrian Detection and Localization
Object Recognition in Images Slides originally created by Bernd Heisele.
Online Kinect Handwritten Digit Recognition Based on Dynamic Time Warping and Support Vector Machine Journal of Information & Computational Science, 2015.
Recognition II Ali Farhadi. We have talked about Nearest Neighbor Naïve Bayes Logistic Regression Boosting.
Stable Multi-Target Tracking in Real-Time Surveillance Video
Histograms of Oriented Gradients for Human Detection(HOG)
Kylie Gorman WEEK 1-2 REVIEW. CONVERTING AN IMAGE FROM RGB TO HSV AND DISPLAY CHANNELS.
The Viola/Jones Face Detector A “paradigmatic” method for real-time object detection Training is slow, but detection is very fast Key ideas Integral images.
Human Detection Method Combining HOG and Cumulative Sum based Binary Pattern Jong Gook Ko', Jin Woo Choi', So Hee Park', Jang Hee You', ' Electronics and.
Face Alignment at 3000fps via Regressing Local Binary Features CVPR14 Shaoqing Ren, Xudong Cao, Yichen Wei, Jian Sun Presented by Sung Sil Kim.
Machine Learning for Pedestrian Detection. How does a Smart Assistance System detects Pedestrian?
Week 10 Emily Hand UNR.
Computer Vision Spring ,-685 Instructor: S. Narasimhan WH 5409 T-R 10:30am – 11:50am Lecture #23.
9.913 Pattern Recognition for Vision Class9 - Object Detection and Recognition Bernd Heisele.
Finding Clusters within a Class to Improve Classification Accuracy Literature Survey Yong Jae Lee 3/6/08.
Week 4: 6/6 – 6/10 Jeffrey Loppert. This week.. Coded a Histogram of Oriented Gradients (HOG) Feature Extractor Extracted features from positive and negative.
1 Bilinear Classifiers for Visual Recognition Computational Vision Lab. University of California Irvine To be presented in NIPS 2009 Hamed Pirsiavash Deva.
Non-separable SVM's, and non-linear classification using kernels Jakob Verbeek December 16, 2011 Course website:
Face recognition using Histograms of Oriented Gradients
Abdul Jabbar Siddiqui, Abdelhamid Mammeri, and Azzedine Boukerche
Cascade for Fast Detection
Object detection with deformable part-based models
Data Driven Attributes for Action Detection
Week III: Deep Tracking
Learning Mid-Level Features For Recognition
Lit part of blue dress and shadowed part of white dress are the same color
Introductory Seminar on Research: Fall 2017
Detecting Room Occupancy with Pi Camera
Object detection as supervised classification
R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.
Introduction of Pedestrian Detection
A Tutorial on HOG Human Detection
An HOG-LBP Human Detector with Partial Occlusion Handling
Bilinear Classifiers for Visual Recognition
Walter J. Scheirer, Samuel E. Anthony, Ken Nakayama & David D. Cox
A Fast and Scalable Nearest Neighbor Based Classification
AHED Automatic Human Emotion Detection
Progress report 2019/1/14 PHHung.
AHED Automatic Human Emotion Detection
AHED Automatic Human Emotion Detection
Image Processing and Multi-domain Translation
Presentation transcript:

Performance of Computer Vision Amos Wang

Computer Vision with Real-Time Real-Time standard in computer vision is 30 frames per second However, there is no guarantee that the system always reach 30 frames per second.

A common computer vision pipeline for image classification K-means Clustering Feature Extraction dictionary Coding Feature Extraction Pooling Classifier Slide credit: http://ufldl.stanford.edu/eccv10-tutorial/

Flowchart Training Testing Extract Training Sample Multiple Scales Feature Extraction Sliding Window Classifier Selection Feature Extraction Validation Classify

Extract Training Sample Feature Extraction Classifier Selection Validation Extract Training Sample Dataset : Daimler-DB Positive Negative Test source : Monocular Pedestrian Detection Survey and Experiments, PAMI 2009

Feature Extract HoG ( Histogram of Orientated Gradient) Extract Training Sample Feature Extract Feature Extraction HoG ( Histogram of Orientated Gradient) Classifier Selection Validation

Classifier Selection Linear SVM Extract Training Sample Classifier Selection Feature Extraction Linear SVM Classifier Selection Validation source :Pedestrian Detection: An Evaluation of the State of the Art, PAMI 2012

Paper I Real-time pedestrian detection and tracking at nighttime for driver-assistance systems, Ge, J., Yupin L., and Gyomei T., Intelligent Transportation Systems, IEEE Transactions on 10.2 (2009). produce a detection rate of more than 90% as fast as 30 FPS on a Pentium IV 3.0-GHz

Overview of the system modules

Pedestrian classification Challenge: high performance and real-time constraint vs. the large intra class variability in the pedestrian class Solution: a tree-structured two-stage detector based on Haar-like and HOG features

Paper II Hardware architecture for high-accuracy real-time pedestrian detection with CoHOG features , Hiromoto, Masayuki, and Ryusuke Miyamoto, Computer Vision Workshops, 2009 IEEE 12th International Conference on. IEEE, 2009. the proposed architecture is implemented on a Xilinx Virtex-5 FPGA achieves real-time pedestrian detection on 38 fps 320x240 video

CoHOG features A high-dimensional feature descriptor that uses pairs of gradient orientations Each co-occurrence matrix C with an offset (x, y) can be expressed as a 64-dimensional vector fk,x,y.

Parallel calculation of histograms Ex: K × 31 co-occurrence matrices for all combinations of block k and offset (x, y) are calculated K × 31 parallelism at its maximum with this parallelization approach

Paper III Pedestrian detection at 100 frames per second, Benenson, Rodrigo, et al., Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on. IEEE, 2012. on a single CPU+GPU desktop machine reach 135 fps

The process of Detection Multiple Scales Generate multiple images on different scales Sliding Window Feature Extraction Classify Figure from : Pedestrian detection at 100 frames per second

The process of Detection Multiple Scales use sliding window on all locations Sliding Window Feature Extraction Classify Figure from : Pedestrian detection at 100 frames per second

The process of Detection Multiple Scales Extract HoG from current window Sliding Window Feature Extraction Classify Figure from : Pedestrian detection at 100 frames per second

The process of Detection Multiple Scales Use LinearSVM to classify Accurate Criteria Normally if >= 0.5, we consider it is correct. Sliding Window Feature Extraction Classify Figure from : Pedestrian detection at 100 frames per second

N Models, 1 image scale scales N is usually in the order of ~50 scales Need a lot of data for this method! overfitting

1 model, N image scales training a canonical scale is delicate how to decide the size? high resolution scales and blurry low scales Recompute features multiple times in runtime

1 model, N/K image scales rescale at feature space directly. HoG cannot

N/K model, 1 image scales Get approximated models quickly for example, N=50, K=10 5 models *K = 50

Let’s go back to real-time issue In order to implement a real-time, there are some possible ways as following Use real-time programming language: Ada 95, Real- Time Java and Real-Time POSIX Get help from RTOS: LynxOS, QNX, RTLinux, and LitmusRT