Object Category Detection: Sliding Windows Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 03/18/10.

Slides:



Advertisements
Similar presentations
Statistical Learning of Multi-View Face Detection
Advertisements

Detecting Faces in Images: A Survey
Rapid Object Detection using a Boosted Cascade of Simple Features Paul Viola, Michael Jones Conference on Computer Vision and Pattern Recognition 2001.
Rapid Object Detection using a Boosted Cascade of Simple Features Paul Viola, Michael Jones Conference on Computer Vision and Pattern Recognition 2001.
Face detection Behold a state-of-the-art face detector! (Courtesy Boris Babenko)Boris Babenko.
Recap: Advanced Feature Encoding Bag of Visual Words is only about counting the number of local descriptors assigned to each Voronoi region (0 th order.
Crash Course on Machine Learning Part IV Several slides from Derek Hoiem, and Ben Taskar.
Lecture 31: Modern object recognition
AdaBoost & Its Applications
Object Category Detection: Sliding Windows Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 04/10/12.
Face detection Many slides adapted from P. Viola.
Cos 429: Face Detection (Part 2) Viola-Jones and AdaBoost Guest Instructor: Andras Ferencz (Your Regular Instructor: Fei-Fei Li) Thanks to Fei-Fei Li,
EE462 MLCV Lecture 5-6 Object Detection – Boosting Tae-Kyun Kim.
The Viola/Jones Face Detector Prepared with figures taken from “Robust real-time object detection” CRL 2001/01, February 2001.
The Viola/Jones Face Detector (2001)
Object Category Detection: Statistical Templates Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 04/14/15.
HCI Final Project Robust Real Time Face Detection Paul Viola, Michael Jones, Robust Real-Time Face Detetion, International Journal of Computer Vision,
Robust Real-time Object Detection by Paul Viola and Michael Jones ICCV 2001 Workshop on Statistical and Computation Theories of Vision Presentation by.
Face detection and recognition Many slides adapted from K. Grauman and D. Lowe.
A Robust Real Time Face Detection. Outline  AdaBoost – Learning Algorithm  Face Detection in real life  Using AdaBoost for Face Detection  Improvements.
A Robust Real Time Face Detection. Outline  AdaBoost – Learning Algorithm  Face Detection in real life  Using AdaBoost for Face Detection  Improvements.
Robust Real-Time Object Detection Paul Viola & Michael Jones.
Object Detection Using the Statistics of Parts Henry Schneiderman Takeo Kanade Presented by : Sameer Shirdhonkar December 11, 2003.
Face Detection and Recognition
Foundations of Computer Vision Rapid object / face detection using a Boosted Cascade of Simple features Presented by Christos Stoilas Rapid object / face.
Face Detection CSE 576. Face detection State-of-the-art face detection demo (Courtesy Boris Babenko)Boris Babenko.
FACE DETECTION AND RECOGNITION By: Paranjith Singh Lohiya Ravi Babu Lavu.
Kullback-Leibler Boosting Ce Liu, Hueng-Yeung Shum Microsoft Research Asia CVPR 2003 Presented by Derek Hoiem.
Face Detection using the Viola-Jones Method
Recognition using Boosting Modified from various sources including
Object Detection Using the Statistics of Parts Presented by Nicholas Chan – Advanced Perception Robust Real-time Object Detection Henry Schneiderman.
Window-based models for generic object detection Mei-Chen Yeh 04/24/2012.
Sign Classification Boosted Cascade of Classifiers using University of Southern California Thang Dinh Eunyoung Kim
Lecture 29: Face Detection Revisited CS4670 / 5670: Computer Vision Noah Snavely.
Face detection Slides adapted Grauman & Liebe’s tutorial
Visual Object Recognition
DIEGO AGUIRRE COMPUTER VISION INTRODUCTION 1. QUESTION What is Computer Vision? 2.
Lecture 31: Modern recognition CS4670 / 5670: Computer Vision Noah Snavely.
Robust real-time face detection Paul A. Viola and Michael J. Jones Intl. J. Computer Vision 57(2), 137–154, 2004 (originally in CVPR’2001) (slides adapted.
Recognition II Ali Farhadi. We have talked about Nearest Neighbor Naïve Bayes Logistic Regression Boosting.
A Statistical Method for 3D Object Detection Applied to Face and Cars CVPR 2000 Henry Schneiderman and Takeo Kanade Robotics Institute, Carnegie Mellon.
Tony Jebara, Columbia University Advanced Machine Learning & Perception Instructor: Tony Jebara.
Face Detection Ying Wu Electrical and Computer Engineering Northwestern University, Evanston, IL
Project 3 Results.
Robust Real Time Face Detection
Adaboost and Object Detection Xu and Arun. Principle of Adaboost Three cobblers with their wits combined equal Zhuge Liang the master mind. Failure is.
HCI/ComS 575X: Computational Perception Instructor: Alexander Stoytchev
Lecture 09 03/01/2012 Shai Avidan הבהרה: החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.
The Viola/Jones Face Detector A “paradigmatic” method for real-time object detection Training is slow, but detection is very fast Key ideas Integral images.
Bibek Jang Karki. Outline Integral Image Representation of image in summation format AdaBoost Ranking of features Combining best features to form strong.
CS 1699: Intro to Computer Vision Detection II: Deformable Part Models Prof. Adriana Kovashka University of Pittsburgh November 12, 2015.
Object Detection Overview Viola-Jones Dalal-Triggs Deformable models Deep learning.
Learning to Detect Faces A Large-Scale Application of Machine Learning (This material is not in the text: for further information see the paper by P.
FACE DETECTION : AMIT BHAMARE. WHAT IS FACE DETECTION ? Face detection is computer based technology which detect the face in digital image. Trivial task.
CS-498 Computer Vision Week 9, Class 2 and Week 10, Class 1
Notes on HW 1 grading I gave full credit as long as you gave a description, confusion matrix, and working code Many people’s descriptions were quite short.
A Brief Introduction on Face Detection Mei-Chen Yeh 04/06/2010 P. Viola and M. J. Jones, Robust Real-Time Face Detection, IJCV 2004.
Project Overview CSE 6367 – Computer Vision Vassilis Athitsos University of Texas at Arlington.
Statistical Template-Based Object Detection A Statistical Method for 3D Object Detection Applied to Faces and Cars Henry Schneiderman and Takeo Kanade.
Face detection Many slides adapted from P. Viola.
Reading: R. Schapire, A brief introduction to boosting
Cascade for Fast Detection
Session 7: Face Detection (cont.)
Presented by Minh Hoai Nguyen Date: 28 March 2007
Lit part of blue dress and shadowed part of white dress are the same color
Recap: Advanced Feature Encoding
Object detection as supervised classification
Learning to Detect Faces Rapidly and Robustly
Cos 429: Face Detection (Part 2) Viola-Jones and AdaBoost Guest Instructor: Andras Ferencz (Your Regular Instructor: Fei-Fei Li) Thanks to Fei-Fei.
Lecture 29: Face Detection Revisited
Presentation transcript:

Object Category Detection: Sliding Windows Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 03/18/10

Goal: Detect all instances of objects

Influential Works in Detection Sung-Poggio (1994, 1998) : ~1450 citations – Basic idea of statistical template detection (I think), bootstrapping to get “face-like” negative examples, multiple whole-face prototypes (in 1994) Rowley-Baluja-Kanade ( ) : ~2900 – “Parts” at fixed position, non-maxima suppression, simple cascade, rotation, pretty good accuracy, fast Schneiderman-Kanade ( ,2004) : ~1250 – Careful feature engineering, excellent results, cascade Viola-Jones (2001, 2004) : ~6500 – Haar-like features, Adaboost as feature selection, hyper-cascade, very fast, easy to implement Dalal-Triggs (2005) : 1025 – Careful feature engineering, excellent results, HOG feature, online code Felzenszwalb-McAllester-Ramanan (2008)? 105 citations – Excellent template/parts-based blend

Sliding window detection … …

What the Detector Sees

Statistical Template Object model = log linear model of parts at fixed positions = = 10.5 > 7.5 ? ? Non-object Object

Design challenges Part design – How to model appearance – Which “parts” to include – How to set part likelihoods How to make it fast How to deal with different viewpoints Implementation details – Window size – Aspect ratio – Translation/scale step size – Non-maxima suppression

Schneiderman and Kanade Schneiderman and Kanade. A Statistical Method for 3D Object Detection. (2000)A Statistical Method for 3D Object Detection

Schneiderman and Kanade Decision function:

Parts model Part = group of wavelet coefficients that are statistically dependent

Parts: groups of wavelet coefficients Fixed parts within/across subbands 17 types of “parts” that can appear at each position Discretize wavelet coefficient to 3 values E.g., part with 8 coefficients has 3^8 = 6561 values

Part Likelihood Class-conditional likelihood ratio Estimate P(part|object) and P(part | non- object) by counting over examples Adaboost tunes weights discriminatively

Training 1)Create training data a)Get positive and negative patches b)Pre-process (optional), compute wavelet coefficients, discretize c)Compute parts values 2)Learn statistics a)Compute ratios of histograms by counting for positive and negative examples b)Reweight examples using Adaboost, recount, etc. 3) Get more negative examples (bootstrapping)

Training multiple viewpoints Train new detector for each viewpoint.

Testing 1)Processing: a)Lighting correction (optional) b)Compute wavelet coefficients, quantize 2)Slide window over each position/scale (2 pixels, 2 1/4 scale) a)Compute part values b)Lookup likelihood ratios c)Sum over parts d)Threshold 3)Use faster classifier to prune patches (cascade…more on this later) 4)Non-maximum suppression

Results: faces 208 images with 441 faces, 347 in profile

Results: cars

Results: faces today

Viola and Jones Fast detection through two mechanisms Viola and Jones. Rapid Object Detection using a Boosted Cascade of Simple Features (2001).Rapid Object Detection using a Boosted Cascade of Simple Features

Integral Images “Haar-like features” – Differences of sums of intensity – Thousands, computed at various positions and scales within detection window Two-rectangle featuresThree-rectangle featuresEtc. +1

Integral Images ii = cumsum(cumsum(Im, 1), 2) x, y ii(x,y) = Sum of the values in the grey region How to compute A+D-B-C? How to compute B-A?

Adaboost as feature selection Create a large pool of parts (180K) “Weak learner” = feature + threshold + parity Choose weak learner that minimizes error on the weighted training set Reweight

Adaboost

Sidebar: Adaboost

Adaboost “RealBoost” Figure from Friedman et al Important special case: h t partitions input space: alpha t

Adaboost: Immune to Overfitting? Test error Train error

Interpretations of Adaboost Additive logistic regression (Friedman et al. 2000) – LogitBoost from Collins et al does this more explicitly Margin maximization (Schapire et al. 1998) – Ratch and Warmuth 2002 do this more explicitly

Adaboost: Margin Maximizer margin Test error Train error

Interpretations of Adaboost Rosset Zhu Hastie 2004 – Early stopping is form of L1-regularization – In many cases, converges to “L1-optimal” separating hyperplane – “An interesting fundamental similarity between boosting and kernel support vector machines emerges, as both can be described as methods for regularized optimization in high-dimensional predictor space, utilizing a computational trick to make the calculation practical, and converging to margin-maximizing solutions.”

Back to recognition

Cascade for Fast Detection Examples Stage 1 H 1 (x) > t 1 ? Reject No Yes Stage 2 H 2 (x) > t 2 ? Stage N H N (x) > t N ? Yes … Pass Reject No Reject No Choose threshold for low false negative rate Fast classifiers early in cascade Slow classifiers later, but most examples don’t get there

Viola-Jones details 38 stages with 1, 10, 25, 50 … features – 6061 total used out of 180K candidates – 10 features evaluated on average Examples – 4916 positive examples – negative examples collected after each stage Scanning – Scale detector rather than image – Scale steps = 1.25, Translation 1.0*s to 1.5*s Non-max suppression: average coordinates of overlapping boxes Train 3 classifiers and take vote

Viola Jones Results MIT + CMU face dataset

Schneiderman later results Viola-Jones 2001 Roth et al Schneiderman- Kanade 2000 Schneiderman 2004

Speed: frontal face detector Schneiderman-Kanade (2000): 5 seconds Viola-Jones (2001): 15 fps

Important Ideas and Tricks Excellent results require careful feature engineering Speed = fast features (integral image) + cascade Adaboost for feature selection Bootstrapping to deal with many, many negative examples

Occlusions? A problem Objects occluded by > 50% considered “don’t care” PASCAL VOC changed this

Strengths and Weaknesses of Statistical Template Approach Strengths Works very well for non-deformable objects: faces, cars, upright pedestrians Fast detection Weaknesses Not so well for highly deformable objects Not robust to occlusion Requires lots of training data

SK vs. VJ Schneiderman-Kanade Wavelet features Log linear model via boosted histogram ratios Bootstrap training Two-stage cascade NMS: Remove overlapping weak boxes Slow but very accurate Viola-Jones Similar to Haar wavelets Log linear model via boosted stubs Bootstrap training Multistage cascade, integrated into training NMS: average coordinates of overlapping boxes Less accurate but very fast

Things to remember Excellent results require careful feature engineering Sliding window for search Features based on differences of intensity (gradient, wavelet, etc.) Boosting for feature selection (also L1-logistic regression) Integral images, cascade for speed Bootstrapping to deal with many, many negative examples Examples Stage 1 H 1 (x) > t 1 ? Reject No Yes Stage 2 H 2 (x) > t 2 ? Stage N H N (x) > t N ? Yes … Pass Reject No Reject No