Object detection as supervised classification

Slides:

Advertisements

Similar presentations

Jan-Michael Frahm, Enrique Dunn Spring 2013

Advertisements

Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?

Detecting Faces in Images: A Survey

EE462 MLCV Lecture 5-6 Object Detection – Boosting Tae-Kyun Kim.

Rapid Object Detection using a Boosted Cascade of Simple Features Paul Viola, Michael Jones Conference on Computer Vision and Pattern Recognition 2001.

Appearance-based recognition & detection II Kristen Grauman UT-Austin Tuesday, Nov 11.

Recap: Advanced Feature Encoding Bag of Visual Words is only about counting the number of local descriptors assigned to each Voronoi region (0 th order.

TP14 - Local features: detection and description Computer Vision, FCUP, 2014 Miguel Coimbra Slides by Prof. Kristen Grauman.

Computer Vision for Human-Computer InteractionResearch Group, Universität Karlsruhe (TH) cv:hci Dr. Edgar Seemann 1 Computer Vision: Histograms of Oriented.

Lecture 31: Modern object recognition

Many slides based on P. FelzenszwalbP. Felzenszwalb General object detection with deformable part-based models.

Object Category Detection: Sliding Windows Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 04/10/12.

Face detection Many slides adapted from P. Viola.

Viola/Jones: features “Rectangle filters” Differences between sums of pixels in adjacent rectangles { y t (x) = +1 if h t (x) >  t -1 otherwise Unique.

Detecting Pedestrians by Learning Shapelet Features

More sliding window detection: Discriminative part-based models Many slides based on P. FelzenszwalbP. Felzenszwalb.

Discriminative and generative methods for bags of features

Recognition using Regions CVPR Outline Introduction Overview of the Approach Experimental Results Conclusion.

Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?

1 Image Recognition - I. Global appearance patterns Slides by K. Grauman, B. Leibe.

Generic Object Detection using Feature Maps Oscar Danielsson Stefan Carlsson

Generic object detection with deformable part-based models

“Secret” of Object Detection Zheng Wu (Summer intern in MSRNE) Sep. 3, 2010 Joint work with Ce Liu (MSRNE) William T. Freeman (MIT) Adam Kalai (MSRNE)

CSE 473/573 Computer Vision and Image Processing (CVIP) Ifeoma Nwogu Lecture 24 – Classifiers 1.

Window-based models for generic object detection Mei-Chen Yeh 04/24/2012.

Lecture 29: Face Detection Revisited CS4670 / 5670: Computer Vision Noah Snavely.

Face detection Slides adapted Grauman & Liebe’s tutorial

Visual Object Recognition

Lecture 31: Modern recognition CS4670 / 5670: Computer Vision Noah Snavely.

Pedestrian Detection and Localization

Classifiers Given a feature representation for images, how do we learn a model for distinguishing features from different classes? Zebra Non-zebra Decision.

Deformable Part Models (DPM) Felzenswalb, Girshick, McAllester & Ramanan (2010) Slides drawn from a tutorial By R. Girshick AP 12% 27% 36% 45% 49% 2005.

Recognition II Ali Farhadi. We have talked about Nearest Neighbor Naïve Bayes Logistic Regression Boosting.

Lecture 27: Recognition Basics CS4670/5670: Computer Vision Kavita Bala Slides from Andrej Karpathy and Fei-Fei Li

Project 3 Results.

Histograms of Oriented Gradients for Human Detection(HOG)

Methods for classification and image representation

CS 1699: Intro to Computer Vision Support Vector Machines Prof. Adriana Kovashka University of Pittsburgh October 29, 2015.

The Viola/Jones Face Detector A “paradigmatic” method for real-time object detection Training is slow, but detection is very fast Key ideas Integral images.

CS 1699: Intro to Computer Vision Detection II: Deformable Part Models Prof. Adriana Kovashka University of Pittsburgh November 12, 2015.

Object Detection Overview Viola-Jones Dalal-Triggs Deformable models Deep learning.

Local features: detection and description

Computer Vision Spring ,-685 Instructor: S. Narasimhan WH 5409 T-R 10:30am – 11:50am Lecture #23.

CS 2750: Machine Learning Support Vector Machines Prof. Adriana Kovashka University of Pittsburgh February 17, 2016.

More sliding window detection: Discriminative part-based models

Visual Object Recognition

Face Detection 蔡宇軒.

Lecture IX: Object Recognition (2)

Cascade for Fast Detection

Object detection with deformable part-based models

Tues April 25 Kristen Grauman UT Austin

MIRA, SVM, k-NN Lirong Xia. MIRA, SVM, k-NN Lirong Xia.

Data Driven Attributes for Action Detection

TP12 - Local features: detection and description

CSSE463: Image Recognition Day 11

Lit part of blue dress and shadowed part of white dress are the same color

Recap: Advanced Feature Encoding

R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.

Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science

A Tutorial on HOG Human Detection

An HOG-LBP Human Detector with Partial Occlusion Handling

CS 2770: Computer Vision Intro to Visual Recognition

CS 2750: Machine Learning Support Vector Machines

Brief Review of Recognition + Context

Creating Data Representations

RCNN, Fast-RCNN, Faster-RCNN

Lecture 29: Face Detection Revisited

CSSE463: Image Recognition Day 11

CSSE463: Image Recognition Day 11

MIRA, SVM, k-NN Lirong Xia. MIRA, SVM, k-NN Lirong Xia.

Presentation transcript:

Object detection as supervised classification CS 376 - lecture 23 Object detection as supervised classification Tues April 17 Kristen Grauman UT Austin

Announcements A4 due today A5 out, due May 2 Exam May 10, 2-5 pm

Last time Today Introduction to object categorization Window-based object detection boosting classifiers face detection as case study Today Recap of boosting + face detection Pros/cons of window-based detectors Mosaic examples Support vector machines

See slides / handout from lecture 22

Boosting: pros and cons CS 376 - lecture 23 Boosting: pros and cons Advantages of boosting Integrates classification with feature selection Complexity of training is linear in the number of training examples Flexibility in the choice of weak learners, boosting scheme Testing is fast Easy to implement Disadvantages Needs many training examples Other discriminative models may outperform in practice (SVMs, CNNs,…) especially for many-class problems Slide credit: Lana Lazebnik

Window-based detection: strengths CS 376 - lecture 23 Window-based detection: strengths Sliding window detection and global appearance descriptors: Simple detection protocol to implement Good feature choices critical Past successes for certain classes Slide: Kristen Grauman

Window-based detection: Limitations CS 376 - lecture 23 Window-based detection: Limitations High computational complexity For example: 250,000 locations x 30 orientations x 4 scales = 30,000,000 evaluations! If training binary detectors independently, means cost increases linearly with number of classes With so many windows, false positive rate better be low Slide: Kristen Grauman

Limitations (continued) CS 376 - lecture 23 Limitations (continued) Not all objects are “box” shaped Slide: Kristen Grauman

Limitations (continued) CS 376 - lecture 23 Limitations (continued) Non-rigid, deformable objects not captured well with representations assuming a fixed 2d structure; or must assume fixed viewpoint Objects with less-regular textures not captured well with holistic appearance-based descriptions Slide: Kristen Grauman

Limitations (continued) CS 376 - lecture 23 Limitations (continued) If considering windows in isolation, context is lost Sliding window Detector’s view Figure credit: Derek Hoiem Slide: Kristen Grauman

Limitations (continued) CS 376 - lecture 23 Limitations (continued) In practice, often entails large, cropped training set (expensive) Requiring good match to a global appearance description can lead to sensitivity to partial occlusions Image credit: Adam, Rivlin, & Shimshoni Slide: Kristen Grauman

Summary so far Basic pipeline for window-based detection CS 376 - lecture 23 Summary so far Basic pipeline for window-based detection Model/representation/classifier choice Sliding window and classifier scoring Boosting classifiers: general idea Viola-Jones face detector Exemplar of basic paradigm Plus key ideas: rectangular features, Adaboost for feature selection, cascade Pros and cons of window-based detection

Object proposals Main idea: CS 376 - lecture 23 Object proposals Main idea: Learn to generate category-independent regions/boxes that have object-like properties. Let object detector search over “proposals”, not exhaustive sliding windows Alexe et al. Measuring the objectness of image windows, PAMI 2012

Object proposals Multi-scale saliency Color contrast CS 376 - lecture 23 Object proposals Multi-scale saliency Color contrast Alexe et al. Measuring the objectness of image windows, PAMI 2012

Object proposals Edge density Superpipxel straddling CS 376 - lecture 23 Object proposals Edge density Superpipxel straddling Alexe et al. Measuring the objectness of image windows, PAMI 2012

Object proposals More proposals CS 376 - lecture 23 Object proposals More proposals Alexe et al. Measuring the objectness of image windows, PAMI 2012

Region-based object proposals J. Carreira and C. Sminchisescu. Cpmc: Automatic object segmentation using constrained parametric min-cuts. PAMI, 2012.

Mosaic examples

Window-based models: Three case studies SVM + person detection e.g., Dalal & Triggs Boosting + face detection Viola & Jones NN + scene Gist classification e.g., Hays & Efros

CS 376 - lecture 23 Linear classifiers

CS 376 - lecture 23 Linear classifiers Find linear function to separate positive and negative examples Which line is best?

Support Vector Machines (SVMs) CS 376 - lecture 23 Support Vector Machines (SVMs) Discriminative classifier based on optimal separating line (for 2d case) Maximize the margin between the positive and negative training examples

Support vector machines CS 376 - lecture 23 Support vector machines Want line that maximizes the margin. wx+b=1 wx+b=0 wx+b=-1 For support, vectors, Support vectors Margin C. Burges, A Tutorial on Support Vector Machines for Pattern Recognition, Data Mining and Knowledge Discovery, 1998

Support vector machines CS 376 - lecture 23 Support vector machines Want line that maximizes the margin. wx+b=1 wx+b=0 wx+b=-1 For support, vectors, Distance between point and line: For support vectors: Support vectors Margin M

Support vector machines CS 376 - lecture 23 Support vector machines Want line that maximizes the margin. wx+b=1 wx+b=0 wx+b=-1 For support, vectors, Distance between point and line: Therefore, the margin is 2 / ||w|| Support vectors Margin M

Finding the maximum margin line CS 376 - lecture 23 Finding the maximum margin line Maximize margin 2/||w|| Correctly classify all training data points: Quadratic optimization problem: Minimize Subject to yi(w·xi+b) ≥ 1 C. Burges, A Tutorial on Support Vector Machines for Pattern Recognition, Data Mining and Knowledge Discovery, 1998

Finding the maximum margin line CS 376 - lecture 23 Finding the maximum margin line Solution: learned weight Support vector C. Burges, A Tutorial on Support Vector Machines for Pattern Recognition, Data Mining and Knowledge Discovery, 1998

Finding the maximum margin line CS 376 - lecture 23 Finding the maximum margin line Solution: b = yi – w·xi (for any support vector) Classification function: If f(x) < 0, classify as negative, if f(x) > 0, classify as positive C. Burges, A Tutorial on Support Vector Machines for Pattern Recognition, Data Mining and Knowledge Discovery, 1998

Person detection with HoG’s & linear SVM’s CS 376 - lecture 23 Person detection with HoG’s & linear SVM’s Histogram of oriented gradients (HoG): Map each grid cell in the input window to a histogram counting the gradients per orientation. Train a linear SVM using training set of pedestrian vs. non-pedestrian windows. Dalal & Triggs, CVPR 2005

Person detection with HoGs & linear SVMs Histograms of Oriented Gradients for Human Detection, Navneet Dalal, Bill Triggs, International Conference on Computer Vision & Pattern Recognition - June 2005 http://lear.inrialpes.fr/pubs/2005/DT05/

Summary Object recognition as classification task Boosting (face detection ex) Support vector machines and HOG (person detection ex) Sliding window search paradigm Pros and cons Speed up with attentional cascade Object proposals, proposal regions as alternative

Next time What if the data are not linearly separable? What about the multi-class case? Nearest neighbors Convolutional neural networks