Ivan Laptev IRISA/INRIA, Rennes, France September 07, 2006 Boosted Histograms for Improved Object Detection.

Slides:

Advertisements

Similar presentations

Weakly supervised learning of MRF models for image region labeling Jakob Verbeek LEAR team, INRIA Rhône-Alpes.

Advertisements

Combining Detectors for Human Hand Detection Antonio Hernández, Petia Radeva and Sergio Escalera Computer Vision Center, Universitat Autònoma de Barcelona,

Histograms of Oriented Gradients for Human Detection

Detecting Faces in Images: A Survey

EE462 MLCV Lecture 5-6 Object Detection – Boosting Tae-Kyun Kim.

Carolina Galleguillos, Brian McFee, Serge Belongie, Gert Lanckriet Computer Science and Engineering Department Electrical and Computer Engineering Department.

Classification using intersection kernel SVMs is efficient Joint work with Subhransu Maji and Alex Berg Jitendra Malik UC Berkeley.

Computer Vision for Human-Computer InteractionResearch Group, Universität Karlsruhe (TH) cv:hci Dr. Edgar Seemann 1 Computer Vision: Histograms of Oriented.

Lecture 31: Modern object recognition

Many slides based on P. FelzenszwalbP. Felzenszwalb General object detection with deformable part-based models.

Histograms of Oriented Gradients for Human Detection Navneet Dalal and Bill Triggs CVPR 2005 Another Descriptor.

Face detection Many slides adapted from P. Viola.

Cos 429: Face Detection (Part 2) Viola-Jones and AdaBoost Guest Instructor: Andras Ferencz (Your Regular Instructor: Fei-Fei Li) Thanks to Fei-Fei Li,

Ghunhui Gu, Joseph J. Lim, Pablo Arbeláez, Jitendra Malik University of California at Berkeley Berkeley, CA

Contour Based Approaches for Visual Object Recognition Jamie Shotton University of Cambridge Joint work with Roberto Cipolla, Andrew Blake.

Instructor: Mircea Nicolescu Lecture 13 CS 485 / 685 Computer Vision.

Detecting Pedestrians by Learning Shapelet Features

More sliding window detection: Discriminative part-based models Many slides based on P. FelzenszwalbP. Felzenszwalb.

The Viola/Jones Face Detector (2001)

Recognition using Regions CVPR Outline Introduction Overview of the Approach Experimental Results Conclusion.

Generic Object Detection using Feature Maps Oscar Danielsson Stefan Carlsson

Learning Spatial Context: Using stuff to find things Geremy Heitz Daphne Koller Stanford University October 13, 2008 ECCV 2008.

TextonBoost : Joint Appearance, Shape and Context Modeling for Multi-Class Object Recognition and Segmentation J. Shotton*, J. Winn†, C. Rother†, and A.

Object Detection using Histograms of Oriented Gradients

Learning Spatial Context: Can stuff help us find things? Geremy Heitz Daphne Koller April 14, 2008 DAGS Stuff (n): Material defined by a homogeneous or.

Lecture 29: Recent work in recognition CS4670: Computer Vision Noah Snavely.

Programme 2pm Introduction –Andrew Zisserman, Chris Williams 2.10pm Overview of the challenge and results –Mark Everingham (Oxford) 2.40pm Session 1: The.

Generic object detection with deformable part-based models

EADS DS / SDC LTIS Page 1 7 th CNES/DLR Workshop on Information Extraction and Scene Understanding for Meter Resolution Image – 29/03/07 - Oberpfaffenhofen.

“Secret” of Object Detection Zheng Wu (Summer intern in MSRNE) Sep. 3, 2010 Joint work with Ce Liu (MSRNE) William T. Freeman (MIT) Adam Kalai (MSRNE)

Window-based models for generic object detection Mei-Chen Yeh 04/24/2012.

Visual Object Recognition

Object Detection with Discriminatively Trained Part Based Models

Lecture 31: Modern recognition CS4670 / 5670: Computer Vision Noah Snavely.

Pedestrian Detection and Localization

Beyond Sliding Windows: Object Localization by Efficient Subwindow Search The best paper prize at CVPR 2008.

Efficient Subwindow Search: A Branch and Bound Framework for Object Localization ‘PAMI09 Beyond Sliding Windows: Object Localization by Efficient Subwindow.

Deformable Part Models (DPM) Felzenswalb, Girshick, McAllester & Ramanan (2010) Slides drawn from a tutorial By R. Girshick AP 12% 27% 36% 45% 49% 2005.

Recognition II Ali Farhadi. We have talked about Nearest Neighbor Naïve Bayes Logistic Regression Boosting.

MSRI workshop, January 2005 Object Recognition Collected databases of objects on uniform background (no occlusions, no clutter) Mostly focus on viewpoint.

Object detection, deep learning, and R-CNNs

Histograms of Oriented Gradients for Human Detection(HOG)

Robust Real Time Face Detection

Methods for classification and image representation

The Viola/Jones Face Detector A “paradigmatic” method for real-time object detection Training is slow, but detection is very fast Key ideas Integral images.

CS 1699: Intro to Computer Vision Detection II: Deformable Part Models Prof. Adriana Kovashka University of Pittsburgh November 12, 2015.

Object Detection Overview Viola-Jones Dalal-Triggs Deformable models Deep learning.

Improved Object Detection

Recognition Using Visual Phrases

Object Recognition as Ranking Holistic Figure-Ground Hypotheses Fuxin Li and Joao Carreira and Cristian Sminchisescu 1.

Object Recognition by Integrating Multiple Image Segmentations Caroline Pantofaru, Cordelia Schmid, Martial Hebert ECCV 2008 E.

Notes on HW 1 grading I gave full credit as long as you gave a description, confusion matrix, and working code Many people’s descriptions were quite short.

Presented by David Lee 3/20/2006

More sliding window detection: Discriminative part-based models

Computer Vision Lecture 7 Classifiers. Computer Vision, Lecture 6 Oleh Tretiak © 2005Slide 1 This Lecture Bayesian decision theory (22.1, 22.2) –General.

Week 4: 6/6 – 6/10 Jeffrey Loppert. This week.. Coded a Histogram of Oriented Gradients (HOG) Feature Extractor Extracted features from positive and negative.

Cascade for Fast Detection

Object detection with deformable part-based models

Presented by David Lee 3/20/2006

Presented by Minh Hoai Nguyen Date: 28 March 2007

Lit part of blue dress and shadowed part of white dress are the same color

Recognition using Nearest Neighbor (or kNN)

Object detection as supervised classification

R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.

Introduction of Pedestrian Detection

A Tutorial on HOG Human Detection

Finding Clusters within a Class to Improve Classification Accuracy

Learning to Detect Faces Rapidly and Robustly

“The Truth About Cats And Dogs”

Presentation transcript:

Ivan Laptev IRISA/INRIA, Rennes, France September 07, 2006 Boosted Histograms for Improved Object Detection

[Swain & Ballard 1991] - Color histograms [Schiele & Crowley 1996] - Receptive field histograms [Lowe 1999] - localized orientation histograms (SIFT) [Schneiderman & Kanade 2000] - localized histograms of wavelet coef. [Leung & Malik 2001] - Texton histograms [Belongie et.al. 2002] - Shape context [Dalal & Triggs 2005] - Dense orientation histograms Remarkable success of recognition methods using histograms of local image measurements: Likely explanation: Histograms are robust to image variations such as limited geometric transformations and object class variability. Histograms for object recognition

Histograms What to measure? No guarantee for optimal recognition Different regions may have different discriminative power Color [SB91] Gaussian derivatives [SC96] Wavelet coeff. [SK00] Textons [LM01] Gradient orientation [L99,DT05] Where to measure? A B C D A B C D Whole image [SB91,SC96] Pre-defined grid [SK00,BMP02,DT05] Key points [L99] Histograms: What vs. Where

Efficient discriminative classifier [Freund&Schapire’97] Good performance for face detection [Viola&Jones’01] Idea boosting selected features weak classifier AdaBoost: Haar features Histogram features SVM Neural Networks Too heavy

Possible approach: Example 1: Weak learner 1-dim. projections onto predefined vectors

Possible approach: Example 2: Weak learner 1-dim. projections onto predefined vectors

feature mean feature covariance Can be modified to minimize the error of weighted samples (required for boosting) Fischer weak learner Alternative approach: Evidence from real image training data: Fischer learner“1-bin” learner Assume Normal distribution of features (hopefully valid at least for some of ~10^5 features!) Compute projection direction by FLD:

Histogram features ~10^5 rectangle features Histograms over 4 gradient orientations, 4 subdivisions for each reactangle

Training data Crop and resize Perturb annotation Increase training set X 10 +

Training: Selected Features 376 of ~10^5 features selected correct classification 10^-5 false positives

Scan and classify image windows at different positions and scales Cluster detections in the space-scale space Assign cluster size to the detection confidence Conf.=5 Object detection

motorbikes bicycles people cars #217 / #220 #123 / #123 #152 / #149 #320 / #341 PASCAL Visual Object Classes Challenge 2005 (VOC’05)

Ground truth annotation Detection results: >50 % overlap of bounding box with GT one bounding box for each object confidence value for each detection Precision-Recall (PR) curve: Average Precision (AP) value: Evaluation criteria Detection results: >50 % overlap of bounding box with GT one bounding box for each object confidence value for each detection Detection results: >50 % overlap of bounding box with GT one bounding box for each object confidence value for each detection Detection results: >50 % overlap of bounding box with GT one bounding box for each object confidence value for each detection

PR-curves for the “Motorbike” validation dataset: [Levi and Weiss, CVPR 2004] “Learning object detection from a small number of examples: The importance of good features” Evaluation of detection FLD learner + 1-bin classifier

Bicycles test1 People test1 cars test1Motorbikes test1 Results for VOC’05 Challenge

Average Precision values: Results for VOC’05 Challenge

PASCAL Visual Object Classes Challenge 2006 (VOC’06)

examples Results for VOC’06 Challenge Competition "comp3" (train on VOC data) Class “bicycle"

examples Results for VOC’06 Challenge Competition "comp3" (train on VOC data) Class “cow"

examples Results for VOC’06 Challenge Competition "comp3" (train on VOC data) Class “horse"

Results for VOC’06 Challenge Competition "comp3" (train on VOC data) Class “motorbike"

Results for VOC’06 Challenge Competition "comp3" (train on VOC data) Class “person"

bicyclebuscarcatcowdoghorsemotorbikepersonsheep Cambridge ENSMP INRIA_Douze INRIA_Laptev TUD TKK Average Precision values: Results for VOC’06 Challenge

All results are obtained with a single set of parameters Small number of training samples is sufficient Efficient detection: 10fps on 320x280 images Extension to texton/color histogram features is straightforward Open questions: Other free-shape regions better? How to find them? Better weak learner that takes advantage of histogram properties View transformations Final Notes

All results are obtained with a single set of parameters Small number of training samples is sufficient Efficient detection: 10fps on 320x280 images Extension to texton/color histogram features is straightforward Open questions: Other free-shape regions better? How to find them? Better weak learner that takes advantage of histogram properties View transformations Final Notes

All results are obtained with a single set of parameters Small number of training samples is sufficient Efficient detection: 10fps on 320x280 images Extension to texton/color histogram features is straightforward Open questions: Other free-shape regions better? How to find them? Better weak learner that takes advantage of histogram properties View transformations Final Notes

All results are obtained with a single set of parameters Small number of training samples is sufficient Efficient detection: 10fps on 320x280 images Extension to texton/color histogram features is straightforward Open questions: Other free-shape regions better? How to find them? Better weak learner that takes advantage of histogram properties View transformations Final Notes

All results are obtained with a single set of parameters Small number of training samples is sufficient Efficient detection: 10fps on 320x280 images Extension to texton/color histogram features is straightforward Open questions: Other free-shape regions better? How to find them? Better weak learner that takes advantage of histogram properties View transformations Final Notes Detection tasks in VOC05,VOC06 are far from being solved, it is a challenge!