Download presentation
Presentation is loading. Please wait.
Published byChristian Boone Modified over 9 years ago
1
Ivan Laptev IRISA/INRIA, Rennes, France September 07, 2006 Boosted Histograms for Improved Object Detection
2
[Swain & Ballard 1991] - Color histograms [Schiele & Crowley 1996] - Receptive field histograms [Lowe 1999] - localized orientation histograms (SIFT) [Schneiderman & Kanade 2000] - localized histograms of wavelet coef. [Leung & Malik 2001] - Texton histograms [Belongie et.al. 2002] - Shape context [Dalal & Triggs 2005] - Dense orientation histograms Remarkable success of recognition methods using histograms of local image measurements: Likely explanation: Histograms are robust to image variations such as limited geometric transformations and object class variability. Histograms for object recognition
3
Histograms What to measure? No guarantee for optimal recognition Different regions may have different discriminative power Color [SB91] Gaussian derivatives [SC96] Wavelet coeff. [SK00] Textons [LM01] Gradient orientation [L99,DT05] Where to measure? A B C D A B C D Whole image [SB91,SC96] Pre-defined grid [SK00,BMP02,DT05] Key points [L99] Histograms: What vs. Where
4
Efficient discriminative classifier [Freund&Schapire’97] Good performance for face detection [Viola&Jones’01] Idea boosting selected features weak classifier AdaBoost: Haar features Histogram features SVM Neural Networks Too heavy
5
Possible approach: Example 1: Weak learner 1-dim. projections onto predefined vectors
6
Possible approach: Example 2: Weak learner 1-dim. projections onto predefined vectors
7
feature mean feature covariance Can be modified to minimize the error of weighted samples (required for boosting) Fischer weak learner Alternative approach: Evidence from real image training data: Fischer learner“1-bin” learner Assume Normal distribution of features (hopefully valid at least for some of ~10^5 features!) Compute projection direction by FLD:
8
Histogram features ~10^5 rectangle features Histograms over 4 gradient orientations, 4 subdivisions for each reactangle
9
Training data Crop and resize Perturb annotation Increase training set X 10 +
10
Training: Selected Features 376 of ~10^5 features selected 0.999 correct classification 10^-5 false positives
11
Scan and classify image windows at different positions and scales Cluster detections in the space-scale space Assign cluster size to the detection confidence Conf.=5 Object detection
12
motorbikes bicycles people cars #217 / #220 #123 / #123 #152 / #149 #320 / #341 PASCAL Visual Object Classes Challenge 2005 (VOC’05)
13
Ground truth annotation Detection results: >50 % overlap of bounding box with GT one bounding box for each object confidence value for each detection Precision-Recall (PR) curve: Average Precision (AP) value: Evaluation criteria Detection results: >50 % overlap of bounding box with GT one bounding box for each object confidence value for each detection Detection results: >50 % overlap of bounding box with GT one bounding box for each object confidence value for each detection Detection results: >50 % overlap of bounding box with GT one bounding box for each object confidence value for each detection
14
PR-curves for the “Motorbike” validation dataset: [Levi and Weiss, CVPR 2004] “Learning object detection from a small number of examples: The importance of good features” Evaluation of detection FLD learner + 1-bin classifier
15
Bicycles test1 People test1 cars test1Motorbikes test1 Results for VOC’05 Challenge
16
Average Precision values: Results for VOC’05 Challenge
19
PASCAL Visual Object Classes Challenge 2006 (VOC’06)
20
examples Results for VOC’06 Challenge Competition "comp3" (train on VOC data) Class “bicycle"
21
examples Results for VOC’06 Challenge Competition "comp3" (train on VOC data) Class “cow"
22
examples Results for VOC’06 Challenge Competition "comp3" (train on VOC data) Class “horse"
23
Results for VOC’06 Challenge Competition "comp3" (train on VOC data) Class “motorbike"
24
Results for VOC’06 Challenge Competition "comp3" (train on VOC data) Class “person"
25
bicyclebuscarcatcowdoghorsemotorbikepersonsheep Cambridge0.2490.1380.2540.1510.1490.1180.0910.1780.0300.131 ENSMP--0.398-0.159----- INRIA_Douze0.4140.1170.444-0.212--0.3900.1640.251 INRIA_Laptev0.440---0.224-0.1400.3180.114- TUD-------0.1530.074- TKK0.3030.1690.2220.1600.2520.1130.1370.2650.0390.227 Average Precision values: Results for VOC’06 Challenge
26
All results are obtained with a single set of parameters Small number of training samples is sufficient Efficient detection: 10fps on 320x280 images Extension to texton/color histogram features is straightforward Open questions: Other free-shape regions better? How to find them? Better weak learner that takes advantage of histogram properties View transformations Final Notes
27
All results are obtained with a single set of parameters Small number of training samples is sufficient Efficient detection: 10fps on 320x280 images Extension to texton/color histogram features is straightforward Open questions: Other free-shape regions better? How to find them? Better weak learner that takes advantage of histogram properties View transformations Final Notes
28
All results are obtained with a single set of parameters Small number of training samples is sufficient Efficient detection: 10fps on 320x280 images Extension to texton/color histogram features is straightforward Open questions: Other free-shape regions better? How to find them? Better weak learner that takes advantage of histogram properties View transformations Final Notes
29
All results are obtained with a single set of parameters Small number of training samples is sufficient Efficient detection: 10fps on 320x280 images Extension to texton/color histogram features is straightforward Open questions: Other free-shape regions better? How to find them? Better weak learner that takes advantage of histogram properties View transformations Final Notes
30
All results are obtained with a single set of parameters Small number of training samples is sufficient Efficient detection: 10fps on 320x280 images Extension to texton/color histogram features is straightforward Open questions: Other free-shape regions better? How to find them? Better weak learner that takes advantage of histogram properties View transformations Final Notes Detection tasks in VOC05,VOC06 are far from being solved, it is a challenge!
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.