Multi-view Traffic Sign Detection, Recognition and 3D Localisation Radu Timofte, Karel Zimmermann, and Luc Van Gool.

Slides:

Advertisements

Similar presentations

CVPR2013 Poster Modeling Actions through State Changes.

Advertisements

Road-Sign Detection and Recognition Based on Support Vector Machines Saturnino, Sergio et al. Yunjia Man ECG 782 Dr. Brendan.

Ignas Budvytis*, Tae-Kyun Kim*, Roberto Cipolla * - indicates equal contribution Making a Shallow Network Deep: Growing a Tree from Decision Regions of.

Rapid Object Detection using a Boosted Cascade of Simple Features Paul Viola, Michael Jones Conference on Computer Vision and Pattern Recognition 2001.

Human Identity Recognition in Aerial Images Omar Oreifej Ramin Mehran Mubarak Shah CVPR 2010, June Computer Vision Lab of UCF.

Learning Techniques for Video Shot Detection Under the guidance of Prof. Sharat Chandran by M. Nithya.

- Recovering Human Body Configurations: Combining Segmentation and Recognition (CVPR’04) Greg Mori, Xiaofeng Ren, Alexei A. Efros and Jitendra Malik -

Face detection Many slides adapted from P. Viola.

Student: Yao-Sheng Wang Advisor: Prof. Sheng-Jyh Wang ARTICULATED HUMAN DETECTION 1 Department of Electronics Engineering National Chiao Tung University.

Recognition using Regions CVPR Outline Introduction Overview of the Approach Experimental Results Conclusion.

HCI Final Project Robust Real Time Face Detection Paul Viola, Michael Jones, Robust Real-Time Face Detetion, International Journal of Computer Vision,

Real-time Embedded Face Recognition for Smart Home Fei Zuo, Student Member, IEEE, Peter H. N. de With, Senior Member, IEEE.

Generic Object Detection using Feature Maps Oscar Danielsson Stefan Carlsson

Face Recognition with Harr Transforms and SVMs EE645 Final Project May 11, 2005 J Stautzenberger.

1 How to be a Bayesian without believing Yoav Freund Joint work with Rob Schapire and Yishay Mansour.

Introduction to Boosting Aristotelis Tsirigos SCLT seminar - NYU Computer Science.

Smart Traveller with Visual Translator for OCR and Face Recognition LYU0203 FYP.

Multiple Object Class Detection with a Generative Model K. Mikolajczyk, B. Leibe and B. Schiele Carolina Galleguillos.

Matthias Wimmer, Bernd Radig, Michael Beetz Chair for Image Understanding Computer Science Technische Universität München Adaptive.

EE392J Final Project, March 20, Multiple Camera Object Tracking Helmy Eltoukhy and Khaled Salama.

Face Alignment Using Cascaded Boosted Regression Active Shape Models

Final Exam Review CS485/685 Computer Vision Prof. Bebis.

Learning Based Hierarchical Vessel Segmentation

Artificial Neural Networks

3D Fingertip and Palm Tracking in Depth Image Sequences

SPIE'01CIRL-JHU1 Dynamic Composition of Tracking Primitives for Interactive Vision-Guided Navigation D. Burschka and G. Hager Computational Interaction.

CS 8751 ML & KDDSupport Vector Machines1 Support Vector Machines (SVMs) Learning mechanism based on linear programming Chooses a separating plane based.

Professor: S. J. Wang Student : Y. S. Wang

Jifeng Dai 2011/09/27.  Introduction  Structural SVM  Kernel Design  Segmentation and parameter learning  Object Feature Descriptors  Experimental.

Energy-Aware Scheduling with Quality of Surveillance Guarantee in Wireless Sensor Networks Jaehoon Jeong, Sarah Sharafkandi and David H.C. Du Dept. of.

Implementing Codesign in Xilinx Virtex II Pro Betim Çiço, Hergys Rexha Department of Informatics Engineering Faculty of Information Technologies Polytechnic.

Classification and Ranking Approaches to Discriminative Language Modeling for ASR Erinç Dikici, Murat Semerci, Murat Saraçlar, Ethem Alpaydın 報告者：郝柏翰 2013/01/28.

Window-based models for generic object detection Mei-Chen Yeh 04/24/2012.

Face detection Slides adapted Grauman & Liebe’s tutorial

Automatic Image Annotation by Using Concept-Sensitive Salient Objects for Image Content Representation Jianping Fan, Yuli Gao, Hangzai Luo, Guangyou Xu.

Robust Real-time Face Detection by Paul Viola and Michael Jones, 2002 Presentation by Kostantina Palla & Alfredo Kalaitzis School of Informatics University.

Efficient Subwindow Search: A Branch and Bound Framework for Object Localization ‘PAMI09 Beyond Sliding Windows: Object Localization by Efficient Subwindow.

ECE738 Advanced Image Processing Face Detection IEEE Trans. PAMI, July 1997.

A Face processing system Based on Committee Machine: The Approach and Experimental Results Presented by: Harvest Jang 29 Jan 2003.

CS654: Digital Image Analysis

Human pose recognition from depth image MS Research Cambridge.

Face Detection Ying Wu Electrical and Computer Engineering Northwestern University, Evanston, IL

Gang WangDerek HoiemDavid Forsyth. INTRODUCTION APROACH (implement detail) EXPERIMENTS CONCLUSION.

Automated Solar Cavity Detection

The Viola/Jones Face Detector A “paradigmatic” method for real-time object detection Training is slow, but detection is very fast Key ideas Integral images.

Hand Gesture Recognition Using Haar-Like Features and a Stochastic Context-Free Grammar IEEE 高裕凱陳思安.

Fast Query-Optimized Kernel Machine Classification Via Incremental Approximate Nearest Support Vectors by Dennis DeCoste and Dominic Mazzoni International.

Object Recognition as Ranking Holistic Figure-Ground Hypotheses Fuxin Li and Joao Carreira and Cristian Sminchisescu 1.

Feature Selction for SVMs J. Weston et al., NIPS 2000 오장민 (2000/01/04) Second reference : Mark A. Holl, Correlation-based Feature Selection for Machine.

Notes on HW 1 grading I gave full credit as long as you gave a description, confusion matrix, and working code Many people’s descriptions were quite short.

Object Recognition by Discriminative Combinations of Line Segments and Ellipses Alex Chia ^˚ Susanto Rahardja ^ Deepu Rajan ˚ Maylor Leung ˚ ^ Institute.

A Discriminatively Trained, Multiscale, Deformable Part Model Yeong-Jun Cho Computer Vision and Pattern Recognition,2008.

Fast Human Detection in Crowded Scenes by Contour Integration and Local Shape Estimation Csaba Beleznai, Horst Bischof Computer Vision and Pattern Recognition,

Introduction to Skin and Face Detection

Author : Sang Hwa Lee, Junyeong Choi, and Jong-Il Park

Guillaume-Alexandre Bilodeau

Face Detection EE368 Final Project Group 14 Ping Hsin Lee

Object detection as supervised classification

R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.

Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science

Tremor Detection Using Motion Filtering and SVM Bilge Soran, Jenq-Neng Hwang, Linda Shapiro, ICPR, /16/2018.

Level Set Tree Feature Detection

An Introduction to Supervised Learning

Brief Review of Recognition + Context

RCNN, Fast-RCNN, Faster-RCNN

Introduction to Object Tracking

Concave Minimization for Support Vector Machine Classifiers

Jie Chen, Shiguang Shan, Shengye Yan, Xilin Chen, Wen Gao

A Novel Smoke Detection Method Using Support Vector Machine

Presentation transcript:

Multi-view Traffic Sign Detection, Recognition and 3D Localisation Radu Timofte, Karel Zimmermann, and Luc Van Gool

Problem definition  Input: Large set of views and corresponding camera locations  Output: List of traffic signs

Outline Single view Segmentation – very fast bounding box selection process with FN -> 0. –Traffic signs are designed to be well distinguishable from background => have distinctive colors and shapes. Detection – Adaboost classifiers of bounding boxes. Recognition – based on SVM classifiers. Multi-view Global optimization – over single-view detections constrained by 3D geometry

Color-based segmentation (thresholding) Estimation of connected components of a thresholded image (T = [0.5,0.2,-0.4,1.0] T )

Shape-based segmentation Searching for specific shapes (rectangles, circles, triangles). –Not all traffic signs are locally threshold separable. –More time consuming, many responses for small shapes (every small region is approximately some basic shape).

Learning segmentation There are thousands of possible settings of such methods, e.g. different projections from color space. Learning is searching for a reasonable subset of these methods/settings. Optimal trade-off among FN, FP and the number of methods: T* = argmin (FP(T) + K 1 FN(T)+K 2 card(T)) Boolean Linear Programming selects ≈ 50 methods out of in 2 hours. Segmentation results for example: FN BB = 1.5%, FP = 3443/ 2Mpxl image, (FN TS = 0.5%)

How does the output of segmentation looks like?

Detection Detection: suppression of bounding boxes which does not like a traffic sign. –Haar features computed on each channel of HSI space. –Separated shape-specific cascades of Adaboost classifiers. Detection (+segmentation) results:

How does the output of detection looks like?

3D optimization - introduction Single view detection and recognition is just preprocessing, the final decision is subject of the global optimization over multiple views. The idea is based on Minimum Description Length, i.e. explaining detected bounding boxes by the lowest number of real world traffic signs. If detections satisfy some visual and geometrical constrains, then all of these detections are explainable by one real world traffic sign.

Minimum Description Length in 3D

Problem formulation max X T X x ∊{0,1}

Example with 16 views

Results The summary of 3D results: The average accuracy of 3D localisation is of cm.

Visualisation of 3D results in one camera

3D visualisation

Conclusions Traffic Sign Detection, Recognition and 3D Localisation is a challenging problem. We propose a multi-view scheme, which combines 2D and 3D analysis. The main contributions are: –Boolean Linear Programming formulation for fast candidate extraction in 2D –Minimum Description Length formulation for best 3D hypothesis selection Work in progress…