Object Recognition by Discriminative Combinations of Line Segments and Ellipses Alex Chia ^˚ Susanto Rahardja ^ Deepu Rajan ˚ Maylor Leung ˚ ^ Institute.

Slides:

Advertisements

Similar presentations

Shape Matching and Object Recognition using Low Distortion Correspondence Alexander C. Berg, Tamara L. Berg, Jitendra Malik U.C. Berkeley.

Advertisements

Semantic Contours from Inverse Detectors Bharath Hariharan et.al. (ICCV-11)

The Layout Consistent Random Field for detecting and segmenting occluded objects CVPR, June 2006 John Winn Jamie Shotton.

Context-based object-class recognition and retrieval by generalized correlograms by J. Amores, N. Sebe and P. Radeva Discussion led by Qi An Duke University.

Top-Down & Bottom-Up Segmentation

Road-Sign Detection and Recognition Based on Support Vector Machines Saturnino, Sergio et al. Yunjia Man ECG 782 Dr. Brendan.

電腦視覺 Computer and Robot Vision I

- Recovering Human Body Configurations: Combining Segmentation and Recognition (CVPR’04) Greg Mori, Xiaofeng Ren, Alexei A. Efros and Jitendra Malik -

Patch to the Future: Unsupervised Visual Prediction

Chapter 8 Content-Based Image Retrieval. Query By Keyword: Some textual attributes (keywords) should be maintained for each image. The image can be indexed.

Ivan Laptev IRISA/INRIA, Rennes, France September 07, 2006 Boosted Histograms for Improved Object Detection.

Mixture of trees model: Face Detection, Pose Estimation and Landmark Localization Presenter: Zhang Li.

Global spatial layout: spatial pyramid matching Spatial weighting the features Beyond bags of features: Adding spatial information.

High-level Component Filtering for Robust Scene Text Detection

Groups of Adjacent Contour Segments for Object Detection Vittorio Ferrari Loic Fevrier Frederic Jurie Cordelia Schmid.

Ghunhui Gu, Joseph J. Lim, Pablo Arbeláez, Jitendra Malik University of California at Berkeley Berkeley, CA

Contour Based Approaches for Visual Object Recognition Jamie Shotton University of Cambridge Joint work with Roberto Cipolla, Andrew Blake.

São Paulo Advanced School of Computing (SP-ASC’10). São Paulo, Brazil, July 12-17, 2010 Looking at People Using Partial Least Squares William Robson Schwartz.

Beyond bags of features: Part-based models Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

Recognition using Regions CVPR Outline Introduction Overview of the Approach Experimental Results Conclusion.

Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

Locally Constraint Support Vector Clustering

1 Image Recognition - I. Global appearance patterns Slides by K. Grauman, B. Leibe.

1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.

Generic Object Detection using Feature Maps Oscar Danielsson Stefan Carlsson

Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

CVR05 University of California Berkeley 1 Familiar Configuration Enables Figure/Ground Assignment in Natural Scenes Xiaofeng Ren, Charless Fowlkes, Jitendra.

Image Categorization by Learning and Reasoning with Regions Yixin Chen, University of New Orleans James Z. Wang, The Pennsylvania State University Published.

5/30/2006EE 148, Spring Visual Categorization with Bags of Keypoints Gabriella Csurka Christopher R. Dance Lixin Fan Jutta Willamowski Cedric Bray.

1 How do ideas from perceptual organization relate to natural scenes?

Spatial Pyramid Pooling in Deep Convolutional

Multiple Object Class Detection with a Generative Model K. Mikolajczyk, B. Leibe and B. Schiele Carolina Galleguillos.

Unsupervised Category Modeling, Recognition and Segmentation Sinisa Todorovic and Narendra Ahuja.

Chapter 4 Pattern Recognition Concepts continued.

Computer Vision James Hays, Brown

Flow Based Action Recognition Papers to discuss: The Representation and Recognition of Action Using Temporal Templates (Bobbick & Davis 2001) Recognizing.

Professor: S. J. Wang Student : Y. S. Wang

Nonparametric Part Transfer for Fine-grained Recognition Presenter Byungju Kim.

Recognition using Regions (Demo) Sudheendra V. Outline Generating multiple segmentations –Normalized cuts [Ren & Malik (2003)] Uniform regions –Watershed.

“Secret” of Object Detection Zheng Wu (Summer intern in MSRNE) Sep. 3, 2010 Joint work with Ce Liu (MSRNE) William T. Freeman (MIT) Adam Kalai (MSRNE)

Wei Zhang Akshat Surve Xiaoli Fern Thomas Dietterich.

Building local part models for category-level recognition C. Schmid, INRIA Grenoble Joint work with G. Dorko, S. Lazebnik, J. Ponce.

Multimodal Information Analysis for Emotion Recognition

Classifying Images with Visual/Textual Cues By Steven Kappes and Yan Cao.

Learning Collections of Parts for Object Recognition and Transfer Learning University of Illinois at Urbana- Champaign.

Object Detection with Discriminatively Trained Part Based Models

Beyond Sliding Windows: Object Localization by Efficient Subwindow Search The best paper prize at CVPR 2008.

Features-based Object Recognition P. Moreels, P. Perona California Institute of Technology.

In Defense of Nearest-Neighbor Based Image Classification Oren Boiman The Weizmann Institute of Science Rehovot, ISRAEL Eli Shechtman Adobe Systems Inc.

Chapter 4: Pattern Recognition. Classification is a process that assigns a label to an object according to some representation of the object’s properties.

Gang WangDerek HoiemDavid Forsyth. INTRODUCTION APROACH (implement detail) EXPERIMENTS CONCLUSION.

Levels of Image Data Representation 4.2. Traditional Image Data Structures 4.3. Hierarchical Data Structures Chapter 4 – Data structures for.

1Ellen L. Walker Category Recognition Associating information extracted from images with categories (classes) of objects Requires prior knowledge about.

Ivica Dimitrovski 1, Dragi Kocev 2, Suzana Loskovska 1, Sašo Džeroski 2 1 Faculty of Electrical Engineering and Information Technologies, Department of.

Image Registration Advanced DIP Project

Category Independent Region Proposals Ian Endres and Derek Hoiem University of Illinois at Urbana-Champaign.

Recognition Using Visual Phrases

Colour and Texture. Extract 3-D information Using Vision Extract 3-D information for performing certain tasks such as manipulation, navigation, and recognition.

CS654: Digital Image Analysis

Object Recognition by Integrating Multiple Image Segmentations Caroline Pantofaru, Cordelia Schmid, Martial Hebert ECCV 2008 E.

EE368 Final Project Spring 2003

Data Driven Attributes for Action Detection

Recognizing Deformable Shapes

Paper Presentation: Shape and Matching

Object detection as supervised classification

R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.

Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science

Local Binary Patterns (LBP)

Brief Review of Recognition + Context

Recognizing Deformable Shapes

Presentation transcript:

Object Recognition by Discriminative Combinations of Line Segments and Ellipses Alex Chia ^˚ Susanto Rahardja ^ Deepu Rajan ˚ Maylor Leung ˚ ^ Institute for Infocomm Research (I²R), Singapore ˚ Nanyang Technological University, Singapore

Horse-side Image classification –Separate images containing an object category from other images Goals 2

Category-Level Object Detection –Localize all instances of an object category from an image People Face Cow-side Goals – cont. 3

Region based approach –Exploits image pixel brightness or color values –Other classes (e.g. horse) are more defined by their shape Region based approach –Exploits image pixel brightness or color values –Not suitable for complex classes characterized by thin skeletal structures (e.g. bicycle) Existing Approaches 4

Contour based approach –Exploits spatial configuration or statistic of edge pixels –Edge based rich local descriptors –Contour fragments –Shape primitives Contour based approach –Exploits spatial configuration or statistic of edge pixels –Edge based rich local descriptors –Contour fragments –Shape primitives Existing Approaches – cont. Contour based approach –Shape primitives I.Support abstract reasoning (unlike edge based local descriptors) II.Efficient storage demands (unlike contour fragments) III.Efficient comparison across single and multiple scales (unlike contour fragments) 5

Detect object instances and classify images Boost discriminative codeword combinations Construct shape tokens Our contour based approach - outline Detect object instances and classify images Evaluate performance Learn category-specific codebook of shape tokens Boost discriminative codeword combinations Construct shape tokens Extract line segments and ellipses Learn category-specific codebook of shape tokens Dataset Training images Testing images Extract line segments and ellipses Learning phase Evaluation phase 6

Constructing shape tokens Pair a reference primitive to its connected neighbor –Tokens: Ellipse-line, Line-line, Ellipse-ellipse Geometrical and spatial properties –Length, orientation, distance between midpoints, relative primitive positions θrθr θnθn h lrlr lnln wrwr lrlr wrwr θrθr lnln θnθn h 7

Difference in widths A token is compared only to similar typed tokens Differences in their attributes Difference in spatial separation of primitives Difference in orientationDifference in widths Difference in lengths Difference in spatial separation of primitives Difference in orientation Difference in relative primitive positions Comparing shape tokens 8

Clustering for its relative position –Mean-shift clustering Extracting tokens from within the bounding boxes of training objects Learning category-specific codebook Clustering for its scale normalized appearance descriptors –Adapted bisecting 2-medoid clustering Normalized appearance descriptor Normalized translational vector 10

Medoid in each mean-shift as candidate codeword Appearance distance allowance Indicate range of appearance candidate represents = Mean appearance distance + Std. dev. Scale normalized circular window Indicate where candidate is found relative to object centroid center and radius of window: Medoid in each mean-shift as candidate codeword Appearance distance allowance Indicate range of appearance candidate represents Learning category-specific codebook – cont. Mean-shift sub-cluster feature space x x x x x x x + = Mean appearance distance + Std. dev. Medoid in each mean-shift as candidate codeword 11

Learning category-specific codebook – cont. Score each candidate by appearance + geometric qualities Number of unique training objects Candidates from all sub-clusters Candidates from 350 most populated sub-clusters Appearance qualities Geometric quality 12

Learning category-specific codebook – cont. Radial ranking method to select candidate into codebook 13

Learning category-specific codebook – cont. Candidates from all sub-clusters Candidates from 350 most populated sub-clusters Candidates from 350 selected sub-clusters FaceBike-frontBottle Horse-sideCow-side 14

Matching codeword combination Every codeword in combination finds image tokens within (appearance constraint) Centroid predictions by all codewords in combination concur (geometric constraint) Learning discriminative codeword combinations Each codeword parameterized by Appearance distance allowance Scale normalized circular window with radius and center Matching codeword combination Every codeword in combination finds image tokens within (appearance constraint) Centroid predictions by all codewords in combination concur (geometric constraint) 15

For a scale ‘s’ and location ‘x’, all codewords find matching tokens within its estimated window, will predict centroid locations which concur Learning discriminative codeword combinations – cont. Basic idea for finding matched codeword combinations x x x = (0,0)+ + x x Given codeword i and codeword j, for a scale ‘s’ and location ‘x’ in an image 16

Learning discriminative codeword combinations – cont. Finding token t* within estimated window that has the least appearance distance to codeword x x = (0,0) + x x x x [0, 2] if matching token found within window = x x x otherwise x x x x x x Response of codeword i at scale ‘s’ and location ‘x’ of image 17

Simple example (2 codewords) –Matching of codewords ‘i’ and ‘j’ at scale s and location x –Generalized form and pipi pjpj pipi pjpj p, [0, 2] {-1 or +1} where, and … pipi pjpj pipi pjpj Binary decision tree Learning discriminative codeword combinations – cont. Visual aspects of tokens Spatial layout of tokens Relationships of tokens and, [0, 2] Direction of inequality Structural constraints of object class AppearanceGeometric+ + constraints of object class AppearanceGeometricStructural+ + constraints of object class AppearanceGeometricStructural+ + Predicted label 18 predicted label i p and predicted label i p j p and predicted label and j p i p k p  iii pxs , i p

Input Output …… … Learning discriminative codeword combinations – cont. …… … … … …… … Matrix of values … Vector of z labels Weight vector … Boosting Output …… … Detection confidence: 19

False positives per image Recall Shotton et. al. I Shotton-et. al. II (Retrained test) Bai et. al. Our method Detection RP-AUC False positive rate True positive rate Shotton et. al. I Shotton-et. al. II (Retrained test) Our method Classification ROC-AUC Experimental Results – Weizmann horse J. Shotton et.al., TPAMI, X. Bai et. al., ICCV, 2009.

Average across categories Object category Number of object images Image classification results ROC-AUC Object detection results RP-AUC TrainingTesting Our methodShotton et. al.Our methodShotton et. al. Object category Number of object images Image classification results ROC-AUC Object detection results RP-AUC TrainingTesting Our methodShotton et. al.Our methodShotton et. al. Plane Motorbike Face Car-rear Car-2/3-rear Car-front Bike-rear Bike-front Bike-side Bottle Cow-front Cow-side Horse-front Horse-side Person Mug Cup Average across categories Experimental Results – Graz-17 J. Shotton et. al, TPAMI, Additional comparisons with other methods provided in paper 21

Presented a contour based recognition approach which exploits simple and generic shape primitives Proposed a method to learn discriminative primitive combinations which have variable number of primitives Demonstrated with extensive experiments across 17 categories the effectiveness of our approach Summary

Thank you