Unsupervised Category Modeling, Recognition and Segmentation Sinisa Todorovic and Narendra Ahuja.

Slides:

Advertisements

Similar presentations

Shape Matching and Object Recognition using Low Distortion Correspondence Alexander C. Berg, Tamara L. Berg, Jitendra Malik U.C. Berkeley.

Advertisements

Applications of one-class classification

Complex Networks for Representation and Characterization of Images For CS790g Project Bingdong Li 9/23/2009.

Top-Down & Bottom-Up Segmentation

FACIAL EMOTION RECOGNITION BY ADAPTIVE PROCESSING OF TREE STRUCTURES Jia-Jun Wong and Siu-Yeung Cho Forensic and Security Lab School of Computer Engineering.

Object class recognition using unsupervised scale-invariant learning Rob Fergus Pietro Perona Andrew Zisserman Oxford University California Institute of.

Face Alignment with Part-Based Modeling

Vision Based Control Motion Matt Baker Kevin VanDyke.

Qualifying Exam: Contour Grouping Vida Movahedi Supervisor: James Elder Supervisory Committee: Minas Spetsakis, Jeff Edmonds York University Summer 2009.

Unsupervised Learning of Visual Object Categories Michael Pfeiffer

Multiple People Detection and Tracking with Occlusion Presenter: Feifei Huo Supervisor: Dr. Emile A. Hendriks Dr. A. H. J. Stijn Oomes Information and.

Ghunhui Gu, Joseph J. Lim, Pablo Arbeláez, Jitendra Malik University of California at Berkeley Berkeley, CA

Contour Based Approaches for Visual Object Recognition Jamie Shotton University of Cambridge Joint work with Roberto Cipolla, Andrew Blake.

Object Detection by Matching Longin Jan Latecki. Contour-based object detection Database shapes: …..

Fast intersection kernel SVMs for Realtime Object Detection

Computer and Robot Vision I

Multiple Criteria for Evaluating Land Cover Classification Algorithms Summary of a paper by R.S. DeFries and Jonathan Cheung-Wai Chan April, 2000 Remote.

Bundling Features for Large Scale Partial-Duplicate Web Image Search Zhong Wu ∗, Qifa Ke, Michael Isard, and Jian Sun CVPR 2009.

A new face detection method based on shape information Pattern Recognition Letters, 21 (2000) Speaker: M.Q. Jing.

1 Image Recognition - I. Global appearance patterns Slides by K. Grauman, B. Leibe.

1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.

Generic Object Recognition -- by Yatharth Saraf A Project on.

Generic Object Detection using Feature Maps Oscar Danielsson Stefan Carlsson

Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to.

Human Action Recognition

5/30/2006EE 148, Spring Visual Categorization with Bags of Keypoints Gabriella Csurka Christopher R. Dance Lixin Fan Jutta Willamowski Cedric Bray.

Object Class Recognition by Unsupervised Scale-Invariant Learning R. Fergus, P. Perona, and A. Zisserman Presented By Jeff.

Wayne State University, 1/31/ Multiple-Instance Learning via Embedded Instance Selection Yixin Chen Department of Computer Science University of.

Multiple Object Class Detection with a Generative Model K. Mikolajczyk, B. Leibe and B. Schiele Carolina Galleguillos.

Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to.

SVCL Automatic detection of object based Region-of-Interest for image compression Sunhyoung Han.

Recognition using Regions (Demo) Sudheendra V. Outline Generating multiple segmentations –Normalized cuts [Ren & Malik (2003)] Uniform regions –Watershed.

Multiscale Symmetric Part Detection and Grouping Alex Levinshtein, Sven Dickinson, University of Toronto and Cristian Sminchisescu, University of Bonn.

A Statistically Selected Part-Based Probabilistic Model for Object Recognition Zhipeng Zhao, Ahmed Elgammal Department of Computer Science, Rutgers, The.

Generalized Hough Transform

Representations for object class recognition David Lowe Department of Computer Science University of British Columbia Vancouver, Canada Sept. 21, 2006.

AUTOMATIC TARGET RECOGNITION OF CIVILIAN TARGETS September 28 th, 2004 Bala Lakshminarayanan.

Features-based Object Recognition P. Moreels, P. Perona California Institute of Technology.

MSRI workshop, January 2005 Object Recognition Collected databases of objects on uniform background (no occlusions, no clutter) Mostly focus on viewpoint.

Class-Specific Hough Forests for Object Detection Zhen Yuan Hsu Advisor：S.J.Wang Gall, J., Lempitsky, V.: Class-specic hough forests for object detection.

Stable Multi-Target Tracking in Real-Time Surveillance Video

Visual Categorization With Bags of Keypoints Original Authors: G. Csurka, C.R. Dance, L. Fan, J. Willamowski, C. Bray ECCV Workshop on Statistical Learning.

Interactive Learning of the Acoustic Properties of Objects by a Robot

Discussion of Pictorial Structures Pedro Felzenszwalb Daniel Huttenlocher Sicily Workshop September, 2006.

Category Independent Region Proposals Ian Endres and Derek Hoiem University of Illinois at Urbana-Champaign.

Data Mining and Decision Support

Object Recognition by Integrating Multiple Image Segmentations Caroline Pantofaru, Cordelia Schmid, Martial Hebert ECCV 2008 E.

Object Recognition by Discriminative Combinations of Line Segments and Ellipses Alex Chia ^˚ Susanto Rahardja ^ Deepu Rajan ˚ Maylor Leung ˚ ^ Institute.

Image features and properties. Image content representation The simplest representation of an image pattern is to list image pixels, one after the other.

SUMMERY 1. VOLUMETRIC FEATURES FOR EVENT DETECTION IN VIDEO correlate spatio-temporal shapes to video clips that have been automatically segmented we.

Meeting 8: Features for Object Classification Ullman et al.

Parsing Natural Scenes and Natural Language with Recursive Neural Networks INTERNATIONAL CONFERENCE ON MACHINE LEARNING (ICML 2011) RICHARD SOCHER CLIFF.

Date of download: 7/8/2016 Copyright © 2016 SPIE. All rights reserved. A scalable platform for learning and evaluating a real-time vehicle detection system.

Object Recognition by Parts

Guillaume-Alexandre Bilodeau

Learning Mid-Level Features For Recognition

Supervised Time Series Pattern Discovery through Local Importance

Nonparametric Semantic Segmentation

Paper Presentation: Shape and Matching

Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT

Object Recognition by Parts

Object Recognition by Parts

CellNetQL Image Segmentation without Feature Definition

Object Recognition by Parts

Brief Review of Recognition + Context

Saliency Optimization from Robust Background Detection

Object Recognition by Parts

Human-object interaction

Object Recognition with Interest Operators

Presentation transcript:

Unsupervised Category Modeling, Recognition and Segmentation Sinisa Todorovic and Narendra Ahuja

What is Common in a Set of Images? Images possibly contain an object of interest Which objects appear frequently in the set? What properties are shared by similar objects in the set? Where are the objects?

Objective: Car Example occlusionno carmultiple cars learn car model unseen image segment all cars RESULT occlusion

Training Problem Definition GIVEN Images possibly containing frequent occurrences of similar objects DETERMINE If similar objects are present AND IF YES LEARN The model of similar objects GIVEN An unseen image DETECT, RECOGNIZE AND SEGMENT All occurrences of the learned category Testing

Prior Work Dominated By: Statistical modeling of local features: patches or curve fragments Trend: Object detection = Image classification Trend: Object segmentation = Object localization Trend: Object segmentation = Binary thresholding of a probabilistic map Hypothesize the number of objects and their parts Hypothesize the topology of object parts Each training image must contain a category of interest Modeling background Require typically hundreds of training images

Explicit modeling of recursive embedding of object subparts Regions vs. local features open questions: More informative? More stable and robust to noise? Regions allow: simultaneous object detection and segmentation explicit representation of the recursive embedding property Category Modeling is Very Difficult

Our Approach SIMILAR OBJECTS PRESENT IN THE SET MANY SUBIMAGES WITH SIMILAR REGION PROPERTIES ABUNDANT DATA ROBUST LEARNING IS FEASIBLE find ? do ? image matching structural learning

Region Properties Geometric Region area Boundary shape Photometric Gray-level contrast with the surround Topology Recursive containment of regions Layout - relative region locations

Feature Extraction = Image Segmentation Image Homogeneous regions at ALL contrasts and sizes segmentation [N. Ahuja TPAMI ‘96, Tabb & Ahuja TIP ‘97, Arora & Ahuja ICPR ‘06] Example segmentations for several contrasts

Multiscale Segmentation to Segmentation Tree Sample cutsets Segmentation tree Contrast level ≠ Tree level Example segmentations

Image = Tree and Object = Subtree

Outline of Our Approach Images = Trees Category present = Many similar subtrees Extracting similar subtrees = Tree matching Category model = Union of similar subtrees Simultaneous detection, recognition and segmentation of ALL category instances by Matching the model with an image

Tree Matching: Structural Noise Edit-distance tree matching [Pelillo et a. PAMI‘99, Torsello&Hancock ECCV’02, PRL’03]

Matching Algorithm Input treesMatched subtrees

which MAXIMIZES their similarity measure Matching Algorithm FIND bijection while PRESERVING ancestor-descendant relationships GIVEN two trees: node saliency cost of node matching

Matching Algorithm: Recursive Solution Maximum clique over all descendant pairs descendants SOLUTION Select all pairs with > threshold.

Outline LEARNING

Category Model = Tree Union Learning algorithm estimates: 1) Model structure 2) Model parameters Tree intersection: Tree union:

Simultaneous Detection and Segmentation MATCHING

Performance Evaluation Criteria Ground Truth (GT) Matched Subtrees (MST) DETECTION ERROR False positive: “intersection of MST and GT” < 0.5 “union of MST and GT” SEGMENTATION ERROR Matched Subtrees (MST) Ground Truth (GT) “XOR of MST and GT”

10 positive out of 20 training images 5 positive out of 10 training images Results on test images: Results: UIUC Cars Side View

6 positive out of 12 training images 3 positive out of 6 training images Results on test images: Results: Faces -- Caltech 101 Database

Results: Caltech Cars Rear View 10 positive out of 20 training images

Recall-Precision Training from a small-size dataset Varying tradeoff recall vs. precision

Complexity and Runtime on 2.4GHZ 2GB RAM PC Training on 20 images of UIUC CARS: < 2 hours # of tree nodes Extracting similar subtrees:per image pair # of subtree nodes Learning on 32 subtrees extracted for UIUC CARS: < 1 hour Learning: # of model nodes Processing time for UIUC CARS: < 10 sec, regardless of the total number of target objects Detection, recognition and segmentation:

Summary and Conclusion Unsupervised learning of an unknown category frequently occurring in a given set of images Region-based, structural approach Simultaneous detection, recognition, and segmentation of all category instances in unseen images NO multiple detections on the same object NO hypotheses on the number of objects and their parts NO hypotheses on the topology of object parts Small number of training images Complexity comparable with standard methods

Acknowledgment THANK YOU! {sintod,