1 High-Level Computer Vision Detection of classes of objects (faces, motorbikes, trees, cheetahs) in images Detection of classes of objects (faces, motorbikes,

Slides:



Advertisements
Similar presentations
Distinctive Image Features from Scale-Invariant Keypoints
Advertisements

Distinctive Image Features from Scale-Invariant Keypoints David Lowe.
Matching in 2D Readings: Ch. 11: Sec alignment point representations and 2D transformations solving for 2D transformations from point correspondences.
Road-Sign Detection and Recognition Based on Support Vector Machines Saturnino, Sergio et al. Yunjia Man ECG 782 Dr. Brendan.
RGB-D object recognition and localization with clutter and occlusions Federico Tombari, Samuele Salti, Luigi Di Stefano Computer Vision Lab – University.
Extended Gaussian Images
Chapter 8 Content-Based Image Retrieval. Query By Keyword: Some textual attributes (keywords) should be maintained for each image. The image can be indexed.
Fitting: The Hough transform
Recognition using Regions CVPR Outline Introduction Overview of the Approach Experimental Results Conclusion.
1 Matching in 2D engine model image containing an instance of the model Is there an engine in the image? If so, where is it located?
1 Image Recognition - I. Global appearance patterns Slides by K. Grauman, B. Leibe.
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
Generic Object Recognition -- by Yatharth Saraf A Project on.
3D Models and Matching representations for 3D object models
A Study of Approaches for Object Recognition
Object Recognition with Invariant Features n Definition: Identify objects or scenes and determine their pose and model parameters n Applications l Industrial.
3-D Object Recognition From Shape Salvador Ruiz Correa Department of Electrical Engineering.
Distinctive Image Feature from Scale-Invariant KeyPoints
1 3D Models and Matching representations for 3D object models particular matching techniques alignment-based systems appearance-based systems GC model.
Object Recognition Using Geometric Hashing
Pattern Recognition. Introduction. Definitions.. Recognition process. Recognition process relates input signal to the stored concepts about the object.
Multiple Object Class Detection with a Generative Model K. Mikolajczyk, B. Leibe and B. Schiele Carolina Galleguillos.
Distinctive Image Features from Scale-Invariant Keypoints David G. Lowe – IJCV 2004 Brien Flewelling CPSC 643 Presentation 1.
Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to.
00/4/103DVIP-011 Part Three: Descriptions of 3-D Objects and Scenes.
Content-based Image Retrieval (CBIR)
Machine learning & category recognition Cordelia Schmid Jakob Verbeek.
3D Models and Matching representations for 3D object models
Distinctive Image Features from Scale-Invariant Keypoints By David G. Lowe, University of British Columbia Presented by: Tim Havinga, Joël van Neerbos.
Recap Low Level Vision –Input: pixel values from the imaging device –Data structure: 2D array, homogeneous –Processing: 2D neighborhood operations Histogram.
Hubert CARDOTJY- RAMELRashid-Jalal QURESHI Université François Rabelais de Tours, Laboratoire d'Informatique 64, Avenue Jean Portalis, TOURS – France.
Computer vision.
Recognizing Deformable Shapes Salvador Ruiz Correa Ph.D. Thesis, Electrical Engineering.
ENT 273 Object Recognition and Feature Detection Hema C.R.
Building local part models for category-level recognition C. Schmid, INRIA Grenoble Joint work with G. Dorko, S. Lazebnik, J. Ponce.
Intelligent Vision Systems ENT 496 Object Shape Identification and Representation Hema C.R. Lecture 7.
Generalized Hough Transform
Classifying Images with Visual/Textual Cues By Steven Kappes and Yan Cao.
Recognizing Deformable Shapes Salvador Ruiz Correa UW Ph.D. Graduate Researcher at Children’s Hospital.
Recognizing Deformable Shapes Salvador Ruiz Correa (CSE/EE576 Computer Vision I)
Roee Litman, Alexander Bronstein, Michael Bronstein
1 Introduction Computer vision is the analysis of digital images by a computer for such applications as: Industrial: part localization and inspection,
Recognizing Deformable Shapes Salvador Ruiz Correa (CSE/EE576 Computer Vision I)
Object Recognition by Discriminative Combinations of Line Segments and Ellipses Alex Chia ^˚ Susanto Rahardja ^ Deepu Rajan ˚ Maylor Leung ˚ ^ Institute.
776 Computer Vision Jan-Michael Frahm Spring 2012.
Matching Geometric Models via Alignment Alignment is the most common paradigm for matching 3D models to either 2D or 3D data. The steps are: 1. hypothesize.
1 Review and Summary We have covered a LOT of material, spending more time and more detail on 2D image segmentation and analysis, but hopefully giving.
Real-Time Hierarchical Scene Segmentation and Classification Andre Uckermann, Christof Elbrechter, Robert Haschke and Helge Ritter John Grossmann.
SIFT.
Object Recognition I Linda Shapiro EE/CSE 576.
SIFT Scale-Invariant Feature Transform David Lowe
CS262: Computer Vision Lect 09: SIFT Descriptors
Recognizing Deformable Shapes
Matching in 2D Find an instance of an object class or a specific
Paper Presentation: Shape and Matching
CSE373: Data Structures & Algorithms Lecture 23: Applications
Object Recognition in the Dynamic Link Architecture
3D Models and Matching particular matching techniques
Matching in 2D engine model image containing an instance of the model
3D Models and Matching representations for 3D object models
Matching in 2D Readings: Ch. 11: Sec
Application: Geometric Hashing
RIO: Relational Indexing for Object Recognition
Brief Review of Recognition + Context
COMPUTER VISION Introduction
Descriptions of 3-D Objects and Scenes
Object Recognition I Linda Shapiro ECE/CSE 576.
Recognizing Deformable Shapes
Presented by Xu Miao April 20, 2005
Human-object interaction
Presentation transcript:

1 High-Level Computer Vision Detection of classes of objects (faces, motorbikes, trees, cheetahs) in images Detection of classes of objects (faces, motorbikes, trees, cheetahs) in images Recognition of specific objects such as George Bush or machine part #45732 Recognition of specific objects such as George Bush or machine part #45732 Classification of images or parts of images for medical or scientific applications Classification of images or parts of images for medical or scientific applications Recognition of events in surveillance videos Recognition of events in surveillance videos Measurement of distances for robotics Measurement of distances for robotics

2 High-level vision uses techniques from AI. Graph-Matching: A*, Constraint Satisfaction, Branch and Bound Search, Simulated Annealing Graph-Matching: A*, Constraint Satisfaction, Branch and Bound Search, Simulated Annealing Learning Methodologies: Decision Trees, Neural Nets, SVMs, EM Classifier Learning Methodologies: Decision Trees, Neural Nets, SVMs, EM Classifier Probabilistic Reasoning, Belief Propagation, Graphical Models Probabilistic Reasoning, Belief Propagation, Graphical Models

3 Graph Matching for Object Recognition For each specific object, we have a geometric model. For each specific object, we have a geometric model. The geometric model leads to a symbolic model in terms of image features and their spatial relationships. The geometric model leads to a symbolic model in terms of image features and their spatial relationships. An image is represented by all of its features and their spatial relationships. An image is represented by all of its features and their spatial relationships. This leads to a graph matching problem. This leads to a graph matching problem.

4 Model-based Recognition as Graph Matching Let U = the set of model features. Let U = the set of model features. Let R be a relation expressing their spatial relationships. Let R be a relation expressing their spatial relationships. Let L = the set of image features. Let L = the set of image features. Let S be a relation expressing their spatial relationships. Let S be a relation expressing their spatial relationships. The ideal solution would be a subgraph isomorphism f: U-> L satisfying The ideal solution would be a subgraph isomorphism f: U-> L satisfying if (u 1, u 2,..., u n )  R, then (f(u 1 ),f(u 2 ),...,f(u n ))  S if (u 1, u 2,..., u n )  R, then (f(u 1 ),f(u 2 ),...,f(u n ))  S

5 House Example f(S1)=Sj f(S2)=Sa f(S3)=Sb f(S4)=Sn f(S5)=Si f(S6)=Sk f(S7)=Sg f(S8) = Sl f(S9)=Sd f(S10)=Sf f(S11)=Sh P L RP and RL are connection relations. 2D model 2D image

6 But this is too simplistic The model specifies all the features of the object that may appear in the image. The model specifies all the features of the object that may appear in the image. Some of them don’t appear at all, due to occlusion or failures at low or mid level. Some of them don’t appear at all, due to occlusion or failures at low or mid level. Some of them are broken and not recognized. Some of them are broken and not recognized. Some of them are distorted. Some of them are distorted. Relationships don’t all hold. Relationships don’t all hold.

7 TRIBORS: view class matching of polyhedral objects A view-class is a typical 2D view of a 3D object. Each object had 4-5 view classes (hand selected). The representation of a view class for matching included: - triplets of line segments visible in that class - the probability of detectability of each triplet The first version of this program used depth-limited A* search. edges from image model overlayed improved location

8 RIO: Relational Indexing for Object Recognition RIO worked with more complex parts that could have - planar surfaces - cylindrical surfaces - threads

9 Object Representation in RIO 3D objects are represented by a 3D mesh and set of 2D view classes. Each view class is represented by an attributed graph whose nodes are features and whose attributed edges are relationships. For purposes of indexing, attributed graphs are stored as sets of 2-graphs, graphs with 2 nodes and 2 relationships. ellipse coaxial arc cluster share an arc

10 RIO Features ellipses coaxials coaxials-multi parallel lines junctions triples close and far L V Y Z U

11 RIO Relationships share one arc share one line share two lines coaxial close at extremal points bounding box encloses / enclosed by

12 Hexnut Object How are 1, 2, and 3 related? What other features and relationships can you find?

13 Graph and 2-Graph Representations 1 coaxials- multi 3 parallel lines 2 ellipse encloses coaxial e e e c

14 Relational Indexing for Recognition Preprocessing (off-line) Phase for each model view Mi in the database encode each 2-graph of Mi to produce an index store Mi and associated information in the indexed bin of a hash table H

15 Matching (on-line) phase 1.Construct a relational (2-graph) description D for the scene 2.For each 2-graph G of D 3.Select the Mis with high votes as possible hypotheses 4.Verify or disprove via alignment, using the 3D meshes encode it, producing an index to access the hash table H cast a vote for each Mi in the associated bin

16 The Voting Process

17 RIO Verifications 1. The matched features of the hypothesized object are used to determine its pose. 2. The 3D mesh of the object is used to project all its features onto the image. 3. A verification procedure checks how well the object features line up with edges on the image. incorrect hypothesis

18 Use of classifiers is big in computer vision today. 2 Examples: 2 Examples:  Rowley’s Face Detection using neural nets  Our 3D object classification using SVMs

19 Object Detection: Rowley’s Face Finder 1. convert to gray scale 2. normalize for lighting 3. histogram equalization 4. apply neural net(s) trained on 16K images What data is fed to the classifier? 32 x 32 windows in a pyramid structure

20 3D-3D Alignment of Mesh Models to Mesh Data Older Work: match 3D features such as 3D edges and junctions or surface patches More Recent Work: match surface signatures - curvature at a point - curvature histogram in the neighborhood of a point - Medioni’s splashes - Johnson and Hebert’s spin images *

21 The Spin Image Signature P X n   P is the selected vertex. X is a contributing point of the mesh.  is the perpendicular distance from X to P’s surface normal.  is the signed perpendicular distance from X to P’s tangent plane. tangent plane at P

22 Spin Image Construction A spin image is constructed - about a specified oriented point o of the object surface - with respect to a set of contributing points C, which is controlled by maximum distance and angle from o. It is stored as an array of accumulators S( ,  ) computed via: For each point c in C(o) 1. compute  and  for c. 2. increment S ( ,  ) o

23 Spin Image Matching ala Sal Ruiz

24 Sal Ruiz’s Classifier Approach Architecture of Classifiers Numeric Signatures Components Symbolic Signatures Recognition And Classification Of Deformable Shapes

25 Numeric Signatures: Spin Images Rich set of surface shape descriptors. Rich set of surface shape descriptors. Their spatial scale can be modified to include local and non-local surface features. Their spatial scale can be modified to include local and non-local surface features. Representation is robust to scene clutter and occlusions. Representation is robust to scene clutter and occlusions. P Spin images for point P 3-D faces

26 How To Extract Shape Class Components?…… ComponentDetector ComputeNumericSignatures Training Set SelectSeedPoints RegionGrowingAlgorithm Grown components around seeds

27 Labeled Surface Mesh Selected 8 seed points by hand Component Extraction Example RegionGrowing Grow one region at the time (get one detector per component) Detected components on a training sample

28 How To Combine Component Information? … Extracted components on test samples Note: Numeric signatures are invariant to mirror symmetry; our approach preserves such an invariance.

29 Symbolic Signature Symbolic Signature at P Labeled Surface Mesh Matrix storing component labels EncodeGeometricConfiguration Critical Point P

30 Symbolic Signatures Are Robust To Deformations P Relative position of components is stable across deformations: experimental evidence

31 Proposed Architecture (Classification Example)Input LabeledMesh ClassLabel(Abnormal) Verify spatial configuration of the components IdentifySymbolicSignatures IdentifyComponents Two classification stages SurfaceMesh

32 At Classification Time (1) Bank of ComponentDetectors Surface Mesh AssignsComponentLabels Labeled Surface Mesh Multi-wayclassifier Identify Components

33 Labeled Surface Mesh Bank of SymbolicSignaturesDetectors Symbolic pattern for components 1,2,4 At Classification Time (2) Symbolic pattern for components 5,6,8 AssignsSymbolicLabels Two detectors

34 Architecture Implementation ALL our classifiers are (off-the-shelf) ν - Support Vector Machines ( ν - SVMs) (Schölkopf et al., 2000 and 2001). ALL our classifiers are (off-the-shelf) ν - Support Vector Machines ( ν - SVMs) (Schölkopf et al., 2000 and 2001). Component (and symbolic signature) detectors are one-class classifiers. Component (and symbolic signature) detectors are one-class classifiers. Component label assignment: performed with a multi-way classifier that uses pairwise classification scheme. Component label assignment: performed with a multi-way classifier that uses pairwise classification scheme. Gaussian kernel. Gaussian kernel.

35 Shape Classes

36 Task 1: Recognizing Single Objects (2) Snowman: 93%. Snowman: 93%. Rabbit: 92%. Rabbit: 92%. Dog: 89%. Dog: 89%. Cat: 85.5%. Cat: 85.5%. Cow: 92%. Cow: 92%. Bear: 94%. Bear: 94%. Horse: 92.7%. Horse: 92.7%. Human head: 97.7%. Human face: 76%. Recognition rates (true positives) (No clutter, no occlusion, complete models)

37 Task 2-3: Recognition in Complex Scenes (2) ShapeClassTruePositivesFalsePositivesTruePositivesFalsePositives Snowmen91%31%87.5%28% Rabbit90.2%27.6%84.3%24% Dog89.6%34.6%88.12%22.1% Task 2 Task 3

38 Task 2-3: Recognition in Complex Scenes (3)