Kapitel 14 Recognition – p. 1 Recognition Scene understanding / visual object categorization Pose clustering Object recognition by local features Image.

Slides:



Advertisements
Similar presentations
Numbers Treasure Hunt Following each question, click on the answer. If correct, the next page will load with a graphic first – these can be used to check.
Advertisements

Chapter 4 Sampling Distributions and Data Descriptions.
Variations of the Turing Machine
AP STUDY SESSION 2.
1
Kapitel 14 Recognition Scene understanding / visual object categorization Pose clustering Object recognition by local features Image categorization Bag-of-features.
Copyright © 2003 Pearson Education, Inc. Slide 1 Computer Systems Organization & Architecture Chapters 8-12 John D. Carpinelli.
Copyright © 2011, Elsevier Inc. All rights reserved. Chapter 6 Author: Julia Richards and R. Scott Hawley.
Author: Julia Richards and R. Scott Hawley
Myra Shields Training Manager Introduction to OvidSP.
Properties Use, share, or modify this drill on mathematic properties. There is too much material for a single class, so you’ll have to select for your.
Objectives: Generate and describe sequences. Vocabulary:
David Burdett May 11, 2004 Package Binding for WS CDL.
1 RA I Sub-Regional Training Seminar on CLIMAT&CLIMAT TEMP Reporting Casablanca, Morocco, 20 – 22 December 2005 Status of observing programmes in RA I.
Local Customization Chapter 2. Local Customization 2-2 Objectives Customization Considerations Types of Data Elements Location for Locally Defined Data.
Process a Customer Chapter 2. Process a Customer 2-2 Objectives Understand what defines a Customer Learn how to check for an existing Customer Learn how.
Custom Statutory Programs Chapter 3. Customary Statutory Programs and Titles 3-2 Objectives Add Local Statutory Programs Create Customer Application For.
CALENDAR.
1 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt BlendsDigraphsShort.
1 Hierarchical Part-Based Human Body Pose Estimation * Ramanan Navaratnam * Arasanathan Thayananthan Prof. Phil Torr * Prof. Roberto Cipolla * University.
1 Click here to End Presentation Software: Installation and Updates Internet Download CD release NACIS Updates.
Break Time Remaining 10:00.
Turing Machines.
Table 12.1: Cash Flows to a Cash and Carry Trading Strategy.
PP Test Review Sections 6-1 to 6-6
Bright Futures Guidelines Priorities and Screening Tables
EIS Bridge Tool and Staging Tables September 1, 2009 Instructor: Way Poteat Slide: 1.
Bellwork Do the following problem on a ½ sheet of paper and turn in.
Exarte Bezoek aan de Mediacampus Bachelor in de grafische en digitale media April 2014.
Computer vision: models, learning and inference
Distinctive Image Features from Scale-Invariant Keypoints David Lowe.
Copyright © 2012, Elsevier Inc. All rights Reserved. 1 Chapter 7 Modeling Structure with Blocks.
1 RA III - Regional Training Seminar on CLIMAT&CLIMAT TEMP Reporting Buenos Aires, Argentina, 25 – 27 October 2006 Status of observing programmes in RA.
Basel-ICU-Journal Challenge18/20/ Basel-ICU-Journal Challenge8/20/2014.
1..
CONTROL VISION Set-up. Step 1 Step 2 Step 3 Step 5 Step 4.
Adding Up In Chunks.
1 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt Synthetic.
Artificial Intelligence
Skills for Success with Microsoft® Office 2010
Subtraction: Adding UP
1 hi at no doifpi me be go we of at be do go hi if me no of pi we Inorder Traversal Inorder traversal. n Visit the left subtree. n Visit the node. n Visit.
Analyzing Genes and Genomes
Essential Cell Biology
Converting a Fraction to %
Clock will move after 1 minute
PSSA Preparation.
Essential Cell Biology
Immunobiology: The Immune System in Health & Disease Sixth Edition
Chapter 13 Web Page Design Studio
Physics for Scientists & Engineers, 3rd Edition
Energy Generation in Mitochondria and Chlorplasts
Select a time to count down from the clock above
CS4670 / 5670: Computer Vision Bag-of-words models Noah Snavely Object
Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,
Bag-of-features models Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Lecture 28: Bag-of-words models
Bag-of-features models
Keypoint-based Recognition and Object Search
10/31/13 Object Recognition and Augmented Reality Computational Photography Derek Hoiem, University of Illinois Dali, Swans Reflecting Elephants.
Object Recognition and Augmented Reality
Review: Intro to recognition Recognition tasks Machine learning approach: training, testing, generalization Example classifiers Nearest neighbor Linear.
Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,
Keypoint-based Recognition Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 03/04/10.
Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,
Perceptual and Sensory Augmented Computing Visual Object Recognition Tutorial Visual Object Recognition Bastian Leibe & Computer Vision Laboratory ETH.
10/31/13 Object Recognition and Augmented Reality Computational Photography Derek Hoiem, University of Illinois Dali, Swans Reflecting Elephants.
776 Computer Vision Jan-Michael Frahm Spring 2012.
By Suren Manvelyan, Crocodile (nile crocodile?) By Suren Manvelyan,
Presentation transcript:

Kapitel 14 Recognition – p. 1 Recognition Scene understanding / visual object categorization Pose clustering Object recognition by local features Image categorization Bag-of-features models Large-scale image search TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA Kapitel 14

Kapitel 14 Recognition – p. 2 Scene understanding Scene categorization outdoor/indoor city/forest/factory/etc.

Kapitel 14 Recognition – p. 3 Scene understanding Object detection find pedestrians

Kapitel 14 Recognition – p. 4 Scene understanding mountain building tree banner market people street lamp sky building

Kapitel 14 Recognition – p. 5 Visual object categorization

Kapitel 14 Recognition – p. 6 Visual object categorization

Kapitel 14 Recognition – p. 7 Visual object categorization

Kapitel 14 Recognition – p. 8 Visual object categorization Recognition is all about modeling variability camera position illumination shape variations within-class variations

Kapitel 14 Recognition – p. 9 Visual object categorization Within-class variations (why are they chairs?)

Kapitel 14 Recognition – p. 10 Pose clustering Working in transformation space - main ideas: Generate many hypotheses of transformation image vs. model, each built by a tuple of image and model features Correct transformation hypotheses appear many times Main steps: Quantize the space of possible transformations For each tuple of image and model features, solve for the optimal transformation that aligns the matched features Record a vote in the corresponding transformation space bin Find "peak" in transformation space

Kapitel 14 Recognition – p. 11 Pose clustering Example: Rotation only. A pair of one scene segment and one model segment suffices to generate a transformation hypothesis.

Kapitel 14 Recognition – p. 12 Pose clustering 2-tuples of image and model corner points are used to generate hypotheses. (a) The corners found in an image. (b) The four best hypotheses found with the edges drawn in. The nose of the plane and the head of the person do not appear because they were not in the models. C.F. Olson: Efficient pose clustering using a randomized algorithm. IJCV, 23(2): , 1997.

Kapitel 14 Recognition – p. 13 Object recognition by local features D.G. Lowe: Distinctive image features from scale-invariant keypoints. IJCV, 60(2): , 2004 The SIFT features of training images are extracted and stored For a query image Extract SIFT features Efficient nearest neighbor indexing Pose clustering by Hough transform For clusters with >2 keypoints (object hypotheses): determine the optimal affine transformation parameters by least squares method; geometry verification Input Image Stored Image

Kapitel 14 Recognition – p. 14 Object recognition by local features Robust feature-based alignment (panorama)

Kapitel 14 Recognition – p. 15 Object recognition by local features Extract features

Kapitel 14 Recognition – p. 16 Object recognition by local features Extract features Compute putative matches

Kapitel 14 Recognition – p. 17 Object recognition by local features Extract features Compute putative matches Loop: Hypothesize transformation T (small group of putative matches that are related by T)

Kapitel 14 Recognition – p. 18 Object recognition by local features Extract features Compute putative matches Loop: Hypothesize transformation T (small group of putative matches that are related by T) Verify transformation (search for other matches consistent with T)

Kapitel 14 Recognition – p. 19 Object recognition by local features Extract features Compute putative matches Loop: Hypothesize transformation T (small group of putative matches that are related by T) Verify transformation (search for other matches consistent with T)

Kapitel 14 Recognition – p. 20 Object recognition by local features Pose clustering by Hough transform (Hypothesize transformation T): Each of the SIFT keypoints of input image specifies 4 parameters: 2D location, scale, and orientation. Each matched SIFT keypoint in the database has a record of the keypoints parameters relative to the training image in which it was found. We can create a Hough transform entry predicting the model location, orientation, and scale from the match hypothesis.

Kapitel 14 Recognition – p. 21 Object recognition by local features Recognition: Cluster of 3 corresponding feature pairs

Kapitel 14 Recognition – p. 22 Object recognition by local features

Kapitel 14 Recognition – p. 23 Object recognition by local features

Kapitel 14 Recognition – p. 24 Image categorization Training Labels Training Images Classifier Training Training Image Features Testing Test Image Trained Classifier Outdoor Prediction

Kapitel 14 Recognition – p. 25 Image categorization Global histogram (distribution): color, texture, motion, … histogram matching distance

Kapitel 14 Recognition – p. 26 Image categorization Cars found by color histogram matching See Chapter Inhaltsbasierte Suche in Bilddatenbanken

Kapitel 14 Recognition – p. 27 Bag-of-features models Object Bag of words

Kapitel 14 Recognition – p. 28 Bag-of-features models Origin 1: Text recognition by orderless document representation (frequencies of words from a dictionary), Salton & McGill (1983) US Presidential Speeches Tag Cloud

Kapitel 14 Recognition – p. 29 Bag-of-features models Origin 2: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures, it is the identity of the textons, not their spatial arrangement, that matters

Kapitel 14 Recognition – p. 30 Bag-of-features models Universal texton dictionary histogram

Kapitel 14 Recognition – p. 31 Bag-of-features models G. Cruska et al.: Visual categorization with bags of keypoints. Proc. of ECCV, We need to build a visual dictionary!

Kapitel 14 Recognition – p. 32 Bag-of-features models Main step 1: Extract features (e.g. SIFT) …

Kapitel 14 Recognition – p. 33 Bag-of-features models Main step 2: Learn visual vocabulary (e.g. using clustering) clustering

Kapitel 14 Recognition – p. 34 Bag-of-features models visual vocabulary (cluster centers)

Kapitel 14 Recognition – p. 35 Bag-of-features models … Source: B. Leibe Appearance codebook Example: learned codebook

Kapitel 14 Recognition – p. 36 Bag-of-features models Main step 3: Quantize features using visual vocabulary

Kapitel 14 Recognition – p. 37 Bag-of-features models Main step 4: Represent images by frequencies of visual words (i.e., bags of features)

Kapitel 14 Recognition – p. 38 Bag-of-features models Example: representation based on learned codebook

Kapitel 14 Recognition – p. 39 Bag-of-features models Main step 5: Apply any classifier to the histogram feature vectors

Kapitel 14 Recognition – p. 40 Bag-of-features models Example:

Kapitel 14 Recognition – p. 41 Bag-of-features models Caltech6 dataset Dictionary quality and size are very important parameters!

Kapitel 14 Recognition – p. 42 Bag-of-features models Caltech6 dataset J.C. Niebles, H. Wang, L. Fei-Fei: Unsupervised learning of human action categories using spatial-tempotal words. IJCV, 79(3): , Action recognition

Kapitel 14 Recognition – p. 43 Large-scale image search Query Results from 5k Flickr images J. Philbin, et al.: Object retrieval with large vocabularies and fast spatial matching. Proc. of CVPR, 2007.

Kapitel 14 Recognition – p. 44 Large-scale image search Mobile tourist guide self-localization object/building recognition photo/video augmentation Aachen Cathedral [Quack, Leibe, Van Gool, CIVR08]

Kapitel 14 Recognition – p. 45 Large-scale image search

Kapitel 14 Recognition – p. 46 Large-scale image search Application: Image auto-annotation Left: Wikipedia image Right: closest match from Flickr Moulin Rouge Tour Montparnasse Colosseum Viktualienmarkt Maypole Old Town Square (Prague)

Kapitel 14 Recognition – p. 47 Sources K. Grauman, B. Leibe: Visual Object Recognition. Morgen & Claypool Publishers, R. Szeliski: Computer Vision: Algorithms and Applications. Springer, (Chapter 14 Recognition) Course materials from others (G. Bebis, J. Hays, S. Lazebnik, …)