The Beauty of Local Invariant Features

Slides:



Advertisements
Similar presentations
CSE 473/573 Computer Vision and Image Processing (CVIP)
Advertisements

TP14 - Local features: detection and description Computer Vision, FCUP, 2014 Miguel Coimbra Slides by Prof. Kristen Grauman.
MIT CSAIL Vision interfaces Approximate Correspondences in High Dimensions Kristen Grauman* Trevor Darrell MIT CSAIL (*) UT Austin…
Instructor: Mircea Nicolescu Lecture 15 CS 485 / 685 Computer Vision.
CS4670 / 5670: Computer Vision Bag-of-words models Noah Snavely Object
Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,
Global spatial layout: spatial pyramid matching Spatial weighting the features Beyond bags of features: Adding spatial information.
Matching with Invariant Features
Bag-of-features models Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Computational Photography
Locating and Describing Interest Points
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
1 Image Recognition - I. Global appearance patterns Slides by K. Grauman, B. Leibe.
1 Interest Operators Find “interesting” pieces of the image –e.g. corners, salient regions –Focus attention of algorithms –Speed up computation Many possible.
Lecture 28: Bag-of-words models
1 Interest Operator Lectures lecture topics –Interest points 1 (Linda) interest points, descriptors, Harris corners, correlation matching –Interest points.
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce.
Distinctive Image Feature from Scale-Invariant KeyPoints
Feature extraction: Corners and blobs
Interest Points and Corners Computer Vision CS 143, Brown James Hays Slides from Rick Szeliski, Svetlana Lazebnik, Derek Hoiem and Grauman&Leibe 2008 AAAI.
Bag-of-features models
Automatic Matching of Multi-View Images
Visual Object Recognition Rob Fergus Courant Institute, New York University
Evaluation of features detectors and descriptors based on 3D objects P. Moreels - P. Perona California Institute of Technology.
Image Features: Descriptors and matching
By Suren Manvelyan,
Distinctive Image Features from Scale-Invariant Keypoints David G. Lowe – IJCV 2004 Brien Flewelling CPSC 643 Presentation 1.
Overview Introduction to local features
Review: Intro to recognition Recognition tasks Machine learning approach: training, testing, generalization Example classifiers Nearest neighbor Linear.
Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,
Computer vision.
Final Exam Review CS485/685 Computer Vision Prof. Bebis.
Locating and Describing Interest Points Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 03/02/10 Acknowledgment: Many keypoint slides.
Recognition and Matching based on local invariant features Cordelia Schmid INRIA, Grenoble David Lowe Univ. of British Columbia.
Overview Harris interest points Comparing interest points (SSD, ZNCC, SIFT) Scale & affine invariant interest points Evaluation and comparison of different.
Local invariant features Cordelia Schmid INRIA, Grenoble.
Computational Photography Tamara Berg Features. Representing Images Keep all the pixels! Pros? Cons?
Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,
CSE 185 Introduction to Computer Vision Local Invariant Features.
Evaluation of interest points and descriptors. Introduction Quantitative evaluation of interest point detectors –points / regions at the same relative.
MSRI workshop, January 2005 Object Recognition Collected databases of objects on uniform background (no occlusions, no clutter) Mostly focus on viewpoint.
A Sparse Texture Representation Using Affine-Invariant Regions Svetlana Lazebnik, Jean Ponce Svetlana Lazebnik, Jean Ponce Beckman Institute University.
Local invariant features Cordelia Schmid INRIA, Grenoble.
Local invariant features Cordelia Schmid INRIA, Grenoble.
Local invariant features Cordelia Schmid INRIA, Grenoble.
Harris Corner Detector & Scale Invariant Feature Transform (SIFT)
Overview Introduction to local features Harris interest points + SSD, ZNCC, SIFT Scale & affine invariant interest point detectors Evaluation and comparison.
Feature extraction: Corners and blobs. Why extract features? Motivation: panorama stitching We have two images – how do we combine them?
Features, Feature descriptors, Matching Jana Kosecka George Mason University.
CSE 185 Introduction to Computer Vision Feature Matching.
Project 3 questions? Interest Points and Instance Recognition Computer Vision CS 143, Brown James Hays 10/21/11 Many slides from Kristen Grauman and.
Local features: detection and description
CS654: Digital Image Analysis
CS654: Digital Image Analysis
CSE 185 Introduction to Computer Vision Local Invariant Features.
Representing, Learning, and Recognizing Non-Rigid Textures and Texture Categories Svetlana LazebnikCordelia SchmidJean Ponce Beckman InstituteGravir LaboratoryBeckman.
Corners and interest points
TP12 - Local features: detection and description
Source: D. Lowe, L. Fei-Fei Recap: edge detection Source: D. Lowe, L. Fei-Fei.
Local features: detection and description May 11th, 2017
Paper Presentation: Shape and Matching
CS 1674: Intro to Computer Vision Scene Recognition
CSE 455 – Guest Lectures 3 lectures Contact Interest points 1
Brief Review of Recognition + Context
SIFT keypoint detection
Lecture VI: Corner and Blob Detection
Recognition and Matching based on local invariant features
Some slides: courtesy of David Jacobs
Presentation transcript:

The Beauty of Local Invariant Features Svetlana Lazebnik Beckman Institute, University of Illinois at Urbana-Champaign IMA Recognition Workshop University of Minnesota May 22, 2006

What are Local Invariant Features? Descriptors of image patches that are invariant to certain classes of geometric and photometric transformations Lowe (2004)

A Historical Perspective ACRONYM: Brooks and Binford (1981) Alignment: Huttenlocher & Ullman (1987) Invariants: Rothwell et al. (1992) Model-based methods: local shape, no appearance information Appearance-based methods: global appearance, no local shape Eigenfaces: Turk & Pentland (1991) Appearance manifolds: Murase & Nayar (1995) Color histograms: Swain & Ballard (1990) Local invariant features: local shape + appearance pattern +

Feature Detection and Description invariant description 3. Compute appearance descriptors SIFT: Lowe (2004) 1. Detect regions 2. Normalize regions covariant detection

Advantages Locality Repeatability Distinctiveness Invariance Robustness to clutter and occlusion Repeatability The same feature occurs in multiple images of the same scene or class Distinctiveness Salient appearance pattern that provides strong matching constraints Invariance Allow matching despite scale changes, rotations, viewpoint changes Sparseness Relatively few features per image, compact and efficient representation Flexibility Many existing types of detectors, descriptors

Scale-Covariant Detectors Laplacian, Hessian, Difference-of-Gaussian (blobs) Lindeberg (1998), Lowe (1999, 2004) Harris-Laplace (corners) Mikolajczyk & Schmid (2001)

Scale-Covariant Detectors Salient (high entropy) regions Kadir & Brady (2001) Circular edge-based regions Jurie & Schmid (2003)

Affine-Covariant Detectors Laplacian, Hessian-Affine (blobs) Gårding & Lindeberg (1996), Mikolajczyk et al. (2004) Harris-Affine (corners) Mikolajczyk & Schmid (2002)

Affine-Covariant Detectors Edge- and intensity-based regions Tuytelaars & Van Gool (2004) Maximally stable extremal regions (MSER) Matas et al. (2002)

Types of Descriptors Differential invariants Koenderink & Van Doorn (1987), Florack et al. (1991) Filter banks: complex, Gabor, steerable, … Multidimensional histograms PCA-SIFT: Ke & Sukthankar (2004) GLOH: Mikolajczyk & Schmid (2004) Johnson & Hebert (1999) Lazebnik, Schmid & Ponce (2003) Lowe (1999, 2004) Belongie, Malik & Puzicha (2002)

Applications (1) Wide-baseline matching and recognition of specific objects Tuytelaars & Van Gool (2004) Ferrari, Tuytelaars & Van Gool (2005) Lowe (2004) Rothganger, Lazebnik, Schmid & Ponce (2005)

Lazebnik, Schmid & Ponce (2004) Applications (2) Category-level recognition based on geometric correspondence Lazebnik, Schmid & Ponce (2004) Berg, Berg & Malik (2005)

Applications (3) Learning parts and visual vocabularies Constellation model Csurka, Dance, Fan, Willamowski & Bray (2004) Dorko & Schmid (2005) Sivic, Russell, Efros, Zisserman & Freeman (2005) Sivic & Zisserman (2003) Bag of features Fergus, Perona & Zisserman (2003) Weber, Welling & Perona (2000)

Lazebnik, Schmid & Ponce (2005) Applications (4) Building global image models invariant to a wide range of deformations Lazebnik, Schmid & Ponce (2005)

Comparative Evaluations Flat scenes Mikolajczyk & Schmid (2004), Mikolajczyk et al. (2004) MSER and Hessian regions have the highest repeatability Harris and Hessian regions provide the most correspondences SIFT (GLOH, PCA-SIFT) descriptors have the highest performance 3D objects Moreels & Perona (2006) Features on 3D objects are much more unstable than on planar objects All detectors and descriptors perform poorly for viewpoint changes > 30° Hessian with SIFT or shape context perform best

Comparative Evaluations Object classes Mikolajczyk, Liebe & Schiele (2005) Hessian regions with GLOH perform best Salient regions work well for object classes Texture and object classes Zhang, Marszalek, Lazebnik & Schmid (2005) Laplacian regions with SIFT perform best Combining multiple detectors and descriptors improves performance Scale+rotation invariance is sufficient for most datasets

Sparse vs. Dense Features: UIUC texture dataset 25 classes, 40 samples each Lazebnik, Schmid & Ponce (2005)

Sparse vs. Dense Features: UIUC texture dataset Multi-class classification accuracy vs. training set size Invariant local features SVM Non-invariant dense patches NN Baseline (global features) SVM NN A system with intrinsically invariant features can learn from fewer training examples Zhang, Marszalek, Lazebnik & Schmid (2005)

Sparse vs. Dense Features: CUReT dataset Dana, van Ginneken, Nayar, and Koenderink (1999) 61 classes, 92 samples each, 43 training Non-invariant features (SVM) Non-invariant features (NN) Invariant local features (SVM) Baseline – global features Invariant local features (NN) Relative Strengths Sparse locally invariant features: Dense non-invariant features: High-resolution images Low-resolution images Non-homogeneous patterns Homogeneous, high-frequency patterns Viewpoint changes Lighting changes

Anticipating Criticism Existing local features are not ideal for category-level recognition and scene understanding Designed for wide-baseline matching and specific object recognition Describe texture and albedo pattern, not shape Do not explain the whole image A little invariance goes a long way It is best to use features with the lowest level of invariance required by a given task Scale+rotation is sufficient for most datasets Zhang, Marszalek, Lazebnik & Schmid (2005) Denser sets of local features are more effective Hessian detector produces the most regions and performs best in several evaluations Regular grid of fixed-size patches is best for scene category recognition Fei-Fei & Perona (2005)

Future Work Systematic evaluation of sparse vs. dense features Combining sparse and dense representations, e.g., keypoints and segments Russell, Efros, Sivic, Freeman & Zisserman (2006) Learning detectors and descriptors automatically Developing shape-based features