Download presentation
Presentation is loading. Please wait.
1
CS654: Digital Image Analysis
Lecture 37: Feature Representation and Matching Slide adapted from: Kristen Grauman, Derek Hoiem, A. Efros
2
Feature Vectors A feature vector is a method to represent an image or part of an image. A feature vector is an n-dimensional vector that contains a set of values where each value represents a certain feature. This vector can be used to classify an object, or provide us with condensed higher-level information regarding the image.
3
Distance & Similarity Measures
Feature vector is to present the object and will be used to classify it To perform classification, need to compare two feature vectors 2 primary methods – difference between two or similarity Two vectors that are closely related will have small difference and large similarity
4
Distance Measures Difference can be measured by distance measure in n- dimensional feature space; the bigger the distance – the greater the difference Several metric measurement Euclidean distance Range-normalized Euclidean distance City block or absolute value metric Maximum value
5
Distance Measures Euclidean distance is the most common metric for measuring the distance between two vectors. Given two vectors A and B, where:
6
Distance Measures The Euclidean distance is given by:
This measure may be biased as a result of the varying range on different components of the vector. One component may range 1 to 5, another component may range 1 to 5000.
7
Distance Measures Ri is the range of the ith component.
A difference of 5 is significant on the first component, but insignificant on the second component. This problem can be rectified by using range-normalized Euclidean distance: Ri is the range of the ith component.
8
Distance Measures Another distance measure, called the city block or absolute value metric, is defined as follows: This metric is computationally faster than the Euclidean distance but gives similar result.
9
Distance Measures The city block distance can also be range-normalized
It gives a range-normalized city block distance metric, with Ri defined as before:
10
Distance Measures The final distance metric considered here is the maximum value metric defined by: The normalized version:
11
Why extract features? Motivation: panorama stitching
We have two images – how do we combine them?
12
Local features: main components
Detection: Identify the interest points Description: Extract vector feature descriptor surrounding each interest point. Matching: Determine correspondence between descriptors in two views
13
Characteristics of good features
Repeatability The same feature can be found in several images despite geometric and photometric transformations Saliency Each feature is distinctive Compactness and efficiency Many fewer features than image pixels Locality A feature occupies a relatively small area of the image
14
Goal: interest operator repeatability
We want to detect (at least some of) the same points in both images. Yet we have to be able to run the detection procedure independently per image. No chance to find true matches! 14
15
Goal: descriptor distinctiveness
We want to be able to reliably determine which point goes with which. Must provide some invariance to geometric and photometric differences between the two views. ?
16
Applications Feature points are used for: Image alignment
3D reconstruction Motion tracking Robot navigation Indexing and database retrieval Object recognition
17
Local features: main components
Detection: Identify the interest points Description:Extract vector feature descriptor surrounding each interest point. Matching: Determine correspondence between descriptors in two views
18
Many Existing Detectors Available
Hessian & Harris [Beaudet ‘78], [Harris ‘88] Laplacian, DoG [Lindeberg ‘98], [Lowe 1999] Harris-/Hessian-Laplace [Mikolajczyk & Schmid ‘01] Harris-/Hessian-Affine [Mikolajczyk & Schmid ‘04] EBR and IBR [Tuytelaars & Van Gool ‘04] MSER [Matas ‘02] Salient Regions [Kadir & Brady ‘01] Others…
19
What points would you choose?
20
Corner Detection: Basic Idea
We should easily recognize the point by looking through a small window Shifting a window in any direction should give a large change in intensity “flat” region: no change in all directions “edge”: no change along the edge direction “corner”: significant change in all directions
21
Corner Detection: Mathematics
Change in appearance of window w(x,y) for the shift [u,v]: I(x, y) E(u, v) E(3,2) w(x, y)
22
Corner Detection: Mathematics
Change in appearance of window w(x,y) for the shift [u,v]: I(x, y) E(u, v) E(0,0) w(x, y)
23
Corner Detection: Mathematics
Change in appearance of window w(x,y) for the shift [u,v]: Intensity Window function Shifted intensity or Window function w(x,y) = Gaussian 1 in window, 0 outside
24
Corner Detection: Mathematics
Change in appearance of window w(x,y) for the shift [u,v]: We want to find out how this function behaves for small shifts E(u, v)
25
Corner Detection: Mathematics
The quadratic approximation simplifies to where M is a second moment matrix computed from image derivatives: M
26
Corners as distinctive interest points
2 x 2 matrix of image derivatives (averaged in neighborhood of a point). Notation:
27
Interpreting the second moment matrix
Consider a horizontal “slice” of E(u, v): This is the equation of an ellipse. Diagonalization of M: The axis lengths of the ellipse are determined by the eigenvalues and the orientation is determined by R direction of the slowest change direction of the fastest change (max)-1/2 (min)-1/2
28
Interpreting the eigenvalues
Classification of image points using eigenvalues of M: 2 “Edge” 2 >> 1 “Corner” 1 and 2 are large, 1 ~ 2; E increases in all directions 1 and 2 are small; E is almost constant in all directions “Edge” 1 >> 2 “Flat” region 1
29
Corner response function
“Edge” R < 0 α: constant (0.04 to 0.06) “Corner” R > 0 |R| small “Edge” R < 0 “Flat” region
30
Harris corner detector
Compute M matrix for each image window to get their cornerness scores. Find points whose surrounding window gave large corner response (f> threshold) Take the points of local maxima, i.e., perform non-maximum suppression C.Harris and M.Stephens. “A Combined Corner and Edge Detector.” Proceedings of the 4th Alvey Vision Conference: pages 147—151, 1988. 30
31
Harris Detector Second moment matrix (autocorrelation matrix) Ix Iy
1. Image derivatives Ix2 Iy2 IxIy 2. Square of derivatives g(IxIy) 3. Gaussian filter g(sI) g(Ix2) g(Iy2) 4. Cornerness function – both eigenvalues are strong 5. Non-maxima suppression har Slide: Derek Hoiem
32
Harris Detector: Steps
33
Harris Detector: Steps
Compute corner response R
34
Harris Detector: Steps
Find points with large corner response: R>threshold Harris Detector: Steps
35
Harris Detector: Steps
Take only the points of local maxima of R Harris Detector: Steps
36
Harris Detector: Steps
37
Invariance and covariance
We want corner locations to be invariant to photometric transformations and covariant to geometric transformations Invariance: image is transformed and corner locations do not change Covariance: if we have two transformed versions of the same image, features should be detected in corresponding locations
38
Affine intensity change
I a I + b Only derivatives are used => invariance to intensity shift I I + b Intensity scaling: I a I R x (image coordinate) threshold Partially invariant to affine intensity change
39
Corner location is covariant w.r.t. translation
Image translation Derivatives and window function are shift-invariant Corner location is covariant w.r.t. translation
40
Corner location is covariant w.r.t. rotation
Image rotation Second moment ellipse rotates but its shape (i.e. eigenvalues) remains the same Corner location is covariant w.r.t. rotation
41
Corner location is not covariant to scaling!
All points will be classified as edges Corner location is not covariant to scaling!
42
Local features: main components
Detection: Identify the interest points Description:Extract vector feature descriptor surrounding each interest point. Matching: Determine correspondence between descriptors in two views
43
Geometric transformations
e.g. scale, translation, rotation
44
Photometric transformations
45
Raw patches as local descriptors
The simplest way to describe the neighborhood around an interest point is to write down the list of intensities to form a feature vector. But this is very sensitive to even small shifts, rotations.
46
Thank you Next Lecture: Feature descriptor and matching
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.