CSCE 641 Computer Graphics: Image Registration Jinxiang Chai
Review Image warping Image morphing
Image Warping Warping function - similarity, affine, projective etc Image warping - forward warping and two-pass 1D warping - backward warping Resampling filter - point sampling - bilinear filter - anisotropic filter x y u v S(x,y) T(u,v) forward Inverse
Image Morphing Point based image morphing Vector based image morphing
Image Registration Image warping: given h and f, compute g g(x) = f(h(x)) h f g? Image registration: given f and g, compute h h? f g
Why Image Registration? Lots of uses –Correct for camera jitter (stabilization) –Align images (mosaics) –View morphing –Special effects –Image based modeling/rendering –Etc. [Seitz 96]
Image Registration How do we align two images automatically? Two broad approaches: –Feature-based alignment Find a few matching features in both images compute alignment –Direct (pixel-based) alignment Search for alignment where most pixels agree
Outline Image registration - feature-based approach - pixel-based approach
Readings Bergen et al. Hierarchical model-based motion estimation. ECCV’92, pp. 237–252. Shi, J. and Tomasi, C. (1994). Good features to track. In CVPR’94, pp. 593–600. Baker, S. and Matthews, I. (2004). Lucas-kanade 20 years on: A unifying framework. IJCV, 56(3), 221–255.
Outline Image registration - feature-based approach - pixel-based approach
Feature-based Alignment 1.Find a few important features (aka Interest Points) 2.Match them across two images 3.Compute image transformation function h
Feature-based Alignment 1.Find a few important features (aka Interest Points) 2.Match them across two images 3.Compute image transformation function h How to choose features –Choose only the points (“features”) that are salient, i.e. likely to be there in the other image –How to find these features?
Feature-based Alignment 1.Find a few important features (aka Interest Points) 2.Match them across two images 3.Compute image transformation function h How to choose features –Choose only the points (“features”) that are salient, i.e. likely to be there in the other image –How to find these features? windows where has two large eigenvalues Harris Corner detector
Feature Detection -Two images taken at the same place with different angles - Projective transformation H 3X3
Feature Matching ? -Two images taken at the same place with different angles - Projective transformation H 3X3
Feature Matching ? -Two images taken at the same place with different angles - Projective transformation H 3X3 How do we match features across images? Any criterion?
Feature Matching ? -Two images taken at the same place with different angles - Projective transformation H 3X3 How do we match features across images? Any criterion?
Feature Matching Intensity/Color similarity –The intensity of pixels around the corresponding features should have similar intensity
Feature Matching Intensity/Color similarity –The intensity of pixels around the corresponding features should have similar intensity –Cross-correlation, SSDCross-correlationSSD
Feature Matching Intensity/Color similarity –The intensity of pixels around the corresponding features should have similar intensity –Cross-correlation, SSDCross-correlationSSD Distance constraint –The displacement of features should be smaller than a given threshold
Feature-space Outlier Rejection bad Good
Feature-space Outlier Rejection Can we now compute H 3X3 from the blue points?
Feature-space Outlier Rejection Can we now compute H 3X3 from the blue points? –No! Still too many outliers…
Feature-space Outlier Rejection Can we now compute H 3X3 from the blue points? –No! Still too many outliers… –What can we do?
Feature-space Outlier Rejection Can we now compute H 3X3 from the blue points? –No! Still too many outliers… –What can we do? Robust estimation!
Robust EstimationRobust Estimation: A Toy Example How to fit a line based on a set of 2D points?
Robust EstimationRobust Estimation: A Toy Example How to fit a line based on a set of 2D points? RANSACRANSAC: an iterative method to estimate parameters of a mathematical model from a set of observed data which contains outliersoutliers
RANSACRANSAC: RANdom SAmple Consensus Objective Robust fit of model to data set S which contains outliers Algorithm (i)Randomly select a sample of s data points from S and instantiate the model from this subset. (ii)Determine the set of data points S i which are within a distance threshold t of the model. The set S i is the consensus set of samples and defines the inliers of S. (iii)If the subset of S i is greater than some threshold T, re- estimate the model using all the points in S i and terminate (iv)If the size of S i is less than T, select a new subset and repeat the above. (v)After N trials the largest consensus set S i is selected, and the model is re-estimated using all the points in the subset S i
RANSAC Repeat M times: –Sample minimal number of matches to estimate two view relation (affine, perspective, etc). –Calculate number of inliers or posterior likelihood for relation. –Choose relation to maximize number of inliers.
RANSAC Line Fitting Example Task: Estimate best line
RANSAC Line Fitting Example Sample two points
RANSAC Line Fitting Example Fit Line
RANSAC Line Fitting Example Total number of points within a threshold of line.
RANSAC Line Fitting Example Repeat, until get a good result
RANSAC Line Fitting Example Repeat, until get a good result
RANSAC Line Fitting Example Repeat, until get a good result
How Many Samples? Choose N so that, with probability p, at least one random sample is free from outliers. e.g. p=0.99 proportion of outliers e s5%10%20%25%30%40%50%
How Many Samples? Choose N so that, with probability p, at least one random sample is free from outliers. e.g. p=0.99 proportion of outliers e s5%10%20%25%30%40%50% Affine transform
How Many Samples? Choose N so that, with probability p, at least one random sample is free from outliers. e.g. p=0.99 proportion of outliers e s5%10%20%25%30%40%50% Projective transform
RANSAC for Estimating Projective Transformation RANSAC loop: 1.Select four feature pairs (at random) 2.Compute the transformation matrix H (exact) 3.Compute inliers where SSD(p i ’, H p i ) < ε 4.Keep largest set of inliers 5.Re-compute least-squares H estimate on all of the inliers
RANSAC
Feature-based Registration Works for small or large motion Model the motion within a patch or whole image using a parametric transformation model
Feature-based Registration Works for small or large motion Model the motion within a patch or whole image using a parametric transformation model How to deal with motions that cannot be described by a small number of parameters?
Outline Image registration - feature-based approach - pixel-based approach
Direct (pixel-based) Alignment : Optical flow Will start by estimating motion of each pixel separately Then will consider motion of entire image
Problem Definition: Optical FlowOptical Flow How to estimate pixel motion from image H to image I?
Problem Definition: Optical FlowOptical Flow How to estimate pixel motion from image H to image I? –Solve pixel correspondence problem given a pixel in H, look for nearby pixels of the same color in I
Problem Definition: Optical FlowOptical Flow How to estimate pixel motion from image H to image I? –Solve pixel correspondence problem given a pixel in H, look for nearby pixels of the same color in I Key assumptions –color constancy: a point in H looks the same in I For grayscale images, this is brightness constancy –small motion: points do not move very far This is called the optical flow problem
Optical Flow Constraints Let’s look at these constraints more closely –brightness constancy: Q: what’s the equation?
Optical Flow Constraints Let’s look at these constraints more closely –brightness constancy: Q: what’s the equation? H(x,y) - I(x+u,v+y) = 0
Optical Flow Constraints Let’s look at these constraints more closely –brightness constancy: Q: what’s the equation? –small motion: (u and v are less than 1 pixel) suppose we take the Taylor series expansion of I: H(x,y) - I(x+u,v+y) = 0
Optical Flow Constraints Let’s look at these constraints more closely –brightness constancy: Q: what’s the equation? –small motion: (u and v are less than 1 pixel) suppose we take the Taylor series expansion of I: H(x,y) - I(x+u,v+y) = 0
Optical Flow Equation Combining these two equations
Optical Flow Equation Combining these two equations
Optical Flow Equation Combining these two equations
Optical Flow Equation Combining these two equations
Optical Flow Equation Combining these two equations In the limit as u and v go to zero, this becomes exact
Optical Flow Equation How many unknowns and equations per pixel?
Optical Flow Equation How many unknowns and equations per pixel? Intuitively, what does this constraint mean?
Optical Flow Equation How many unknowns and equations per pixel? Intuitively, what does this constraint mean? –The component of the flow in the gradient direction is determined –The component of the flow parallel to an edge is unknown
Optical Flow Equation How many unknowns and equations per pixel? Intuitively, what does this constraint mean? –The component of the flow in the gradient direction is determined –The component of the flow parallel to an edge is unknown
Aperture Problem
Stripes moved upwards 6 pixels Stripes moved left 5 pixels
Aperture Problem Stripes moved upwards 6 pixels Stripes moved left 5 pixels How to address this problem?
Solving the Aperture Problem How to get more equations for a pixel? –Basic idea: impose additional constraints most common is to assume that the flow field is smooth locally one method: pretend the pixel’s neighbors have the same (u,v) –If we use a 5x5 window, that gives us 25 equations per pixel!
RGB Version How to get more equations for a pixel? –Basic idea: impose additional constraints most common is to assume that the flow field is smooth locally one method: pretend the pixel’s neighbors have the same (u,v) –If we use a 5x5 window, that gives us 25 equations per pixel!
Lukas-Kanade Flow Prob: we have more equations than unknowns
Lukas-Kanade Flow Prob: we have more equations than unknowns Solution: solve least squares problem
Lukas-Kanade Flow Prob: we have more equations than unknowns Solution: solve least squares problem –minimum least squares solution given by solution (in d) of:
Lukas-Kanade Flow –The summations are over all pixels in the K x K window –This technique was first proposed by Lukas & Kanade (1981)
Lukas-Kanade Flow When is this Solvable? A T A should be invertible A T A should not be too small due to noise –eigenvalues 1 and 2 of A T A should not be too small A T A should be well-conditioned – 1 / 2 should not be too large ( 1 = larger eigenvalue)
Edge Bad for motion estimation - large 1, small 2
Low Texture Region Bad for motion estimation: - gradients have small magnitude - small 1, small 2
High Textured Region Good for motion estimation: - gradients are different, large magnitudes - large 1, large 2
Observation This is a two image problem BUT –Can measure sensitivity by just looking at one of the images! –This tells us which pixels are easy to track, which are hard very useful later on when we do feature tracking...
Errors in Lucas-Kanade What are the potential causes of errors in this procedure? –Suppose A T A is easily invertible –Suppose there is not much noise in the image
Errors in Lucas-Kanade What are the potential causes of errors in this procedure? –Suppose A T A is easily invertible –Suppose there is not much noise in the image When our assumptions are violated –Brightness constancy is not satisfied –The motion is not small –A point does not move like its neighbors window size is too large what is the ideal window size?
Iterative Refinement Iterative Lukas-Kanade Algorithm 1.Estimate velocity at each pixel by solving Lucas- Kanade equations 2.Warp H towards I using the estimated flow field - use image warping techniques 3.Repeat until convergence
Revisiting the Small Motion Assumption Is this motion small enough? –Probably not—it’s much larger than one pixel (2 nd order terms dominate) –How might we solve this problem?
Reduce the Resolution!
image I image H Gaussian pyramid of image HGaussian pyramid of image I image I image H u=10 pixels u=5 pixels u=2.5 pixels u=1.25 pixels Coarse-to-fine Optical Flow Estimation
image I image J Gaussian pyramid of image HGaussian pyramid of image I image I image H Coarse-to-fine Optical Flow Estimation run iterative L-K Upsample & warp......
Beyond Translation So far, our patch can only translate in (u,v) What about other motion models? –rotation, affine, perspective
Warping Function w(x;p) describes the geometric relationship between two images: Template Image H(x)Input Image I(x) x
Warping Functions Translation: Affine: Perspective:
Image Registration Find the warping parameter p that minimizes the intensity difference between template image and the warped input image
Image Registration Find the warping parameter p that minimizes the intensity difference between template image and the warped input image Mathematically, we can formulate this as an optimization problem:
Image Registration Find the warping parameter p that minimizes the intensity difference between template image and the warped input image Mathematically, we can formulate this as an optimization problem: The above problem can be solved by many optimization algorithm: - Steepest descentSteepest descent - Gauss-newtonGauss-newton - Levenberg-marquardt, etc. Levenberg-marquardt
Image Registration Find the warping parameter p that minimizes the intensity difference between template image and the warped input image Mathematically, we can formulate this as an optimization problem: The above problem can be solved by many optimization algorithm: - Steepest descent - Gauss-newton - Levenberg-marquardt, etc
Image Registration Mathematically, we can formulate this as an optimization problem:
Image Registration Mathematically, we can formulate this as an optimization problem: Taylor series expansion Similar to optical flow:
Image Registration Mathematically, we can formulate this as an optimization problem: Taylor series expansion Similar to optical flow:
Image Registration Mathematically, we can formulate this as an optimization problem: Taylor series expansion Image gradient Similar to optical flow:
Image Registration Mathematically, we can formulate this as an optimization problem: Taylor series expansion Image gradient translation affine …… Similar to optical flow: Jacobian matrix
Gauss-newton Optimization Rearrange
Gauss-newton Optimization Rearrange
Gauss-newton Optimization Rearrange Ab
Gauss-newton Optimization Rearrange A ATbATb b (A T A) -1
Lucas-Kanade Registration Initialize p=p 0 : Iterate: 1. Warp I with w(x;p) to compute I(w(x;p))
Lucas-Kanade Registration Initialize p=p 0 : Iterate: 1. Warp I with w(x;p) to compute I(w(x;p)) 2. Compute the error image
Lucas-Kanade Registration Initialize p=p 0 : Iterate: 1. Warp I with w(x;p) to compute I(w(x;p)) 2. Compute the error image 3. Warp the gradient with w(x;p)
Lucas-Kanade Registration Initialize p=p 0 : Iterate: 1. Warp I with w(x;p) to compute I(w(x;p)) 2. Compute the error image 3. Warp the gradient with w(x;p) 4. Evaluate the Jacobian at (x;p)
Lucas-Kanade Registration Initialize p=p 0 : Iterate: 1. Warp I with w(x;p) to compute I(w(x;p)) 2. Compute the error image 3. Warp the gradient with w(x;p) 4. Evaluate the Jacobian at (x;p) 5. Compute the
Lucas-Kanade Registration Initialize p=p 0 : Iterate: 1. Warp I with w(x;p) to compute I(w(x;p)) 2. Compute the error image 3. Warp the gradient with w(x;p) 4. Evaluate the Jacobian at (x;p) 5. Compute the 6. Update the parameters
Lucas-Kanade Algorithm Iteration 1: H(x ) I(w(x;p)) H(x)-I(w(x;p))
Lucas-Kanade Algorithm Iteration 2: H(x ) I(w(x;p)) H(x)-I(w(x;p))
Lucas-Kanade Algorithm Iteration 3: H(x ) I(w(x;p)) H(x)-I(w(x;p))
Lucas-Kanade Algorithm Iteration 4: H(x ) I(w(x;p)) H(x)-I(w(x;p))
Lucas-Kanade Algorithm Iteration 5: H(x ) I(w(x;p)) H(x)-I(w(x;p))
Lucas-Kanade Algorithm Iteration 6: H(x ) I(w(x;p)) H(x)-I(w(x;p))
Lucas-Kanade Algorithm Iteration 7: H(x ) I(w(x;p)) H(x)-I(w(x;p))
Lucas-Kanade Algorithm Iteration 8: H(x ) I(w(x;p)) H(x)-I(w(x;p))
Lucas-Kanade Algorithm Iteration 9: H(x ) I(w(x;p)) H(x)-I(w(x;p))
Lucas-Kanade Algorithm Final result: H(x ) I(w(x;p)) H(x)-I(w(x;p))
How to Break Assumptions Small motion Constant optical flow in the window Color constancy
Break the color constancy How to deal with illumination change? - linear models - can model gain and bias (H 1 =H 0, H 2 = const., other zeros)
Linear Model Can also model the appearance of a face under different illumination using a linear combination of base images (PCA): mean face lighting variation
Image Registration Similarly, we can formulate this as an optimization problem:
Image Registration Similarly, we can formulate this as an optimization problem: For iterative registration, we have
Image Registration Similarly, we can formulate this as an optimization problem: Taylor series expansion For iterative registration, we have
Gauss-newton optimization
letsimilarly
Gauss-newton optimization letsimilarly
Gauss-newton optimization letsimilarly Jacobian matrixError image
Gauss-newton optimization letsimilarly Jacobian matrixError image Update equation:
Results with Illumination Changes [Hagar and Belhumeur 98]
Image Alignment Image mosaicing
Lucas-Kanade for Image Alignment Pros: –All pixels get used in matching –Can get sub-pixel accuracy (important for good mosaicing!) –Relatively fast and simple Cons: –Prone to local minima –Relative small movement
Beyond 2D Registration So far, we focus on registration between 2D images The same idea can be used in registration between 3D and 2D, and between 3D and 3D
3D-to-2D Registration The transformation between 3D object and 2D images
3D-to-2D Registration From world coordinate to image coordinate u0u0 v0v s y 0 sxsx 0u v 1 Perspective projection View transformation Viewport projection
3D-to-2D Registration From world coordinate to image coordinate u0u0 v0v s y 0 sxsx 0u v 1 Perspective projection View transformation Viewport projection Camera parameters3D points 2D projections
3D-to-2D Registration Similarly, we can formulate this as an optimization problem:
3D-to-2D Registration Similarly, we can formulate this as an optimization problem: 2D imageProjected image
Applications Tracking 3D head motion in real time Video-based motion capture
3D-to-2D Registration Similarly, we can formulate this as an optimization problem: Iteration equation: 2D imageProjected image
3D-to-2D Registration Similarly, we can formulate this as an optimization problem: Iteration equation: 2D imageProjected image What’s ?