Statistical Models of Appearance for Computer Vision 主講人：虞台文.

Statistical Models of Appearance for Computer Vision 主講人：虞台文

Content Overview Statistical Shape Models Statistical Appearance Models Active Shape Models Active Appearance Models Comparison : ASM v/s AAM Conclusion

Statistical Models of Appearance for Computer Vision Overview

Computer Vision Goal – Image understanding Challenge – Deformability of objects Statistic model-based approach – Shape model – Appearance model – Model matching Image interpretation

Deformable Models General – capable of generating any plausible example of the class they represent Specific – only capable of generating legal examples

Statistical Models of Appearance for Computer Vision Statistical Shape Models

Shapes The shape of an object is represented by a set of n points in any dimension Invariance under some transformations – in 2-3 dimension – translation, rotation, scaling – called similarity transformation

Hand-Annotated Training Set The training set typically comes from hand annotation of a set of training images

Suitable Landmarks Object Boundary High Curvature Equally spaced Intermediate points T Junction

Suitable Landmarks

Shape Vector A d -dim shape with n landmarks is represented by a vector with elements In a 2D-image with n landmark points, a shape vector x is then

Training Set of Shape Vectors A d -dim shape with n landmarks is represented by a vector with nd elements In a 2D-image with n landmark points, a shape vector x is then The shape vectors in the training set should be in the same coordinate frame – Alignment of the training set is required

Aligning the Training Set

Procrustes Analysis Minimize

Alignment : Iterative Approach

Possible Alignment Approaches: Center -> scale ( |x| =1) -> rotation Center -> (scale + rotation) Center -> (scale + rotation) -> projection onto tangent space of the mean

Alignment : Iterative Approach Possible Alignment Approaches: Center -> scale ( |x| =1) -> rotation Center -> (scale + rotation) Center -> (scale + rotation) -> projection onto tangent space of the mean

Alignment : Iterative Approach Possible Alignment Approaches: Center -> scale ( |x| =1) -> rotation Center -> (scale + rotation) Center -> (scale + rotation) -> projection onto tangent space of the mean Tangent space to x t – all vectors s.t. (x t  x)  x t = 0, or x  x t = 1 if x t  x t = 1 Obtained by aligning the shapes with the mean, allowing scaling and rotation, then project into the tangent space by scaling x by 1/(x  x) Tangent space to x t – all vectors s.t. (x t  x)  x t = 0, or x  x t = 1 if x t  x t = 1 Obtained by aligning the shapes with the mean, allowing scaling and rotation, then project into the tangent space by scaling x by 1/(x  x)

Alignment : Iterative Approach

Modeling Shape Variation Parametric Shape Model Estimate the distribution of b – Generating new shapes – Examining new shapes (plausibility)

PCA 1. Compute the mean 2. Compute the covariance matrix 3. Compute the eigenvectors,  i and corresponding eigenvalues i of S s.t. 1  2  

Shape Approximation a plausible shape mean shape parameter vector of deformable model

Shape Approximation parameter vector of deformable model with The generated shape is similar those in training set (plausible).

Choice of Number of Modes ( t = ? ) Let f v be the proportion of the total variation one wishes to explain (e.g., 98%) Total variance V T =  i is the sum of all eigenvalues, assuming 1  2   T Then, we can chose the smallest t s.t.

Examples of Shape Models Some members in the training set (18 hands) Each is represented by 72 landmark points

Examples of Shape Models

Some members in the training set (300 faces) Each is represented by 133 landmark points

Examples of Shape Models

Generating Plausible Shapes parameter vector of deformable model Assumption : b i ’s are independent and Gaussian Options Hard limits on independent b i ’s Constrain b in a hyperellipsoid Assumption : b i ’s are independent and Gaussian Options Hard limits on independent b i ’s Constrain b in a hyperellipsoid

Generating Plausible Shapes Variation is modeled as a linear combination of eigenvectors How about for nonlinear cases?

Nonlinear Shape Variation

b 1 and b 2 are not independent.

Nonlinear Shape Variation

Nonlinear shape variations often caused by – Rotating parts of objects – View point change – Other special cases Eg : Only 2 valid positions (x = f(b) fails) Plausible Shapes cannot be generated using aforementioned method.

Non-Linear Models for PDF Polar coordinates (Heap and Hogg) Modeling p(b) by Mixture of Gaussians Drawbacks Figuring out no. of Gaussians to be used Finding nearest plausible shape Polar coordinates (Heap and Hogg) Modeling p(b) by Mixture of Gaussians Drawbacks Figuring out no. of Gaussians to be used Finding nearest plausible shape

Similar Transformation Translation Scale Rotation A shape of the model A shape in a image Pose parameters

Fitting a Model to New Points in a Image Given a set of new image points Y, find b and pose parameters so as to minimize

Fitting a Model to New Points in a Image Minimize

Statistical Models of Appearance for Computer Vision Statistical Models of Appearance

Appearance Shape Texture – Pattern of intensities

Appearance Model shape variance texture variance Shape normalization

Shape Normalization Remove spurious texture variations due to shape differences Warp each image to match control points with the mean image – triangulation algorithm

Intensity Normalization Gray-level vector of shape normalized image Mean of the normalized data, scaled and offset s.t. elements’ sum is 0 and variance is 1 Obtained using a recursive process

PCA mean normalized grey-level vector a set of orthogonal modes of variation a set of grey-level parameters

Texture Reconstruction

Combined Appearance Model Shape parameters Appearance parameters

Combined Appearance Models a diagonal matrix of weights to accommodate the difference in units between the shape and grey models

PCA for Combined Vectors eigenvectors from applying PCA on b ’s appearance parameters controlling both the shape and grey-levels of the model

Shape & Grey-Level Reconstruction

Appearance Reconstruction Given appearance parameter c Generate shape-free gray-level image g warp g to the shape described by x

Review: Combined Appearance Models a diagonal matrix of weights to accommodate the difference in units between the shape and grey models How to obtained W s ?

Choice of Shape Parameter Weights Method1  Displace each element of b s from its optimum value and observe change in g for each training example – The RMS change gives elements in W s Method2  W s = rI where r 2 is the ratio of the total intensity variation to the total shape variation

Choice of Shape Parameter Weights Method1  Displace each element of b s from its optimum value and observe change in g for each training example – The RMS change gives elements in W s Method2  W s = rI where r 2 is the ratio of the total intensity variation to the total shape variation The choice of W s is relatively insensitive

Example: Facial Appearance Model First two modes of shape variation (  3 sd) First two modes of grey- level variation (  3 sd)

Example: Facial Appearance Model First four modes of appearance variation (  3 sd)

Approximating a New Example Given a new image, labelled with a set of landmarks, to generate an approximation with the model. Approx.

Approximating a New Example Given a new image, labelled with a set of landmarks, to generate an approximation with the model. Obtain b s and b g Obtain b Obtain c Apply Inverting gray level normalization by Applying pose to the points Projecting the gray level vector to the image

Statistical Models of Appearance for Computer Vision Active Shape Models

Goal Given a rough starting approximation, to fit an instance of a model to the image

Iterative Approach Iteratively improving the fit of the instance, X, to an image proceeds as follows: 1. Examine a region of the image around each point X i to find the best nearby match for the point 2. Update the parameters (X t, Y t, s, , b) to best fit the new found points X 3. Repeat until convergence

Iterative Approach Iteratively improving the fit of the instance, X, to an image proceeds as follows: 1. Examine a region of the image around each point X i to find the best nearby match for the point 2. Update the parameters (X t, Y t, s, , b) to best fit the new found points X 3. Repeat until convergence The above method is applicable if model points are edges The best approach is to examine the local structures of model points (to be discussed) The above method is applicable if model points are edges The best approach is to examine the local structures of model points (to be discussed)

Iterative Approach Iteratively improving the fit of the instance, X, to an image proceeds as follows: 1. Examine a region of the image around each point X i to find the best nearby match for the point 2. Update the parameters (X t, Y t, s, , b) to best fit the new found points X 3. Repeat until convergence

Modeling Local Structure Sample the derivative along a profile, k pixels on either side of a model point, to get a vector g i of the 2k+1 points gigi k points

Modeling Local Structure Sample the derivative along a profile, k pixels on either side of a model point, to get a vector g i of the 2k+1 points Normalize g i by Repeat for each training image for same model point to get {g i } Estimate mean and covariance

Fitness Sample the derivative along a profile, k pixels on either side of a model point, to get a vector g i of the 2k+1 points Normalize g i by Repeat for each training image for same model point to get {g i } Estimate mean and covariance Mahalanobis distance

Using Local Structure Model Sample a profile m pixels either side of the current point ( m>k ) Test quality of fit at 2(m  k)+1 positions Chose the one which gives the best match

Statistical Models of Grey-Level Profiles The same number of pixels is used at each level

MRASM Algorithm

Starting from the coarsest level

MRASM Algorithm Set up the initial position before search

MRASM Algorithm Search best match position for each model points

MRASM Algorithm Search best match position for each model points Ensure plausibility

MRASM Algorithm Repeat until convergence of the level Change to finer level

MRASM Algorithm Report result

Summary of Parameters

Examples of Search

Poor Starting Point

Examples of Search Search using ASM of cartilage on an MR image of the knee

Statistical Models of Appearance for Computer Vision Active Appearance Models

Disadvantages of ASM Only uses shape constraints (together with some information about the image structure near the landmarks) for search Incapable of generating synthetic image

Goal of AAM Given a rough starting approximation of an appearance model, to fit it within an image

Review: Appearance Model model parameter

Overview of AAM Search difference vector grey-level vector in image grey-level vector by model

Overview of AAM Search AAM Search  Minimize  by varying c effectively effectively

Overview of AAM Search AAM Search  Minimize  by varying c effectively effectively Given  g, how to obtain  c deterministically?

Learning to Correct Model Parameters Given  g, how to obtain  c deterministically? Linear Model Multivariate regression on a sample of known model displacements,  c, and the corresponding  g

Extra Parameters for Pose Linear Model Multivariate regression on a sample of known model displacements,  c, and the corresponding  g Pose parameters Including (s x, s y, t x, t y )

Training Linear Model Multivariate regression on a sample of known model displacements,  c, and the corresponding  g Including (s x, s y, t x, t y ) c 0  a know appearance model in the current image Perturbing by  c to get new parameters, i.e., c = c 0 +  c – including small displacements in position, scale, and orientation Computing  g = g i  g m – Use shape-normalized representation (warping) Record enough perturbations and image differences for regression c 0  a know appearance model in the current image Perturbing by  c to get new parameters, i.e., c = c 0 +  c – including small displacements in position, scale, and orientation Computing  g = g i  g m – Use shape-normalized representation (warping) Record enough perturbations and image differences for regression

Training c 0  a know appearance model in the current image Perturbing by  c to get new parameters, i.e., c = c 0 +  c – including small displacements in position, scale, and orientation Computing  g = g i  g m – Use shape-normalized representation (warping) Record enough perturbations and image differences for regression c 0  a know appearance model in the current image Perturbing by  c to get new parameters, i.e., c = c 0 +  c – including small displacements in position, scale, and orientation Computing  g = g i  g m – Use shape-normalized representation (warping) Record enough perturbations and image differences for regression Ideally, we want a model that holds over large error range,  g, so also for parameter range  c Experimentally, optimal perturbation around 0.5 standard deviations for each parameter Ideally, we want a model that holds over large error range,  g, so also for parameter range  c Experimentally, optimal perturbation around 0.5 standard deviations for each parameter

Results For The Face Model

The weight attached to different areas of the sampled patch when estimating the displacement

Weights for Pose Parameters sxsx sysy txtx tyty

Pose-Parameter Displacement Weights sxsx sysy txtx tyty

First Mode and Displacement Weights

Second Mode and Displacement Weights

Performance of the Prediction Linear relation holds within 4 pixels As long as prediction has the same sign as actual error, and not much over-prediction, it converges Extend range by building multi-resolution model

Performance of the Prediction Multi-Resolution Model L0: 10000 pixels, L1: 2500 pixels, L2: 600 pixels.

AAM Search: Iterative Model Refinement Evaluate the error vector Evaluate the current error Compute the predicted displacement, Set k = 1 Let Sample the image at this new prediction, and calculate a new error vector, If then accept the new estimate, c 1, Otherwise try at k = 1.5, k = 0.5, k = 0.25 etc.

Examples of AAM Search Reconstruction (left) and original (right) given original landmark points

Examples of AAM Search Multi-Resolution search from displaced position

Examples of AAM Search First two modes of appearance variation of knee model Best fit of knee model to new image given landmarks

Examples of AAM Search Multi-Resolution search from displaced position

Statistical Models of Appearance for Computer Vision Comparison : ASM v/s AAM

Key Differences ASM only uses models of the image texture in the small regions around each landmark point ASM searches around current position ASM seeks to minimize the distance b/w model points and corresponding image points AAM uses a model of appearance of the whole region AAM only samples the image under current position AAM seeks to minimize the difference of the synthesized image and target image

Experiment Data Two data sets : – 400 face images, 133 landmarks – 72 brain slices, 133 landmark points Training data set – Faces : 200, tested on remaining 200 – Brain : 400, leave-one-brain-experiments

Capture Range

Point Location Accuracy

ASM runs significantly faster for both models, and locates the points more accurately

Texture Matching

Statistical Models of Appearance for Computer Vision Conclusion

ASM searches around the current location, along profiles, so one would expect them to have larger capture range ASM takes only the shape into account thus are less reliable AAM can work well with a much smaller number of landmarks as compared to ASM

Statistical Models of Appearance for Computer Vision 主講人：虞台文.

Similar presentations

Presentation on theme: "Statistical Models of Appearance for Computer Vision 主講人：虞台文."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Statistical Models of Appearance for Computer Vision 主講人：虞台文.

Similar presentations

Presentation on theme: "Statistical Models of Appearance for Computer Vision 主講人：虞台文."— Presentation transcript:

Similar presentations

About project

Feedback