Statistical Models of Appearance for Computer Vision

Slides:



Advertisements
Similar presentations
Applications of one-class classification
Advertisements

Face Recognition Sumitha Balasuriya.
Active Appearance Models
Scale & Affine Invariant Interest Point Detectors Mikolajczyk & Schmid presented by Dustin Lennon.
EigenFaces and EigenPatches Useful model of variation in a region –Region must be fixed shape (eg rectangle) Developed for face recognition Generalised.
Active Shape Models Suppose we have a statistical shape model –Trained from sets of examples How do we use it to interpret new images? Use an “Active Shape.
Principal Component Analysis Based on L1-Norm Maximization Nojun Kwak IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
Sami Romdhani Volker Blanz Thomas Vetter University of Freiburg
Dimension reduction (1)
Face Alignment with Part-Based Modeling
Wangfei Ningbo University A Brief Introduction to Active Appearance Models.
AAM based Face Tracking with Temporal Matching and Face Segmentation Dalong Du.
, Tim Landgraf Active Appearance Models AG KI, Journal Club 03 Nov 2008.
Computer Vision Spring ,-685 Instructor: S. Narasimhan Wean 5403 T-R 3:00pm – 4:20pm Lecture #20.
A 4-WEEK PROJECT IN Active Shape and Appearance Models
Principal Component Analysis CMPUT 466/551 Nilanjan Ray.
Principal Component Analysis
Active Appearance Models master thesis presentation Mikkel B. Stegmann IMM – June 20th 2000.
Dimensional reduction, PCA
Rodent Behavior Analysis Tom Henderson Vision Based Behavior Analysis Universitaet Karlsruhe (TH) 12 November /9.
Prénom Nom Document Analysis: Data Analysis and Clustering Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
Real-time Combined 2D+3D Active Appearance Models Jing Xiao, Simon Baker,Iain Matthew, and Takeo Kanade CVPR 2004 Presented by Pat Chan 23/11/2004.
Project 4 out today –help session today –photo session today Project 2 winners Announcements.
Face Recognition Jeremy Wyatt.
Active Appearance Models Computer examples A. Torralba T. F. Cootes, C.J. Taylor, G. J. Edwards M. B. Stegmann.
Active Appearance Models Suppose we have a statistical appearance model –Trained from sets of examples How do we use it to interpret new images? Use an.
Computer Vision I Instructor: Prof. Ko Nishino. Today How do we recognize objects in images?
Face Collections : Rendering and Image Processing Alexei Efros.
Smart Traveller with Visual Translator for OCR and Face Recognition LYU0203 FYP.
Camera parameters Extrinisic parameters define location and orientation of camera reference frame with respect to world frame Intrinsic parameters define.
Active Appearance Models based on the article: T.F.Cootes, G.J. Edwards and C.J.Taylor. "Active Appearance Models", presented by Denis Simakov.
Statistical Shape Models Eigenpatches model regions –Assume shape is fixed –What if it isn’t? Faces with expression changes, organs in medical images etc.
CSci 6971: Image Registration Lecture 16: View-Based Registration March 16, 2004 Prof. Chuck Stewart, RPI Dr. Luis Ibanez, Kitware Prof. Chuck Stewart,
PhD Thesis. Biometrics Science studying measurements and statistics of biological data Most relevant application: id. recognition 2.
CS 485/685 Computer Vision Face Recognition Using Principal Components Analysis (PCA) M. Turk, A. Pentland, "Eigenfaces for Recognition", Journal of Cognitive.
Face Recognition Using Neural Networks Presented By: Hadis Mohseni Leila Taghavi Atefeh Mirsafian.
Active Appearance Models for Face Detection
Face Alignment Using Cascaded Boosted Regression Active Shape Models
Dimensionality Reduction: Principal Components Analysis Optional Reading: Smith, A Tutorial on Principal Components Analysis (linked to class webpage)
Active Shape Models: Their Training and Applications Cootes, Taylor, et al. Robert Tamburo July 6, 2000 Prelim Presentation.
Multimodal Interaction Dr. Mike Spann
BACKGROUND LEARNING AND LETTER DETECTION USING TEXTURE WITH PRINCIPAL COMPONENT ANALYSIS (PCA) CIS 601 PROJECT SUMIT BASU FALL 2004.
CSE554AlignmentSlide 1 CSE 554 Lecture 5: Alignment Fall 2011.
1 ECE 738 Paper presentation Paper: Active Appearance Models Author: T.F.Cootes, G.J. Edwards and C.J.Taylor Student: Zhaozheng Yin Instructor: Dr. Yuhen.
1 Recognition by Appearance Appearance-based recognition is a competing paradigm to features and alignment. No features are extracted! Images are represented.
ECE 8443 – Pattern Recognition LECTURE 08: DIMENSIONALITY, PRINCIPAL COMPONENTS ANALYSIS Objectives: Data Considerations Computational Complexity Overfitting.
CSE 185 Introduction to Computer Vision Face Recognition.
Computer Vision Hough Transform, PDM, ASM and AAM David Pycock
Project 11: Determining the Intrinsic Dimensionality of a Distribution Okke Formsma, Nicolas Roussis and Per Løwenborg.
Project 11: Determining the Intrinsic Dimensionality of a Distribution Okke Formsma, Nicolas Roussis and Per Løwenborg.
Last update Heejune Ahn, SeoulTech
Multimodal Interaction Dr. Mike Spann
Point Distribution Models Active Appearance Models Compilation based on: Dhruv Batra ECE CMU Tim Cootes Machester.
Statistical Models of Appearance for Computer Vision 主講人:虞台文.
Advanced Artificial Intelligence Lecture 8: Advance machine learning.
Face detection and recognition Many slides adapted from K. Grauman and D. Lowe.
Part 3: Estimation of Parameters. Estimation of Parameters Most of the time, we have random samples but not the densities given. If the parametric form.
CSE 554 Lecture 8: Alignment
LECTURE 10: DISCRIMINANT ANALYSIS
Recognition: Face Recognition
Clustering (3) Center-based algorithms Fuzzy k-means
Dynamical Statistical Shape Priors for Level Set Based Tracking
Principal Component Analysis
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
Feature space tansformation methods
Generally Discriminant Analysis
LECTURE 09: DISCRIMINANT ANALYSIS
Announcements Project 4 out today Project 2 winners help session today
Active Appearance Models theory, extensions & cases
The “Margaret Thatcher Illusion”, by Peter Thompson
Presentation transcript:

Statistical Models of Appearance for Computer Vision T.F. Cootes and C. J. Taylor July 10, 2000

Computer Vision Aim Challenge Image understanding Deformable objects Models Challenge Deformable objects

Deformable Models Characteristics General Specific

Modeling Approaches Card Board Model Stick Figure Model Surface Based Volumetric Superquadrics Statistical Approach

Why Statistical Approach ? Widely applicable Expert knowledge captured in the system in the annotation of training examples Compact representation n-D space modeling Few prior assumptions

Topics Statistical models of shape Statistical models of appearance

Subsections Building statistical model Using these models to interpret new images

Statistical Shape Models

Shape Invariance under certain transforms eg: in 2-3 dimension – translation, rotation, scaling Represented by a set of n points, in d dimensions by a nd element vector s training examples, s such vectors

Suitable Landmarks Easy to detect Consistent over images 2-D - corners on the boundary Consistent over images Points b/w well defined landmarks

Aligning the Training Set Procrustes Analysis D = |xi – X|2 is minimized Constraints on mean Center Scale Orientation

Alignment : Iterative Approach Translate training set to origin Let x0 be the initial estimate of mean “Align” all shapes with mean Re-estimate mean to be X “Align” new mean w.r.t. previous mean and scale s.t. |X| = 1 REPEAT starting from 3

What is “Align” Operations allowed Center -> scale (|x| =1) -> rotation Center -> (scale + rotation) Center -> (scale + rotation) -> projection onto tangent space of the mean

Tangent Space All vectors x s.t. (xt –x).xt = 0 => x.xt = 1 Method : Scale x by 1/(x.X)

Modelling Shape Variation Advantages Generate new examples Examine new shapes (plausibility) Form x = M(b), b is vector of model parameters

S = ((xi – X)(xi – X)T)/(s-1) PCA Compute the mean of the data X = (xi)/s Compute the covariance of the data, S = ((xi – X)(xi – X)T)/(s-1) Compute the eigenvectors, i and corresponding eigen values i of S

Approximation using PCA If  contains t eigenvectors corresponding to the largest eigenvalue, x X + b where  = (1| 2|..| t) and b is t dimensional vector given by b = T(x-X)

Choice of Number of Modes t Proportion of variance exhibited i=1ti / i > th Accuracy to approximate training examples Miss-one-out manner

Uses of PCA Principal Components Analysis (PCA) exploits the redundancy in multivariate data, enabling us to: Pick out patterns (relationships) in the variables Reduce the dimensionality of our data set without a significant loss of information

Generating Plausible Shapes Assumption : bi are independent and gaussian Options Hard limits on independent b Constrain b in a hyperellipsoid

Drawbacks Inadequate for non-linear shape variations Rotating parts of objects View point change Other special cases Eg : Only 2 valid positions (x = f(b) fails) Only variations observed in the training set are represented

Non-Linear Models of PDF Polar co-ordinates (Heap and Hogg) Mixture of gaussians Drawbacks : Figuring out no. of gaussians to be used Finding nearest plausible shape

Fitting a Model to New Points x = TXt,Yt,s,(X+b) Aim : Minimize |Y-x|2 Initialize shape parameter, b, to 0 Generate model instance x = X + b Find the pose parameters Xt,Yt,s, which best map x to Y

Invert the pose parameters and use to project Y to the model co-ordinate frame : y = T-1 Xt,Yt,s,(Y) Project y into the tangent plane to X by scaling by 1/(y.X) Update the model parameter to match y b = T(y-X) REPEAT

Estimating p(shape) dx = x – X Best approximation of dx be b Residual error r = dx - b p(x) = p(r).p(b) logp(r) = -0.5|r|2/σr2 + const logp(b) = -0.5bi2/i + const

Relaxing Shape Model Artificially add extra variations Finite Element Method (M & K) Perturbing the covariance matrix Combining statistical and FEM modes Decrease the allowed vibration modes as the number of examples increases

Statistical Appearance Models

Appearance Shape Texture Pattern of intensities

Shape Normalization Warp each image to match control points with the mean image (triangulation algorithm) Advantages Remove spurious texture variations due to shape differences

Intensity Normatization g = (gim - 1)/ where  = gim.G  = (gim.1)/n

PCA Model : g = G + Pgbg G = mean of the normalized data Pg = set of the orthogonal modes of variation bg = set of gray level paramemters gim = Tu(G + Pgbg)

Combined Appearance Model Shape bs Texture bg Correlation b/w the two b = (Wsbs bg)T = (WsPsT(x-X) PgT(g-G))T

Applying PCA to b b = Qc x = X + PsWs-1Qsc, g = G + PgQgc where Q = (Qs Qg)T

Choice of Ws Displace each element of bs from its optimum value and observe change in g Ws = rI where r2 is the ratio of the total intensity variation to the total shape variation Insensitivity to Ws

Example : Facial AM

Approximating a New Image Obtain bs and bg Obtain b Obtain c Apply x = X + PsWs-1Qsc, g = G + PgQgc Inverting gray level normalization Applying pose to the points Projecting the gray level vector to the image

Fitting a Model to New Points x = TXt,Yt,s,(X+b) Aim : Minimize |Y-x|2 Initialize shape parameter, b, to 0 Generate model instance x = X + b Find the pose parameters Xt,Yt,s, which best map x to Y

Invert the pose parameters and use to project Y to the model co-ordinate frame : y = T-1 Xt,Yt,s,(Y) Project y into the tangent plane to X by scaling by 1/(y.X) Update the model parameter to match y b = T(y-X) REPEAT

Example

Active Shape Models

Problem statement Given a rough starting approximation, how do we fit an instance of a model to the image

Iterative Approach Examine a region of the image around each point Xi to find the best nearby match for the point Xi’ Update the parameters (Xt, Yt, s, , b) to best fit the new found points X REPEAT

In Practice

Modeling Local Structure Sample the derivative along a profile, k pixels on either side of a model point, to get a vector gi of the 2k+1 points Normalize Repeat for each training image for same model point to get {gi} Estimate mean G and covariance Sg f(gs) = (gs-G)TSg-1(gs-G)

Using Local Structure Model Sample a profile m pixels either side of the current point (m>k) Test quality of fit at 2(m-k)+1 positions Chose the one which gives the best match

Multi-Resolution ASM

Advantages Speed Less likely to get stuck on the wrong image structure

Complete Algorithm Set L = Lmax For L = Lmax:0 Compute model point position in the image at level L Evaluate fit at ns points along the profile Update pose and shape parameters to fit the model to new points Return unless more than pclose points satisfy the required criterion

Paramemters Model Parameters Search Parameters n t k Lmax ns Nmax pclose

Examples of Search

Example (failure)

Active Appearance Models

Background Bajcsy and Kovacic : Volume model that deforms elastically Christensen et al : Viscous flow model Turk and Pentland : ‘eigenfaces’ Poggio : New views from a set of example views, fitting by stochastic optimization procedure

Overview of AAM Search I = Ii – Im Minimize  = | I|2 by varying c Note : I encodes information about c

Learning to correct c Model : c = A I Multivariate regression on a sample of known model displacements, c, and the corresponding I c = Rc I

In reality Linear relation holds within 4 pixels As long as prediction has the same sign as actual error, and not much over-prediction, it converges Extend range by building multi-resolution model

Iterative Model Refinement g = gs – gm E = | g|2 c = A g Set k = 1 Let c’ = c - k c Calculate g’ If | g’| < E, the REPEAT with c’ O/W try at k = 1.5, 0.5, 0.25

Experimental Results

Comparison : ASM v/s AAM

Key Differences ASM only uses models of the image texture in the small regions around each landmark point ASM searches around current position ASM seeks to minimize the distance b/w model points and corresponding image points AAM uses a model of appearance of the whole region AAM only samples the image under current position AAM seeks to minimize the difference of the synthesized image and target image

Experiment Data Two data sets : Training data set 400 face images, 133 landmarks 72 brain slices, 133 landmark points Training data set Faces : 200, tested on remaining 200 Brain : 400, leave-one-brain-experiments

Capture Range

Point Location Accuracy

Point Location Accuracy ASM runs significantly faster for both models, and locates the points more accurately

Texture Matching

Conclusion ASM searches around the current location, along profiles, so one would expect them to have larger capture range ASM takes only the shape into account thus are less reliable AAM can work well with a much smaller number of landmarks as compared to ASM