Technische Universität München Face Model Fitting based on Machine Learning from Multi-band Images of Facial Components Institute for Informatics Technische.

Slides:



Advertisements
Similar presentations
Applications of one-class classification
Advertisements

Matthias Wimmer, Bernd Radig, Michael Beetz Chair for Image Understanding Computer Science TU München, Germany A Person and Context.
Active Appearance Models
12 th International Fall Workshop VISION, MODELING, AND VISUALIZATION 2007 November 7-9, 2007 Saarbrücken, Germany Estimating Natural Activity by Fitting.
Model-based Image Interpretation with Application to Facial Expression Recognition Matthias Wimmer
Road-Sign Detection and Recognition Based on Support Vector Machines Saturnino, Sergio et al. Yunjia Man ECG 782 Dr. Brendan.
Change Detection C. Stauffer and W.E.L. Grimson, “Learning patterns of activity using real time tracking,” IEEE Trans. On PAMI, 22(8): , Aug 2000.
Rapid Object Detection using a Boosted Cascade of Simple Features Paul Viola, Michael Jones Conference on Computer Vision and Pattern Recognition 2001.
Rapid Object Detection using a Boosted Cascade of Simple Features Paul Viola, Michael Jones Conference on Computer Vision and Pattern Recognition 2001.
Matthias Wimmer, Ursula Zucker and Bernd Radig Chair for Image Understanding Computer Science Technische Universität München { wimmerm, zucker, radig
Automatic determination of skeletal age from hand radiographs of children Image Science Institute Utrecht University C.A.Maas.
Face Alignment with Part-Based Modeling
Internet Vision - Lecture 3 Tamara Berg Sept 10. New Lecture Time Mondays 10:00am-12:30pm in 2311 Monday (9/15) we will have a general Computer Vision.
Segmentation-Free, Area-Based Articulated Object Tracking Daniel Mohr, Gabriel Zachmann Clausthal University, Germany ISVC.
AAM based Face Tracking with Temporal Matching and Face Segmentation Dalong Du.
Robust Moving Object Detection & Categorization using self- improving classifiers Omar Javed, Saad Ali & Mubarak Shah.
CS292 Computational Vision and Language Pattern Recognition and Classification.
A Study of Approaches for Object Recognition
Pattern Recognition Topic 1: Principle Component Analysis Shapiro chap
Robust Real-time Object Detection by Paul Viola and Michael Jones ICCV 2001 Workshop on Statistical and Computation Theories of Vision Presentation by.
Face detection and recognition Many slides adapted from K. Grauman and D. Lowe.
A Robust Real Time Face Detection. Outline  AdaBoost – Learning Algorithm  Face Detection in real life  Using AdaBoost for Face Detection  Improvements.
Prénom Nom Document Analysis: Data Analysis and Clustering Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
A Robust Real Time Face Detection. Outline  AdaBoost – Learning Algorithm  Face Detection in real life  Using AdaBoost for Face Detection  Improvements.
Presented by Zeehasham Rasheed
Augmented Reality: Object Tracking and Active Appearance Model
Face Processing System Presented by: Harvest Jang Group meeting Fall 2002.
Jacinto C. Nascimento, Member, IEEE, and Jorge S. Marques
Matthias Wimmer, Bernd Radig, Michael Beetz Chair for Image Understanding Computer Science Technische Universität München Adaptive.
Matthias Wimmer, Sylvia Pietzsch, Freek Stulp and Bernd Radig Chair for Image Understanding Institute for Computer Science Technische Universität München.
Foundations of Computer Vision Rapid object / face detection using a Boosted Cascade of Simple features Presented by Christos Stoilas Rapid object / face.
Facial Feature Detection
Face Detection using the Viola-Jones Method
A Tutorial on Object Detection Using OpenCV
Recognition Part II Ali Farhadi CSE 455.
Principles of Pattern Recognition
Multimodal Interaction Dr. Mike Spann
Prakash Chockalingam Clemson University Non-Rigid Multi-Modal Object Tracking Using Gaussian Mixture Models Committee Members Dr Stan Birchfield (chair)
Face Model Fitting with Generic, Group-specific, and Person- specific Objective Functions Chair for Image Understanding and Knowledge-based Systems Institute.
October 14, 2014Computer Vision Lecture 11: Image Segmentation I 1Contours How should we represent contours? A good contour representation should meet.
EE 492 ENGINEERING PROJECT LIP TRACKING Yusuf Ziya Işık & Ashat Turlibayev Yusuf Ziya Işık & Ashat Turlibayev Advisor: Prof. Dr. Bülent Sankur Advisor:
Detecting Pedestrians Using Patterns of Motion and Appearance Paul Viola Microsoft Research Irfan Ullah Dept. of Info. and Comm. Engr. Myongji University.
2 2  Background  Vision in Human Brain  Efficient Coding Theory  Motivation  Natural Pictures  Methodology  Statistical Characteristics  Models.
Automatic Image Anonymizer Alex Brettingen James Esposito.
CS 782 – Machine Learning Lecture 4 Linear Models for Classification  Probabilistic generative models  Probabilistic discriminative models.
Automatic Image Annotation by Using Concept-Sensitive Salient Objects for Image Content Representation Jianping Fan, Yuli Gao, Hangzai Luo, Guangyou Xu.
Ground Truth Free Evaluation of Segment Based Maps Rolf Lakaemper Temple University, Philadelphia,PA,USA.
Non-Photorealistic Rendering and Content- Based Image Retrieval Yuan-Hao Lai Pacific Graphics (2003)
1Ellen L. Walker Category Recognition Associating information extracted from images with categories (classes) of objects Requires prior knowledge about.
A NOVEL METHOD FOR COLOR FACE RECOGNITION USING KNN CLASSIFIER
Learning to Detect Faces A Large-Scale Application of Machine Learning (This material is not in the text: for further information see the paper by P.
GENDER AND AGE RECOGNITION FOR VIDEO ANALYTICS SOLUTION PRESENTED BY: SUBHASH REDDY JOLAPURAM.
Probability and Statistics in Vision. Probability Objects not all the sameObjects not all the same – Many possible shapes for people, cars, … – Skin has.
Sylvia Pietzsch Chair for Image Understanding Computer Science Technische Universität München Learning Generic and Person Specific Objective.
October 1, 2013Computer Vision Lecture 9: From Edges to Contours 1 Canny Edge Detector However, usually there will still be noise in the array E[i, j],
Learning Photographic Global Tonal Adjustment with a Database of Input / Output Image Pairs.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
An Improved Algorithm for Decision-Tree-Based SVM Sindhu Kuchipudi INSTRUCTOR Dr.DONGCHUL KIM.
EE368: Digital Image Processing Bernd Girod Leahy, p.1/15 Face Detection on Similar Color Images Scott Leahy EE368, Stanford University May 30, 2003.
Stephan Tschechne Chair for Image Understanding Computer Science Technische Universität München Designing vs. Learning the Objective.
May 2003 SUT Color image segmentation – an innovative approach Amin Fazel May 2003 Sharif University of Technology Course Presentation base on a paper.
Computer Vision Lecture 7 Classifiers. Computer Vision, Lecture 6 Oleh Tretiak © 2005Slide 1 This Lecture Bayesian decision theory (22.1, 22.2) –General.
Learning and Removing Cast Shadows through a Multidistribution Approach Nicolas Martel-Brisson, Andre Zaccarin IEEE TRANSACTIONS ON PATTERN ANALYSIS AND.
Shadow Detection in Remotely Sensed Images Based on Self-Adaptive Feature Selection Jiahang Liu, Tao Fang, and Deren Li IEEE TRANSACTIONS ON GEOSCIENCE.
Supervised Time Series Pattern Discovery through Local Importance
Midterm Exam Closed book, notes, computer Similar to test 1 in format:
EE 492 ENGINEERING PROJECT
Midterm Exam Closed book, notes, computer Similar to test 1 in format:
EM Algorithm and its Applications
MIRA, SVM, k-NN Lirong Xia. MIRA, SVM, k-NN Lirong Xia.
Presentation transcript:

Technische Universität München Face Model Fitting based on Machine Learning from Multi-band Images of Facial Components Institute for Informatics Technische Universität München Germany Matthias Wimmer Christoph Mayer Freek Stulp Bernd Radig

2/22 Face Model Fitting based on Machine Learning from Multi-band Images of Facial Components Technische Universität München – Matthias Wimmer Outline of this Presentation Infer semantic information Model fitting with learned objective functions Compute multi-band image representation image facial expression, gaze, identity, gender, age,… Part 2: Part 1: facial component correctly fitted face model

3/22 Face Model Fitting based on Machine Learning from Multi-band Images of Facial Components Technische Universität München – Matthias Wimmer Part 1: Compute Multi-band Image Representing Facial Components

4/22 Face Model Fitting based on Machine Learning from Multi-band Images of Facial Components Technische Universität München – Matthias Wimmer Motivation and Related Work  Descriptive feature representation of input image  Input for the subsequent process of model fitting  Quick computation  Similar approaches:  Stegman et al. (IVC2003) Stegman et al. Image and Vision Computing (2003)

5/22 Face Model Fitting based on Machine Learning from Multi-band Images of Facial Components Technische Universität München – Matthias Wimmer Our Idea  Multi-band image contains the location of facial components  Use classifier for this task.  Provide a multitude of features  Classifier decides which ones are relevant (→ quick)  Consider pixel features only (→ quick)  Pre-compute image characteristics and adjust pixel features (→ accurate) skin lips teeth brows pupils

6/22 Face Model Fitting based on Machine Learning from Multi-band Images of Facial Components Technische Universität München – Matthias Wimmer Prerequisites  Image data base (from Web; 500 images)  Face Locator: e.g. Viola and Jones  Computes rectangular regions around human faces  Manual annotations

7/22 Face Model Fitting based on Machine Learning from Multi-band Images of Facial Components Technische Universität München – Matthias Wimmer Probability Matrices  Indicate the probability of the pixels to denote a certain facial component.  Relative to the face rectangle  Learned offline skin brows pupils lower lip teeth

8/22 Face Model Fitting based on Machine Learning from Multi-band Images of Facial Components Technische Universität München – Matthias Wimmer Image Characteristics  Describe characteristics of the entire image  Computed by applying probability mask to face rectangle color  Distribution of color of all facial components  Gaussian distribution of color (Mean, covariance matrix) space  Distribution of pixel locations of all facial components  Gaussian distribution of locations (Mean, covariance matrix) example imagespatial distribution of skin colorskin color distribution

9/22 Face Model Fitting based on Machine Learning from Multi-band Images of Facial Components Technische Universität München – Matthias Wimmer Multitude of Pixel Features color Static pixel features  Color (RGB, NRGB, HSV, YCbCr) 16 features Adjusted pixel features  Color relative to mean color (Euclidean, Mahalanobis) ~ 90 features space  Coordinates (Cartesian, Polar)  Coordinates relative to mean location of facial components (Euclidean, Mahalanobis)

10/22 Face Model Fitting based on Machine Learning from Multi-band Images of Facial Components Technische Universität München – Matthias Wimmer Evaluation: Train various Classifiers  4 Classifiers for each facial component  C1: static feature only  C2: adjusted color features only  C3: adjusted location features only  C4: all features

11/22 Face Model Fitting based on Machine Learning from Multi-band Images of Facial Components Technische Universität München – Matthias Wimmer Classifiers for Lips and Teeth C1C2C3C4 77.3%94.3%90.4%97.7% C1C2C3C4 74.3%66.2%87.9%95.0%

12/22 Face Model Fitting based on Machine Learning from Multi-band Images of Facial Components Technische Universität München – Matthias Wimmer Part 2: Model Fitting with Learned Objective Functions

13/22 Face Model Fitting based on Machine Learning from Multi-band Images of Facial Components Technische Universität München – Matthias Wimmer Model-based Image Interpretation  The model The model contains a parameter vector that represents the model’s configuration.  The objective function Calculates a value that indicates how accurately a parameterized model matches an image.  The fitting algorithm Searches for the model parameters that describe the image best, i.e. it minimizes the objective function.

14/22 Face Model Fitting based on Machine Learning from Multi-band Images of Facial Components Technische Universität München – Matthias Wimmer Local Objective Functions

15/22 Face Model Fitting based on Machine Learning from Multi-band Images of Facial Components Technische Universität München – Matthias Wimmer Ideal Objective Functions P1:Correctness property: Global minimum corresponds to the best fit. P2:Uni-modality property: The objective function has no local extrema. ¬ P1 P1 ¬P2 P2  Don’t exist for real-world images  Only for annotated images: f n ( I, x ) = | c n – x |

16/22 Face Model Fitting based on Machine Learning from Multi-band Images of Facial Components Technische Universität München – Matthias Wimmer Learning the Objective Function x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x  Ideal objective function generates training data  Machine Learning technique generates calculation rules ideal objective function training data learned objective function

17/22 Face Model Fitting based on Machine Learning from Multi-band Images of Facial Components Technische Universität München – Matthias Wimmer Benefits of the Machine Learning Approach  Accurate and robust calculation rules  Locally customized calculation rules  Generalization from many images  Simple job for the designer  Critical decisions are automated  No domain-dependent knowledge required  No loops

18/22 Face Model Fitting based on Machine Learning from Multi-band Images of Facial Components Technische Universität München – Matthias Wimmer Evaluation 1: Displacing the Correct Model statistics-based objective function ideal objective function learned objective function

19/22 Face Model Fitting based on Machine Learning from Multi-band Images of Facial Components Technische Universität München – Matthias Wimmer Evaluation 2: Selected Features contour point 116

20/22 Face Model Fitting based on Machine Learning from Multi-band Images of Facial Components Technische Universität München – Matthias Wimmer Conclusion  Crucial decisions within Computer Vision algorithms  Don’t solve by trial and error → Learn from training data  Example 1: Learned classifiers for facial components  Example 2: Learned objective functions

21/22 Face Model Fitting based on Machine Learning from Multi-band Images of Facial Components Technische Universität München – Matthias Wimmer Outlook  More features for learning objective functions  Higher number of features  Other kinds of features: SIFT, LBP, …  Learn with better classifiers  Relevance Vector Machines  Boosted regressors  Training images: render faces with AAM  Exact ground truth (no manual work required)  Many images  Learn global objective function  Learn rules to directly update model parameter

22/22 Face Model Fitting based on Machine Learning from Multi-band Images of Facial Components Technische Universität München – Matthias Wimmer Thank you! Online-Demonstration: