An Introduction to Functional Data Analysis Jim Ramsay McGill University.

Slides:



Advertisements
Similar presentations
Independent Component Analysis
Advertisements

Notes Sample vs distribution “m” vs “µ” and “s” vs “σ” Bias/Variance Bias: Measures how much the learnt model is wrong disregarding noise Variance: Measures.
STAT 497 APPLIED TIME SERIES ANALYSIS
Gaussian process emulation of multiple outputs Tony O’Hagan, MUCM, Sheffield.
Handwriting: Registration and Differential Equations.
An Introduction to Functional Data Analysis Jim Ramsay McGill University.
Psychometrics, Dynamics, and Functional Data Analysis Jim Ramsay Jim Ramsay McGill University “The views “The views.
Jim Ramsay McGill University Basis Basics. Overview  What are basis functions?  What properties should they have?  How are they usually constructed?
ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 14 Using Multivariate Design and Analysis.
Mouse Livers: Derivatives and Functional Linear Models.
Longitudinal Experiments Larry V. Hedges Northwestern University Prepared for the IES Summer Research Training Institute July 28, 2010.
Lecture Notes for CMPUT 466/551 Nilanjan Ray
From Data to Differential Equations Jim Ramsay McGill University With inspirations from Paul Speckman and Chong Gu.
Discrim Continued Psy 524 Andrew Ainsworth. Types of Discriminant Function Analysis They are the same as the types of multiple regression Direct Discrim.
Functional Data Analysis T Chapters 10,11,12 Markus Kuusisto.
Human Growth: From data to functions. Challenges to measuring growth We need repeated and regular access to subjects for up to 20 years. We need repeated.
ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.
CHAPTER 4: Parametric Methods. Lecture Notes for E Alpaydın 2004 Introduction to Machine Learning © The MIT Press (V1.1) 2 Parametric Estimation X = {
INTRODUCTION TO Machine Learning ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.
Phase and Amplitude Variation in Montreal Weather Jim Ramsay McGill University.
Correlation and Regression Analysis
1 A MONTE CARLO EXPERIMENT In the previous slideshow, we saw that the error term is responsible for the variations of b 2 around its fixed component 
From Data to Differential Equations Jim Ramsay McGill University.
Hydrologic Statistics
Human Growth: From data to functions. Challenges to measuring growth We need repeated and regular access to subjects for up to 20 years. We need repeated.
1 Linear Methods for Classification Lecture Notes for CMPUT 466/551 Nilanjan Ray.
Inference for regression - Simple linear regression
The Tutorial of Principal Component Analysis, Hierarchical Clustering, and Multidimensional Scaling Wenshan Wang.
Presented By Wanchen Lu 2/25/2013
Gaussian process modelling
0 Pattern Classification, Chapter 3 0 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda,
Functional linear models. Three types of linear model to consider: 1. Response is a function; covariates are multivariate. 2. Response is scalar or multivariate;
INTRODUCTION TO MACHINE LEARNING 3RD EDITION ETHEM ALPAYDIN © The MIT Press, Lecture.
Mouse Livers: Derivatives and Functional Linear Models.
Bayesian networks Classification, segmentation, time series prediction and more. Website: Twitter:
And, now take you into a WORLD of……………...
Geographic Information Science
Generalizing Linear Discriminant Analysis. Linear Discriminant Analysis Objective -Project a feature space (a dataset n-dimensional samples) onto a smaller.
Learning Theory Reza Shadmehr LMS with Newton-Raphson, weighted least squares, choice of loss function.
INTRODUCTION TO MACHINE LEARNING 3RD EDITION ETHEM ALPAYDIN Modified by Prof. Carolina Ruiz © The MIT Press, 2014 for CS539 Machine Learning at WPI
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 13-1 Introduction to Regression Analysis Regression analysis is used.
Multivariate Data Analysis Chapter 1 - Introduction.
Stats 845 Applied Statistics. This Course will cover: 1.Regression –Non Linear Regression –Multiple Regression 2.Analysis of Variance and Experimental.
Data Science and Big Data Analytics Chap 3: Data Analytics Using R
Chapter1: Introduction Chapter2: Overview of Supervised Learning
ANOVA, Regression and Multiple Regression March
Introduction to Machine Learning Multivariate Methods 姓名 : 李政軒.
Introduction to statistics I Sophia King Rm. P24 HWB
Beginning Statistics Table of Contents HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2008 by Hawkes Learning Systems/Quant Systems, Inc.
Lendasse A., Corona F., Liitiäinen E.1 Functional Data Analysis CORONA FRANCESCO, Lendasse Amaury, Liitiäinen Elia.
Designs for Experiments with More Than One Factor When the experimenter is interested in the effect of multiple factors on a response a factorial design.
Ch 1. Introduction Pattern Recognition and Machine Learning, C. M. Bishop, Updated by J.-H. Eom (2 nd round revision) Summarized by K.-I.
Methods of multivariate analysis Ing. Jozef Palkovič, PhD.
State Space Representation
CH 5: Multivariate Methods
12 Inferential Analysis.
Bradley W. Vines McGill University
Human Growth: From data to functions
Descriptive Statistics vs. Factor Analysis
State Space Analysis UNIT-V.
Mouse Livers: Derivatives and Functional Linear Models
12 Inferential Analysis.
Learning Theory Reza Shadmehr
Multivariate Methods Berlin Chen
Multivariate Methods Berlin Chen, 2005 References:
Signals and Systems Revision Lecture 1
Mathematical Foundations of BME
Geology 491 Spectral Analysis
Regression and Correlation of Data
Presentation transcript:

An Introduction to Functional Data Analysis Jim Ramsay McGill University

What is functional data analysis? A data analysis is functional if either: The data to be analyzed are assumed to come from a smooth function (eg: time, space, space/time, ability, depression, frequency, molecular weight, etc.) The data to be analyzed are assumed to come from a smooth function (eg: time, space, space/time, ability, depression, frequency, molecular weight, etc.) The data are to be modeled by a smooth function (eg: probability density, dose response function, intensity function, etc.) The data are to be modeled by a smooth function (eg: probability density, dose response function, intensity function, etc.)

So what’s new about that? Two ideas are central: The functions are smooth, usually meaning that one or more derivatives can be estimated and are useful. The functions are smooth, usually meaning that one or more derivatives can be estimated and are useful. No assumptions, such as stationarity, low dimensionality, equally spaced sampling points, etc, are made about the functions or the data. No assumptions, such as stationarity, low dimensionality, equally spaced sampling points, etc, are made about the functions or the data.

Heights of 10 girls

Important features of growth data Times of measurements are not equally spaced. Times of measurements are not equally spaced. Growth is smooth; we want to study the second derivative of these curves. Growth is smooth; we want to study the second derivative of these curves. No assumptions are made initially about the shapes of these curves or their derivatives. No assumptions are made initially about the shapes of these curves or their derivatives. A girl’s entire height record is viewed as a single unitary observation. A girl’s entire height record is viewed as a single unitary observation.

What new challenges are there? Have a look at the estimated acceleration curves for these ten girls. Growth curves should be monotonic; how can we achieve this? Growth curves should be monotonic; how can we achieve this? How can we get a good estimate of acceleration? How can we get a good estimate of acceleration? What’s wrong with the mean acceleration curve? What’s wrong with the mean acceleration curve?

Acceleration curves for 10 girls

Phase and amplitude variation We see that acceleration curves vary in terms of: The intensity of the pubertal growth spurt. The intensity of the pubertal growth spurt. and its timing. and its timing. There is both amplitude and phase variation here. There is both amplitude and phase variation here. Unless we can remove the phase variation, the cross-sectional mean is worthless. Unless we can remove the phase variation, the cross-sectional mean is worthless.

Do standard data analyses have functional counterparts? They surely do. There are functional versions of: Analysis of variance Analysis of variance Multiple regression analysis Multiple regression analysis Principal components analysis Principal components analysis Canonical correlation analysis Canonical correlation analysis Cluster and classification analysis Cluster and classification analysis

Is there anything new in FDA? Because the functions we estimate are assumed smooth, we can model the dynamic behavior of the data. Because the functions we estimate are assumed smooth, we can model the dynamic behavior of the data. This means using differential equations to model how the output of an input/output system changes in response to changes in the input. This means using differential equations to model how the output of an input/output system changes in response to changes in the input.

Data from an oil refinery

The data are from a tray in a distillation column. The data are from a tray in a distillation column. The output is the top plot; the input is the bottom plot. The output is the top plot; the input is the bottom plot. The solid line is a model using a simple first order differential equation: The solid line is a model using a simple first order differential equation: Dx(t) = -βx(t) + αu(t) Dx(t) = -βx(t) + αu(t) where x(t) is the output function and u(t) is the input function. where x(t) is the output function and u(t) is the input function. How can we estimate such models from noisy observed data? How can we estimate such models from noisy observed data?

Where do we start? The first task is to learn methods for estimating smooth functions from discrete noisy data. The first task is to learn methods for estimating smooth functions from discrete noisy data. We use basis function expansions to model functions. We use basis function expansions to model functions. We impose smoothness using roughness penalties. We impose smoothness using roughness penalties.

And what’s next? Because most functional data show variation in both phase and amplitude, the next step is to learn how to separate phase from amplitude variation. Because most functional data show variation in both phase and amplitude, the next step is to learn how to separate phase from amplitude variation. This process is called curve registration. This process is called curve registration. After that, we can use functional versions of standard multivariate data analyses. After that, we can use functional versions of standard multivariate data analyses.

What about functional exploratory data analysis? As always, graphical display methods are indispensable. As always, graphical display methods are indispensable. We will focus on the phase/plane plot as a way of exploring the interplay between derivatives. We will focus on the phase/plane plot as a way of exploring the interplay between derivatives. Principal components and cluster analyses are also useful. Principal components and cluster analyses are also useful.

How do we use covariate information? Covariates or independent variables can be (a) multivariate and (b) functional. Covariates or independent variables can be (a) multivariate and (b) functional. Regression analysis with a functional response and multivariate covariates is fairly straightforward. Regression analysis with a functional response and multivariate covariates is fairly straightforward. Regressing on functional covariates leads to new challenges, however. Regressing on functional covariates leads to new challenges, however.

Can derivatives be used, too? Every function, whether directly fit to data, or estimated from non-functional data, is assumed to have one or more derivatives available for an analysis. Every function, whether directly fit to data, or estimated from non-functional data, is assumed to have one or more derivatives available for an analysis. A differential equation is a model that contains one or more derivatives as a part of the model. A differential equation is a model that contains one or more derivatives as a part of the model.

Unique Aspects of Functional Data Analysis The data are from a smooth process, so we can use derivatives in various ways. The data are from a smooth process, so we can use derivatives in various ways. Time itself may be an elastic medium, and vary over functional observations. Time itself may be an elastic medium, and vary over functional observations. Differential equations can play a big role in a functional data analysis. Differential equations can play a big role in a functional data analysis.

Finding out More Ramsay, J. O. and Silverman, B. W. (1997, 2004) Functional Data Analysis. Springer. Ramsay, J. O. and Silverman, B. W. (1997, 2004) Functional Data Analysis. Springer. Ramsay, J. O. and Silverman, B. W. Ramsay, J. O. and Silverman, B. W. (2002) Applied Functional Data Analysis. Springer (2002) Applied Functional Data Analysis. Springer Visit the FDA website: Visit the FDA website: Software in Matlab, R and S-PLUS available at ego.psych.mcgill.ca/pub/ramsay Software in Matlab, R and S-PLUS available at ego.psych.mcgill.ca/pub/ramsay