Extensions of PCA and Related Tools

Slides:



Advertisements
Similar presentations
FMRI Methods Lecture 10 – Using natural stimuli. Reductionism Reducing complex things into simpler components Explaining the whole as a sum of its parts.
Advertisements

Face Recognition Sumitha Balasuriya.
Component Analysis (Review)
III-1 WMO course - “Statistics and Climatology” - Lecture III Dr. Bertrand Timbal Regional Meteorological Training Centre, Tehran, Iran December 2003.
Dimension reduction (1)
Maximum Covariance Analysis Canonical Correlation Analysis.
1er. Escuela Red ProTIC - Tandil, de Abril, 2006 Principal component analysis (PCA) is a technique that is useful for the compression and classification.
Statistical tools in Climatology René Garreaud
1 Multivariate Statistics ESM 206, 5/17/05. 2 WHAT IS MULTIVARIATE STATISTICS? A collection of techniques to help us understand patterns in and make predictions.
Principal Component Analysis
Factor Analysis Research Methods and Statistics. Learning Outcomes At the end of this lecture and with additional reading you will be able to Describe.
Multi-Scale Analysis for Network Traffic Prediction and Anomaly Detection Ling Huang Joint work with Anthony Joseph and Nina Taft January, 2005.
What is EOF analysis? EOF = Empirical Orthogonal Function Method of finding structures (or patterns) that explain maximum variance in (e.g.) 2D (space-time)
Subseasonal variability of North American wintertime surface air temperature Hai Lin RPN, Environment Canada August 19, 2014 WWOSC, Montreal.
Laurent Itti: CS599 – Computational Architectures in Biological Vision, USC Lecture 7: Coding and Representation 1 Computational Architectures in.
COMING ATTRACTIONS. A GEDALOF, MANTUA, PETERSON PRODUCTION CIG / JISAO PRESENTS.
Techniques for studying correlation and covariance structure
Statistical Methods for long-range forecast By Syunji Takahashi Climate Prediction Division JMA.
Principal Component Analysis. Philosophy of PCA Introduced by Pearson (1901) and Hotelling (1933) to describe the variation in a set of multivariate data.
Dr Mark Cresswell Statistical Forecasting [Part 1] 69EG6517 – Impacts & Models of Climate Change.
Summarized by Soo-Jin Kim
Chapter 2 Dimensionality Reduction. Linear Methods
Presented By Wanchen Lu 2/25/2013
Principal Components Analysis BMTRY 726 3/27/14. Uses Goal: Explain the variability of a set of variables using a “small” set of linear combinations of.
The MJO Not really….it’s The Madden Julian Oscillation.
Recap of PCA: what it does, how to do it Details of PCA presentation of results terminology scaling truncation of PCs interpretation of PCs Rotation of.
2012 TTA ICTP1 ENSO-South Asian Monsoon V. Krishnamurthy Center for Ocean-Land-Atmosphere Studies Institute of Global Environment and Society Calverton,
1 The Venzke et al. * Optimal Detection Analysis Jeff Knight * Venzke, S., M. R. Allen, R. T. Sutton and D. P. Rowell, The Atmospheric Response over the.
What is it? Principal Component Analysis (PCA) is a standard tool in multivariate analysis for examining multidimensional data To reveal patterns between.
Linear Inverse Modeling with an SVD treatment (at least the extent that I’ve learned thus far) Eleanor Middlemas.
Canonical Correlation Analysis and Related Techniques Simon Mason International Research Institute for Climate Prediction The Earth Institute of Columbia.
Modulation of eastern North Pacific hurricanes by the Madden-Julian oscillation. (Maloney, E. D., and D. L. Hartmann, 2000: J. Climate, 13, )
N– variate Gaussian. Some important characteristics: 1)The pdf of n jointly Gaussian R.V.’s is completely described by means, variances and covariances.
SINGULAR VALUE DECOMPOSITION (SVD)
Mathematics of PCR and CCA Simon Mason Seasonal Forecasting Using the Climate Predictability Tool Bangkok, Thailand, 12 – 16 January.
Descriptive Statistics vs. Factor Analysis Descriptive statistics will inform on the prevalence of a phenomenon, among a given population, captured by.
Modern Navigation Thomas Herring MW 11:00-12:30 Room
Principal Components: A Mathematical Introduction Simon Mason International Research Institute for Climate Prediction The Earth Institute of Columbia University.
Carlos H. R. Lima - Depto. of Civil and Environmental Engineering, University of Brasilia. Brazil. Upmanu Lall - Water Center, Columbia.
Variations in the Activity of the Madden-Julian Oscillation:
Discriminant Analysis
Introduction to Linear Algebra Mark Goldman Emily Mackevicius.
Christina Bonfanti University of Miami- RSMAS MPO 524.
Module III Multivariate Analysis Techniques- Framework, Factor Analysis, Cluster Analysis and Conjoint Analysis Research Report.
Principal Component Analysis Zelin Jia Shengbin Lin 10/20/2015.
A COMPARISON OF EOF and REDUNDANCY ANALYSES FOR THE COUPLED ATMOSPHERE- OCEAN SYSTEM F. Bakalian 1, H. Ritchie 1,2, K. Thompson 1, W. Merryfield 3 1 Dept.
Feature Extraction 主講人:虞台文. Content Principal Component Analysis (PCA) PCA Calculation — for Fewer-Sample Case Factor Analysis Fisher’s Linear Discriminant.
3 “Products” of Principle Component Analysis
Multivariate Transformation. Multivariate Transformations  Started in statistics of psychology and sociology.  Also called multivariate analyses and.
Université d’Ottawa / University of Ottawa 2001 Bio 8100s Applied Multivariate Biostatistics L11.1 Lecture 11: Canonical correlation analysis (CANCOR)
Feature Extraction 主講人:虞台文.
Central limit theorem revisited Throw a dice twelve times- the distribution of values is not Gaussian Dice Value Number Of Occurrences.
Principal Components Analysis ( PCA)
Central limit theorem - go to web applet. Correlation maps vs. regression maps PNA is a time series of fluctuations in 500 mb heights PNA = 0.25 *
Marcel Rodney McGill University Department of Oceanic and Atmospheric Sciences Supervisors: Dr. Hai Lin, Prof. Jacques Derome, Prof. Seok-Woo Son.
Methods of multivariate analysis Ing. Jozef Palkovič, PhD.
PRINCIPAL COMPONENT ANALYSIS(PCA) EOFs and Principle Components; Selection Rules LECTURE 8 Supplementary Readings: Wilks, chapters 9.
Principal Component Analysis
Principal Component Analysis (PCA)
PREDICT 422: Practical Machine Learning
School of Computer Science & Engineering
LECTURE 10: DISCRIMINANT ANALYSIS
Lecture 8:Eigenfaces and Shared Features
Techniques for studying correlation and covariance structure
Descriptive Statistics vs. Factor Analysis
LECTURE 09: DISCRIMINANT ANALYSIS
Feature Selection Methods
Principal Component Analysis
Seasonal Forecasting Using the Climate Predictability Tool
Canonical Correlation Analysis and Related Techniques
Presentation transcript:

Extensions of PCA and Related Tools PCA: Lecture 3 Extensions of PCA and Related Tools Extended EOF (EEOF), Singular spectrum analysis (SSA), M-SSA Canonical Correlation Analysis (CCA) Others Complex EOFs Maximum Covariance Analysis Principal Oscillation Patterns (POP) Independent Component Analysis (ISA)

Singular Spectrum Analysis (SSA) or Extended EOF (EEOF) PCA makes use of correlation in SPACE Weather and climate data (and other geoscience data) usually have high correlation in space. PCA is a useful tool to learn about large scale patterns that explain most of the variability. Since PCs find the combination of variables which explain most of the variability it is implied that PCs make use of the usually observed high correlation in space. But geoscience data are often correlated in TIME PCA does not take this into account Auto and cross-correlation in time can be very useful for prediction purposes and also for building probabilistic time series models. SSA/EEOFs used to handle temporal correlation EEOFs are an extension of the traditional EOF technique to deal not only with spatial- but also with temporal correlations observed in (weather/climate) data it is based on the auto-covariance matrix (instead of the usual spatial covariance matrix from PCA) normally used to find propagating or periodic signals in the data

Extended EOF (EEOF) Implementation for the univariate case consider a single times series: xt, t = 1, … , n like PCA, eigenvectors and eigenvalues are extracted from the covariance matrix The covariance matrix is calculated using a delay window or imposing an embedding dimension of length M on the time series

Singular Spectrum Analysis (SSA) Terminology SSA is the application of PCA to time series also know as EEOFs and Time PCs (T-PCs or T-EOFs) when applied to multivariate data (many time series) it is known as multi-channel singular spectrum analysis (M-SSA) Summary of what it does application of PCA to time series which is structured into overlapping moving windows of data the data vectors are fragments of time series rather than spatial distributions of values at a single time the eigenvectors therefore represent characteristic time patterns, rather than characteristic spatial patterns used mainly to identify oscillatory features in the time series

Singular Spectrum Analysis (SSA) Example application: searching for the sub-seasonal oscillations in the Tropical Pacific using Outgoing Longwave Radiation (OLR) From Hannachi et al., Int. J. Clim., 2007

Singular Spectrum Analysis (SSA) Applying PCA and then SSA gives: First PC/EOF is the seasonal cycle From Hannachi et al., Int. J. Clim., 2007

Singular Spectrum Analysis (SSA) EPCs 4 and 5 Semi-annual variation in OLR EEOF/SSA can detect oscillatory or quasi-oscillatory features in the time series - as a pair of (degenerate) T-PCs - with same shape but offset by ¼ cycle - compare with Fourier analysis and pairs of sine, cosine functions EPCs 8 and 9 Madden-Julian Oscillation (MJO), an eastward propagating wave of tropical convective anomalies (dominant mode of intra-seasonal tropical variability) From Hannachi et al., Int. J. Clim., 2007

Canonical Correlation Analysis (CCA) Definition of CCA identifies a sequence of pairs of patterns in 2 multivariate data sets, and constructs sets of transformed variables by projecting the original data onto these patterns Difference between PCA and CCA PCA looks for patterns with a single multivariate dataset that represent maximum amounts of the variation in the data In CCA, the patterns are chosen such that the projected data onto these patterns exhibit maximum correlation – while being uncorrelated with the projections onto any other pattern In other words: CCA identifies new variables that maximize the inter-relationships between two data sets, in contrast to the patterns describing the internal variability within a single dataset from PCA. Link to Multiple Regression Can be thought of as an extension to multiple regression instead of predicting a scalar y, we are predicting a vector y

Canonical Correlation Analysis (CCA) Applications In the atmospheric sciences, CCA has been used in diagnostic climatological studies, in the forecast of El Nino, and the forecast of long-range temperature and precipitation. Example for a geophysical field: vector x containing observations of one variable at a set of locations vector y containing observations of a different variable at a set of locations that may be the same or different to those in x. typically the data are time series of the observations of the two fields x and y could be observed at the same time (coupled variability) x and y could be lagged in time (statistical prediction)

Canonical Correlation Analysis (CCA) How to do it: CCA extracts relationships between pairs of data vectors x and y from their joint covariance matrix Remember: PCA is applied to the covariance matrix of x only Concatenate x and y into a single vector, cT = [xT, yT] Partition the covariance matrix of c, Sc into four blocks: Transform the data, x and y, into sets of new variables (canonical variates), v and w: v = aTx w = bTy where a and b are linear weights (like eigenvectors) called canonical vectors

Canonical Correlation Analysis (CCA) Some things to note: the number of pairs of canonical variates is the min(dim(x), dim(y)) a and b are chosen such that corr[v1, w1] >= corr[v2,w2] >= … >= corr[vm,wm] >= 0 (each of the M pairs of canonical variates exhibits no greater correlation than the previous pair) corr[vk, wm] = rC(m) for k = m; corr[vk, wm] = 0 for k != m, where rC = canonical correlations (each canonical variate is uncorrelated with all other variates except its twin in the mth pair) Calculation of canonical vectors and variates eigen decomposition to get two sets of eigenvectors, em and fm and shared eigenvalues; rC = sqrt(λ) also can be done using SVD Combining CCA and PCA sometimes it is worth performing PCA on the two fields x and y and then CCA on the leading PCs ux and uy.

Canonical Correlation Analysis (CCA) A simple example consider two normally distributed 2-D variables x and y with unit variance let y1 + y2 = x1 + x2 the correlation between x and y : which is relatively weak despite the perfect linear relationship between x and y If we apply CCA: the largest and only canonical correlation is 1 and this lies along the direction of the linear relationship if we project the data onto the canonical vectors, then the correlation matrix is

Canonical Correlation Analysis (CCA) Example application: Prediction of Wildfire in the Western U.S. Seasonal wildfire forecasts based on spring PDSI Use CCA to form linear relationships between PCs of seasonal acres burned (field 1) and PDSI (field 2) Find optimally correlated patterns in the area burned and preceding soil moisture. A linear forecast model was constructed using the first three canonical correlation pairs (CCs) calculated for the six area burned and six PDSI PCs. BUT Longer lead time forecasts needed Previously forecasts were based on March/April PDSI data but policy decisions must be made many months before the fire season. So use CCA to form relationships between previous year’s Pacific SSTs and Jan PDSI Prediction of area burned for 2003 fire season From “Westerling et al., 2003, Statistical Forecasts of the 2003 Western Wildfire Season Using Canonical Correlation Analysis”

Other Extensions and Some Relatives Complex-EOF To extend the EOF analysis to the study of spatial structures that can propagate in time, one can perform a complex principal component analysis in the frequency domain. Maximum Covariance Analysis (MCA) Finds linear combinations of two sets of vector data, x and y, that maximizes their covariance (CCA maximizes their correlation). Independent Component Analysis (ICA) ICA seeks directions that are most statistically independent. i.e. that minimize the mutual information between the data. Principal Oscillation Patterns (POP) POPs are used to examine the oscillation properties and spatial structure of dynamical processes in the atmosphere