Quimiometria Teórica e Aplicada Instituto de Química - UNICAMP

Slides:



Advertisements
Similar presentations
Beyond Streams and Graphs: Dynamic Tensor Analysis
Advertisements

Air Force Technical Applications Center 1 Subspace Based Three- Component Array Processing Gregory Wagner Nuclear Treaty Monitoring Geophysics Division.
Regression analysis Relating two data matrices/tables to each other Purpose: prediction and interpretation Y-data X-data.
pH Emission Spectrum Emission(3 λ) λ1 λ2 λ3 A λ λ1λ2λ3λ1λ2λ3 A Ex 1 Emission(3 λ) λ1λ2λ3λ1λ2λ3 A Ex 2 Emission(3 λ) λ1λ2λ3λ1λ2λ3 A Ex 3 λ1λ2λ3λ1λ2λ3.
بنام خدا 1. An Introduction to multi-way analysis Mohsen Kompany-Zareh IASBS, Nov 1-3, Session one.
Tensors and Component Analysis Musawir Ali. Tensor: Generalization of an n-dimensional array Vector: order-1 tensor Matrix: order-2 tensor Order-3 tensor.
Exploratory Factor Analysis
Fitting the PARAFAC model Giorgio Tomasi Chemometrics group, LMT,MLI, KVL Frederiksberg. Denmark
1 Maarten De Vos SISTA – SCD - BIOMED K.U.Leuven On the combination of ICA and CPA Maarten De Vos Dimitri Nion Sabine Van Huffel Lieven De Lathauwer.
1 Bootstrap Confidence Intervals for Three-way Component Methods Henk A.L. Kiers University of Groningen The Netherlands.
PARAFAC and Fluorescence Åsmund Rinnan Royal Veterinary and Agricultural University.
In the Name of God. Morteza Bahram Department of Chemistry, Faculty of Science, Urmia University, Urmia, Iran
By: S.M. Sajjadi Islamic Azad University, Parsian Branch, Parsian,Iran.
Principal Components An Introduction Exploratory factoring Meaning & application of “principal components” Basic steps in a PC analysis PC extraction process.
Factor Analysis Research Methods and Statistics. Learning Outcomes At the end of this lecture and with additional reading you will be able to Describe.
Removal of the 1st order Rayleigh scatter effect Åsmund Rinnan.
Dimensional reduction, PCA
CALIBRATION Prof.Dr.Cevdet Demir
Two-way Analysis of Three-way Data. Two-way Analysis of Two-way Data = X D Y D = X Y 23.
1 MCR-ALS analysis using initial estimate of concentration profile by EFA.
Microarray analysis Algorithms in Computational Biology Spring 2006 Written by Itai Sharon.
Previously Two view geometry: epipolar geometry Stereo vision: 3D reconstruction epipolar lines Baseline O O’ epipolar plane.
Principal Components Analysis (PCA) 273A Intro Machine Learning.
Exploring Microarray data Javier Cabrera. Outline 1.Exploratory Analysis Steps. 2.Microarray Data as Multivariate Data. 3.Dimension Reduction 4.Correlation.
PATTERN RECOGNITION : PRINCIPAL COMPONENTS ANALYSIS Prof.Dr.Cevdet Demir
Laurent Itti: CS599 – Computational Architectures in Biological Vision, USC Lecture 7: Coding and Representation 1 Computational Architectures in.
1 5. Multiway calibration Quimiometria Teórica e Aplicada Instituto de Química - UNICAMP.
1 2. The PARAFAC model Quimiometria Teórica e Aplicada Instituto de Química - UNICAMP.
6-5 The Quadratic Formula and the Discriminant
Principal Components An Introduction
Principal Components Analysis BMTRY 726 3/27/14. Uses Goal: Explain the variability of a set of variables using a “small” set of linear combinations of.
1 6. Other issues Quimiometria Teórica e Aplicada Instituto de Química - UNICAMP.
1 Statistical Analysis Professor Lynne Stokes Department of Statistical Science Lecture 6 Solving Normal Equations and Estimating Estimable Model Parameters.
Slides to accompany Weathington, Cunningham & Pittenger (2010), Chapter 13: Between-Subjects Factorial Designs 1.
ASCA: analysis of multivariate data from an experimental design, Biosystems Data Analysis group Universiteit van Amsterdam.
Chapter 9 Factor Analysis
Informatics and Mathematical Modelling / Intelligent Signal Processing 1 Sparse’09 8 April 2009 Sparse Coding and Automatic Relevance Determination for.
Estimation of Number of PARAFAC Components
Descriptive Statistics vs. Factor Analysis Descriptive statistics will inform on the prevalence of a phenomenon, among a given population, captured by.
1 Statistical Analysis Professor Lynne Stokes Department of Statistical Science Lecture 8 Analysis of Variance.
GRASP Learning a Kernel Matrix for Nonlinear Dimensionality Reduction Kilian Q. Weinberger, Fei Sha and Lawrence K. Saul ICML’04 Department of Computer.
MOMENT OF A FORCE SCALAR FORMULATION, CROSS PRODUCT, MOMENT OF A FORCE VECTOR FORMULATION, & PRINCIPLE OF MOMENTS Today’s Objectives : Students will be.
LAHW#13 Due December 20, Eigenvalues and Eigenvectors 5. –The matrix represents a rotations of 90° counterclockwise. Obviously, Ax is never.
THREE-WAY COMPONENT MODELS pages By: Maryam Khoshkam 1.
Reduces time complexity: Less computation Reduces space complexity: Less parameters Simpler models are more robust on small datasets More interpretable;
A Convergent Solution to Tensor Subspace Learning.
Equilibrium systems Chromatography systems Number of PCs original Mean centered Number of PCs original Mean centered
PATTERN RECOGNITION : PRINCIPAL COMPONENTS ANALYSIS Richard Brereton
MOMENT OF A FORCE (Section 4.1) Today’s Objectives : Students will be able to: a) understand and define moment, and, b) determine moments of a force in.
1 4. Model constraints Quimiometria Teórica e Aplicada Instituto de Química - UNICAMP.
Multiway Data Analysis
Advanced Statistics Factor Analysis, I. Introduction Factor analysis is a statistical technique about the relation between: (a)observed variables (X i.
1 Robustness of Multiway Methods in Relation to Homoscedastic and Hetroscedastic Noise T. Khayamian Department of Chemistry, Isfahan University of Technology,
Principal Component Analysis
3 “Products” of Principle Component Analysis
Tensor Algorithms & Visualization By Arun Rao. Presentation Outline Introduction to Tensors Boussinesq Problem Visualizations Recent Uses.
Université d’Ottawa / University of Ottawa 2001 Bio 8100s Applied Multivariate Biostatistics L11.1 Lecture 11: Canonical correlation analysis (CANCOR)
Central limit theorem revisited Throw a dice twelve times- the distribution of values is not Gaussian Dice Value Number Of Occurrences.
Unsupervised Learning II Feature Extraction
Sect. 4.5: Cayley-Klein Parameters 3 independent quantities are needed to specify a rigid body orientation. Most often, we choose them to be the Euler.
Estimation Techniques for High Resolution and Multi-Dimensional Array Signal Processing EMS Group – Fh IIS and TU IL Electronic Measurements and Signal.
Refitting PCA/MPCA and CLS/PARAFAC Models to Incomplete Data Records
HOMOGENEOUS REPRESENTATION
Descriptive Statistics vs. Factor Analysis
Feature space tansformation methods
Factor Analysis (Principal Components) Output
Lecture 8: Factor analysis (FA)
AD5933 報告人:葉榮陞 指導教授:沈毓泰.
Exploratory Factor Analysis. Factor Analysis: The Measurement Model D1D1 D8D8 D7D7 D6D6 D5D5 D4D4 D3D3 D2D2 F1F1 F2F2.
Presentation transcript:

Quimiometria Teórica e Aplicada Instituto de Química - UNICAMP 3. The Tucker3 model Quimiometria Teórica e Aplicada Instituto de Química - UNICAMP

The PARAFAC model has a strict trilinear structure: PARAFAC & Tucker3 The PARAFAC model has a strict trilinear structure: xijk = airbjrckr + eijk Another generalization of PCA for multiway data is able to use a different number of components for each mode: the Tucker3 model. = + + etc.

= + The Tucker3 model CT G BT X E A C (K  4) B (J  2) core array I A J X (I  J  K) E (I  J  K) A (I  3) e.g. Mode I has chemical rank 3 Mode J has chemical rank 2 Mode K has chemical rank 4

X = AG (CB)T + E The Tucker3 formula Loadings Core array A (I  R1) describes variation in the first mode B (J  R2) describes variation in the second mode C (K  R3) describes variation in the third mode Core array G (R1  R2  R3) is matricized into GR1R2R3 (R1  R2R3)

What does the core array mean? The core array describes the significance of the interactions between the different loadings, e.g. the Tucker3 (2,2,2) model can be written as X = g111a1(c1b1)T + g112a1(c2b1)T + g121a1(c1b2)T + g122a1(c2b2)T + g211a2(c1b1)T + g212a2(c2b1)T + g221a2(c1b2)T + g222a2(c2b2)T + E g111=97, this triad is important 97 2.1 26 27 3 41 6 R1 R2 R3 g211=0, this interaction does not exist

How many components to use in each mode How many components to use in each mode? Unfold along each mode and look at the eigenvalues XIJK First mode XJKI Select 2 PC’s for second mode 1 2 3 4 5 6 7 8 9 0.5 1.5 2.5 3.5 4.5 Eigenvalue vs. PC Number PC Number Eigenvalue Second mode XKIJ Select 3 PC’s for third mode 1 2 3 4 5 6 7 8 9 0.5 1.5 2.5 3.5 4.5 Eigenvalue vs. PC Number PC Number Eigenvalue Third mode ‘Knee’ here - select 4 PC’s for first mode Try Tucker3(4,2,3)

When to use PARAFAC or Tucker3? Fluorescence data: Try Tucker3 (4,2,3) X 5 samples rank = 3 61 excitation ’s 201 emission ’s rank = 3 X rank = 3 35 batches Try 3-component PARAFAC or Tucker3 (3,3,3) rank = 4 24 hours rank = 3 12 sensors rank = 2 Process data:

PARAFAC as a restricted Tucker model C (K  R) CT B (J  R) = + I BT X E core array I (R  R  R) K I A J X (I  J  K) E (I  J  K) A (I  R) PARAFAC is a type of Tucker model for which the core array is a superidentity, I. 1 1 R3 R1 R2

Can have different number of components in each mode. Core array, G. PARAFAC vs Tucker3 PARAFAC Tucker3 Same number of components in each mode. Core array is superidentity, I. Can have different number of components in each mode. Core array, G. Solution is unique. Easy to interpret. Rotational freedom. More difficult to interpret. Strict, trilinear model. Good for some types of data. Multiway subspace model. Good for exploratory analysis. Algorithm sometimes slow and problematic. Algorithm fast and robust.

The Tucker3 model is good for Conclusions The Tucker3 model is good for general exploratory analysis multiway data which have modes of different rank Like PARAFAC, the Tucker3 model is estimated using ALS, with an extra step for the estimation of G. Like PCA, Tucker loadings have rotational freedom, making model interpretation more difficult than for PARAFAC. The use of constraints can help. Restricted Tucker3 models have been used for chemical calibration (more about this later...).