1 4. Model constraints Quimiometria Teórica e Aplicada Instituto de Química - UNICAMP.

Slides:



Advertisements
Similar presentations
Richard Young Optronic Laboratories Kathleen Muray INPHORA
Advertisements

Modeling of Data. Basic Bayes theorem Bayes theorem relates the conditional probabilities of two events A, and B: A might be a hypothesis and B might.
ADHD Reaction Times: Densities, Mixed Effects, and PCA.
Active Shape Models Suppose we have a statistical shape model –Trained from sets of examples How do we use it to interpret new images? Use an “Active Shape.
Calibration methods Chemistry 243.
Analysis of the Visible Absorption Spectrum of I 2 in Inert Solvents Using a Physical Model Joel Tellinghuisen Department of Chemistry Vanderbilt University.
Combined evaluation of PFNS for 235 U(n th,f), 239 Pu(n th,f), 233 U(n th,f) and 252 Cf(sf) (in progress) V.G. Pronyaev Institute of Physics.
Regression analysis Relating two data matrices/tables to each other Purpose: prediction and interpretation Y-data X-data.
pH Emission Spectrum Emission(3 λ) λ1 λ2 λ3 A λ λ1λ2λ3λ1λ2λ3 A Ex 1 Emission(3 λ) λ1λ2λ3λ1λ2λ3 A Ex 2 Emission(3 λ) λ1λ2λ3λ1λ2λ3 A Ex 3 λ1λ2λ3λ1λ2λ3.
FTP Biostatistics II Model parameter estimations: Confronting models with measurements.
L.M. McMillin NOAA/NESDIS/ORA Regression Retrieval Overview Larry McMillin Climate Research and Applications Division National Environmental Satellite,
Nonlinear Regression Ecole Nationale Vétérinaire de Toulouse Didier Concordet ECVPT Workshop April 2011 Can be downloaded at
Chem. 31 – 4/15 Lecture. Announcements I Exam 2 – Hope to have graded by next Monday Lab Reports –AA report now due Monday, 4/20 (best to try to complete.
No Data Left Behind Modeling Colorful Compounds in Chemical Equilibria Mike DeVries D. Kwabena Bediako Prof. Douglas A. Vander Griend.
11.1 Introduction to Response Surface Methodology
Response Surface Method Principle Component Analysis
By: S.M. Sajjadi Islamic Azad University, Parsian Branch, Parsian,Iran.
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem, random variables, pdfs 2Functions.
Experiment 6 Amount of Dye in a Sports Drink. Goal To make a Beer’s Law standard curve To use the standard curve (and spectrophotometry) to determine.
Removal of the 1st order Rayleigh scatter effect Åsmund Rinnan.
The rank of a product of two matrices X and Y is equal to the smallest of the rank of X and Y: Rank (X Y) =min (rank (X), rank (Y)) A = C S.
CALIBRATION Prof.Dr.Cevdet Demir
Chromatographic detectors for Liquid Chromatography.
Initial estimates for MCR-ALS method: EFA and SIMPLISMA
1 MCR-ALS analysis using initial estimate of concentration profile by EFA.
Ordinary least squares regression (OLS)
PATTERN RECOGNITION : PRINCIPAL COMPONENTS ANALYSIS Prof.Dr.Cevdet Demir
Linear and generalised linear models
1 5. Multiway calibration Quimiometria Teórica e Aplicada Instituto de Química - UNICAMP.
1 2. The PARAFAC model Quimiometria Teórica e Aplicada Instituto de Química - UNICAMP.
Measurement of the Branching fraction B( B  D* l ) C. Borean, G. Della Ricca G. De Nardo, D. Monorchio M. Rotondo Riunione Gruppo I – Napoli 19 Dicembre.
Introduction to API Process Simulation
CALIBRATION METHODS.
1 SVY207: Lecture 18 Network Solutions Given many GPS solutions for vectors between pairs of observed stations Compute a unique network solution (for many.
Spectroscopy Chromatography PhysChem Naming Drawing and Databasing Enterprise Solutions Software for Interactive Curve Resolution using SIMPLISMA Andrey.
1 6. Other issues Quimiometria Teórica e Aplicada Instituto de Química - UNICAMP.
1 A Fast-Nonegativity-Constrained Least Squares Algorithm R. Bro, S. D. Jong J. Chemometrics,11, , 1997 By : Maryam Khoshkam.
Peak-purity by LC-MS and LC-DAD Knut Dyrstad Erlend Hvattum Sharon Jara Arnvid Lie.
Threeway analysis Batch organic synthesis. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.
CSC321: Neural Networks Lecture 2: Learning with linear neurons Geoffrey Hinton.
SUPA Advanced Data Analysis Course, Jan 6th – 7th 2009 Advanced Data Analysis for the Physical Sciences Dr Martin Hendry Dept of Physics and Astronomy.
Factor Analysis Psy 524 Ainsworth. Assumptions Assumes reliable correlations Highly affected by missing data, outlying cases and truncated data Data screening.
CALIBRATION METHODS. For many analytical techniques, we need to evaluate the response of the unknown sample against the responses of a set of standards.
1 Statistical Analysis Professor Lynne Stokes Department of Statistical Science Lecture 8 Analysis of Variance.
Principal Component Analysis (PCA). Data Reduction summarization of data with many (p) variables by a smaller set of (k) derived (synthetic, composite)
© Huw Owens - University of Manchester : Interactions of Light and Matter Refraction Reflection Absorption Dr Huw Owens.
Solutions. 1.The tensile strength of concrete produced by 4 mixer levels is being studied with 4 replications. The data are: Compute the MS due to mixers.
Principal Components Analysis. Principal Components Analysis (PCA) A multivariate technique with the central aim of reducing the dimensionality of a multivariate.
THREE-WAY COMPONENT MODELS pages By: Maryam Khoshkam 1.
Grey modeling approaches to investigate chemical processes Romà Tauler 1 and Anna de Juan 2 IIQAB-CSIC 1, UB 2 Spain
PATTERN RECOGNITION : PRINCIPAL COMPONENTS ANALYSIS Richard Brereton
Multiway Data Analysis
Principal Component Analysis (PCA)
Principal Component Analysis (PCA)
Quimiometria Teórica e Aplicada Instituto de Química - UNICAMP
ISBE-AstraZeneca Strategic Alliance Project # 42 Modelling DCE-MRI Arterial Input Functions in rats Deirdre McGrath, Strategic Alliance meeting 1 st July.
Laser light stimulates vibration, causing some energy to be absorbed and the rest to be scattered as a slightly red-shifted photon. Raman Microspectroscopic.
1 Robustness of Multiway Methods in Relation to Homoscedastic and Hetroscedastic Noise T. Khayamian Department of Chemistry, Isfahan University of Technology,
3 “Products” of Principle Component Analysis
Central limit theorem revisited Throw a dice twelve times- the distribution of values is not Gaussian Dice Value Number Of Occurrences.
Central limit theorem - go to web applet. Correlation maps vs. regression maps PNA is a time series of fluctuations in 500 mb heights PNA = 0.25 *
1 C.A.L. Bailer-Jones. Machine Learning. Data exploration and dimensionality reduction Machine learning, pattern recognition and statistical data modelling.
Curve fitting methods for complex and rank deficient chemical data
Rotational Ambiguity in Hard-Soft Modeling Method
X.1 Principal component analysis
5.4 General Linear Least-Squares
5.1 Introduction to Curve Fitting why do we fit data to a function?
X.6 Non-Negative Matrix Factorization
Cu2+ + 4NH3 → Cu(NH3)42+ (deep blue)
Samuel T. Hess, Watt W. Webb  Biophysical Journal 
Presentation transcript:

1 4. Model constraints Quimiometria Teórica e Aplicada Instituto de Química - UNICAMP

2 Principal component analysis (PCA) In Hotelling’s (1933) approach, components have maximum variance. –X = TP T + E –Components are calculated successively. –Components are orthogonal: T T T = Diagonal; P T P = I In Pearson’s (1901) and Eckart & Young’s (1936) approach, components explain maximum amount of variance in the variables. –X = AB T + E –Components are calculated simultaneously. –Components have no orthogonality or unit-length constraints.

3 Constrained least squares Solve under the constraint that A is non-negative, unimodal, smooth etc. Some constraints are inactive, e.g. PCA under orthogonality. If constraints are active, A is no longer the least- squares solution.

4 Why use constraints? Obtain solutions that correspond to known chemistry, making the model more interpretable. –Concentrations can not be negative. Obtain models that are uniquely identified. –Remove rotational ambiguity. Avoid numerical problems such as local minima and swamps. –Constraints can help ALS find the correct solution

5 Example: curve resolution of HPLC data (1) HPLC analysis of three coeluting organophosphorus pesticides. Diode-array detector gives a spectrum at each time point: X (time  wavelength). Beer-Lambert law says X = CS T + E. Initial analysis shows that three analytes are present. Data is from Roma Tauler’s web-site Download it and try for yourself!

6 Example: curve resolution of HPLC data (2) Unconstrained solution % of X explained Calculation time: 0.43 seconds CS

7 Example: curve resolution of HPLC data (2) Non-negativity constraints % of X explained Calculation time: 16 seconds CS

8 Example: curve resolution of HPLC data (3) Unimodality & non-negativity constraints % of X explained Calculation time: 16 minutes CS

9 Comments Active constraints always reduce % fit, but can give a more interpretable model. E3E3 X3X3 C3C3 E2E2 X2X2 C2C2 It is possible to ‘stack’ two–way data from different experiments, e.g. =+ E1E1 X1X1 C1C1 S

10 What sort of constraints might be useful? Hard target: known spectrum, a r = s Non-negativity: concentrations, absorbances Monotonicity: kinetic profiles Unimodality: elution profiles, fluorescence excitations Other curve shapes: Gaussian peaks, symmetry Selectivity: pure variables Functional constraints: first-principle models Closure: [A] t + [B] t + [C] t = y Orthogonality: useful for separation of variances...plus many more...

11 Conclusions (1) It is possible to mix constraints within the same mode, i.e. loadings 1 and 3 are non-negative, loadings 2 are unimodel. Chemical knowledge can be included in your model by using constraints. Constraints can improve the model making it –closer to reality –easier to understand –more robust to extrapolation

12 Conclusions (2) Mixed constraints can be applied using column-wise estimation: 1. Subtract contribution from other components 2. Estimate component under desired constraint –Bro & Sidiropoulos (1998) have shown that this is equivalent to solving where  is the unconstrained solution,

13 ALS for Tucker3 Step 0: Initialise B, C & G Step 1: Estimate A: Step 4: Estimate G: Step 5: Check for convergence. If not, go to Step 1. Step 3: Estimate C in same way: Step 2: Estimate B in same way:

14 Example: UV-Vis monitoring of a chemical reaction (1) Two-step conversion reaction under pseudo-first-order kinetics: A + B  C  D + E UV-Vis spectrum ( nm) measured every 10 seconds for 45 minutes 30 normal batches measured: X (30  201  271) 9 disturbed batches: pH changes made during the reaction

15 Example: UV-Vis monitoring of a chemical reaction (2) 3-component PARAFAC model has problems! spectra are difficult to interpret  highly correlated 

16 Example: UV-Vis monitoring of a chemical reaction (3) External process information A D No compound interactions allowed: Lambert-Beer law First-order reaction kinetics are known: Pure spectra of reactant and product known:

17 Example: UV-Vis monitoring of a chemical reaction (4) Constrained Tucker3 (1,3,3) model B C G = wavelength time batch A + E REACTION KINETICS KNOWN SPECTRA LAMBERT- BEER LAW X X = AG (C  B) T + E

18 Example: UV-Vis monitoring of a chemical reaction (5) Constrained Tucker3 (1,3,3) model Core array: G = [g | 0 g | 0 0 g 133 ] * * * * * fixed to known spectrum fixed to 1 st -order kinetics Rate constants are found! k 1 = 0.27, k 2 = Spectrum of intermediate is found!

19 Conclusions (3) It is possible to build ‘hybrid’ or ‘grey’ models where some loadings are constrained and others are left free – see the extra material which follows! If you already have some information about your chemical process, then include it in your model Using constraints can really help to uncover new information about your data (e.g. find spectra, estimate rate constants, test models).

20 Extra material: Black vs white models ‘Black-box’ or ‘soft’ models are empirical models which aim to fit the data as well as possible e.g. PCA, neural networks ‘White’ or ‘hard’ models use known external knowledge of the process e.g. physicochemical model, mass-energy balances Difficult to interpret Good fit Easy to interpret Not always available Good fit ‘Grey’ or ‘hybrid’ models combine the two. +

21 Extra material: Grey models mix black and white models ++ Total variation Systematic variation due to known causes Systematic variation due to unknown causes Unsystem- atic variation RESIDUALSMODEL REACTION KINETICS MECHANISTIC MODEL KNOWN CONCENTRATIONS

22 Extra material: Grey model B C G + = wavelength time batch A B C G + E REACTION KINETICS KNOWN SPECTRA LAMBERT- BEER LAW X A

23 Extra material: Grey model parameter estimation

24 Extra material: Grey model parameters White components Black components describe known effects can be interpreted 99.8% fit (corresponds well with estimated level of spectral noise of  0.13%) * * * * *

25 Extra material: Grey model residuals

26 Extra material: Off-line monitoring D-statistic Q-statistic (within model variation) (residual variation)

27 Extra material: On-line monitoring of disturbed batch