Initial estimates for MCR-ALS method: EFA and SIMPLISMA

Slides:



Advertisements
Similar presentations
By: Bahram Hemmateenejad. Complexity in Chemical Systems Unknown Components Unknown Numbers Unknown Amounts.
Advertisements

Visualizing the Microscopic Structure of Bilinear Data: Two components chemical systems.
Evolving Factor Analysis The evolution of a chemical system is gradually known by recording a new response vector at each stage of the process under study.
pH Emission Spectrum Emission(3 λ) λ1 λ2 λ3 A λ λ1λ2λ3λ1λ2λ3 A Ex 1 Emission(3 λ) λ1λ2λ3λ1λ2λ3 A Ex 2 Emission(3 λ) λ1λ2λ3λ1λ2λ3 A Ex 3 λ1λ2λ3λ1λ2λ3.
The General Linear Model Or, What the Hell’s Going on During Estimation?
Ch. 14: Chemical Equilibrium Dr. Namphol Sinkaset Chem 201: General Chemistry II.
No Data Left Behind Modeling Colorful Compounds in Chemical Equilibria Mike DeVries D. Kwabena Bediako Prof. Douglas A. Vander Griend.
Spectrophotometry Chapter 17, Harris
Advanced process modelling with multivariate curve resolution Anna de Juan 1,(*) and Romà Tauler Chemometrics group. Universitat de Barcelona. Diagonal,
Lecture 17 Factor Analysis. Syllabus Lecture 01Describing Inverse Problems Lecture 02Probability and Measurement Error, Part 1 Lecture 03Probability and.
A B C k1k1 k2k2 Consecutive Reaction d[A] dt = -k 1 [A] d[B] dt = k 1 [A] - k 2 [B] d[C] dt = k 2 [B] [A] = [A] 0 exp (-k 1 t) [B] = [A] 0 (k 1 /(k 2 -k.
Multivariate Resolution in Chemistry
HPLC-DAD. HPLC-DAD data t w t 2 Cw 2 S = Suppose in a chromatogram obtained with a HPLC-DAD there is a peak which an impurity is co-eluted with analyte.
Experiment 6 Amount of Dye in a Sports Drink. Goal To make a Beer’s Law standard curve To use the standard curve (and spectrophotometry) to determine.
The rank of a product of two matrices X and Y is equal to the smallest of the rank of X and Y: Rank (X Y) =min (rank (X), rank (Y)) A = C S.
4 Th Iranian chemometrics Workshop (ICW) Zanjan-2004.
The Estimation of Protein Structure in Solution Using Circular Dichroism Norma J. Greenfield
CALIBRATION Prof.Dr.Cevdet Demir
Net Analyte Signal Based Multivariate Calibration Methods By: Bahram Hemmateenejad Medicinal & Natural Products Chemistry Research Center, Shiraz University.
Including trilinear and restricted Tucker3 models as a constraint in Multivariate Curve Resolution Alternating Least Squares Romà Tauler Department of.
Lecture 3 Calibration and Standards. The points (1,2) and (6,5) do not fall exactly on the solid line, but they are too close to the line to show their.
1 MCR-ALS analysis using initial estimate of concentration profile by EFA.
Lecture 11 Vector Spaces and Singular Value Decomposition.
Study of the interaction between the porphyrin TmPyP4 and the Thrombin Binding Aptamer (TBA) G-quadruplex Miquel del Toro 1, Ramon Eritja 2, Raimundo Gargallo.
PATTERN RECOGNITION : PRINCIPAL COMPONENTS ANALYSIS Prof.Dr.Cevdet Demir
1 2. The PARAFAC model Quimiometria Teórica e Aplicada Instituto de Química - UNICAMP.
Multivariate Resolution in Chemistry
UV/Visible Spectroscopy
TITRATION This involves removing small samples from the reaction mixture at different times and then titrating the sample to determine the concentration.
To determine the rate constants for the second order consecutive reactions, a number of chemometrics and hard kinetic based methods are described. The.
Spectroscopy Chromatography PhysChem Naming Drawing and Databasing Enterprise Solutions Software for Interactive Curve Resolution using SIMPLISMA Andrey.
Interactive Series Baseline Correction Algorithm
1 Spectroscopic ANALYSIS Part 5 – Spectroscopic Analysis using UV-Visible Absorption Chulalongkorn University, Bangkok, Thailand January 2012 Dr Ron Beckett.
Chemistry Chemistry – the study of the composition of substances and the changes that substances undergo.
Determining the Concentration of a Solution: Beer’s Law
SUPA Advanced Data Analysis Course, Jan 6th – 7th 2009 Advanced Data Analysis for the Physical Sciences Dr Martin Hendry Dept of Physics and Astronomy.
Bashkir State Univerity The Chair of Mathematical Modeling , Ufa, Zaki Validi str. 32 Phone: ,
Synchronous map Asynchronous map Order of spectral changes: 1)1654 cm-1 (  -helix) 2) 1641, 1594 cm -1 (disordered structures, COO - ) 3) 1675, 1616 cm-1.
Measurements in Fluid Mechanics 058:180:001 (ME:5180:0001) Time & Location: 2:30P - 3:20P MWF 218 MLH Office Hours: 4:00P – 5:00P MWF 223B-5 HL Instructor:
  Examining how much light is absorbed by a compound’s sample at various wavelengths  Spectrum peaks—  Indicates the wavelengths associated with electrons’
1.Each element has a different symbol 2.The formula for a compound shows the elements in the compound 3.It also shows the ratio of the atoms of different.
Grey modeling approaches to investigate chemical processes Romà Tauler 1 and Anna de Juan 2 IIQAB-CSIC 1, UB 2 Spain
Equilibrium systems Chromatography systems Number of PCs original Mean centered Number of PCs original Mean centered
PATTERN RECOGNITION : PRINCIPAL COMPONENTS ANALYSIS Richard Brereton
1 4. Model constraints Quimiometria Teórica e Aplicada Instituto de Química - UNICAMP.
Rotational Ambiguity in Soft- Modeling Methods. D = USV = u 1 s 11 v 1 + … + u r s rr v r Singular Value Decomposition Row vectors: d 1,: d 2,: d p,:
Principal Component Analysis (PCA)
? Investigate the effects of extent of spectral overlapping on the results of RAFA.
Self-Modeling Curve Resolution and Constraints Hamid Abdollahi Department of Chemistry, Institute for Advanced Studies in Basic Sciences (IASBS), Zanjan,
Downloaded from کروماتوگرافی CHROMATOGRAPHY Downloaded from
Rank Annihilation Factor Analysis for Kinetics Spectrophotometric Mixture Analysis.
Chemical Data Fitting as a Powerful Chemometric Method.
An Introduction to Model-Free Chemical Analysis Hamid Abdollahi IASBS, Zanjan Lecture 3.
Absorption Spectrums 9/18/15.
UV/Visible Spectroscopy
Application of data fitting in chemical equilibria studies
Curve fitting methods for complex and rank deficient chemical data
Journal of Food and Drug Analysis
Rotational Ambiguity in Hard-Soft Modeling Method
Principal Nested Spheres Analysis
Law Conservation of Mass and Balanced and Unbalanced Equations
Volume 113, Issue 12, Pages (December 2017)
Addressing THE Problem of NIR
5.4 General Linear Least-Squares
X.6 Non-Negative Matrix Factorization
6.1 Introduction to Chi-Square Space
Foldability of a Natural De Novo Evolved Protein
Application of Singular Value Decomposition to the Analysis of Time-Resolved Macromolecular X-Ray Data  Marius Schmidt, Sudarshan Rajagopal, Zhong Ren,
Chapter 2 Properties of Matter
Foldability of a Natural De Novo Evolved Protein
Presentation transcript:

Initial estimates for MCR-ALS method: EFA and SIMPLISMA 7th Iranian Workshop on Chemometrics 3-5 February 2008                                         Initial estimates for MCR-ALS method: EFA and SIMPLISMA Bahram Hemmateenejad Chemistry Department, Shiraz University, Shiraz, Iran E-mail: hemmatb@sums.ac.ir

Fitting data to model (Hard model) Fitting model to data (soft model) Chemical modeling Fitting data to model (Hard model) Fitting model to data (soft model)

Multicomponent Curve Resolution Goal: Given an I x J matrix, D, of N species, determine N and the pure spectra of each specie. Model: DIxJ = CIxN SNxJ Common assumptions: Non-negative spectra and concentrations Unimodal concentrations Kinetic profiles 20 40 60 10 30 0.5 1 sensors samples S N x J C I = D

Basic Principles of MCR methods PCA: D=TP Beer-Lambert: D=CS In MCR we want to reach from PCA to Beer-Lambert D= TP = TRR-1P, R: rotation matrix D = (TR)(R-1P) C=TR, S=R-1P The critical step is calculation of R

Multivariate Curve Resolution-Alternative Least Squares (MCR-ALS) Developed by R. Tauler and A. de Juan Fully soft modeling method Chemical and physical constraints Data augmentation Combined hard model Tauler R, Kowalski B, Fleming S, ANALYTICAL CHEMISTRY 65 (15): 2040-2047, 1993. de Juan A, Tauler R, CRITICAL REVIEWS IN ANALYTICAL CHEMISTRY 36 (3-4): 163-176 2006

MCR-ALS Theory Widely Applied to spectroscopic methods UV/Vis. Absorbance spectra UV-Vis. Luminescence spectra Vibration Spectra NMR spectra Circular Dichroism … Electrochemical data are also analyzed

MCR-ALS Theory In the case of spectroscopic data Beer-Lambert Law for a mixture D(mn) absorbance data of k absorbing species D = CS C(mk) concentration profile S(kn) pure spectra

MCR-ALS Theory Initial estimate of C or S Evolving Factor Analysis (EFA) C Simple-to-use Interactive Self-Modeling Mixture Analysis (SIMPLISMA) S

MCR-ALS Theory Consider we have initial estimate of C (Cint) Determination of the chemical rank Least square solution for S: S=Cint+ D Least square solution for C: C=DS+ Reproducing of Dc: Dc=CS Calculating lack of fit error (LOF) Go to step 2

Constraints in MCR-ALS Non-negativity (non-zero concentrations and absorbencies) Unimodality (unimodal concentration profiles). Its rarely applied to pure spectra Closure (the law of mass conservation or mass balance equation for a closed system) Selectivity in concentration profiles (if some selective zooms are available) Selectivity in pure spectra (if the pure spectra of a chemical species, i.e. reactant or product, are known)

Constraints in MCR-ALS Peak shape constraint Hard model constraint (combined hard model MCR-ALS)

Rotational Ambiguity Rank Deficiency

Evolving Factor Analysis (EFA) Gives a raw estimate of concentration profiles Repeated Factor analysis on evolving submatrices Gampp H, Maeder M, Meyer CI, Zuberbuhler AD, CHIMIA 39 (10): 315-317 1985 Maeder M, Zuberbuhler AD, ANALYTICA CHIMICA ACTA 181: 287-291, 1986 Gampp H, Maeder M, Meyer CJ, Zuberbuhler AD, TALANTA 33 (12): 943-951, 1986

Basic EFA Example Calculate Forward Singular Values 1 1 ___ 1st Singular Value 0.9 ----- 2nd Singular Value SVD 0.8 ...… 3rd Singular Value S 0.7 i R i 0.6 0.5 0.4 0.3 0.2 0.1 I 5 10 15 20 25 I samples

Basic EFA Example Calculate Backward Singular Values 1 1 0.9 ___ 1st Singular Value ----- 2nd Singular Value 0.8 ...… 3rd Singular Value 0.7 R 0.6 0.5 i 0.4 SVD 0.3 S 0.2 i 0.1 I 5 10 15 20 25 I samples

Basic EFA Use ‘forward’ and ‘backward’ singular values to estimate initial concentration profiles Area under both nth forward and (K-n+1)th backward singular values is estimate for initial concentration of nth component. 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 5 10 15 20 25 I samples

Basic EFA First estimated spectra Area under 1st forward 5 10 15 20 25 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 I samples First estimated spectra Area under 1st forward and 3rd backward singular value plot. (Blue) Compare to true component (Black)

Basic EFA First estimated spectra Area under 2nd forward 5 10 15 20 25 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 I samples First estimated spectra Area under 2nd forward and 2nd backward singular value plot. (Red) Compare to true component (Black)

Basic EFA First estimated spectra Area under 3rd forward 5 10 15 20 25 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 I samples First estimated spectra Area under 3rd forward and 1st backward singular value plot. (Green) Compare to true component (Black)

Example data Spectrophotometric monitoring of the kinetic of a consecutive first order reaction of the form of A B C k1 k2

Pseudo first-order reaction with respect to A A + R B C [R]1 k1=0.20 k2=0.02 [R]2 k1=0.30 k2=0.08 [R]3 k1=0.45 k2=0.32 k1 k2

K1=0.2 K2=0.02 K1=0.3 K2=0.08 K1=0.45 K2=0.32

K1=0.30 K2=0.08 K1=0.20 K2=0.02 K1=0.45 K2=0.32

Noisy data

EFA Analysis The m.file is downloadable from the MCR-ALS home page: http://www.ub.edu/mcr/welcome.html

Simple-to-use Interactive Self-Modeling Mixture Analysis (SIMPLISMA) W. Windigm J. Guilment, Anal. Chem. 1991, 63, 1425-1432. F.C. Sanchez, D.L. Massart, Anal. Chim. Acta 1994, 298, 331-339.

SIMPLISMA is based on the selection of what are called pure variables or pure objects. Data matrix Variable (i.e. wavelength) Object (i.e. time or sample) A pure variable is a wavelength at which only one of the compounds in the system is absorbing. A pure object is an analysis time at which only one compound is eluting.

Absorbance spectra Chromatographic profile Pure object Pure variable

1 2

35 20

Standard deviation vector Mean vector Standard deviation vector  t = 0 µ0 µm . 0 m . t = m

Standard deviation vector t = m  . µ0 µm Mean vector . Standard deviation vector 0 m

chromatogram Pure spectra

Pure spectra Mean Standard deviation

chromatogram Mean Standard deviation

λ1 λ2 µi vi xi

SIMPLISMA steps 1) The ratio between the standard deviation, σi, and the mean, μi, of each spectrum is determined

To avoid attributing a high purity value to spectra with low mean absorbances, i.e., to noise spectra, an offset is included in the denominator 0<offset<3

2) Normalisation of the data matrix: Each spectrum xi is normalised by dividing each element of a row xij by the length of the row ||xi||: When an offset is added, the same offset is also included in the normalisation of the spectra.

3) Determination of the weight of each spectrum, wi 3) Determination of the weight of each spectrum, wi. The weight is defined as the determinant of the dispersion matrix of Yi, which contains the normalised spectra that have already been selected and each individual normalised spectrum zi of the complete data matrix. Yi = [Zi H] Initially, when no spectrum has been selected, each Yi contains only one column, zi (H=1), and the weight of each spectrum is equal to the square of the length of the normalized spectrum

When the first spectrum has been selected, p1, each matrix Yi consists of two columns: p1 and each individual spectrum zi, and the weight is equal to Yi = [Zi p1] When two spectra have been selected, pl and p2, each Yi consists of those two selected spectra and each individual zi, and so on. Yi = [Zi [p1 p2]]

i=1 H=I i=2 H=p1 i=3 H=[p1 p2] i=4 H=[p1 p2 p3] …

Offset=0

Offset=1

* *

* *

* *

Example data HPLC-DAD data of a binary mixture

chromatogram

Pure spectra