Presentation is loading. Please wait.

Presentation is loading. Please wait.

Grey modeling approaches to investigate chemical processes Romà Tauler 1 and Anna de Juan 2 IIQAB-CSIC 1, UB 2 Spain

Similar presentations


Presentation on theme: "Grey modeling approaches to investigate chemical processes Romà Tauler 1 and Anna de Juan 2 IIQAB-CSIC 1, UB 2 Spain"— Presentation transcript:

1 Grey modeling approaches to investigate chemical processes Romà Tauler 1 and Anna de Juan 2 IIQAB-CSIC 1, UB 2 Spain E-mail: rtaqam@iiqab.csic.es

2 Grey modeling approaches to investigate chemical processes Introduction to chemical modeling: white (hard), black (soft) and grey modeling in chemistry Multivariate Curve Resolution as a grey modeling method Grey modeling applications using MCR- ALS

3 Modeling approaches Hard Modeling  White Modeling Models based on Physical/Chemical Laws Soft Modeling  Black Modeling Empirical Models with no knowledge/assumptions about the Physical/chemical laws of the system (usually non-linear) Models with no assumptions about the physical/chemical model but with assumptions about the measurement model (usually multivariate and linear) Soft+Hard Modeling?  Grey Modeling? Mixed Models partially using information about physical/ /chemical laws

4 Chemical model (variation of compound contribution) MIXTURE Non-existent PROCESS Known Too complex Unknown Chemical multicomponent systems. Structure Measurement model (variation of the instrumental signal) Simple additive linear model (Factor Analysis tools) DD1D1 D2D2 DnDn =++... + D =+ s 1 + c 2 s 2 c n s n c 1 D = C S s n s 1 c n c 1

5 Hard (White) Modeling Data modeling and data fitting in chemical sciences has been traditionally done by hard modeling techniques. They are based on physical/chemical models which are already known (or assumed, proposed,...) The parameters of these model are not known and they are estimated by least squares curve fitting This approach may be also called white modeling and it is valid for well known phenomena and laboratory data, where the variables of the model are under control during the experiments and only the phenomena under study affect the data.

6 Hard (White) Modeling Find the optimal parameters of the Model, 

7 Hard (White) Modeling Case 1 Kinetic Systems: Y ij = A ij measured absorbances of sample/solution i wavelength j Measurement model assumptions: Chemical Model assumptions: Defining the residuals: Finding the best model and its parameters

8

9 Hard (White) Modeling Case 2 Solution Equilibria: Y ij = A ij measured absorbances of sample/solution i wavelength j. Measurement model assumptions: Chemical Model assumptions: Defining the residuals: Finding the best model and its parameters

10 mp=0 guess parameters, k 0 calculate residuals, r(k 0 ) and the sum of squares, ssq calculate Jacobian J calculate shift vector  k, and k 0 = k 0 +  k end; display results ssq old ssq mp=0 mp / 3 mp  5 <  > yes no The Newton-Gauss- Levenberg/Marquardt (NGL/M) algorithm Hard (White) Modeling

11 In soft (black) modeling no physical model is assumed. In some cases a linear measurement model is assumed (factor analysis methods) In other cases dependencies among variables and sources of variation are considered to be non linear (neural networks, genetic algorithms, …) The goal of these methods is the explanation of data variance using the minimal or softer assumptions about data Soft (Black) Modeling

12 Example of Soft (Black) Modeling Factor Analysis/Principal Component Analysis Bilinear Model D = U V T + E Unique solutions but without physical meaning Constraints: U orthogonal, V T orthonormal V T in the direction of maximum variance N  D U VTVT E + I JJJ I I N N << I or J

13 Hard (white)- vs. Soft (black)-modelling Pros HM Well defined behaviour model (useful chemical information). Unique solutions. Reduced number of parameters to be optimised (e.g., K, k,..) Pros SM  No explicit model is required.  Information on the process or signal may be used (constraints).  May help to set or to validate a physicochemical model.

14 Cons HM The underlying model should be correct and completely known. No variations other than those related to the model should be present in the data set. Cons SM  Ambiguous solutions.  Does not provide directly physicochemical (kinetic or thermodynamic,...) information. Hard (white)- vs. Soft (black)-modelling

15 Use HM The variation of the system is completely described by a reliable physicochemical model. Clean reaction systems (kinetic or thermodynamic processes) Use SM  The model describing the variation of the data is too complex, unknown or non- existent. Images. Chromatographic data. Macromolecular processes.

16 Mixed systems with hard-modelable and soft- modelable parts are proposed –Hard-model: kinetic process, equilibrium reaction..... –Soft-model: interferent, background, drift, unknown.... Introducing a hard-model part decreases the ambiguity related to pure soft-modeling methods and gives additional information (parameters). Introducing a soft-model part, may help to clarify the nature of the physicochemical model and give more reliable results. Grey (hard+soft) modeling

17 Grey modeling approaches to investigate chemical processes Introduction to chemical modeling: white (hard), black (soft) and grey modeling in chemistry Multivariate Curve Resolution as a grey modeling method Grey modeling applications using MCR- ALS

18 Multivariate Curve Resolution (MCR) Goal Knowing the identity and contribution of each pure compound (entity) in the process or in the mixture. PROCESS The composition changes in a continuous evolutionary manner. E.g. chemical reactions, processes, HPLC-DAD. MIXTURE The composition changes with a random pattern variation. E.g. Series of independent samples. The composition changes with a non-random pattern variation. E.g. environmental data, spectroscopic images. A tool to analyse (resolve) changes in composition and response in multicomponent systems.

19 Multivariate Curve Resolution Pure component information  C STST snsn s1s1 c n c 1 Wavelengths Retention times Pure concentration profiles Chemical model Process evolution Compound contribution Pure signals Compound identity D Mixed information tRtR

20 Multivariate Curve Resolution methods D = CS T + E Investigation of chemical reactions (kinetics, equilibria, …) using multivarite measurements (spectrometric,...) Industrial processes (blending, syntheses,…). Macromolecular processes. Biochemical processes (protein folding). Spectroscopic images. Mixture Analysis (in general) Hyphenated separation techniques (HPLC-DAD, GC-MS, CE-DAD,...). Environmental data (model of pollution sources) ……………..

21 Multivariate Curve Resolution Bilinear Model: Factor Analysis Model D = C S T + E  D C STST E + I JJJ I I K N << I or J N Non-unique solutions but with physical meaning (rotational/ intensity ambiguities are present) Constraints: C and S T non-negative C or S T scaled (normalization, closure) Other constraints (unimodality, local rank, selectivity, previous knowledge... )

22 D1D1 D2D2 D3D3 STST C1C1 C2C2 C3C3 Z DC Multivariate Curve resolution Alternating Least Squares MCR-ALS Extension to multiple data matrices quantitative information row-, concentration profiles column-, spectra profiles column-wise augmented data matrix NR 1 NR 2 NR 3 NC NM = 3

23 Advantages of matrix augmetation (multiway data) Resolution local rank conditions are achieved in many situations for well designed experiments (unique solutions!) Rank deficiency problems can be more easily solved Unique decompositions are easily achieved for trilinear data (trilinear constraints) Constraints (local rank/selectivity and natural constraints) can be applied independently to each component and to each individual data matrix. J,of Chemometrics 1995, 9, 31-58 J.of Chemometrics and Intell. Lab. Systems, 1995, 30, 133

24 Multivariate Curve Resolution – Alternating Least Squares (MCR-ALS) Determination of the number of components (i.e. by SVD) Building of initial estimates (C or S T ) Iterative optimisation of C and/or S T by Alternating Least Squares (ALS) subject to constraints. Check for satisfactory CS T data reproduction. Data exploration Input of external information as CONSTRAINTS The aim is the optimal description of the experimental data using chemically meaningful pure profiles. Fit and validation

25 Optional constraints (local rank, non-negativity, unimodality,closure,…) are applied at each iteration Initial estimates of C or S are obtained from EFA or from pure variable detection methods. C and S T are obtained by solving iteratively the two LS equations: An algorithm for Bilinear Multivariate Curve Resolution Models : Alternating Least Squares (MCR-ALS)

26 Constraints Definition Any chemical or mathematical feature obeyed by the profiles of the pure compounds in our data set. C and S T can be constrained differently. The profiles within C and S can be constrained differently. Constraints transform resolution algorithms into problem-oriented data analysis tools

27 Soft constraints Non-negativity C* 01020304050 -0.1 -0.05 0 0.05 0.1 0.15 0.2 0.25 0.3 Retention times CcCc 01020304050 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 Retention times Concentration profiles spectra

28 Unimodality C* 05101520253035404550 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 Retention times CcCc 05101520253035404550 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 Retention times Reaction profiles Chromatographic peaks Voltammograms Soft constraints

29 Selectivity/local rank Concentration selectivity/local rank constraint C* 05101520253035404550 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 Retention times C c < threshold values 05101520253035404550 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 Retention times We know that this region is not rank 3, but rank 2!

30 D Select Updated STST ALS c ALS Local model b, b 0 C Concentration correlation constraint (multivariate calibration)

31 STST C = D D1D1 D2D2 D3D3 Trilinearity Constraint (flexible to every species) Extension of MCR-ALS to multilinear systems 1st score loadings PCA, SVD Folding species profile 1st score gives the common shape Loadings give the relative amounts! Trilinearity Constraint Unfolding species profile Unique Solutions ! Substitution of species profile C Selection of species profile R.Tauler, I.Marqués and E.Casassas. Journal of Chemometrics, 1998; 12, 55-75

32 Hard modeling: Mass balance or Closure constraint C* 23456789 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 pH c total 23456789 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 pH CcCc  = c total c total Mass balance Closed reaction systems Hard modeling constraints

33 Hard modeling: Mass action law and rate laws Hard modeling constraints C 23456789 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 pH C cons 23456789 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 pH Physicochemical model Kinetic processes Equilibrium processes

34 The hard model is introduced as a new and essential constraint in the soft-modelling resolution process. It is applied in a flexible manner, as the soft- modelling constraints. –To some or to all process profiles. –To some or to all matrices in a three-way data set. –Different hard models can be applied to different matrices in a three-way data set. Grey modeling using MCR-ALS soft + hard modeling constraints

35 C 23456789 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 pH C cons 23456789 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 pH physicochemical model (mass action law, rate law) Kinetic processes Equilibrium processes C SM C HM Grey modeling using MCR-ALS soft model (non-negativity ) HM SM

36 1.Select the soft-modelled profiles to be constrained (C SM ). 2.Non-linear fit of the selected profiles according to the hard model selected. 3.Update the soft-modelled profiles C SM.by the fitted C HM. min(ssq(C SM -C HM )) ssq=f(C SM, model, parameters) Grey modeling using MCR-ALS

37 Grey modeling approaches to investigate chemical processes Introduction to chemical modeling: white (hard), black (soft) and grey modeling in chemistry Multivariate Curve Resolution as a grey modeling method Grey modeling applications using MCR- ALS

38 Grey modeling approaches to investigate chemical processes Examples: 1.Getting kinetic and analytical information from mixed systems (drift and interferents) 2.Using a physicochemical model to decrease resolution ambiguity and getting analytical information 3.pH induced transitions in hemoglobin

39 0510 0 0.5 1 Time Concentration 050100 0 1 2 3 4 x 10 4 Wavelengths Absorbance A B C C B A i D a d 0510 0 0.5 1 Concentration drift D d Time Kinetic process + drift 0510 0 0.5 1 Time Concentration interf. D i Kinetic process + interferent k 1 = k 2 = 1 Model Grey modeling applications using MCR-ALS consecutive irreversible Example 1 Getting kinetic information from mixed systems (drift and interferents) Anna de Juan, Marcel Maeder, Manuel MartÍnez, Romà Tauler Analytica Chimica Acta 442 (2001) 337–350;

40 Kinetic model C HM = f(k 1, k 2 ) Kinetic process + drift/interferent A, B, C HM Drift, inter SM

41 0246810 0 0.2 0.4 0.6 0.8 1 Time Concentration (a.u.) 020406080100 0 1 2 3 x 10 4 Wavelength channel Absorptivities (a.u.) 0246810 0 0.2 0.4 0.6 0.8 1 Time Concentration (a.u.) 020406080100 0 1 2 3 4 x 10 4 a) b) Kinetic process + drift 0246810 0 0.2 0.4 0.6 0.8 1 Time Concentration 020406080100 0 1 2 3 x 10 4 Wavelength channel Absorbance 0246810 0 0.2 0.4 0.6 0.8 1 Time Concentration 020406080100 0 1 2 3 x 10 4 Wavelength channel Absorbance a) b) Kinetic process + interferent HM HSM Grey modeling applications using MCR-ALS Example 1 Getting kinetic information from mixed systems (drift and interferents) SystemAlgorithmk 1 = 1k 2 = 1 A,B,C (drift)HM1.400.83 HSM1.160.90 A,B,C (interferent)HM1.160.89 HSM0.951.05

42 Anna de Juan, Marcel Maeder, Manuel MartÍnez, Romà Tauler Chemometrics and Intelligent Laboratory Systems 54 2000 123–141

43

44

45 Example 2. Using a physicochemical model to decrease resolution ambiguity. Getting analytical information. Chemical problem: multiequilibria systems Quantitation of an analyte (H 2 A) in the presence of an interferent (H 2 B). Measurements FT-IR monitored pH titrations H 2 A (malic acid) H 2 B (tartaric acid) Grey modeling applications using MCR-ALS Highly overlapped concentration profiles

46 Example 2. Using a physicochemical model to decrease resolution ambiguity. Getting analytical information. Too correlated concentration profiles Too overlapped spectra Too ambiguous SM solutions Quantitation fails Data set Standard H 2 A Sample H 2 A/H 2 B pH Grey modeling applications using MCR-ALS

47

48

49 Time effect on pH transitions (UV) 2345678910 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 pH 345678910 0 0.2 0.4 0.6 0.8 1 pH Grey modeling applications using MCR-ALS Example 3 Time effect on pH induced transitions in hemoglobin SM 1,2 Heme group unbound 3 Native 4 Heme bound (change in coordination)

50

51

52 Time-dependent acidic conformations evolve very similarly with pH (rank- deficiency). The kinetic matrix helps in the resolution of the acidic conformations in the pH- dependent process. Hard-modelling constraint applied to the kinetic process helps to a less ambiguous recovery of the acidic conformations in the pH- dependent process. time pH D C STST = Global description of the process After 48 hours Grey modeling applications using MCR-ALS Example 3 Time effect on pH induced transitions in hemoglobin SM HM

53 All the pH-dependent conformations can be resolved, even those time- dependent. Additional kinetic information is obtained. k 1 = 1.424e -5 + 4 e -8 Complete description SM + HM 350400450500550600650 700 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 Wavelengths (nm) Grey modeling applications using MCR-ALS Example 3 Time effect on pH induced transitions in hemoglobin HM SM

54 Some References Soft+Hard (Grey) Modelling A. de Juan, M. Maeder, M. Martínez, R. Tauler. Chemom. Intell. Lab. Sys. 54 (2000) 123. A. de Juan, M. Maeder, M. Martínez, R. Tauler. Anal. Chim. Acta, 442 (2001) 337 J.Diework, A. de Juan, R.Tauler and B.Lendl. Applied Spectroscopy, 2002, 56, 40-50 J. Diewok; A. de Juan; M. Marcel; R. Tauler; B. Lendl. Analytical Chemistry, 2003, 76, 641-7

55 Acknowledgements Chemometrics Group (UB and IIQAB-CSIC) –Staff: Romà Tauler, Javier Saurina, Anna de Juan, Raimundo Gargallo –Post-doc: Montse Vives, Mónica Felipe –PhD : Susana Navea, Joaquim Jaumot, Emma Peré-Trepat, Elisabeth Teixido –Master: Silvia Termes, Silvia Mas, Gloria Muñoz, Marta Terrado, Xavier Puig. Manel Martínez, University of Barcelona (Spain) Marcel Maeder (University of Newcastle, Australia) Josef Diewok (University of Viena, Austria)


Download ppt "Grey modeling approaches to investigate chemical processes Romà Tauler 1 and Anna de Juan 2 IIQAB-CSIC 1, UB 2 Spain"

Similar presentations


Ads by Google