Presentation is loading. Please wait.

Presentation is loading. Please wait.

Advanced process modelling with multivariate curve resolution Anna de Juan 1,(*) and Romà Tauler 2. 1. Chemometrics group. Universitat de Barcelona. Diagonal,

Similar presentations


Presentation on theme: "Advanced process modelling with multivariate curve resolution Anna de Juan 1,(*) and Romà Tauler 2. 1. Chemometrics group. Universitat de Barcelona. Diagonal,"— Presentation transcript:

1 Advanced process modelling with multivariate curve resolution Anna de Juan 1,(*) and Romà Tauler 2. 1. Chemometrics group. Universitat de Barcelona. Diagonal, 647. 08028 Barcelona. anna.dejuan@ub.eduanna.dejuan@ub.edu 2. Dept. of Environmental Chemistry. IIQAB-CSIC. Barcelona.

2 Process. Definition and underlying model. Evolving chemical system monitored by a multivariate signal. Reaction system with a known mechanism (kinetic process) Evolving system with inexistent mechanism (chromatographic elution)

3 DDADA DBDB =+ DADA DBDB D =+ s A c B s B c A A cBcB sAsA c sBsB Process. Definition and underlying model. D = C STST s B s A c B c A C STST D = CS T + E Bilinear model

4 Known mechanism Hard-modeling (HM) No mechanism Soft-modeling (SM) Process. Definition and underlying model. = D Time A B C C STST ABCABC Ordered evolving concentration pattern

5 Process soft-modeling (Multivariate Curve Resolution, MCR)

6 MCR in process analysis D Time Process raw data = A B C C STST ABCABC 012345678910 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Time Concentration Wavelength Absortivities D = CS T Process description MCR Evolution of process contributions (model) Structural information of compounds (identification)

7 Multivariate Curve Resolution – Alternating Least Squares (MCR-ALS) Determination of the number of components (PCA). Building of initial estimates (C or S T ) (EFA, SIMPLISMA, prior knowledge...) Iterative least squares calculation of C and S T subject to constraints. Check for satisfactory CS T data reproduction. Data exploration Input of external information Optimal and chemically meaningful process description D = CS T + E R. Tauler. Chemom. Intell. Lab. Sys. 30 (1995) 133. A. de Juan and R. Tauler. Anal. Chim. Acta 500 (2003) 195. J. Jaumot et al. Chemom. Intell. Lab. Sys. 76 (2005) 101.

8 Constraints Definition Any property systematically present in the profiles of the compounds in our data set. Chemical origin Mathematical properties. Application C and S can be constrained differently. The profiles within C and S T can be constrained differently. Reflect the inherent order in a process

9 Process constraints Non-negativity (C, S) Unimodality (C) Processes evolving in emergence- decay profiles Closure (C) Mass balance Selectivity!!

10 MCR in process modelling Advantages (low requirements) Bilinear data structure No process model required. No previous identification of process compounds needed. Limitations We model what we measure (non-absorbing species) Each compound should have a distinct concentration profile and spectrum (rank-deficiency).

11 MCR in process modelling Limitations We model what we measure (non-absorbing species) Each compound should have a distinct concentration profile and spectrum (rank-deficiency). Multiset process analysis Incorporation of hard-modelling information

12 Advanced process modeling Multiset analysis

13 Processes and multiset models The same process monitored with different techniques Several processes/batches monitored with the same technique Several processes monitored with several techniques

14 Multiset arrangements. Advantages. The chemometric reasons Rotational ambiguity decreases/is suppressed. Rank-deficiency problems are solved. Noise effect is minimized The chemical reasons More information introduced in the process modelling. More robustness in the process description. Better characterization of process compounds (multitechnique analysis). More global description of process evolution and of effect of inducing agents. (multiexperiment analysis).

15 Rank-deficient systems (the concept) Detectable rank < nr. of process contributions = D C STST Rank(D) = min(rank C, rank S T ) Equally shaped concentration profiles A + B  C [A] = [B] Rank 2 Equally shaped spectra D  L (enantiomers) Spectra D = Spectra L Rank 1 Rank-deficiency can be linked to C or to S T

16 Rank-deficient systems (the concept) Equally shaped concentration profiles A + B  C Rank 2 = D C STST cBcB cAcA [A] o = 1 [B] o = 3 3c A = c B (rank 2) D1D1 = D C STST [A] o = 2 [B] o = 1 cBcB cAcA c A = 2c B (rank 2) D2D2

17 Rank-deficient systems (the concept) [A] o = 1 [B] o = 3 [A] o = 2 [B] o = 1 3c A = c B = STST cBcB cAcA D1D1 D C cBcB cAcA c A = 2c B D2D2 = STST D1D1 D C cBcB cAcA c A  kc B (rank 3) D2D2

18 Breaking rank-deficiency (multiset data) = C S UV T s A = ks B sBsB sAsA S CD T s A  ks B sBsB sAsA D UV D D CD = C STST sBsB sAsA D UV D D CD s A  ks B (rank 2)

19 Multitechnique process analysis

20 Multitechnique data analysis Only the concentration direction is shared by all experiments. Completely different techniques can be treated together Higher spectral discrimination power among compounds. The augmented response contains complementary information of all techniques (‘superspectrum’). The single matrix of process profiles provides cleaner process profiles and a more robust description of the process. Process profiles are not affected by specific noise patterns of particular techniques. Process description should be valid for all measurements collected. Multiset  multi-way

21 O N Fe O N pH-induced transitions in hemoglobin Spectroscopic monitoring between pH 1.5 and 10.5 Changes in secondary structure UV (350-650 nm), far-UV CD (200-250 nm) Changes in tertiary structure UV, near-UV CD (250-350 nm), fluorescence (300-450 nm) Binding of heme group UV, Soret CD (380-430 nm) Evolution of protein conformations Global process: many events at different structural levels. No mechanism defined. Muñoz, G.; de Juan, A. Anal. Chim. Acta 2007, 595, 198.

22 pH-induced transitions in hemoglobin (single technique resolution) D1D1 pH D2D2 D3D3 D4D4 D5D5 UVFar-UV CD 200210220230240250 -15 -10 -5 0 5 10 15 20 D1D1 Fluorescence 250275300325350 -4 -2 0 2 4 6 8 10 D3D3 Near-UV CD Wavelengths (nm) Soret CD 380390400410420 430 -10 -5 0 5 10 15 20 D2D2 Wavelengths (nm) pH 1.510.5 2 ary structure 3 ary structure Heme bindingGlobal

23 pH-induced transitions in hemoglobin (single technique resolution) TechniqueChemical event Nr. of process contributions pH transition values Explained variance (%) Far-UV CDChanges 2 ary structure24.099.75 Near-UV CDChanges 3 ary structure24.593.83 FluorescenceChanges 3 ary structure3 4.2 / 8.799.96 Soret CDHeme binding2 7.899.77 UV-visibleGlobal process42.8 / 3.9 / 8.599.75 Some chemical events are simpler than the global process. Non absorbing species are not modelled. Too similar spectral contributions may not be distinguished. Multitechnique analysis is needed to complete the puzzle.

24 pH-induced transitions in hemoglobin Global process resolution (multitechnique analysis) D1D1 D2D2 D3D3 D4D4 D5D5 pH UVFar-UV CD 200210220230240250 -15 -10 -5 0 5 10 15 20 D1D1 300350400450 0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18 D4D4 350450550650 0 0.2 0.4 0.6 0.8 1 D5D5 Fluorescence Near-UV CD 250275300325350 -4 -2 0 2 4 6 8 10 D3D3 Wavelengths (nm) Soret CD 380390400410420 430 -10 -5 0 5 10 15 20 D2D2 Wavelengths (nm)

25 pH-induced transitions in hemoglobin Global process resolution 300350400450 0 5 10 15 20 Fluorescence Wavelengths (nm) 195205215225235245 -20 -10 0 10 20 Far-UV CD Wavelengths (nm) 350400450500550600650 0 5 10 15 20 UV Wavelengths (nm) 250270290310330350 -5 0 5 10 Near-UV CD Wavelengths (nm) 380390400410420430 -10 0 10 20 Soret CD Wavelengths (nm) pH 0 0.2 0.4 0.6 0.8 1 1.2 246810 Non-absorbing species are modelled (Soret CD). Similar spectral contributions are distinguished (near-UV CD). C S 1 T (2) * S 3 T (2) S 2 T (2) S 4 T (3) S 5 T (4) * Figures in parentheses are number of resolved species in single technique analysis. Native HbD1D1 OxyHb D2D2

26 Multiexperiment process analysis

27 Multiexperiment data analysis Only the spectral direction is shared by all experiments. No batch synchronisation is needed. Process induced by different agents and performed in different conditions can be treated together The single matrix S T provides cleaner pure spectra and a more robust structural characterisation of process compounds. Easier modelling of minor process contributions by using experiments with complementary information. Good experimental design may provide experiments with presence/absence of different species. Multiset  multi-way

28 Protein-drug interaction Dominant at low [ligand:protein] ratio and low [ligand]. Protein + TSPP [Protein-TSPP] complex TSPP aggregate Dominant at high [ligand:protein] ratio and high [ligand]. Multiexperiment analysis of experiments enhancing low and high [protein:ligand] ratios help in the definition of all species involved.

29 Protein-drug interaction D 1 : protein-ligand complex dominates. D 2 : aggregate dominates 400500600 700 0 0.1 0.2 0.3 0.4 0.5 Absobance (a.u.) 0  M 7.5 MM Protein concentration D1D1 400500600700 0 0.5 1 1.5 2 2.5 3 3.5 4 Wavelength (nm) Absorbance (a.u.) 0  M 40  M TSPP concentration D2D2

30 Protein-drug interaction 350400450500550600650700750 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 Wavelength (nm) Absorbance (a.u.) STST The aggregate could not be recovered using only D 1 TSPP and the complex are very minor to be correctly recovered only from D 2 The different presence/absence of species in D 1 and D 2 and the decorrelated information in terms of [TSPP:complex:aggregate] helps to a better definition of the pure spectra.

31 Advanced process modeling (Incorporating hard models)

32 Process modelling Hard-modeling. The variation of a process is fully described by fitting a specific mathematical model (physicochemical or empirical) to the experimental measurements. Soft-modeling. The variation of a process is described by the bilinear model of the measurements, optimised under chemical and/or mathematical constraints. No explicit mathematical model is used.

33 Process hard-modeling 0102030405060708090100 0 0.5 1 1.5 2 2.5 3 x 10 4 Wavelengths Absortivities LS (D, C) (S T ) STST Output: C, S and model parameters. Unique solutions The model must describe all the experimental variation. 0102030405060708090100 0 0.5 1 1.5 2 2.5 Wavelength Absorbance 012345678910 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Time Concentration D C Non-linear model Fitting min(D(I-CC + ) C = f(k 1, k 2 ) D = CS T ; D = CC + D

34 Process Hard modeling (multibatch/multiexperiment) Need of one global model or Knowledge of the link expression among different batch models Batch/ exp. 1 DC STST = Batch/ exp. 2 Batch/ exp. 3 Batch/ exp. n Link among batches  model

35 Soft- modeling (one experiment) 0102030405060708090100 0 0.5 1 1.5 2 2.5 3 x 10 4 Wavelengths Absortivities STST D C Constrained ALS optimisation LS (D,C)  S* LS (D,S*)  C* min (D –C*S*), Output: C and S. Solutions might be ambiguous. All absorbing contributions in and out of the process are modelled. 012345678910 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Time Concentration

36 Soft-modeling (multibatch/multiexperiment) Batch/ exp. 1 DC STST = Batch/ exp. 2 Batch/ exp. 3 Batch/ exp. n Different experiments can be analysed together Experimental conditions, link among batches may be unknown. Link among batches  pure spectra

37 Incorporating hard-modeling in MCR All or some of the concentration profiles can be constrained. All or some of the batches can be constrained.

38 Hybrid hard- and soft-modeling MCR (HS-MCR) Output: C, S and model parameters. Hard models and soft-modeling constraints act simultaneously. Off-process contributions can be modelled separately. Process model can be recovered in the presence of absorbing interferences. 0102030405060708090100 0 0.5 1 1.5 2 2.5 3 x 10 4 Wavelengths Absortivities STST 0102030405060708090100 0 0.5 1 1.5 2 2.5 Wavelength Absorbance D C 012345678910 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Time Concentration

39 HS-MCR (multibatch/multiexperiment) Batch/ exp. 1 DC STST = Batch/ exp. 2 Batch/ exp. 3 Batch/ exp. n Link among batches (pure spectra) Global or individual models can be used. Link among different models can be unknown or inexistent. Model-free and model-based experiments can be analysed together.

40 Myoglobin denaturation Mechanism Steady-state process Native (N)  Intermediate (I s )  Denatured (D) Kinetic transient (I t ) Kinetic process Steady-state process UV spectra, pH range 7.0-2.0 N  I s ?  D Unknown model Kinetic process UV spectra, pH-jump stopped-flow First-order consecutive reactions P. Culberg, P.J. Gemperline, A. de Juan. (submitted)

41 Hard-modelling (kinetic unfolding, 1 st order reactions) Soft-modelling constraints Myoglobin denaturation = Steady- state unfolding Kinetic unfolding pH time C STST. C pH CtCt D pH time Model-free and model-based experiments can be analyzed together.

42 Myoglobin denaturation Formation of a kinetic transient was detected and hard-modelled. k 1 = 4.05 s.1 k 2 = 0.62 s -1 Steady-state unfolding was modelled with soft constraints. Steady-state process Native (N)  Denatured (D) Kinetic transient (I t ) Kinetic process 1 0 pH time Wavelengths

43 BDE-209 (flame retardant) Photodegradation of decabromodiphenil ether O Br UV kinetic monitoring in several THF/ water mixtures (10% water, 20% water, 30% water, 40% water) Three replicates per solvent composition. Wavelength (nm) S. Mas, A. de Juan, S. Lacorte, R. Tauler (submitted)

44 Data arrangement Global model 1 Global model 2 One global kinetic model per solvent composition k3k3 C k2k2 B k1k1 A D Off-process contribution (spectral solvent effects)

45 Photodegradation of BDE-209 40% water C 10% water20% water30% water 1 2 3 1 2 3 1 2 1 2 3 STST Compositionk 1 (x 10 -4 ) * k 2 (x 10 -4 ) * k 3 (x 10 -4 ) * 90:10 THF-water 2.76 (1)2.60 (2)1.38 (6) 80:20 THF-water 2.448 (8)1.613 (5)1.362 (4) 70:30 THF-water 2.41 (1)0.99 (4)0.77 (4) 60:40 THF-water 1.933 (6)1.092 (3)0.68 (2) k3k3 C k2k2 B k1k1 A D Off-process contribution Rate constants

46 MCR in process modelling. Conclusions Low requirements Bilinear data structure No process model required. No previous identification of process compounds needed. High flexibility In data arrangements Multitechnique analysis Multiexperiment analysis. Multitechnique and multiexperiment analysis. In input information Soft-modeling constraints. Hard models. Adaptable to individual compounds and/or experiments.

47 Acknowledgements Glòria Muñoz (pH-dependent hemoglobin example) Susana Navea (Protein-drug interaction). Sílvia Mas (UB and IIQAB-CSIC) (BDE-209 example) Pat Culberg, East Carolina University (myoglobin example). Lionel Blanchet, UB and Université des Sciences et Technologies de Lille (photochemical example) Financial support by Spanish Government Group Web page: www.ub.es/gesq/mcr/mcr.htmwww.ub.es/gesq/mcr/mcr.htm

48 Process. Definition and underlying model. Evolving chemical system monitored by a multivariate signal. Reaction system with a known mechanism (kinetic process) Evolving system with inexistent mechanism (chromatographic elution) Process variable Measurement channel

49 Protein photochemical reaction Photochemical kinetic process Protein conformational change Light on Light off time Measurement: IR rapid-scan spectroscopy (difference spectra) (1200-1800 cm -1 ) Fe P B A H A Q A Q B H B B B Q i 2QH 2 QH 2 Q Q 40 Å Cytochrome complex Reaction center CYTOPLASM 2 H + + 4 H+ h e - Q1Q1 Q2Q2 Photosynthetic reaction center Rhodobacter Spheroides Blanchet, L.; Ruckebusch, C.; Huvenne, J. P.; de Juan, A. Chemom. Intell. Lab. Sys. 2007, 89, 26.

50 Protein photochemical reaction time Light on Light off Kinetics of ubiquinol are modelled in the presence of an interference (protein absorption). time Q2Q2 P2P2 C STST Hard-modeling (ubiquinol formation and decay contribution) Soft-modeling constraints = D

51 Protein photochemical reaction Kinetics of ubiquinol formation and decay are modelled (hard- modeling constraint). k 1 = 7 10 -4 s -1 k -1 = 10 -4 s -1 Photoinduced protein conformational change (model-free) is modelled. -2 0 1 2 Off On Time (s) Wavenumber (cm -1 ) 1200 1800 60 Amide II Amide I -Q1-Q1 +Q2+Q2

52 Rotational ambiguity and noise minimization Single set of process profiles for all techniques C,S T possible combinations with optimal fit are less (rotational ambiguity decreases) Noise is technique- and data set-dependent. C encloses common information for all techniques (noise effect is minimized)

53 Breaking rank-deficiency (multiset data) = D C S CD T s A  ks B (rank 2) sBsB sAsA D CD = D C S UV T s A = ks B (rank 1) sBsB sAsA D UV Equally shaped spectra D  L (enantiomers) Spectra D = Spectra L Rank 1

54 D = CS T D = CT inv(T)S T


Download ppt "Advanced process modelling with multivariate curve resolution Anna de Juan 1,(*) and Romà Tauler 2. 1. Chemometrics group. Universitat de Barcelona. Diagonal,"

Similar presentations


Ads by Google