Yuri Kalambet, Ampersand Ltd., Moscow, Russia

Slides:



Advertisements
Similar presentations
Quantification – Concepts and Integration
Advertisements

Chapter 7 Statistical Data Treatment and Evaluation
Pattern Recognition and Machine Learning
CSCI 347 / CS 4206: Data Mining Module 07: Implementations Topic 03: Linear Models.
FTP Biostatistics II Model parameter estimations: Confronting models with measurements.
Mathematics and Graphing in Chemistry Lab 1. Outline Mathematics in Chemistry Units Rounding Digits of Precision (Addition and Subtraction) Significant.
The loss function, the normal equation,
Visual Recognition Tutorial
Chapter 13 Multiple Regression
x – independent variable (input)
Curve-Fitting Regression
CALIBRATION Prof.Dr.Cevdet Demir
Evaluating Hypotheses
Development of Empirical Models From Process Data
Governing equation for concentration as a function of x and t for a pulse input for an “open” reactor.
PATTERN RECOGNITION : PRINCIPAL COMPONENTS ANALYSIS Prof.Dr.Cevdet Demir
Estimation and the Kalman Filter David Johnson. The Mean of a Discrete Distribution “I have more legs than average”
Linear and generalised linear models
Inferences About Process Quality
Tracking with Linear Dynamic Models. Introduction Tracking is the problem of generating an inference about the motion of an object given a sequence of.
Lehrstuhl für Informatik 2 Gabriella Kókai: Maschine Learning 1 Evaluating Hypotheses.
Linear and generalised linear models
Maximum likelihood (ML)
Radial Basis Function Networks
CALIBRATION METHODS.
Calibration & Curve Fitting
Principles of the Global Positioning System Lecture 11 Prof. Thomas Herring Room A;
Objectives of Multiple Regression
LINEAR REGRESSION Introduction Section 0 Lecture 1 Slide 1 Lecture 5 Slide 1 INTRODUCTION TO Modern Physics PHYX 2710 Fall 2004 Intermediate 3870 Fall.
Chapter 1: Introduction to Statistics
Handling Data and Figures of Merit Data comes in different formats time Histograms Lists But…. Can contain the same information about quality What is meant.
Qualitative Data Analysis. 2 In This Section, We Will Discuss:  How to load data files.  How to use Signal Options for data display.  How to apply.
Regression analysis Control of built engineering objects, comparing to the plan Surveying observations – position of points Linear regression Regression.
Part 1: Basic Principle of Measurements
Basic Probability (Chapter 2, W.J.Decoursey, 2003) Objectives: -Define probability and its relationship to relative frequency of an event. -Learn the basic.
Quality Control Lecture 5
Make observations to state the problem *a statement that defines the topic of the experiments and identifies the relationship between the two variables.
Chapter 21 R(x) Algorithm a) Anomaly Detection b) Matched Filter.
Curve-Fitting Regression
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Chem. 231 – 2/18 Lecture. Announcements Set 2 Homework – Due Wednesday Quiz 2 – Next Monday Set 1 Labs –should be switching instruments today (or after.
Quality Assurance How do you know your results are correct? How confident are you?
CALIBRATION METHODS. For many analytical techniques, we need to evaluate the response of the unknown sample against the responses of a set of standards.
Lecture 4: Statistics Review II Date: 9/5/02  Hypothesis tests: power  Estimation: likelihood, moment estimation, least square  Statistical properties.
The Best Method of Noise Filtering Yuri Kalambet, Sergey Maltsev, Ampersand Ltd., Moscow, Russia; Yuri Kozmin, Shemyakin Institute of Bioorganic Chemistry,
Data Analysis and Presentation Chapter 5- Calibration Methods and Quality Assurance EXCEL – How To Do 1- least squares and linear calibration curve/function.
Module 1: Measurements & Error Analysis Measurement usually takes one of the following forms especially in industries: Physical dimension of an object.
Radiation Detection and Measurement, JU, 1st Semester, (Saed Dababneh). 1 Radioactive decay is a random process. Fluctuations. Characterization.
PATTERN RECOGNITION : PRINCIPAL COMPONENTS ANALYSIS Richard Brereton
BME 353 – BIOMEDICAL MEASUREMENTS AND INSTRUMENTATION MEASUREMENT PRINCIPLES.
6. Population Codes Presented by Rhee, Je-Keun © 2008, SNU Biointelligence Lab,
CHAPTER 2.3 PROBABILITY DISTRIBUTIONS. 2.3 GAUSSIAN OR NORMAL ERROR DISTRIBUTION  The Gaussian distribution is an approximation to the binomial distribution.
Machine Vision Edge Detection Techniques ENT 273 Lecture 6 Hema C.R.
The Normal Approximation for Data. History The normal curve was discovered by Abraham de Moivre around Around 1870, the Belgian mathematician Adolph.
Lecture 8 Peak Parameters and Quantitative chromatography
Chapter 4 Variability PowerPoint Lecture Slides Essentials of Statistics for the Behavioral Sciences Seventh Edition by Frederick J Gravetter and Larry.
Week 2 Normal Distributions, Scatter Plots, Regression and Random.
Stats Methods at IC Lecture 3: Regression.
SUR-2250 Error Theory.
Chapter 4 Basic Estimation Techniques
Chapter 7. Classification and Prediction
CJT 765: Structural Equation Modeling
Statistical Methods For Engineers
Introduction to Instrumentation Engineering
The loss function, the normal equation,
Mathematical Foundations of BME Reza Shadmehr
Once again, how many points per peak?
CISE-301: Numerical Methods Topic 1: Introduction to Numerical Methods and Taylor Series Lectures 1-4: KFUPM CISE301_Topic1.
Presentation transcript:

Yuri Kalambet, Ampersand Ltd., Moscow, Russia www.ampersand.ru Implementation of chemometric techniques in chromatographic data station software Yuri Kalambet, Ampersand Ltd., Moscow, Russia www.ampersand.ru

Data processing Noise level estimation Noise filtration and/or data bunching Peak search Peak quantification (area, height, asymmetry, etc.) Table of components (qualitative analysis) Calibration graph (quantitative analysis) Peak shape analysis and deconvolution Periodic noise filtration Multichannel (spectral) data analysis

Noise High-frequency noise estimation using the whole time series data set Noise filtration could cause questions from GLP rules Bunching never causes questions from GLP, but can significantly reduce observed noise Noise is taken into account while: peak detection; approximation quality evaluation multichannel/spectral analysis

Peak Quantification Baseline Y position using weighted Savitsky-Goley approximation with evaluation of approximation quality Primary peak parameters using adaptive polynomial approximation, where parameters of polynomial depend on target peak width and asymmetry Criterion for algorithm selection: Peak parameters should be resistant to bunching operation

Noise definition 1st derivative point DOES NOT belong to “noise pool of points” if both of the neighboring derivative points have the same sign 1st derivative point DOES NOT belong to “noise pool of points” if its absolute value is outside 5 sigma of absolute values distribution (iterative procedure)

Calibration Predictive relationship between input and detector response Input: calibration samples – concentrations of components Output: peak area or height Prediction: Predict unknown input looking at response

ESTD Calibration Response (Area or Height) versus Quantity Quantity is provided without error - false Response is measured with random normally distributed error – sometimes true

Calibration Curve Axes: Q – Quantity (NOT Concentration), R – Response (Area or Height) Independent variable: Typically Q, sometimes R Calibration curve: polynomial interpolation Prediction: either solution of polynomial equation (independent Q) or value of polynomial (independent R) – we denote either of them W(R)

Quantification Absolute Concentration Quantity of injected substance Qx = W(Rx) Concentration of initial sample Cx = Qx/V = W(Rx)/(Vinj*A/D) Vinj – injection volume A – amount D – dilution

ISTD Targets Reason Axis Sample-size variations Q Effect of sample preparations Q Instrument drift R All reasons are acting always together

ISTD tricks Add component with known concentration to sample Add component with known concentration to calibration samples Usually it’s done together, but who told that you can’t do it separately?

ISTD with ESTD calibration = Relative Concentration Accounts for systematic error due to sample-size error and sample losses while preparation. Assumption: volume is unknown and is calculated using known concentration of the internal standard V = Qxistd/Cxistd = Wistd(Rxistd)/Cxistd, Qxistd is calculated using calibration curve of Internal standard from Rxistd, Cxistd – declared concentration of standard in sample Relative Concentration C = Qx/V = Cxistd×Wx(Rx)/ Wistd(Rxistd)

“Universal” ISTD Calibration If calibration is nonlinear, we must measure this nonlinearity, i.e. we MUST know ESTD calibration curve of the Internal Standard component In the case we learned this curve somehow, we can use this curve to change positions of calibration points of other components using the same trick as was used while calculating Relative Concentration: Qn = Cn×V = Cn×Wistd(Ristd)/Cistd So, for Internal Standard predefined curve is in use, all other components get curves constructed conditionally, condition being known calibration curve of the Internal Standard component

Pro et Contra Advantages: Only one type of axes Calibration curves are suitable for calculation of both Absolute and Relative concentrations Disadvantages: ESTD Calibration curve of Internal Standard is required

Qn = Cn×V = Cn×Wistd(Ristd)/Cistd Multiplication of Q axis of Internal Standard to any number will multiply Q coordinates of all corrected points of all components to the same number, hence causing “affinity” change of all calibration curves. Calibration curve will change, absolute concentration also, but not Relative Concentration C = Cxistd×Wx(Rx)/ Wistd(Rxistd) If all calibration dependencies are linear through origin Q = K×R, it is possible to select multiplication factor so, that Kistd = 1 and we will get relative response factors for all other components.

“Classic” ISTD Calibration dependence in coordinates Response Ratio vs. Concentration ratio Calibration curve: polynomial, typically straight line through origin Prediction: from Response Ratio predict Concentration Ratio Peculiarities: no calibration curve for Internal Standard component

Example of “Classic” ISTD failure Response A Response B (Standard) Concentration Area Ratio Concentration Ratio 18.1269247 9.516258 1 1.904837 32.9679954 18.12692 2 1.818731 55.0671036 32.968 4 1.67032 79.8103482 55.0671 8 1.449329 95.9237796 79.81035 16 1.201897

ESTD/”Universal” ISTD Calibration

“Classic” ISTD calibration curve

Device Drift Drift model: R = K×F(Q) Drift can be compensated completely, if exists such k, that K×F(Q)=F(k×Q) Particular case: linear through origin calibration; K = k

Example

Conclusions ISTD is split in two parts: ISTD quantification (Relative Concentration) and ISTD Calibration. Parts can be applied separately Relative Concentration is applicable for both ESTD and “Universal” ISTD calibrations Full “Universal” ISTD calibration can be used for both ESTD and ISTD calculations Simplified “Universal” ISTD calibration can be used in the case of linear through origin calibrations instead of “Classic” ISTD calibration including imitation of user interface Full “Universal” ISTD solution still works where “Classic” already fails, i.e. it allows to get into account wide concentration range of Internal Standard in the case of nonlinear calibrations

Difficulties User mind. The more users are used to “Classic” ISTD method, the more difficult it is to change their minds. Typical argument: -“Old approach works. We typically use linear through origin calibrations. Why should we change the way how we work?”.

Peak deconvolution by shape Peak shape Gaussian Exponentially modified Gaussian Corresponds to reference peak Tuning, specific to chromatography Peak width individual for every peak; the same for all peaks proportional to retention time asymmetry of the same sign (2) Optimization criterion quadratic (sum of squares of residuals) weighted quadratic (weight = response)

Gaussian peak shape analysis

Gaussian peak shape analysis

Gaussian peak shape analysis

EMG peak shape analysis Problems: product of zero to infinity for some range of parameters Solution: or “standard” Gaussian function, depending on parameter values

Fourier-domain periodic noise filtration Median filter in Fourier space

Periodic noise filtration – mains frequency

Periodic noise filtration – pump pulsations

Multichannel chromatogram questions Which channel is better? How to detect peaks? How to identify peaks? How to access substance purity?

Multichannel chromatogram answers Calculated channels Using weighted PCR: Noise-normalized detector response space for spectra comparison Measure of difference for spectra: angle “Local” spectrum quality: angle with neighbors Factor analysis with missing (overloaded) points allowed Qualitative and quantitative spectra comparison

Weighted PCA Channel weighting Point weighting

Calculated channels Summary. Types of Calculated channels: Total Average Difference Integral Derivative - up to 3-rd order Angle «Response/Time» channel Spectral ratio channel

Spectral chromatogram with angle profile.

Best purity spectrum advantage

Spectrum identification Step 1: Spectrum calculation.

Step 2: Performing recognition.

Step 3: Conflicts resolving.

Step 4: View and print report.

Calculated channels in “Chrom&Spec”. Summary. Types of Calculated channels: Total Average Difference Integral Derivative - up to 3-rd order Angle «Response/Time» channel Spectral ratio channel

Main spectral window.

3D multi-channel chromatogram view.

2D multi-channel chromatogram view.

Factor analysis example. Picture shows the “sum of s/n” channel of multi-channel chromatogram. Usual peak-detection procedure failed to resolve peaks. What about factor analysis?

Factor analysis. Step 1. Software makes PCR of the region selected and suggests number of components in the analyzed region (Rank). Overlapping only 2 peaks means that software does not look for overlapping of 1st and 3rd components, only 1st and 2nd and then 2nd and 3rd.

Factor analysis. Step 2 On the left panel we can choose any channel to view peak deconvolution results. Rotation is selected using “local purity” attribute, resulting in “Pure component” approach

Factor analysis. Step 3 The resolved peaks are displayed together with the original channel. The chromatogram with additional newly generated channels can be saved into separate file. Each generated channel represents the detector response, corresponded to particular component. When user finishes the wizard, the original peak is split in the proportions, corresponding to component’s fractions.