Download presentation
Presentation is loading. Please wait.
Published byMuriel Webb Modified over 9 years ago
1
M.Sc (Chemistry) CY10AC1: ANALYTICAL CHEMISTRY – I Unit - I www.kanchiuniv.ac.in/DrKSivakumar_chemistry.html Dr. K. SIVAKUMAR Department of Chemistry SCSVMV University chemshiva@gmail.com Chemical Data Analysis
2
2 Measurement Errors: Measurement errors: are numerous and their magnitudes are variable leads to uncertainties in reported results can be minimized and some types eliminated altogether by careful experimental design effects can be assessed by the application of statistical data analysis and chemometrics can be avoided by proper equipment maintenance and appropriate training of personnel Measurement errors: are numerous and their magnitudes are variable leads to uncertainties in reported results can be minimized and some types eliminated altogether by careful experimental design effects can be assessed by the application of statistical data analysis and chemometrics can be avoided by proper equipment maintenance and appropriate training of personnel
3
3 Absolute & Relative errors: Absolute Error: is the numerical difference between measured value and the accepted (True) value Simply subtract the measured value from the accepted answer E A – Measurement error x M – Measured value x T - True or accepted value Absolute Error: is the numerical difference between measured value and the accepted (True) value Simply subtract the measured value from the accepted answer E A – Measurement error x M – Measured value x T - True or accepted value Relative Error (also, percent error): is the ratio between absolute error and true value E R – Relative error x M – Measured value x T - True or accepted value Relative Error (also, percent error): is the ratio between absolute error and true value E R – Relative error x M – Measured value x T - True or accepted value
4
4 Absolute & Relative errors: Example Example: 200 mg aspirin standard has been analyzed a number of times. absolute errors range from -4 mg to +10 mg. absolute error may be used to express the inaccuracy in a measurement. Example: 200 mg aspirin standard has been analyzed a number of times. absolute errors range from -4 mg to +10 mg. absolute error may be used to express the inaccuracy in a measurement. the relative error ranges from -2% to +5%. relative errors are particularly useful for comparing results of differing magnitude. the relative error ranges from -2% to +5%. relative errors are particularly useful for comparing results of differing magnitude.
5
5 Determinate Errors: The basis of determinate or systematic errors arises from three sources: ● the analyst or operator; ● the equipment (apparatus and instrumentation) and the laboratory environment; ● the method or procedure. It can be rectified by careful observation and record keeping equipment maintenance training of laboratory personnel The basis of determinate or systematic errors arises from three sources: ● the analyst or operator; ● the equipment (apparatus and instrumentation) and the laboratory environment; ● the method or procedure. It can be rectified by careful observation and record keeping equipment maintenance training of laboratory personnel
6
6 Determinate Errors: Reasons… Operator errors: can arise through Carelessness insufficient training disability Equipment errors: are due to substandard volumetric glassware faulty or worn mechanical components incorrect electrical signals poor or insufficiently controlled laboratory environment Method or procedural errors: are caused by inadequate method validation Improper methods and concentrations that affect measurements. Operator errors: can arise through Carelessness insufficient training disability Equipment errors: are due to substandard volumetric glassware faulty or worn mechanical components incorrect electrical signals poor or insufficiently controlled laboratory environment Method or procedural errors: are caused by inadequate method validation Improper methods and concentrations that affect measurements. Determinate errors that lead to a HIGHER value than a true value are said to show a positive bias ; Determinate errors that lead to a lower value than a true value are said to negative bias. Determinate errors can be proportional to the size of sample taken for analysis. Determinate errors that lead to a HIGHER value than a true value are said to show a positive bias ; Determinate errors that lead to a lower value than a true value are said to negative bias. Determinate errors can be proportional to the size of sample taken for analysis.
7
7 Determinate Errors: Example Cu 2+ can be determined by titration after reaction with potassium iodide to release iodine, i.e., 2Cu 2+ + 4I - 2CuI + I 2 However, the reaction is not specific to Cu 2+, and any Fe 3+ present in the sample will react in the same way. Results for the determination of copper in an alloy containing 20%, but which also contained 0.2% of iron are shown below, for a range of sample sizes. The same absolute error of +0.2% or relative error of 1% (i.e. a positive bias) occurs regardless of sample size, due to the presence of the iron. This type of error may go undetected unless the constituents of the sample and the chemistry of the method are known. The same absolute error of +0.2% or relative error of 1% (i.e. a positive bias) occurs regardless of sample size, due to the presence of the iron. This type of error may go undetected unless the constituents of the sample and the chemistry of the method are known.
8
8 Indeterminate Errors: Reasons This type of error arises random fluctuations in measured quantities which always occur even under closely controlled conditions. It has an EQUAL chance of being POSITIVE or NEGATIVE It can be REDUCED by careful experimental design and control but never ELIMINATED. Environmental factors such as temperature, pressure, & humidity Electrical properties such as current, voltage, & resistance are some of the factors producing random variations described as NOISE. These contribute to the overall indeterminate error in any physical or physico-chemical measurement, but no one specific source can be identified. This type of error arises random fluctuations in measured quantities which always occur even under closely controlled conditions. It has an EQUAL chance of being POSITIVE or NEGATIVE It can be REDUCED by careful experimental design and control but never ELIMINATED. Environmental factors such as temperature, pressure, & humidity Electrical properties such as current, voltage, & resistance are some of the factors producing random variations described as NOISE. These contribute to the overall indeterminate error in any physical or physico-chemical measurement, but no one specific source can be identified.
9
9 Indeterminate Errors:… A series of measurements made under the same prescribed conditions and represented graphically is known as a frequency distribution. The frequency of occurrence of each experimental value is plotted as a function of the magnitude of the error or deviation from the average or mean value. For analytical data, the values are often distributed symmetrically about the mean value, the most common being the normal error or Gaussian distribution curve. A series of measurements made under the same prescribed conditions and represented graphically is known as a frequency distribution. The frequency of occurrence of each experimental value is plotted as a function of the magnitude of the error or deviation from the average or mean value. For analytical data, the values are often distributed symmetrically about the mean value, the most common being the normal error or Gaussian distribution curve.
10
10 Indeterminate Errors:… The curve shows that small errors are more probable than large ones, positive and negative errors are equally probable, and the maximum of the curve corresponds to the mean value The curve shows that small errors are more probable than large ones, positive and negative errors are equally probable, and the maximum of the curve corresponds to the mean value
11
11 Accumulated Errors: (Also called as propagation errors) Errors are will be aggregated in the final calculated result. The accumulation is treated similarly for both determinate (systematic) and indeterminate (random) errors. Determinate (systematic) errors can be either positive or negative Cancellation of errors is likely in computing an overall determinate error In some instances this may be zero. Errors are will be aggregated in the final calculated result. The accumulation is treated similarly for both determinate (systematic) and indeterminate (random) errors. Determinate (systematic) errors can be either positive or negative Cancellation of errors is likely in computing an overall determinate error In some instances this may be zero. The overall error is calculated using one of two alternative expressions, that is E 1 & E 2 etc., being the ABSOLUTE determinate errors in the individual measurements Overall ABSOLUTE determinate error E T E 1R & E 2R etc., being the RELATIVE determinate errors in the individual measurements Overall RELATIVE determinate error, E TR
12
12 Accuracy & Precision Degree of agreement between measured value and accepted true value. ACCURACYPRECISION Degree of agreement between replicate measurements of same quantity. i.e., Being near to true value i.e., Repeatability of a result Useful for measuring deviation from mean Where, d i - deviation x i - any individual value in the population - Experimental mean Where, d i - deviation x i - any individual value in the population - Experimental mean Measures of accuracy is, Absolute Error: E A – Measurement error x M – Measured value x T - True or accepted value Absolute Error: E A – Measurement error x M – Measured value x T - True or accepted value Relative Error: E R – Relative error x M – Measured value x T - True or accepted value Relative Error: E R – Relative error x M – Measured value x T - True or accepted value & Relative error is more useful in practice
13
13 Accuracy & Precision Degree of agreement between measured value and accepted true value. ACCURACYPRECISION Degree of agreement between replicate measurements of same quantity. i.e., Being near to true value i.e., Repeatability of a result Measurements can be Accurate & Precise Precise but Inaccurate neither accurate nor precise
14
Accuracy & Precision: Example o Four analysts have each performed a set of five titrations for which the correct titer is known to be 20.00 cm3. The titers have been plotted on a linear scale, and inspection reveals the following: 14
15
Standard deviation: Measure of Precision - Formula 15 Standard Deviation: - A measure of the width of the distribution - Small standard deviation gives narrow distribution curve Where, s - Standard deviation x i - any individual value in the population - Experimental mean N – total No. of values N-1 – No. of degrees of freedom is defined as the number of independent deviations (x i −x) used to calculate s - Sum for i = 1 to i = N Where, s - Standard deviation x i - any individual value in the population - Experimental mean N – total No. of values N-1 – No. of degrees of freedom is defined as the number of independent deviations (x i −x) used to calculate s - Sum for i = 1 to i = N
16
Standard deviation: Measure of Precision – Gaussian Curve Standard Deviation: The standard deviation of a set of values is a statistic based on the normal error (Gaussian) curve and used as a measure of precision. 16 THE GAUSSIAN DISTRIBUTION μ-σ-σσ-2σ-3σ2σ2σ3σ3σ Deviation from Mean Frequency of occurrence of each deviation
17
Standard deviation: Measure of Precision: Gaussian curve 17 Figure shows two curves for the frequency distribution of two theoretical sets of data, each having an infinite number of values and known as a statistical population.
18
18 Johann Carl Friedrich Gauss (30 April 1777 – 23 February 1855) Statue of Gauss at his birthplace; Germany Stamps Currency
19
Standard deviation: Measure of Precision: Calculation 19 Calculation of an estimated standard deviation, s, involves the following steps: ● calculation of an experimental mean; ● calculation of the deviations of individual x i values from the mean; ● squaring the deviations and summing them; ● dividing by the number of degrees of freedom, N - 1, and ● taking the square root of the result. Calculation of an estimated standard deviation, s, involves the following steps: ● calculation of an experimental mean; ● calculation of the deviations of individual x i values from the mean; ● squaring the deviations and summing them; ● dividing by the number of degrees of freedom, N - 1, and ● taking the square root of the result.
20
Relative Standard deviation (RSD) or Coefficient of Variation (CV): 20 Measure of relative precision and is normally expressed as a percentage of the mean value
21
Pooled Standard deviation: 21 Where replicate samples are analyzed on a number of occasions under the same conditions, an improved estimate of the standard deviation can be obtained by pooling the data from the individual sets. Where, N 1, N 2, N 3 ………N k are the numbers of results in each of the k sets are the means for each of the k sets.
22
Variance: 22 Variance - Is the square of the standard deviation Variance = s 2
23
Overall Precision: 23 Random errors accumulated within an analytical procedure contribute to the overall precision. Where the calculated result is derived by the addition or subtraction of the individual values, the overall precision can be found by summing the variances of all the measurements so as to provide an estimate of the overall standard deviation, i.e. Random errors accumulated within an analytical procedure contribute to the overall precision. Where the calculated result is derived by the addition or subtraction of the individual values, the overall precision can be found by summing the variances of all the measurements so as to provide an estimate of the overall standard deviation, i.e.
24
Median, Range & Probability: 24 Median: The middle number in a series of measurements arranged in increasing order. The average of the two middle numbers if the number of measurements is even Range: The difference between the highest and the lowest values Range: The difference between the highest and the lowest values Range µ ± 1σ µ ± 2σ µ ± 3σ Gaussian Distribution (%) 68.3 95.5 99.7 Probability Range of measurements for ideal Gaussian distribution The percentage of measurements lying within the given range Probability Range of measurements for ideal Gaussian distribution The percentage of measurements lying within the given range
25
Confidence Interval (CI) or Confidence Coefficient: 25 The true mean of a set of experimental results is generally unknown except where a certified reference material is being checked or analyzed for calibration purposes. In all other cases, an estimate of the accuracy of the experimental mean must be made. This can be done by defining a range of values on either side of within which the true mean, , is expected to lie with a defined level of probability. This range, which ideally should be as narrow as possible, is based on the standard deviation and is known as the confidence interval, CI, and the upper and lower limits of the range as confidence limits, CL t is a statistical factor that depends upon the number of degrees of freedom.
26
Confidence Interval (CI) or Confidence Coefficient: 26 The chloride content of water samples has been determined a very large number of times using a particular method, and the standard deviation found to be 7 ppm. Further analysis of a particular sample gave experimental values of 350 ppm for a single determination, for the mean of two replicates and for the mean of four replicates. At, 95% probability level, t=1.96 and the confidence limits are, To demonstrate good accuracy, the confidence interval, CI, should be as small as possible and increasing the number of replicates will clearly achieve this.
27
Significance of testing 27 These are statistical tests used to compare individual values or sets of values for significant differences. Outliers: A measurement or result that appears to differ significantly from others in the same set of replicates is described as an outlier. Q-test: The Q-test is used to determine whether to reject or retain a suspected outlier. F-test: The F-test enables the precisions of two sets of data to be compared using their variances. t-test: The t-test is used to compare two experimental means, an experimental mean with a known value or sets of pairs of experimental values. Analysis of variance : F-tests can be applied to several sets of data to assess and compare different sources of variability.
28
Significance of testing: Q-test 28 It involves the calculation of a ratio, Q exptl, defined as the absolute difference between a suspect value and the value closest to it divided by the spread of all the values in the set: Q exptl is then compared with a tabulated value, Qt ab, at a selected level of probability, usually 90% or 95%, for a set of n values. If Q exptl is less than Q tab, then the null hypothesis that there is no significant difference between the suspect value and the other values in the set is accepted, and the suspect value is retained for further data processing. However, if Q exptl is greater than Qt ab, then the suspect value is regarded as an outlier and is rejected. A rejected value should NOT be used in the remaining calculations.
29
Significance of testing: Q-test – Table values 29
30
Significance of testing: Q-test 30
31
Significance of testing: Q-test 31
32
Significance of testing: F-test 32 This test is used to compare the precisions of two sets of data which may originate from two analysts in the same laboratory, two different methods of analysis for the same analyte or results from two different laboratories. A statistic, F, is defined as the ratio of the population variances, s 1 2 /s 2 2 of the two sets of data where the larger variance is always placed in the numerator so that F ≥1 If the null hypothesis is true, the variances are equal and the value of F will be one or very close to it. As for the Q-test, an experimental value, F exptl, is calculated and compared with a tabulated value, F tab, at a defined probability level, usually 90% or 95%, and for the number of degrees of freedom, N- 1, for each set of data. If F exptl is less than F tab, then the null hypothesis that there is no significant difference between the two variances and hence between the precision of the two sets of data, is accepted. However, if F exptl is greater than F tab, there is a significant difference between the two variances and hence between the precisions of the two sets of data.
33
Significance of testing: F-test – Table values 33
34
Significance of testing: F-test 34
35
Significance of testing: t-test 35 This test is used to compare the experimental means of two sets of data or to compare the experimental mean of one set of data with a known or reference value. A statistic, t, is defined, depending on the circumstances, by one of three alternative equations. S pooled is the pooled estimated standard deviation for sets A & B N and M are the numbers of values in sets A and B respectively If t exptl is less than t tab, then the null hypothesis that there is no significant difference between the two experimental means or between the experimental mean and a known value is accepted, i.e. there is no evidence of a bias. However, if t exptl is greater than t tab, there is a significant difference indicating a bias.
36
Significance of testing: t-test – Table values 36
37
Significance of testing: t-test – Example 37
38
Significance of testing: paired t-test 38 paired t-test Paired t-test is useful if ● the amount of sample is so restricted as to allow only one determination by each of the two methods; ● the methods are to be compared for a series of samples containing different levels of analyte rather than replicating the analysis at one level only; ● samples are to be analyzed over a long period of time when the same experimental conditions cannot be guaranteed. is mean difference between paired values and S d the estimated standard deviation of the differences.
39
Significance of testing: paired t-test – Example 39
40
Significance of testing: t-test 40 For t-test, Using the appropriate equation, an experimental value, t exptl, is calculated and compared with a tabulated value, t tab, at a defined probability level, usually between 90% and 99%, and for N- 1 degrees of freedom or (N + M - 2) degrees of freedom (with respect to the appropriate t-test formula).
41
Significance of testing: t-test – Example 41
42
Analysis of Variance - ANOVA 42 Analysis of variance, also known as ANOVA, is a statistical technique for investigating different sources of variability associated with a series of results. It enables the effect of each source to be assessed separately and compared with the other(s) using F-tests. Indeterminate or random errors affect all measurements, but additional sources of variability may also arise. The additional sources can be divided into two types, additional random effects, described as random-effect factors specific effects from determinate sources, described as controlled or fixed effect factors. Where one additional effect may be present, a one-way ANOVA is used, whilst for two additional effects, two-way ANOVA is appropriate. Both involve much lengthier calculations * than the simpler tests of significance, but facilities for these are available with computer packages such as Microsoft Excel & Minitab. Typical examples of the use of ANOVA are: ● analysis of a heterogeneous material where variation in composition is an additional random factor; ● analysis of samples by several laboratories, methods or analysts where the laboratories, methods or analysts are additional fixed-effect factors; ● analysis of a material stored under different conditions to investigate stability where the storage conditions provide an additional fixed-effect factor. Example Calculation: Vogel’s Quantitative Chemical Analysis, (6 th Edition) pp.168-170. Pearson Education
43
Calibration and Linear Regression 43 Calibration: Calibration is the process of establishing a relation between a detection or measurement system and known amounts or concentrations of an analyte under specified conditions. Correlation coefficient: The coefficient is used to assess the degree of linearity between two variables, e.g. an instrument response and an analyte mass or concentration. Linear regression: Calculations to define the best straight line through a series of calibration points represented graphically are described as linear regression. Limit of detection: The smallest mass or concentration of an analyte that can be measured quantitatively at a defined level of probability defines a limit of detection. Standard addition: This is a calibration procedure that avoids matrix interference by measuring instrument response for an analyte in both the sample and a sample to which known amounts of an analyte standard have been added. Internal standardization: This is a calibration procedure where the ratio of the instrument response for an analyte to that of an added standard is measured for a series of analyte standards and samples.
44
Calibration 44 Before using a particular analytical procedure to analyze samples, it is first necessary to establish the detector responses to known amounts of the analyte (calibration standards) over a selected mass or concentration range, a process known as calibration. A calibration curve is constructed by measuring the instrumental signal for each standard and plotting this response against concentration. Calibration: Calibration is the process of establishing a relation between a detection or measurement system and known amounts or concentrations of an analyte under specified conditions.
45
Correlation coefficient (r) 45 Correlation coefficient: The coefficient is used to assess the degree of linearity between two variables, e.g. an instrument response and an analyte mass or concentration. The correlation coefficient, r, indicates the degree of linearity between x and y and is given by the expression where x 1 y 1 ; x 2 y 2 ; x 3 y 3 ;….x n,y n are the co-ordinates of the plotted points are the means of the x and y values respectively indicates sums of terms The value of r must lie between +1 and -1. The nearer it is to, the greater the probability that linear relationship exists between the variables x and y. value close to +1 indicates POSITIVE correlation value close to -1 indicates NEGATIVE correlation A value of zero indicates no linear correlation between x and y
46
Correlation coefficient (r)… 46 The value of r must lie between +1 and -1. The nearer it is to, the greater the probability that linear relationship exists between the variables x and y. value close to +1 indicates POSITIVE correlation value close to -1 indicates NEGATIVE correlation A value of zero indicates no linear correlation between x and y
47
Linear regression 47 Linear regression: Calculations to define the best straight line through a series of calibration points represented graphically are described as linear regression. When inspection of the calibration data and the value of the correlation coefficient show that there is a linear relation between the detector response and the mass or concentration of the analyte, it is necessary to draw a line of best fit through the plotted points before they can be used as a working curve. Linear regression enables a line of best fit through the points to be defined by calculating values for the slope and y-axis intercept for the straight line equation
48
Linear regression 48
49
Linear regression 49
50
Limit of Detection 50 Limit of detection: The smallest mass or concentration of an analyte that can be measured quantitatively at a defined level of probability defines a limit of detection. defined as the smallest amount of an analyte giving a detector response significantly different from a blank or background response (i.e. the response from standards containing the same reagents and having the same overall composition (matrix) as the samples, where this is known, but containing no analyte). Detection limits are usually based on estimates of the standard deviation of replicate measurements of prepared blanks.
51
Standard addition Standard addition: This is a calibration procedure that avoids matrix interference by measuring instrument response for an analyte in both the sample and a sample to which known amounts of an analyte standard have been added. standard addition is used in instrumental analysis to determine concentration of a substance (analyte) in an unknown sample by comparison to a set of samples of known concentration, similar to using a calibration curve. Standard addition can be applied to most analytical techniques and is used instead of a calibration curve to solve the matrix effect problem.
52
Standard addition
53
Internal Standardization A calibration procedure whereby a constant amount of a selected substance, the internal standard is added to all samples and analyte standards alike compensates for variations in sample size and other parameters. The ratio of the detector response for the analyte in each standard to the corresponding response for the added internal standard is plotted on the y-axis of a calibration graph against the mass or concentration of the analyte on the x-axis. The correlation coefficient, slope and intercept values can be computed as shown previously, and response ratios for the analyte and added internal standard in the samples can then be used to determine the amount of analyte in the samples by interpolation on the graph. If only one or two analyte standards are prepared, the amount of analyte in a sample can be calculated by simple proportion,
54
54
55
Good Luck! Dr. K. SIVAKUMAR Department of Chemistry SCSVMV University chemshiva@gmail.com +91 98423 61378 The End
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.