Exploring Chemical Analysis Fourth Edition

Exploring Chemical Analysis Fourth Edition
4 Statistics 歐亞書局

4-1 The Gaussian Distribution
The example given in Figure 4-1 introduces the Gaussian distribution.

As shown in Figure 4-1, neurotransmitters bind to membrane proteins of the muscle cell and open up channels that permit cations to diffuse into the cell. In the absence of neurotransmitter, the ion channel is closed and cations cannot enter the muscle cell. (b) In the presence of neurotransmitter, the channel opens, cations enter the cell, and muscle action is initiated.

Figure 4-2 Observed cation current passing through individual channels of a frog muscle cell.
The smooth, bellshaped curve superimposed on the data is Gaussian distribution. Of the 922 ion-channel responses recorded, 190 are in narrow range from 2.64 to 2.68 pA. P.84

Mean and Standard Deviation
A Gaussian distribution is characterized by mean standard deviation.

The arithmetic mean, x, also called the average, is the sum of the measured values divided by the number of measurements. mean (average)

 The smaller the standard deviation, the narrower the distribution.
The standard deviation, s, is a measure of the width of the distribution.  The smaller the standard deviation, the narrower the distribution. degrees of freedom P.84

degrees of freedom Initially we have n independent data points, which represent n pieces of information.  After computing the average, there are only n-1 independent pieces of information.

Figure 4-3 If the standard deviation were doubled
 the Gaussian curve for the same number of observations would be shorter and broader. * The total number of observations is fixed Figure 4-3 Gaussian curves showing the effect of doubling the standard deviation. P.85

Relative standard deviation
The relative standard deviation is the standard deviation divided by the average. standard deviation average relative standard deviation P.85

Variance Variance is the square (平方值) of the standard deviation.

Example: Mean and Standard Deviation
Find the mean, standard deviation, and relative standard deviation for the set of measurements (7, 18, 10, 15). SOLUTION: P.85

Standard Deviation and Probability
In an ideal Gaussian distribution,  68.3% of the measurement lie within μ± 1σ region 95.5% of the measurement lie within μ± 2σ region 99.7% of the measurement lie within μ± 3σ region

對任何的常態分佈而言平均數上下0.5個標準差之間的面積為 0.38 上下1個標準差之間的面積為 0.68 上下1.645個標準差之間的面積為 0.90 上下1.96個標準差之間的面積為 0.95 上下3個標準差之間的面積為 0.997

Standard Deviation and Probability
Table 4-1 shows the correspondence between ideal Gaussian behavior and the observations in Figure 4-2. Ideally, there should be 68.3% of measurement fall in μ±1σ region. In Figure 4-2, 71% of measurement fall in m μ±1σ region.

4-2 Student’s t t test

2 applications of Student’s t
(i) it is the statistical tool used to express confidence intervals (ii) It can be used to compare results from different experiments. 信賴區間 P.86

Confidence Intervals t is Student’s t, taken from Table 4-2
The confidence interval is a range of values within which there is a specified probability of finding the true mean. the measured standard deviation measurement # t is Student’s t, taken from Table 4-2 P.86

Table 4-2 P.87

Example : Calculating Confidence Intervals
In replicate analyses, the carbohydrate content of a glycoprotein (a protein with sugars attached to it) is found to be 12.6, 11.9, 13.0, 12.7, and 12.5 g of carbohydrate per 100 g of protein. Find the 50% and 90% confidence intervals for the carbohydrate content. P.87

SOLUTION: the 50 % confidence interval is 5 - 1

SOLUTION: the 90 % confidence interval is 5 - 1

Better precision gives smaller confidence intervals.
There is 50% chance that the true mean lies in the range ± 0.13, i.e., to There is 90% chance that the true mean lies in the range ± 0.38, i.e., to Better precision gives smaller confidence intervals. A higher chance to find the true mean within a small interval.

公式用measurement # : n 查表用degree of freedom : n-1

Improving the Reliability of Your Measurements
Better precision gives smaller confidence intervals. The confidence interval is ± ts/√n. To reduce the size of the confidence interval we may make more measurements to increase n we may improve measurement procedure to decrease s The only way to reduce s is to improve your experimental procedure. In the absence of a procedural change, the way to reduce the confidence interval is to increase the number of measurements.

t-test Comparison of Means with Student’s t
Student’s t can be used to compare two sets of measurements to decide whether they are “statistically different.”

An example comes from the work of Lord Rayleigh (John W
An example comes from the work of Lord Rayleigh (John W. Strutt), who received the Nobel Prize in 1904 for discovering the inert gas argon. Figure 4-4 Figure 4-4 Lord Rayleigh’s measurements of the mass of nitrogen isolated from air or generated by decomposition of nitrogen compounds.

Here Spooled is a pooled standard deviation making use of both sets of data.
If the calculated t is greater than the tabulated t at the 95% confidence level  the two results are considered to be significantly different.

Degrees of freedom of two sets of measurements

Is Rayleigh’s N2 from Air Denser than N2 from Chemicals?
Example : Is Rayleigh’s N2 from Air Denser than N2 from Chemicals? Are the two masses significantly different? 由兩種不同氣體來源來測氮氣質量答案不同嗎? 可由t-test來檢查 P.90

SOLUTION: 7 measurements 8 measurements

Degrees of freedom = 7 + 8 – 2 = 13
For 13 degrees of freedom, t lies between and at the 95% confidence level. The calculated t = 20.2 is greater than the tabulated t  so the difference is significant.

4-3 A Spreadsheet for the t Test
skip

Gcalculated questionable value mean /s
Gcalculated questionable value mean /s. If Gcalculated Gtable, the value in question can be rejected with 95% confidence. Values in this table are for a one-sided test, as recommended by ASTM.SOURCE: ASTM E Standard Practice for Dealing with Outlying Observations; F. E. Grubbs and G. Beck, Technometrics 1972, 14, 847.

4-4 Grubbs Test for an Outlier

It appears that the value of 7.8 by Cheryl is questionable.
Four students performed the experiment in triplicate and pooled their results: 10.2, 10.8, , 9.4, , 9.2, , 10.6, 11.6 Sidney Cheryl Tien Dick It appears that the value of 7.8 by Cheryl is questionable. Should we retain this measurement? To answer this question, use Grubbs test.

Grubbs test Compute the Grubbs statistic G, defined as Eq. 4-6 If G calculated from Equation 4-6 is greater than G in Table 4-4, the questionable point should be discarded.

Gcalculated < Gtable
10.2, 10.8, , 9.4, , 9.2, , 10.6, 11.6 Sidney Cheryl Tien Dick Gcalculated = ( ) / 1.11 = 2.13 Gtable = for 12 observation in Table 4-4 Gcalculated < Gtable The observation of value 7.8 should be retained.  There is 5% chance that the value 7.8 is a member of the same population as the other measurements.

 the data should be discarded.
10.2, 10.8, , 9.4, , 9.2, , 10.6, 11.6 Sidney Cheryl Tien Dick Gcalculated (2.285) < Gtable (2.13)  The observation of value 7.8 should be retained.  There is 5% chance that the value 7.8 is a member of the same population as the other measurements. Common sense must always prevail. If Cheryl knows that her measurement was low because she spilled some of her unknown, then the probability that the result is wrong is 100%  the data should be discarded. Any data based on a faulty procedure should be discarded, no matter how well it fits the rest of data.

4-5 Finding the “Best” Straight Line

The method of least squares
 finds the “best” straight line through experimental data points. m: the slope b: the y-intercept Figure 4-6

Figure 4-6 P.93

One example

The method of least squares finds the “best” line by adjusting the line to minimize the vertical deviations between the points and the line.

Figure 4-7 Figure 4-7 Least-squares curve fitting minimizes the sum of the squares of the vertical deviations of the measured points from the line. The Gaussian curve drawn over the point (3, 3) is a schematic indication of the distribution of measured y values about the straight line. The most probable value of y falls on the line, but there is a finite probability of measuring y some distance from the line. P.93

vertical deviation = di = yi (實際測量值) － y (直線模型預測值)
= yi － (mxi＋b) Σdi2 = Σ (yi－y)2 = Σ ( yi－mxi－b)2 Because we minimize the squares of the deviations, this is called the method of least squares.

You may find out how to derive the slope and intercept in your calculus textbook.

Table 4-5 sets out an example in which the four points (n = 4) in Figure 4-7 are treated.
4 data points: (1,2), (3,3), (4,4), and (6,5) Figure 4-7

How Reliable Are Least-Squares Parameters?
The uncertainties in m and b are related to the uncertainty in measuring each value of y. standard deviation of y: y (± sy) = [m (± sm) ]x + [b (± sb) ]

The first decimal place of the standard deviation is the last significant figure of the slope or intercept.

4-6 Constructing a Calibration Curve

A calibration curve is a graph showing how the experimentally measured property depends on the known concentrations of the standards. Linear range The detection limit (DL) is the smallest concentration that can be reported with a certain level of confidence. Skoog

Use data from a spectrophotometric analysis to show how to construct a calibration curve.
Table 4-6 A result obtained with zero analyte is called a blank (or a reagent blank), because it measures effects due to the analytical reagents. A solution containing a known quantity of analyte (or other reagent) is called a standard solution.

Table 4-6 Subtract the average absorbance of blanks (0.0993)

This data point is removed because it is a bad data.

Theses three data points are not included because they lie below the straight line.
The linear range is between 0 and 2.0 mg but not between 0 and 2.5 mg.

Figure 4-8 Once the calibration line is constructed, you may estimate the unknown’s concentration by measuring its absorbance.

method of least squares

Finding the Protein in a Unknown
Suppose that the measured absorbance of an unknown sample is How many micrograms of protein does it contain? What uncertainty is associated with the answer? 0.373 – blank

What uncertainty is associated with the answer?
mean values of x and y for the points in the calibration line k: the number of replicate measurements of the unknown n: the number of data points included in the calibration line

4-7 A Spreadsheet for Least Squares

Figure 4-9 uses built-in power of Excel for least-squares calculations of straight lines.
skip Figure 4-9 Spreadsheet for least-squares calculations.

Figure 4-10 Adding error bars corresponding to the 95% confidence interval for each data point

Figure 4-11 Figure 4-11 Format Data Series window for
Adding error bars to a graph.

Exploring Chemical Analysis Fourth Edition

Similar presentations

Presentation on theme: "Exploring Chemical Analysis Fourth Edition"— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Exploring Chemical Analysis Fourth Edition

Similar presentations

Presentation on theme: "Exploring Chemical Analysis Fourth Edition"— Presentation transcript:

Similar presentations

About project

Feedback