Presentation is loading. Please wait.

Presentation is loading. Please wait.

Statistical Methods II&III: Confidence Intervals ChE 477 (UO Lab) Lecture 5 Larry Baxter, William Hecker, & Ron Terry Brigham Young University.

Similar presentations


Presentation on theme: "Statistical Methods II&III: Confidence Intervals ChE 477 (UO Lab) Lecture 5 Larry Baxter, William Hecker, & Ron Terry Brigham Young University."— Presentation transcript:

1 Statistical Methods II&III: Confidence Intervals ChE 477 (UO Lab) Lecture 5 Larry Baxter, William Hecker, & Ron Terry Brigham Young University

2 Population vs. Sample Statistics Population statistics –Characterizes the entire population, which is generally the unknown information we seek –Mean generally designated as  –Variance & standard deviation generally designated as  , and , respectively Sample statistics –Characterizes the random sample we have from the total population –Mean generally designated –Variance and standard deviation generally designated as s 2 and s, respectively

3 Overall Approach Use sample statistics to estimate population statistics Use statistical theory to indicate the accuracy with which the population statistics have been estimated Use trends indicated by theory to optimize experimental design

4 Data Come From pdf

5 Histogram Approximates a pdf

6 All Statistical Info Is in pdf Probabilities are determined by integration. Moments (means, variances, etc.) Are obtained by simple means. Most likely outcomes are determined from values.

7 Gaussian or Normal pdf Pervasive

8 Properties of a Normal pdf About 68.26%, 95.44%, and 99.74% of data lie within 1, 2, and 3 standard deviations of the mean, respectively. When mean is zero and standard deviation is 1, it is referred to as a standard normal distribution. Plays fundamental role in statistical analysis because of the Central Limit Theorem.

9 Lognormal Distributions Used for non-negative random variables. –Particle size distributions. –Drug dosages. –Concentrations and mole fractions. –Duration of time periods. Similar to normal pdf when variance is < 0.04.

10 Student’s t Distribution Widely used in hypothesis testing and confidence intervals Equivalent to normal distribution for large sample size

11 Central Limit Theorem Possibly most important single theory in applied statistics Deals with distributions of normalized sample and population means Not quite applicable because it assumes population mean and variance are known

12 Central Limit Theorem Distribution of means calculated from data from most distributions is approximately normal –Becomes more accurate with higher number of samples –Assumes distributions are not peaked close to a boundary

13 Student’s t Distribution Used to compute confidence intervals according to Assumes mean and variance estimated by sample values

14 Values of Student’s t Distribution Depends on both confidence level being sought and amount of data. Degrees of freedom generally n -1, with n = number of data points (assumes mean and variance are estimated from data and estimation of population mean only). This table assumes two- tailed distribution of area.

15 Sample Size Is Important Confidence interval decreases proportional to inverse of square root of sample size and proportional to decrease in t value. Limit of t value is normal distribution. Limit of confidence interval is 0.

16 Theory Can Be Taken Too Far Accuracy of measurement ultimately limits confidence interval to something greater than 0. Not all sample means are appropriately treated using central limit theorem and t distribution.

17 Typical Numbers Two-tailed analysis Population mean and variance unknown Estimation of population mean only Calculated for 95% confidence interval Based on number of data points, not degrees of freedom

18 An Example Five data points with sample mean and standard deviation of 713.6 and 107.8, respectively. The estimated population mean and 95% confidence interval is:

19 Point vs. Model Estimation Point estimation –Characterizes a single, usually global value –Generally simple mathematics and statistical analysis –Procedures are unambiguous Model development –Characterizes a function of dependent variables –Complexity of parameter estimation and statistical analysis depend on model complexity –Parameter estimation and especially statistics somewhat ambiguous

20 Overall Approach Assume model Estimate parameters Check residuals for bias or trends Estimate parameter confidence intervals Consider alternative models

21 Data Come From pdf

22 Histogram Approximates a pdf

23 General Confidence Interval Degrees of freedom generally = n-p, where n is number of data points and p is number of parameters Confidence interval for parameter given by

24 Linear Fit Confidence Interval For intercept: For slope:

25 Definition of Terms

26 Confidence Interval for Y at a Given X


Download ppt "Statistical Methods II&III: Confidence Intervals ChE 477 (UO Lab) Lecture 5 Larry Baxter, William Hecker, & Ron Terry Brigham Young University."

Similar presentations


Ads by Google