Download presentation
Presentation is loading. Please wait.
Published byDoreen Clark Modified over 9 years ago
1
Statistical Intervals for a Single Sample From only one sample, An interval has been found. Because the sample was ample, The results were quite profound! - author unknown circa 2007 Chapter 8A
2
What to Look Forward to this Week Today only!
3
A Diversion – the sampling distributions or distributions arising from the normal If Z 1, Z 2,..., Z n are independent standard normal random variables, then chi-square distribution with n degrees freedom t-distribution with n degrees freedom F-distribution with n degrees freedom in the numerator and m degrees of freedom in the denominator
4
What will be Chi-square? Let X i be the i th sample value from a normal population
5
The Chi-square Distribution k = degrees of freedom
6
The t-distribution also known as Student’s t f(x) = v = k = df
7
More about Student The t statistic was introduced by William Sealy Gosset for cheaply monitoring the quality of beer brews. "Student" was his pen name. Gosset was a statistician for the Guinness brewery in Dublin, Ireland, and was hired due to Claude Guinness's innovative policy of recruiting the best graduates from Oxford and Cambridge to apply biochemistry and statistics to Guinness' industrial processes. Gosset published the t test in Biometrika in 1908, but was forced to use a pen name by his employer who regarded the fact that they were using statistics as a trade secret.
8
The F-distribution f(x) B(m,n) is the Beta function an interesting property:
9
F-Distribution named after R.A. Fisher Born 17 February 1890(1890-02-17) East Finchley, London, England Died 29 July 1962 (aged 72) Adelaide, Australia Residence England, Australia Nationality British, Field Statistics, Genetics, Natural selection Institutions Rothamsted Experimental Station University College London, Cambridge University Alma mater Cambridge University Academic advisor Sir James Jeans F.J.M. Stratton Notable students C.R. Rao Known for Maximum likelihood Fisher information Analysis of variance Notable prizes Royal Medal (1938) Copley Medal (1955)
10
Three types of Intervals -Confidence Interval – bound population parameter or distribution parameter. -Tolerance Interval – bound a proportion of the distribution at a certain confidence level. - Prediction Interval – bound a single observation => assumptions on population distribution critical here.
11
Overheard at a rest stop I know that my average driving time on this daily route has been 2.3 hours over the last 7 days. However, that is based on a sample and therefore is unlikely to equal my population mean. What I really need is some way to measure how precise this estimate is.
12
A typical prob-stat graduate Confidence Interval - A statement consisting of two values between which the population parameter is estimated to lie. Reliability – degree of confidence – the probability with which the population parameter will be “captured by the two values. Precision – the length of the confidence interval (a measure of the error in estimating the parameter. I am 95% confident that my mean driving time to work is between 37.4 minutes and 41.2 minutes. 41.2 – 37.4 = 3.8 Best estimate is midpoint = 39.3 Error = 1.9 minutes
13
The Big Picture of a Confidence Interval (L,U) is 100(1- )% CI for the population parameter
14
The Bigger Picture of a Confidence Interval General Approach: Estimate reliability Factor x Standard Error our point estimate our confidence our precision
15
The Big gest Picture of a Confidence Interval Measure of Risk Measure of Uncertainty (precision) (Random Variables; i.e. statistics population parameter The length of a confidence interval is a measure of the precision of estimation. (1 – )% of the C.I.s constructed this way contain the mean. Watch the interpretation of this concept.
16
Confidence Interval on Mean of Normal Distribution Variance Known
17
Our Very First Real Confidence Interval A sample of 100 batteries are tested for their operating life. They averaged (mean) 10 hours before failing. The manufacturer has assured us that the population variance is 16 hours. Find a 95 percent confidence interval for the mean life of this particular type of battery.
18
Sample Size and Precision Think of E as a measure of practical significance.
19
Our Very First Real Confidence Interval Revisited For our battery problem, what sample size is required to reduce the error to.5 hr. with a 99% confidence?
20
Problem 8-12 Life of a 75 watt bulb is normally distributed with std dev = 25 hrs. Suppose we want to be 95% confident that the error in estimating mean life is less than 5 hours. Find a sample size.
21
Problem 8-10 Diameter of holes for a cable harness is normally distributed with a standard deviation of.01 in. A random sample of 10 yields average diameter of 1.5045 in. Find a 99% two-sided confidence interval.
22
Interpreting a Confidence Interval The confidence interval is a random interval The appropriate interpretation of a confidence interval (for example on ) is: The observed interval [l, u] brackets the true value of , with confidence 100(1- ). Examine Figure 8-1 on the next slide.
23
Figure 8-1 Repeated construction of a confidence interval for
24
Repeated Confidence Intervals, gen. samples Random samples from a standard normal distribution, N(0,1). Generated in Excel as NORMSINV(RAND())
25
Repeated Confidence Intervals, hits and misses Since these are not 10 and 5, respectively, is there an error?
26
Repeated Confidence Intervals,.90 How are these probabilities being generated? Let X = RV, number of misses. Then X ~ Bin(100,.1) E[X] = np = 10
27
Repeated Confidence Intervals,.95 Major Point: Can you see how probability helps us assess the risk associated with statistical inference?
28
One-sided Confidence Bounds
29
Our Very First Real Confidence Interval Revisited Again A One-Sided Confidence Interval Based upon the sample of 100 batteries averaging (mean) 10 hours to failure. The manufacturer continues to assure us that the population variance is 16 hours. Find a 95 percent lower confidence interval for the mean life of this particular type of battery.
30
A Transition… The previous development of a confidence interval was limited in two ways: - Needed a Normal population - Needed to know the standard deviation of the Normal distribution The Central Limit Theorem eliminates the need to explicitly know the population is normal – Z will still be approximately standard normal We can estimate using the sample standard deviation, s.
31
Confidence Interval on Mean of Normal – Variance Unknown Same form as for the normal – measure of risk is now from the t distribution, and we boldly use the sample standard deviation – protected by the heavy-tailed t distribution-even when sample size is small! Remember we are still assuming that observations from the underlying population are normally distributed.
32
Where Does it Come From? Do we care? From earlier: numerator is standard normal denominator is chi- square divided by d.f. (n-1) T has a t distribution with n-1 degrees of freedom.
33
Are we still caring?
34
Confidence Interval on Mean of Normal – Variance Unknown For large samples the distributional assumption is not critical. If sample size is not large use the t distribution As n ∞, t distribution becomes standard normal.
35
t Distribution Converges to Standard Normal For large sample size, use the normal distribution even if the variance is unknown.
36
The t distribution
37
Our Very First CI using the t-distribution Sulfur dioxide and nitrogen oxide are products of fossil fuel consumption. These compounds can be carried long distances and converted to acid before being deposited in the form of “acid rain.” The following sulfur dioxide concentrations (in micrograms per cubic meter) were obtained from different locations in a forest though to have been damaged by acid rain. Estimate the mean concentration in the forest. Average concentration in undamaged areas is 20 g/m 3
38
It’s Official Now
39
Stay Tuned – next time…
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.