Stat 13, Thu 5/10/12. 1. CLT again. 2. CIs. 3. Interpretation of a CI. 4. Examples. 5. Margin of error and sample size. 6. CIs using the t table. 7. When.

Slides:



Advertisements
Similar presentations
Estimation of Means and Proportions
Advertisements

Statistics and Quantitative Analysis U4320
Ch. 8 – Practical Examples of Confidence Intervals for z, t, p.
Confidence Intervals for Proportions
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 18, Slide 1 Chapter 18 Confidence Intervals for Proportions.
Math 161 Spring 2008 What Is a Confidence Interval?
Estimation Procedures Point Estimation Confidence Interval Estimation.
1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 23 = Finish Chapter “Confidence Interval Estimation” (CIE)
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Introduction to Statistics: Chapter 8 Estimation.
1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 22 = More of Chapter “Confidence Interval Estimation”
Inference about a Mean Part II
Copyright © 2010 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
Estimation 8.
Chapter 19: Confidence Intervals for Proportions
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Business Statistics, A First Course.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Confidence Interval Estimation
Many times in statistical analysis, we do not know the TRUE mean of a population of interest. This is why we use sampling to be able to generalize the.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Confidence Intervals for Means. point estimate – using a single value (or point) to approximate a population parameter. –the sample mean is the best point.
Estimates and Sample Sizes Lecture – 7.4
AP Statistics Chap 10-1 Confidence Intervals. AP Statistics Chap 10-2 Confidence Intervals Population Mean σ Unknown (Lock 6.5) Confidence Intervals Population.
AP STATISTICS LESSON 10 – 1 (DAY 2)
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
When σ is Unknown The One – Sample Interval For a Population Mean Target Goal: I can construct and interpret a CI for a population mean when σ is unknown.
Section 8.1 Estimating  When  is Known In this section, we develop techniques for estimating the population mean μ using sample data. We assume that.
Stat 13, Tue 5/8/ Collect HW Central limit theorem. 3. CLT for 0-1 events. 4. Examples. 5.  versus  /√n. 6. Assumptions. Read ch. 5 and 6.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 8-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
Stat 13, Tue 5/15/ Hand in HW5 2. Review list. 3. Assumptions and CLT again. 4. Examples. Hand in Hw5. Midterm 2 is Thur, 5/17. Hw6 is due Thu, 5/24.
STA Lecture 181 STA 291 Lecture 18 Exam II Next Tuesday 5-7pm Memorial Hall (Same place) Makeup Exam 7:15pm – 9:15pm Location TBA.
STA291 Statistical Methods Lecture 18. Last time… Confidence intervals for proportions. Suppose we survey likely voters and ask if they plan to vote for.
Copyright © 2012 Pearson Education. All rights reserved © 2010 Pearson Education Copyright © 2012 Pearson Education. All rights reserved. Chapter.
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 8-1 Confidence Interval Estimation.
Sampling distributions rule of thumb…. Some important points about sample distributions… If we obtain a sample that meets the rules of thumb, then…
Chapter 8 Parameter Estimates and Hypothesis Testing.
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
Confidence intervals. Want to estimate parameters such as  (population mean) or p (population proportion) Obtain a SRS and use our estimators, and Even.
Chapter 12 Confidence Intervals and Hypothesis Tests for Means © 2010 Pearson Education 1.
Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall
Review Normal Distributions –Draw a picture. –Convert to standard normal (if necessary) –Use the binomial tables to look up the value. –In the case of.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Confidence Intervals for a Population Mean, Standard Deviation Unknown.
1 Chapter 8 Interval Estimation. 2 Chapter Outline  Population Mean: Known  Population Mean: Unknown  Population Proportion.
10.1 – Estimating with Confidence. Recall: The Law of Large Numbers says the sample mean from a large SRS will be close to the unknown population mean.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Business Statistics: A First Course 5 th Edition.
Many times in statistical analysis, we do not know the TRUE mean of a population on interest. This is why we use sampling to be able to generalize the.
Copyright © 2010 Pearson Education, Inc. Slide
Confidence Intervals. Point Estimate u A specific numerical value estimate of a parameter. u The best point estimate for the population mean is the sample.
Statistics 19 Confidence Intervals for Proportions.
Statistics for Business and Economics 8 th Edition Chapter 7 Estimation: Single Population Copyright © 2013 Pearson Education, Inc. Publishing as Prentice.
Confidence Intervals Dr. Amjad El-Shanti MD, PMH,Dr PH University of Palestine 2016.
Chapter 8 Confidence Interval Estimation Statistics For Managers 5 th Edition.
Solution: D. Solution: D Confidence Intervals for Proportions Chapter 18 Confidence Intervals for Proportions Copyright © 2010 Pearson Education, Inc.
Confidence Intervals for Proportions
Confidence Interval Estimation
Confidence Intervals for Proportions
Confidence Intervals for Proportions
Estimating
CONCEPTS OF ESTIMATION
WARM - UP 1. How do you interpret a Confidence Interval?
Chapter 8: Estimating with Confidence
Confidence Interval Estimation
Chapter 6 Confidence Intervals
Confidence Intervals for Proportions
Confidence Intervals for Proportions
Estimates and Sample Sizes Lecture – 7.4
Confidence Intervals for Proportions
Presentation transcript:

Stat 13, Thu 5/10/ CLT again. 2. CIs. 3. Interpretation of a CI. 4. Examples. 5. Margin of error and sample size. 6. CIs using the t table. 7. When to use z* and t*. Read ch. 5 and 6. Hw5 is due Tue, 5/15. Midterm 2 is Thur, 5/17. On Thur, 5/17, I won’t be able to have my usual office hour from 230 to 3:30, so it will be instead from 1:30 to 2:15pm. 1

1. Central Limit Theorem (CLT). If you have a SRS (or observations are iid), and n is large (or the population is normally distributed), then is normally distributed with mean µ and std deviation, where  is the std deviation of the population and n is the sample size. 2

2. CIs. The examples from last class were a little artificial, because we KNEW the population mean µ. Usually you take a sample because you don't know µ. We then use the sample mean to estimate the population mean µ. But what if we want a range, or interval, where we think µ is likely to fall, based on ? That's called a confidence interval (CI). We know from the CLT that is normally distributed with mean µ and std deviation. This means the difference between and µ is typically around. So from this info, we can tell given where µ seems likely to lie. For instance, if we know = 10, and = 1, then it seems pretty likely that µ is between 9 and 11, and very likely between 8 and 12. The way to get a c%-confidence interval using the Z table: * First find the values from the table that contain the middle c% of the area under the standard normal curve. If c = 95, that means 2.5% is to the right of the region, and 2.5% (0.025) is to the left, so you look in Table A til you find and you see the appropriate value is We call this z* = (or see bottom row of table 4 or in back of book: 95% corresponds to % would correspond to ) 3

The way to get a c%-confidence interval using the Z table: * First find the values from the table that contain the middle c% of the area under the standard normal curve. If c = 95, that means 2.5% is to the right of the region, and 2.5% (0.025) is to the left, so you look in Table A til you find and you see the appropriate value is We call this z* = (or see bottom row of table 4 or in back of book: 95% corresponds to c = 80 would correspond to z* = ) * Now, just use the formula: +/- z*, and you have your CI. For a different confidence level besides 95%, the value of z* would change. The use of this formula is based on the CLT. It can only be used if the following assumptions are met: (i) SRS (or somehow you know that the observations are iid), AND (ii) n is large (or population is ~ normal and  is known). Typically you don't know . If n is large you can just plug in s, the standard deviation of the observations in your SAMPLE. In the case of 0-1 data, s =, where and are the proportion of 0's and 1's in the sample. 4

3. Interpretation of a 95% CI: there's a 95% chance that the CI contains the true population mean µ. The CI is a random variable (statistic, estimate): If another sample were taken, there'd be a different sample mean, and therefore a different CI. Unless we're really unlucky, our CI will contain µ. That is, if we kept sampling over and over, and each time we got a different and a different 95%-CI, then 95% of these CIs would contain µ. 4. Examples. Suppose we don't know the mean amount of wet manure produced by the avg cow. We sample 400 cows and find that in our sample, the mean is = 18 pounds, and the sample standard deviation is s = 5 pounds. Find a 92%-CI for the population mean. Answer: It’s a SRS and n = 400 is large, so the standard formulas apply, but we don’t know  so we will plug in s. For a 92%-CI, we want the values containing 92% of the area, which means 4% is to the right and 4% is to the left, so from the table, z* = The CI is +/- (z*)s/√n = 18 +/- (1.75)(5) ÷ √400 = 18 +/

Another example. Suppose we don't know the percentage of people with peanut allergies. We take a SRS of 900 people. We find that 72 of them (8.0%) of them have peanut allergies. Find a 90%-CI for the population percentage of people with peanut allergies. Answer: This is a 0-1 question. It’s a SRS and n is large because there are 72 with allergies and 828 without, and both of these are ≥ 10. So the standard formulas apply. For a 90%-CI, z* = from the bottom row of Table 4. The formula for the 90%-CI is +/- z*  /√n. We don't know  so use s = = √ (8.0% x 92.0%) ~ Our 90%-CI is 8.0% +/- (1.645) (0.271) / √900 which is 8.0% +/ %. 5. Margin of error and sample size. This +/- part is called a margin of error. 6

5. Margin of error and sample size. This +/- part is called a margin of error (m in the book). m = z*  /√n. Suppose you know what margin of error, m, you want. But you don't know what sample size n you need. Just let m = z*  /√n. Solving for n, we get n = (z*  / m) 2. This tells you how large the sample size needs to be to achieve the margin of error. Typically for margin of error you want a 95%-confidence level, so z* = 1.96, unless otherwise specified. Example: Continuing with peanut allergies, we took a SRS of 900 people and found that 72 of them (8.0%) of them had peanut allergies and a 90%-CI for the population percentage of people with peanut allergies was 8.0% +/ %. How many more people are needed to get this margin of error for the 90%-CI down to 1%? Answer: n = (z*  / m) 2. Here it’s a 90%-CI so z* =  is unknown so use s = √ (8.0% x 92.0%) ~ m = 1%. So, n = (1.645 x /.01) 2 ~ We already have 900 so we need 1087 more. 7

6. Using the t table. Assumptions for CIs using the Z (std normal) table: (i) SRS (or somehow you know that the observations are iid), AND (ii) n is large (or the population is normal and  is known). Under these conditions, the CLT says that is normally distributed with mean µ and std deviation, so a CI is +/- z*, and you can substitute s for . If n is small and you know the population is normal, then s might be substantially different from . If  is unknown but estimated using s, then use of the t table is appropriate, rather than the Z table. Specifically, if you have: (i) SRS (or the observations are iid), AND (ii) population is normal, AND (iii)  is unknown, and estimated with s, then is t n-1 distributed with mean µ and std deviation, so a CI is +/- t* s/√n. t* is given in Table 4 or the back of the book. n-1 is the “degrees of freedom” (df). Can't use the Z table when n is small and distribution of the population is unknown. 8

Example using the t table. Suppose you take a SRS of 10 patients with hand, foot and mouth disease and record their ages. You find that is 12 and s = 7. Find a 95% CI for µ, the mean age among the whole population of patients with hand, foot and mouth disease, assuming the ages in this population are normally distributed. Answer. Here we have a SRS, the pop. is normal, and  is unknown, so use the t table. df = n-1 = 10-1 = 9. From Table 4, for a 95% CI, with df = 9, t* = So, the 95% CI is +/- t* s/√n = 12 +/ (7)/√10 = 12 +/- 5.01, or the interval (6.99,17.01). Note that if the population is 0s and 1s, then this contradicts the assumption that the population is normal, so you’d never use the t table with this type of data. 9

7. When to use z* and t*. The book seems to always recommend using t* rather than z*. a) If it's a simple random sample (SRS) and the population is normal,  is unknown, and n is small (< 25), then use t*. b) If it's a SRS and the population is normal,  is known, and n is small (< 25), then use z*. c) If it's a SRS and n is large, then t* and z* are very close together, so it doesn't really matter which you use. The book recommends t*, but I'm going to suggest you use z* since it's easier to determine, especially when the sample size is such that the df isn't a value in the table on the last page of the book. On the hw, I will tell the reader to accept either t* or z* for this case, and similarly on my exams. d) One thing that's crucial to me is that you understand that, if the population might NOT be normal and n is NOT large, then neither t* nor z* is appropriate. 10