Chapter 10: Estimating With Confidence. 10.1 – Confidence Intervals: The basics.

Slides:



Advertisements
Similar presentations
Estimating a Population Proportion
Advertisements

Estimating with Confidence Estimating an unknown parameter  Ex: The admissions director at a University proposes using the IQ scores of current.
Conditions with σ Unknown Note: the same as what we saw before.
Confidence Intervals with Means Chapter 9. What is the purpose of a confidence interval? To estimate an unknown population parameter.
Confidence Intervals Chapter 7. Rate your confidence Guess my mom’s age within 10 years? –within 5 years? –within 1 year? Shooting a basketball.
Confidence Intervals Chapter 10. Rate your confidence Name my age within 10 years? 0 within 5 years? 0 within 1 year? 0 Shooting a basketball.
Chapter 11- Confidence Intervals for Univariate Data Math 22 Introductory Statistics.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 7-1 Chapter 7 Confidence Interval Estimation Statistics for Managers.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Basic Business Statistics 10 th Edition.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Introduction to Statistics: Chapter 8 Estimation.
Fall 2006 – Fundamentals of Business Statistics 1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter 7 Estimating Population Values.
8-1 Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Chapter 8 Confidence Interval Estimation Statistics for Managers using Microsoft.
Copyright ©2011 Pearson Education 8-1 Chapter 8 Confidence Interval Estimation Statistics for Managers using Microsoft Excel 6 th Global Edition.
Chapter 11: Inference for Distributions
Confidence Intervals Chapter 10. Rate your confidence Name my age within 10 years? within 5 years? within 1 year? Shooting a basketball at a wading.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Business Statistics, A First Course.
Copyright © 2005 by Evan Schofer
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Confidence Interval Estimation
+ DO NOW What conditions do you need to check before constructing a confidence interval for the population proportion? (hint: there are three)
Chapter 6 Confidence Intervals.
10.1 Estimating With Confidence
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Basic Business Statistics 11 th Edition.
Dan Piett STAT West Virginia University
Confidence Interval Estimation
PROBABILITY (6MTCOAE205) Chapter 6 Estimation. Confidence Intervals Contents of this chapter: Confidence Intervals for the Population Mean, μ when Population.
AP Statistics Chap 10-1 Confidence Intervals. AP Statistics Chap 10-2 Confidence Intervals Population Mean σ Unknown (Lock 6.5) Confidence Intervals Population.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Estimating the Value of a Population Parameter 9.
Chapter 11 Inference for Distributions AP Statistics 11.1 – Inference for the Mean of a Population.
Lesson Estimating a Population Mean. Knowledge Objectives Identify the three conditions that must be present before estimating a population mean.
PARAMETRIC STATISTICAL INFERENCE
CHAPTER 18: Inference about a Population Mean
When σ is Unknown The One – Sample Interval For a Population Mean Target Goal: I can construct and interpret a CI for a population mean when σ is unknown.
1 Chapter 6 Estimates and Sample Sizes 6-1 Estimating a Population Mean: Large Samples / σ Known 6-2 Estimating a Population Mean: Small Samples / σ Unknown.
Confidence Intervals with Means Unit 12, Notes p 189.
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
Section 10.1 Confidence Intervals
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 8-1 Confidence Interval Estimation.
Confidence Intervals For a Sample Mean. Point Estimate singleUse a single statistic based on sample data to estimate a population parameter Simplest approach.
Steps to Construct ANY Confidence Interval:
Matched Pairs t-procedures: Subjects are matched according to characteristics that affect the response, and then one member is randomly assigned to treatment.
AP Statistics Chapter 10 Notes. Confidence Interval Statistical Inference: Methods for drawing conclusions about a population based on sample data. Statistical.
Confidence Intervals. Rate your confidence Name my age within 10 years? within 5 years? within 1 year? Shooting a basketball at a wading pool,
Estimation of a Population Mean
Lesson 10 - R Review of Chapter 10 Confidence Intervals.
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
Chapter 8: Estimating With Confidence. 8.3 – Estimating a Population Mean In the previous examples, we made an unrealistic assumption that the population.
Chap 8-1 Chapter 8 Confidence Interval Estimation Statistics for Managers Using Microsoft Excel 7 th Edition, Global Edition Copyright ©2014 Pearson Education.
Confidence Intervals with Means. Rate your confidence Name my age within 10 years? Name my age within 10 years? within 5 years? within 5 years?
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example: In a recent poll, 70% of 1501 randomly selected adults said they believed.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.3 Estimating a Population Mean.
Department of Quantitative Methods & Information Systems
10.1 – Estimating with Confidence. Recall: The Law of Large Numbers says the sample mean from a large SRS will be close to the unknown population mean.
Learning Objectives After this section, you should be able to: The Practice of Statistics, 5 th Edition1 DESCRIBE the shape, center, and spread of the.
+ Unit 5: Estimating with Confidence Section 8.3 Estimating a Population Mean.
+ Z-Interval for µ So, the formula for a Confidence Interval for a population mean is To be honest, σ is never known. So, this formula isn’t used very.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Business Statistics: A First Course 5 th Edition.
+ Chapter 8 Estimating with Confidence 8.1Confidence Intervals: The Basics 8.2Estimating a Population Proportion 8.3Estimating a Population Mean.
Statistics for Business and Economics 8 th Edition Chapter 7 Estimation: Single Population Copyright © 2013 Pearson Education, Inc. Publishing as Prentice.
Confidence Intervals with Means Chapter 9. What is the purpose of a confidence interval? To estimate an unknown population parameter.
Confidence Intervals Chapter 9. How confident are you? Guess my weight… –within 50 pounds –within 20 pounds –within 1 pound Shoot a basketball… –at a.
Chapter 8: Estimating With Confidence
Review of Chapter 10 Confidence Intervals
Lecture 15 Sections 7.3 – 7.5 Objectives:
Chapter 8: Estimating With Confidence
8.3 – Estimating a Population Mean
Click the mouse button or press the Space Bar to display the answers.
Confidence Interval for a Population mean ( known)
Chapter 8: Estimating With Confidence
Confidence Interval for a Population mean ( known)
Presentation transcript:

Chapter 10: Estimating With Confidence

10.1 – Confidence Intervals: The basics

Statistical Inference: Using sample data to draw conclusions about a population Note:Each sample may vary, but the population parameter doesn’t!

Sampling Distribution: If population is approximately normal, so is the sample distribution If population is skewed, the sample distribution is approximately normal if n  30 by the central limit theorem If given sample data, look at the distribution to assess normality if needed. (Normal Prob. Plot)

Confidence Interval: Uses the sample distribution to predict population parameter It is an interval of numbers above and below the sample statistic

Confidence Level: The probability the interval will capture the true parameter value in repeated samples Critical Value: The probability p lying to its right under the standard Normal curve. ( Z* )

Margin of Error: How accurate our estimate is based on the variability of the sample distribution. We add and subtract this from our estimate. estimate  margin of error Caution! Margin of error is only from random sampling errors. This does not include errors in collecting the data!

Most Common Critical Values Confidence Level (C)Upper tail prob.Z* Value 90% Z=? 0.05 

Most Common Critical Values Confidence Level (C)Upper tail prob.Z* Value 95% Z=? 

Most Common Critical Values Confidence Level (C)Upper tail prob.Z* Value 99% Z=? 

Calculator Tip:Critical Values 2 nd Dist – invNorm( (1 + C)/2 ) OR: Look at the T-Tables for the most common ones! (You will learn more about them later)

(Z-Interval) estimate  margin of error Confidence Interval for a Population mean (  known)  estimate  critical value  standard error

Properties of Confidence Intervals The interval is always centered around the statistic The higher the confidence level, the wider the interval becomes If you increase n, then the margin of error decreases

Calculator Tip:Z-Interval Stat – Tests – ZInterval Data: If given actual values Stats: If given summary of values

Interpreting a Confidence Interval: I am C% confident that the true parameter is captured in the interval What you will say: If we took many, many, SRS from a population and calculated a confidence interval for each sample, C% of the confidence intervals will contain the true mean What it means:

CAUTION! Never Say: The interval will capture the true mean C% of the time. It either does or does not!

Conditions for a Z-Interval: 1.SRS (CLT or population approx normal) (should say) (Population 10x sample size) 2. Normality 3. Independence

Steps to Construct ANY Confidence Interval: PANIC P:Parameter of Interest (what are you looking for?) A:Assumptions (what are the conditions?) N:Name the type of interval (what type of data do we have?) I:Interval (Finally! You can calculate!) C:Conclusion in context (I am ___% confident the true parameter lies between ________ and _________)

Example #1 Serum Cholesterol-Dr. Paul Oswick wants to estimate the true mean serum HDL cholesterol for all of his year old female patients. He randomly selects 30 patients and computes the sample mean to be Assume from past records, the population standard deviation for the serum HDL cholesterol for year old female patients is  =13.4. a.Construct a 95% confidence interval for the mean serum HDL cholesterol for all of Dr. Oswick’s year old female patients. P:The true mean serum HDL cholesterol for all of Dr. Oswick’s year old female patients.

A: SRS: Normality: Independence: N:One sample Z-Interval Says randomly selected Approximately normal by the CLT (n  30) I am assuming that Dr. Oswick has 300 patients or more.

I:

C:I am 95% confident the true mean serum HDL cholesterol for all of Dr. Oswick’s year old female patients is between and

Example #1 Serum Cholesterol-Dr. Paul Oswick wants to estimate the true mean serum HDL cholesterol for all of his year old female patients. He randomly selects 30 patients and computes the sample mean to be Assume from past records, the population standard deviation for the serum HDL cholesterol for year old female patients is  =13.4. b. If the US National Center for Health Statistics reports the mean serum HDL cholesterol for females between years old to be  = 53, do Dr. Oswick’s patients appear to have a different serum level compared to the general population? Explain. No, 53 is contained in the interval.

Example #1 Serum Cholesterol-Dr. Paul Oswick wants to estimate the true mean serum HDL cholesterol for all of his year old female patients. He randomly selects 30 patients and computes the sample mean to be Assume from past records, the population standard deviation for the serum HDL cholesterol for year old female patients is  =13.4. c. What two things could you do to decrease your margin of error? Increase n Lower confidence level

Example #2 Suppose your class is investigating the weights of Snickers 1- ounce Fun-Size candy bars to see if customers are getting full value for their money. Assume that the weights are Normally distributed with standard deviation = ounces. Several candy bars are randomly selected and weighed with sensitive balances borrowed from the physics lab. The weights are ounces. Determine a 90% confidence interval for the true mean, µ. Can you say that the bars weigh 1oz on average? P:The true mean weight of Snickers 1-oz Fun-size candy bars

A: SRS: Normality: Independence: N:One sample Z-Interval Says randomly selected Approximately normal because the population is approximately normal I am assuming that Snickers has 80 bars or more in the 1-oz size

I:

C:I am 90% confident the true mean weight of Snickers 1-oz Fun-size candy bars is between.9921 and.9979 ounces. I am not confident that the candy bars weigh as advertised at the 90% level.

Choosing a Sample Size for a specific margin of error Note: Always round up! You can’t have part of a person! Ex: rounds up to 164.

Example #3 A statistician calculates a 95% confidence interval for the mean income of the depositors at Bank of America, located in a poverty stricken area. The confidence interval is $18,201 to $21,799. a.What is the sample mean income?

Example #3 A statistician calculates a 95% confidence interval for the mean income of the depositors at Bank of America, located in a poverty stricken area. The confidence interval is $18,201 to $21,799. b. What is the margin of error? m = 21,799 – 20,000 m = 1,799 m

Example #4 A researcher wishes to estimate the mean number of miles on four-year-old Saturn SCI’s. How many cars should be in a sample in order to estimate the mean number of miles within a margin of error of  1000 miles with 99% confidence assuming  =19,700.

10.2 – Estimating a Population Mean

In the 10.1 we made an unrealistic assumption that the population standard deviation was known and could be used to calculate confidence intervals.

When the standard deviation of a statistic is estimated from the data Standard Error:

When we know  we can use the Z-table to make a confidence interval. But, when we don’t know it, then we have to use something else!

Properties of the t-distribution: σ is unknown Degrees of Freedom = n – 1 More variable than the normal distribution (it has fatter tails than the normal curve) Approaches the normal distribution when the degrees of freedom are large (sample size is large). Area is found to the right of the t-value

Properties of the t-distribution: If n < 15, if population is approx normal, then so is the sample distribution. If the data are clearly non-Normal or if outliers are present, don’t use! If n > 15, sample distribution is normal, except if population has outliers or strong skewness If n  30, sample distribution is normal, even if population has outliers or strong skewness

Example #1 Determine the degrees of freedom and use the t-table to find probabilities for each of the following: DFPictureProbability P(t > 1.093)n =

DFPictureProbability P(t > 1.093)n = Example #1 Determine the degrees of freedom and use the t-table to find probabilities for each of the following:

Example #1 Determine the degrees of freedom and use the t-table to find probabilities for each of the following: DFPictureProbability P(t < 1.093)n =

Example #1 Determine the degrees of freedom and use the t-table to find probabilities for each of the following: DF Picture Probability P(t > 0.685)n =

Example #1 Determine the degrees of freedom and use the t-table to find probabilities for each of the following: DF Picture Probability P(t > 0.685)n =

Example #1 Determine the degrees of freedom and use the t-table to find probabilities for each of the following: DF Picture Probability P(t > 0.685)n =

Example #1 Determine the degrees of freedom and use the t-table to find probabilities for each of the following: DF Picture Probability P(0.70<t<1.093) n =

.25 –.15 = 0.1 Example #1 Determine the degrees of freedom and use the t-table to find probabilities for each of the following: DF Picture Probability P(0.70<t<1.093) n =

Calculator Tip:Finding P(t) 2 nd – Dist – tcdf( lower bound, upper bound, degrees of freedom)

One-Sample t-interval:

Calculator Tip:One sample t-Interval Stat – Tests – TInterval Data: If given actual values Stats: If given summary of values

Conditions for a t-interval: 1.SRS (population approx normal and n<15, or moderate size (15≤ n < 30) with moderate skewness or outliers, or large sample size n ≥ 30) (should say) (Population 10x sample size) 2. Normality 3. Independence

Robustness: The probability calculations remain fairly accurate when a condition for use of the procedure is violated The t-distribution is robust for large n values, mostly because as n increases, the t-distribution approaches the Z-distribution. And by the CLT, it is approx normal.

Example #2 Practice finding t* nDegrees of Freedom Confidence Interval t* n = 1099% CI n = 2090% CI n = 4095% CI n = 3099% CI 9

Example #2 Practice finding t* nDegrees of Freedom Confidence Interval t* n = 1099% CI n = 2090% CI n = 4095% CI n = 3099% CI

Example #2 Practice finding t* nDegrees of Freedom Confidence Interval t* n = 1099% CI n = 2090% CI n = 4095% CI n = 3099% CI

Example #2 Practice finding t* nDegrees of Freedom Confidence Interval t* n = 1099% CI n = 2090% CI n = 4095% CI n = 3099% CI

Example #2 Practice finding t* nDegrees of Freedom Confidence Interval t* n = 1099% CI n = 2090% CI n = 4095% CI n = 3099% CI

Example #3 As part of your work in an environmental awareness group, you want to estimate the mean waste generated by American adults. In a random sample of 20 American adults, you find that the mean waste generated per person per day is 4.3 pounds with a standard deviation of 1.2 pounds. Calculate a 99% confidence interval for  and explain it’s meaning to someone who doesn’t know statistics. P:The true mean waste generated per person per day.

A: SRS: Normality: Independence: N:One Sample t-interval Says randomly selected 15<n<30. We must assume the population doesn’t have strong skewness. Proceeding with caution! It is safe to assume that there are more than 200 Americans that create waste.

I: df = 20 – 1 =19

I: df = 20 – 1 = 19

C:I am 99% confident the true mean waste generated per person per day is between and pounds.

Matched Pairs t-procedures: Subjects are matched according to characteristics that affect the response, and then one member is randomly assigned to treatment 1 and the other to treatment 2. Recall that twin studies provide a natural pairing. Before and after studies are examples of matched pairs designs, but they require careful interpretation because random assignment is not used. Apply the one-sample t procedures to the differences

Confidence Intervals for Matched Pairs

Example #4 Archaeologists use the chemical composition of clay found in pottery artifacts to determine whether different sites were populated by the same ancient people. They collected five random samples from each of two sites in Great Britain and measured the percentage of aluminum oxide in each. Based on these data, do you think the same people used these two kiln sites? Use a 95% confidence interval for the difference in aluminum oxide content of pottery made at the sites and assume the population distribution is approximately normal. Can you say there is no difference between the sites? New Forrest Ashley Trails Difference

P:μ n = New Forrest percentage of aluminum oxide The true mean difference in aluminum oxide levels between the New Forrest and Ashley Trails. μ a = Ashley Trails percentage of aluminum oxide μ d = μ n - μ a = Difference in aluminum oxide levels

A: SRS: Normality: Independence: N:Matched Pairs t-interval Says randomly selected Says population is approx normal It is safe to assume that there are more than 50 samples available

I: df = 5 – 1 =4

I: df = 20 – 1 = 19

C:I am 95% confident the true mean difference in aluminum oxide levels between the New Forrest and Ashley Trails is between –1.754 and Can you say there is no difference between the sites? Yes, zero is in the confidence interval, so it is safe to say there is no difference.

Example #5 The National Endowment for the Humanities sponsors summer institutes to improve the skills of high school language teachers. One institute hosted 20 Spanish teachers for four weeks. At the beginning of the period, the teachers took the Modern Language Association’s listening test of understanding of spoken Spanish. After four weeks of immersion in Spanish in and out of class, they took the listening test again. (The actual spoken Spanish in the two tests was different, so that simply taking the first test should not improve the score on the second test.) Below is the pretest and posttest scores. Give a 90% confidence interval for the mean increase in listening score due to attending the summer institute. Can you say the program was successful?

SubjectPretestPosttestSubjectPretestPosttest

P:μ B = Pretest score The true mean difference in test scores between the Pretest and Posttest μ d = μ B - μ A = Difference in test scores μ A = Posttest score

A: SRS: Normality: We must assume the 20 teachers are randomly selected

A: SRS: Normality: Independence: N:Matched Pairs t-interval We must assume the 20 teachers are randomly selected 15<n<30 and distribution is approximately normal, so safe to assume It is safe to assume that there are more than 200 Spanish teachers

I: df = 20 – 1 = 19

I: df = 20 – 1 = 19

C:I am 90% confident the true mean difference in test scores between the Pretest and Posttest is between –2.689 and – Can you say the program was successful? Yes, zero is not in the confidence interval, so the pretest score is lower than the posttest score.

10.3 – Estimating a Population Proportion

Properties of :

Confidence Interval for a Population Proportion: Notice! We use Z* and not t*

Conditions for a p-interval: 1.SRS (should say) (Population 10x sample size) 2. Normality 3. Independence

Calculator Tip:One sample p-Interval Stat – Tests – 1–PropZInt x = # of successes in the sample

Example #1 A news release by the IRS reported 90% of all Americans fill out their tax forms correctly. A random sample of 1500 returns revealed that 1200 of them were correctly filled out. Calculate a 92% confidence interval for the proportion of Americans who correctly fill out their tax forms. Is the IRS correct in their report? P:The true percent of Americans who fill out their tax forms correctly

A: SRS: Normality: Independence: Says randomly selected It is safe to assume that there are more than 15,000 people who file their taxes 0.80 Yes, safe to assume an approximately normally distribution

N:One Sample Proportion Interval I: Z* = ?

?

Confidence Level (C)Upper tail prob.Z* Value 92% Z=?

Confidence Level (C)Upper tail prob.Z* Value 92% Z=? 0.04 

N:One Sample Proportion Interval I:

C:I am 92% confident the true percent of Americans who fill out their tax forms correctly is between 78.19% and 81.8% Is the IRS correct in their report? No, 90% is not in the interval!

Sample size for a Desired Margin of Error If we want the margin of error in a level C confidence interval for p to be m, then we need n subjects in the sample, where: n  p* = An estimate for

Note: If p is unknown use the most conservative value of p = 0.5. Since n is the sample size, it must be a whole number!!! Round up!

Example #2 You wish to estimate with 95% confidence; the proportion of computers that need repairs or have problems by the time the product is three years old. Your estimate must be accurate within 3.5% of the true proportion. a. Find the sample size needed if a prior study found that 19% of computers needed repairs or had problems by the time the product as three years old.

Example #2 You wish to estimate with 95% confidence; the proportion of computers that need repairs or have problems by the time the product is three years old. Your estimate must be accurate within 3.5% of the true proportion. b. If no preliminary estimate is available, find the most conservative sample size required.

Example #2 You wish to estimate with 95% confidence; the proportion of computers that need repairs or have problems by the time the product is three years old. Your estimate must be accurate within 3.5% of the true proportion. c. Compare the results from a and b. Using 0.5 makes the sample size very large, ensuring that enough people will be surveyed.