Stat 13, Tue 5/15/12. 1. Hand in HW5 2. Review list. 3. Assumptions and CLT again. 4. Examples. Hand in Hw5. Midterm 2 is Thur, 5/17. Hw6 is due Thu, 5/24.

Slides:



Advertisements
Similar presentations
Estimation of Means and Proportions
Advertisements

Mean, Proportion, CLT Bootstrap
Chapter 8: Estimating with Confidence
Chapter 10: Estimating with Confidence
Statistics and Quantitative Analysis U4320
Ch. 8 – Practical Examples of Confidence Intervals for z, t, p.
STA291 Statistical Methods Lecture 13. Last time … Notions of: o Random variable, its o expected value, o variance, and o standard deviation 2.
1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 23 = Finish Chapter “Confidence Interval Estimation” (CIE)
BCOR 1020 Business Statistics
Inference on averages Data are collected to learn about certain numerical characteristics of a process or phenomenon that in most cases are unknown. Example:
Standard error of estimate & Confidence interval.
Review of normal distribution. Exercise Solution.
Interval estimation ASW, Chapter 8 Economics 224, Notes for October 8, 2008.
1. Homework #2 2. Inferential Statistics 3. Review for Exam.
Confidence Interval Estimation
Stat 13, Tue 6/5/ Hand in hw7. 2. Practice problems. Final exam is Thur, 6/7, in class. It has 20 multiple choice questions. Only about 50% are on.
ESTIMATING with confidence. Confidence INterval A confidence interval gives an estimated range of values which is likely to include an unknown population.
Many times in statistical analysis, we do not know the TRUE mean of a population of interest. This is why we use sampling to be able to generalize the.
AP Statistics Chapter 9 Notes.
Stat 13, Thu 5/10/ CLT again. 2. CIs. 3. Interpretation of a CI. 4. Examples. 5. Margin of error and sample size. 6. CIs using the t table. 7. When.
Estimates and Sample Sizes Lecture – 7.4
Chapter 8 Confidence Intervals Statistics for Business (ENV) 1.
Review of Chapters 1- 5 We review some important themes from the first 5 chapters 1.Introduction Statistics- Set of methods for collecting/analyzing data.
AP STATISTICS LESSON 10 – 1 (DAY 2)
1 Sampling Distributions Lecture 9. 2 Background  We want to learn about the feature of a population (parameter)  In many situations, it is impossible.
When σ is Unknown The One – Sample Interval For a Population Mean Target Goal: I can construct and interpret a CI for a population mean when σ is unknown.
Stat 13, Tue 5/8/ Collect HW Central limit theorem. 3. CLT for 0-1 events. 4. Examples. 5.  versus  /√n. 6. Assumptions. Read ch. 5 and 6.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.2 Estimating a Population Proportion.
From Theory to Practice: Inference about a Population Mean, Two Sample T Tests, Inference about a Population Proportion Chapters etc.
Sampling Distribution of the Sample Mean. Example a Let X denote the lifetime of a battery Suppose the distribution of battery battery lifetimes has 
Statistical estimation, confidence intervals
STA Lecture 181 STA 291 Lecture 18 Exam II Next Tuesday 5-7pm Memorial Hall (Same place) Makeup Exam 7:15pm – 9:15pm Location TBA.
STA291 Statistical Methods Lecture 18. Last time… Confidence intervals for proportions. Suppose we survey likely voters and ask if they plan to vote for.
Copyright © 2012 Pearson Education. All rights reserved © 2010 Pearson Education Copyright © 2012 Pearson Education. All rights reserved. Chapter.
1 Sampling Distributions. Central Limit Theorem*
Estimation Chapter 8. Estimating µ When σ Is Known.
Confidence intervals: The basics BPS chapter 14 © 2006 W.H. Freeman and Company.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.2 Estimating a Population Proportion.
STA Lecture 191 STA 291 Lecture 19 Exam II Next Tuesday 5-7pm Memorial Hall (Same place as exam I) Makeup Exam 7:15pm – 9:15pm Location CB 234.
Confidence Intervals Lecture 3. Confidence Intervals for the Population Mean (or percentage) For studies with large samples, “approximately 95% of the.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.2 Estimating a Population Proportion.
Chapter 8: Confidence Intervals based on a Single Sample
Chapter 10: Confidence Intervals
Confidence intervals. Want to estimate parameters such as  (population mean) or p (population proportion) Obtain a SRS and use our estimators, and Even.
Chapter 12 Confidence Intervals and Hypothesis Tests for Means © 2010 Pearson Education 1.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example: In a recent poll, 70% of 1501 randomly selected adults said they believed.
Review Normal Distributions –Draw a picture. –Convert to standard normal (if necessary) –Use the binomial tables to look up the value. –In the case of.
Chapter 8: Confidence Intervals based on a Single Sample
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
Stat 13, Thu 4/19/ Hand in HW2! 1. Resistance. 2. n-1 in sample sd formula, and parameters and statistics. 3. Probability basic terminology. 4. Probability.
1 Probability and Statistics Confidence Intervals.
10.1 – Estimating with Confidence. Recall: The Law of Large Numbers says the sample mean from a large SRS will be close to the unknown population mean.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.2 Estimating a Population Proportion.
CHAPTER 8 (4 TH EDITION) ESTIMATING WITH CONFIDENCE CORRESPONDS TO 10.1, 11.1 AND 12.1 IN YOUR BOOK.
Independent Samples: Comparing Means Lecture 39 Section 11.4 Fri, Apr 1, 2005.
CHAPTER 8 Estimating with Confidence
FINAL EXAMINATION STUDY MATERIAL II
Chapter 6 Inferences Based on a Single Sample: Estimation with Confidence Intervals Slides for Optional Sections Section 7.5 Finite Population Correction.
Inference: Conclusion with Confidence
Stat 35b: Introduction to Probability with Applications to Poker
Chapter 6: Sampling Distributions
Estimating the Population Mean Income of Lexus Owners
Stat 35b: Introduction to Probability with Applications to Poker
Chapter 8: Estimating with Confidence
Confidence Intervals: The Basics
Lecture 7 Sampling and Sampling Distributions
1. Homework #2 (not on posted slides) 2. Inferential Statistics 3
Confidence Intervals for Proportions
Estimates and Sample Sizes Lecture – 7.4
Presentation transcript:

Stat 13, Tue 5/15/ Hand in HW5 2. Review list. 3. Assumptions and CLT again. 4. Examples. Hand in Hw5. Midterm 2 is Thur, 5/17. Hw6 is due Thu, 5/24. On Thur, 5/17, I won’t be able to have my usual office hour from 230 to 3:30, so it will be instead from 1:30 to 2:15pm. The midterm is primarily on chapters 4-6, though it has a bit of probability also. The std normal and t tables will be provided. Again, you can have 1 page, double-sided, of notes, plus a calculator and a pen or pencil. 1

2. Review list, of stuff since the first midterm. a) More probability (i) Expected value. (ii) Expected value of sums of random variables. (iii) Bernoulli random variables. (iv) Binomial random variables. (v) Independence. b) Normal calculations. (i) Calculating the area under the normal curve between a and b. (ii) Normal percentiles. (iii) Normal probability plots. c) CLT and CIs. (i) CLT. (ii) Construction of CIs, for the mean and for proportions. (iii) Interpretation of CIs. (iii) SE versus . (iv) Margin of error. (v) Sample size calculations. (vi) CIs using the t table. (vii) Assumptions behind CIs. 2

3. Assumptions and Central Limit Theorem (CLT), again. If you have a SRS (or observations are iid with mean µ), and n is large (or the population is normally distributed), then is approximately normally distributed with mean µ and std deviation, where  is the std deviation of the population and n is the sample size. Equivalently, we can say [( - µ) ÷ ] is approximately standard normally distributed. When to use z* and t*. a) If it's a simple random sample (SRS) and the population is normal,  is unknown, and n is small (< 25), then use t*. b) If it's a SRS and the population is normal,  is known, and n is small (< 25), then use z*. c) If it's a SRS and n is large, then t* and z* are very close together, so it doesn't really matter which you use. Use z*, though the book recommends t*. d) If the population might NOT be normal and n is NOT large, then neither t* nor z* is appropriate. 3

4. Examples. In order to see what tv shows Americans watch, the Nielsen corporation surveys a sample of approximately 50,000 Americans. They recently (May 20, 2009) reported that the average American watches approximately 5.1 hours of tv a day. Suppose it’s a SRS and that the sample standard deviation is s = 2 hours per day. Find an 70%-CI for the population mean. Answer: It’s a SRS and n = 50,000 is large, so the standard formulas apply, but we don’t know  so we will plug in s. For a 70%-CI, we look for a value close to 15% (or 85%) in the table, and find the closest is (or 1.04), so z* = The 70% CI is +/- (z*)s/√n = 5.1 +/- (1.04)(2) ÷ √50,000 = 5.1 +/ What does this 5.1 +/ mean? It’s a range likely to contain µ. Here, n = 50,000 is so large that we can be confident that µ is likely close to 5.1. Q. A typical American watches ___ +/- ___ hours of tv per day? Q. If we took another SRS of 50,000 Americans, we’d expect to get a sample mean of around _____ +/- _____ hours of tv per day? Q. If we change this 70% CI to a 95% CI, will the margin of error increase or decrease? 4

4. Examples. In order to see what tv shows Americans watch, the Nielsen corporation surveys a sample of approximately 50,000 Americans. They recently (May 20, 2009) reported that the average American watches approximately 5.1 hours of tv a day. Suppose it’s a SRS and that the sample standard deviation is s = 2 hours per day. Find an 70%-CI for the population mean. Answer: It’s a SRS and n = 50,000 is large, so the standard formulas apply, but we don’t know  so we will plug in s. For a 70%-CI, we look for a value close to 15% (or 85%) in the table, and find the closest is (or 1.04), so z* = The 70% CI is +/- (z*)s/√n = 5.1 +/- (1.04)(2) ÷ √50,000 = 5.1 +/ What does this 5.1 +/ mean? It’s a range likely to contain µ. Here, n = 50,000 is so large that we can be confident that µ is likely close to 5.1. Q. A typical American watches _5.1_ +/- _2_ hours of tv per day? Q. If we took another SRS of 50,000 Americans, we’d expect to get a sample mean of around _____ +/- _____ hours of tv per day? Q. If we change this 70% CI to a 95% CI, will the margin of error increase or decrease? 5

4. Examples. In order to see what tv shows Americans watch, the Nielsen corporation surveys a sample of approximately 50,000 Americans. They recently (May 20, 2009) reported that the average American watches approximately 5.1 hours of tv a day. Suppose it’s a SRS and that the sample standard deviation is s = 2 hours per day. Find an 70%-CI for the population mean. Answer: It’s a SRS and n = 50,000 is large, so the standard formulas apply, but we don’t know  so we will plug in s. For a 70%-CI, we look for a value close to 15% (or 85%) in the table, and find the closest is (or 1.04), so z* = The 70% CI is +/- (z*)s/√n = 5.1 +/- (1.04)(2) ÷ √50,000 = 5.1 +/ What does this 5.1 +/ mean? It’s a range likely to contain µ. Here, n = 50,000 is so large that we can be confident that µ is likely close to 5.1. Q. A typical American watches _5.1_ +/- _2_ hours of tv per day? Q. If we took another SRS of 50,000 Americans, we’d expect to get a sample mean of around _5.1_ +/- _0.009_ hours of tv per day? Q. If we change this 70% CI to a 95% CI, will the margin of error increase or decrease? 6

4. Examples. In order to see what tv shows Americans watch, the Nielsen corporation surveys a sample of approximately 50,000 Americans. They recently (May 20, 2009) reported that the average American watches approximately 5.1 hours of tv a day. Suppose it’s a SRS and that the sample standard deviation is s = 2 hours per day. Find an 70%-CI for the population mean. Answer: It’s a SRS and n = 50,000 is large, so the standard formulas apply, but we don’t know  so we will plug in s. For a 70%-CI, we look for a value close to 15% (or 85%) in the table, and find the closest is (or 1.04), so z* = The 70% CI is +/- (z*)s/√n = 5.1 +/- (1.04)(2) ÷ √50,000 = 5.1 +/ What does this 5.1 +/ mean? It’s a range likely to contain µ. Here, n = 50,000 is so large that we can be confident that µ is likely close to 5.1. Q. A typical American watches _5.1_ +/- _2_ hours of tv per day? Q. If we took another SRS of 50,000 Americans, we’d expect to get a sample mean of around _5.1_ +/- _0.009_ hours of tv per day? Q. If we change this 70% CI to a 95% CI, will the margin of error increase or decrease? Increase. z* will go from 1.04 to Margin of error will almost double. 7

According to the CDC, the largest number of reported cases in the U.S. for any condition is chlamydia, a sexually transmitted disease reported each year in about 0.4% of people overall. In , the National Health and Nutrition Examination Survey (NHANES) took a SRS of 10,253 Americans, and among the (roughly) 510 females age 15-24, they found the prevalence of chlamydia to be 8.0%. Find a 95%-CI for the population percentage of females age with chlamydia. Answer: This is a 0-1 question. It’s a SRS and n is large because the number of females with chlamydia in the sample = 8% x 510 = 41 ≥ 10, and the number without = 92% x 520 = 469 ≥ 10. For a 95%-CI, z* = 1.96 from the bottom row of Table 4. The formula for the 95%-CI is +/- z*  /√n. We don't know  so use s = = √ (8.0% x 92.0%) ~ Our 95%-CI is 8.0% +/- (1.96) (0.271) / √510 which is 8.0% +/- 2.35%. 8

Suppose you take a SRS of 17 UCLA students and find their IQs. You find that is 120 and s = 20. Find a 95% CI for µ, the mean IQ for UCLA, assuming these IQs are normally distributed. Here we have a SRS, the pop. is normal, and  is unknown, so use the t table. df = n-1 = 17-1 = 16. From Table 4, for a 95% CI, with df = 16, t* = So, the 95% CI is +/- t* s/√n = 120 +/ (20)/√17 = 120 +/ What is the standard error? s/√n = 20/√17 = A typical UCLA student has an IQ of about ____ +/- _______. If you take other samples each of 17 UCLA students, you’d expect your samples typically to have a mean of about ______ +/- _______. 9

Suppose you take a SRS of 17 UCLA students and find their IQs. You find that is 120 and s = 20. Find a 95% CI for µ, the mean IQ for UCLA, assuming these IQs are normally distributed. Here we have a SRS, the pop. is normal, and  is unknown, so use the t table. df = n-1 = 17-1 = 16. From Table 4, for a 95% CI, with df = 16, t* = So, the 95% CI is +/- t* s/√n = 120 +/ (20)/√17 = 120 +/ What is the standard error? s/√n = 20/√17 = A typical UCLA student has an IQ of about _120_ +/- __20___. If you take other samples each of 17 UCLA students, you’d expect your samples typically to have a mean of about ______ +/- ______. 10

Suppose you take a SRS of 17 UCLA students and find their IQs. You find that is 120 and s = 20. Find a 95% CI for µ, the mean IQ for UCLA, assuming these IQs are normally distributed. Here we have a SRS, the pop. is normal, and  is unknown, so use the t table. df = n-1 = 17-1 = 16. From Table 4, for a 95% CI, with df = 16, t* = So, the 95% CI is +/- t* s/√n = 120 +/ (20)/√17 = 120 +/ What is the standard error? s/√n = 20/√17 = A typical UCLA student has an IQ of about _120_ +/- __20___. If you take other samples each of 17 UCLA students, you’d expect your samples typically to have a mean of about _120__ +/- _4.85__. 11