Week 8 Confidence Intervals for Means and Proportions.

Slides:



Advertisements
Similar presentations
Estimation in Sampling
Advertisements

1 Chapter 12 Inference About One Population Introduction In this chapter we utilize the approach developed before to describe a population.In.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. Estimating Proportions with Confidence Chapter 10.
Chapter 19 Confidence Intervals for Proportions.
Confidence Intervals for Proportions
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 18, Slide 1 Chapter 18 Confidence Intervals for Proportions.
Week 10 Comparing Two Means or Proportions. Generalising from sample IndividualsMeasurementGroupsQuestion Children aged 10 Mark in maths test Boys & girls.
PSY 1950 Confidence and Power December, Requisite Quote “The picturing of data allows us to be sensitive not only to the multiple hypotheses that.
Copyright © 2010 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
Estimating Means with Confidence
6.1 Confidence Intervals for the Mean (Large Samples)
Chapter 19: Confidence Intervals for Proportions
Estimation Goal: Use sample data to make predictions regarding unknown population parameters Point Estimate - Single value that is best guess of true parameter.
10.3 Estimating a Population Proportion
Copyright ©2011 Brooks/Cole, Cengage Learning Understanding Sampling Distributions: Statistics as Random Variables Chapter 9 1.
CONFIDENCE INTERVALS of Means AP STATISTICS, CHAPTER 19 Mrs. Watkins.
QBM117 Business Statistics Estimating the population mean , when the population variance  2, is known.
Statistical Fundamentals: Using Microsoft Excel for Univariate and Bivariate Analysis Alfred P. Rovai Estimation PowerPoint Prepared by Alfred P. Rovai.
Chapter 8 Confidence Intervals Statistics for Business (ENV) 1.
Sampling Distribution ● Tells what values a sample statistic (such as sample proportion) takes and how often it takes those values in repeated sampling.
When σ is Unknown The One – Sample Interval For a Population Mean Target Goal: I can construct and interpret a CI for a population mean when σ is unknown.
June 25, 2008Stat Lecture 14 - Two Means1 Comparing Means from Two Samples Statistics 111 – Lecture 14 One-Sample Inference for Proportions and.
Section 2 Part 2.   Population - entire group of people or items for which we are collecting data  Sample – selections of the population that is used.
Estimating and Constructing Confidence Intervals.
CHAPTER SIX Confidence Intervals.
1 How do we interpret Confidence Intervals (Merit)? A 95% Confidence Interval DOES NOT mean that there is a 95 % probability that the population mean lies.
Section 10.1 Confidence Intervals
Statistical Fundamentals: Using Microsoft Excel for Univariate and Bivariate Analysis Alfred P. Rovai Estimation PowerPoint Prepared by Alfred P. Rovai.
Confidence Intervals Lecture 3. Confidence Intervals for the Population Mean (or percentage) For studies with large samples, “approximately 95% of the.
Copyright © 2010 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
1 Section 10.1 Estimating with Confidence AP Statistics January 2013.
6.1 Inference for a Single Proportion  Statistical confidence  Confidence intervals  How confidence intervals behave.
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 8 First Part.
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 4 First Part.
Economics 173 Business Statistics Lecture 3 Fall, 2001 Professor J. Petry
Chapter 10: Confidence Intervals
Sample Means & Proportions
STT 315 Ashwini maurya This lecture is based on Chapter 5.4 Acknowledgement: Author is thankful to Dr. Ashok Sinha, Dr. Jennifer Kaplan and Dr. Parthanil.
Copyright © 2009 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
Chapter 19 Confidence intervals for proportions
Chapter 7 Estimates, Confidence Intervals, and Sample Sizes
Sampling and Statistical Analysis for Decision Making A. A. Elimam College of Business San Francisco State University.
Measuring change in sample survey data. Underlying Concept A sample statistic is our best estimate of a population parameter If we took 100 different.
1 Probability and Statistics Confidence Intervals.
10.1 – Estimating with Confidence. Recall: The Law of Large Numbers says the sample mean from a large SRS will be close to the unknown population mean.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
Section 9.1 Sampling Distributions AP Statistics January 31 st 2011.
Chapter 7: Inferences Based on a Single Sample: Estimation with Confidence Interval Statistics.
Estimating a Population Proportion ADM 2304 – Winter 2012 ©Tony Quon.
Statistics 19 Confidence Intervals for Proportions.
O Make Copies of: o Areas under the Normal Curve o Appendix B.1, page 784 o (Student’s t distribution) o Appendix B.2, page 785 o Binomial Probability.
Warm Up- Class work Activity 10 Pg.534 -each person tosses the pin 50 times
Chapter 8 Confidence Intervals Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Dr.Theingi Community Medicine
Chapter 10 Confidence Intervals for Proportions © 2010 Pearson Education 1.
More About Confidence Intervals
Confidence Intervals for Proportions
Statistics Chapter 7: Inferences Based on a Single Sample: Estimation with Confidence Interval.
Chapter 6 Inferences Based on a Single Sample: Estimation with Confidence Intervals Slides for Optional Sections Section 7.5 Finite Population Correction.
Confidence Intervals for Proportions
Confidence Intervals for Proportions
Statistics Chapter 7: Inferences Based on a Single Sample: Estimation with Confidence Interval.
Week 10 Chapter 16. Confidence Intervals for Proportions
Statistics Chapter 7: Inferences Based on a Single Sample: Estimation with Confidence Interval.
CHAPTER 22: Inference about a Population Proportion
Confidence Intervals for Proportions
Confidence Intervals for Proportions
Statistics Chapter 7: Inferences Based on a Single Sample: Estimation with Confidence Interval.
Confidence Intervals for Proportions
Presentation transcript:

Week 8 Confidence Intervals for Means and Proportions

Inference Data are a single sample Interested in underlying population, not specific sample Sample gives information about population Randomness of sample means uncertainty Called inference about population

Types of inference Focus on value of population parameter e.g. mean or proportion (probability) Estimation What is the value of the parameter? Hypothesis testing Is the parameter equal to a specific value (usually zero)?

Point estimate To estimate population parameter, use corresponding sample statistic e.g. Likely to be an error in estimate e.g. How big is error likely to be?

Error distribution Error is random Simulation from an ‘approx’ population could build up error distribution Shows how large error from actual sample data is likely to be

Example Silkworm survival after arsenic poisoning How long will 1 / 4 survive? What is upper quartile?

Simulation Approx population (same mean & sd as data) Target = UQ from normal = sec

Simulation (cont) Sample UQs ≠ target Simulation shows error distribution Error in estimate (292 sec) unlikely to be more than 10 sec.

Error distn for proportion Simulation is not needed DistributionMeanSt devn # success x binomial (n,  )n  Propn(success) p = x / n  Error p -  0

Standard error of proportion Approx error distn bias = 0 standard error =

Teens and interracial dating Point estimate: Bias = 0 Standard error = 1997 USA Today/Gallup Poll of teenagers across country: 57% of the 497 teens who go out on dates say they’ve been out with someone of another race or ethnic group. = 0.57

Error distn (interracial dating)      = 0,  = General normal Error distn Error in estimate, p = 0.57, unlikely to be more than 0.05 almost certainly less than 0.07

Interval estimates Survey 150 randomly selected students and 41% think marijuana should be legalized. If we report between 33% and 49% of all students at the college think that marijuana should be legalized, how confident can we be that we are correct? Confidence interval: an interval of estimates that is ‘likely’ to capture the population value.

95% confidence interval Legalise? p = 0.41, n = rule of thumb Prob(error < 2 x ) is approx 95% We are 95% confident that  is between 0.41 – and and % Conf Interval

Interpreting 95% C.I. Confidence interval is function of sample data Random It may not include population parameter (  here) In repeated samples, about 95% of CIs calculated as described will include  We therefore say we are 95% confident that our single CI will include 

Teens and interracial dating Point estimate: Standard error = 95% C.I. is to to USA Today/Gallup Poll of teenagers across country: 57% of the 497 teens who go out on dates say they’ve been out with someone of another race or ethnic group. = 0.57 We would prefer more decimals!

Teens and interracial dating 95% C.I. is to We do not know whether  is between and However 95% of CIs calculated in this way will work We are therefore 95% confident that is in (0.526, 0.614)

St error & width of 95% C.I. Smallest s.e. and C.I. width when: n is large p is close to 0 or 1 Biggest s.e. and C.I. width when: n is small p is close to 0.5

Margin of error Public opinion polls usually estimate several popn proportions. Each has its own “± 2 s.e.” describing accuracy n = 350 propn± 2 x s.e. Will vote for A0.45± Will vote for X0.04± Happy with govt0.66± Wants tax cut0.87±

Margin of error (cont) n = 350 Maximum possible is propn± 2 x s.e. Will vote for A0.45± Will vote for X0.04± Happy with govt0.66± Wants tax cut0.87± “Margin of error” for poll

Requirements for C.I. Sample should be randomly selected from population “Large” sample size — at least 10 success and 10 failure (though some say only 5 needed) If finite population, at least 10 times sample size

Case Study : Nicotine Patches vs Zyban Study: New England Journal of Medicine 3/4/99) 893 participants randomly allocated to four treatment groups: placebo, nicotine patch only, Zyban only, and Zyban plus nicotine patch. Participants blinded: all used a patch (nicotine or placebo) all took a pill (Zyban or placebo). Treatments used for nine weeks.

Nicotine Patches vs Zyban (cont) Conclusions: Zyban is effective (no overlap of Zyban and not Zyban CIs) Nicotine patch is not particularly effective (overlap of patch and no patch CIs)

Error distribution for mean Again, a simulation is unnecessary to find the error distribution (approx) DistributionMeanSt devn Sample mean Approx normal  Error Approx normal 

Standard error of mean Approx error distn bias = 0 standard error =

Poll: Class of 175 students. In a typical day, about how much time to you spend watching television? Mean hours watching TV n Mean MedianStDev Point estimate: Bias = 0 Standard error, = 2.09 hours

Standard devn & standard error Sample standard deviation is approx  stay similar if n increases Standard error of mean is usually less than  decreases as n increases Don’t get mixed up between the two!

Error distn (hours watching TV)      = 0,  = General normal Error distn Error in estimate, = 2.09 hours, unlikely to be more than 0.25 hrs almost certainly less than 0.4 hrs

General form for 95% C.I.  se  se  se Error distn If error distn is normal zero bias & we can find s.e. Prob( error is in ± 2 s.e.) is approx % confidence interval: estimate ± 2 s.e. 95% confidence interval: estimate ± 1.96 s.e. (if really sure error distn is normal)

95% confidence interval Mean hrs watching TV? rule of thumb Prob(error < 2 x 0.124) is approx 95% We are 95% confident that  is between 2.09 – and and 2.34 hours 95% C. I. = 2.09 hrs, n = 175

Requirements for C.I. Sample should be randomly selected from population “Large” sample size — n > 30 is often recommended If finite population, at least 10 times sample size

Problem with small n Known  Unknown  Variable width Less likely to include  Confidence level less than 95% works fine

C.I. for mean, small n Solution is to replace 1.96 (or 2) by a bigger number. Look up t-tables with (n - 1) ‘degrees of freedom’ Sample size, nd.f., n – 1t n

Example: Mean Forearm Length Data:From random sample of n = 9 men 25.5, 24.0, 26.5, 25.5, 28.0, 27.0, 23.0, 25.0, % C.I.: 25.5  2.31(.507) => 25.5  1.17 => to cm df = 8 t 8 = 2.31

What Students Sleep More? Q: How many hours of sleep did you get last night, to the nearest half hour? Notes:  CI for Stat 10 is wider (smaller sample size)  Two intervals do not overlap Class n Mean StDev SE Mean Stat 10 (stat literacy) Stat 13 (stat methods)

Interpreting 95% C.I. Confidence interval is function of sample data Random It may not include population parameter (  here) In repeated samples, about 95% of CIs calculated as described will include  We therefore say we are 95% confident that our single CI will include 