Inference for Proportions

Slides:



Advertisements
Similar presentations
Chapter 12: Inference for Proportions BY: Lindsey Van Cleave.
Advertisements

Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan 10/23/12 Sections , Single Proportion, p Distribution (6.1)
Hypothesis Testing: Intervals and Tests
Hypothesis Testing I 2/8/12 More on bootstrapping Random chance
Statistics: Unlocking the Power of Data Lock 5 Inference Using Formulas STAT 101 Dr. Kari Lock Morgan Chapter 6 t-distribution Formulas for standard errors.
STAT 101 Dr. Kari Lock Morgan Exam 2 Review.
Connecting Simulation- Based Inference with Traditional Methods Kari Lock Morgan, Penn State Robin Lock, St. Lawrence University Patti Frazer Lock, St.
Dr. Kari Lock Morgan Department of Statistics Penn State University Teaching the Common Core: Making Inferences and Justifying Conclusions ASA Webinar.
Inference for Categorical Variables 2/29/12 Single Proportion, p Distribution Intervals and tests Difference in proportions, p 1 – p 2 One proportion or.
Introducing Inference with Simulation Methods; Implementation at Duke University Kari Lock Morgan Department of Statistical Science, Duke University
Statistics: Unlocking the Power of Data Lock 5 Inference for Proportions STAT 250 Dr. Kari Lock Morgan Chapter 6.1, 6.2, 6.3, 6.7, 6.8, 6.9 Formulas for.
One Sample  M ean μ, Variance σ 2, Proportion π Two Samples  M eans, Variances, Proportions μ1 vs. μ2 σ12 vs. σ22 π1 vs. π Multiple.
Statistics: Unlocking the Power of Data Lock 5 Normal Distribution STAT 250 Dr. Kari Lock Morgan Chapter 5 Normal distribution Central limit theorem Normal.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 2 – Slide 1 of 25 Chapter 11 Section 2 Inference about Two Means: Independent.
Using Lock5 Statistics: Unlocking the Power of Data
Estimation: Sampling Distribution
Introducing Inference with Simulation Methods; Implementation at Duke University Kari Lock Morgan Department of Statistical Science, Duke University
Statistics: Unlocking the Power of Data Lock 5 Normal Distribution STAT 101 Dr. Kari Lock Morgan 10/18/12 Chapter 5 Normal distribution Central limit theorem.
Confidence Intervals: Bootstrap Distribution
1 When we free ourselves of desire, we will know serenity and freedom.
Chapter 8 Delving Into The Use of Inference 8.1 Estimating with Confidence 8.2 Use and Abuse of Tests.
Statistics: Unlocking the Power of Data Lock 5 Exam 2 Review STAT 101 Dr. Kari Lock Morgan 11/13/12 Review of Chapters 5-9.
1 When we free ourselves of desire, we will know serenity and freedom.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Statistics: Unlocking the Power of Data Lock 5 Inference for Means STAT 250 Dr. Kari Lock Morgan Sections 6.4, 6.5, 6.6, 6.10, 6.11, 6.12, 6.13 t-distribution.
Statistical Inference Drawing conclusions (“to infer”) about a population based upon data from a sample. Drawing conclusions (“to infer”) about a population.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Statistics: Unlocking the Power of Data Lock 5 Normal Distribution STAT 250 Dr. Kari Lock Morgan Chapter 5 Normal distribution (5.1) Central limit theorem.
Inference for Proportions Section Starter Do dogs who are house pets have higher cholesterol than dogs who live in a research clinic? A.
Statistics: Unlocking the Power of Data Lock 5 Inference for Means STAT 250 Dr. Kari Lock Morgan Sections 6.4, 6.5, 6.6, 6.10, 6.11, 6.12, 6.13 t-distribution.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 3 – Slide 1 of 27 Chapter 11 Section 3 Inference about Two Population Proportions.
Statistics: Unlocking the Power of Data Lock 5 Inference for Proportions STAT 250 Dr. Kari Lock Morgan Chapter 6.1, 6.2, 6.3, 6.7, 6.8, 6.9 Formulas for.
+ Unit 5: Estimating with Confidence Section 8.3 Estimating a Population Mean.
Dr.Theingi Community Medicine
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
Inference: Conclusion with Confidence
Review of Power of a Test
Lecture #25 Tuesday, November 15, 2016 Textbook: 14.1 and 14.3
CHAPTER 8 Estimating with Confidence
CHAPTER 8 Estimating with Confidence
CHAPTER 8 Estimating with Confidence
More on Inference.
Chapter 6 Inferences Based on a Single Sample: Estimation with Confidence Intervals Slides for Optional Sections Section 7.5 Finite Population Correction.
Significance Test for the Difference of Two Proportions
Normal Distribution Chapter 5 Normal distribution
Chapter 4. Inference about Process Quality
STAT 312 Chapter 7 - Statistical Intervals Based on a Single Sample
Math 4030 – 10b Inferences Concerning Variances: Hypothesis Testing
This Week Review of estimation and hypothesis testing
AP STATISTICS REVIEW INFERENCE
When we free ourselves of desire,
Daniela Stan Raicu School of CTI, DePaul University
Measuring Evidence with p-values
More on Inference.
Connecting Intuitive Simulation-Based Inference to Traditional Methods
Introduction to Inference
CHAPTER 22: Inference about a Population Proportion
YOU HAVE REACHED THE FINAL OBJECTIVE OF THE COURSE
Confidence Intervals: The Basics
Daniela Stan Raicu School of CTI, DePaul University
STAT Z-Tests and Confidence Intervals for a
Chapter 10: Estimating with Confidence
CHAPTER 6 Statistical Inference & Hypothesis Testing
CHAPTER 6 Statistical Inference & Hypothesis Testing
Estimating a Population Proportion
CHAPTER 12 Inference for Proportions
CHAPTER 12 Inference for Proportions
Chapter 8: Estimating with Confidence
Inference for Proportions
Presentation transcript:

Inference for Proportions STAT 250 Dr. Kari Lock Morgan Inference for Proportions Sections 6.1, 6.3 Formulas for standard errors Normal based inference

Confidence Interval Formula IF SAMPLE SIZES ARE LARGE… From N(0,1) From original data From bootstrap distribution

From randomization distribution Formula for p-values IF SAMPLE SIZES ARE LARGE… From original data From H0 From randomization distribution Compare z to N(0,1) for p-value

Standard Error Wouldn’t it be nice if we could compute the standard error without doing thousands of simulations? We can!!!

Standard Error Formulas Parameter Distribution Standard Error Proportion Normal Difference in Proportions Mean t, df = n – 1 Difference in Means t, df = min(n1, n2) – 1

SE Formula Observations n is always in the denominator (larger sample size gives smaller standard error) Standard error related to square root of 1/n Standard error formulas use population parameters… (uh oh!) For intervals, plug in the sample statistic(s) as your best guess at the parameter(s)

Standard Error Formulas Parameter Distribution Standard Error Proportion Normal Difference in Proportions Mean t, df = n – 1 Difference in Means t, df = min(n1, n2) – 1

Hitchhiker Snails A type of small snail (Tornatellides boeningi) is very widespread in Japan, and colonies of the snails that are genetically very similar have been found very far apart. How could the snails travel such long distances? Biologist Shinichiro Wada fed 174 live snails to birds, and found that 26 were excreted live out the other end. (The snails are apparently able to seal their shells shut to keep the digestive fluids from getting in). Yong, E. “The Scatological Hitchhiker Snail,” Discover, October 2011, 13.

Question #1 of the Day What proportion of “hitchhiker” snails survive when ingested by a bird?

Interval for Proportion statistic ± z* × SE Calculate sample statistic Find z* for desired level of confidence Calculate standard error Use statistic ± z* × SE Interpret in context

Sample Statistic 26 of 174 snails survived

z* for 90%

Difference in Proportions Standard Error Parameter Distribution Standard Error Proportion Normal Difference in Proportions Mean t, df = n – 1 Difference in Means t, df = min(n1, n2) – 1

Bootstrap Standard Error

Interval for Proportion

Bootstrap Interval

Different Birds The snails were fed to two different birds 14.3% of the 119 snails fed to Japanese white-eyes survived 16.4% of the 55 snails fed to brown-eared bulbuls survived Wada, S., Kawakami, K., & Chiba, S. Snails can survive passage through a bird’s digestive system. Journal of Biogeography, June 21, 2011.

Your Turn! Give a 95% confidence interval for the difference in proportions, white-eye – bulbul. Japanese white-eye: 14.3% out of 119 Brown-eared bulbul: 16.4% out of 55 (-0.10, 0.06) (-0.12, 0.08) (-0.14, 0.10) (-0.17, 0.13)

Your Turn

Question #2 of the Day Does hormone replacement therapy cause increased risk of breast cancer?

Hormone Replacement Therapy Until 2002, hormone replacement therapy (HRT), estrogen and/or progesterone, was commonly prescribed to post-menopausal women. This changed in 2002, when the results of a large clinical trial were published 8506 women were randomized to take HRT, 8102 were randomized to placebo. 166 HRT and 124 placebo women developed any kind of cancer.

Hypothesis Test State hypotheses Calculate sample statistic Calculate SE Calculate z = (statistic – null)/SE Compare z to standard normal distribution to find p-value. Make a conclusion.

SE Formulas What to plug in for the unknown population parameters? For testing, Best: plug in the null value for the parameter(s), because you want the distribution assuming H0 true (if the parameters have anything to do with H0) Acceptable for this course: just use sample statistics (this is different from the textbook)

Hypotheses H0: p1= p2, Ha: p1≠ p2 H0: p1= p2, Ha: p1> p2 Does hormone replacement therapy cause increased risk of invasive breast cancer? p1 = proportion of women taking HRT who get invasive breast cancer p2 = proportion of women not taking HRT who get invasive breast cancer H0: p1= p2, Ha: p1≠ p2 H0: p1= p2, Ha: p1> p2 H0: p1= p2, Ha: p1< p2

Calculate Sample Statistic

Calculate SE

Standard Error

Calculate z-statistic

p-value

p-value

Conclusion If there were no difference between HRT and placebo regarding invasive breast cancer, we would only see results this extreme about 2 out of 100 times. We have evidence that HRT increases risk of invasive breast cancer. (The trial was terminated because of this, and hormone replacement therapy is now no longer routinely recommended)

Your Turn! Same trial, different variable of interest. 8506 women were randomized to take HRT, 8102 were randomized to placebo. 502 HRT and 458 placebo women developed any kind of cancer. Does hormone replacement therapy cause increased risk of cancer in general? Yes No

Your Turn

Margin of Error For a single proportion, what is the margin of error?

Margin of Error You can choose your sample size in advance, depending on your desired margin of error! Given this formula for margin of error, solve for n.

Margin of Error

Margin of Error Suppose we want to estimate a proportion with a margin of error of 0.03 with 95% confidence. How large a sample size do we need? About 100 About 500 About 1000 About 5000

To Do HW 5.1, 5.2, 6.13 (due Monday, 3/27)