Chapter 17 Comparing Two Proportions

Slides:



Advertisements
Similar presentations
January Structure of the book Section 1 (Ch 1 – 10) Basic concepts and techniques Section 2 (Ch 11 – 15): Inference for quantitative outcomes Section.
Advertisements

Comparing Two Proportions (p1 vs. p2)
Lecture 3 Outline: Thurs, Sept 11 Chapters Probability model for 2-group randomized experiment Randomization test p-value Probability model for.
Lecture 6 Outline – Thur. Jan. 29
Chap 9: Testing Hypotheses & Assessing Goodness of Fit Section 9.1: INTRODUCTION In section 8.2, we fitted a Poisson dist’n to counts. This chapter will.
HS 1678: Comparing Two Means1 Two Independent Means Unit 8.
HS 167Basics of Hypothesis Testing1 (a)Review of Inferential Basics (b)Hypothesis Testing Procedure (c)One-Sample z Test (σ known) (d)One-sample t test.
Sample size computations Petter Mostad
Lecture 5 Outline – Tues., Jan. 27 Miscellanea from Lecture 4 Case Study Chapter 2.2 –Probability model for random sampling (see also chapter 1.4.1)
Elementary hypothesis testing Purpose of hypothesis testing Type of hypotheses Type of errors Critical regions Significant levels Hypothesis vs intervals.
Lecture 5 Outline: Thu, Sept 18 Announcement: No office hours on Tuesday, Sept. 23rd after class. Extra office hour: Tuesday, Sept. 23rd from 12-1 p.m.
8 - 10: Intro to Statistical Inference
HS 1679: Inference About a Proportion1 Inference about a proportion Unit 9.
7/2/2015Basics of Significance Testing1 Chapter 15 Tests of Significance: The Basics.
Inferences About Process Quality
Chapter 17 Comparing Two Proportions
Chapter 9 Hypothesis Testing.
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Thomas Songer, PhD with acknowledgment to several slides provided by M Rahbar and Moataza Mahmoud Abdel Wahab Introduction to Research Methods In the Internet.
How Can We Test whether Categorical Variables are Independent?
One Sample  M ean μ, Variance σ 2, Proportion π Two Samples  M eans, Variances, Proportions μ1 vs. μ2 σ12 vs. σ22 π1 vs. π Multiple.
September 15. In Chapter 18: 18.1 Types of Samples 18.2 Naturalistic and Cohort Samples 18.3 Chi-Square Test of Association 18.4 Test for Trend 18.5 Case-Control.
14. Introduction to inference
More About Significance Tests
Inference for a Single Population Proportion (p).
September 15. In Chapter 11: 11.1 Estimated Standard Error of the Mean 11.2 Student’s t Distribution 11.3 One-Sample t Test 11.4 Confidence Interval for.
Evidence-Based Medicine 3 More Knowledge and Skills for Critical Reading Karen E. Schetzina, MD, MPH.
Confidence Intervals Nancy D. Barker, M.S.. Statistical Inference.
FRAMING RESEARCH QUESTIONS The PICO Strategy. PICO P: Population of interest I: Intervention C: Control O: Outcome.
October 15. In Chapter 9: 9.1 Null and Alternative Hypotheses 9.2 Test Statistic 9.3 P-Value 9.4 Significance Level 9.5 One-Sample z Test 9.6 Power and.
Essential Statistics Chapter 131 Introduction to Inference.
INTRODUCTION TO INFERENCE BPS - 5th Ed. Chapter 14 1.
CHAPTER 14 Introduction to Inference BPS - 5TH ED.CHAPTER 14 1.
Chapter 14Introduction to Inference1 Chapter 14 Introduction to Inference.
Chapter 20 Testing hypotheses about proportions
Biostatistics Class 6 Hypothesis Testing: One-Sample Inference 2/29/2000.
October 15. In Chapter 19: 19.1 Preventing Confounding 19.2 Simpson’s Paradox 19.3 Mantel-Haenszel Methods 19.4 Interaction.
1October In Chapter 17: 17.1 Data 17.2 Risk Difference 17.3 Hypothesis Test 17.4 Risk Ratio 17.5 Systematic Sources of Error 17.6 Power and Sample.
The binomial applied: absolute and relative risks, chi-square.
November 15. In Chapter 12: 12.1 Paired and Independent Samples 12.2 Exploratory and Descriptive Statistics 12.3 Inference About the Mean Difference 12.4.
통계적 추론 (Statistical Inference) 삼성생명과학연구소 통계지원팀 김선우 1.
Essential Statistics Chapter 141 Thinking about Inference.
Introduction to Inferece BPS chapter 14 © 2010 W.H. Freeman and Company.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 8 First Part.
Issues concerning the interpretation of statistical significance tests.
BPS - 3rd Ed. Chapter 141 Tests of significance: the basics.
1 Chapter 11: Analyzing the Association Between Categorical Variables Section 11.1: What is Independence and What is Association?
A short introduction to epidemiology Chapter 9: Data analysis Neil Pearce Centre for Public Health Research Massey University Wellington, New Zealand.
Mystery 1Mystery 2Mystery 3.
CHAPTER 27: One-Way Analysis of Variance: Comparing Several Means
AP Statistics Unit 5 Addie Lunn, Taylor Lyon, Caroline Resetar.
1 G Lect 7a G Lecture 7a Comparing proportions from independent samples Analysis of matched samples Small samples and 2  2 Tables Strength.
© Copyright McGraw-Hill 2004
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 10 Comparing Two Groups Section 10.1 Categorical Response: Comparing Two Proportions.
MATB344 Applied Statistics I. Experimental Designs for Small Samples II. Statistical Tests of Significance III. Small Sample Test Statistics Chapter 10.
1 Probability and Statistics Confidence Intervals.
A short introduction to epidemiology Chapter 6: Precision Neil Pearce Centre for Public Health Research Massey University Wellington, New Zealand.
Estimating a Population Proportion ADM 2304 – Winter 2012 ©Tony Quon.
16/23/2016Inference about µ1 Chapter 17 Inference about a Population Mean.
Inference for a Single Population Proportion (p)
STAT 312 Chapter 7 - Statistical Intervals Based on a Single Sample
The binomial applied: absolute and relative risks, chi-square
Chapter 8: Inference for Proportions
Epidemiology Kept Simple
Chapter 12: Comparing Independent Means
Risk ratios 12/6/ : Risk Ratios 12/6/2018 Risk ratios StatPrimer.
Essential Statistics Introduction to Inference
CHAPTER 6 Statistical Inference & Hypothesis Testing
AP Statistics Chapter 12 Notes.
Presentation transcript:

Chapter 17 Comparing Two Proportions 4/16/2017 4/16/2017 Chapter 17 Comparing Two Proportions April 17 Basic Biostat 1

In Chapter 17: 17.1 Data [17.2 Risk Difference] [17.3 Hypothesis Test] 17.4 Risk Ratio [17.5 Systematic Sources of Error] [17.6 Power and Sample Size]

Data conditions Binary response variables (“success/failure”) Binary explanatory variable Notation:

Incidence proportion ≡ average risk Sample Proportions Incidence proportion, exposed group: Incidence proportion, non-exposed group: Incidence proportion ≡ average risk

Example: WHI Estrogen Trial Group 1 n1 = 8506 Estrogen Treatment Compare risks of index disease Random Assignment Group 2 n2 = 8102 Placebo

2-by-2 Table Disease+ Disease− Total Exposure + a1 b1 n1 Exposure − a2 m1 m2 N Risk, non-exposed Risk, exposed

WHI Data D+ D− Total E+ 751 7755 8506 E- 623 7479 8102 1374 15234 16608 Compare these risks

Proportion Difference (Risk Difference) Quantifies excess risk in absolute terms

In large samples, the sampling distribution of the risk difference is approximately Normal

(1 – α)100% CI for p1 – p2 Plus-four method

Estrogen Trial, 95% CI for p1−p2 Data: a1 = 751, n1 = 8506, a2 = 623, n2 = 8102

Excess risk of between 0.3% and 2.0% (in absolute terms) 95% CI for p1−p2 Excess risk of between 0.3% and 2.0% (in absolute terms)

95% CI for p1 – p2 Plus-four method similar to Wilson’s score method. Output from WinPepi > Compare 2 program:

§17.3 Hypothesis Test C. P-value D. Interpret  evidence against H0 A. H0: p1 = p2 (equivalently H0: RR = 1) B. Test statistic (three options) z (large samples) Chi-square (large samples, next chapter) Fisher’s exact (any size sample) C. P-value D. Interpret  evidence against H0

z Test A. H0: p1 = p2 vs.Ha:p1 ≠ p2 (two-sided) B. C. One-sided P = Pr(Z ≥ |zstat|) Two-sided P = 2 × one-sided P

z Test Example A. H0: p1 = p2 against vs. Ha:p1 ≠ p2 B. Test statistic

One-sided P = Pr(Z ≥ 2.66) = .0039 Two-sided P = 2 × .0039 = .0078 The evidence against H0 is v. significant  proportions (average risks) differ significantly

z Test: Notes z statistic Numerator = observed difference Denominator = standard error when p1 = p2 A continuity correction can be optionally applied (p. 382) Equivalent to the chi-square test of association (HS 267) Avoid z tests in small samples; use exact binomial procedure (HS 267)

Fisher’s Exact Test All purpose test for testing H0: p1 = p2 Based on exact binomial probabilities Calculation intensive, but easy with modern software Comes in original and Mid-Probability corrected forms

Example: Fisher’s Test Data. The incidence of colonic necrosis in an exposed group is 2 of 117. The incidence in a non-exposed group is 0 of 862. Ask: Is this difference statistically significant? Hypothesis statements. Under the null hypothesis, there is no difference in risks in the two populations. Thus: H0: p1 = p2 Ha: p1 > p2 (one-sided) or Ha: p1 ≠ p2 (two-sided)

Fisher’s Test, Example B. Test statistic  none per se C. P-value. Use WinPepi > Compare2.exe > A. D. Interpret. P-value = .014  strong (“significant”) evidence against H0 D+ D− E+ 2 115 E− 862

§17.4 Proportion Ratio (Relative Risk) Compare incidences from the two groups in form of a RATIO Quantifies effect of the exposure in relative terms Relative Risk Parameter Relative Risk Estimator (“RR hat”)

Example: RR (WHI Data) + − Total Estrogen + 751 7755 8506 Estrogen − 623 7479 8102

Interpretation When p1 = p2, RR = 1  indicating “no association” RR > 1  positive association RR < 1  negative association The RR indicates how much the exposure multiplies the risk over the baseline risk of the non-exposed group RR of 1.15 suggests risk in exposed group is “1.15 times” that of non-exposed group Baseline RR is 1! Thus, an RR of 1.15 is 0.15 (15%) above the baseline

Confidence Interval for the RR To derive information about the precision of the estimate, calculate a (1– α)100% CI for the RR with this formula: ln ≡ natural log, base e

90% CI for RR, WHI D+ D− Total E+ 751 7755 8506 E− 623 7479 8102

WinPepi > Compare2.exe > Program B D+ D− Total E+ 751 7755 8506 E − 623 7479 8102 See prior slide for hand calculations

Confidence Interval for the RR Interpretation similar to other confidence intervals Interval intends to capture the parameter (in this case the RR parameter) Confidence level refers to confidence in the procedure CI length quantifies the precision of the estimate

§17.5 Systematic Error CIs and P-values address random error only In observational studies, systematic errors are more important than random error Consider three types of systematic errors: Confounding Information bias Selection bias

Confounding Confounding = mixing together of the effects of the explanatory variable with the extraneous factors. Example: WHI trial found 15% increase in risk in estrogen exposed group. Earlier observational studies found 40% lower in estrogen exposed groups. Plausible explanation: Confounding by extraneous lifestyles factors in observational studies

Information Bias Information bias - mismeasurement (misclassification) leading to overestimation or underestimation in risk Nondifferential misclassification (occurs to the same extent in the groups)  tends to bias results toward the null or have no effect Differential misclassification (one groups experiences a greater degree of misclassification than the other)  bias can be in either direction.

Nondifferential & Differential Misclassification - Examples

Selection Bias Selection bias ≡ systematic error related to manner in which study participants are selected Example. If we shoot an arrow into the broad side of a barn and draw a bull’s-eye where it had landed, have we identified anything that is nonrandom?

Sample Size & Power for Comparing Proportions Three approaches: n needed to estimate given effect with margin of error m (not covered in Ch 17) n needed to test H0 at given α and power Power of test of H0 under given conditions

Sample Size Requirements for Comparing Proportions Depends on: r ≡ sample size ratio = n2 / n1 1−β ≡ power (acceptable type II error rate) α ≡ significance level (type I error rate) p1 ≡ expected proportion, group 1 p2 ≡ expected proportion in group 2 Instead of specifying p1 and p2, we can specify p2 and the expected RR

Calculation Formulas on pp. 396 – 402 (complex) In practice  use WinPEPI > Compare2.exe > Sample size

WinPepi > Compare2 > S1