Biostatistics Case Studies 2005 Peter D. Christenson Biostatistician Session 1: Study Design for Demonstrating Lack of Treatment.

Slides:



Advertisements
Similar presentations
Hypothesis Testing Goal: Make statement(s) regarding unknown population parameter values based on sample data Elements of a hypothesis test: Null hypothesis.
Advertisements

Equivalence Testing Dig it!.
Biostatistics in Practice Session 3: Testing Hypotheses Peter D. Christenson Biostatistician
Anthony Greene1 Simple Hypothesis Testing Detecting Statistical Differences In The Simplest Case:  and  are both known I The Logic of Hypothesis Testing:
+ Chapter 10 Section 10.4 Part 2 – Inference as Decision.
Biostatistics Case Studies 2006 Peter D. Christenson Biostatistician Session 5: Reporting Subgroup Results.
Confidence Intervals © Scott Evans, Ph.D..
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
Nemours Biomedical Research Statistics March 19, 2009 Tim Bunnell, Ph.D. & Jobayer Hossain, Ph.D. Nemours Bioinformatics Core Facility.
PSY 307 – Statistics for the Behavioral Sciences
PSY 1950 Confidence and Power December, Requisite Quote “The picturing of data allows us to be sensitive not only to the multiple hypotheses that.
Sample Size Determination In the Context of Hypothesis Testing
Sample Size Determination
Determining Statistical Significance
STAT 3130 Statistical Methods II Session 6 Statistical Power.
Sample Size Determination Ziad Taib March 7, 2014.
Biostatistics in Clinical Research Peter D. Christenson Biostatistician January 12, 2005IMSD U*STAR RISE.
Biostatistics for Coordinators Peter D. Christenson REI and GCRC Biostatistician GCRC Lecture Series: Strategies for Successful Clinical Trials Session.
+ Chapter 9 Summary. + Section 9.1 Significance Tests: The Basics After this section, you should be able to… STATE correct hypotheses for a significance.
Academic Viva POWER and ERROR T R Wilson. Impact Factor Measure reflecting the average number of citations to recent articles published in that journal.
Biostatistics Case Studies 2015 Youngju Pak, PhD. Biostatistician Session 2: Sample Size & Power for Inequality and Equivalence Studies.
CHAPTER 16: Inference in Practice. Chapter 16 Concepts 2  Conditions for Inference in Practice  Cautions About Confidence Intervals  Cautions About.
The Practice of Statistics Third Edition Chapter 10: Estimating with Confidence Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates.
Biostatistics Class 6 Hypothesis Testing: One-Sample Inference 2/29/2000.
Randomized Trial of Preoperative Chemoradiation Versus Surgery Alone in Patients with Locoregional Esophageal Carcinoma, Ursa et al. Statistical Methods:
Biostatistics in Practice Peter D. Christenson Biostatistician LABioMed.org /Biostat Session 4: Study Size and Power.
Biostatistics in Practice Peter D. Christenson Biostatistician Session 4: Study Size and Power.
Biostatistics in Practice Peter D. Christenson Biostatistician Session 6: Case Study.
Using Inference to MAKE DECISIONS The Type I and Type II Errors in Hypothesis Testing.
Biostatistics in Practice Peter D. Christenson Biostatistician Session 3: Testing Hypotheses.
Copyright © 2011 Pearson Education, Inc. Putting Statistics to Work.
Biostatistics in Practice Peter D. Christenson Biostatistician Session 4: Study Size for Precision or Power.
Chapter 21: More About Tests
Power & Sample Size Dr. Andrea Benedetti. Plan  Review of hypothesis testing  Power and sample size Basic concepts Formulae for common study designs.
Biostatistics Case Studies 2006 Peter D. Christenson Biostatistician Session 1: Demonstrating Equivalence of Active Treatments:
© Copyright McGraw-Hill 2004
Biostatistics Case Studies 2007 Peter D. Christenson Biostatistician Session 2: Aging and Survival.
Biostatistics in Practice Peter D. Christenson Biostatistician LABioMed.org /Biostat Session 3: Testing Hypotheses.
Multiple comparisons 郭士逢 輔大生科系 2008 Advanced Biostatistics.
Biostatistics Case Studies 2009 Peter D. Christenson Biostatistician Session 4: The Logic Behind Statistical Adjustment.
The Practice of Statistics Third Edition Chapter 11: Testing a Claim Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates.
Biostatistics Case Studies 2006 Peter D. Christenson Biostatistician Session 1: Demonstrating Equivalence of Active Treatments:
Biostatistics Case Studies 2016 Youngju Pak, PhD. Biostatistician Session 2 Understanding Equivalence and Noninferiority testing.
Critical Appraisal Course for Emergency Medicine Trainees Module 2 Statistics.
6. Statistical Inference: Significance Tests
More on Inference.
Sample Size Determination
CHAPTER 9 Testing a Claim
Biostatistics Case Studies 2007
The Importance of Adequately Powered Studies
P-values.
CHAPTER 9 Testing a Claim
Chapter 8: Inference for Proportions
More on Inference.
P-value Approach for Test Conclusion
Introduction to Inference
Statistical inference
Statistical Inference
Volume 46, Issue 5, Pages (May 2007)
Psych 231: Research Methods in Psychology
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
Chapter 9: Testing a Claim
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
AP STATISTICS LESSON 10 – 4 (DAY 2)
Statistical Test A test of significance is a formal procedure for comparing observed data with a claim (also called a hypothesis) whose truth we want to.
Statistical inference
Presentation transcript:

Biostatistics Case Studies 2005 Peter D. Christenson Biostatistician Session 1: Study Design for Demonstrating Lack of Treatment Effect: Equivalence or Non-inferiority

Case Study

p ASA+PPI = 1.5% Demonstrate: p clop – p ASA+PPI ≤ 4% N=145/group Power=80% for what?

Typical Analysis: Inferiority or Superiority H 0 : p clop – p ASA+PPI = 0% H 1 : p clop – p ASA+PPI ≠ 0% H 1 → therapies differ α = 0.05 Power = 80% for Δ=|p clop - p ASA+PPI | =? Clop inferior = 95% CI for p clop – p ASA+PPI Clop superior 0 0 p clop – p ASA+PPI [Not used in this paper] 0 p clop – p ASA+PPI No diff detected* * and 80% chance that a Δ of (?) or more would be detected.

Typical Analysis: Inferiority or Superiority H 0 : p clop – p ASA+PPI = 0% H 1 : p clop – p ASA+PPI ≠ 0% H 1 → therapies differ α = 0.05 Power = 80% for Δ=|p clop - p ASA+PPI | =? [Not used in this paper] So, N=331/group → 80% chance that a Δ of 4% or more would be detected. Detectable Δ = 5.5%-1.5%=4%

Typical Analysis: Inferiority or Superiority H 0 : p clop – p ASA+PPI = 0% H 1 : p clop – p ASA+PPI ≠ 0% H 1 → therapies differ α = 0.05 Power = 80% for Δ=|p clop - p ASA+PPI | =4% [Not used in this paper] H 0 : p clop – p ASA+PPI ≤ 0% H 1 : p clop – p ASA+PPI > 0% H 1 → clop inferior Note that this could be formulated as two one-sided tests (TOST): α = Power = 80% for p clop - p ASA+PPI =4% H 0 : p clop – p ASA+PPI ≥ 0% H 1 : p clop – p ASA+PPI < 0% H 1 → clop superior α = Power = 80% for p clop - p ASA+PPI =-4%

Demonstrating Equivalence H 0 : |p clop – p ASA+PPI | ≥ E% H 1 : |p clop – p ASA+PPI | < E% H 1 → therapies “equivalent”, within E [Not used in this paper] H 0 : p clop – p ASA+PPI ≤ -4% H 1 : p clop – p ASA+PPI > -4% H 1 → clop non-superior Note that this could be formulated as two one-sided tests (TOST): α = Power = 80% for p clop - p ASA+PPI = 0% H 0 : p clop – p ASA+PPI ≥ 4% H 1 : p clop – p ASA+PPI < 4% H 1 → clop non-inferior α = Power = 80% for p clop - p ASA+PPI = 0%

Demonstrating Equivalence H 0 : |p clop – p ASA+PPI | ≥ 4% H 1 : |p clop – p ASA+PPI | < 4% H 1 → equivalence α = 0.05 Power = 80% for p clop - p ASA+PPI = 0 Clop non-superior = 95% CI for p clop – p ASA+PPI Clop non-inferior 0 0 p clop – p ASA+PPI 0 Equivalence* * both non-superior and non-inferior.

This Paper: Inferiority and Non-Inferiority H 0 : p clop – p ASA+PPI ≤ 0% H 1 : p clop – p ASA+PPI > 0% H 1 → clop inferior Apparently, two one-sided tests (TOST), but only one explicitly powered: α = Power = 80% for p clop - p ASA+PPI = ?% H 0 : p clop – p ASA+PPI ≥ 4% H 1 : p clop – p ASA+PPI < 4% H 1 → clop non-inferior α = Power = 80% for p clop - p ASA+PPI = 0% The authors chose E=4% as the maximum therapy difference that therapies are considered equivalent.

This Paper: Inferiority and Non-Inferiority Clop inferior = 95% CI for p clop – p ASA+PPI Clop non-inferior 0 0 p clop – p ASA+PPI 0 “Non-clinical” inferiority* * clop is statistically inferior, but not enough for clinical significance. Decisions: Observed Results: p clop = 8.6%; p ASA+PPI = 0.7%; 95% CI = 3.4 to p clop – p ASA+PPI Clop inferior

Power for Test of Clopidrogrel Non-Inferiority H 0 : p clop – p ASA+PPI ≥ 4% H 1 : p clop – p ASA+PPI < 4% H 1 → clop non-inferior α = Power = 80% for p clop - p ASA+PPI = 0%

Power for Test of Clopidrogrel Inferiority H 0 : p clop – p ASA+PPI ≤ 0% H 1 : p clop – p ASA+PPI > 0% H 1 → clop inferior α = Power = 80% for p clop - p ASA+PPI = 7.3% Detectable Δ = 8.8%-1.5%=7.3%

Conclusions: This Paper In this paper, clop was so inferior that investigators were apparently lucky to have enough power for detecting it. The CI was too wide with this N for detecting a smaller therapy difference. Investigators justify testing non-inferiority of clop only (and not of Aspirin + Nexium) with the lessened desirability of combination therapy (?). I feel that this is a good approach for size and power for a new competing therapy against a standard, if the N for clop inferiority had been considered also. Note that power calculations were based on actual %s of subjects, whereas cumulative 12-month incidence was used in the analysis. There are not power calculations for equivalency tests using survival analysis, that I know of.

Conclusions: General “Negligibly inferior” would be better than non-inferior. All inference can be based on confidence intervals. Pre-specify the comparisons to be made, which can be defined as where confidence intervals lie. Ns are smaller for equivalence tests, but study may be underpowered to detect differences if that is the case, unless specifically designed for that. Power for only one or for multiple comparisons. Power can be different for different comparisons. For large N, reversing α and β=1-power for the typical test gives the same N as for equivalence test.

Appendix: Possible Errors in Study Conclusions Truth: H 0 : No EffectH 1 : Effect No Effect Effect Study Claims: Correct Error (Type I) Error (Type II) Power: Maximize Choose N for 80% Set α=0.05 Specificity=95% Specificity Sensitivity Typical study to demonstrate superiority/inferiority

Appendix: Graphical Representation of Power H0H0 HAHA H 0 : true effect=0 H A : true effect=3 Effect in study=1.13 \\\ = Probability of concluding H A if H 0 is true. 41% 5% Effect (Group B mean – Group A mean) /// = Probability of concluding H 0 if H A is true. Power=100-41=59% Note greater power if larger N, and/or if true effect>3, and/or less subject heterogeneity. N=100 per Group Larger Ns give narrower curves Typical study to demonstrate superiority/inferiority

Appendix: Online Study Size / Power Calculator Does NOT include tests for equivalence or non- inferiority or non- superiority