Biostatistics Case Studies 2016 Youngju Pak, PhD. Biostatistician Session 2 Understanding Equivalence and Noninferiority testing.

Slides:



Advertisements
Similar presentations
ISSUES THAT PLAGUE NON- INFERIORITY TRIALS PAST AND FUTURE RALPH B. DAGOSTINO, SR. BOSTON UNIVERSITY HARVARD CLINICAL RESEARCH INSTITUTE.
Advertisements

Hypothesis Testing Goal: Make statement(s) regarding unknown population parameter values based on sample data Elements of a hypothesis test: Null hypothesis.
Biostatistics Case Studies 2005 Peter D. Christenson Biostatistician Session 1: Study Design for Demonstrating Lack of Treatment.
The equivalence trial Didier Concordet NATIONAL VETERINARY S C H O O L T O U L O U S E.
LSU-HSC School of Public Health Biostatistics 1 Statistical Core Didactic Introduction to Biostatistics Donald E. Mercante, PhD.
Introduction to Statistics
Biostatistics Case Studies 2006 Peter D. Christenson Biostatistician Session 5: Reporting Subgroup Results.
Confidence Intervals © Scott Evans, Ph.D..
1 Equivalence and Bioequivalence: Frequentist and Bayesian views on sample size Mike Campbell ScHARR CHEBS FOCUS fortnight 1/04/03.
Statistical inference form observational data Parameter estimation: Method of moments Use the data you have to calculate first and second moment To fit.
Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.1 Sample Size and Power.
Horng-Chyi HorngStatistics II41 Inference on the Mean of a Population - Variance Known H 0 :  =  0 H 0 :  =  0 H 1 :    0, where  0 is a specified.
Sample Size Determination Ziad Taib March 7, 2014.
Power and Non-Inferiority Richard L. Amdur, Ph.D. Chief, Biostatistics & Data Management Core, DC VAMC Assistant Professor, Depts. of Psychiatry & Surgery.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
1 Non-inferiority designs for relapse prevention of schizophrenia Gene Laska Ph.D. Department of Psychiatry NYU School of Medicine Nathan Kline Institute.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
Assessing the Evidence: Statistical Inference Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Comparing Systems Using Sample Data Andy Wang CIS Computer Systems Performance Analysis.
Inference in practice BPS chapter 16 © 2006 W.H. Freeman and Company.
NY, 14 December 2007 Emmanuel Lesaffre Biostatistical Centre, K.U.Leuven, Leuven, Belgium Dept of Biostatistics, Erasmus MC, Rotterdam, the Netherlands.
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
Understanding the Concept of Equivalence and Non-Inferiority Trials CM Gibson, 2000.
Dependent Samples: Hypothesis Test For Hypothesis tests for dependent samples, we 1.list the pairs of data in 2 columns (or rows), 2.take the difference.
Biostatistics for Coordinators Peter D. Christenson REI and GCRC Biostatistician GCRC Lecture Series: Strategies for Successful Clinical Trials Session.
Biostatistics Case Studies 2015 Youngju Pak, PhD. Biostatistician Session 2: Sample Size & Power for Inequality and Equivalence Studies.
EDUC 200C Friday, October 26, Goals for today Homework Midterm exam Null Hypothesis Sampling distributions Hypothesis testing Mid-quarter evaluations.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Comparing two sample means Dr David Field. Comparing two samples Researchers often begin with a hypothesis that two sample means will be different from.
Challenges of Non-Inferiority Trial Designs R. Sridhara, Ph.D.
10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.
Biostatistics Case Studies 2015 Youngju Pak, PhD. Biostatistician Session 1: Sample Size & Power for Inequality and Equivalence Studies.
Hypothesis Testing Hypothesis Testing Topic 11. Hypothesis Testing Another way of looking at statistical inference in which we want to ask a question.
Biostatistics Case Studies 2008 Peter D. Christenson Biostatistician Session 5: Choices for Longitudinal Data Analysis.
Biostatistics Class 6 Hypothesis Testing: One-Sample Inference 2/29/2000.
Biostatistics in practice Session 3 Youngju Pak, Ph.D. UCLA Clinical and Translational Science Institute LA BioMed/Harbor-UCLA Medical Center LA BioMed/Harbor-UCLA.
What is a non-inferiority trial, and what particular challenges do such trials present? Andrew Nunn MRC Clinical Trials Unit 20th February 2012.
Biostatistics in Practice Peter D. Christenson Biostatistician LABioMed.org /Biostat Session 4: Study Size and Power.
Biostatistics in Practice Peter D. Christenson Biostatistician Session 4: Study Size and Power.
통계적 추론 (Statistical Inference) 삼성생명과학연구소 통계지원팀 김선우 1.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Introduction to the Practice of Statistics Fifth Edition Chapter 6: Introduction to Inference Copyright © 2005 by W. H. Freeman and Company David S. Moore.
Biostatistics in Practice Peter D. Christenson Biostatistician Session 6: Case Study.
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 4 First Part.
Ch11: Comparing 2 Samples 11.1: INTRO: This chapter deals with analyzing continuous measurements. Later, some experimental design ideas will be introduced.
Biostatistics in Practice Peter D. Christenson Biostatistician Session 3: Testing Hypotheses.
Equivalence Trials: Understanding the Statistical and Clinical Issues Christopher Cannon, M.D. C. Michael Gibson, M.S., M.D. Brigham and Women’s HospitalBeth.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Biostatistics in Practice Peter D. Christenson Biostatistician Session 4: Study Size for Precision or Power.
Math 4030 – 9a Introduction to Hypothesis Testing
Biostatistics Case Studies 2006 Peter D. Christenson Biostatistician Session 1: Demonstrating Equivalence of Active Treatments:
1 URBDP 591 A Lecture 12: Statistical Inference Objectives Sampling Distribution Principles of Hypothesis Testing Statistical Significance.
Inen 460 Lecture 2. Estimation (ch. 6,7) and Hypothesis Testing (ch.8) Two Important Aspects of Statistical Inference Point Estimation – Estimate an unknown.
Joel Singer, Programme Head, Methodology and Statistics, CIHR Canadian HIV Trials Network.
Sample Size Determination
Results: How to interpret and report statistical findings Today’s agenda: 1)A bit about statistical inference, as it is commonly described in scientific.
Biostatistics in Practice Peter D. Christenson Biostatistician LABioMed.org /Biostat Session 3: Testing Hypotheses.
Biostatistics Case Studies 2006 Peter D. Christenson Biostatistician Session 1: Demonstrating Equivalence of Active Treatments:
Biostatistics Case Studies 2016 Youngju Pak, PhD. Biostatistician Session 1 Understanding hypothesis testing, P values, and sample size.
Testing for Equivalence or Noninferiority chapter 21.
Remaining Challenges in Assessing Non-Inferiority Steven Snapinn DIA Statistics Community Virtual Journal Club December 16, 2014 Based on Paper with Qi.
Sample Size Determination
Biostatistics Case Studies 2007
Statistical Core Didactic
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
Chapter 2 Simple Comparative Experiments
Hypothesis Testing: Hypotheses
Hypothesis Testing Summer 2017 Summer Institutes.
Inference on the Mean of a Population -Variance Known
Volume 46, Issue 5, Pages (May 2007)
Presentation transcript:

Biostatistics Case Studies 2016 Youngju Pak, PhD. Biostatistician Session 2 Understanding Equivalence and Noninferiority testing

Testing Inequality v.s. Equality Testing Inequality Ha: | mean(treatment ) - mean (control ) | ≠ 0 H0: | mean(treatment ) - mean (control ) | = 0 Testing equivalence H a : δ 1 < mean(trt 1) – mean (trt2) < δ 2 H 0 : mean(trt 1) – mean (trt2) ≤ δ 1 or mean(trt 1) – mean (trt2) ≥ δ 2, δ 1 & δ 2 is called “Equivalence Margin Testing Noninferority, noninfiority ity margin

Case Study Ophthalmology 2006; 113:70-76.

Abstract

Primary Outcome and Study Size Study Size - Page 72 bottom of column 1: Primary Outcome - Page 72 middle of column 1: Needs Consensus PI’s Gamble

Non-Inferiority Study Usually a new treatment or regimen is compared with an accepted treatment or regimen or standard of care. The new treatment is assumed inferior to the standard and the study is designed to show overwhelming evidence that it is at least nearly as good, i.e., non- inferior. It usually has other advantages, e.g., oral vs. inj. A negative inferiority study fails to detect inferiority, but does not necessarily give evidence for non-inferiority. The accepted treatment is usually known to be efficacious already, but an added placebo group may also be used.

How to determine Sample Size? For IOP study, we have –H a : mean IOP change uf – mean IOP change f < 1.5 –H 0 : mean IOP change uf – mean IOP change f ≥ 1.5 thus, we are only interested in the upper limit of the difference  Non-inferiority  one-sided T-test Thus we reject the H 0 if Signal/ Noise < some clinical value. But N for a non-inferiority test require more complicated parameters such as the non- centrality parameter of the t-distribution (a Two One Sided T-test is usually used for the equivalence test ).

Let’s run a software from Information you will need –Equivalence Margin Non-Inferiority Margin(NIM) =1.5 for the IOP study –Assumed mean difference in change of IOP between two groups -> usually zero difference assumed but it is assumed 0.5 for the IOP study –SD of changes of IOP = 3.5 –α (usually set to 2.5%) since the confidence level of the confidence interval is (100-2 x α) %

Sample size for IOP study

Three dimensional power curve for a non-inferiority test

How do we determine if the fixed method is non-inferior to the unfixed method? Regardless of study aim – to prove treatments equivalent or to prove them different - inference can be based on: Primary Outcome: IOP reduction D= D uf – D f, where D f = mean IOP reduction with fixed therapy Typical superiority/inferiority study: Compare to 0. Non-inferiority study: Compare to δ 2, a pre-specified margin of equivalence (1.5 here). = 95% CI for D(= D uf – D f ) = “true (population) values for D”

Typical Analysis: Inferiority or Superiority H 0 : D uf – D f = 0 H 1 : D uf – D f ≠ 0 Aim: H 1 → therapies differ α = 0.05 & N=2194 Power = 80% when Δ=1, SD=3.5 Fixed is inferior = 95% CI for D = “true (population) values for D” Fixed is superior 0 0 D u – D f [Not used in this paper] 0 No difference detected D u – D f

Typical Analysis: Inferiority Only H 0 : D u – D f ≤ 0 H 1 : D u – D f > 0 Aim: H 1 → fixed is inferior α = & N=2194 Power = 80% for when Δ=1, SD=3.5 Fixed is inferior = 95% CI for D u – D f = “true (population) values for D” 0 0 D uf – Df [Not used in this paper] 0 Inferiority not detected D uf – D f ( α = 0.05 → N=2153 )

Non-Inferiority H 0 : D u – D f ≥ 1.5 H 1 : D u – D f < 1.5 Aim: H 1 → fixed is non-inferior α = & N=2194 Power = 80% for When Δ= 0.5, NIM=1.5 Fixed is non-inferior = 95% CI for D u – D f = “true (population) values for D” 0 0 D uf – D f [As in this paper] 0 Non-Inferiority not detected D uf – D f 1.5 Fixed is inferior 1.5

Inferiority and Non-Inferiority Fixed is non-inferior = 95% CI for D u – D f = “true (population) values for D” Neither is detected D uf – D f 1.5 Fixed is inferior 01.5 Fixed is “non-clinically” inferior D^ uf = 9.0 D^ f = 8.7 D^ = % CI = -0.1 to 0.7 Observed Results: Fixed is non-inferior 01.5

Conclusions: General “Negligibly inferior” would be a better term than non- inferior. All inference can be based on confidence intervals. Pre-specify the comparisons to be made. Cannot test for both non-inferiority and superiority. Power for only one or for multiple comparisons, e.g., non-inferiority and inferiority. Power can be different for different comparisons. Very careful consideration must be given to choice of margin of equivalence (1.5 here). You can be risky and gamble on what expected differences will be (0.5 here), but the study is worthless if others in the field would find your margin too large.

FDA Guidelines : on/guidances/ucm pdf on/guidances/ucm pdf Where, M 1 = Full effect of the active control compare with the test drug M 2 = NI Margin

Self-Quiz 1.Give an example in your specialty area for a superiority /inferiority study. Now modify it to an equivalence study. Now modify it to a non-inferiority study. 2.T or F: The main point about non-inferiority studies is that we are asking whether a treatment is as good or better vs. worse than another treatment, so it uses a one-sided test. 3.Power for a typical superiority test is the likelihood that you will declare treatment differences (p<0.05) if treatments really differ by some magnitude Δ. Explain what power means for a non-inferiority study. 4.T or F: Last-value-carried-forward is a good way to handle drop-outs in a non-inferiority study. Explain. continued

Self-Quiz 5.T or F: In a non-inferiority study, you should first test for non- inferiority with a confidence interval, and then use a t-test to test for superiority, but only if non-inferiority was established at the first step. 6.What is the meaning of the equivalence margin, and how do you determine it? continued

Self-Quiz 7.Suppose the primary outcome for a study is a serum inflammatory marker. If it’s assay is poor (low reproducibility), then it is more difficult to find treatment differences in a typical superiority/inferiority study than for a better assay, due to this noise. Would it be easier or more difficult to find non-inferiority with this assay, compared to a better assay? 8.Does the assumed treatment difference (0.5 here) for power calculations have the same meaning as the difference used for power calculations in a typical superiority/inferiority study?