DSBS Discussion: Multiple Testing 28 May 2009 Discussion on Multiple Testing Prepared and presented by Lars Endahl.

Slides:



Advertisements
Similar presentations
ISSUES THAT PLAGUE NON- INFERIORITY TRIALS PAST AND FUTURE RALPH B. DAGOSTINO, SR. BOSTON UNIVERSITY HARVARD CLINICAL RESEARCH INSTITUTE.
Advertisements

Gatekeeping Testing Strategies in Clinical Trials Alex Dmitrienko, Ph.D. Eli Lilly and Company FDA/Industry Statistics Workshop September 2004.
Robert T. O’Neill, Ph.D. Director, Office of Biostatistics CDER, FDA
The equivalence trial Didier Concordet NATIONAL VETERINARY S C H O O L T O U L O U S E.
Synopsis of FDA Colorectal Cancer Endpoints Workshop Michael J. O’Connell, MD Director, Allegheny Cancer Center Associate Chairman, NSABP Pittsburgh, PA.
By Trusha Patel and Sirisha Davuluri. “An efficient method for accommodating potentially underpowered primary endpoints” ◦ By Jianjun (David) Li and Devan.
Basic Design Consideration. Previous Lecture Definition of a clinical trial The drug development process How different aspects of the effects of a drug.
Design and analysis of clinical trials MULTIPLE COMPARISONS.
Optimal Drug Development Programs and Efficient Licensing and Reimbursement Regimens Neil Hawkins Karl Claxton CENTRE FOR HEALTH ECONOMICS.
ODAC May 3, Subgroup Analyses in Clinical Trials Stephen L George, PhD Department of Biostatistics and Bioinformatics Duke University Medical Center.
New or existing slides are easily formatted using built-in layouts that can be applied via the Home tab EMA DRAFT GUIDELINE ON SUBGROUPS DISCUSSION April.
Estimation and Reporting of Heterogeneity of Treatment Effects in Observational Comparative Effectiveness Research Prepared for: Agency for Healthcare.
Locating Variance: Post-Hoc Tests Dr James Betts Developing Study Skills and Research Methods (HL20107)
1 A Bayesian Non-Inferiority Approach to Evaluation of Bridging Studies Chin-Fu Hsiao, Jen-Pei Liu Division of Biostatistics and Bioinformatics National.
ANOVA Determining Which Means Differ in Single Factor Models Determining Which Means Differ in Single Factor Models.
Statistics for the Social Sciences Psychology 340 Spring 2005 Analysis of Variance (ANOVA)
Today Concepts underlying inferential statistics
Meeting Agenda Presentations on endpoints –Regulatory issues –Scientific issues Pros and cons of end points –Classical end points –Non-classical end points.
Adaptive Designs for Clinical Trials
Sample Size Determination Ziad Taib March 7, 2014.
Power and Non-Inferiority Richard L. Amdur, Ph.D. Chief, Biostatistics & Data Management Core, DC VAMC Assistant Professor, Depts. of Psychiatry & Surgery.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
NDA Study MP-US-M01. Division of Oncology Drug Products 2 Federal Food, Drug, and Cosmetic Act of 1962 Substantial Evidence = Adequate and well-controlled.
Research Methods. Research Projects  Background Literature  Aims and Hypothesis  Methods: Study Design Data collection approach Sample Size and Power.
Discussion on Randomisation and Blinding
Proprietary and Confidential © AstraZeneca 2009 FOR INTERNAL USE ONLY 1 O Guilbaud, FMS+Cramér Society, AZ-Södertälje, Alpha Recycling in Confirmatory.
Gil Harari Statistical considerations in clinical trials
Analysis of Clinical Trials with Multiple Outcomes Changchun Xie, PhD Assistant Professor of Biostatistics Division of Biostatistics and Bioinformatics.
Intermediate Applied Statistics STAT 460
NY, 14 December 2007 Emmanuel Lesaffre Biostatistical Centre, K.U.Leuven, Leuven, Belgium Dept of Biostatistics, Erasmus MC, Rotterdam, the Netherlands.
Biostatistics, statistical software V
1 Statistical Review Dr. Shan Sun-Mitchell. 2 ENT Primary endpoint: Time to treatment failure by day 50 Placebo BDP Patients randomized Number.
Decision-Theoretic Views on Switching Between Superiority and Non-Inferiority Testing. Peter Westfall Director, Center for Advanced Analytics and Business.
Inference and Inferential Statistics Methods of Educational Research EDU 660.
1 OTC-TFM Monograph: Statistical Issues of Study Design and Analyses Thamban Valappil, Ph.D. Mathematical Statistician OPSS/OB/DBIII Nonprescription Drugs.
1 Statistics in Drug Development Mark Rothmann, Ph. D.* Division of Biometrics I Food and Drug Administration * The views expressed here are those of the.
What is a non-inferiority trial, and what particular challenges do such trials present? Andrew Nunn MRC Clinical Trials Unit 20th February 2012.
Introduction to sample size and power calculations Afshin Ostovar Bushehr University of Medical Sciences.
A Comparison of Some Methods for Detection of Safety Signals in Randomised Controlled Clinical Trials Raymond Carragher Project Supervisors: Prof. Chris.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
One-way ANOVA: - Comparing the means IPS chapter 12.2 © 2006 W.H. Freeman and Company.
Federal Institute for Drugs and Medical Devices The BfArM is a Federal Institute within the portfolio of the Federal Ministry of Health (BMG) The use of.
The Multiple Comparisons Problem in IES Impact Evaluations: Guidelines and Applications Peter Z. Schochet and John Deke June 2009, IES Research Conference.
Evidence Based Advertising Part II Beyond the TMA: From clinical trials to real world evidence.
MARE 250 Dr. Jason Turner Analysis of Variance (ANOVA)
1.  What inferential statistics does best is allow decisions to be made about populations based on the information about samples.  One of the most useful.
1 BLA Sipuleucel-T (APC-8015) FDA Statistical Review and Findings Bo-Guang Zhen, PhD Statistical Reviewer, OBE, CBER March 29, 2007 Cellular, Tissue.
One-way ANOVA Example Analysis of Variance Hypotheses Model & Assumptions Analysis of Variance Multiple Comparisons Checking Assumptions.
Division of Oncology Drug Products 1 AREAS OF MAJOR STATISTICAL CONCERNS IN THE M01 STUDY Overall (ITT Population) Finding Liver Metastasis Subgroup Finding.
Regulatory Considerations for Endpoints Ann T. Farrell, M.D. FDA/CDER/DODP.
Guidelines for Multiple Testing in Impact Evaluations Peter Z. Schochet June 2008.
ANalysis Of VAriance (ANOVA) Used for continuous outcomes with a nominal exposure with three or more categories (groups) Result of test is F statistic.
Zometa for Prostate Cancer Bone Metastases Protocol 039 Amna Ibrahim, M.D. Oncology Drug Products FDA.
Biostatistics Case Studies 2006 Peter D. Christenson Biostatistician Session 1: Demonstrating Equivalence of Active Treatments:
MARE 250 Dr. Jason Turner Analysis of Variance (ANOVA)
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Remaining Challenges in Assessing Non-Inferiority Steven Snapinn DIA Statistics Community Virtual Journal Club December 16, 2014 Based on Paper with Qi.
Double-blind, randomized trial in 4,162 patients with Acute Coronary Syndrome
European Obesity Academy Assmannshausen 2016 Statistics; power calculation and randomization Johan Bring Statisticon AB.
Critical Appraisal Course for Emergency Medicine Trainees Module 2 Statistics.
Estimating the False Discovery Rate in Genome-wide Studies BMI/CS 576 Colin Dewey Fall 2008.
Sample Journal Club Your Name Here.
Biostatistics Case Studies 2007
Deputy Director, Division of Biostatistics No Conflict of Interest
Multiple Endpoint Testing in Clinical Trials – Some Issues & Considerations Mohammad Huque, Ph.D. Division of Biometrics III/Office of Biostatistics/OPaSS/CDER/FDA.
Critical Reading of Clinical Study Results
Crucial Statistical Caveats for Percutaneous Valve Trials
Aiying Chen, Scott Patterson, Fabrice Bailleux and Ehab Bassily
Medical Statistics Exam Technique and Coaching, Part 2 Richard Kay Statistical Consultant RK Statistics Ltd 22/09/2019.
Presentation transcript:

DSBS Discussion: Multiple Testing 28 May 2009 Discussion on Multiple Testing Prepared and presented by Lars Endahl

DSBS Discussion: Multiple TestingSlide no 228 May 2009 Outline The multiplicity issue Questions to the audience Regulatory guidance Discussion

DSBS Discussion: Multiple TestingSlide no 328 May 2009 Rejecting a hypothesis The p-value leads to a decision whether to reject the hypothesis or not Decision Hypothesis Do not RejectReject TrueCorrect decisionType 1 error FalseType 2 errorCorrect decision

DSBS Discussion: Multiple TestingSlide no 428 May 2009 Multiplicity problem Type 1 error for each comparison = comparison-wise error the entire experiment = experiment-wise error #tests in experiment Comparison-wise error Experiment-wise error 15% 5 25% 105%40% 205%64% 505%92%

DSBS Discussion: Multiple TestingSlide no 528 May 2009 Regulatory Perspective Regulatory authorities requires the type 1 error frequency to be low do not want to grant labelling claims that are not a true reflection of the drug capabilities

DSBS Discussion: Multiple TestingSlide no 628 May 2009 Show of Hands How many of you have analysed data from a confirmatory trial? except for bioequivalence trials routinely provided p-values as inferential statistics? routinely adjusted the analyses for multiplicity?

DSBS Discussion: Multiple TestingSlide no 728 May 2009 Show of Hands For those of you who do NOT adjust for multiplicity, what’s the reason I only test one hypothesis per trial I hardly know anything about multiplicity It’s not needed to get the wanted label claims Project management does not want it It’s too difficult to achieve significance It prevents me from cherry picking

DSBS Discussion: Multiple TestingSlide no 828 May 2009 Show of Hands For those of you who do adjust for multiplicity, on what endpoints do you adjust All efficacy and safety endpoints All efficacy endpoints (but not safety endpoints) All safety endpoints (but not efficacy endpoints) Selected efficacy and safety endpoints Selected efficacy endpoints Selected safety endpoints

DSBS Discussion: Multiple TestingSlide no 928 May 2009 Show of Hands For those of you who do adjust for multiplicity, what is your preferred method Bonferroni methods (incl. Holm, Hochberg etc.) Closed testing Hierarchical testing (incl. Win-on-All) ANOVA methods (Tukey, Fisher, Duncan etc.) O’Brien & Fleming (and related methods) Controlling the false discovery rate

DSBS Discussion: Multiple TestingSlide no 1028 May 2009 Regulatory Guidelines ICH-E9: when multiplicity is present, the usual frequentist approach to the analysis of clinical trial data may necessitate an adjustment to the type 1 error the method of controlling type 1 error should be given in the protocol.

DSBS Discussion: Multiple TestingSlide no 1128 May 2009 Regulatory Guidelines CPMP Points-to-consider on multiplicity issues Significant effects in secondary variables can be considered for an additional claim only after the primary objective of the clinical trial has been achieved, and if they were part of the confirmatory strategy There are many situations where no multiplicity concern arises, for example, having predefined primary variables and all secondary variable are declared supportive In case of adverse effects, p-values are of very limited value as substantial difference will raise concern,…, irrespective of the p-value

DSBS Discussion: Multiple TestingSlide no 1228 May 2009 Regulatory Guidelines FDA Guidance on Diabetes Mellitus (draft) If statistical significance is achieved on the primary endpoint, secondary assessments of efficacy can be considered. Type 1 error should be controlled across all clinically relevant secondary efficacy endpoints that may be intended for product labelling to provide statistical support for their inclusion in the label.

DSBS Discussion: Multiple TestingSlide no 1328 May 2009 Example 1 Primary objective: Efficacy of a new chemotherapy by confirming superiority of survival compared to placebo Primary endpoint: time to death 25 secondary endpoints If all endpoints have a power of 80% hierarchical power of endpoint number 5 = 33% number 10 = 11% number 26 = 0.3% Bonferroni power for all endpoints = 38% must test with a significance level of 0.2% using two-sample t-test

DSBS Discussion: Multiple TestingSlide no 1428 May 2009 Example 2 Primary objective: Efficacy of a new chemotherapy by confirming superiority of survival compared to placebo Primary endpoint: time to death Prioritised secondary endpoint: Quality of life Hierarchical testing Can only confirm superiority of QoL if chemotherapy improves survival any chance of getting this in the label??? any relevance??

DSBS Discussion: Multiple TestingSlide no 1528 May 2009 Example 3 Primary objective: Efficacy of a new chemotherapy confirm superiority of the new chemotherapy in one of two possible regimens compared to placebo Primary endpoint: time to death Two options design a three-armed trial design two different two armed trials When, how and why should we adjust for multiplicity in these two different options?

DSBS Discussion: Multiple TestingSlide no 1628 May 2009 Discussion points Is multiplicity just a regulatory issue or do we also need to adjust for it in e.g. publications? is it possible to publish any difference in medical journals without providing a (unadjusted) p-value? is it possible to publish p-values > 1? Should we adjust the p-values or significance levels and/or the confidence intervals for multiplicity?? for multiple trials testing the same hypothesis? for previous testing of the same hypothesis in individual trials in a pre-planned meta-analysis?

DSBS Discussion: Multiple TestingSlide no 1728 May 2009 More discussion points What should we do when testing for superiority in a non-inferiority trial and we wish to continue if non- inferiority is confirmed for the primary endpoint avoid using hierarchical approaches? test all relevant intersections? regard the superiority test as part of the non- inferiority comparison? Should we report p-values for hypotheses that has not be tested? default in e.g. SAS

DSBS Discussion: Multiple TestingSlide no 1828 May 2009