Testing and Estimation Procedures in Multi-Armed Designs with Treatment Selection Gernot Wassmer, PhD Institut für Medizinische Statistik, Informatik und.

Slides:



Advertisements
Similar presentations
Emerging Issues with Adaptation of Clinical Trial Design in Drug Development* H.M. James Hung Division of Biometrics I, Office of Biostatistics, OPaSS,
Advertisements

Interim Analysis in Clinical Trials: A Bayesian Approach in the Regulatory Setting Telba Z. Irony, Ph.D. and Gene Pennello, Ph.D. Division of Biostatistics.
Phase II/III Design: Case Study
1 O Guilbaud, MCP2007, July 8-11, Vienna, v.1 Simultaneous Confidence Regions corresponding to Holm’s Step-down MTP (and other CTPs) Olivier Guilbaud.
Statistical Analysis for Two-stage Seamless Design with Different Study Endpoints Shein-Chung Chow, Duke U, Durham, NC, USA Qingshu Lu, U of Science and.
By Trusha Patel and Sirisha Davuluri. “An efficient method for accommodating potentially underpowered primary endpoints” ◦ By Jianjun (David) Li and Devan.
A new group-sequential phase II/III clinical trial design Nigel Stallard and Tim Friede Warwick Medical School, University of Warwick, UK
Design and analysis of clinical trials MULTIPLE COMPARISONS.
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
10-1 Introduction 10-2 Inference for a Difference in Means of Two Normal Distributions, Variances Known Figure 10-1 Two independent populations.
9-1 Hypothesis Testing Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Evaluating Hypotheses
PSY 307 – Statistics for the Behavioral Sciences
An Experimental Paradigm for Developing Adaptive Treatment Strategies S.A. Murphy Univ. of Michigan February, 2004.
4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.
Lehrstuhl für Informatik 2 Gabriella Kókai: Maschine Learning 1 Evaluating Hypotheses.
Chapter 9 Hypothesis Testing.
1Carl-Fredrik Burman, 11 Nov 2008 RSS / MRC / NIHR HTA Futility Meeting Futility stopping Carl-Fredrik Burman, PhD Statistical Science Director AstraZeneca.
Adaptive Designs for Clinical Trials
1/2555 สมศักดิ์ ศิวดำรงพงศ์
4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.
Copyright © Cengage Learning. All rights reserved. 8 Tests of Hypotheses Based on a Single Sample.
CI - 1 Cure Rate Models and Adjuvant Trial Design for ECOG Melanoma Studies in the Past, Present, and Future Joseph Ibrahim, PhD Harvard School of Public.
Background to Adaptive Design Nigel Stallard Professor of Medical Statistics Director of Health Sciences Research Institute Warwick Medical School
10-1 Introduction 10-2 Inference for a Difference in Means of Two Normal Distributions, Variances Known Figure 10-1 Two independent populations.
9-1 Hypothesis Testing Statistical Hypotheses Definition Statistical hypothesis testing and confidence interval estimation of parameters are.
Statistics 11 Confidence Interval Suppose you have a sample from a population You know the sample mean is an unbiased estimate of population mean Question:
1 Lecture 19: Hypothesis Tests Devore, Ch Topics I.Statistical Hypotheses (pl!) –Null and Alternative Hypotheses –Testing statistics and rejection.
Biostatistics Class 6 Hypothesis Testing: One-Sample Inference 2/29/2000.
1 10 Statistical Inference for Two Samples 10-1 Inference on the Difference in Means of Two Normal Distributions, Variances Known Hypothesis tests.
CHAPTER 17: Tests of Significance: The Basics
1 An Interim Monitoring Approach for a Small Sample Size Incidence Density Problem By: Shane Rosanbalm Co-author: Dennis Wallace.
Adaptive Designs for U-shaped ( Umbrella) Dose-Response Yevgen Tymofyeyev Merck & Co. Inc September 12, 2008.
EMIS 7300 SYSTEMS ANALYSIS METHODS FALL 2005 Dr. John Lipp Copyright © Dr. John Lipp.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
1 9 Tests of Hypotheses for a Single Sample. © John Wiley & Sons, Inc. Applied Statistics and Probability for Engineers, by Montgomery and Runger. 9-1.
1 Keaven Anderson, Ph.D. Amy Ko, MPH Nancy Liu, Ph.D. Yevgen Tymofyeyev, Ph.D. Merck Research Laboratories June 9, 2010 Information-Based Sample Size Re-estimation.
Bayesian Approach For Clinical Trials Mark Chang, Ph.D. Executive Director Biostatistics and Data management AMAG Pharmaceuticals Inc.
MPS/MSc in StatisticsAdaptive & Bayesian - Lect 41 Lecture 4 Sample size reviews 4.1A general approach to sample size reviews 4.2Binary data 4.3Normally.
1 Chihiro HIROTSU Meisei (明星) University Estimating the dose response pattern via multiple decision processes.
1. Homework due Wed 2. Project Proposal due Friday
Introduction to hypothesis testing Hypothesis testing is about making decisions Is a hypothesis true or false? Ex. Are women paid less, on average, than.
MPS/MSc in StatisticsAdaptive & Bayesian - Lect 51 Lecture 5 Adaptive designs 5.1Introduction 5.2Fisher’s combination method 5.3The inverse normal method.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
BIOSTATISTICS Hypotheses testing and parameter estimation.
Adaptive Designs P. Bauer Medical University of Vienna June 2007.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 6 –Multiple hypothesis testing Marshall University Genomics.
Hypothesis Testing Steps for the Rejection Region Method State H 1 and State H 0 State the Test Statistic and its sampling distribution (normal or t) Determine.
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Inferences Concerning Means.
Adaptive trial designs in HIV vaccine clinical trials Morenike Ukpong Obafemi Awolowo University Ile-Ife, Nigeria.
Chapter 10: The t Test For Two Independent Samples.
A Parametrized Strategy of Gatekeeping, Keeping Untouched the Probability of Having at Least One Significant Result Analysis of Primary and Secondary Endpoints.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 15: Sample size and Power Marshall University Genomics.
Elizabeth Garrett-Mayer, PhD Associate Professor of Biostatistics
Chapter 9 -Hypothesis Testing
Strategies for Implementing Flexible Clinical Trials Jerald S. Schindler, Dr.P.H. Cytel Pharmaceutical Research Services 2006 FDA/Industry Statistics Workshop.
When we free ourselves of desire,
Sue Todd Department of Mathematics and Statistics
Aiying Chen, Scott Patterson, Fabrice Bailleux and Ehab Bassily
Multiplicity Testing Procedure Selection in Clinical Trials Rachael Wen Ph.D JSM 2018 of 8.
OMGT LECTURE 10: Elements of Hypothesis Testing
Tobias Mielke QS Consulting Janssen Pharmaceuticals
Hui Quan, Yi Xu, Yixin Chen, Lei Gao and Xun Chen Sanofi June 28, 2019
Exact Inference for Adaptive Group Sequential Clinical Trials
Oncology Biostatistics
Presentation transcript:

Testing and Estimation Procedures in Multi-Armed Designs with Treatment Selection Gernot Wassmer, PhD Institut für Medizinische Statistik, Informatik und Epidemiologie Universität zu Köln ADDPLAN GmbH Adaptive Design KOL Lecture Series, August 14th, 2009

Introduction  Confirmatory adaptive designs are a generalization of group sequential designs, where - in interim analyses - confirmatory analysis is performed under control of the Type I error rate and data dependent changes of design are allowed.  Three particular applications –Sample size reassessment –Treatment arm selection –Subset selection (“enrichment designs”)  This talk shows –how to reach a test decision in an adaptive multi-armed trial with treatment selection at interim –how to calculate confidence intervals and overall p-values

Confirmatory adaptive designs can be based on –the combination testing principle –the conditional error approach Combination testing principle Combination of p-values with a specific combination function (Bauer, 1989; Bauer & Köhne, 1994) Inverse normal method: The test decision is based on Lehmacher & Wassmer, 1999 where the weights w k are prefixed

The conditional error approach Plan a trial with reasonable (optimum) design, including sample size calculation and timing of interim analyses. Calculate the conditional Type I error rate  (x 1,…,x k ) at any time during the course of the trial  (x 1,…,x k ) = conditional probability, under H 0, of rejecting H 0 in one of the subsequent stages, given x 1,…,x k x 1,…,x k : data up to stage k Remainder of the trial can be defined as a test at level  (x 1,…,x k ) where the design of this test is arbitrary. Müller & Schäfer (2001): “CRP principle” Brannath, Posch & Bauer (2002): “Recursice testing principle”

The situation  Consider many-to-one comparisons, e.g., G treatment arms and one control, normal case.  Throughout this talk, we consider one-sided testing.  In an interim stage a treatment arm is selected based on data observed so far.  Not only selection procedures, but also other adaptive strategies (e.g., sample size reassessment) can be performed.  Application within “Adaptive seamless designs” using the combination testing principle

Sources for alpha inflation  Interim analyses  Sample size reassessment  Multiple arms The proposed adaptive procedure fulfils the regulatory requirements for the analysis of adaptive trials in that it strongly controls the prespecified Type I error rate. This procedure will be based on the application of the closed test procedure together with combination tests (e.g., Bauer & Kieser, 1999; Hellmich, 2001; Posch et al., 2005, Bretz et al., 2009). Other approaches: Thall et al., 1988; Follmann et al, 1994; Stallard and Todd, 2003; Stallard and Friede, 2008;

Stage II … Simple “trick”: Test of intersection hypotheses are formally performed as tests for ? ? Stage I Closed testing procedure

At the first interim analysis, consider a test statistic for e.g., the test statistic Closed testing procedure That is, compute Dunnett’s adjusted p-value for each intersection hypothesis, critical values are according to Or compute the p-value using Dunnett’s t distribution.

where p J is the p-value of the Dunnett test for testing q S is the second stage test statistic for the selected treatment arm, and u 2 is the critical value for the second stage. Test decision for the second stage: is rejected if This is the use of the inverse normal method for the Dunnett test situation.

can be rejected if all combination tests exceed the critical value u 2. Stage I Stage II … Example S = 3 Simple shortcut: If the treatment arm with the largest test statistic is selected, it suffices to combine the test for H 0 : with the test for H 0 :

Properties of the Procedure  Choice of tests for intersection hypotheses is free, i.e., you might select, e.g., Dunnett‘s test, Bonferroni-, Simes or Sidak‘s test.  The procedure may become inconsonant and, hence, conservative. I.e., you can reject the global hypothesis, but no single hypothesis (Friede and Stallard, 2008).  A hypothesis can be rejected at a later stage even it was not selected for the current stage (and not rejected before). This can happen if, e.g., the test statistic for the global hypothesis exceeds u 2 in the second stage but not u 1 in the first stage, and the test statistic for the de-selected hypothesis exceeds u 1 in the first stage. 12

An alternative procedure (König et al., 2008) Compute conditional error at first stage: In the second stage, perform a  Conditional second-stage Dunnett test  Separate second-stage Dunnett test This is the application of the CRP principle (Müller & Schäfer, 2001). It assumes the variance to be known

A comparison shows that  the conditional second-stage Dunnett test performs best but is hardly better if a treatment arm selection was performed (cf., Friede and Stallard, 2008)  it is identical with the conventional Dunnett test if no adaptations were performed  becomes complicated if, e.g., –allocation is not constant –variance is unknown  the inverse normal technique is not optimum but enables early stopping and more general adaptations  is straightforward if, e.g., –allocation is not constant –variance is unknown

A comparison shows that  the conditional second-stage Dunnett test performs best but is hardly better if a treatment arm selection was performed (cf., Friede and Stallard, 2008)  it is identical with the conventional Dunnett test if no adaptations were performed  becomes complicated if, e.g., –allocation is not constant –variance is unknown  the inverse normal technique is not optimum but enables early stopping and more general adaptations  is straightforward if, e.g., –allocation is not constant –variance is unknown

Overall p-values  Defined as smallest p-value for which the test results yield rejection of the considered (single) hypothesis  Repeated overall p-value can be calculated at any stage of the trial.  That is,  p-values account for the step-down nature of the closed testing principle and are completely consistent with the test decision.

Overall confidence intervals  Confidence intervals based on stepwise testing are difficult to construct. This is a specific feature of multiple testing procedures and not of adaptive testing.  Posch et al. (2005) proposed to construct confidence intervals based on the single step adjusted overall p-values. These can also be applied for the conditional Dunnett test.  The RCIs are not, in general, consistent with the test decision. It might happen that, e.g., a hypothesis is rejected but the lower bound of the CI is smaller 0.  They can be provided for each step of the trial.  In general, they may fail to become narrower for increasing sample size (e.g., if Bonferroni or Simes intersection tests are used).

Illustration  Two-stage design with G treatment arms  Selection of treatment arm with highest respone, no efficacy stop at interim  Bonferroni (or Simes) correction is used for first stage  Lower bound lb j of 95% confidence intervals for effect  j =  j -  0 of selected treatment arm at second stage is calculated through 21 It is easy to see that

Summary  The adaptive procedures fulfil the regulatory requirements for the analysis of adaptive trials in that they control the prespecified Type I error rate. For regulatory purposes, the class of envisaged decisions after stage 1 should be stated in the protocol.  The “rules” for adaptation and stopping for futility –not need to be pre-specified –Adaptations may depend on all interim data including secondary and safety endpoints. –can make use of Bayesian principles integrating all information available, also external to the study –should be evaluated (e.g. via simulations) and preferred version recommended, e.g., in DMC charter  Software ADDPLAN MC is available for designing and analyzing these trials 22

23 Bauer, P. (1989). Multistage testing with adaptive designs (with Discussion). Biometrie und Informatik in Medizin und Biologie 20, 130–148. Bauer, P., Köhne, K. (1994). Evaluation of experiments with adaptive interim analyses. Biometrics 50, 1029–1041. Bauer, P., Kieser, M. (1999). Combining different phases in the development of medical treatments within a single trial. Statistics in Medicine 18,1833–1848. Brannath, W., Posch, M., Bauer, P., 2002: Recursive combination tests. J. Amer. Stat. Ass. 97, 236–244. Follmann, D. A., Proschan, M. A., Geller, N. L., 1994: Monitoring pairwise comparisons in multi-armed clinical trials. Biometrics 50, 325–336. Friede, T., Stallard, N., 2008: A comparison of methods for adaptive treatment selection. Biometrical J. 50, 767–781. Hellmich, M., 2001: Monitoring clinical trials with multiple arms. Biometrics 57, 892–898. König, F., Brannath, W., Bretz, F., Posch, M. (2008). Adaptive Dunnett tests for treatment selection. Statistics in Medicine 27, 1612–1625. Lehmacher, W., Wassmer, G. (1999). Adaptive sample size calculations in group sequential trials. Biometrics 55, 1286– Müller, H.H., Schäfer, H. (2001). Adaptive group sequential designs for clinical trials, combining the advantages of adaptive and of classical group sequential approaches. Biometrics 57,886–891. Posch, M., König, F., Branson, M., Brannath, W., Dunger-Baldauf, C., Bauer, P. (2005). Testing and estimation in flexible group sequential designs with adaptive treatment selection. Statistics in Medicine 24, 3697–3714. Posch, M., Wassmer, G., Brannath, W. (2008). A note on repeated p-values for group sequential designs. Biometrika 95, Stallard, N., Friede, T. (2008). A group-sequential design for clinical trials with treatment selection. Statistics in Medicine 27, 6209–6227. Stallard, N., Todd, S. (2003). Sequential designs for phase III clinical trials incorporating treatment selection. Statistics in Medicine 22, Thall, P.F., Simon, R., Ellenberg, S.S. (1988). Two-stage selection and testing designs for comparative clinical trials. Biometrika 75, References