By Trusha Patel and Sirisha Davuluri. “An efficient method for accommodating potentially underpowered primary endpoints” ◦ By Jianjun (David) Li and Devan.

Slides:



Advertisements
Similar presentations
Phase II/III Design: Case Study
Advertisements

A Flexible Two Stage Design in Active Control Non-inferiority Trials Gang Chen, Yong-Cheng Wang, and George Chi † Division of Biometrics I, CDER, FDA Qing.
Hypothesis: It is an assumption of population parameter ( mean, proportion, variance) There are two types of hypothesis : 1) Simple hypothesis :A statistical.
1 Chapter 9 Hypothesis Testing Developing Null and Alternative Hypotheses Type I and Type II Errors One-Tailed Tests About a Population Mean: Large-Sample.
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved.
1 1 Slide Chapter 9 Hypothesis Tests Developing Null and Alternative Hypotheses Developing Null and Alternative Hypotheses Type I and Type II Errors Type.
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Hypothesis Testing Developing Null and Alternative Hypotheses Developing Null and Alternative Hypotheses Type I and Type II Errors Type I and Type II Errors.
1 1 Slide STATISTICS FOR BUSINESS AND ECONOMICS Seventh Edition AndersonSweeneyWilliams Slides Prepared by John Loucks © 1999 ITP/South-Western College.
Chapter 9 Hypothesis Testing
1 1 Slide MA4704Gerry Golding Developing Null and Alternative Hypotheses Hypothesis testing can be used to determine whether Hypothesis testing can be.
1 1 Slide Hypothesis Testing Chapter 9 BA Slide Hypothesis Testing The null hypothesis, denoted by H 0, is a tentative assumption about a population.
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
Section 7.1 Hypothesis Testing: Hypothesis: Null Hypothesis (H 0 ): Alternative Hypothesis (H 1 ): a statistical analysis used to decide which of two competing.
9-1 Hypothesis Testing Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental.
Statistics Are Fun! Analysis of Variance
Lecture 2: Thu, Jan 16 Hypothesis Testing – Introduction (Ch 11)
Pengujian Hipotesis Nilai Tengah Pertemuan 19 Matakuliah: I0134/Metode Statistika Tahun: 2007.
1 1 Slide © 2006 Thomson/South-Western Chapter 9 Hypothesis Testing Developing Null and Alternative Hypotheses Developing Null and Alternative Hypotheses.
Major Points Formal Tests of Mean Differences Review of Concepts: Means, Standard Deviations, Standard Errors, Type I errors New Concepts: One and Two.
Definitions In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test is a standard procedure for testing.
Sample Size Determination Ziad Taib March 7, 2014.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Tests of Hypotheses Based on a Single Sample.
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
Statistical hypothesis testing – Inferential statistics I.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
Chapter 13 – 1 Chapter 12: Testing Hypotheses Overview Research and null hypotheses One and two-tailed tests Errors Testing the difference between two.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Business Statistics - QBM117 Introduction to hypothesis testing.
1 1 Slide © 2005 Thomson/South-Western Chapter 9, Part A Hypothesis Tests Developing Null and Alternative Hypotheses Developing Null and Alternative Hypotheses.
Section 9.1 Introduction to Statistical Tests 9.1 / 1 Hypothesis testing is used to make decisions concerning the value of a parameter.
Hypothesis Testing.
Copyright © Cengage Learning. All rights reserved. 10 Inferences Involving Two Populations.
1 CSI5388: Functional Elements of Statistics for Machine Learning Part I.
1 1 Slide Slides Prepared by JOHN S. LOUCKS St. Edward’s University © 2002 South-Western/Thomson Learning.
1 1 Slide © 2004 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Chapter 8 Introduction to Hypothesis Testing
Lecture 7 Introduction to Hypothesis Testing. Lecture Goals After completing this lecture, you should be able to: Formulate null and alternative hypotheses.
9-1 Hypothesis Testing Statistical Hypotheses Definition Statistical hypothesis testing and confidence interval estimation of parameters are.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 1 Slide © 2003 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
Chapter 9 Tests of Hypothesis Single Sample Tests The Beginnings – concepts and techniques Chapter 9A.
Chapter 8 Introduction to Hypothesis Testing ©. Chapter 8 - Chapter Outcomes After studying the material in this chapter, you should be able to: 4 Formulate.
All-or-None procedure: An outline Nanayaw Gyadu-Ankama Shoubhik Mondal Steven Cheng.
What is a non-inferiority trial, and what particular challenges do such trials present? Andrew Nunn MRC Clinical Trials Unit 20th February 2012.
1 Chapter 9 Hypothesis Testing. 2 Chapter Outline  Developing Null and Alternative Hypothesis  Type I and Type II Errors  Population Mean: Known 
Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall 9-1 σ σ.
Example 10.2 Measuring Student Reaction to a New Textbook Hypothesis Tests for a Population Mean.
Chapter 9: Testing Hypotheses Overview Research and null hypotheses One and two-tailed tests Type I and II Errors Testing the difference between two means.
© Copyright McGraw-Hill 2004
Formulating the Hypothesis null hypothesis 4 The null hypothesis is a statement about the population value that will be tested. null hypothesis 4 The null.
Hypothesis Testing Introduction to Statistics Chapter 8 Feb 24-26, 2009 Classes #12-13.
Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Nine Hypothesis Testing.
1 Definitions In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test is a standard procedure for testing.
1 Section 8.2 Basics of Hypothesis Testing Objective For a population parameter (p, µ, σ) we wish to test whether a predicted value is close to the actual.
Hypothesis Testing Steps for the Rejection Region Method State H 1 and State H 0 State the Test Statistic and its sampling distribution (normal or t) Determine.
Introduction to Biostatistics, Harvard Extension School, Fall, 2005 © Scott Evans, Ph.D.1 Sample Size and Power Considerations.
Chapter 7: Hypothesis Testing. Learning Objectives Describe the process of hypothesis testing Correctly state hypotheses Distinguish between one-tailed.
CHAPTER 7: TESTING HYPOTHESES Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.
Chapter 10: The t Test For Two Independent Samples.
A Parametrized Strategy of Gatekeeping, Keeping Untouched the Probability of Having at Least One Significant Result Analysis of Primary and Secondary Endpoints.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Hypothesis Testing Chapter Hypothesis Testing  Developing Null and Alternative Hypotheses  Type I and Type II Errors  One-Tailed Tests About.
Multiple Endpoint Testing in Clinical Trials – Some Issues & Considerations Mohammad Huque, Ph.D. Division of Biometrics III/Office of Biostatistics/OPaSS/CDER/FDA.
Covering Principle to Address Multiplicity in Hypothesis Testing
Incorporating the sample correlation between two test statistics to adjust the critical points for the control of type-1 error Dror Rom and Jaclyn McTague.
Presentation transcript:

By Trusha Patel and Sirisha Davuluri

“An efficient method for accommodating potentially underpowered primary endpoints” ◦ By Jianjun (David) Li and Devan V. Mehrotra” 2

1. Introduction 2. Procedure 1: PAAS Method 3. Procedure 2: 4A Method a.Independent Primary Endpoints b.Correlated Primary Endpoints 4. Performance of both Procedures (Probability of achieving positive trial) 5. Conclusion 6. Appendix 7. References 8. Questions? 3

Clinical trials generally classify the endpoints into primary, secondary and exploratory types. Primary Endpoints:  Primary endpoints address primary objectives of the trial. They are usually few but are clinically most relevant to the disease and the treatment under study. They assess the main clinical benefits of the treatment. 4

Secondary Endpoints:  Secondary endpoints characterize extra benefits of the treatment under study after it has been demonstrated that the primary endpoints show clinically meaningful benefits of the treatment. Explanatory Endpoints:  Exploratory endpoints are usually not prospectively planned and are generally not rigorously evaluated like primary and secondary endpoints. 5

Different approaches have been used to specify a clinical decision rule for trials that have more than one primary endpoint. Significant results can be required for each of several primary endpoints to consider a trial ‘‘positive.” 6

 To demonstrate the treatment’s superiority on at least one endpoint.  If each multiple endpoint is independently clinical relevant, the multiple endpoint problem can be formulated as a multiple testing problem, and the trial is declared positive if at least one significant effect is detected.  When significant results are required for more than one but not all of multiple primary endpoints for a trial to be considered positive, correction for multiplicity is also necessary, and this must take into account the total number of endpoints and the number required for the trial to be considered positive. 7

 Weighted Bonferroni procedure  Prospective Alpha Allocation Scheme (PAAS Method)  Adaptive Alpha Allocation Approach (4A Method)  Bonferroni-type parametric procedure  Fallback-type parametric procedure For this project, we are mainly focusing on PAAS Method and 4A Method by considering two Primary Endpoints (A : well powered, B : potentially underpowered). 8

Procedure:  Moye’(2000) proposed the ‘‘prospective alpha allocation scheme” for preserving Type I error rates at acceptable levels when there are multiple endpoints.  Assume that p-values for the individual endpoints are independent.  The experiment-wise type I error rate = α should be capped, say at 0.05, and the fraction of the Type I error rate α allocating α 1 * = α − ε, ε= 0.01 to endpoint A α 2 * = 1 - (1−α)/(1−α 1 *) to endpoint B 9

Advantages:  PAAS is a simple and appealing method.  Prospectively allocating alpha in this method preserves the experiment-wise Type I error rate and makes it possible to consider a treatment efficacious when the null hypothesis is not rejected for the primary endpoint but is rejected for one or more of the secondary endpoints. Disadvantage: The prospective alpha allocation scheme preserves the experiment-wise Type I error rate at a higher rate than is customarily accepted. 10

Procedure:  Li and Mehrotra (2008) proposed a multiple testing procedure, which they referred to as the adaptive alpha allocation approach or 4A procedure.  Consider a clinical trial with m endpoints and assume that the endpoints are grouped into two families. The first family includes m 1 endpoints that are adequately powered and the second family includes m 2 potentially underpowered endpoints (m 1 + m 2 = m). 11

Let’s consider 2 multiple points for illustration of the procedure:  Let p A and p B denote the p-value for endpoint A and endpoint B, respectively.  In the article, they assume that p-values are two-tailed (to mimic common practice) and that ‘statistically significant’ results are in the direction of interest; however all the methods discussed can also be used with one-tailed p-values.  In their proposed adaptive alpha allocation approach, endpoint A is tested at the pre specified level α 1 = α − ε, and endpoint B is tested at the adaptive level. 12

 If A achieves statistical significance (i.e. p A ≤α 1 ), then B is tested at level α 2 (p A,α 1, α) = α;  If A fails to achieve statistical significance (i.e. p A >α 1 ), then B is tested at level 0 <α 2 (p A,α 1,α) ≤ α 1, which is close (or equal) to α 1 if p A is not much greater than α 1, but approaches zero as p A increases.  Sample values of α 2 as a function of observed p A are provided in Table I (ρ = 0 column)  Table I shows the alpha allocation for Primary endpoint B. 13

14 Table I

 Let there be M primary end points out of which the first m are assumed to be powered by type A endpoints and the rest of the M-m are underpowered or type B endpoints.  Here, they suppose, the p-values for the type A endpoints are tested for statistical significance at an overall level α 1 = α – ε using Hochberg’s method.  The p-values for the type B primary endpoints are then measured at and overall adaptive level α 2 (p (m), α 1, α,m) using Hochberg’s method 15

Note : (3) and (4) are generalizations of (1) and (2); setting m=1 in (3) and (4) leads to (1) and (2), respectively. Advantage: The remaining endpoints are tested at a generally higher significance level, which improves their power. 16 Extending 4A to the scenario of three or more independent primary endpoints

Procedure:  If the endpoints are correlated, then formula (1) and (3) cannot be used directly because the FWER may be inflated. Therefore, an adjustment may be needed in the formulas presented for correlated endpoints.  The 4A procedure is readily implemented using Table I if ρ is known.  For the case of unknown ρ, See Appendix-I. Extending 4A to the relatively uncommon scenario of M>2 primary endpoints is notably more challenging in the case of correlated endpoints compared with independent endpoints. 17

Let’s compare the performances of the PAAS and 4A methods for our motivating scenario of two primary endpoints: A-well powered and B-underpowered. Goal: To improve the probability of correctly achieving a positive trial, i.e. of rejecting at least one of the two null hypotheses, while ensuring that the family wise type I error rate is at most α. Table A summarizes the probabilities of achieving a positive trial using the PAAS and 4A methods when the two endpoints are either independent (ρ=0) or correlated (ρ=0.5); results are based on 10,000,000 simulations. (See Table III in Appendix-II) 18

19 Table A

Case 1-7: Primary endpoint A: Adequately Powered Primary endpoint B: Underpowered Case 8-17: Primary endpoint A: Underpowered Primary endpoint B: Underpowered For PAAS method, α 1 * = 0.04 and α 2 * = 0.010; and For 4A method, α 1 = 0.04 and α 2 ≡α 2 (p A,α 1,α,ρ) is calculated adaptively as described before. 20

 Use of 4A method increased the probability of achieving a positive trial compared with PAAS. o For Independent endpoints, the absolute gain in power ranged from 0.9 % (case 7; 77.9 versus 78.8%) to 3.7% (case 8; 78.8 versus 82.5%) compared with PAAS. o For Correlated endpoints A and B (ρ=0.5); the power for 4A was numerically higher than that for PAAS, For example, the power gains of 4A ranged from 0.1 per cent (case 7; 77.4 versus 77.5 per cent) to 2.9 per cent (case 8, case 14) compared with PAAS. 21

 The probability of achieving a positive trial by pre specifying both A and B as primary endpoints and using 4A for the analysis is about as good as (cases 7 and 13) or notably better than (cases 1 and 8) the marginal power for endpoint A. o In other words, 4A enables us to accommodate the underpowered endpoint B in the primary family while essentially preserving or substantially enhancing the likelihood of achieving a positive trial compared with the strategy of using only A as the primary endpoint with α=

 There is no optimal strategy for the use of significance testing with multiple endpoints, especially when dealing with clinically persuasive endpoints that may be underpowered, but based on the simulation results that are provided in the article, we can conclude that 4A method performs better than PAAS method. o The power advantage of 4A over PAAS was greater when the marginal power for endpoint B was lower (higher), regardless of whether the endpoints were independent or correlated. 23

24

25

  Multiple Testing Problems in Pharmaceutical Statistics, by Alex Dmitrienko, Ajit Tamhane and Frank Bretz. 26

27