1 Controlling False Positive Rate Due to Multiple Analyses Controlling False Positive Rate Due to Multiple Analyses Unstratified vs. Stratified Logrank.

Slides:



Advertisements
Similar presentations
COMPUTER INTENSIVE AND RE-RANDOMIZATION TESTS IN CLINICAL TRIALS Thomas Hammerstrom, Ph.D. USFDA, Division of Biometrics The opinions expressed are those.
Advertisements

Introduction to Hypothesis Testing
Lesson #24 Multiple Comparisons. When doing ANOVA, suppose we reject H 0 :  1 =  2 =  3 = … =  k Next, we want to know which means differ. This does.
Categorical Data Analysis
A Flexible Two Stage Design in Active Control Non-inferiority Trials Gang Chen, Yong-Cheng Wang, and George Chi † Division of Biometrics I, CDER, FDA Qing.
The equivalence trial Didier Concordet NATIONAL VETERINARY S C H O O L T O U L O U S E.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Controlling the Experimentwise Type I Error Rate When Survival Analyses Are Planned for Subsets of the Sample. Greg Yothers, MA National Surgical Adjuvant.
Estimation and Reporting of Heterogeneity of Treatment Effects in Observational Comparative Effectiveness Research Prepared for: Agency for Healthcare.
Fundamentals of Forensic DNA Typing Slides prepared by John M. Butler June 2009 Appendix 3 Probability and Statistics.
Testing means, part III The two-sample t-test. Sample Null hypothesis The population mean is equal to  o One-sample t-test Test statistic Null distribution.
Survival analysis1 Every achievement originates from the seed of determination.
Introduction to Chi-Square Procedures March 11, 2010.
Chapter 3 Hypothesis Testing. Curriculum Object Specified the problem based the form of hypothesis Student can arrange for hypothesis step Analyze a problem.
Sample Size Determination Ziad Taib March 7, 2014.
Chapter Ten Introduction to Hypothesis Testing. Copyright © Houghton Mifflin Company. All rights reserved.Chapter New Statistical Notation The.
1 Efficacy Results NDA (MTP-PE) Laura Lu Statistical Reviewer Office of Biostatistics FDA/CDER.
1 STATISTICAL HYPOTHESES AND THEIR VERIFICATION Kazimieras Pukėnas.
Chapter 8 Introduction to Hypothesis Testing
1 1 Slide © 2005 Thomson/South-Western Chapter 13, Part A Analysis of Variance and Experimental Design n Introduction to Analysis of Variance n Analysis.
Health and Disease in Populations 2001 Sources of variation (2) Jane Hutton (Paul Burton)
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
1 Statistical Review Dr. Shan Sun-Mitchell. 2 ENT Primary endpoint: Time to treatment failure by day 50 Placebo BDP Patients randomized Number.
PSY2004 Research Methods PSY2005 Applied Research Methods Week Five.
1 Statistics in Drug Development Mark Rothmann, Ph. D.* Division of Biometrics I Food and Drug Administration * The views expressed here are those of the.
All-or-None procedure: An outline Nanayaw Gyadu-Ankama Shoubhik Mondal Steven Cheng.
Analysis of Variance 1 Dr. Mohammed Alahmed Ph.D. in BioStatistics (011)
1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.
August, 2002MCP A General Multiplicity Adjustment Approach and Its Application to Evaluating Several Independently Conducted Studies Qian Li and Mohammad.
Statistical Analysis Professor Lynne Stokes Department of Statistical Science Lecture 18 Random Effects.
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
Inen 460 Lecture 2. Estimation (ch. 6,7) and Hypothesis Testing (ch.8) Two Important Aspects of Statistical Inference Point Estimation – Estimate an unknown.
1 BLA Sipuleucel-T (APC-8015) FDA Statistical Review and Findings Bo-Guang Zhen, PhD Statistical Reviewer, OBE, CBER March 29, 2007 Cellular, Tissue.
Chapter 13- Inference For Tables: Chi-square Procedures Section Test for goodness of fit Section Inference for Two-Way tables Presented By:
Division of Oncology Drug Products 1 AREAS OF MAJOR STATISTICAL CONCERNS IN THE M01 STUDY Overall (ITT Population) Finding Liver Metastasis Subgroup Finding.
European Patients’ Academy on Therapeutic Innovation The Purpose and Fundamentals of Statistics in Clinical Trials.
Major Steps. 1.State the hypotheses.  Be sure to state both the null hypothesis and the alternative hypothesis, and identify which is the claim. H0H0.
1 Pulminiq™ Cyclosporine Inhalation Solution Pulmonary Drug Advisory Committee Meeting June 6, 2005 Statistical Evaluation Statistical Evaluation Jyoti.
Pulmonary-Allergy Drugs Advisory Committee May 1, 2007 OutlineOutline History of development programHistory of development program –Dr. Carol Bosken Introduction.
Lec. 19 – Hypothesis Testing: The Null and Types of Error.
1 Chapter 6 SAMPLE SIZE ISSUES Ref: Lachin, Controlled Clinical Trials 2:93-113, 1981.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
McGraw-Hill/Irwin © 2003 The McGraw-Hill Companies, Inc.,All Rights Reserved. Part Four ANALYSIS AND PRESENTATION OF DATA.
Methods of Presenting and Interpreting Information Class 9.
Carolinas Medical Center, Charlotte, NC Website:
BIOST 513 Discussion Section - Week 10
Chapter 10: Comparing Two Populations or Groups
Part Four ANALYSIS AND PRESENTATION OF DATA
Test for Goodness of Fit
Chapter 2 Simple Comparative Experiments
Chapter 10: Comparing Two Populations or Groups
Multiple Endpoint Testing in Clinical Trials – Some Issues & Considerations Mohammad Huque, Ph.D. Division of Biometrics III/Office of Biostatistics/OPaSS/CDER/FDA.
Review of Chapter 11 Comparison of Two Populations
Inferences on Two Samples Summary
Analysis of Variance (ANOVA)
9 Tests of Hypotheses for a Single Sample CHAPTER OUTLINE
Chapter 9: Hypothesis Testing
Saturday, August 06, 2016 Farrokh Alemi, PhD.
Chapter 10: Comparing Two Populations or Groups
Chapter 10: Comparing Two Populations or Groups
Chapter 10: Comparing Two Populations or Groups
Chapter 10: Comparing Two Populations or Groups
Chapter 10: Comparing Two Populations or Groups
Chapter 10: Comparing Two Populations or Groups
Chapter 10: Comparing Two Populations or Groups
Chapter 10: Comparing Two Populations or Groups
Chapter 10: Comparing Two Populations or Groups
Chapter 10: Comparing Two Populations or Groups
Chapter 10: Comparing Two Populations or Groups
Chapter 10: Comparing Two Populations or Groups
Presentation transcript:

1 Controlling False Positive Rate Due to Multiple Analyses Controlling False Positive Rate Due to Multiple Analyses Unstratified vs. Stratified Logrank Test Peiling Yang, Gang Chen, George Y.H. Chi DBI/OB/OPaSS/CDER/FDA The view expressed in this talk are those of the authors and may not necessarily represent those of the Food and Drug Administration.

2 Motivation: Example of Drug X

3 Issues to Explore Implication of these tests/analyses. Eligibility of efficacy claim based on these tests/analyses. Practicability of multiple testing/analyses.

4 Outline Notations / Settings Introduction to logrank test –Unstratified, stratified Comparisons –Hypotheses, test statistic, test procedure, inference Practicability of hypotheses Testing Multiple testing/analyses Example of Drug X Summary

5 Settings / Notations 2 arms (control j=1; experimental: j=2). K strata: k=1,.., K Patients randomized within strata t 1 < t 2 < …< t D : distinct death times d ijk : # of deaths & Y ijk : # of patients at risk at death time t i, in j th arm & k th stratum.

6 Settings / Notations

7 Hazard ratio (ctrl./exper.): constant –Across strata: c –Within stratum: c k Non-informative censoring

8 Introduction: Unstratified Logrank

9 W u ~ N(0,1) under least favorable parameter configuration (c=1) in. Reject if W u > z . Type I error rate is controlled at level .

10 Introduction: Stratified Logrank

11 Introduction: Stratified Logrank W s ~ N(0,1) under least favorable parameter configuration (c k = 1 for all k) in. Reject if W s > z . Type I error rate is controlled at level .

12 Comparison of Hypotheses Different hypotheses formulations:

13 Comparison of Test Statistics Corr(W u, W s ) = 1 because of same r.v. d.1. W s = a W u + b, wherewhere W u ~ N(0, 1)  W s ~ N(b, a 2 )

14 Comparison of Test Procedure

15 Comparison of Test Procedure

16 Comparison of Inference Rejection of : –Infer overall positive treatment effect in entire population. Rejection of : –Can only infer positive treatment effect in "at least one stratum". –Further testing to identify those strata required to make claim & error rate for identifying wrong strata also needs to be controlled.

17 Practicability of Hypotheses Testing Unstratified hypotheses are tested when desired to infer overall positive treatment effect in entire population. Stratified hypotheses are tested when desired to infer positive treatment effect in certain strata. Multiple testing of both unstratified & stratified hypotheses ok when not sure whether treatment is effective in entire population or certain strata (but both nulls need to be prespecified in protocol).

18 Multiple Testing/Analyses Multiple testing unstratified (use W u ) & stratified (use W s ) hypotheses. Error to control: strong familywise error (SFE), including the following: –When c  1 & all c k  1: falsely infer c or some c k ’s>1. –When c  1 & some c k ’s>1: falsely infer c>1 or wrong c k ’s>1 Note: parameter space of “all c k  1 but c>1” impossible.

19 Multiple Testing/Analyses c  1 & all c k  1 c>1 & at least one c k >1 impossible space c  1 & at least one c k >1 Property of SFE: FE nested in another FE. FE Which c k >1? Nested FE

20 Example -- Drug X W s = aW u +b, where a = 1.039, b=0.409 Critical value using W s should be adjusted to az  +b. False positive error rate using W s w/o adjustment = 0.066; –Inflation = = Ans.: This finding is not statistically significant. for

21 Figure 1: False positive rate vs. desired level (w/o adjustment)

22 Summary Hypotheses (unstratified or stratified or both) –should reflect what is desired to claim. –need to be prespecified in protocol. If stratified null is rejected, further testing required to identify in which strata treatment effect is positive. Strong family error rate needs to be controlled regardless of single or multiple testing.