Closed Testing and the Partitioning Principle Jason C. Hsu The Ohio State University MCP 2002 August 2002 Bethesda, Maryland.

Slides:



Advertisements
Similar presentations
Gatekeeping Testing Strategies in Clinical Trials Alex Dmitrienko, Ph.D. Eli Lilly and Company FDA/Industry Statistics Workshop September 2004.
Advertisements

Hypothesis Testing Goal: Make statement(s) regarding unknown population parameter values based on sample data Elements of a hypothesis test: Null hypothesis.
Issues of Simultaneous Tests for Non-Inferiority and Superiority Tie-Hua Ng*, Ph. D. U.S. Food and Drug Administration Presented at MCP.
STA305 Spring 2014 This started with excerpts from STA2101f13
The equivalence trial Didier Concordet NATIONAL VETERINARY S C H O O L T O U L O U S E.
CHAPTER 2 Building Empirical Model. Basic Statistical Concepts Consider this situation: The tension bond strength of portland cement mortar is an important.
Chapter 10.  Real life problems are usually different than just estimation of population statistics.  We try on the basis of experimental evidence Whether.
From the homework: Distribution of DNA fragments generated by Micrococcal nuclease digestion mean(nucs) = bp median(nucs) = 110 bp sd(nucs+ = 17.3.
A new group-sequential phase II/III clinical trial design Nigel Stallard and Tim Friede Warwick Medical School, University of Warwick, UK
Design and analysis of clinical trials MULTIPLE COMPARISONS.
Regression Part II One-factor ANOVA Another dummy variable coding scheme Contrasts Multiple comparisons Interactions.
Chapter Seventeen HYPOTHESIS TESTING
Elementary hypothesis testing
9-1 Hypothesis Testing Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental.
1 Equivalence and Bioequivalence: Frequentist and Bayesian views on sample size Mike Campbell ScHARR CHEBS FOCUS fortnight 1/04/03.
Elementary hypothesis testing Purpose of hypothesis testing Type of hypotheses Type of errors Critical regions Significant levels Hypothesis vs intervals.
Analysis of Differential Expression T-test ANOVA Non-parametric methods Correlation Regression.
Chapter 3 Hypothesis Testing. Curriculum Object Specified the problem based the form of hypothesis Student can arrange for hypothesis step Analyze a problem.
Inference about a Mean Part II
Chapter 2 Simple Comparative Experiments
Lecture 13: Tues., Feb. 24 Comparisons Among Several Groups – Introduction (Case Study 5.1.1) Comparing Any Two of the Several Means (Chapter 5.2) The.
Chapter 5 Inferences Regarding Population Central Values.
Chapter 7 Inferences Regarding Population Variances.
WHO Prequalification Program Workshop, Kiev, Ukraine, June 25-27,2007.
Qian H. Li, Lawrence Yu, Donald Schuirmann, Stella Machado, Yi Tsong
Chapter 9 Title and Outline 1 9 Tests of Hypotheses for a Single Sample 9-1 Hypothesis Testing Statistical Hypotheses Tests of Statistical.
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Hypothesis Testing (Statistical Significance). Hypothesis Testing Goal: Make statement(s) regarding unknown population parameter values based on sample.
Classification of clinical trials
Testing and Estimation Procedures in Multi-Armed Designs with Treatment Selection Gernot Wassmer, PhD Institut für Medizinische Statistik, Informatik und.
Statistical Inference Decision Making (Hypothesis Testing) Decision Making (Hypothesis Testing) A formal method for decision making in the presence of.
Regression Part II One-factor ANOVA Another dummy variable coding scheme Contrasts Multiple comparisons Interactions.
9-1 Hypothesis Testing Statistical Hypotheses Definition Statistical hypothesis testing and confidence interval estimation of parameters are.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
January 31 and February 3,  Some formulae are presented in this lecture to provide the general mathematical background to the topic or to demonstrate.
Design and Analysis of Experiments Dr. Tai-Yue Wang Department of Industrial and Information Management National Cheng Kung University Tainan, TAIWAN,
Introduction to inference Tests of significance IPS chapter 6.2 © 2006 W.H. Freeman and Company.
Decision-Theoretic Views on Switching Between Superiority and Non-Inferiority Testing. Peter Westfall Director, Center for Advanced Analytics and Business.
Regression Part II One-factor ANOVA Another dummy variable coding scheme Contrasts Multiple comparisons Interactions.
Logistic and Nonlinear Regression Logistic Regression - Dichotomous Response variable and numeric and/or categorical explanatory variable(s) –Goal: Model.
1 Chapter 1: Introduction to Design of Experiments 1.1 Review of Basic Statistical Concepts (Optional) 1.2 Introduction to Experimental Design 1.3 Completely.
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
All-or-None procedure: An outline Nanayaw Gyadu-Ankama Shoubhik Mondal Steven Cheng.
L Berkley Davis Copyright 2009 MER301: Engineering Reliability Lecture 9 1 MER301:Engineering Reliability LECTURE 9: Chapter 4: Decision Making for a Single.
Statistical considerations Drs. Jan Welink Training workshop: Assessment of Interchangeable Multisource Medicines, Kenya, August 2009.
Unit 8 Section 8-1 & : Steps in Hypothesis Testing- Traditional Method  Hypothesis Testing – a decision making process for evaluating a claim.
1 9 Tests of Hypotheses for a Single Sample. © John Wiley & Sons, Inc. Applied Statistics and Probability for Engineers, by Montgomery and Runger. 9-1.
August, 2002MCP A General Multiplicity Adjustment Approach and Its Application to Evaluating Several Independently Conducted Studies Qian Li and Mohammad.
Ex St 801 Statistical Methods Inference about a Single Population Mean.
WHO Prequalification Programme June 2007 Training Workshop on Dissolution, Pharmaceutical Product Interchangeability and Biopharmaceutical Classification.
Comparison of 2 Population Means Goal: To compare 2 populations/treatments wrt a numeric outcome Sampling Design: Independent Samples (Parallel Groups)
1 Chihiro HIROTSU Meisei (明星) University Estimating the dose response pattern via multiple decision processes.
Hypothesis Testing Errors. Hypothesis Testing Suppose we believe the average systolic blood pressure of healthy adults is normally distributed with mean.
Introduction to inference Tests of significance IPS chapter 6.2 © 2006 W.H. Freeman and Company.
© Copyright McGraw-Hill 2004
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
BIOSTATISTICS Hypotheses testing and parameter estimation.
PEP-PMMA Training Session Statistical inference Lima, Peru Abdelkrim Araar / Jean-Yves Duclos 9-10 June 2007.
MARE 250 Dr. Jason Turner Analysis of Variance (ANOVA)
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Inferences Concerning Means.
A Parametrized Strategy of Gatekeeping, Keeping Untouched the Probability of Having at Least One Significant Result Analysis of Primary and Secondary Endpoints.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Chapter 8: Inferences Based on a Single Sample: Tests of Hypotheses
9 Tests of Hypotheses for a Single Sample CHAPTER OUTLINE
STATISTICAL INFERENCE PART IV
Statistical inference
DOSE SPACING IN EARLY DOSE RESPONSE CLINICAL TRIAL DESIGNS
Confidence Intervals.
Covering Principle to Address Multiplicity in Hypothesis Testing
Presentation transcript:

Closed Testing and the Partitioning Principle Jason C. Hsu The Ohio State University MCP 2002 August 2002 Bethesda, Maryland

Principles of Test-Construction Union-Intersection Testing UIT Union-Intersection Testing UIT S. N. Roy Intersection-Union Testing IUT Intersection-Union Testing IUT Roger Berger (1982) Technometrics Closed testing Closed testing Marcus, Peritz, Gabriel (1976) Biometrika Partitioning Partitioning Stefansson, Kim, and Hsu (1984) Statistical Decision Theory and Related Topics, Berger & Gupta eds., Springer-Verlag. Finner and Strassberger (2002) Annals of Statistics Equivariant confidence set Equivariant confidence set Tukey (1953) Scheffe (195?) Dunnett (1955)

Partitioning confidence sets Multiple Comparison with the Best Multiple Comparison with the Best Gunnar Stefansson & Hsu 1-sided stepdown method (sample-determined steps) = Naik/Marcus-Peritz-Gabriel closed test 1-sided stepdown method (sample-determined steps) = Naik/Marcus-Peritz-Gabriel closed testHsu Multiple Comparison with the Sample Best Multiple Comparison with the Sample Best Woochul Kim & Hsu & Stefansson Bioequivalence Bioequivalence Ruberg & Hsu & G. Hwang & Liu & Casella & Brown 1-sided stepdown method (pre-determined steps) 1-sided stepdown method (pre-determined steps) Roger Berger & Hsu

Partitioning 1. Formulate hypotheses H 0i :    i * for i  I  i  I  i * = entire parameter space  i  I  i * = entire parameter space {  i * : i  I } partitions the parameter space {  i * : i  I } partitions the parameter space 2. Test each H 0i * :    i *, i  I, at  3. Infer    i if H 0i * is rejected 4. Pivot in each  i a confidence set C i for  5.  i  I C i is a 100(1  )% confidence set for 

Partitioning 1. Formulate hypotheses H 0i :    i for i  I  i  J  I = entire parameter space  i  J  I = entire parameter space 2. For each J  I, let  J * =  i  J  i  (  j  J  j ) c 3. Test each H 0J * :    J *, J  I, at  {  J * : J  I} partitions the parameter space {  J * : J  I} partitions the parameter space 4. Infer    J if H 0J * is rejected 5. Pivot in each  J a confidence set C J for  6.  J  J C J is a 100(1  )% confidence set for 

MCB confidence intervals  i  max j  i  j  [  (Y i  max j  i Y j  W) , (Y i  max j  i Y j + W) + ], i = 1, 2, …, k Upper bounds imply subset selection Upper bounds imply subset selection Lower bounds imply indifference zone selection Lower bounds imply indifference zone selection

Multiple Comparison with the Best 1. H 01 : Treatment 1 is the best 2. H 02 : Treatment 2 is the best 3. H 03 : Treatment 3 is the best 4. … Test each at  using 1-sided Dunnett’s Test each at  using 1-sided Dunnett’s Collate the results Collate the results

Partitioning picture

Union-Intersection Testing UIT 1. Form H a :  H ai (an “or” thing) 2. Test H 0 :  H 0i, the complement of H a 1. If reject, infer at least one H 0i false 2. Else, infer nothing

Closed Testing 1. Formulate hypotheses H 0i :    i for i  I 2. For each J  I, let  J =  i  J  i 3. Form closed family of null hypotheses {H 0J :    J : J  I} 4. Test each H 0J at  5. Infer    i  J  i if all H 0J’ with J  J’ rejected 6. Infer    i if all H 0J’ with i  J’ rejected

Oneway model Y ir =  i +  ir, i = 0, 1, 2, …, k, r = 1, …, n i  ir are i.i.d. Normal(0,  2 ) Dose i “efficacious” if  i >  1 +  ICH E10 (2000) Superiority if   0 Superiority if   0 Non-inferiority if  < 0 Non-inferiority if  < 0 Equivalence is 2-sided Equivalence is 2-sided Non-inferiority is 1-sided Non-inferiority is 1-sided

Closed testing null hypotheses (sample-determined steps) 1. H 02 : Dose 2 not efficacious 2. H 03 : Dose 3 not efficacious 3. H 01 : Doses 2 and 3 not efficacious Test each at  Test each at  Collate the results Collate the results

Partitioning null hypotheses (sample-determined steps) 1. H 01 : Doses 2 and 3 not efficacious 2. H 02 : Dose 2 not efficacious but dose 3 is 3. H 03 : Dose 3 not efficacious but dose 2 is Test each at  Test each at  Collate the results Collate the results

Partitioning implies closed testing Partitioning implies closed testing because A size  test for H 0i is a size  test for H 0i A size  test for H 0i is a size  test for H 0i Reject H 01 : Doses 2 and 3 not efficacious implies either dose 2 or dose 3 efficacious Reject H 01 : Doses 2 and 3 not efficacious implies either dose 2 or dose 3 efficacious Reject H 02 : Dose 2 not efficacious but dose 3 efficacious implies it is not the case dose 3 is efficacious but not dose 2 Reject H 02 : Dose 2 not efficacious but dose 3 efficacious implies it is not the case dose 3 is efficacious but not dose 2 Reject H 01 and H 02 thus implies dose 2 efficacious Reject H 01 and H 02 thus implies dose 2 efficacious

Intersection-Union Testing IUT 1. Form H a :  H ai (an “and” thing) 2. Test H 0 :  H 0i, the complement of H a 1. If reject, infer all H 0i false 2. Else, infer nothing

PK concentration in blood plasma curve

Bioequivalence defined Bioequivalence: clinical equivalence between 1. Brand name drug 2. Generic drug Bioequivalence parameters AUC = Area Under the Curve AUC = Area Under the Curve C max = maximum Concentration C max = maximum Concentration T max = Time to maximum concentratin T max = Time to maximum concentratin

Average bioequivalence Notation   = expected value of brand name drug  2 = expected value of generic drug Average bioequivalence means.8 <   /  2 < 1.25 for AUC and.8 <   /  2 < 1.25 for C max

Bioequivalence in practice If log of observations are normal with means   and  2 and equal variances, then average bioequivalence becomes log(.8) <    2 < log(1.25) for AUC and log(.8) <    2 < log(1.25) for C max

Partitioning Partition the parameter space as 1. H 0< :    2 <  log(0.8) 2. H 0> :    2 > log(1.25) 3. H a : log(.8) <    2 < log(1.25) Test H 0 each at . Infer log(.8) rejected. Controls P{incorrect decision} at .

Dose-Response (Phase II)

Anti-psychotic drug efficacy trial Dose of Seroquel (mg) Dose of Seroquel (mg) n iiii SE Arvanitis et al. (1997 Biological Psychiatry) CGI = Clinical Global Impression

Minimum Effective Dose (MED) Minimu Effective Dose =MED =smallest i so that  i >  1 +  for all j, i  j  k Want an upper confidence bound MED + so that P{MED < MED + }  100(1  )%

Closed testing inference Infer nothing if H 01 is accepted Infer nothing if H 01 is accepted Infer at least one of doses 2 and 3 effective if H 01 is rejected Infer at least one of doses 2 and 3 effective if H 01 is rejected Infer dose 2 effective if, in addition to H 01, H 02 is rejected Infer dose 2 effective if, in addition to H 01, H 02 is rejected Infer dose 3 effective if, in addition to H 01, H 03 is rejected Infer dose 3 effective if, in addition to H 01, H 03 is rejected

Closed testing method (sample-determined steps) Start from H 01 to H 02 and H 03 Start from H 01 to H 02 and H 03 Stepdown from smallest p-value to largest p-value Stepdown from smallest p-value to largest p-value Stop as soon as one fails to reject Stop as soon as one fails to reject Multiplicity adjustment decreases from k to k  1 to k  2  to 2 from step 1 to 2 to 3 … to step k  1 Multiplicity adjustment decreases from k to k  1 to k  2  to 2 from step 1 to 2 to 3 … to step k  1

Partitioning picture

Tests of equalities (pre-determined steps) 1. H 0k :  1 =  2 =  =  k H ak :  1 =  2 =  <  k 2. H 0(k  1) :  1 =  2 =  =  k  1 H a(k  1) :  1 =  2 =  <  k  1 3. H 0(k  2) :  1 =  2 =  =  k  2 H a(k  2) :  1 =  2 =  <  k  2 4.  5. H 02 :  1 =  2 H a2 :  1 <  2

Closed testing of equalities Null hypotheses are nested Null hypotheses are nested 1. Closure of family remains H 0k  H Test each H 0i at  3. Stepdown from dose k to dose k  1 to  to dose 2 4. Stop as soon as one fails to reject 5. Multiplicity adjustment not needed

Testing equalities is easy 1. H 0k :  1 =  =  k 2.  3. H 02 :  1 =  2 H 0i  H 0i H 0i  H 0i 1. H 0k :  1   k 2.  3. H 02 :  1   2 H 0i  H 0i

Partitioning null hypotheses (for pre-determined steps) 1. H 0k :Dose k not efficacious 2. H 0(k-1) :Dose k efficacious but dose k  1 not efficacious 3. H 0(k-1) :Doses k and k  1 efficacious but dose k  2 not efficacious 4.  5. H 02 :Doses k  3 efficacious but dose 2 not efficacious Test each at  Test each at  Collate the results Collate the results

Partitioning inference 1. Infer nothing if H 0k is accepted 2. Infer dose k effective if H 0k is rejected 3. Infer dose k  1 effective if, in addition to H 0k, H 0(k-1) is rejected 4. Infer dose k  2 effective if, in addition to H 0k and H 0(k-1), H 03 is rejected 5. 

Partitioning method (for pre-determined steps) Stepdown from dose k to dose k  1 to  to dose 2 Stepdown from dose k to dose k  1 to  to dose 2 Stop as soon as one fails to reject Stop as soon as one fails to reject Multiplicity adjustment not needed Multiplicity adjustment not needed Any pre-determined sequence of doses can be used Any pre-determined sequence of doses can be used Confidence set given in Hsu and Berger (1999 JASA) Confidence set given in Hsu and Berger (1999 JASA)

Pairwise t tests for partitioning Size  tests for H 0k  H 02 are also size  test for H 0k  H H 0k :Dose k not efficacious 2. H 0(k-1) :Dose k  1 not efficacious 3. H 0(k-2) :Dose k  2 not efficacious 4.  5. H 02 :Dose 2 not efficacious Test each with a size-  2-sample 1-sided t-test Test each with a size-  2-sample 1-sided t-test

Testing equalities is easy 1. H 0k :  1 =  =  k 2.  3. H 02 :  1 =  2 H 0i  H 0i H 0i  H 0i 1. H 0k :  1   k 2.  3. H 02 :  1   2 H 0i  H 0i

Could reject for the wrong reason H0H0H0H0 HaHaHaHaneither