Outline of Stratification Lectures Definitions, examples and rationale (credibility) Implementation –Fixed allocation (permuted blocks) –Adaptive (minimization)

Slides:



Advertisements
Similar presentations
FDA/Industry Workshop September, 19, 2003 Johnson & Johnson Pharmaceutical Research and Development L.L.C. 1 Uses and Abuses of (Adaptive) Randomization:
Advertisements

COMPUTER INTENSIVE AND RE-RANDOMIZATION TESTS IN CLINICAL TRIALS Thomas Hammerstrom, Ph.D. USFDA, Division of Biometrics The opinions expressed are those.
When Should a Clinical Trial Design with Pre-Stratification be Used? Group 1.
Introduction Simple Random Sampling Stratified Random Sampling
Randomized Complete Block and Repeated Measures (Each Subject Receives Each Treatment) Designs KNNL – Chapters 21,
PRE-STRATIFIED RANDOMIZATION IS NOT NECESSARY FOR LARGE CLINICAL TRIALS Brent Leininger, Patrick Kurkiewicz, Lifeng Lin, Xiang Li, Bryan Trottier Jr, Yuanyuan.
Module B-4: Processing ICT survey data TRAINING COURSE ON THE PRODUCTION OF STATISTICS ON THE INFORMATION ECONOMY Module B-4 Processing ICT Survey data.
Lecture 11 (Chapter 9).
LSU-HSC School of Public Health Biostatistics 1 Statistical Core Didactic Introduction to Biostatistics Donald E. Mercante, PhD.
Many Important Issues Covered Current status of ICH E5 and implementation in individual Asian countries Implementation at a regional level (EU) and practical.
Chapter 5 Stratified Random Sampling n Advantages of stratified random sampling n How to select stratified random sample n Estimating population mean and.
When Is Stratification Detrimental to a Clinical Trial Design? Part II Katherine L. Monti, Ph.D. Senior Statistical Scientist and Director of the Massachusetts.
1 Multifactor ANOVA. 2 What We Will Learn Two-factor ANOVA K ij =1 Two-factor ANOVA K ij =1 –Interaction –Tukey’s with multiple comparisons –Concept of.
Sampling.
Chapter 10 Sampling and Sampling Distributions
Sampling and Randomness
Chapter 28 Design of Experiments (DOE). Objectives Define basic design of experiments (DOE) terminology. Apply DOE principles. Plan, organize, and evaluate.
11 Populations and Samples.
A new sampling method: stratified sampling
Stratified Simple Random Sampling (Chapter 5, Textbook, Barnett, V
STAT 4060 Design and Analysis of Surveys Exam: 60% Mid Test: 20% Mini Project: 10% Continuous assessment: 10%
PROBABILITY SAMPLING: CONCEPTS AND TERMINOLOGY
Stratified Sampling Lecturer: Chad Jensen. Sampling Methods SRS (simple random sample) SRS (simple random sample) Systematic Systematic Convenience Convenience.
Course Content Introduction to the Research Process
Formalizing the Concepts: STRATIFICATION. These objectives are often contradictory in practice Sampling weights need to be used to analyze the data Sampling.
Common Designs for Controlled Clinical Trials A.Parallel Group Trials 1.Simplest example - 2 groups, no stratification 2.Stratified design 3.Matched pairs.
Stratification and Adjustment
ANCOVA Lecture 9 Andrew Ainsworth. What is ANCOVA?
5-1 Introduction 5-2 Inference on the Means of Two Populations, Variances Known Assumptions.
Definitions Observation unit Target population Sample Sampled population Sampling unit Sampling frame.
1 Basic Scientific Research Topic 6: Sampling methods Dr Jihad ABDALLAH Source: Research Methods Knowledge Base
Sampling Techniques LEARNING OBJECTIVES : After studying this module, participants will be able to : 1. Identify and define the population to be studied.
Chapter 1: The Nature of Statistics
Scot Exec Course Nov/Dec 04 Survey design overview Gillian Raab Professor of Applied Statistics Napier University.
Part III Gathering Data.
Lecture 6 Forestry 3218 Forest Mensuration II Lecture 6 Double Sampling Cluster Sampling Sampling for Discrete Variables Avery and Burkhart, Chapter 3.
Validation / citations. Validation u Expert review of model structure u Expert review of basic code implementation u Reproduce original inputs u Correctly.
Sampling Design and Analysis MTH 494 Lecture-30 Ossam Chohan Assistant Professor CIIT Abbottabad.
Sampling Design and Analysis MTH 494 LECTURE-12 Ossam Chohan Assistant Professor CIIT Abbottabad.
1 Chapter Two: Sampling Methods §know the reasons of sampling §use the table of random numbers §perform Simple Random, Systematic, Stratified, Cluster,
Lohr 2.2 a) Unit 1 is included in samples 1 and 3.  1 is therefore 1/8 + 1/8 = 1/4 Unit 2 is included in samples 2 and 4.  2 is therefore 1/4 + 3/8 =
Adaptive randomization
1 THE ROLE OF COVARIATES IN CLINICAL TRIALS ANALYSES Ralph B. D’Agostino, Sr., PhD Boston University FDA ODAC March 13, 2006.
Use of Candidate Predictive Biomarkers in the Design of Phase III Clinical Trials Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer.
Chapter 5 Parameter estimation. What is sample inference? Distinguish between managerial & financial accounting. Understand how managers can use accounting.
ICCS 2009 IDB Workshop, 18 th February 2010, Madrid 1 Training Workshop on the ICCS 2009 database Weighting and Variance Estimation picture.
Sampling Sources: -EPIET Introductory course, Thomas Grein, Denis Coulombier, Philippe Sudre, Mike Catchpole -IDEA Brigitte Helynck, Philippe Malfait,
Single-Factor Studies KNNL – Chapter 16. Single-Factor Models Independent Variable can be qualitative or quantitative If Quantitative, we typically assume.
1 Lecture 10: Meta-analysis of intervention studies Introduction to meta-analysis Selection of studies Abstraction of information Quality scores Methods.
Sample Size Determination
IE241: Introduction to Design of Experiments. Last term we talked about testing the difference between two independent means. For means from a normal.
Instructor Resource Chapter 15 Copyright © Scott B. Patten, Permission granted for classroom use with Epidemiology for Canadian Students: Principles,
ICCS 2009 IDB Seminar – Nov 24-26, 2010 – IEA DPC, Hamburg, Germany Training Workshop on the ICCS 2009 database Weights and Variance Estimation picture.
When Is Stratification Detrimental to a Clinical Trial Design? Part I Gretchen Marcucci, M.S. Biostatistician, Rho, Inc. and Katherine L. Monti, Ph.D.
Sampling Design and Analysis MTH 494 Lecture-21 Ossam Chohan Assistant Professor CIIT Abbottabad.
Probability Sampling. Simple Random Sample (SRS) Stratified Random Sampling Cluster Sampling The only way to ensure a representative sample is to obtain.
1 Lecture 10: Meta-analysis of intervention studies Introduction to meta-analysis Selection of studies Abstraction of information Quality scores Methods.
1. 2 DRAWING SIMPLE RANDOM SAMPLING 1.Use random # table 2.Assign each element a # 3.Use random # table to select elements in a sample.
Sampling Concepts Nursing Research. Population  Population the group you are ultimately interested in knowing more about “entire aggregation of cases.
Dr.N.K.Tyagi, SAMPLE SIZE The average in the form of estimate ‘p’ or mean should be of known along with its precision and tolerable error,
STA248 week 121 Bootstrap Test for Pairs of Means of a Non-Normal Population – small samples Suppose X 1, …, X n are iid from some distribution independent.
Topic 2: Types of Statistical Studies
Statistical Core Didactic
The 4th ICTMC & 38th Annual Meeting of SCT
Elaine M Pascoe, Darsy Darssan, Liza A Vergara
STRATIFIED SAMPLING.
2. Stratified Random Sampling.
Joanna Romaniuk Quanticate, Warsaw, Poland
Sampling and estimation
Björn Bornkamp, Georgina Bermann
Presentation transcript:

Outline of Stratification Lectures Definitions, examples and rationale (credibility) Implementation –Fixed allocation (permuted blocks) –Adaptive (minimization) Rationale - variance reduction

Stratification A procedure in which factors known to be associated with the response (prognostic factors) are taken into account in the design (e.g., randomization) Pre-stratification refers to a stratified design; post-stratification refers to the analysis

Pre- versus Post Stratification and Precision (Variance Reduction) As a general rule, the precision gained with pre- versus post-stratification is less than one might expect The gain in precision is greatest in small studies (where you need it the most) because the risk of chance imbalance is greater. Covariate adjustment for prognostic factors is usually carried out with regression (e.g., linear, logistic, or proportional hazards regression.

Stratification Can Increase Precision Simple versus stratified random sampling. Snedecor and Cochran note (p. 520): “If we form strata so that a heterogeneous population is divided into parts each of which is fairly homogeneous, we may expect a gain in precision over simple random sampling”. Ref. Snedecor and Cochran, Statistical Methods

Stratification Can Increase Precision Randomized block versus completely random design. Snedecor and Cochran note (p. 299): “Knowledge (about predictors or response) can be used to increase the accuracy of experiments. If there are a treatments to be compared,…first arrange the experimental units in groups of a, often called replications. The rule is that units assigned to the same replication should be as similar in responsiveness as possible. Each treatment is then allocated by randomization to one unit in each replication…Replications are therefore usually compact areas of land…This experimental plan is called randomized blocks.”

Pre-stratification Does Not Matter. Peto et al note: “As long as good statistical methods,…,are used to analyze data from clinical trials, there is no need for randomization to be stratified by prognostic features.” Keep it simple so investigators are not discouraged from participating. Post-stratified analysis is needed with pre- stratification anyway. Improvement in sensitivity (precision) with pre- stratification compared to letting stratum sizes be determined by chance is small. Peto R et al., Br. J Cancer, pp ,1976

Stratified Design for Comparing Treatments StratumAB m1m1 m2m2 m3m3 m4m4 nana nbnb Treatment m 1A m 2A m 3A m 4A m 1B m 2B m 3B m 4B Typical situation:m1m1 ≠m2m2 m3m3 m4m4 ≠≠ Study is designed/powered based on n a and n b Goal: m iA = m iB for all i.

How much of a price does one pay with respect to precision by trusting randomization to achieve reasonable balance? Consider the relative efficiency of a stratified design to an unstratified design: Var (treatment contrast with stratification) Var (treatment contrast with no stratification in design, but post-stratified analyses) RE =

Pooling Estimates Estimates: E, E Var (E ) =  w E + w E Pooled Estimate: Best Pooled Est: w = , w =  w  + w  Variance Pooled Est: w + w (w + w )

Continuous response, equal variance - effect of chance imbalance n A =total number randomly assigned to A n B =total number randomly assigned to B g =fraction of those given A with prognostic factor h =fraction of those given B with prognostic factor Treatment Stratum A B S1S1 S2S2 A n g B n h nAnA nBnB (1-g)nAnA (1-h)nBnB RE = A n g B n h (1-g) (1-h) A n g B n h + + A n g B n h + A n B n+ 1 - 

RE obtained by noting: 1)Var( y - y ) = 1A1B 1 + n g A 1 n h + ()  2 2)Var(y - y ) = 2A 2B n A n B ( )  2 (1-g) (1-h) 3)Pooled variance is: Var Pooled (y - y ) = AB wiwi 2 Var (y - y ) iAiB 2  w i  2 w 1 = A n g B n h + A n g B n h A n B n (1-g) (1-h) + w 2 = A n B n (1-g)(1-h) B 1 1

For Stratified Design, g = h w = 1 n A n B + n A n B g 2 n A n B + n A n B (1 - g)

Assume A n B n = RE = g(1-g)h(1-h) g + h       e.g., block randomization used Consider the case of g = 2h: 0.10, , , , g, h RE

Bernouli Response Loss of efficiency = h2n2h2n2 1 - h = lack of balance 2n = number in each stratum Ref: Meier (Controlled Clinical Trials, 1981)

This can be seen by noting: 1)Stratified design, for stratum 1 Var (p - p ) = AB since n = n = n A1 B n1n A 1 + 1n1n B 1 p1q1p1q1 () 2n2n = p1p1 q1q1 (()) Note: q 1 = 1- p 1

The ratio of these variances is proportional to: 2)No stratification in design; post-stratification in analysis Var (p - p ) = A B1 1 1 n+h + 1 n-h p1q1p1q1 () 1/n+h + 1/n-h 2/n = h2n2h2n2 1 -

n = 10 1(11, 9)0.99 2(12, 8)0.96 4(14, 6)0.86 5(15, 5)0.75 h RE (n, n ) 1 A 1 B

Example: Brown et al. Clinical Trial of Tetanus Anti-toxin in Treatment of Tetanus. Lancet, ;1960 (see also Meier, Cont Clin Trials, 1981; a slightly different approach is taken here). Anti- Toxin (A) Alive Dead No Anti- Toxin (B) p= overall death rate = = ^

= 20/41 = p ^ A = 29/38 = p ^ B p ^ A p ^ B - = Var p ^ A p ^ B - () = + A 1n1n B 1n1n ][ p ^ 1p ^ - () = [(())] + = SE p ^ A p ^ B - () =

Time from first symptoms to admission turned out to be an important prognostic factor; therefore, post-stratification was carried out. A Alive Dead B A Alive Dead B 13 8 < 72 Hours ≥ 72 Hours

Stratum 1: < 72 hours Stratum 2: ≥ 72 hours p ^ 1A p ^ 1B - = p ^ 1A = p ^ 1B = p ^ 1 = SE( ) = p ^ 1A p ^ 1B - p ^ 2A p ^ 2B - = p ^ 2 = SE( ) = p ^ 2A p ^ 2B -

Weighted diff. ( ˆ p A - ˆ p B ) w ˆ Let G = fraction of patients in Stratum 1 = 58/79 = ( ˆ p A - ˆ p B ) w = ˆ G( ˆ p 1A - ˆ p 1B )+ (1- G)( ˆ p 2A - ˆ p 2B ) = compared to unweighted VAR(p - p ) = G VAR(p - p ) ˆ A ˆ B w ˆ 1A ˆ 1B ˆ SE(p - p ˆ A - ˆ B ) w = (1- G ) VAR( ˆ p 2A - ˆ p 2B )= ˆ ˆ

Gain in precision achieved with post- stratification Var(post-stratification) Var(no stratification) RE = = = % reduction

How much gain in precision would be achieved if stratification was used in the design?

Force balance within stratum Assume ˆ p ‘s don’t change ij ˆ 1A ˆ 1B ˆ 2A ˆ 2B A Alive Dead B 29 A Alive Dead B < 72 Hours≥ 72 Hours SE(p - p ) = instead of SE(p - p ) = instead of 0.191

Var(stratified design) Var(no stratification) RE 1 = (0.096) (0.109) = % reduction = 2 2 Var(stratified design) Var(post-stratification) RE 2 = (0.096) (0.097) = % reduction = 2 2 SE stratified design = (same weights are used)

GREP 1. P 2. Gp 1. (1-p 1. ) + (1-G)p 2. (1-p 2. ) [ Gp 1. + (1-G)p 2. ] [1 – Gp 1. – (1-G)p 2. ] RE = If p 1. = p 2. Then RE = 1

1)the distribution of the prognostic factor in the population; 2)the relative strength of the prognostic factor; and 3)the expected endpoint rate in the group studied. The reduction in variance achieved with post- stratification depends on:

Scott’s Survey of Trials Published in Lancet and N Eng J Med in 2001 Stratification Permuted block43/150 Minimization6/150 Other adaptive3/150 Other19/150 Unspecified79/150 Scott et al. Cont Clin Trials 2002; 23:

Kahan’s Survey of 258 Trials Published in Four Major Medical Journals in 2010 Method of Randomization Simple: 4 Permuted blocks, no stratification: 40 Permuted blocks, stratification: 85 Minimization: 29 Other: 4 Unclear: 96 Kahan BC et al. BMJ 2012; 345:e5840

Conclusions 1.Usually there is little loss of efficiency with post- stratification as compared to a stratified design. 2.Loss of efficiency results from large chance imbalances for important prognostic factors, which are more likely in small studies. 3.Stratified designs should be considered in small studies (n < 50) with important prognostic factors. 4.Strictly speaking, analysis should account for pre-stratification.

Recommendation for Multi-Center Trials: Always Consider Stratification on Center 1.Clinic populations differ. 2.Treatment differs from clinic to clinic. 3.Each center represents a replicate of overall trial – can investigate treatment x clinic interactions. 4.In some trials (surgery), it may be better to stratify on surgeon within clinic. 5.If there are a very large number of clinical sites, small block size may have to be used and site combined into a priori defined larger strata (e.g., region or country) for analysis

General Recommendations Large trials –Block randomization with stratification by center –Stratification on other factors not necessary (I am a lumper) –If needed, usually okay to carry out block randomization within each stratum Small trials –Block randomization with stratification by center –If stratification on other factors is considered, may have to use an adaptive approach These are consistent with Freidman, Furberg and DeMets (see page 111)