Download presentation
Presentation is loading. Please wait.
Published byRalph Knight Modified over 8 years ago
1
Outline of Stratification Lectures Definitions, examples and rationale (credibility) Implementation –Fixed allocation (permuted blocks) –Adaptive (minimization) Rationale - variance reduction
2
Stratification A procedure in which factors known to be associated with the response (prognostic factors) are taken into account in the design (e.g., randomization) Pre-stratification refers to a stratified design; post-stratification refers to the analysis
3
Pre- versus Post Stratification and Precision (Variance Reduction) As a general rule, the precision gained with pre- versus post-stratification is less than one might expect The gain in precision is greatest in small studies (where you need it the most) because the risk of chance imbalance is greater. Covariate adjustment for prognostic factors is usually carried out with regression (e.g., linear, logistic, or proportional hazards regression.
4
Stratification Can Increase Precision Simple versus stratified random sampling. Snedecor and Cochran note (p. 520): “If we form strata so that a heterogeneous population is divided into parts each of which is fairly homogeneous, we may expect a gain in precision over simple random sampling”. Ref. Snedecor and Cochran, Statistical Methods
5
Stratification Can Increase Precision Randomized block versus completely random design. Snedecor and Cochran note (p. 299): “Knowledge (about predictors or response) can be used to increase the accuracy of experiments. If there are a treatments to be compared,…first arrange the experimental units in groups of a, often called replications. The rule is that units assigned to the same replication should be as similar in responsiveness as possible. Each treatment is then allocated by randomization to one unit in each replication…Replications are therefore usually compact areas of land…This experimental plan is called randomized blocks.”
6
Pre-stratification Does Not Matter. Peto et al note: “As long as good statistical methods,…,are used to analyze data from clinical trials, there is no need for randomization to be stratified by prognostic features.” Keep it simple so investigators are not discouraged from participating. Post-stratified analysis is needed with pre- stratification anyway. Improvement in sensitivity (precision) with pre- stratification compared to letting stratum sizes be determined by chance is small. Peto R et al., Br. J Cancer, pp. 585-612,1976
7
Stratified Design for Comparing Treatments StratumAB 1 2 3 4 m1m1 m2m2 m3m3 m4m4 nana nbnb Treatment m 1A m 2A m 3A m 4A m 1B m 2B m 3B m 4B Typical situation:m1m1 ≠m2m2 m3m3 m4m4 ≠≠ Study is designed/powered based on n a and n b Goal: m iA = m iB for all i.
8
How much of a price does one pay with respect to precision by trusting randomization to achieve reasonable balance? Consider the relative efficiency of a stratified design to an unstratified design: Var (treatment contrast with stratification) Var (treatment contrast with no stratification in design, but post-stratified analyses) RE =
9
Pooling Estimates Estimates: E, E Var (E ) = w E + w E Pooled Estimate: Best Pooled Est: w = , w = w + w Variance Pooled Est: w + w 1 1 1 2 2 1 21222122 11 1 22 2 1 2121 2222 (w + w ) 1 1 2121 2121 2222 2 2 2222 2
10
Continuous response, equal variance - effect of chance imbalance n A =total number randomly assigned to A n B =total number randomly assigned to B g =fraction of those given A with prognostic factor h =fraction of those given B with prognostic factor Treatment Stratum A B S1S1 S2S2 A n g B n h nAnA nBnB (1-g)nAnA (1-h)nBnB RE = A n g B n h (1-g) (1-h) A n g B n h + + A n g B n h + A n B n+ 1 -
11
RE obtained by noting: 1)Var( y - y ) = 1A1B 1 + n g A 1 n h + () 2 2)Var(y - y ) = 2A 2B n A n B ( ) 2 (1-g) (1-h) 3)Pooled variance is: Var Pooled (y - y ) = AB wiwi 2 Var (y - y ) iAiB 2 w i 2 w 1 = A n g B n h + A n g B n h A n B n (1-g) (1-h) + w 2 = A n B n (1-g)(1-h) B 1 1
12
For Stratified Design, g = h w = 1 n A n B + n A n B g 2 n A n B + n A n B (1 - g)
13
Assume A n B n = RE = g(1-g)h(1-h) g + h + 2 1 - e.g., block randomization used Consider the case of g = 2h: 0.10,0.050.99 0.25,0.1250.97 0.50,0.250.93 0.75,0.3750.86 g, h RE
14
Bernouli Response Loss of efficiency = h2n2h2n2 1 - h = lack of balance 2n = number in each stratum Ref: Meier (Controlled Clinical Trials, 1981)
15
This can be seen by noting: 1)Stratified design, for stratum 1 Var (p - p ) = AB since n = n = n A1 B1 1 1 1n1n A 1 + 1n1n B 1 p1q1p1q1 () 2n2n = p1p1 q1q1 (()) Note: q 1 = 1- p 1
16
The ratio of these variances is proportional to: 2)No stratification in design; post-stratification in analysis Var (p - p ) = A B1 1 1 n+h + 1 n-h p1q1p1q1 () 1/n+h + 1/n-h 2/n = h2n2h2n2 1 -
17
n = 10 1(11, 9)0.99 2(12, 8)0.96 4(14, 6)0.86 5(15, 5)0.75 h RE (n, n ) 1 A 1 B
18
Example: Brown et al. Clinical Trial of Tetanus Anti-toxin in Treatment of Tetanus. Lancet, 227-30;1960 (see also Meier, Cont Clin Trials, 1981; a slightly different approach is taken here). Anti- Toxin (A) Alive Dead 21 9 20 29 No Anti- Toxin (B) 41 38 30 49 79 p= overall death rate = = 0.620 49 79 ^
19
= 20/41 = 0.488 p ^ A = 29/38 = 0.763 p ^ B p ^ A p ^ B - = -0.275 Var p ^ A p ^ B - () = + A 1n1n B 1n1n ][ p ^ 1p ^ - () = 1 41 1 38 49 79 49 79 1 - [(())] + = 0.01195 SE 0.109 p ^ A p ^ B - () =
20
Time from first symptoms to admission turned out to be an important prognostic factor; therefore, post-stratification was carried out. A Alive Dead 10 4 1826 B 28 30 A Alive Dead 11 5 2 3 B 13 8 < 72 Hours ≥ 72 Hours
21
Stratum 1: < 72 hours Stratum 2: ≥ 72 hours p ^ 1A p ^ 1B - = - 0.223 p ^ 1A = 0.643 p ^ 1B = 0.866 p ^ 1 = 0.759 SE( ) = 0.112 p ^ 1A p ^ 1B - p ^ 2A p ^ 2B - = - 0.221 p ^ 2 = 0.238 SE( ) = 0.191 p ^ 2A p ^ 2B -
22
Weighted diff. ( ˆ p A - ˆ p B ) w ˆ Let G = fraction of patients in Stratum 1 = 58/79 = 0.734 ( ˆ p A - ˆ p B ) w = ˆ G( ˆ p 1A - ˆ p 1B )+ (1- G)( ˆ p 2A - ˆ p 2B ) = - 0.223 compared to - 0.275 unweighted VAR(p - p ) = G VAR(p - p ) ˆ A ˆ B w ˆ 1A ˆ 1B ˆ SE(p - p ˆ A - ˆ B ) w =.097 + (1- G ) VAR( ˆ p 2A - ˆ p 2B )= 0.00938 2 2 ˆ ˆ
23
Gain in precision achieved with post- stratification Var(post-stratification) Var(no stratification) RE = = 0.00938 0.01195 = 0.78 22% reduction
24
How much gain in precision would be achieved if stratification was used in the design?
25
Force balance within stratum Assume ˆ p ‘s don’t change ij ˆ 1A ˆ 1B ˆ 2A ˆ 2B A Alive Dead B 29 A Alive Dead B 11 10 < 72 Hours≥ 72 Hours SE(p - p ) = 0.109 instead of 0.112 SE(p - p ) = 0.186 instead of 0.191
26
Var(stratified design) Var(no stratification) RE 1 = (0.096) (0.109) = 0.77 23% reduction = 2 2 Var(stratified design) Var(post-stratification) RE 2 = (0.096) (0.097) = 0.98 2% reduction = 2 2 SE stratified design = 0.096 (same weights are used)
27
0.500.600.300.91 0.200.600.300.94 0.100.600.300.96 0.500.600.200.83 0.200.600.200.87 0.100.600.200.92 0.500.100.050.991 0.200.100.050.992 0.100.100.050.996 GREP 1. P 2. Gp 1. (1-p 1. ) + (1-G)p 2. (1-p 2. ) [ Gp 1. + (1-G)p 2. ] [1 – Gp 1. – (1-G)p 2. ] RE = If p 1. = p 2. Then RE = 1
28
1)the distribution of the prognostic factor in the population; 2)the relative strength of the prognostic factor; and 3)the expected endpoint rate in the group studied. The reduction in variance achieved with post- stratification depends on:
29
Scott’s Survey of Trials Published in Lancet and N Eng J Med in 2001 Stratification Permuted block43/150 Minimization6/150 Other adaptive3/150 Other19/150 Unspecified79/150 Scott et al. Cont Clin Trials 2002; 23:662-674
30
Kahan’s Survey of 258 Trials Published in Four Major Medical Journals in 2010 Method of Randomization Simple: 4 Permuted blocks, no stratification: 40 Permuted blocks, stratification: 85 Minimization: 29 Other: 4 Unclear: 96 Kahan BC et al. BMJ 2012; 345:e5840
31
Conclusions 1.Usually there is little loss of efficiency with post- stratification as compared to a stratified design. 2.Loss of efficiency results from large chance imbalances for important prognostic factors, which are more likely in small studies. 3.Stratified designs should be considered in small studies (n < 50) with important prognostic factors. 4.Strictly speaking, analysis should account for pre-stratification.
32
Recommendation for Multi-Center Trials: Always Consider Stratification on Center 1.Clinic populations differ. 2.Treatment differs from clinic to clinic. 3.Each center represents a replicate of overall trial – can investigate treatment x clinic interactions. 4.In some trials (surgery), it may be better to stratify on surgeon within clinic. 5.If there are a very large number of clinical sites, small block size may have to be used and site combined into a priori defined larger strata (e.g., region or country) for analysis
33
General Recommendations Large trials –Block randomization with stratification by center –Stratification on other factors not necessary (I am a lumper) –If needed, usually okay to carry out block randomization within each stratum Small trials –Block randomization with stratification by center –If stratification on other factors is considered, may have to use an adaptive approach These are consistent with Freidman, Furberg and DeMets (see page 111)
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.