Part IV The General Linear Model Multiple Explanatory Variables Chapter 13.4 Fixed * Random Effects Randomized block
Statistical control The effect of one variable (random: subject in last lecture) are removed to arrive at a better test for the variable of interest (fixed: drug in last lecture) It is used when manipulative control is not possible Field Not possible at large scales Expensive Can generate artifacts study with well designed statistical control can be more informative
Randomized block design "Block what you can, randomize what you cannot.“ Blocking to remove effects of the most important nuisance variables Randomization to reduce contaminating effects of remaining nuisance variables Within each block elements must be homogenous with respect to response variable Heterogeneity among blocks Order blocks perpendicular to gradient of extraneous variable Treatments are randomized within each block
Randomized block design Example from Netter & Wasserman (1974) Applied Linear Statistical Models Response: sales volume Explanatory: level of newspaper advertising Size of city correlated with response block according to population size perpendicular to gradient=from large to small randomize level of advertising within each block
Randomized block design Some blocking criteria: Characteristics associated with the unit – For persons: gender, age, etc – For geographic areas: population size, average income, etc Characteristics associated with the experimental setting observer, time of processing, machine, tank, batch, measuring instrument, etc
Data from Sokal & Rohlf Dry weights of 3 genotypes (wild, hetrozygote mutant, homozygote mutnt) of flour beetle Tribolium in 4 experiments Does weight vary among genotypes? GLM | Randomized block design BlocksGtypeWt
1. Construct Model Response variable: M = beetle mass Explanatory variables: 1. Genotype Fixed effect 2. Experiment Random effect Experiments were laborious and carried out several months apart Each experiment is a block
1. Construct Model Verbal: Does weight vary among genotypes, after controlling for differences among genotypes? Graphical:
1. Construct Model Formal: Can we have an interaction term? Same situation as last lecture. If we were to include an interaction term then df res =0 MS res = SS res / df res = X
1. Construct Model Formal: X
2. Execute analysis lm1 <- lm(M~G+B, data=trib) BlocksGtypeWt
2. Execute analysis GtypeBlock Effect FitsRes BlocksGtypeWtβ0β0 βGβG βBβB β 0 + β B + β G
3. Evaluate model a. Straight line □ Straight line model ok? b. Need to revise model? □ Errors homogeneous? c. Assumptions for computing p-values □ Errors normal? □ Errors independent?
a. Straight line □ Straight line model ok? b. Need to revise model? □ Errors homogeneous? c. Assumptions for computing p-values □ Errors normal? □ Errors independent? 3. Evaluate model NA ?
3. Evaluate model a. Straight line □ Straight line model ok? b. Need to revise model? □ Errors homogeneous? c. Assumptions for computing p-values □ Errors normal? □ Errors independent? NA ? X
3. Evaluate model a. Straight line □ Straight line model ok? b. Need to revise model? □ Errors homogeneous? c. Assumptions for computing p-values □ Errors normal? □ Errors independent? NA X ?
4.State the population and whether the sample is representative. Genotype fixed effects We will infer only to those genotypes Experiment, i.e. time and other condition random effects All possible measurements that could have been made on Tribolium, given the mode of collection
5.Decide on mode of inference. Is hypothesis testing appropriate? 6.State H A / H o pair, test statistic, distribution, tolerance for Type I error. Interaction Term: Removed by experimental design (genotypes weighed in random order) Block Term experiment: We are interested in this effect. Only included in model to remove the variance from the error term
6.State H A / H o pair, test statistic, distribution, tolerance for Type I error. Genotype Term: H A : E(M I ) ≠ E(M II ) ≠ E(M III )H A :Var(β G ) > 0 H 0 : E(M I ) = E(M II ) = E(M III ) H 0 :Var(β G ) = 0 Test Statistic Distribution of test statitstic Tolerance for Type I error
7. ANOVA n = 12 SourcedfSSMSFp Blocks Genotype Res______ Total
7. ANOVA n = 12 SourcedfSSMSFp Blocks Genotype Res__6__ Total
7. ANOVA n = 12 SourcedfSSMSFp Blocks Genotype Res__6__ Total
7. ANOVA n = 12 SourcedfSSMSFp Blocks Genotype Res__6__ Total What would it look like had we not controlled for experiment? SourcedfSSMSFp Genotype Res__9__ Total
7. ANOVA n = 12 SourcedfSSMSFp Blocks Genotype Res__6__ Total What would it look like had we not controlled for experiment? SourcedfSSMSFp Genotype Res__9__ Total
7. ANOVA n = 12 SourcedfSSMSFp Blocks Genotype Res__6__ Total What would it look like had we not controlled for experiment? SourcedfSSMSFp Genotype Res__9__ Total
7. ANOVA n = 12 SourcedfSSMSFp Blocks Genotype Res__6__ Total What would it look like had we not controlled for experiment? SourcedfSSMSFp Genotype Res__9__ Total
7. ANOVA n = 12 SourcedfSSMSFp Blocks Genotype Res__6__ Total What would it look like had we not controlled for experiment? SourcedfSSMSFp Genotype Res__9__ Total
7. ANOVA n = 12 SourcedfSSMSFp Blocks Genotype Res__6__ Total What would it look like had we not controlled for experiment? SourcedfSSMSFp Genotype Res__9__ Total
7. ANOVA n = 12 SourcedfSSMSFp Blocks Genotype Res__6__ Total What would it look like had we not controlled for experiment? SourcedfSSMSFp Genotype Res__9__ Total STATISTICAL CONTROL
8. Decide whether to recompute p-value Residuals homogenous not independent deviated slightly from normality n=12 p=0.027 needs to change 2-fold to change decision Randomization ( runs) p ran = change in p: 0.027/0.186 = 1.45
9.Declare decision about terms Only the fixed term was tested p=0.0186< α =0.05 Reject H 0 There is significant variation in mean dry weight among genotypes
10.Report and interpret parameters of biological interest Means per genotype with 95% CI, not controlled for among experiments variation Genotype III Homozygote mutant Genotype I Heterozygote mutant Genotype II Wild
10.Report and interpret parameters of biological interest Means per genotype with 95% CI, not controlled for among experiments variation library(effects) effect("G", lm2,se=TRUE, confidence.level=.95) Genotype III Homozygote mutant Genotype I Heterozygote mutant Genotype II Wild
Genotype III Homozygote mutant 10.Report and interpret parameters of biological interest Means per genotype with 95% CI, controlled for among experiments variation Genotype I Heterozygote mutant Genotype II Wild
10.Report and interpret parameters of biological interest Means per genotype with 95% CI, controlled for among experiments variation effect("G", lm1,se=TRUE, confidence.level=.95) Genotype III Homozygote mutant Genotype I Heterozygote mutant Genotype II Wild