Fixed vs. Random Effects Fixed effect –we are interested in the effects of the treatments (or blocks) per se –if the experiment were repeated, the levels.

Slides:



Advertisements
Similar presentations
A. The Basic Principle We consider the multivariate extension of multiple linear regression – modeling the relationship between m responses Y 1,…,Y m and.
Advertisements

Data: Crab mating patterns Data: Typists (Poisson with random effects) (Poisson Regression, ZIP model, Negative Binomial) Data: Challenger (Binomial with.
Design and Analysis of Augmented Designs in Screening Trials
CHAPTER 25: One-Way Analysis of Variance Comparing Several Means
i) Two way ANOVA without replication
Design of Experiments and Analysis of Variance
PROC GLIMMIX: AN OVERVIEW
Linear statistical models 2009 Models for continuous, binary and binomial responses  Simple linear models regarded as special cases of GLMs  Simple linear.
Analysis of Variance: ANOVA. Group 1: control group/ no ind. Var. Group 2: low level of the ind. Var. Group 3: high level of the ind var.
The Statistical Analysis Partitions the total variation in the data into components associated with sources of variation –For a Completely Randomized Design.
Generalised linear models
Lesson #23 Analysis of Variance. In Analysis of Variance (ANOVA), we have: H 0 :  1 =  2 =  3 = … =  k H 1 : at least one  i does not equal the others.
ANalysis Of VAriance (ANOVA) Comparing > 2 means Frequently applied to experimental data Why not do multiple t-tests? If you want to test H 0 : m 1 = m.
Final Review Session.
Chapter 2 Simple Comparative Experiments
OLS versus MLE Example YX Here is the data:
Incomplete Block Designs
Regression Model Building Setting: Possibly a large set of predictor variables (including interactions). Goal: Fit a parsimonious model that explains variation.
Assumptions of the ANOVA The error terms are randomly, independently, and normally distributed, with a mean of zero and a common variance. –There should.
Transforming the data Modified from: Gotelli and Allison Chapter 8; Sokal and Rohlf 2000 Chapter 13.
Assumptions of the ANOVA
SAS Lecture 5 – Some regression procedures Aidan McDermott, April 25, 2005.
1 Advances in Statistics Or, what you might find if you picked up a current issue of a Biological Journal.
Fixed vs. Random Effects
Experimental Design An Experimental Design is a plan for the assignment of the treatments to the plots in the experiment Designs differ primarily in the.
Chapter 3: Generalized Linear Models 3.1 The Generalization 3.2 Logistic Regression Revisited 3.3 Poisson Regression 1.
The Triangle of Statistical Inference: Likelihoood
Random Sampling, Point Estimation and Maximum Likelihood.
Applications The General Linear Model. Transformations.
23-1 Analysis of Covariance (Chapter 16) A procedure for comparing treatment means that incorporates information on a quantitative explanatory variable,
1 Experimental Statistics - week 10 Chapter 11: Linear Regression and Correlation Note: Homework Due Thursday.
ALISON BOWLING THE GENERAL LINEAR MODEL. ALTERNATIVE EXPRESSION OF THE MODEL.
The Scientific Method Formulation of an H ypothesis P lanning an experiment to objectively test the hypothesis Careful observation and collection of D.
Today: Lab 9ab due after lecture: CEQ Monday: Quizz 11: review Wednesday: Guest lecture – Multivariate Analysis Friday: last lecture: review – Bring questions.
Chapter 13 ANOVA The Design and Analysis of Single Factor Experiments - Part II Chapter 13B Class will begin in a few minutes. Reaching out to the internet.
Part III The General Linear Model Chapter 10 GLM. ANOVA.
Generalized Linear Models All the regression models treated so far have common structure. This structure can be split up into two parts: The random part:
Repeated Measurements Analysis. Repeated Measures Analysis of Variance Situations in which biologists would make repeated measurements on same individual.
Different Distributions David Purdie. Topics Application of GEE to: Binary outcomes: – logistic regression Events over time (rate): –Poisson regression.
Forecasting Choices. Types of Variable Variable Quantitative Qualitative Continuous Discrete (counting) Ordinal Nominal.
The Completely Randomized Design (§8.3)
Interval Estimation and Hypothesis Testing Prepared by Vera Tabakova, East Carolina University.
Control of Experimental Error Blocking - –A block is a group of homogeneous experimental units –Maximize the variation among blocks in order to minimize.
Mixed Effects Models Rebecca Atkins and Rachel Smith March 30, 2015.
Chapter Seventeen. Figure 17.1 Relationship of Hypothesis Testing Related to Differences to the Previous Chapter and the Marketing Research Process Focus.
Chapter 13 Complete Block Designs. Randomized Block Design (RBD) g > 2 Treatments (groups) to be compared r Blocks of homogeneous units are sampled. Blocks.
Single-Factor Studies KNNL – Chapter 16. Single-Factor Models Independent Variable can be qualitative or quantitative If Quantitative, we typically assume.
1 Introduction to Mixed Linear Models in Microarray Experiments 2/1/2011 Copyright © 2011 Dan Nettleton.
1 STA 617 – Chp10 Models for matched pairs Summary  Describing categorical random variable – chapter 1  Poisson for count data  Binomial for binary.
1 STA 617 – Chp12 Generalized Linear Mixed Models SAS for Model (12.3) with Matched Pairs from Table 12.1.
Statistics……revisited
IE241: Introduction to Design of Experiments. Last term we talked about testing the difference between two independent means. For means from a normal.
One-Way Analysis of Variance Recapitulation Recapitulation 1. Comparing differences among three or more subsamples requires a different statistical test.
Dependent Variable Discrete  2 values – binomial  3 or more discrete values – multinomial  Skewed – e.g. Poisson Continuous  Non-normal.
Topic 22: Inference. Outline Review One-way ANOVA Inference for means Differences in cell means Contrasts.
Biostatistics Case Studies Peter D. Christenson Biostatistician Session 3: Missing Data in Longitudinal Studies.
G Lecture 71 Revisiting Hierarchical Mixed Models A General Version of the Model Variance/Covariances of Two Kinds of Random Effects Parameter Estimation.
Chapter 13 Understanding research results: statistical inference.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
 List the characteristics of the F distribution.  Conduct a test of hypothesis to determine whether the variances of two populations are equal.  Discuss.
F73DA2 INTRODUCTORY DATA ANALYSIS ANALYSIS OF VARIANCE.
Model Comparison. Assessing alternative models We don’t ask “Is the model right or wrong?” We ask “Do the data support a model more than a competing model?”
Transforming the data Modified from:
Generalized Linear Models
Joanna Romaniuk Quanticate, Warsaw, Poland
I. Statistical Tests: Why do we use them? What do they involve?
Interval Estimation and Hypothesis Testing
Introduction to Mixed Linear Models in Microarray Experiments
Fixed, Random and Mixed effects
Confidence Intervals for Proportions and Variances
Presentation transcript:

Fixed vs. Random Effects Fixed effect –we are interested in the effects of the treatments (or blocks) per se –if the experiment were repeated, the levels would be the same –conclusions apply to the treatment (or block) levels that were tested –treatment (or block) effects sum to zero Random effect –represents a sample from a larger reference population –the specific levels used are not of particular interest –conclusions apply to the reference population inference space may be broad (all possible random effects) or narrow (just the random effects in the experiment) –goal is generally to estimate the variance among treatments (or other groups) Need to know which effects are fixed or random to determine appropriate F tests in ANOVA

Fixed or Random? lambs born from common parents (same ram and ewe) are given different formulations of a vitamin supplement comparison of new herbicides for potential licensing comparison of herbicides used in different decades (1980’s, 1990’s, 2000’s) nitrogen fertilizer treatments at rates of 0, 50, 100, and 150 kg N/ha years of evaluation of new canola varieties (2008, 2009, 2010) location of a crop rotation experiment that is conducted on three farmers’ fields in the Willamette valley (Junction City, Albany, Woodburn) species of trees in an old growth forest

Fixed and random models for the CRD Fixed Model (Model I) Random Model (Model II) Y ij = µ +  i +  ij Expected Source df Mean Square Treatment t-1 Error tr-t 2 T 2 e r  2 e  Expected  Te  r 22  Source df Mean Square Treatment t -1 Error tr - t 2 e  variance among fixed treatment effects

Models for the RBD Fixed Model Random Model Y ij = µ +  i +  j +  ij Mixed Model

RBD Mixed Model Analyses with SAS Mixed Models - contain both random and fixed effects Note that PROC GLM will only handle LM! PROC GLIMMIX can handle all of the situations above Distribution Treatments Fixed Blocks Fixed Treatments Fixed Blocks Random Normal (continuous) (PROC GLM) Linear Model (LM) (PROC MIXED) Linear Mixed Model (LMM) Non-normal (categories or counts) (PROC GENMOD) Generalized Linear Model (GLM) (PROC GLIMMIX) Generalized Linear Mixed Model (GLMM)

Generalized Linear Models An alternative to data transformations Principle is to make the model fit the data, rather than changing the data to fit the model Models include link functions that allow heterogeneous variances and nonlinearity Analysis and estimation are based on maximum likelihood methods Becoming more widely used - recommended by the experts Need some understanding of the underlying theory to implement properly Notes adapted from ASA GLMM Workshop, Long Beach, CA, 2010

Generalized Linear Models ANOVA/Regression model is fit to a non-normal data set Three elements: 1. Random component – a probability distribution for Y i from the exponential family of distributions 2. Systematic component – represent the linear predictors (X variables) in the model 3. Link function – links the random and systematic elements Form is mean + trt effect No error term

Generalized Linear Models ANOVA/Regression model is fit to a non-normal data set Three elements: 1. Random component – a probability distribution for Y i from the exponential family of distributions (this is known) 2. Systematic component – represent the linear predictors (X variables) in the model 3. Link function – links the random and systematic elements Form is mean + trt effect No error term

Log of Distribution = “Log-Likelihood” Binary responses (0 or 1) Probability of success follows a binomial distribution “canonical parameter” Takes the form Y * function of P

Example – logit link µ can only vary from 0 to 1  can take on any value Use an inverse function to convert means to the original scale

Some Common Distributions & Link(s)

Linear Models for an RBD in SAS Treatments fixed, Blocks fixed –PROC GLM (normal) or PROC GENMOD (non-normal) –all effects appear in model statement Model Response = Block Treatment; Treatments fixed, Blocks random –PROC MIXED (normal) or PROC GLIMMIX (non-normal) –Only fixed effects appear in model statement Model Response = Treatment; Random Block;

GLIMMIX basic syntax for an RBD fixed effects go in the model statement random effects go in the random statement default means and standard errors from lsmeans statement are on a log scale ilink option gives back-transformed means on original scale and estimates standard errors on original scale diff option requests significant tests between all possible pairs of treatments in the trial, proc glimmix; class treatment block; model response = treatment / link=log s dist=poisson; random block; lsmeans treatment/ilink diff;

Estimation in LMM, GLM, and GLMM Does not use Least Squares estimation Does not calculate Sums of Squares or Mean Squares Estimates are by Maximum Likelihood Output includes Source of variation degrees of freedom F tests and p-values Treatment means and standard errors Comparisons of means and standard errors