Analysis of Clustered and Longitudinal Data Module 3 Linear Mixed Models (LMMs) for Clustered Data – Two Level Part A 1 Biostat 512: Module 3A - Kathy.

Slides:



Advertisements
Similar presentations
Hierarchical Linear Modeling: An Introduction & Applications in Organizational Research Michael C. Rodriguez.
Advertisements

Lecture 11 (Chapter 9).
Statistical Analysis Overview I Session 2 Peg Burchinal Frank Porter Graham Child Development Institute, University of North Carolina-Chapel Hill.
3-Dimensional Gait Measurement Really expensive and fancy measurement system with lots of cameras and computers Produces graphs of kinematics (joint.
Regression Inferential Methods
1 SSS II Lecture 1: Correlation and Regression Graduate School 2008/2009 Social Science Statistics II Gwilym Pryce
SC968: Panel Data Methods for Sociologists Random coefficients models.
Lecture 6 (chapter 5) Revised on 2/22/2008. Parametric Models for Covariance Structure We consider the General Linear Model for correlated data, but assume.
Correlation and regression
Lecture 4 (Chapter 4). Linear Models for Correlated Data We aim to develop a general linear model framework for longitudinal data, in which the inference.
1 BINARY CHOICE MODELS: LOGIT ANALYSIS The linear probability model may make the nonsense predictions that an event will occur with probability greater.
Objectives (BPS chapter 24)
Multilevel Models 4 Sociology 8811, Class 26 Copyright © 2007 by Evan Schofer Do not copy or distribute without permission.
From last time….. Basic Biostats Topics Summary Statistics –mean, median, mode –standard deviation, standard error Confidence Intervals Hypothesis Tests.
School of Veterinary Medicine and Science Multilevel modelling Chris Hudson.
DATA VISUALIZATION UNIVARIATE (no review- self study) STEM & LEAF BOXPLOT BIVARIATE SCATTERPLOT (review correlation) Overlays; jittering Regression line.
Longitudinal Experiments Larry V. Hedges Northwestern University Prepared for the IES Summer Research Training Institute July 28, 2010.
Multilevel Models 2 Sociology 8811, Class 24
ANCOVA Psy 420 Andrew Ainsworth. What is ANCOVA?
Clustered or Multilevel Data
1 Review of Correlation A correlation coefficient measures the strength of a linear relation between two measurement variables. The measure is based on.
Mixed models Various types of models and their relation
Multilevel Models 3 Sociology 8811, Class 25 Copyright © 2007 by Evan Schofer Do not copy or distribute without permission.
Introduction to Linear Mixed Effects Kiran Pedada PhD Student (Marketing) March 26, 2015.
Experimental Group Designs
Standard error of estimate & Confidence interval.
Analysis of Clustered and Longitudinal Data
Inference for regression - Simple linear regression
1 Basic Definitions Greg C Elvers, Ph.D.. 2 Statistics Statistics are a set of tools that help us to summarize large sets of data data -- set of systematic.
1 BINARY CHOICE MODELS: PROBIT ANALYSIS In the case of probit analysis, the sigmoid function is the cumulative standardized normal distribution.
PARAMETRIC STATISTICAL INFERENCE
Hierarchical Linear Modeling (HLM): A Conceptual Introduction Jessaca Spybrook Educational Leadership, Research, and Technology.
Scientific question: Does the lunch intervention impact cognitive ability? The data consists of 4 measures of cognitive ability including:Raven’s score.
Introduction Multilevel Analysis
+ Chapter 12: Inference for Regression Inference for Linear Regression.
Regression. Height Weight How much would an adult female weigh if she were 5 feet tall? She could weigh varying amounts – in other words, there is a distribution.
Regression. Height Weight Suppose you took many samples of the same size from this population & calculated the LSRL for each. Using the slope from each.
Biostatistics Case Studies 2007 Peter D. Christenson Biostatistician Session 3: Incomplete Data in Longitudinal Studies.
Repeated Measurements Analysis. Repeated Measures Analysis of Variance Situations in which biologists would make repeated measurements on same individual.
Regression with Inference Notes: Page 231. Height Weight Suppose you took many samples of the same size from this population & calculated the LSRL for.
Lecture 3 Linear random intercept models. Example: Weight of Guinea Pigs Body weights of 48 pigs in 9 successive weeks of follow-up (Table 3.1 DLZ) The.
Biostat 200 Lecture Simple linear regression Population regression equationμ y|x = α +  x α and  are constants and are called the coefficients.
28. Multiple regression The Practice of Statistics in the Life Sciences Second Edition.
T tests comparing two means t tests comparing two means.
FIXED AND RANDOM EFFECTS IN HLM. Fixed effects produce constant impact on DV. Random effects produce variable impact on DV. F IXED VS RANDOM EFFECTS.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
The Mixed Effects Model - Introduction In many situations, one of the factors of interest will have its levels chosen because they are of specific interest.
1 BINARY CHOICE MODELS: LINEAR PROBABILITY MODEL Economists are often interested in the factors behind the decision-making of individuals or enterprises,
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
1 BINARY CHOICE MODELS: LOGIT ANALYSIS The linear probability model may make the nonsense predictions that an event will occur with probability greater.
1 Consider the k studies as coming from a population of effects we want to understand. One way to model effects in meta-analysis is using random effects.
From t-test to multilevel analyses Del-3
Analysis for Designs with Assignment of Both Clusters and Individuals
Simple Linear Regression
Regression.
From t-test to multilevel analyses Del-2
Stephen W. Raudenbush University of Chicago December 11, 2006
Linear Mixed Models in JMP Pro
Regression.
ANOVA Advanced Statistical Methods: Continuous Variables ANOVA
Chapter 12 Regression.
Regression.
Regression.
Regression.
Regression.
Bayesian Multilevel/Longitudinal Models Using Stata
Regression.
Advanced Algebra Unit 1 Vocabulary
Introduction to Regression
Introduction to Econometrics, 5th edition
Presentation transcript:

Analysis of Clustered and Longitudinal Data Module 3 Linear Mixed Models (LMMs) for Clustered Data – Two Level Part A 1 Biostat 512: Module 3A - Kathy Welch, Heidi Reichert

The Linear Mixed Model (LMM) A Linear Mixed Model is a parametric model for a continuous outcome. The model is linear in the parameters. The model contains both fixed and random effects. LMMs can be used to analyze both clustered and longitudinal/repeated measures data. We will discuss the analysis clustered data using LMMs in this module and cover the analysis of longitudinal and repeated measures data using LMMs in later modules. 2

Data Example: Rat Pup Data 30 female rats were randomly assigned to one of three treatment groups, high dose, low dose and control. The objective of the study was to compare the birth weights of pups from litters born to female rats that received the drug treatment at high and low doses to the birth weights of pups from litters that received the control treatment. Research question: Is there an effect of drug treatment (High, Low, Control) on birth weight? 3

Clustered Data Example: Rat Pup Data The design is unbalanced –Number of rats receiving each treatment varies by treatment group (3 rats in the high-dose group died) –Number of rat pups per litter varies across the litters Variables include –Litter (litter ID number) –Pup_ID (rat pup ID number) –Weight (birth weight of the rat pup: the outcome) –Sex (sex of the rat pup: female or male) –Treatment (dose: high, low, or control) 4

The Rat Pup Data is Multilevel Level 2 (Litter) Level 1 (Rat Pup) Level 1 Variables: Birth Weight, Sex Level 2 Variables: Treatment Pup 2 1 Pup n 1 Litter 1 Pup Pup 2 2 Pup n 2 Pup Litter 2 5

Weights Vary Within and Between Litters 6 Rat weights vary from rat to rat within the same litter. The average litter weight ( ) varies between litters.

Weights are Correlated Within Litters 7 The weights of rats from within the same litter tend to be pretty similar. For some litters, the rat weights lie entirely above or below the overall average (-).

Summarize the Level 1 Covariate(s) Level 1 covariate is sex sex | Freq. Percent Cum Female | Male | Total |

Summarize Weight by the Level 1 Covariate(s) Y is Weight Level 1 covariate is sex Summary for variables: weight by categories of: female (Sex) female | N mean sd min max | | Total |

Use boxplots to assess the effect of sex 10 Visualize Weight by the Level 1 Covariate(s)

Summarize the Level 2 Covariates Level 2 covariate is treatment group treatment | Freq. Percent Cum Control | High | Low | Total|

12 Visualize Weight by the Level 2 Covariates Use boxplots to assess the effect of treatment

The Linear Mixed Model (LMM) for Clustered Data LMMs for clustered data allow for both fixed and random effects. Fixed effects may be modeled at any level of the data. –In the rat pup data, we are interested in the fixed effects of sex and treatment. –Sex can vary from rat to rat. It is measured at Level 1. –Treatment is constant for rats within the same litter. It is measured at Level 2. Random effects usually include a random intercept for each level of clustering to account for possible correlation within clusters, and to make inference to the larger population of clusters. –In the rat pup model, we will include a random intercept term. 13

The LMM for the Rat Pup Data We start with the simplest mixed model : where i denotes a rat pup j denotes the litter is the overall intercept term is the random deviation from the fixed intercept for litter j is the random error for the ith rat pup in the jth litter fixedrandom 14

The LMM for the Rat Pup Data 15

The LMM for the Rat Pup Data weight | Coef. Std. Err. z P>|z| [95% Conf. Interval] _cons | Random-effects Parameters | Estimate Std. Err. [95% Conf. Interval] litter: Identity | var(_cons) | var(Residual) |

The LMM for the Rat Pup Data The random portion of the model now involves two parts – the cluster-specific random deviations (the b 0j ), and the subject-within-cluster-specific error (the ). This LMM is commonly referred to as the Variance Components model, because it partitions the total variation in the outcome into between-cluster variation and within-cluster variation. –The variance of the random intercepts is the between-cluster variation. Also referred to as the Level 2 variance. –The variance of the residuals is the within-cluster variation, also known as the Level 1 variance. 17

The LMM for the Rat Pup Data We now add the dummy variables for the Level 2 covariate, Treatment: where i denotes a rat pup j denotes the litter is the overall intercept term, and represents the mean for Control group are the difference in effect of treatment for the High and Low treatment groups, respectively, compared to Control is the random deviation from the treatment-specific intercept for litter j is the random error for the ith rat pup in the jth litter fixedrandom 18

The LMM for the Rat Pup Data 19

The LMM for the Rat Pup Data weight | Coef. Std. Err. z P>|z| [95% Conf. Interval] treatnum | 1 | | | _cons | Random-effects Parameters | Estimate Std. Err. [95% Conf. Interval] litter: Identity | var(_cons) | var(Residual) |

The LMM for the Rat Pup Data The addition of the Level 2 dummies for treatment has reduced the Level 2 between- cluster variance. –The variance of the random intercepts (or the b 0j s) is smaller because the systematic variation due to treatment has been removed. 21

The LMM for the Rat Pup Data We now add the dummy variable for the Level 1 covariate, Sex: where i denotes a rat pup j denotes the litter is the overall intercept term, and represents the mean for Males in the Control group are the difference in effect of treatment for the High and Low treatment groups, respectively, compared to Control is the effect being Female compared to Male is the random deviation from the treatment-specific intercept for litter j is the random error for the ith rat pup in the jth litter fixedrandom 22

The LMM for the Rat Pup Data 23

The LMM for the Rat Pup Data weight | Coef. Std. Err. z P>|z| [95% Conf. Interval] treatnum | 1 | | | 1.female | _cons | Random-effects Parameters | Estimate Std. Err. [95% Conf. Interval] litter: Identity | var(_cons) | var(Residual) |

The LMM for the Rat Pup Data The addition of the Level 1 dummy for sex has reduced the Level 1 within-cluster variance. –The residual variance is smaller because the systematic variation due to sex has been removed. 25

The LMM Accounts for Correlation We say that given the s, the s within a cluster are independent. 26

The Linear Mixed Model (LMM) for Clustered Data LMMs for clustered data generally include both fixed and random effects. –We include random intercepts for each level of clustering. In LMMs the random part of the model now involves two parts – the b 0j s and the s –The variance of the random intercepts (the b 0j s) quantifies the between-cluster variation in the outcome. –The residual variance (variance of the s) quantifies the within-cluster variation in the outcome. 27

Data Setup for LMM Analysis: Long Form 28

Lab Example Rat Pup Data 29