HLM – ESTIMATING MULTI-LEVEL MODELS Hierarchical Linear Modeling.

Slides:



Advertisements
Similar presentations
Multilevel modelling short course
Advertisements

{ Multilevel Modeling using Stata Andrew Hicks CCPR Statistics and Methods Core Workshop based on the book: Multilevel and Longitudinal Modeling Using.
Hierarchical Linear Modeling: An Introduction & Applications in Organizational Research Michael C. Rodriguez.
AMMBR - final stuff xtmixed (and xtreg) (checking for normality, random slopes)
AMMBR from xtreg to xtmixed (+checking for normality, random slopes)
Tests of Significance for Regression & Correlation b* will equal the population parameter of the slope rather thanbecause beta has another meaning with.
Statistical Analysis Overview I Session 2 Peg Burchinal Frank Porter Graham Child Development Institute, University of North Carolina-Chapel Hill.
Random effects as latent variables: SEM for repeated measures data Dr Patrick Sturgis University of Surrey.
Nested Example Using SPSS David A. Kenny January 8, 2014.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
1 SSS II Lecture 1: Correlation and Regression Graduate School 2008/2009 Social Science Statistics II Gwilym Pryce
SC968: Panel Data Methods for Sociologists Random coefficients models.
Lecture 6 (chapter 5) Revised on 2/22/2008. Parametric Models for Covariance Structure We consider the General Linear Model for correlated data, but assume.
Correlation and regression
Mixing it up: Mixed Models Tracy Tomlinson December 11, 2009 Tracy Tomlinson December 11, 2009.
Clustered or Multilevel Data
1 Hierarchical Linear Modeling David A. Hofmann Kenan-Flagler Business School University of North Carolina at Chapel Hill Academy of Management August,
Treatment Effects: What works for Whom? Spyros Konstantopoulos Michigan State University.
Experimental Design & Analysis
C ENTERING IN HLM. W HY CENTERING ? In OLS regression, we mostly focus on the slope but not intercept. Therefore, raw data (natural X metric) is perfectly.
Overlooking Stimulus Variance Jake Westfall University of Colorado Boulder Charles M. Judd David A. Kenny University of Colorado BoulderUniversity of Connecticut.
Analysis of Variance & Multivariate Analysis of Variance
Longitudinal Data Analysis: Why and How to Do it With Multi-Level Modeling (MLM)? Oi-man Kwok Texas A & M University.
Examples from Singer’s Using SAS Proc Mixed to Fit Multilevel Models… “ To use the paper effectively, … in particular the reader must understand: The difference.
Analysis of Clustered and Longitudinal Data
3nd meeting: Multilevel modeling: introducing level 1 (individual) and level 2 (contextual) variables + interactions Subjects for today:  Intra Class.
Introduction to Multilevel Modeling Using SPSS
Multilevel Modeling: Other Topics
Issues in Experimental Design Reliability and ‘Error’
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
Chapter 13: Inference in Regression
G Lecture 5 Example fixed Repeated measures as clustered data
Hierarchical Linear Modeling (HLM): A Conceptual Introduction Jessaca Spybrook Educational Leadership, Research, and Technology.
Introduction Multilevel Analysis
Statistics and Quantitative Analysis U4320 Segment 12: Extension of Multiple Regression Analysis Prof. Sharyn O’Halloran.
Growth Curve Models Using Multilevel Modeling with SPSS David A. Kenny January 23, 2014.
Testing Hypotheses about Differences among Several Means.
Corinne Introduction/Overview & Examples (behavioral) Giorgia functional Brain Imaging Examples, Fixed Effects Analysis vs. Random Effects Analysis Models.
Introduction to Multilevel Modeling Stephen R. Porter Associate Professor Dept. of Educational Leadership and Policy Studies Iowa State University Lagomarcino.
Multilevel Linear Modeling aka HLM. The Design We have data at two different levels In this case, 7,185 students (Level 1) Nested within 160 Schools (Level.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 26.
The Completely Randomized Design (§8.3)
BUSI 6480 Lecture 8 Repeated Measures.
Multi-level Analysis Recognizing the Problem Maureen Smith, MD PhD Depts. of Population Health Sciences and Family Medicine University of Wisconsin-Madison.
2nd meeting: Multilevel modeling: intra class correlation Subjects for today:  Multilevel data base construction  The difference between single level.
Multilevel Modeling: Other Topics David A. Kenny January 7, 2014.
Unit 3a: Introducing the Multilevel Regression Model © Andrew Ho, Harvard Graduate School of EducationUnit 3a – Slide 1
General Linear Model.
FIXED AND RANDOM EFFECTS IN HLM. Fixed effects produce constant impact on DV. Random effects produce variable impact on DV. F IXED VS RANDOM EFFECTS.
Advanced Methods and Models in Behavioral Research – 2009/2010 AMMBR course design CONTENT METHOD Y is 0/1 conjoint analysis logistic regression multi-level.
One-Way Analysis of Variance Recapitulation Recapitulation 1. Comparing differences among three or more subsamples requires a different statistical test.
Analysis of Experiments
1 Statistics 262: Intermediate Biostatistics Regression Models for longitudinal data: Mixed Models.
The Mixed Effects Model - Introduction In many situations, one of the factors of interest will have its levels chosen because they are of specific interest.
ANCOVA.
Biostatistics Case Studies Peter D. Christenson Biostatistician Session 3: Missing Data in Longitudinal Studies.
G Lecture 71 Revisiting Hierarchical Mixed Models A General Version of the Model Variance/Covariances of Two Kinds of Random Effects Parameter Estimation.
Chapter 13 Understanding research results: statistical inference.
Logic of Hypothesis Testing
Introduction to Multilevel Modeling Using HLM 6
BINARY LOGISTIC REGRESSION
Nested Example Using SPSS
An introduction to basic multilevel modeling
HLM with Educational Large-Scale Assessment Data: Restrictions on Inferences due to Limited Sample Sizes Sabine Meinck International Association.
Inference about the Slope and Intercept
Inference about the Slope and Intercept
Fixed, Random and Mixed effects
MGS 3100 Business Analysis Regression Feb 18, 2016
Presentation transcript:

HLM – ESTIMATING MULTI-LEVEL MODELS Hierarchical Linear Modeling

1. THESE MODELS INCORPORATE A NESTED DESIGN 2. THIS ALLOWS FOR RESPONSES TO BE MORE SIMILAR WITHIN A GROUP THAN BETWEEN A GROUP 3. HLM ALLOWS FOR FIXED EFFECTS, RANDOM EFFECTS, AND VARIANCE COMPONENTS 4. OFTENTIMES IN EXPERIMENTAL SETTINGS, THE RANDOM EFFECTS ARE NUISANCES THAT NECESSITATE STATISTICAL CONTROLS. FOR EXAMPLE, THE EFFECT OF A DRUG MAY BE THE PRIMARY INTEREST, WHEREAS THE NURSE FACTOR CAN BE POTENTIALLY CONFOUNDING BUT THEORETICALLY UNINTERESTING. IT IS NONETHELESS NECESSARY TO INCLUDE THE RELEVANT RANDOM EFFECTS IN THE MODEL OR OTHERWISE RUN THE RISK OF MAKING FALSE INFERENCES ABOUT THE DRUG EFFECT (AND ANY DRUG/RANDOM EFFECT INTERACTION). IN OTHER APPLICATIONS THE RANDOM EFFECTS ARE OF SUBSTANTIVE INTEREST. Hierarchical Linear Modeling

ANOVA AND HLM  Hox and Kreft (1994) make the connection clearly:  “An effect in ANOVA is said to be fixed when inferences are to be made only about the treatments actually included. An effect is random when the treatment groups are sampled from a population of treatment groups and inferences are to be made to the population of which these treatments are a sample. Random effects need random effects ANOVA models (Hays1973). Multilevel models assume a hierarchically structured population, with random sampling of both groups and individuals within groups. Consequently, multilevel analysis models must incorporate random effects” (pgs ).

WHY? CORRELATED ERROR TERMS!  Note that the motivation for utilizing mixed models for multilevel data does not rest on the different number of observations at each level, as any model including a dummy variable involves nesting (e.g. survey respondents are nested within gender).  The justification instead lies in the fact that the errors within each randomly sampled level-2 unit are likely correlated, necessitating the estimation of a random effects model.  Once the researcher has accounted for error non- independence it is possible to make more accurate inferences about the fixed effects of interest.

The Model  Yij = [A_00 + A_01(MEANSES_j) + A_02(SECTOR_j) + A_10(SES_ij) ](Fixed Effects)  + [A_11(MEANSES_j *SES_ij) + A_12(SECTOR_j* SES_ij) ] (Fixed Effects)  + [B_0j + B_1j(SES_ij)](Random Effects)  +r_ij

How to read in data:

SPSS  The command for estimating multilevel models is MIXED, followed immediately by the dependent variable. PRINT = SOLUTION requests that SPSS report the fixed effects estimates and standard errors. FIXED and RANDOM specify which variables to treat as fixed and random effects, respectively. The SUBJECT option following the vertical line j identifies the grouping variable, in this case school ID.  MIXED mathach WITH meanses sector centses  /PRINT = SOLUTION TESTCOV  /FIXED = INTERCEPT meanses sector centses meanses*centses sector*centses  /RANDOM = INTERCEPT centses | SUBJECT(id) COVTYPE(UN).

Covariances  -The null hypothesis for the random effect is therefore that its variance is equal to zero. -The COV(UN) option, specifies a structure for the level- 2 covariance matrix. When a single school-level variance component is estimated it is unnecessary to deal with co-variances. When there is more than one level-2 variance component, SPSS will assume a particular covariance structure. In many cross-sectional applications of multilevel models, the researcher does not wish to put any constraints on this covariance matrix. Thus the UN in the COV option specifies an unstructured matrix.

Results  The fixed effects are all significant. Given the inclusion of the group- mean centered SES variable, the intercept is interpreted as the expected math achievement in a public school with average SES levels for a student at his or her school's average SES. In this model, the expected outcome is Because there are interactions in the model, the marginal fixed effects of each variable will depend on the value of the other variable(s) involved in the interaction. The marginal effect of a one- unit change in a student's SES score on math achievement depends on whether a school is public or private as well as on the school's average SES score. For a public school (where sector=0), the marginal effect of a one-unit change in the group-mean centered student SES variable is equal to (MEANSES) For a private school (where sector=1), the marginal effect of a one-unit change in student SES is equal to (MEANSES)

Results Continued  When cross-level interactions are present, graphical means may be appropriate for exploring the contingent nature of marginal effects in greater detail. Here the simplest interpretation is that the effect of student-level SES is significantly higher in wealthier schools and significantly lower in private schools. The variance component for the random intercept continues to be significant, suggesting that there remains some variation in average school performance not accounted for by the variables in the model. The variance component for the random slope, however, is not significant. Thus the researcher may be justified in estimating an alternative model that constrains this variance component to equal zero.

STATA  A final model introduces the student socioeconomic status variable. Because it is possible that the effect of individual SES status varies across schools, this slope is treated as random. In addition, a school's average SES score and its sector (public or private) may interact with student-level SES, accounting for some of the variance in the slope. In order to include these cross-level interactions in the model, however,  it is necessary to first explicitly create the interaction variables in Stata: .gen ses_mses=meanses*centses .gen ses_sect=sector*centses .xtmixed mathach meanses sector centses ses_mses ses_sect || id: centses, var cov(un)

SAS  The COVTEST option requests hypothesis tests for the random effects. The CLASS statement identifies id as a categorical variable. The MODEL statement defines the model, which in this case does not include any predictor variables, and the SOLUTION option asks SAS to print the fixed effects estimates in the output. The next statement, RANDOM, identifies the elements of the model to be specified as random effects. The SUBJECT=id option identies id to be the grouping variable.  PROC MIXED COVTEST DATA=hsb2;  CLASS id;  MODEL mathach = meanses sector cses meanses*cses sector*cses/solution;  RANDOM intercept cses / TYPE=UN SUB=id;  RUN;

R  install.packages(lme4)  library(lme4)  HSBdata <- read.table("C:/user/temp/hsbALL.txt", header=T, sep=",")  attach(HSBdata)  HSBdata$meanses <- ave(ses, list(id))  HSBdata$centses <- ses – meanses  attach(HSBdata)  Within the lme4 package, the lme() function estimates linear mixed effects models. To use lme(), specify the dependent variable, the fixed components after the tilde sign and the random components in parentheses. Indicate which dataset R should use. To fit the empty model described above (5), use the following sintax:  results3 <- lmer(mathach sesmeans + sector + centses + sesmeans*centses + sector*centses + (1 + centses|id), data = HSBdata)  summary(results3)

R - continued  R saves the results of the model in an object called results1, which is stored in memory and may be retrieved with the function summary(). The function lmer() estimates a model, in which mathach is the dependent variable. The intercept, denoted by 1 immediately following the tilde sign, is the intercept for the fixed effects.  Within the parentheses, 1 denotes the random effects intercept, and the variable id is specified as the level-2 grouping variable. R uses the HSB data for this analysis.