Kelvyn Jones, University of Bristol Wednesday 2nd July 2008, Session 29 WHAT IS: multilevel modelling?

Slides:



Advertisements
Similar presentations
Questions From Yesterday
Advertisements

Contextual effects In the previous sections we found that when regressing pupil attainment on pupil prior ability schools vary in both intercept and slope.
What is multilevel modelling? Realistically complex modelling Structures that generate dependent data Dataframes for modelling Distinguishing between.
THREE-LEVEL MODEL Two views The intractable statistical complexity that is occasioned by unduly ambitious three-level models (Bickel, 2007, 246) AND higher.
Multilevel modelling short course
What is multilevel modelling?
Advanced Lazarsfeldian Methodology Conference From Lazarsfeldian Contextual analysis to Multilevel models (Strategies for analysis of individual and/or.
Structural Equation Modeling
By Zach Andersen Jon Durrant Jayson Talakai
Multilevel Modeling in Health Research April 11, 2008.
The choice between fixed and random effects models: some considerations for educational research Claire Crawford with Paul Clarke, Fiona Steele & Anna.
3-Dimensional Gait Measurement Really expensive and fancy measurement system with lots of cameras and computers Produces graphs of kinematics (joint.
SC968: Panel Data Methods for Sociologists Random coefficients models.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 12: Analysis of Variance: Differences among Means of Three or More Groups.
2005 Hopkins Epi-Biostat Summer Institute1 Module 2: Bayesian Hierarchical Models Francesca Dominici Michael Griswold The Johns Hopkins University Bloomberg.
Lecture 4 Linear random coefficients models. Rats example 30 young rats, weights measured weekly for five weeks Dependent variable (Y ij ) is weight for.
1 Lecture 2: ANOVA, Prediction, Assumptions and Properties Graduate School Social Science Statistics II Gwilym Pryce
School of Veterinary Medicine and Science Multilevel modelling Chris Hudson.
Longitudinal Experiments Larry V. Hedges Northwestern University Prepared for the IES Summer Research Training Institute July 28, 2010.
1 BA 275 Quantitative Business Methods Residual Analysis Multiple Linear Regression Adjusted R-squared Prediction Dummy Variables Agenda.
A multilevel approach to geography of innovation Martin Srholec TIK Centre University of Oslo DIME International Workshop.
Clustered or Multilevel Data
Using Hierarchical Growth Models to Monitor School Performance: The effects of the model, metric and time on the validity of inferences THE 34TH ANNUAL.
Treatment Effects: What works for Whom? Spyros Konstantopoulos Michigan State University.
Multilevel Modeling Soc 543 Fall Presentation overview What is multilevel modeling? Problems with not using multilevel models Benefits of using.
Multilevel Modelling of PLASC data Harvey Goldstein University of Bristol.
Longitudinal Data Analysis: Why and How to Do it With Multi-Level Modeling (MLM)? Oi-man Kwok Texas A & M University.
Foster Care Reunification: The use of hierarchical modeling to account for sibling and county correlation Emily Putnam-Hornstein, MSW Center for Social.
Experimental Group Designs
Unit 3b: From Fixed to Random Intercepts © Andrew Ho, Harvard Graduate School of EducationUnit 3b – Slide 1
Analysis of Clustered and Longitudinal Data
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Introduction to Multilevel Modeling Using SPSS
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 25 Categorical Explanatory Variables.
School Dropout in Rural Vietnam: Does Gender Matter?
Lecture 5 “additional notes on crossed random effects models”
Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.
Modelling non-independent random effects in multilevel models William Browne Harvey Goldstein University of Bristol.
Workshop 1 Specify a multilevel structure for EITHER a response variable of your choice OR for a model to explain house prices OR voting behaviour Template.
Hierarchical Linear Modeling (HLM): A Conceptual Introduction Jessaca Spybrook Educational Leadership, Research, and Technology.
Scientific question: Does the lunch intervention impact cognitive ability? The data consists of 4 measures of cognitive ability including:Raven’s score.
Introduction Multilevel Analysis
Funded through the ESRC’s Researcher Development Initiative Prof. Herb MarshMs. Alison O’MaraDr. Lars-Erik Malmberg Department of Education, University.
Multilevel Data in Outcomes Research Types of multilevel data common in outcomes research Random versus fixed effects Statistical Model Choices “Shrinkage.
Introduction to Multilevel Modeling Stephen R. Porter Associate Professor Dept. of Educational Leadership and Policy Studies Iowa State University Lagomarcino.
Widening Participation in Higher Education: A Quantitative Analysis Institute of Education Institute for Fiscal Studies Centre for Economic Performance.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
A short introduction to epidemiology Chapter 4: More complex study designs Neil Pearce Centre for Public Health Research Massey University Wellington,
The Choice Between Fixed and Random Effects Models: Some Considerations For Educational Research Clarke, Crawford, Steele and Vignoles and funding from.
Multiple Linear Regression ● For k>1 number of explanatory variables. e.g.: – Exam grades as function of time devoted to study, as well as SAT scores.
HLM Models. General Analysis Strategy Baseline Model - No Predictors Model 1- Level 1 Predictors Model 2 – Level 2 Predictors of Group Mean Model 3 –
And now for something completely different, or is it? Modelling contextuality and heterogeneity Or Realistically complex modelling (
Analysis Overheads1 Analyzing Heterogeneous Distributions: Multiple Regression Analysis Analog to the ANOVA is restricted to a single categorical between.
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
Data Analysis in Practice- Based Research Stephen Zyzanski, PhD Department of Family Medicine Case Western Reserve University School of Medicine October.
Talk by William Browne Slides by Kelvyn Jones In memory of Jon Rasbash all University of Bristol Monday 5th July 2010, Session 2 WHAT IS: multilevel modelling?
Sampling and Nested Data in Practice-Based Research Stephen Zyzanski, PhD Department of Family Medicine Case Western Reserve University School of Medicine.
Chapter 5 Multilevel Models
Instructor: Dr. Amery Wu
Jessaca Spybrook Western Michigan University Multi-level Modeling (MLM) Refresher.
Funded through the ESRC’s Researcher Development Initiative Department of Education, University of Oxford Session 2.1 – Revision of Day 1.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Methods of multivariate analysis Ing. Jozef Palkovič, PhD.
Multilevel modelling: general ideas and uses
Multiple Regression Analysis and Model Building
HLM with Educational Large-Scale Assessment Data: Restrictions on Inferences due to Limited Sample Sizes Sabine Meinck International Association.
Brief Introduction to Multilevel Analysis
From GLM to HLM Working with Continuous Outcomes
An Introductory Tutorial
Rachael Bedford Mplus: Longitudinal Analysis Workshop 23/06/2015
Presentation transcript:

Kelvyn Jones, University of Bristol Wednesday 2nd July 2008, Session 29 WHAT IS: multilevel modelling?

What is multilevel modelling? Realistically complex modelling Structures that generate dependent data Data-frames for modelling Distinguishing between variables and levels (fixed and random classifications) Why should we use multilevel modelling as compared to other approaches? Going further

Realistically complex modelling Statistical models as a formal framework of analysis with a complexity of structure that matches the system being studied Three KEY Notions Modelling contextuality: micro & macro eg individual house prices varies from neighbourhood to n’hood eg individual house prices varies differentially from neighbourhood to neighbourhood according to size of property Modelling heterogeneity standard regression models ‘averages’, ie the general relationship ML model variances Eg between-n’hood AND between-house, within-n’hood variation Modelling dependent data deriving from complex structure series of structures that ML can handle routinely, ontological depth!

1:Hierarchical structures : model all levels simultaneously a) People nested within places: two-level model b) People nested within households within places: three-level model Modelling data with complex structure Note imbalance allowed! 2

So far unit diagrams now…… b) multiple membership with weights a) cross-classified structure Non- Hierarchical structures

CLASSIFICATION DIAGRAMS a) 3-level hierarchical structure b) cross-classified structure c) multiple membership structure People Neighbourhoods Regions Students Neighbourhoods Schools Neighbourhoods People

School S1 S2 S3 S4 Pupils P1 P2 P3 P4 P5 P6 P7 P8 P9 P10 P11 P12 Area A1 A2 A3 Combining structures: crossed-classifications and multiple membership relationships P1 Pupil 1 moves in the course of the study from residential area 1 to 2 and from school 1 to 2 Now in addition to schools being crossed with residential areas pupils are multiple members of both areas and schools. Pupil 8 has moved schools but still lives in the same area P8 Pupil 7 has moved areas but still attends the same school P7 Student School Area

ALSPAC All children born in Avon in 1990 followed longitudinally Multiple attainment measures on a pupil Pupils span 3 school-year cohorts (say 1996,1997,1998) Pupils move between teachers,schools,neighbourhoods Pupils progress potentially affected by their own changing characteristics, the pupils around them, their current and past teachers, schools and neighbourhoods occasions Pupil TeacherSchool Cohort Primary school Area

IS SUCH COMPLEXITY NEEDED? Complex models are NOT reducible to simpler models Confounding of variation across levels (eg primary and secondary school variation) M. occasions Pupil TeacherSchool Cohort Primary school Area

A data-frame for examining neighbourhood effects on price of houses Classifications or levels ResponseExplanatory variables House i N’hood j House Price ij No of Rooms ij House type ij N’hood Type j 11756SemiSuburb 21718SemiSuburb 31917DetSuburb 12684TerCentral 22376DetCentral 32676TerCentral 13827SemiSuburb 23855DetSuburb 14549TerrCentral 24917TerrCentral 34434SemiCentral DetCentral Questions for multilevel (random coefficient) models What is the between-neighbourhood variation in price taking account of size of house? Are large houses more expensive in central areas? Are detached houses more variable in price Form needed for MLwiN

P1 P2 P O1 O2 O3 O4 O1 O2 O1 O2 O3 Person Measurement Occasion Classification diagram Unit diagram Two level repeated measures design: classifications, units and dataframes a) in long form Classifications or levels ResponseExplanatory variables Occasion i Person j Income ij Age ij Gender j F F F M M F F F b) in short form : Person Inc- Occ1 Inc- Occ2 Inc- Occ3 Age- Occ1 Age- Occ2 Age- Occ3 Gender F 28291*3233*M F Form needed for MLwiN

House H1 H2 H3 H1 H2 H3 H1 H2 H1 H2 H3 H4 N’hood N1 N2 N1 N2 N’hood type Suburb Central Distinguishing Variables and Levels Classifications or levelsResponseExplanatory Variables House I Nhood jType k Price ijkRooms ijkHouse type ijk ijk 11Suburb756Det 21Suburb714Det 31Suburb917F 12Central689F 22Central376M Etc N’hood type is not a random classification but a fixed classification, and therefore an attribute of a level; ie a VARIABLE Random classification: if units can be regarded as a random sample from a wider population of units. Eg houses and n’hoods Fixed classification is a small fixed number of categories. Eg Suburb and central are not two types sampled from a large number of types, on the basis of these two we cannot generalise to a wider population of types of n’hoods, NO!

Analysis Strategies for Multilevel Data I Group-level analysis. Aggregate to level 2 and fit standard regression model. Problem: Cannot infer individual-level relationships from group-level relationships (ecological or aggregation fallacy) Robinson (1950) calculated the correlation between illiteracy and ethnicity in the USA. 2 scales of analysis for 1930 USA - Individual: for 97 million people; - States: 48 units

Analysis Strategies continued IIIndividual-level analysis. Fit standard OLS regression model Problem: Assume independence of residuals, but may expect dependency between individuals in the same group; leads to underestimation of SEs; Type I errors Bennet’s (1976) “teaching styles” study uses a single-level model: test scores for English, Reading and Maths aged 11 were significantly influenced by teaching style; PM calls for a return to ‘traditional’ or formal methods Re-analysis: Aitkin, M. et al (1981) Statistical modelling of data on teaching styles (with Discussion). J. Roy. Statist. Soc. A 144, Using proto- multilevel models to handle dependence of pupils within classes; no significant effect Also atomistic fallacy………….

What does an individual analysis miss? Re-analysis as a two level model (97m in 48 States) Who is illiterate? Individual model Does this vary from State to State? States People Cross-level interactions?

Analysis Strategies (cont.) III Contextual analysis. Analysis individual-level data but include group-level predictors Problem: Assumes all group-level variance can be explained by group-level predictors; incorrect SE’s for group-level predictors Do pupils in single-sex school experience higher exam attainment? Structure: 4059 pupils in 65 schools Response: Normal score across all London pupils aged 16 Predictor: Girls and Boys School compared to Mixed school Parameter Single level Multilevel Cons (Mixed school) (0.021) (0.070) Boy school (0.049) (0.149) Girl school (0.034) (0.117) Between school variance(  u 2 ) (0.030) Between student variance (  e 2 ) (0.022) (0.019) SEs

Analysis Strategies (cont.) IV Analysis of covariance (fixed effects model). Include dummy variables for groups Problems What if number of groups very large, eg households? No single parameter assess between group differences Cannot make inferences beyond groups in sample Cannot include group-level predictors as all degrees of freedom at the group-level have been consumed

Analysis Strategies (cont.) VFit single-level model but adjust standard errors for clustering. Problems: Treats groups as a nuisance rather than of substantive interest; no estimate of between-group variance; not extendible to more levels and complex heterogeneity VI Multilevel (random effects) model. Partition residual variance into between- and within-group (level 2 and level 1) components. Allows for un-observables at each level, corrects standard errors, Micro AND macro models analysed simultaneously, avoids ecological fallacy and atomistic fallacy: richer set of research questions

Type of questions tackled by ML: fixed AND random effects Even with only ‘simple’ hierarchical 2-level structure EG 2-level model: current attainment given prior attainment of pupils(1) in schools(2) Do Boys make greater progress than Girls (F: ie averages) Are boys more or less variable in their progress than girls? (R: modelling variances) What is the between-school variation in progress? (R) Is School X different from other schools in the sample in its effect? (F)……….

Type of questions tackled by ML cont. Are schools more variable in their progress for pupils with low prior attainment? (R) Does the gender gap vary across schools? (R) Do pupils make more progress in denominational schools? (F) ) (correct SE’s) Are pupils in denominational schools less variable in their progress? (R) Do girls make greater progress in denominational schools? (F) (cross-level interaction) (correct SE’s) More generally a focus on variances: segregation, inequality are all about differences between units

Sometimes: single level models can be seriously misleading! Why should we use multilevel models?

Resources Centre for Multilevel Modelling Provides access to general information about multilevel modelling and MlwiN. discussion group: Lemma training repository php php

Texts There is also a ‘Useful Books’ guide on the website.

The MLwiN manuals are another training resource