What is multilevel modelling?

Slides:



Advertisements
Similar presentations
MANOVA (and DISCRIMINANT ANALYSIS) Alan Garnham, Spring 2005
Advertisements

Contextual effects In the previous sections we found that when regressing pupil attainment on pupil prior ability schools vary in both intercept and slope.
Mark Tranmer Cathie Marsh Centre for Census and Survey Research Multilevel models for combining macro and micro data Unit 5.
Multilevel modelling short course
The Census Area Statistics Myles Gould Understanding area-level inequality & change.
Gender and Educational Attainment in Schools Stephen Machin and Sandra McNally.
Levels of causation and the interpretation of probability Seminar 2 Federica Russo Philosophy, Louvain & Kent.
Hierarchical Linear Modeling: An Introduction & Applications in Organizational Research Michael C. Rodriguez.
Advanced Lazarsfeldian Methodology Conference From Lazarsfeldian Contextual analysis to Multilevel models (Strategies for analysis of individual and/or.
By Zach Andersen Jon Durrant Jayson Talakai
Multilevel Modeling in Health Research April 11, 2008.
HLM PSY 515 Jim Graham Fall Fixed Effects So far, most of we have dealt with uses what are called fixed effects. So far, most of we have dealt with.
3-Dimensional Gait Measurement Really expensive and fancy measurement system with lots of cameras and computers Produces graphs of kinematics (joint.
Random effects as latent variables: SEM for repeated measures data Dr Patrick Sturgis University of Surrey.
Multiple Regression Fenster Today we start on the last part of the course: multivariate analysis. Up to now we have been concerned with testing the significance.
Advanced Methods and Models in Behavioral Research – 2014 Been there / done that: Stata Logistic regression (……) Conjoint analysis Coming up: Multi-level.
School of Veterinary Medicine and Science Multilevel modelling Chris Hudson.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 14 Using Multivariate Design and Analysis.
A multilevel approach to geography of innovation Martin Srholec TIK Centre University of Oslo DIME International Workshop.
Clustered or Multilevel Data
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics 11 th Edition.
Multilevel Modelling of PLASC data Harvey Goldstein University of Bristol.
Topic 3: Regression.
Chapter 7 Correlational Research Gay, Mills, and Airasian
Foster Care Reunification: The use of hierarchical modeling to account for sibling and county correlation Emily Putnam-Hornstein, MSW Center for Social.
How Institutional Context Affects Degree Production and Student Aspirations in STEM Kevin Eagan, Ph.D. University of California, Los Angeles January 28,
Experimental Group Designs
Analysis of Clustered and Longitudinal Data
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 13-1 Chapter 13 Introduction to Multiple Regression Statistics for Managers.
What influences English and Mathematics attainment at age 11? Evidence from the EPPSE project.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
An Introduction to HLM and SEM
Lecture 5 “additional notes on crossed random effects models”
Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.
Modelling non-independent random effects in multilevel models William Browne Harvey Goldstein University of Bristol.
Workshop 1 Specify a multilevel structure for EITHER a response variable of your choice OR for a model to explain house prices OR voting behaviour Template.
Advanced Methods and Models in Behavioral Research – 2010/2011 AMMBR course design CONTENT METHOD Y is 0/1 conjoint analysis logistic regression multi-level.
Hierarchical Linear Modeling (HLM): A Conceptual Introduction Jessaca Spybrook Educational Leadership, Research, and Technology.
Chapter 14 Introduction to Multiple Regression
Introduction Multilevel Analysis
Variables, sampling, and sample size. Overview  Variables  Types of variables  Sampling  Types of samples  Why specific sampling methods are used.
Funded through the ESRC’s Researcher Development Initiative Prof. Herb MarshMs. Alison O’MaraDr. Lars-Erik Malmberg Department of Education, University.
Multilevel Data in Outcomes Research Types of multilevel data common in outcomes research Random versus fixed effects Statistical Model Choices “Shrinkage.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Chap 14-1 Copyright ©2012 Pearson Education, Inc. publishing as Prentice Hall Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics.
HLM Models. General Analysis Strategy Baseline Model - No Predictors Model 1- Level 1 Predictors Model 2 – Level 2 Predictors of Group Mean Model 3 –
And now for something completely different, or is it? Modelling contextuality and heterogeneity Or Realistically complex modelling (
META-ANALYSIS, RESEARCH SYNTHESES AND SYSTEMATIC REVIEWS © LOUIS COHEN, LAWRENCE MANION & KEITH MORRISON.
Talk by William Browne Slides by Kelvyn Jones In memory of Jon Rasbash all University of Bristol Monday 5th July 2010, Session 2 WHAT IS: multilevel modelling?
Multilevel Modeling. Multilevel Question Turns out the Simple Random Sampling is very expensive Travel to Moscow, Idaho to give survey to a single student.
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 14-1 Chapter 14 Introduction to Multiple Regression Statistics for Managers using Microsoft.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice- Hall, Inc. Chap 14-1 Business Statistics: A Decision-Making Approach 6 th Edition.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics 10 th Edition.
Advanced Methods and Models in Behavioral Research – 2009/2010 AMMBR course design CONTENT METHOD Y is 0/1 conjoint analysis logistic regression multi-level.
Classification Ensemble Methods 1
Introduction to Multiple Regression Lecture 11. The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & 2 or more.
Funded through the ESRC’s Researcher Development Initiative Department of Education, University of Oxford Session 2.1 – Revision of Day 1.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Prentice-Hall, Inc.Chap 14-1 Statistics for Managers Using Microsoft® Excel 5th Edition Chapter.
Kelvyn Jones, University of Bristol Wednesday 2nd July 2008, Session 29 WHAT IS: multilevel modelling?
[Part 5] 1/43 Discrete Choice Modeling Ordered Choice Models Discrete Choice Modeling William Greene Stern School of Business New York University 0Introduction.
Multivariate Statistics Latent Growth Curve Modelling. Random effects as latent variables: SEM for repeated measures data Dr Patrick Sturgis University.
1 The Training Benefits Program – A Methodological Exposition To: The Research Coordination Committee By: Jonathan Adam Lind Date: 04/01/16.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
Methods of Presenting and Interpreting Information Class 9.
Department of Politics and International Relations
Multiple Regression Analysis and Model Building
An introduction to basic multilevel modeling
Moving Beyond Frontiers:
Brief Introduction to Multilevel Analysis
An Introductory Tutorial
Presentation transcript:

What is multilevel modelling? Kelvyn Jones, School of Geographical Sciences, LEMMA, University of Bristol 2nd Oxford Research Methods Festival July 2006

MULTILEVEL MODELS AKA random-effects models, hierarchical models, variance-components models, random-coefficient models, mixed models

Two-level hierarchical model Micro model Macro models Combined multilevel model Level 2 variance Level 1 variance

Three KEY Notions Modelling contextuality: firms as contexts eg discrimination varies from firm to firm eg discrimination varies differentially for employees of different ages from firm to firm Modelling heterogeneity standard regression models ‘averages’, ie the general relationship ML models variances Eg between-firm AND between-employee, within-firm variation Modelling data with complex structure - series of structures that ML can handle routinely

Structures: UNIT DIAGRAMS 1: Hierarchical structures a) Pupils nested within schools: modelling progress NB imbalance More examples follow…...

Examples of strict hierarchy Education pupils (1) in schools (2) pupils (1) in classes( 2) in schools (3) Surveys: 3 stage sampling respondents (1) in neighbourhoods(2) in regions(3) Business individuals(1) within teams(2) within organizations(3) Psychology individuals(1) within family(2) individuals(1) within twin sibling pair(2) Economics employees(1) within firms(2) NB all are structures in the POPULATION (ie exist in reality)

1: Multi-stage samples as hierarchies Two-level structure imposed by design Respondents nested within PSU’s Usually generates dependent data with individuals living within the same PSU can be expected to be more alike than a random sample If not allowed for, get incorrect estimates of SE’s and therefore Type 1 errors: Multilevel models model this dependency

1: Hierarchical structures (continued) b) Repeated measures of voting behaviour at the UK general election

1: Hierarchical structures (continued) c) Multivariate design for health-related behaviours Extreme case of rotational designs

2: Non- Hierarchical structures a) cross-classified structure b) multiple membership with weights Can represent reality by COMBINATIONS of different types of structures But can get complex so….

CLASSIFICATION DIAGRAMS a) 3-level hierarchical structure b) cross-classified structure

CLASSIFICATION DIAGRAMS(cont) c) multiple membership structure d) spatial structure

ALSPAC All children born in Avon in 1990 followed longitudinally occasions Pupil Teacher School Cohort Primary school Area ALSPAC All children born in Avon in 1990 followed longitudinally Multiple attainment measures on a pupil Pupils span 3 school-year cohorts (say 1996,1997,1998) Pupils move between teachers,schools,neighbourhoods Pupils progress potentially affected by their own changing characteristics, the pupils around them, their current and past teachers, schools and neighbourhoods

IS SUCH COMPLEXITY NEEDED? M. occasions Pupil Teacher School Cohort Primary school Area IS SUCH COMPLEXITY NEEDED? Complex models are NOT reducible to simpler models Confounding of variation across levels (eg primary and secondary school variation)

Summary Multilevel models can handle social science research problems with “realistic complexity” Complexity takes on two forms and two types As ‘Structure’ ie dependencies - naturally occurring dependencies Eg: pupils in schools ; measurements over time - ‘imposed-by-design’ dependencies Eg: multistage sample As ‘Missingness’ ie imbalance - naturally occurring imbalances Eg: not answering in a panel study - ‘imposed-by-design’ imbalances Eg: rotational questions Most (all?) social science research problems and designs are a combination of strict hierarchies, cross-classifications and multiple memberships

So what? Substantive reasons: richer set of research questions To what extent are pupils affected by school context in addition to or in interaction with their individual characteristics? What proportion of the variability in achievement at aged 16 can be accounted for by primary school, secondary school and neighbourhood characteristics? Technical reasons: Individuals drawn from a particular ‘groupings’ can be expected to be more alike than a random sample Incorrect estimates of precision, standard errors, confidence limits and tests; increased risk of finding relationships and differences where none exists

Varying relationships: what are random effects? “There are NO general laws in social science that are constant over time and independent of the context in which they are embedded” Rein (quoted in King, 1976)

VARYING RELATIONS 3 2 1 -1 -2 -3 -4 8 7 6 5 4 Rooms Multilevel modelling can handle - multiple outcomes - categorical & continuous predictors - categorical and continuous responses But KISS……… Single response: house price Single predictor - size of house, number of rooms Two level hierarchy - houses at level 1 nested within - neighbourhoods at level 2 are the contexts Set of characteristic plots……………… 3 2 1 -1 -2 -3 -4 8 7 6 5 4 Rooms

Example of varying relations (BJPS 1992) Stucture: 3 levels strict hierarchy individuals within constituencies within regions Response: Voting for labour in 1987 Predictors 1 age, class, tenure, employment status 2 %unemployed, employment change, % in mining in 1981 Expectation: coal mining areas vote for the left Allow: mining parameters for mining effect(2) to vary over region(3) in a 3-level logistic model

Varying relations for Labour voting and % mining

Higher-level variables So far all predictors have been level 1 (Math3, boy/girl); (size,type of property) Now higher level predictors (contextual,ecological) - global occurs only at the higher level; -aggregate based on summarising a level 1 attribute Example: pupils in classes progress affected by previous score (L1); class average score (A:L2); class homogeneity (SD, A:L2); teaching style (G:L2) NOW: trying to account for between school differences

Propensity for left vote Main and cross-level relationships: a graphical typology The individual and the ecological - 1 Low SES Propensity for left vote High SES % Working class

The individual and the ecological - 2 Low SES High SES Propensity for left vote % Working class

The individual and the ecological - 3 consensual Low SES High SES Propensity for left vote % Working class

A graphical typology of cross-level interactions (Jones & Duncan 1993) Individual Ecological Reactive Consensual Reactive for W; Consensual for M Non-linear cross-level interactions

STRUCTURE: 2275 voters in 218 constituencies, 1992 RESPONSE: vote Labour not Conservative PREDICTORS: Level - individual: age, sex, education, tenure, income 1 : 8-fold classification of class - constituency:% Local authority renters 2 % Employers and managers;100 - % Unemployed MODEL: cross-level interactions between INDIVIDUAL&CONSTITUENCY characteristics Fixed part main effects: 8 fold division of class Random part at level 2: 2 fold division of class Working class: unskilled and skilled manual, foreman Non-working class: public and private-sector salariat, routine non- manual, petty-bourgeoisie, ‘unstated’

Cross-level interactions

Type of questions tackled by multilevel modelling I 2-level model: current attainment given prior attainment of pupils(1) in schools(2) NB assuming a random sample of pupils from a random samples of schools Do Boys make greater progress than Girls (F) Are boys more or less variable in their progress than girls?(R) What is the between-school variation in progress? (R) Is School X different from other schools in the sample in its effect? (F) continued…….

Type of questions tackled by multilevel modelling II Are schools more variable in their progress for pupils with low prior attainment? (R) Does the gender gap vary across schools? (R) Do pupils make more progress in denominational schools?(F) Are pupils in denominational schools less variable in their progress? (R) Do girls make greater progress in denominational schools? (F) (cross-level interaction)

Fixed and Random classifications Levels and Variables Why are schools a level but gender a variable? Schools = Level = a population of units from which we have taken a random sample Gender = Variable ≠ a sample out of all possible gender categories Fixed and Random classifications Fixed classification Discrete categories of a variable (eg Gender) Not sample from a population Specific categories only contribute to their respective means Information on Females does contribute to the estimate for Males Random classification Generalization of a level (e.g., schools) Random effects come from a distribution All schools contribute to between-school variance Information is exchangeable between schools

When levels become variables... Schools can be treated as a variable and placed in the fixed part; achieved by a set of dummy variables one for each school; target of inference is each specific school; each one treated as an ‘island unto itself’ Schools in the random part, treated as a level, with generalization possible to ALL schools (or ‘population’ of schools), in addition to predicting specific school effects given that they come from an overall distribution

Conclusions 3 Substantive advantages 1 Modelling contextuality and heterogeneity 2 Micro AND macro models analysed simultaneously -avoids ecological fallacy and atomistic fallacy 3 Social contexts maintained in the analysis; permits intensive, qualitative research on ‘interesting’ cases “The complexity of the world is not ignored in the pursuit of a single universal equation, but the specific of people and places are retained in a model which still has a capacity for generalisation” And finally

LEMMA: http://www.ncrm.ac.uk/nodes/lemma/about.php Going Further! Learning Environment for Multilevel Methodology and Applications NCRM node based at University of Bristol LEMMA: http://www.ncrm.ac.uk/nodes/lemma/about.php