Modelling non-independent random effects in multilevel models William Browne Harvey Goldstein University of Bristol.

Slides:



Advertisements
Similar presentations
Handling attrition and non- response in longitudinal data Harvey Goldstein University of Bristol.
Advertisements

Modelling social and educational segregation Harvey Goldstein University of Bristol We start from a suitable measure – here we consider FSM A Model based.
Latent normal models for missing data Harvey Goldstein Centre for Multilevel Modelling University of Bristol.
Topic 12: Multiple Linear Regression
Kin 304 Regression Linear Regression Least Sum of Squares
Notes Sample vs distribution “m” vs “µ” and “s” vs “σ” Bias/Variance Bias: Measures how much the learnt model is wrong disregarding noise Variance: Measures.
Tests of Significance for Regression & Correlation b* will equal the population parameter of the slope rather thanbecause beta has another meaning with.
Logistic Regression Psy 524 Ainsworth.
Copyright © 2006 The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1 ~ Curve Fitting ~ Least Squares Regression Chapter.
BA 275 Quantitative Business Methods
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Multilevel survival models A paper presented to celebrate Murray Aitkin’s 70 th birthday Harvey Goldstein ( also 70 ) Centre for Multilevel Modelling University.
Linear regression models
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 13 Nonlinear and Multiple Regression.
Lecture 6 (chapter 5) Revised on 2/22/2008. Parametric Models for Covariance Structure We consider the General Linear Model for correlated data, but assume.
Qualitative Variables and
Lecture 4 (Chapter 4). Linear Models for Correlated Data We aim to develop a general linear model framework for longitudinal data, in which the inference.
Growth Curve Model Using SEM
Some Terms Y =  o +  1 X Regression of Y on X Regress Y on X X called independent variable or predictor variable or covariate or factor Which factors.
School of Veterinary Medicine and Science Multilevel modelling Chris Hudson.
1-1 Regression Models  Population Deterministic Regression Model Y i =  0 +  1 X i u Y i only depends on the value of X i and no other factor can affect.
Correlation and Regression. Spearman's rank correlation An alternative to correlation that does not make so many assumptions Still measures the strength.
9. SIMPLE LINEAR REGESSION AND CORRELATION
Multivariate Data Analysis Chapter 4 – Multiple Regression.
The Simple Regression Model
Log-linear and logistic models
Multilevel Modelling of PLASC data Harvey Goldstein University of Bristol.
Statistical Analysis SC504/HS927 Spring Term 2008 Session 7: Week 23: 7 th March 2008 Complex independent variables and regression diagnostics.
Chapter 11 Multiple Regression.
Probability & Statistics for Engineers & Scientists, by Walpole, Myers, Myers & Ye ~ Chapter 11 Notes Class notes for ISE 201 San Jose State University.
Simple Linear Regression Analysis
Introduction to Linear and Logistic Regression. Basic Ideas Linear Transformation Finding the Regression Line Minimize sum of the quadratic residuals.
Correlation 1. Correlation - degree to which variables are associated or covary. (Changes in the value of one tends to be associated with changes in the.
Regression Model Building Setting: Possibly a large set of predictor variables (including interactions). Goal: Fit a parsimonious model that explains variation.
Lecture II-2: Probability Review
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Regression Chapter 14.
Generalized Linear Models
Objectives of Multiple Regression
Introduction to Multilevel Modeling Using SPSS
Multilevel Modeling: Other Topics
Regression and Correlation Methods Judy Zhong Ph.D.
Marketing Research Aaker, Kumar, Day and Leone Tenth Edition
Chapter 12 Multiple Regression and Model Building.
Simple Linear Regression Models
The Examination of Residuals. The residuals are defined as the n differences : where is an observation and is the corresponding fitted value obtained.
Applications The General Linear Model. Transformations.
The Examination of Residuals. Examination of Residuals The fitting of models to data is done using an iterative approach. The first step is to fit a simple.
Modelling non-independent random effects in multilevel models Harvey Goldstein and William Browne University of Bristol NCRM LEMMA 3.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
University of Warwick, Department of Sociology, 2014/15 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard) Week 7 Logistic Regression I.
Lesson Multiple Regression Models. Objectives Obtain the correlation matrix Use technology to find a multiple regression equation Interpret the.
Centre for Market and Public Organisation School Markets & Correlated Random Effects Paul Clarke (with Fiona Steele, Harvey Goldstein and Rich Harris)
Copyright ©2011 Brooks/Cole, Cengage Learning Inference about Simple Regression Chapter 14 1.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
Biostatistics Regression and Correlation Methods Class #10 April 4, 2000.
Simple and multiple regression analysis in matrix form Least square Beta estimation Beta Simple linear regression Multiple regression with two predictors.
The “Big Picture” (from Heath 1995). Simple Linear Regression.
Estimating standard error using bootstrap
The simple linear regression model and parameter estimation
Chapter 4 Basic Estimation Techniques
ESTIMATION.
CHAPTER 7 Linear Correlation & Regression Methods
Linear Mixed Models in JMP Pro
CH 5: Multivariate Methods
Linear and generalized linear mixed effects models
Generalized Linear Models
Regression Analysis Week 4.
CHAPTER 29: Multiple Regression*
Fixed, Random and Mixed effects
3.2. SIMPLE LINEAR REGRESSION
Presentation transcript:

Modelling non-independent random effects in multilevel models William Browne Harvey Goldstein University of Bristol

A standard multilevel (VC) model Fixed Random level 2 level 1 residual The are assumed independent. But this is sometimes unrealistic: Repeated measures growth models with closely spaced occasions Schools competing for resources in a ‘zero-sum’ environment

Repeated measures growth curves A simple model of linear growth with random slopes: A model for (non-independent) level 1 residuals might be written: Leading to an exponential decay function. (Goldstein and Healy 1994)

Schools in competition where this correlation is inversely proportional to the (resource) distance between the schools. If we can specify a suitable (set of ) distance functions then we can estimate the relevant parameters. One possibility is to use the extent of overlap between appropriately defined catchment areas. Work using the ALSPAC cohort is currently underway.

Other link functions These have the following forms Logit: Log:

Link functions Link function f(s). From left to right; hyperbolic, logit, log

Parameters and estimation We need to estimate the parameters of the correlation function, the variances and the fixed effects. We propose an MCMC algorithm and have programmed this for general 2- level models where correlations can exist at either or both levels and responses can be normal or binary. Steps are a mixture of Gibbs and MH sampling with adaptive proposal distributions and suitable diffuse priors

Example 1: Growth data The data are 9 measurements on 20 boys around age 13, approximately 3 months apart Fitting a 2-level model with random linear and quadratic coefficients does not remove residual autocorrelation among level 1 residuals. We model the correlation as a negative exponentially decreasing function of the time difference We use a log (exponential) link since correlations should be positive In discrete time (equal intervals) this is a standard first order autoregressive model We fit a 4-th degree polynomial with and without random linear coefficient

Results random slope Intercept only For model A the correlation between measurements 0.25 years apart is 0.73 and for model B is

An equivalence For a 2-level variance components model the full covariance matrix among the level 1 units in a level 2 unit can be written in the form where in this case there are 4 level 1 units. For the model with an equal correlation structure at level 1 and no level 2 variation the corresponding covariance matrix is equivalent, namely

A level 2 example: dependence based on distance apart We have a three level model consisting of schools at level 3, cohorts or year groups at level 2 (2004,2005,2006) and students at level 1. The data are taken from the PLASC/NPD database response is GCSE score and predictors include 11 year KS test score We fit as a 2-level model (school cohorts at level 2) with a correlation structure between cohorts within schools and dummies for years : So that the correlation is modelled by a constant + decreasing function of time difference

A two level model for examination data with correlation structure at level 2. Inverse tanh link function. Burn in =500, Sample =5000. Year 1 (2004) chosen as base category. Uniform priors for variances. Second covariance matrix is unretricted estimate. ParameterEstimateStandard error Intercept Year Year Pretest Level 1 variance Level 2 covariance matrix DIC (PD) (126.4) RESULTS

Conclusions These models provide a useful generalisation to standard ‘independence’ models and are readily extended to non-normal responses, cross classifications etc. They allow us to more realistically describe the behaviour of institutions that are interactive rather than independently behaving units