Lecture 5 “additional notes on crossed random effects models”

Slides:



Advertisements
Similar presentations
Introduction Simple Random Sampling Stratified Random Sampling
Advertisements

Randomized Complete Block and Repeated Measures (Each Subject Receives Each Treatment) Designs KNNL – Chapters 21,
CHAPTER TWELVE ANALYSING DATA I: QUANTITATIVE DATA ANALYSIS.
Hierarchical Linear Modeling: An Introduction & Applications in Organizational Research Michael C. Rodriguez.
By Zach Andersen Jon Durrant Jayson Talakai
Cognitive Modelling – An exemplar-based context model Benjamin Moloney Student No:
Statistical Analysis Overview I Session 2 Peg Burchinal Frank Porter Graham Child Development Institute, University of North Carolina-Chapel Hill.
3-Dimensional Gait Measurement Really expensive and fancy measurement system with lots of cameras and computers Produces graphs of kinematics (joint.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Lecture 28 Categorical variables: –Review of slides from lecture 27 (reprint of lecture 27 categorical variables slides with typos corrected) –Practice.
Validity In our last class, we began to discuss some of the ways in which we can assess the quality of our measurements. We discussed the concept of reliability.
Prediction, Correlation, and Lack of Fit in Regression (§11. 4, 11
CHAPTER 23: Two Categorical Variables The Chi-Square Test ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture.
Lecture 4 Linear random coefficients models. Rats example 30 young rats, weights measured weekly for five weeks Dependent variable (Y ij ) is weight for.
Complex Surveys Sunday, April 16, 2017.
Longitudinal Experiments Larry V. Hedges Northwestern University Prepared for the IES Summer Research Training Institute July 28, 2010.
ANOVA: ANalysis Of VAriance. In the general linear model x = μ + σ 2 (Age) + σ 2 (Genotype) + σ 2 (Measurement) + σ 2 (Condition) + σ 2 (ε) Each of the.
Part I – MULTIVARIATE ANALYSIS
Clustered or Multilevel Data
Lecture 9: One Way ANOVA Between Subjects
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Created by Tom Wegleitner, Centreville, Virginia Section 5-2.
Lecture Slides Elementary Statistics Twelfth Edition
Lecture 6: Descriptive Statistics: Probability, Distribution, Univariate Data.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Regression Chapter 14.
Two-Way Analysis of Variance STAT E-150 Statistical Methods.
Analysis of Clustered and Longitudinal Data
Analysis of Variance. ANOVA Probably the most popular analysis in psychology Why? Ease of implementation Allows for analysis of several groups at once.
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
Lecture Slides Elementary Statistics Twelfth Edition
Hypothesis Testing II The Two-Sample Case.
5-2 Probability Distributions This section introduces the important concept of a probability distribution, which gives the probability for each value of.
Introduction to plausible values National Research Coordinators Meeting Madrid, February 2010.
Statistics Used In Special Education
Simple Linear Regression
Multiple Regression. In the previous section, we examined simple regression, which has just one independent variable on the right side of the equation.
H IERARCHICAL B AYESIAN M ODELLING OF THE S PATIAL D EPENDENCE OF I NSURANCE R ISK L ÁSZLÓ M ÁRKUS and M IKLÓS A RATÓ Eötvös Loránd University Budapest,
Student Engagement Survey Results and Analysis June 2011.
Introduction ANOVA Mike Tucker School of Psychology B209 Portland Square University of Plymouth Drake Circus Plymouth, PL4 8AA Tel: +44 (0)
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Chapter 5 Discrete Probability Distributions 5-1 Review and Preview 5-2.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
© 2003 Prentice-Hall, Inc.Chap 6-1 Business Statistics: A First Course (3 rd Edition) Chapter 6 Sampling Distributions and Confidence Interval Estimation.
Scientific question: Does the lunch intervention impact cognitive ability? The data consists of 4 measures of cognitive ability including:Raven’s score.
Inferential Statistics 2 Maarten Buis January 11, 2006.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
Chapter 14 Multiple Regression Models. 2  A general additive multiple regression model, which relates a dependent variable y to k predictor variables.
Funded through the ESRC’s Researcher Development Initiative Prof. Herb MarshMs. Alison O’MaraDr. Lars-Erik Malmberg Department of Education, University.
Multilevel Data in Outcomes Research Types of multilevel data common in outcomes research Random versus fixed effects Statistical Model Choices “Shrinkage.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Copyright ©2011 Brooks/Cole, Cengage Learning Inference about Simple Regression Chapter 14 1.
Sub-regional Workshop on Census Data Evaluation, Phnom Penh, Cambodia, November 2011 Evaluation of Age and Sex Distribution United Nations Statistics.
28. Multiple regression The Practice of Statistics in the Life Sciences Second Edition.
Regression Analysis: Part 2 Inference Dummies / Interactions Multicollinearity / Heteroscedasticity Residual Analysis / Outliers.
Chapter 20 Classification and Estimation Classification – Feature selection Good feature have four characteristics: –Discrimination. Features.
Sampling and Nested Data in Practice-Based Research Stephen Zyzanski, PhD Department of Family Medicine Case Western Reserve University School of Medicine.
Chapter 5 Multilevel Models
Measurements and Their Analysis. Introduction Note that in this chapter, we are talking about multiple measurements of the same quantity Numerical analysis.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Chapter 5 Probability Distributions 5-1 Overview 5-2 Random Variables 5-3 Binomial Probability Distributions 5-4 Mean, Variance and Standard Deviation.
Lab 4 Multiple Linear Regression. Meaning  An extension of simple linear regression  It models the mean of a response variable as a linear function.
Inferential statistics PSY Central concepts in inferential statistics: Sampling error Sampling distribution Standard error Null hypothesis and alternative.
Methods of Presenting and Interpreting Information Class 9.
Stats Methods at IC Lecture 3: Regression.
Lecture Slides Elementary Statistics Twelfth Edition
POSC 202A: Lecture Lecture: Substantive Significance, Relationship between Variables 1.
An Example of {AND, OR, Given that} Using a Normal Distribution
LESSON 4.4. MULTIPLE LINEAR REGRESSION. Residual Analysis
From GLM to HLM Working with Continuous Outcomes
Randomized Complete Block and Repeated Measures (Each Subject Receives Each Treatment) Designs KNNL – Chapters 21,
Consider the following problem
Mathematical Expectation
Presentation transcript:

Lecture 5 “additional notes on crossed random effects models”

Clustered versus non clustered random effects (Chap 11, new edition) We have discussed higher-level hierarchical models where units are classified by some factors (for example schools) into top level clusters at level L. The units in each top level cluster are then (sub)classified by a further factor (for example class) into clusters at level L-1. The factors defining the classifications are nested in the same sense that a lower-level cluster can only belong to one higher level cluster (for example a class can only belong to one school)

Non hierarchical models but random effects models We now discuss non hierarchical models where units are cross-classified by two or more factors, with each unit potentially belonging to any combination of levels of the different factors

Non Hierarchical Models So far, we have treated occasions nested within individuals However, if all individuals are affected similarly by some events or characteristics associated with the occasions, such as weather conditions, strikes, new legislation etc.. It seems reasonable to treat occasions as crossed with individuals, or to consider a “main effect” of time.

Non hierarchical models Factors are not always completely crossed. For example, the high schools and elementary schools attended by students are not clustered, but there are many combinations of high school and elementary school that do not occur in practice, perhaps because the schools are in different geographical regions.

A psychological experiment with two potentially interacting factors (Gelman, sec 13.5) Let denotes the success rate of a pilots training on a flight simulator (j=1,2,3,4,5) in airport (k=1,….,8). These 40 data points have two groupings - treatments and airports - which are not nested

Non nested random effects model Treatment random effectsAirport random effects

Estimates of the variance components The variance of the success rates is huge among airports - even larger than among the individual measurements. Whereas there is almost no differences across treatments

How much do primary and secondary schools afflict attainment at age 16? Data are cross-classified by 148 primary schools (elementary schools) and 19 secondary schools (middle/high schools) (fife.dta) attain: attainment score at age 16 pid: identifier for primary school (up to age 12) sid: identifier for secondary school (from age 12) vrq: verbal reasoning score from test taken in the last year of primary school sex: gender (1:female; 0:male)

Data characteristics First, not every combination of primary and secondary school exists. Second, many combinations of primary and secondary schools occur multiple times For instance, students that attend elementary school 1 ended up in 3 secondary schools (1,9,18) There are at most 6 secondary schools per primary schools, and for 90% of the primary schools there are at most 3 secondary schools per primary school There are between 7 and 32 primary schools per secondary school, the median being between 13 and 14

An additive crossed random effects model Attainment score at age 16 for student i who went to secondary school j and primary school k Estimation using xtmixed Variance across secondary schools Variance across primary schools Residual variance Average score Random effects

Results for the additive model The estimated standard deviation of the primary school random effect ( ) is 1.06, which is considerably larger than the estimated standard deviation of the secondary school random effect, given by 0.59 ( ) Therefore elementary schools appear to be more variable in their effects than secondary schools. However neither of these estimates are precise The standard deviation of the ( ) is estimated as 2.85( ). This number reflects any interactions between primary and secondary schools from the means implied by the additive effects and variability within groups of children belonging to the same combination of primary and secondary school

Including a random interaction For many combinations of primary and secondary school, we have several observations because more than one child attended that combination of schools

The random interaction term The interaction term takes on a different value for each combination of secondary and primary school to allow the assumption of additive random effects to be relaxed. For example, some secondary schools might be more beneficial to children who attended particular elementary schools, perhaps because of similar instructional practices We could not include interaction terms in the pilot example, because there we have only one observation for each treatment, airport combination

Intraclass correlations IC among children for the same primary schools but different secondary schools IC among children for the same secondary schools but different primary schools IC among children for the same primary and secondary schools Given the secondary school, this denotes the IC correlation among children that had the same primary school

Diagnostics We can obtain the empirical Bayes estimates of both primary and secondary school random effects. If the model is correct, there EB estimates should have a normal distribution We assess the normality of the EB estimates using a QQ plot