Analysis for Designs with Assignment of Both Clusters and Individuals

Slides:



Advertisements
Similar presentations
Multilevel Models with Latent Variables Daniel J. Bauer Department of Psychology University of North Carolina 9/13/04 SAMSI Workshop.
Advertisements

Review bootstrap and permutation
Properties of Least Squares Regression Coefficients
Statistical Analysis Overview I Session 2 Peg Burchinal Frank Porter Graham Child Development Institute, University of North Carolina-Chapel Hill.
3-Dimensional Gait Measurement Really expensive and fancy measurement system with lots of cameras and computers Produces graphs of kinematics (joint.
Factorial ANOVA More than one categorical explanatory variable.
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Analysis of Clustered and Longitudinal Data Module 3 Linear Mixed Models (LMMs) for Clustered Data – Two Level Part A 1 Biostat 512: Module 3A - Kathy.
Objectives (BPS chapter 24)
2005 Hopkins Epi-Biostat Summer Institute1 Module 2: Bayesian Hierarchical Models Francesca Dominici Michael Griswold The Johns Hopkins University Bloomberg.
Missing Data Issues in RCTs: What to Do When Data Are Missing? Analytic and Technical Support for Advancing Education Evaluations REL Directors Meeting.
Regression Part II One-factor ANOVA Another dummy variable coding scheme Contrasts Multiple comparisons Interactions.
EVAL 6970: Meta-Analysis Fixed-Effect and Random- Effects Models Dr. Chris L. S. Coryn Spring 2011.
EVAL 6970: Meta-Analysis Fixed-Effect and Random- Effects Models Dr. Chris L. S. Coryn Spring 2011.
Longitudinal Experiments Larry V. Hedges Northwestern University Prepared for the IES Summer Research Training Institute July 28, 2010.
Chapter 4 Multiple Regression.
Simulation Modeling and Analysis Session 12 Comparing Alternative System Designs.
Clustered or Multilevel Data
Chapter 11 Multiple Regression.
Quantitative Genetics
1 Psych 5500/6500 Statistics and Parameters Fall, 2008.
School of Education Archive Analysis: On EEF Trials Adetayo Kasim, ZhiMin Xiao, and Steve Higgins.
Regression and Correlation Methods Judy Zhong Ph.D.
Inference for regression - Simple linear regression
Introduction to plausible values National Research Coordinators Meeting Madrid, February 2010.
Moving from Development to Efficacy & Intervention Fidelity Topics National Center for Special Education Research Grantee Meeting: June 28, 2010.
Sampling and Nested Data in Practice- Based Research Stephen Zyzanski, PhD Department of Family Medicine Case Western Reserve University School of Medicine.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 7 Sampling Distributions.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 6 Sampling Distributions.
Montecarlo Simulation LAB NOV ECON Montecarlo Simulations Monte Carlo simulation is a method of analysis based on artificially recreating.
Regression Part II One-factor ANOVA Another dummy variable coding scheme Contrasts Multiple comparisons Interactions.
Slide 1 Estimating Performance Below the National Level Applying Simulation Methods to TIMSS Fourth Annual IES Research Conference Dan Sherman, Ph.D. American.
Funded through the ESRC’s Researcher Development Initiative Prof. Herb MarshMs. Alison O’MaraDr. Lars-Erik Malmberg Department of Education, University.
Multilevel Data in Outcomes Research Types of multilevel data common in outcomes research Random versus fixed effects Statistical Model Choices “Shrinkage.
Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Chapter 7 Sampling Distributions.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
The Campbell Collaborationwww.campbellcollaboration.org C2 Training: May 9 – 10, 2011 Introduction to meta-analysis.
Dependencies Complex Data in Meta-analysis. Common Dependencies Independent subgroups within a study (nested in lab?) Multiple outcomes on the same people.
ICCS 2009 IDB Workshop, 18 th February 2010, Madrid 1 Training Workshop on the ICCS 2009 database Weighting and Variance Estimation picture.
Eurostat Weighting and Estimation. Presented by Loredana Di Consiglio Istituto Nazionale di Statistica, ISTAT.
Week 101 ANOVA F Test in Multiple Regression In multiple regression, the ANOVA F test is designed to test the following hypothesis: This test aims to assess.
Analysis of Experiments
ICCS 2009 IDB Seminar – Nov 24-26, 2010 – IEA DPC, Hamburg, Germany Training Workshop on the ICCS 2009 database Weights and Variance Estimation picture.
Tutorial I: Missing Value Analysis
4 basic analytical tasks in statistics: 1)Comparing scores across groups  look for differences in means 2)Cross-tabulating categoric variables  look.
An Application of Multilevel Modelling to Meta-Analysis, and Comparison with Traditional Approaches Alison O’Mara & Herb Marsh Department of Education,
WELCOME TO BIOSTATISTICS! WELCOME TO BIOSTATISTICS! Course content.
1 One goal in most meta-analyses is to examine overall, or typical effects We might wish to estimate and test the value of an overall (general) parameter.
Sampling Distributions
Module 2: Bayesian Hierarchical Models
RDI Meta-analysis workshop - Marsh, O'Mara, & Malmberg
Approaches to Incorporating Late Pretests in Experiments: Evaluation of Two Early Mathematics and Self-Regulation Interventions Presenter: Fatih Unlu,
Stephen W. Raudenbush University of Chicago December 11, 2006
Linear Mixed Models in JMP Pro
Mixed models and their uses in meta-analysis
Sampling Distributions
Statistical Models for the Analysis of Single-Case Intervention Data
Meta-analysis statistical models: Fixed-effect vs. random-effects
Improving the Design of STEM Impact Studies: Considerations for Statistical Power Discussant Notes Cristofer Price SREE
12 Inferential Analysis.
Ungraded quiz Unit 6.
Ratio and regression estimation STAT262, Fall 2017
Estimation of Sampling Errors, CV, Confidence Intervals
Statistical Models for the Analysis of Single-Case Intervention Data
12 Inferential Analysis.
Sample Sizes for IE Power Calculations.
3 basic analytical tasks in bivariate (or multivariate) analyses:
MetaForest Using random forests to explore heterogeneity in meta-analysis Caspar J. van Lissa, Utrecht University NL
Meta-analysis, systematic reviews and research syntheses
Statistical Models for the Analysis of Single-Case Intervention Data
Presentation transcript:

Analysis for Designs with Assignment of Both Clusters and Individuals Cristofer Price SREE - March 2017

The Problem How to analyze data from designs where: Part of sample has assignment of clusters e.g. random assignment (RA) of districts, schools, classes Part of sample has assignment of individuals within clusters e.g., RA of schools within districts e.g., RA of students within schools

Example Special education intervention in rural schools In some schools, there are enough eligible students to fill treatment slots and also have students in the business-as-usual control condition So there is within-school assignment In other schools, there are not enough eligible students to fill the slots So there is between-school assignment (T schools and C schools)

Prototypical Design with RA of Schools and RA of Students within Schools

Desired Outcome of Analysis Approach Produce a point estimate and standard error of the average treatment effect where Effect is averaged across both RA of schools and RA of students within schools designs Impact estimate is unbiased Impact standard error is unbiased

This Presentation is NOT About Partially nested RCTs Individuals randomized, but in T group individuals are placed into clusters, C group individuals are not Roberts & Roberts (2005); Bauer et al (2008; Baldwin et al. (2011) Lohr et al., (2014); Sterba et al. (2014) Individually randomized grouped treatment designs Inability to get SE.s exactly right in individual RA designs where T and C individuals are placed into clusters (post- RA), and similarity of experiences within clusters induces correlation within clusters Weiss et al. (2015); Lee & Thompson (2005)

This Presentation IS About Two approaches to analyzing the mixed assignment design to produce a point estimate and standard error of the impact Approach 1: Perform two separate analyses, get estimates and SE.s, calculate a weighted average of the two estimates Approach 2: Analyze all data in a single multilevel model that has two separate sets of intercept terms One set of random intercept terms for schools in the school RA design One set of intercept terms for schools in the RA of students within schools design

Take Away Points I show how to implement the two approaches I show how I assessed the extent to which they produced unbiased impact estimates and standard errors I conclude that both approaches: Work equally well to each other Work about as well as standard approaches to analyzing the two samples separately Produce more precise estimates than those obtained in separate analyses

Approach 1: Model For Sample with RA of Schools Two level model with students (level-1) nested in schools (level-2)

Approach 1: Model For Sample with RA of Schools Two level model with students (level-1) nested in schools (level-2) And written in combined form

Approach 1: Model For Sample with RA of Schools (For simplicity, I’m showing models with no covariates)

Approach 1: Model For Sample with RA of Schools

Approach 1: Model For Sample with RA of Schools

Approach 1: Model For Sample with RA of Schools

Approach 1: Model For Sample with RA of Schools

Approach 1: Model for Sample with RA of Students within Schools I’m going to describe three commonly used models for designs with random assignment of students within schools

Approach 1: Model for Sample with RA of Students within Schools

Approach 1: Model for Sample with RA of Students within Schools Fixed Treatment effect

Approach 1: Model for Sample with RA of Students within Schools Fixed Treatment effect Fixed dummies for schools

Approach 1: Model for Sample with RA of Students within Schools

Approach 1: Model for Sample with RA of Students within Schools Fixed dummies for schools

Approach 1: Model for Sample with RA of Students within Schools Fixed dummies for schools Treatment by school interaction terms.

Approach 1: Model for Sample with RA of Students within Schools Fixed dummies for schools Treatment by school interaction terms. This model will produce a separate impact estimate for each school.

Approach 1: Model for Sample with RA of Students within Schools Fixed dummies for schools Treatment by school interaction terms. This model will produce a separate impact estimate for each school. To get an overall estimate, need to calculate an average of the individual estimates (perhaps a weighted average)

Approach 1: Model for Sample with RA of Students within Schools

Approach 1: Model for Sample with RA of Students within Schools Random Intercepts for Schools

Approach 1: Model for Sample with RA of Students within Schools Random Intercepts for Schools Random treatment effects

Approach 1: Get the Relevant Estimates from Models 1 and 2

Approach 1: Get the Relevant Estimates from Models 1 and 2

Approach 1: Create Weights Create weights that are proportional to the inverse of the variance of the estimates

Approach 1: Calculate the Combined Treatment Effect Estimate

Approach 1: Calculate the Combined Treatment Effect Estimate I call this the “meta-analytic” approach

Approach 1: Calculate the Combined Standard Error

Approach 2: Analysis of Both Samples in a Single Model I will show: Model 3: A single model that produces separate estimates for the two samples This is to facilitate comparisons of the single model results to analyzing the data from the two sample separately Model 4a: A combined model that produces a single combined fixed treatment estimate Model 4b: A combined model that has a random treatment effect for the sample where students were randomized within schools

Fit a Single Model to Both Samples Get Two Separate Treatment Estimates Combined model for full sample with separate treatment estimates for subsets with RA of schools and RA of Students

Fit a Single Model to Both Samples Get Two Separate Treatment Estimates Random intercept terms for part of sample where schools were RA’d Combined model for full sample with separate treatment estimates for subsets with RA of schools and RA of Students

Fit a Single Model to Both Samples Get Two Separate Treatment Estimates Random intercept terms for part of sample where schools were RA’d Combined model for full sample with separate treatment estimates for subsets with RA of schools and RA of Students Fixed intercept terms for part of sample where students were RA’d

Fit a Single Model to Both Samples Get Two Separate Treatment Estimates Random intercept terms for part of sample where schools were RA’d Combined model for full sample with separate treatment estimates for subsets with RA of schools and RA of Students Fixed intercept terms for part of sample where students were RA’d Treatment effect for part of sample where students were RA’d

Fit a Single Model to Both Samples Get Two Separate Treatment Estimates Random intercept terms for part of sample where schools were RA’d Combined model for full sample with separate treatment estimates for subsets with RA of schools and RA of Students Fixed intercept terms for part of sample where students were RA’d Treatment effect for part of sample where students were RA’d =1 of Schools RA’d =0 of Students RA’d

Fit a Single Model to Both Samples Get Two Separate Treatment Estimates Random intercept terms for part of sample where schools were RA’d Combined model for full sample with separate treatment estimates for subsets with RA of schools and RA of Students Fixed intercept terms for part of sample where students were RA’d Treatment effect for part of sample where students were RA’d Difference between treatment effect for part of sample where schools where RA’d and part of sample were students were RA’d =1 of Schools RA’d =0 of Students RA’d

Fit a Single Model to Both Samples Get Two Separate Treatment Estimates Random intercept terms for part of sample where schools were RA’d Combined model for full sample with separate treatment estimates for subsets with RA of schools and RA of Students I’ll show you that this model produces the same estimates and SE’s as Models 1 and 2a Fixed intercept terms for part of sample where students were RA’d Treatment effect for part of sample where students were RA’d Difference between treatment effect for part of sample where schools where RA’d and part of sample were students were RA’d =1 of Schools RA’d =0 of Students RA’d

Approach 2: Obtain a Combined Estimate from a Single Model

Approach 2: Obtain a Combined Estimate from a Single Model Fixed treatment effect

Approach 2: Obtain a Combined Estimate from a Single Model Fixed treatment effect Fixed dummies (intercepts) for schools where students were RA’d

Approach 2: Obtain a Combined Estimate from a Single Model Fixed treatment effect Random intercepts for schools where schools were RA’d Fixed dummies (intercepts) for schools where students were RA’d

Approach 2: Using Software to Fit Model 4a e.g., in SAS:

Approach 2: Using Software to Fit Model 4a e.g., in SAS: Two Set of school IDs

Approach 2: Using Software to Fit Model 4a e.g., in SAS: Two Set of school IDs “class” statement creates dummy variables from the IDs

Approach 2: Using Software to Fit Model 4a e.g., in SAS: Two Set of school IDs “class” statement creates dummy variables from the IDs Fixed dummies (intercepts) for schools where students were RA’d

Approach 2: Using Software to Fit Model 4a e.g., in SAS: Two Set of school IDs “class” statement creates dummy variables from the IDs Fixed dummies (intercepts) for schools where students were RA’d Random intercepts for schools where schools were RA’d

Approach 2: How to Code Data to Get the Two Sets of Intercepts

Approach 2: How to Code Data to Get the Two Sets of Intercepts Here are the original school IDs

Approach 2: How to Code Data to Get the Two Sets of Intercepts Create two new sets of school IDs

Approach 2: How to Code Data to Get the Two Sets of Intercepts Create two new sets of school IDs

Approach 2: How to Code Data to Get the Two Sets of Intercepts Create two new sets of school IDs

Approach 2: How to Code Data to Get the Two Sets of Intercepts For the part of the sample were schools were RA’d to T & C

Approach 2: How to Code Data to Get the Two Sets of Intercepts For the part of the sample were schools were RA’d to T & C Assign the original IDs to “Schid1”

Approach 2: How to Code Data to Get the Two Sets of Intercepts For the part of the sample were schools were RA’d to T & C Assign the original IDs to “Schid1” And assign zeros to “Schid2”

Approach 2: How to Code Data to Get the Two Sets of Random Intercepts For the part of the sample were students were RA’d to T & C within schools

Approach 2: How to Code Data to Get the Two Sets of Random Intercepts For the part of the sample were students were RA’d to T & C within schools Assign the original IDs to “Schid2”

Approach 2: How to Code Data to Get the Two Sets of Intercepts For the part of the sample were students were RA’d to T & C within schools And assign zeros to “Schid1” Assign the original IDs to “Schid2”

Approach 2: Obtain a Combined Estimate from a Single Model Model 4b

Approach 2: Obtain a Combined Estimate from a Single Model Model 4b Has random treatment effects for the part of the sample where students were RA’d within schools

Approach 2: Obtain a Combined Estimate from a Single Model Model 4b Has random treatment effects for the part of the sample where students were RA’d within schools And two sets of random intercepts

Approach 2: Obtain a Combined Estimate from a Single Model Model 4b Has random treatment effects for the part of the sample where students were RA’d within schools And two sets of random intercepts = 1 if schools RA’d = 0 if students RA’d

Approach 2: Obtain a Combined Estimate from a Single Model Model 4b Has random treatment effects for the part of the sample where students were RA’d within schools And two sets of random intercepts = 1 if schools RA’d = 0 if students RA’d Average treatment effect

How do these Approaches Perform? Used simulated data to assess the extent to which each approach produces treatment effect estimates and standard errors that are unbiased. Three sets of simulations (10,000 replications each): Set 1: Fixed set of schools and students. In each replication, schools and students assigned to different conditions (T or C) Set 2: Fixed set of schools, but different students within each school in each replication. Distribution of student’s underlying abilities within each school stays the same across replications Set 3: Infinite population of schools. In each replication, different schools and students selected, randomly assigned.

Simulated Data In each simulation: True average treatment effect = 0.20 1/5 of schools or students true effect = 0.10 1/5 of schools or students true effect = 0.15 1/5 of schools or students true effect = 0.20 1/5 of schools or students true effect = 0.25 1/5 of schools or students true effect = 0.30 ICC = 0.20

Mean of impact estimates over the 10,000 simulations. True impact is 0.20

Mean of standard error estimates over the 10,000 simulations.

True SE = square root of variance of impact estimates over the 10,000 simulations

Difference between mean of SE estimates and True SEs

Difference between mean of SE estimates and True SEs as a percent of the True SE

Same

Same

Same

The models that produce a single estimate from the two samples

The models that produce a single estimate from the two samples are more precise…

The models that produce a single estimate from the two samples are more precise than the models that produce separate impact estimates in each sample

But the extent to which there is bias in the SEs is about the same

Take Away Points I presented two easy-to-implement approaches to analyzing data from a mixed assignment design I showed how I assessed the extent to which they produced unbiased impact estimates and standard errors I concluded that both approaches: Work equally well to each other Work about as well as standard approaches to analyzing the two samples separately Produce more precise estimates than those obtained in separate analyses

Which to Use Either seems good Meta-analysis approach would estimate different relationships between covariates and outcomes for the two samples Single model combined approach would have common covariate effects across the two samples (unless the covariates were interacted with the “SchoolAssign” dummy variable)

Thank you!