An Empirical Test of the Regression Discontinuity Design

Slides:



Advertisements
Similar presentations
EMR 6550: Experimental and Quasi- Experimental Designs Dr. Chris L. S. Coryn Kristin A. Hobson Fall 2013.
Advertisements

Regression Discontinuity. Basic Idea Sometimes whether something happens to you or not depends on your ‘score’ on a particular variable e.g –You get a.
Thomas D. Cook Northwestern University
Sources of bias in experiments and quasi-experiments sean f. reardon stanford university 11 december, 2006.
Analyzing Regression-Discontinuity Designs with Multiple Assignment Variables: A Comparative Study of Four Estimation Methods Vivian C. Wong Northwestern.
Regression-Discontinuity Design
Regression Discontinuity Design William Shadish University of California, Merced.
Regression Discontinuity Design Thanks to Sandi Cleveland and Marc Shure (class of 2011) for some of these slides.
Introduction of Regression Discontinuity Design (RDD)
Threats to Conclusion Validity. Low statistical power Low statistical power Violated assumptions of statistical tests Violated assumptions of statistical.
Reliability and Validity in Experimental Research ♣
Longitudinal Experiments Larry V. Hedges Northwestern University Prepared for the IES Summer Research Training Institute July 28, 2010.
Empirical methods take real-world data estimate size of relationship between variables two types  regression analysis  natural experiments take real-world.
Regression Discontinuity Design 1. Motivating example Many districts have summer school to help kids improve outcomes between grades –Enrichment, or –Assist.
Regression Discontinuity Design 1. 2 Z Pr(X i =1 | z) 0 1 Z0Z0 Fuzzy Design Sharp Design.
Basic Mathematics for Portfolio Management. Statistics Variables x, y, z Constants a, b Observations {x n, y n |n=1,…N} Mean.
Today Concepts underlying inferential statistics
Statistical Analysis of the Regression-Discontinuity Design.
1 Exploring Quasi-experiments Lab Assignment #5 Due May 16, 2008 Locate an empirical journal article that employs a quasi-experimental design. Closely.
Decision analysis and Risk Management course in Kuopio
Inferential statistics Hypothesis testing. Questions statistics can help us answer Is the mean score (or variance) for a given population different from.
Chapter 11 Experimental Designs
Session III Regression discontinuity (RD) Christel Vermeersch LCSHD November 2006.
A Randomized Experiment Comparing Random to Nonrandom Assignment William R Shadish University of California, Merced and M.H. Clark Southern Illinois University,
MSRP Year 1 (Preliminary) Impact Research for Better Schools RMC Corporation.
Using A Regression Discontinuity Design (RDD) to Measure Educational Effectiveness: Howard S. Bloom
Using Regression Discontinuity Analysis to Measure the Impacts of Reading First Howard S. Bloom
WWC Standards for Regression Discontinuity Study Designs June 2010 Presentation to the IES Research Conference John Deke ● Jill Constantine.
ICCS 2009 IDB Workshop, 18 th February 2010, Madrid 1 Training Workshop on the ICCS 2009 database Weighting and Variance Estimation picture.
Lecture 7: Parametric Models for Covariance Structure (Examples)
What is randomization and how does it solve the causality problem? 2.3.
Experimental Research Methods in Language Learning Chapter 5 Validity in Experimental Research.
Analysis and Interpretation: Analysis of Variance (ANOVA)
Generalized Additive Models: An Introduction and Example
Analysis of Covariance KNNL – Chapter 22. Analysis of Covariance Goal: To Compare treatments (1-Factor or Multiple Factors) after Controlling for Numeric.
Rerandomization to Improve Covariate Balance in Randomized Experiments Kari Lock Harvard Statistics Advisor: Don Rubin 4/28/11.
Regression Discontinuity Design Case Study : National Evaluation of Early Reading First Peter Z. Schochet Decision Information Resources, Inc.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
The influence of forgetting rate on complex span and academic performance Debbora Hall, Chris Jarrold, John Towse and Amy Zarandi.
Issues in Selecting Covariates for Propensity Score Adjustment William R Shadish University of California, Merced.
Inferential Statistics Psych 231: Research Methods in Psychology.
Impact Evaluation Methods Regression Discontinuity Design and Difference in Differences Slides by Paul J. Gertler & Sebastian Martinez.
CHAPTER 13: THE NUTS AND BOLTS OF QUASI- EXPERIMENTS.
8 Experimental Research Design.
Detecting and understanding interviewer effects on survey data using a cross-classified mixed-effects location scale model Ian Brunton-Smith, University.
Dependent-Samples t-Test
Analysis for Designs with Assignment of Both Clusters and Individuals
1University of Oklahoma 2Shaker Consulting
Lurking inferential monsters
Lezione di approfondimento su RDD (in inglese)
Regression in Practice: Observational studies with controls for pretests work better than you think Shadish, W. R., Clark, M. H., & Steiner, P. M. (2008).
Multiple Imputation using SOLAS for Missing Data Analysis
William R. Shadish University of California, Merced
Inference for Regression
Evidence of a Program's Effectiveness in Improving Colorectal Cancer Screening Rates in Federally Qualified Health Centers Robert L. Stephens, PhD, MPH1;
Statistical Models for the Analysis of Single-Case Intervention Data
Friendship Quality as a Moderator
Introduction to the General Linear Model (GLM)
12 Inferential Analysis.
Null Hypothesis Testing
Introduction to Design
Mingyu Feng Neil Heffernan Joseph Beck
Statistical Models for the Analysis of Single-Case Intervention Data
کارگاه حجم نمونه با نرم افزار G*Power
12 Inferential Analysis.
Explanation of slide: Logos, to show while the audience arrive.
Analysis of Covariance
Regression in Practice: Observational studies with controls for pretests work better than you think Shadish, W. R., Clark, M. H., & Steiner, P. M. (2008).
Statistical Models for the Analysis of Single-Case Intervention Data
Nonlinear associations between anxiety-depression and SA of the genetic cluster 6R. Nonlinear associations between anxiety-depression and SA of the genetic.
Presentation transcript:

An Empirical Test of the Regression Discontinuity Design William R. Shadish and Rodolfo Galindo University of California, Merced and Vivian Wong, Peter Steiner, and Thomas Cook Northwestern University

Regression Discontinuity Design Units are assigned to conditions based on a cutoff score on a measured covariate, The effect is measured as the discontinuity between treatment and control regression lines at the cutoff (it is not the group mean difference).

Does It Work? When properly implemented and analyzed, RD yields an unbiased estimate of treatment effect (see Rubin, 1977). But proper analysis is not easy; it requires correct modeling of functional form. If so, RDD should yield the same effect estimate as a randomized experiment. Three prior studies (Aiken et al.; Buddlemeyer & Skoufias; Black, Galdo & Smith) found reasonable agreement between RE and RDD but Two did not test the RE/RDD difference for statistical significance The one that did found a substantial minority of significant differences depending on how the analysis was done. To explore this further, we tested this by randomly assigning participants to be in a randomized experiment or a regression discontinuity design:

N = 588/5691 Undergraduate Psychology Students Pretest and then Random Assignment to: Randomized Experiment2 N = 197/189 Regression Discontinuity Design2 N = 391/380 Mathematics Training N = 110/107 Vocabulary Training N = 87/82 Mathematics Training N = 259/257 Vocabulary Training N = 132/123 1 N = a/b is number assigned (a) and number after attrition (b). Total attrition = 3.2%. 2 Allows two hypothesis tests: (1) effects of vocabulary training on vocabulary outcome, (2) effects of mathematics training on mathematics outcome

Analysis Team 1 (Shadish/Galindo) did initial analysis of the RE. Team 2 (Wong/Steiner/Cook) analyzed the RDD. Must correctly model functional form of relationship between AV and outcome, so used: Parametric (with nonlinear and interaction terms) Nonparametric (local polynomial regression) Semiparametric (Generalized Additive Model) Masked to each others results initially. All analyses used the same set of pretest covariates (not included in equations that follow)

Mathematics Results: No Difference RE-ATE Effect = 2.53 (.60) RDD Effect = 1.94 (se = .59): z = 0.70, p = .48

Vocabulary Results: No Difference RDD Effect = 5.91 (se = .77): Quadratic Model RE-ATE Effect = 4.46 (.49) z = 1.59, p = .11

Problem These analyses make sense from a policy perspective—RE’s are nearly always analyzed for the average treatment effect (ATE). From a scientific point of view, however, ATE is a not what RDD is estimating. RDD estimates LATE at cutoff, taking nonlinearities and interactions into account. So, does RDD = RE when estimating LATE for RE?

Constrained Analyses Since participants were randomly assigned to RDD and RE, they should have the same functional form within sampling error. So we compared RDD-LATE to RE-LATE constraining both to have the same functional form. Linear Quadratic Cubic Across six different models (three functional forms each for vocabulary and mathematics), RE and RDD gave effects that were not significantly different in every case.

Discussion RE = RDD across all of these tests Consistent with previous studies (Aiken et al.; Buddlemeyer & Skoufias; Black, Galdo & Smith). We continue to explore this data set (e.g., we have a measure of treatment receipt we could use to improve the analysis).