Using A Regression Discontinuity Design (RDD) to Measure Educational Effectiveness: Howard S. Bloom

Slides:



Advertisements
Similar presentations
REGRESSION, IV, MATCHING Treatment effect Boualem RABTA Center for World Food Studies (SOW-VU) Vrije Universiteit - Amsterdam.
Advertisements

Multiple Regression and Model Building
Hierarchical Linear Modeling: An Introduction & Applications in Organizational Research Michael C. Rodriguez.
Statistical Analysis Overview I Session 2 Peg Burchinal Frank Porter Graham Child Development Institute, University of North Carolina-Chapel Hill.
Presented by Malte Lierl (Yale University).  How do we measure program impact when random assignment is not possible ?  e.g. universal take-up  non-excludable.
Analyzing Regression-Discontinuity Designs with Multiple Assignment Variables: A Comparative Study of Four Estimation Methods Vivian C. Wong Northwestern.
Heterogeneous impact of the social program Oportunidades on contraceptive methods use in young adult women living in rural areas: limitations of the regression.
Regression Discontinuity Design Thanks to Sandi Cleveland and Marc Shure (class of 2011) for some of these slides.
Introduction of Regression Discontinuity Design (RDD)
Regression Discontinuity Design Saralyn Miller Southern Methodist University ____________________ Paper presented at the annual meeting of the Southwest.
CJT 765: Structural Equation Modeling Class 3: Data Screening: Fixing Distributional Problems, Missing Data, Measurement.
Sample Selection Issues in Experiment Random sampling (difficult) Convenience & purposive sampling Volunteers External validity Representativeness & generalizability.
Sessions 22 & 23: (Second Best) Alternatives to Randomized Designs Mark W. Lipsey Vanderbilt University IES/NCER Summer Research Training Institute, 2008.
Multivariate Data Analysis Chapter 4 – Multiple Regression.
Simulation Modeling and Analysis Session 12 Comparing Alternative System Designs.
Clustered or Multilevel Data
Using Between-Subjects and Within-Subjects Experimental Designs
Treatment Effects: What works for Whom? Spyros Konstantopoulos Michigan State University.
L1 Chapter 11 Experimental and Quasi- experimental Designs Dr. Bill Bauer.
Nasih Jaber Ali Scientific and disciplined inquiry is an orderly process, involving: problem Recognition and identification of a topic to.
Applied Business Forecasting and Planning
Copyright ©2011 Pearson Education 15-1 Chapter 15 Multiple Regression Model Building Statistics for Managers using Microsoft Excel 6 th Global Edition.
Objectives of Multiple Regression
Copyright © 2008 by Pearson Education, Inc. Upper Saddle River, New Jersey All rights reserved. John W. Creswell Educational Research: Planning,
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 15-1 Chapter 15 Multiple Regression Model Building Statistics for Managers using Microsoft.
Chapter 12 Multiple Regression and Model Building.
Chapter 11 Experimental Designs
Descriptive Statistics: Variability Lesson 5. Theories & Statistical Models n Theories l Describe, explain, & predict real- world events/objects n Models.
1 B IVARIATE AND MULTIPLE REGRESSION Estratto dal Cap. 8 di: “Statistics for Marketing and Consumer Research”, M. Mazzocchi, ed. SAGE, LEZIONI IN.
Slide 1 Estimating Performance Below the National Level Applying Simulation Methods to TIMSS Fourth Annual IES Research Conference Dan Sherman, Ph.D. American.
Correlational Research Chapter Fifteen Bring Schraw et al.
The World Bank Human Development Network Spanish Impact Evaluation Fund.
Multilevel Data in Outcomes Research Types of multilevel data common in outcomes research Random versus fixed effects Statistical Model Choices “Shrinkage.
Session III Regression discontinuity (RD) Christel Vermeersch LCSHD November 2006.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
RCTs and instrumental variables Anna Vignoles University of Cambridge.
MSRP Year 1 (Preliminary) Impact Research for Better Schools RMC Corporation.
Using Regression Discontinuity Analysis to Measure the Impacts of Reading First Howard S. Bloom
1 G Lect 14M Review of topics covered in course Mediation/Moderation Statistical power for interactions What topics were not covered? G Multiple.
Analysis of Variance.
Stat 112 Notes 9 Today: –Multicollinearity (Chapter 4.6) –Multiple regression and causal inference.
McGraw-Hill/Irwin Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. Using Between-Subjects and Within- Subjects Experimental Designs.
Single-Factor Studies KNNL – Chapter 16. Single-Factor Models Independent Variable can be qualitative or quantitative If Quantitative, we typically assume.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Model Building and Model Diagnostics Chapter 15.
The Simple Linear Regression Model: Specification and Estimation ECON 4550 Econometrics Memorial University of Newfoundland Adapted from Vera Tabakova’s.
Chapter 20 Classification and Estimation Classification – Feature selection Good feature have four characteristics: –Discrimination. Features.
AADAPT Workshop Latin America Brasilia, November 16-20, 2009 Laura Chioda.
Chapter Eight: Quantitative Methods
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Randomized Assignment Difference-in-Differences
Copyright © 2015 Inter-American Development Bank. This work is licensed under a Creative Commons IGO 3.0 Attribution-Non Commercial-No Derivatives (CC-IGO.
Common Pitfalls in Randomized Evaluations Jenny C. Aker Tufts University.
11-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Experimental Evaluations Methods of Economic Investigation Lecture 4.
1/25 Introduction to Econometrics. 2/25 Econometrics Econometrics – „economic measurement“ „May be defined as the quantitative analysis of actual economic.
WELCOME TO BIOSTATISTICS! WELCOME TO BIOSTATISTICS! Course content.
Module 5 Inference with Panel Data
General belief that roads are good for development & living standards
Threats and Analysis.
Multiple Regression and Model Building
Simple Linear Regression - Introduction
Matching Methods & Propensity Scores
Matching Methods & Propensity Scores
Overview of Major Kinds of Educational Research
1 Causal Inference Counterfactuals False Counterfactuals
Matching Methods & Propensity Scores
Lecture 1 Cameron Kaplan
Explanation of slide: Logos, to show while the audience arrive.
Explanation of slide: Logos, to show while the audience arrive.
Presentation transcript:

Using A Regression Discontinuity Design (RDD) to Measure Educational Effectiveness: Howard S. Bloom

This Talk Will: introduce the history and logic of RDD, introduce the history and logic of RDD, consider conditions for its internal validity, consider conditions for its internal validity, considers its sample size requirements, considers its sample size requirements, consider its dependence on functional form, consider its dependence on functional form, illustrate some specification tests for it, illustrate some specification tests for it, consider limits to its external validity, consider limits to its external validity, consider how to deal with noncompliance, consider how to deal with noncompliance, describe an application. describe an application.

RDD History In the beginning there was Thislethwaite and Campbell (1960) In the beginning there was Thislethwaite and Campbell (1960) This was followed by a flurry of applications to Title I (Trochim, 1984) This was followed by a flurry of applications to Title I (Trochim, 1984) Only a few economists were involved initially (Goldberger, 1972) Only a few economists were involved initially (Goldberger, 1972) Then RDD went into hibernation Then RDD went into hibernation It recently experienced a renaissance among economists (e.g. Hahn, Todd and van der Klaauw, 2001; Jacob and Lefgren, 2002) It recently experienced a renaissance among economists (e.g. Hahn, Todd and van der Klaauw, 2001; Jacob and Lefgren, 2002) Tom Cook has written about this story Tom Cook has written about this story

RDD Logic Selection on an observable (a rating) Selection on an observable (a rating) A tie-breaking experiment A tie-breaking experiment Modeling close to the cut-point Modeling close to the cut-point Modeling the full distribution of ratings Modeling the full distribution of ratings

RDD As A Linear Regression

Conditions for Internal Validity The outcome-by-rating regression is a continuous function (absent treatment). The outcome-by-rating regression is a continuous function (absent treatment). The cut-point is determined independently of knowledge about ratings. The cut-point is determined independently of knowledge about ratings. Ratings are determined independently of knowledge about the cut-point. Ratings are determined independently of knowledge about the cut-point. The functional form of the outcome-by- rating regression is specified properly. The functional form of the outcome-by- rating regression is specified properly.

RDD Statistical Model where: Y i = outcome for subject i, T i = one for subjects in the treatment group and zero otherwise, R i = rating for subject i, e i = random error term for subject i, which is independently and identically distributed

Variance of the Impact Estimator  2 = variance of mean outcomes across subjects in the treatment group or comparison group treatment group or comparison group R 1 2 = square of the correlation between outcomes and ratings within the treatment and comparison group ratings within the treatment and comparison group R 2 2 = square of the correlation between treatment status and the rating the rating P = P = proportion of subjects in the treatment group, N = total number of subjects

Sample Size Implications Because of the substantial multi-collinearity that exists between its rating variable and treatment indicator, an RDD requires 3 to 4 times as many sample members as a corresponding randomized experiment Because of the substantial multi-collinearity that exists between its rating variable and treatment indicator, an RDD requires 3 to 4 times as many sample members as a corresponding randomized experiment

Specification Tests Using the RDD to compare baseline characteristics of the treatment and comparison groups Using the RDD to compare baseline characteristics of the treatment and comparison groups Re-estimating impacts and sequentially deleting subjects with the highest and lowest ratings Re-estimating impacts and sequentially deleting subjects with the highest and lowest ratings Re-estimating impacts and adding: Re-estimating impacts and adding:  a treatment status/rating interaction  a quadratic rating term  interacting the quadratic with treatment status Using non-parametric estimation Using non-parametric estimation

External Validity Estimating impacts at the cut-point Estimating impacts at the cut-point Extrapolating impacts beyond the cut-point with a simple linear model Extrapolating impacts beyond the cut-point with a simple linear model Estimating varying impacts beyond the cut- point with more complex functional forms Estimating varying impacts beyond the cut- point with more complex functional forms

Dealing With Noncompliance Sharp and fuzzy RDDs Sharp and fuzzy RDDs No-shows and crossovers No-shows and crossovers The effect of intent to treat (ITT) The effect of intent to treat (ITT) The local average treatment effect (LATE) The local average treatment effect (LATE) The effect of treatment on the treated (TOT) The effect of treatment on the treated (TOT) Where r T and r C = the proportion of the treatment and control groups receiving treatment, respectively

Application of RDD To Reading First Reading First (RF) is a cornerstone of No Child Left Behind Reading First (RF) is a cornerstone of No Child Left Behind RF resources are allocated purposefully to schools that need it most and will benefit most RF resources are allocated purposefully to schools that need it most and will benefit most Some districts allocated RF resources based on quantitative indicators Some districts allocated RF resources based on quantitative indicators We chose a sample of 251 schools near the cut- points for 17 such districts and 1 state We chose a sample of 251 schools near the cut- points for 17 such districts and 1 state

References Cook, T. D. (in press) “Waiting for Life to Arrive: A History of the Regression-discontinuity Design in Psychology, Statistics and Economics” Journal of Econometrics. Goldberger, A. S. (1972) “Selection Bias in Evaluating Treatment Effects: Some Formal Illustrations” (Discussion Paper , Madison WI: University of Wisconsin, Institute for Research on Poverty, June). Hahn, H., P. Todd and W. van der Klaauw (2001) “Identification and Estimation of Treatment Effects with a Regression-Discontinuity Design” Econometrica, 69(3): 201 – 209. Jacob, B. and L. Lefgren (2004) “Remedial Education and Student Achievement: A Regression-Discontinuity Analysis” Review of Economics and Statistics, LXXXVI.1: Thistlethwaite, D. L. and D. T. Campbell (1960) “Regression Discontinuity Analysis: An Alternative to the Ex Post Facto Experiment” Journal of Educational Psychology, 51(6): 309 – 317. Trochim, W. M. K. (1984) Research Designs for Program Evaluation: The Regression-Discontinuity Approach (Newbury Park, CA: Sage Publications).