Using School Choice Lotteries to Test Measures of School Effectiveness David Deming Harvard University and NBER.

Slides:



Advertisements
Similar presentations
Teacher Training, Teacher Quality and Student Achievement Douglas Harris Tim R. Sass Dept. of Educational Dept. of Economics Policy Studies Florida State.
Advertisements

What Does Research Tell Us About Identifying Effective Teachers? Jonah Rockoff Columbia Business School Nonprofit Leadership Forum, May 2010.
Are Teacher-Level Value- Added Estimates Biased? An Experimental Validation of Non-Experimental Estimates Thomas J. KaneDouglas O. Staiger HGSEDartmouth.
Teacher Productivity & Models of Employer Learning Economic Models in Education Research Workshop University of Chicago April 7, 2011 Douglas O. Staiger.
Estimating the Effects of Treatment on Outcomes with Confidence Sebastian Galiani Washington University in St. Louis.
Teacher Quality, Teacher Evaluation, and “Value-Added”
Doing Experiments An introduction 1. Empirical social science, including economics, is largely nonexperimental, using data from situations occurring in.
Chapter 13 Additional Topics in Regression Analysis
Session 4: Sampling and Data Collection. Objectives for Session 4: Finish in-class data activity – delivery of services in ten food pantries Sampling.
CHAPTER 22 Reliability of Ordination Results From: McCune, B. & J. B. Grace Analysis of Ecological Communities. MjM Software Design, Gleneden Beach,
Justine Hastings, Yale University and NBER Jeffrey Weinstein, Yale University Information, School Choice, and Academic Achievement: Evidence from Two Experiments.
Using Growth Models for Accountability Pete Goldschmidt, Ph.D. Assistant Professor California State University Northridge Senior Researcher National Center.
Magnet Schools and Peers: Effects on Student Achievement Dale Ballou Vanderbilt University November, 2007 Thanks to Steve Rivkin, Julie Berry Cullen, Adam.
How Do We Identify Causes? The criteria of causation © Pine Forge Press, an imprint of Sage Publications, 2006.
TOOLS OF POSITIVE ANALYSIS
Production Functions and Measuring the Effect of Teachers on Student Achievement With Value-Added HSE March 20, 2012.
-- Preliminary, Do Not Quote Without Permission -- VALUE-ADDED MODELS AND THE MEASUREMENT OF TEACHER QUALITY Douglas HarrisTim R. Sass Dept. of Ed. LeadershipDept.
Model Assessment and Selection Florian Markowetz & Rainer Spang Courses in Practical DNA Microarray Analysis.
Chapter 8 Introduction to Hypothesis Testing
Hypothesis Testing in Linear Regression Analysis
1 Comments on: “New Research on Training, Growing and Evaluating Teachers” 6 th Annual CALDER Conference February 21, 2013.
CHAPTER 21: Comparing Two Proportions
Human Capital Policies in Education: Further Research on Teachers and Principals 5 rd Annual CALDER Conference January 27 th, 2012.
Validity in an Era of Accountability Daniel Koretz CRESST/ Harvard Graduate School of Education.
Sensitivity of Teacher Value-Added Estimates to Student and Peer Control Variables October 2013 Matthew Johnson Stephen Lipscomb Brian Gill.
Association for Behavior Analysis International 36 th Annual Convention Teacher Induction: Where the Rubber Meets the Road Randy Keyworth.
The Evaluation of Charter School Impacts June 30, 2010 Presentation at the 2010 IES Research Conference Philip Gleason ● Melissa Clark Christina Clark.
PolMeth2009: Freedman Panel Regression Adjustments to Experimental Data: Do David Freedman’s Concerns Apply to Political Science? Donald P. Green Yale.
Evaluating Job Training Programs: What have we learned? Haeil Jung and Maureen Pirog School of Public and Environmental Affairs Indiana University Bloomington.
The Inter-temporal Stability of Teacher Effect Estimates J. R. Lockwood Daniel F. McCaffrey Tim R. Sass The RAND Corporation The RAND Corporation Florida.
Key issues from yesterday and for panel Lorraine Dearden.
Measuring Student Growth and its Role in the Evaluation Process.
Chap 14-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 14 Additional Topics in Regression Analysis Statistics for Business.
Validity and Reliability of Teacher Performance Measures Presentation to the Forum on Measuring Teacher Effectiveness Woodrow Wilson School of Pubic Affairs,
Sensitivity of Teacher Value-Added Estimates to Student and Peer Control Variables March 2012 Presentation to the Association of Education Finance and.
Selection Bias
Issues in Assessment Design, Vertical Alignment, and Data Management : Working with Growth Models Pete Goldschmidt UCLA Graduate School of Education &
Final Reports from the Measures of Effective Teaching Project Tom Kane Harvard University Steve Cantrell, Bill & Melinda Gates Foundation.
Assumptions of value-added models for estimating school effects sean f reardon stephen w raudenbush april, 2008.
Modeling Student Growth Using Multilevel Mixture Item Response Theory Hong Jiao Robert Lissitz University of Maryland Presentation at the 2012 MARCES Conference.
The Policy Choices of Effective Principals David Figlio, Northwestern U/NBER Tim Sass, Florida State U July 2010.
Julian Betts, Department of Economics, UCSD and NBER.
Public Policy Analysis ECON 3386 Anant Nyshadham.
Research on teacher pay-for-performance Patrick McEwan Wellesley College (Also see Victor Lavy, “Using performance-based pay to improve.
A discussion of Comparing register and survey wealth data ( F. Johansson and A. Klevmarken) & The Impact of Methodological Decisions around Imputation.
Evaluating Results of Learning Blaž Zupan
Public Finance Seminar Spring 2015, Professor Yinger Public Production Functions.
Impediments to the estimation of teacher value added Steven Rivkin Jun Ishii April 2008.
7.4 DV’s and Groups Often it is desirous to know if two different groups follow the same or different regression functions -One way to test this is to.
CREATE – National Evaluation Institute Annual Conference – October 8-10, 2009 The Brown Hotel, Louisville, Kentucky Research and Evaluation that inform.
5. Consistency We cannot always achieve unbiasedness of estimators. -For example, σhat is not an unbiased estimator of σ -It is only consistent -Where.
Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
Human Capital Policies in Education: Further Research on Teachers and Principals 5 rd Annual CALDER Conference January 27 th, 2012.
Teacher effectiveness. Kane, Rockoff and Staiger (2007)
Characteristics of Studies that might Meet the What Works Clearinghouse Standards: Tips on What to Look For 1.
Are There Fast Tracks in Economics Departments? Valerie Smeets (UC3M & CCP, ASB) Madrid, October 2007.
“CEO Louis Gerstner added $40 billion to IBM’s stock market value.” “By himself?” Linking Educational Leadership to Student Achievement 2008 IES Conference.
Teacher Quality/Effectiveness: Defining, Developing, and Assessing Policies and Practices Part III: Setting Policies around Teacher Quality/Effectiveness.
Using Prior Scores to Evaluate Bias in Value-Added Models Raj Chetty, Stanford University and NBER John N. Friedman, Brown University and NBER Jonah Rockoff,
 Mark D. Reckase.  Student achievement is a result of the interaction of the student and the educational environment including each teacher.  Teachers.
Analysis for Designs with Assignment of Both Clusters and Individuals
Lurking inferential monsters
Regression in Practice: Observational studies with controls for pretests work better than you think Shadish, W. R., Clark, M. H., & Steiner, P. M. (2008).
School Quality and the Black-White Achievement Gap
Dan Goldhaber1,2, Vanessa Quince2, and Roddy Theobald1
Why use marginal model when I can use a multi-level model?
Ch. 13. Pooled Cross Sections Across Time: Simple Panel Data.
Evaluating Impacts: An Overview of Quantitative Methods
Ch. 13. Pooled Cross Sections Across Time: Simple Panel Data.
Regression in Practice: Observational studies with controls for pretests work better than you think Shadish, W. R., Clark, M. H., & Steiner, P. M. (2008).
Presentation transcript:

Using School Choice Lotteries to Test Measures of School Effectiveness David Deming Harvard University and NBER

Measuring School Effectiveness School rankings, ratings, league tables Gain score or “value-added” modeling approach (VAMs) – School VAMs now in ~30 U.S. states (Blank 2010) – Teacher VAMs used in evaluation, retention Accuracy of VAMs is important for incentive design and student welfare (Baker 2002, Rothstein 2010)

VAM Research Large literature on measurement / technical issues First order issue mostly untested - is assignment of teachers to classes within schools as good as random? Kane and Staiger (2008), Kane et al (2013) randomly assign classes to teachers, test validity of VAMs – Chetty, Friedman and Rockoff (2013) use quasi-experimental design with teacher mobility School VAMs require conditionally exogenous sorting of students across schools – Would you consent to this experiment?

Using School Choice Lotteries Oversubscribed public schools in Charlotte- Mecklenburg – Random assignment, within a self-selected group of applicants Estimate VAMs using data from prior cohorts – Vary model specification, sample, outcome – Out-of-sample predictions of “school effects” Use VAM estimates to predict the treatment effect of winning the lottery

Data and VAMs

Lottery Data and Sample 2,599 students in 118 separate lotteries for “marginal” priority groups Top 3 choices but nearly all randomization was over 1 st choices

Empirical Strategy

4 Possible Explanations for Bias 1.Sorting on unobserved determinants of student achievement (Rothstein 2010) 2.Estimation error 3.“True” school effects may vary over time independent of estimation error 4.Lottery sample is self-selected, so treatment effect is different for them

No correlation between average test scores (in levels) and lottery impacts

1.Huge improvement from adding lagged scores (gains model) 2.Need 2+ years of data to fail to reject unbiasedness (triple negative!)

“Shrinkage”

1.Shrinkage improves prediction when only one year of prior data is used – with longer panel unshrunken more accurate 2.“Drift” adjustment is best here 3.Shrinkage overcompensates if there is true variation in school effects – CFR call this “teacher bias”

Concluding thoughts Despite sorting concerns, school VAMs are surprisingly accurate – Best fit was gains only on full panel, but that’s not a theorem – Large standard errors admit non-trivial bias – need to replicate in other settings Good news for policies that use VAMs, but: 1.Biases might be offsetting 2.Other outcomes are important too! 3.“School effects” may include contextual factors that are beyond school’s control (peers, neighborhoods)

Thanks!