What you don’t see can’t matter? The effects of unobserved differences on causal attributions Robert Coe CEM, Durham University Randomised Controlled Trials.

Slides:



Advertisements
Similar presentations
Implementing an Online Homework Program in a Large Organic Chemistry Lecture Course Jaime Curtis-Fisk Kevin Walker Michigan State University.
Advertisements

The Application of Propensity Score Analysis to Non-randomized Medical Device Clinical Studies: A Regulatory Perspective Lilly Yue, Ph.D.* CDRH, FDA,
Stephen Meek Director of Strategy, DfES 26 April 2006
BUS 220: ELEMENTARY STATISTICS
1 Eloise E. Kaizar The Ohio State University Combining Information From Randomized and Observational Data: A Simulation Study June 5, 2008 Joel Greenhouse.
Impact analysis and counterfactuals in practise: the case of Structural Funds support for enterprise Gerhard Untiedt GEFRA-Münster,Germany Conference:
Undergraduates in Minnesota: Who are they and how do they finance their education? Tricia Grimes Shefali Mehta Minnesota Office of Higher Education November.
Alaska Accountability Adequate Yearly Progress January 2008, Updated.
STNE Seminar Pupil Engagement and Pupil Gains School of Education, University of Aberdeen Thursday 18th September 10.30am – 4.00pm.
Lecture 2 ANALYSIS OF VARIANCE: AN INTRODUCTION
Leading the Geography Department GA Secondary Education Section Committee SESC 2005.
1 Correlation and Simple Regression. 2 Introduction Interested in the relationships between variables. What will happen to one variable if another is.
Is Free School Meal Status a Valid Proxy for Socio-Economic Status? Graham Hobbs and Anna Vignoles Centre for the Economics of Education (funded by Department.
The educational gender gap, catch up and labour market performance Martyn Andrews (University of Manchester), Steve Bradley, Dave Stott & Jim Taylor (Lancaster.
The impact of good student selection on retention. Mary Crawford Florence Nightingale School of Nursing and Midwifery.
Experimental and Quasiexperimental Designs Chapter 10 Copyright © 2009 Elsevier Canada, a division of Reed Elsevier Canada, Ltd.
1 The Social Survey ICBS Nurit Dobrin December 2010.
PLAN Interpretive Visuals 9/2009 Interpretive Visuals.
The Effective use of Digital Technologies for Learning in Schools Professor Steve Higgins Durham University
Gender and Educational Attainment in Schools Stephen Machin and Sandra McNally.
REGRESSION, IV, MATCHING Treatment effect Boualem RABTA Center for World Food Studies (SOW-VU) Vrije Universiteit - Amsterdam.
Correlation and Regression
Raise Your Childs Attendance, -Raise their Chances! Raise Your Childs Attendance, -Raise their Chances! What does “Good attendance” mean?
Knowledge for Knowledge Translation Jeremy Grimshaw MD, PhD Clinical Epidemiology Program, OHRI Department of Medicine, University of Ottawa Canada Research.
Module 16: One-sample t-tests and Confidence Intervals
Effect Size Robert ResearchED 2013, Dulwich College, 7 Sept 2013.
Faculty of Development & Society - Affirming Achievement 1 Multiverse Project Analysis of High Attaining Black Students: Factors and Conditions that Affect.
Evaluation of the Effectiveness of the Letterbox Club in Improving Educational Outcomes among Children Aged 7-11 Years in Foster Care in Northern Ireland.
Putting Statistics to Work
Detecting Spam Zombies by Monitoring Outgoing Messages Zhenhai Duan Department of Computer Science Florida State University.
USING EVIDENCE TO INFORM YOUR LEADERSHIP APPROACH AND SUPPORT SCHOOL IMPROVEMENT ROB CARPENTER 26 TH SEPTEMBER 2013
Multiple Regression and Model Building
Mental Health Matters Lauren Kazee, LMSW Mental Health Consultant Michigan Dept of Education Coordinated School Health and Safety Programs Michigan Dept.
Evidence-Based Education (EBE) Grover J. (Russ) Whitehurst Assistant Secretary Educational Research and Improvement United States Department of Education.
1 Review Lecture: Guide to the SSSII Assignment Gwilym Pryce 5 th March 2006.
#ieGovern Impact Evaluation Workshop Istanbul, Turkey January 27-30, 2015 Measuring Impact 1 Non-experimental methods 2 Experiments Vincenzo Di Maro Development.
The choice between fixed and random effects models: some considerations for educational research Claire Crawford with Paul Clarke, Fiona Steele & Anna.
Income and Child Development Lawrence Berger, University of Wisconsin Christina Paxson, Princeton University Jane Waldfogel, Columbia Univerity.
Robert Coe Neil Appleby Academic mentoring in schools: a small RCT to evaluate a large policy Randomised Controlled trials in the Social Sciences: Challenges.
The counterfactual logic for public policy evaluation Alberto Martini hard at first, natural later 1.
Multiple Regression Fenster Today we start on the last part of the course: multivariate analysis. Up to now we have been concerned with testing the significance.
AS Achievement Standard.
Correlation AND EXPERIMENTAL DESIGN
Using evidence to raise the attainment of children facing disadvantage James Richardson Senior Analyst, Education Endowment Foundation 1 st April 2014.
Production Functions and Measuring the Effect of Teachers on Student Achievement With Value-Added HSE March 20, 2012.
Beyond test scores: the role of primary schools in improving multiple child outcomes Claire Crawford and Anna Vignoles Institute of Education, University.
-- Preliminary, Do Not Quote Without Permission -- VALUE-ADDED MODELS AND THE MEASUREMENT OF TEACHER QUALITY Douglas HarrisTim R. Sass Dept. of Ed. LeadershipDept.
Raising Academic Standards for all School Development Planning Initiative.
EVAL 6970: Cost Analysis for Evaluation Dr. Chris L. S. Coryn Nick Saxton Fall 2014.
Global Workshop on Development Impact Evaluation in Finance and Private Sector Rio de Janeiro, June 6-10, 2011 Mattea Stein Quasi Experimental Methods.
McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. Educational Research: Fundamentals.
Quasi Experimental Methods I Nethra Palaniswamy Development Strategy and Governance International Food Policy Research Institute.
S-005 Intervention research: True experiments and quasi- experiments.
Level 7 Research Project Laura Bridge Robert Owen EBITT.
Chapter 8 Causal-Comparative Research Gay, Mills, and Airasian
Interpreting Feedback from Baseline Tests – Predictive Data Course: CEM Information Systems for Beginners and New Users Day 1 Session 3 Wednesday 17 th.
How do we know what works? Robert Coe ResearchEd, London, 5 Sept 2015.
Widening Participation in Higher Education: A Quantitative Analysis Institute of Education Institute for Fiscal Studies Centre for Economic Performance.
Mike Treadaway Director of Research Fischer Family Trust Using FFT Live Secondary Schools.
The Choice Between Fixed and Random Effects Models: Some Considerations For Educational Research Clarke, Crawford, Steele and Vignoles and funding from.
Non-experimental methods Markus Goldstein The World Bank DECRG & AFTPM.
Impact of two teacher training programmes on pupils’ development of literacy and numeracy ability: a randomised trial Jack Worth National Foundation for.
The Coseley School A Co-operative Trust Closing the Gap Strategies – 2015/16 Believe, Achieve, Excel Closing the Gap Strategies – 2015/16 Believe, Achieve,
Seven Sins of Regression Evaluation Research (8521) Prof. Jesse Lecy Lecture 7 1.
Public Finance and Public Policy Jonathan Gruber Third Edition Copyright © 2010 Worth Publishers 1 of 24 Copyright © 2010 Worth Publishers.
S-005 Intervention research: True experiments and quasi- experiments.
7 th June 2012 Is it better to fail than to succeed? A quantitative analysis of ‘just’ failing an English school inspection Rebecca Allen, Institute of.
Lecture 6 Feb. 2, 2015 ANNOUNCEMENT: Lab session will go from 4:20-5:20 based on the poll. (The majority indicated that it would not be a problem to chance,
Quasi Experimental Methods I
Impact Evaluation Methods
Presentation transcript:

What you don’t see can’t matter? The effects of unobserved differences on causal attributions Robert Coe CEM, Durham University Randomised Controlled Trials in the Social Sciences, York, September 2009

2 The problem … Evaluations of education policy initiatives often use regression methods to control for initial differences They often seem to make clear and confident causal claims about programme effects They seldom seem to consider other possible explanations (eg unobserved differences) If regression models incorporate lots of explanatory variables and have high R 2, does it matter if there might have been initial unobserved differences?

3 Are you a researcher or a policy maker? Would you rather have an answer to your question that is –Simple, clear and confident, but probably wrong –Complex, nuanced, and carefully justified, but completely useless for making a decision

4 Model simulated Measured variable (M) Unmeasured variable (U) Outcome variable (Y) r Treatment group (X) p q s d Y =  0 M + d (X –X) + e – Observed relationship Y =  0 M +  1 U + e True relationship

5 The simulation 1.Generate random variables Y, M, U with desired inter-correlations (p, q, r) 2.Assign group membership (X) to be correlated with U 3.Apply regression model (Y on M and X) to estimate spurious ‘effect’ of X on Y if U were omitted

6 Example evaluations Effects of study support on achievement –MacBeath et al (2001) ‘The Impact of Study Support: A report of a longitudinal study into the impact of participation in out-of- school-hours learning on the academic attainment, attitudes and school attendance of secondary school students’. Published by the Department for Education and Skills Effects of gifted and talented provision on achievement –Kendall, et al (2005) ‘Excellence in Cities: The National Evaluation of a Policy to Raise Standards in Urban Schools ’. Published by the Department for Education and Skills. Effects of Assisted Places on achievement –Power et al (2006) ‘The Educational and Career Trajectories of Assisted Place Holders’. Published by the Sutton Trust.

7 Summary of the examples StudyMacBeath et al. (2001) Kendall et al. (2005) Power et al. (2006) Intervention / programme Study support (Y11 Easter School) Gifted & Talented provision Assisted Places Scheme Outcome(s)GCSE English; GCSE maths KS3 average level; GCSE capped 8 score A-level points CovariatesKS3 SATs average, Gender, School type Prior attainment, FSM status, gender, ethnicity Parents’ SES and education R 2 in the model63%;70%66%;80%30% Estimate of the effect, from regression model 0.18; ;

8 Results of the simulation

9 Summary of simulations Possible relevant unmeasured variable(s) Socioeconomic status; Motivation; Self-discipline Attainment used to identify G&T status Performance on entrance test Range of possible spurious effects 0.0 – – – – 1.0 Range of likely spurious effects 0.05 – – – – 0.5 Best guess at spurious effect

10 Interpretations Interpretation given by the researchers ‘Study support can improve attainment in Maths and English by half a grade’ ‘Pupils designated as gifted and talented had higher levels of attainment at the end of Key Stages 3 and 4 than those of otherwise similar pupils not designated.’ ‘AP holders did better attending a private school than if they had gone to a state school’ Justified conclusion, taking account of bias due to omitted factors Possible small residual effect (0.10) on English but pretty much no genuine effect on maths Any genuine effect for both outcomes is very close to zero Possible positive effect (0.14), but much caution and uncertainty surrounds this

11 Conclusions Even high R 2 with the variables you have got does not necessarily mean you can ignore others you haven’t The size of artefactual spurious effects is quite sensitive to assumptions about parameter values, which are themselves contentious Which is all the more reason to consider unobserved variables The examples considered are otherwise of relatively high quality

12 Recommendations Need to replicate, simplify application, extend to other methods Be more cautious about making – and believing – causal claims based on statistical control Before interpreting as causal: –List possible alternative factors –Evaluate the case for their impact Use stronger designs