Explanation of slide: Logos, to show while the audience arrive.

Slides:



Advertisements
Similar presentations
AFRICA IMPACT EVALUATION INITIATIVE, AFTRL Africa Program for Education Impact Evaluation Muna Meky Impact Evaluation Cluster, AFTRL Slides by Paul J.
Advertisements

Impact analysis and counterfactuals in practise: the case of Structural Funds support for enterprise Gerhard Untiedt GEFRA-Münster,Germany Conference:
Graduate Methods Master Class
Explanation of slide: Logos, to show while the audience arrive.
The World Bank Human Development Network Spanish Impact Evaluation Fund.
Designing an impact evaluation: Randomization, statistical power, and some more fun…
The World Bank Human Development Network Spanish Impact Evaluation Fund.
The World Bank Human Development Network Spanish Impact Evaluation Fund.
REGRESSION, IV, MATCHING Treatment effect Boualem RABTA Center for World Food Studies (SOW-VU) Vrije Universiteit - Amsterdam.
Explanation of slide: Logos, to show while the audience arrive.
The World Bank Human Development Network Spanish Impact Evaluation Fund.
Mywish K. Maredia Michigan State University
The World Bank Human Development Network Spanish Impact Evaluation Fund.
The counterfactual logic for public policy evaluation Alberto Martini hard at first, natural later 1.
The World Bank Human Development Network Spanish Impact Evaluation Fund.
The World Bank Human Development Network Spanish Impact Evaluation Fund.
3.2 OLS Fitted Values and Residuals -after obtaining OLS estimates, we can then obtain fitted or predicted values for y: -given our actual and predicted.
The World Bank Human Development Network Spanish Impact Evaluation Fund.
Lecture 12 (Ch16) Simultaneous Equations Models (SEMs)
SREE workshop march 2010sean f reardon using instrumental variables in education research.
The Generalized IV Estimator IV estimation with a single endogenous regressor and a single instrument can be naturally generalized. Suppose that there.
Simultaneous Equations Models
1Prof. Dr. Rainer Stachuletz Simultaneous Equations y 1 =  1 y 2 +  1 z 1 + u 1 y 2 =  2 y 1 +  2 z 2 + u 2.
Chapter 4 Multiple Regression.
Chapter 9 Simultaneous Equations Models. What is in this Chapter? In Chapter 4 we mentioned that one of the assumptions in the basic regression model.
Impact Evaluation: The case of Bogotá’s concession schools Felipe Barrera-Osorio World Bank 1 October 2010.
The World Bank Human Development Network Spanish Impact Evaluation Fund.
Designing a Random Assignment Social Experiment In the U.K.; The Employment Retention and Advancement Demonstration (ERA)
Has Public Health Insurance for Older Children Reduced Disparities in Access to Care and Health Outcomes? Janet Currie, Sandra Decker, and Wanchuan Lin.
Instrumental Variables: Problems Methods of Economic Investigation Lecture 16.
The World Bank Human Development Network Spanish Impact Evaluation Fund.
Session III Regression discontinuity (RD) Christel Vermeersch LCSHD November 2006.
Managerial Economics Demand Estimation & Forecasting.
RCTs and instrumental variables Anna Vignoles University of Cambridge.
Public Policy Analysis ECON 3386 Anant Nyshadham.
The World Bank Human Development Network Spanish Impact Evaluation Fund.
AFRICA IMPACT EVALUATION INITIATIVE, AFTRL Africa Program for Education Impact Evaluation David Evans Impact Evaluation Cluster, AFTRL Slides by Paul J.
Nigeria Impact Evaluation Community of Practice Abuja, Nigeria, April 2, 2014 Measuring Program Impacts Through Randomization David Evans (World Bank)
Application 2: Minnesota Domestic Violence Experiment Methods of Economic Investigation Lecture 6.
Applying impact evaluation tools A hypothetical fertilizer project.
Instrumental Variables: Introduction Methods of Economic Investigation Lecture 14.
Christel M. J. Vermeersch November 2006 Session V Instrumental Variables.
Review Section on Instrumental Variables Economics 1018 Abby Williamson and Hongyi Li October 11, 2006.
Randomized Assignment Difference-in-Differences
10-1 MGMG 522 : Session #10 Simultaneous Equations (Ch. 14 & the Appendix 14.6)
The World Bank Human Development Network Spanish Impact Evaluation Fund.
Copyright © 2015 Inter-American Development Bank. This work is licensed under a Creative Commons IGO 3.0 Attribution-Non Commercial-No Derivatives (CC-IGO.
Impact Evaluation Methods Randomization and Causal Inference Slides by Paul J. Gertler & Sebastian Martinez.
INSTRUMENTAL VARIABLES Eva Hromádková, Applied Econometrics JEM007, IES Lecture 5.
Experimental Evaluations Methods of Economic Investigation Lecture 4.
The World Bank Human Development Network Spanish Impact Evaluation Fund.
Instrumental Variable (IV) Regression
Gustavo Angeles MEASURE Evaluation
Impact Evaluation Methods
Explanation of slide: Logos, to show while the audience arrive.
Explanation of slide: Logos, to show while the audience arrive.
Instrumental Variables and Two Stage Least Squares
Impact evaluation: The quantitative methods with applications
Instrumental Variables and Two Stage Least Squares
Development Impact Evaluation in Finance and Private Sector
Impact Evaluation Methods
Instrumental Variables
Instrumental Variables and Two Stage Least Squares
Impact Evaluation Methods: Difference in difference & Matching
Evaluating Impacts: An Overview of Quantitative Methods
Explanation of slide: Logos, to show while the audience arrive.
Explanation of slide: Logos, to show while the audience arrive.
Module 3: Impact Evaluation for TTLs
Simultaneous Equations Models
Measuring the Wealth of Nations
Presentation transcript:

Explanation of slide: Logos, to show while the audience arrive.

INSTRUMENTAL VARIABLES Technical Track Session IV This material constitutes supporting material for the "Impact Evaluation in Practice" book. This additional material is made freely but please acknowledge its use as follows: Gertler, P. J.; Martinez, S., Premand, P., Rawlings, L. B. and Christel M. J. Vermeersch, 2010, Impact Evaluation in Practice: Ancillary Material, The World Bank, Washington DC (www.worldbank.org/ieinpractice). The content of this presentation reflects the views of the authors and not necessarily those of the World Bank.

Instrumental Variables and IE Day 3 - Technical Track Session IV: Instrumental Variables Instrumental Variables and IE Instrumental variables have many uses IV can be generated ex ante: Randomized promotion (or encouragement design) “Randomized offering” of a program IV can be used ex post to correct for non-compliance or conduct retrospective IE: Correction for non-compliance to recover TOT from ITT E.g. Randomized Assignment with non-compliers E.g. Fuzzy Regression Discontinuity Look for exogenous variation to evaluate the impact of a program in absence of a prospective design. Here: General Principles behind IVs Ex ante focus on randomized promotion IV, non-compliance and randomized offering

An example to start off with… Day 3 - Technical Track Session IV: Instrumental Variables An example to start off with… Say we wish to evaluate a voluntary job training program Any unemployed person is eligible (Universal eligibility) Some people choose to register (Participants) Other people choose not to register (Non-participants) Some simple (but not-so-good) ways to evaluate the program: Compare before and after situation in the participant group Compare situation of participants and non-participants after the intervention Compare situation of participants and non-participants before and after (DD).

Voluntary job training program Day 3 - Technical Track Session IV: Instrumental Variables Voluntary job training program Say we decide to compare outcomes for those who participate to the outcomes of those who do not participate: A simple model to do this: y = α + β1 P + β2 x + ε 1 If person participates in training P = 0 If person does not participate in training x = Control variables (exogenous & observed) Why is this not working? 2 problems: Variables that we omit (for various reasons) but that are important Decision to participate in training is endogenous.

Problem #1: Omitted Variables Day 3 - Technical Track Session IV: Instrumental Variables Problem #1: Omitted Variables Even if we try to control for “everything”, we’ll miss: Characteristics that we didn’t know they mattered, and Characteristics that are too complicated to measure (not observables or not observed): Talent, motivation Level of information and access to services Opportunity cost of participation Full model would be: y = γ0 + γ1 x + γ2 P + γ3 M1 + η But we cannot observe M1 , the “missing” and unobserved variables.

Omitted variable bias y = γ0 + γ1 x + γ2 P + γ3 M1 + η True model is: But we estimate: y = β0 + β1 x + β2 P + ε If there is a correlation between M1 and P, then the OLS estimator of β2 will not be a consistent estimator of γ2, the true impact of P. Why? When M1 is missing from the regression, the coefficient of P will “pick up” some of the effect of M1

Problem #2: Endogenous Decision to Participate Day 3 - Technical Track Session IV: Instrumental Variables Problem #2: Endogenous Decision to Participate y = γ0 + γ1 x + γ2 P + η True model is: with P = π0 + π 1 x + π 2 M2 +ξ M2 = Vector of unobserved / missing characteristics (i.e. we don’t fully know why people decide to participate) Since we don’t observe M2 , we can only estimate a simplified model: y = β0 + β 1 x + β2 P + ε Is β2, OLS an unbiased estimator of γ2?

Problem #2: Endogenous Decision to Participate Day 3 - Technical Track Session IV: Instrumental Variables Problem #2: Endogenous Decision to Participate We estimate: y = β0 + β1 x + β2 P + ε But true model is: y = γ0 + γ1 x + γ2 P + η with P = π0 + π1 x + π2 M2 +ξ Is β2, OLS an unbiased estimator of γ2? Corr (ε, P) = corr (ε, π0 + π 1 x + π 2 M2 +ξ) = π 1 corr (ε, x)+ π 2 corr (ε, M2) = π 2 corr (ε, M2) If there is a correlation between the missing variables that determine participation (e.g. Talent) and outcomes not explained by observed characteristics, then the OLS estimator will be biased.

What can we do to solve this problem? Day 3 - Technical Track Session IV: Instrumental Variables What can we do to solve this problem? We estimate: y = β0 + β1 x + β2 P + ε So the problem is the correlation between P and ε How about we replace P with “something else”, call it Z: Z needs to be similar to P But is not correlated with ε

Back to the job training program P = participation ε = that part of outcomes that is not explained by program participation or by observed characteristics I’m looking for a variable Z that is: Closely related to participation P but doesn’t directly affect people’s outcomes Y, other than through its effect on participation. So this variable must be coming from outside.

Generating an outside variable for the job training program Say that a social worker visits unemployed persons to encourage them to participate. She only visits 50% of persons on her roster, and She randomly chooses whom she will visit If she is effective, many people she visits will enroll. There will be a correlation between receiving a visit and enrolling But visit does not have direct effect on outcomes (e.g. income) apart from its effect through enrollment in the training program. Randomized “encouragement” or “promotion” visits are an Instrumental Variable.

Characteristics of an instrumental variable Define a new variable Z 1 If person was randomly chosen to receive the encouragement visit from the social worker Z = 0 If person was randomly chosen not to receive the encouragement visit from the social worker Corr ( Z , P ) > 0 People who receive the encouragement visit are more likely to participate than those who don’t Corr ( Z , ε ) = 0 No correlation between receiving a visit and benefit to the program apart from the effect of the visit on participation. Z is called an instrumental variable

Two-stage least squares (2SLS) Day 3 - Technical Track Session IV: Instrumental Variables Two-stage least squares (2SLS) Remember the original model with endogenous P: y = β0 + β1 x + β2 P + ε Step 1 Regress the endogenous variable P on the instrumental variable(s) Z and other exogenous variables P = δ0 + δ1 x + δ2 Z + τ Calculate the predicted value of P for each observation: P Since Z and x are not correlated with ε, neither will be P. You will need one instrumental variable for each potentially endogenous regressor.

Two-stage least squares (2SLS) Day 3 - Technical Track Session IV: Instrumental Variables Two-stage least squares (2SLS) Step 2 Regress y on the predicted variable P and the other exogenous variables y = β0 + β1 x + β2 P + ε Note: The standard errors of the second stage OLS need to be corrected because P is not a fixed regressor. In Practice: Use STATA ivreg command, which does the two steps at once and reports correct standard errors. Intuition: By using Z for P, we cleaned P of its correlation with η It can be shown that (under certain conditions) β2,IV yields a consistent estimator of γ2 (large sample theory)

Where do we find instrumental variables? Searching for an IV ex post … Hard and risky! Generating an IV with information campaign designed ex ante If everyone is eligible to participate in treatment But some have more information than others (Who has more information will be more likely to participate) Provision of “additional information” on a random basis

Example 1: voluntary job training program Population eligible for job training program Random Sample Randomized assignment Standard Information Package only Standard Information Package + Additional visit Monthly income 1 year later = 700 Monthly income 1 year later = 850 30% take-up 90% take-up Question: what is the impact of the job training program?

Question: what is the impact of the job training program? Standard Information Package only Standard + Additional Information Package Monthly income 1 year later = 700 Monthly income 1 year later = 850 30% take-up 90% take-up Question: what is the impact of the job training program? Difference between the “well informed” and “not well informed” group: ……………………………………………………………………………….. Corrected for the differential take-up rate: ………………………………………………………………………………. Practically: Impact = ……………………………………………………………………

Link back to the estimation formula Stage 1 Regress the participation on training on a dummy for whether person received additional visit (linear model) Compute predicted value of participation Stage 2 Regress wages on the predicted value of participation

Example 2: School autonomy in Nepal Goal To Evaluate: Autonomous school management by communities School report cards Data You can include 1000 schools in the evaluation Each community freely chooses to participate or not School report cards done by NGOs Each community has exactly one school Task Design the implementation of the program so it can be evaluated –propose method of evaluation.

School autonomy in Nepal Intervention B: School report card intervention by NGO. Yes No Total Instrumental variable for Intervention A: NGO visits community to inform on procedures for transfer of the school to community management. 300 600 200 400 500 1000

Reminder and a word of caution… Day 3 - Technical Track Session IV: Instrumental Variables Reminder and a word of caution… corr (Z,ε) =0 If corr (Z , ε) ≠ 0, “Bad instrument” “Finding” a good instrument is hard! But you can build one yourself with a randomized encouragement design corr (Z,P) ≠0 “Weak instruments”: the correlation between Z and P needs to be sufficiently strong. If not, the bias stays large even for large sample sizes.

Recovering TOT from ATE in case of non-compliance Sometimes eligible units are selected randomly into the treatment group, are offered treatment, but not all of them accept it. Computing the Average Treatment Effect (ATE) Straight difference in average outcomes between the group to whom you offered treatment, and the group to whom you did not offer treatment Computing the Effect of Treatment on the Treated (TOT) Use the randomized offering as an instrumental variable (Z) for whether people accepted the treatment (P)

Note: IV is a ‘local’ effect IV methods identify the average gains to persons induced to change their choice by a change of the instrument (referred to as compliers) … however we cannot identify who these people are (“local average treatment effect” or LATE) … different instruments will identify different parameters and answer different questions Caution in extrapolating to the whole population

Day 3 - Technical Track Session VI: Matching References Angrist, J. D. and A. Krueger (2001). “Instrumental Variables and the Search for Identification: From Supply and Demand to Natural Experiments”, Journal of Economic Perspectives, 15(4). Angrist, J. D., G. W. Imbens and D. B. Rubin (1996). “Identification of Causal Effects Using Instrumental Variables”, Journal of the American Statistical Association, Vol. 91, 434. Angrist, J., Bettinger, E., Bloom, E., King, E. and M. Kremer (2002). “Vouchers for Private Schooling in Colombia: Evidence from a Randomized Natural Experiment”, American Economic Review, 92, 5. Bradlow, E., (1998). “Encouragement Designs: An Approach to Self-Selected Samples in an Experimental Design”, Marketing Letters, 9(4) Imbens, G. W. and J. D. Angrist, (1994). “Identification and Estimation of Local Average Treatment Effects.” Econometrica, 62(2). Newman, J., M. Pradhan, L. B. Rawlings, G. Ridder, R. Coa, J. L. Evia, (2002). “An Impact Evaluation of Education, Health, and Water Supply Investments by the Bolivian Social Investment Fund.”, World Bank Economic Review, vol. 16(2).