Download presentation
Presentation is loading. Please wait.
1
Matching Methods & Propensity Scores
Garret Christensen (Taken from Kenny Ajayi) October 27, 2009 Global Poverty and Impact Evaluation
2
Program Evaluation Methods
Randomization (Experiments) Quasi-Experiments Regression Discontinuity Matching, Propensity Score Difference-in-Differences
3
Matching Methods Creating a counterfactual
To measure the effect of a program, we want to measure E[Y | D = 1, X] - E[Y | D = 0, X] but we only observe one of these outcomes for each individual.
4
Evaluation Exercise Argentine Antipoverty Program
5
Basic Idea Match each participant (treated) with one or more nonparticipants (untreated) with similar observed characteristics Counterfactual = matched comparison group (i.e. nonparticipants with same characteristics as participants) Illustrate Example
6
Basic Idea This assumes that there is no selection bias based on unobserved characteristics i.e. there is “selection on observables” and participation is independent of outcomes once we control for observable characteristics (X) What might some of these unobserved characteristics be?
7
Propensity Score When the set of observed variables is large, we match participants with non participants using a summary measure: the propensity score: the probability of participating in the program (being treated), as a function of the individual’s observed characteristics P(X) = Prob(D = 1|X) D indicates participation in project X is the set of observable characteristics In practice, matching individual characteristics is very hard. The entire vector of X observed characteristics could be huge. Instead of attempting to create a match for each participant with exactly the same value of X, we can instead match on the probability of participation.
8
if there had not been a program
Propensity Score We maintain the assumption of selection on observables: i.e., assume that participation is independent of outcomes conditional on Xi E (Y|X, D = 1) = E (Y|X, D = 0) if there had not been a program This is false if there are unobserved outcomes affecting participation
9
Evaluation Exercise Argentine Antipoverty Program
10
Propensity Score Matching
Get representative and comparable data on participants and nonparticipants (ideally using the same survey & a similar time period)
11
Propensity Score Matching
Get representative and comparable data on participants and nonparticipants (ideally using the same survey & a similar time period) Estimate the probability of program participation as a function of observable characteristics (using a logit or other discrete choice model)
12
Jalan and Ravallion (2003)
14
Propensity Score Matching
Get representative and comparable data on participants and nonparticipants (ideally using the same survey & a similar time period) Estimate the probability of program participation as a function of observable characteristics (using a logit or other discrete choice model) Use predicted values from estimation to generate propensity score p(xi) for all treatment and comparison group members
15
Propensity Score Matching
Match Participants: Find a sample of non-participants with similar p(xi) Restrict samples to ensure common support
16
Density of scores for participants
Common Support Density Density of scores for non- participants Density of scores for participants Region of common support High probability of participating, given X Low probability of participating, given X 1 Propensity score
17
Propensity Score Matching
Match Participants: Find a sample of non-participants with similar p(xi) Restrict samples to ensure common support Determine a tolerance limit: how different can matched control individuals or villages be? Decide on a matching technique Nearest neighbors, nonlinear matching, multiple matches
18
Propensity Score Matching
Once matches are made, we can calculate impact by comparing the means of outcomes across participants and their matches The difference in outcomes for each participant and its match is the estimate of the gain due to the program for that observation. Calculate the mean of these individual gains to obtain the average overall gain.
19
Possible Scenarios Case 1: Baseline Data Exists
Arrive at baseline, we can match participants with nonparticipants using baseline characteristics. Case 2: No Baseline Data. Arrive afterwards, we can only match participants with nonparticipants using time-invariant characteristics.
20
Extensions Be cautious of ex-post matching
Matching on variables that change due to program participation (i.e. endogenous variables) What are some invariable characteristics?
21
Key Factors Identification Assumption Data Requirements
Selection on Observables: After controlling for observables, treated and control groups are not systematically different Data Requirements Rich data on as many observable characteristics as possible Large sample size (so that it is possible to find appropriate match)
22
Additional Considerations
Advantages Might be possible to do with existing survey data Doesn’t require randomization/experiment/baseline data Allows estimation of heterogeneous treatment effects because we have individual counterfactuals, instead of just having group averages.
23
Additional Considerations
Disadvantages Strong (if not heroic) identifying assumption: that there are no unobserved differences but if individuals are otherwise identical, then why did some participate and others not? Requires good quality data Need to match on as many characteristics as possible Requires sufficiently large sample size Need a match for each participant in the treatment group
24
Jalan & Ravallion (2003b) Does piped water reduce diarrhea for children in rural India?
25
Data Rural Household Survey What would you use for D, Y, and X?
No baseline data Detailed information on: Health status of household members Education levels of household members Household income Access to piped water What would you use for D, Y, and X?
26
Propensity Score Regression
27
Propensity Score Regression
28
Matching Prior to matching, the estimated propensity scores for those with and without piped water were, respectively, and After matching there was negligible difference in the mean propensity scores of the two groups 0.3743, for those with piped water 0.3742, for the matched control group
29
Results “Prevalence and duration of diarrhea among children under five in rural India are significantly lower on average for families with piped water than for observationally identical households without it.” “However, our results indicate that the health gains largely by-pass children in poor families, particularly when the mother is poorly educated.”
30
Matching is a useful way to control for OBSERVABLE heterogeneity
Conclusion Matching is a useful way to control for OBSERVABLE heterogeneity Especially when randomization or RD approach is not possible However, it requires relatively strong assumptions
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.