Presentation is loading. Please wait.

Presentation is loading. Please wait.

Causal Inference: experimental and quasi-experimental methods Draft ©G. Mason (2005)

Similar presentations


Presentation on theme: "Causal Inference: experimental and quasi-experimental methods Draft ©G. Mason (2005)"— Presentation transcript:

1

2 Causal Inference: experimental and quasi-experimental methods Draft ©G. Mason (2005)

3 Scientific truth always goes through three stages. First, people say it conflicts with the Bible; next they say it has been discovered before; and lastly they say that they always believed it Louis Agassiz, Swiss naturalist We do not now a truth without knowing its cause Aristotle, Nicomachean Ethics Development of Western science is based on two great achievements: the invention of the formal logical system (Euclidean geometry) by the Greek philosophers, and the discovery of the possibility to find out causal relationships by systematic experiment (during the Renaissance) Albert Einstein

4 Models of cause and effect 1 Fishbone Effect A B C

5 Path Diagram (Regression sentence) Y X1 X2 XK : B1 B2 BK Error Models of cause and effect 2

6 Concept of causality Causality often implies inevitability, but the reality is that causal statements usually reflect degrees of uncertainty. Causality and probability are fundamentally connected because we want to : –Know the causes of an event –Measure the relative strength of these causes

7 Randomized experiments Classic experiment is the random, double- blind experiment (RDE): –subjects are selected randomly into a treatment and control group –each subject received a code –an independent third party assigns codes randomly to treatment and control group members. –the treatment is not identifiable (i.e., the real and fake pill are identical. –those administering the treatments and placebo have no knowledge of what subject receives.

8 Key benefits of the RDE randomization creates statistically equivalent groups in the absence of any interventions (the drug under tests), the incidence of disease is the same for both groups the groups are the “same” (statistically, except that one gets the drug and the other a placebo analysis can be done by difference of means tests or other basic techniques.

9

10 Limits of RDE In social science, randomized double blind experiments are often not feasible: –human subjects are unreliable (they move, die or otherwise fail to participate in the full experiment). –many see the administration of a placebo as withholding a treatment. –social policy cannot be masked (creating a placebo is difficult).

11 Quasi-experimental designs Most policy testing in social sciences uses a quasi-experimental design. Two approaches exist –Multivariate (regression) models specify dependent variable outcome, and include dummy variables to identify those in the program. Other covariates are included to control for the interventions. –Matching: Program participants and non- participants serve as the basis for the treatment and control groups.

12 Four potential models for evaluating policy 1.Randomized control (RC) 2.Natural experiments (difference-in- difference, discontinuity) 3.Quasi-experimental methods –Heckman two step –Statistical Matching 4.Instrumental variables (treated separately)

13 Randomized control Attempts to create a situation where Cov (X’,  ) = 0, or E(T’,  ) = E(T”, W), where W are the omitted variables that determine selection into treatment.

14 Natural experiments Create a “split” in the sample, where treated and untreated are classified by a variable that is not related to the the treatment. This split occur “naturally” where the program change occurs in one area/jurisdiction, not in others that are “closely similar.” Difference-in-differences (DID) methods are a common evaluation framework.

15 Difference in Differences The DID estimator uses the average before and after values for an outcome variable for the program and comparison group. DID = [Y p (t=a) – Y p (t=b) ] – [Y c (t=a) - Y c (t=b) ] Example: –Avg. earned income before - program group = $4500 –Avg. earned income after - program group = $6500 –Avg. earned income before - comparison group = $10,500 –Avg. earned income after - comparison group = $11,000 DID = [6500 - 4500] – [11,000 – 10,500] = $1,500 = net impact attributable to the program (treatment)

16 Net impact using DID

17 Causal Inference – comparison in regression Problem: Estimate effect of treatment (T) on observed outcome (Y), or estimate B in Y i = B 0 + B 1 T i +  i = X i B +  I (where X = [1, T] Assume –dichotomous treatment variable: T=1 if treated, 0 otherwise –homogeneous treatment effect (B) (every i experiences same effect) “average treatment effect” –Linear (no dose) –no covariates mediate the outcome

18 B hat = Y bar(T=1) – Y Bar (T=0) The simple comparison group model

19 OLS assumption E (X’,  ) = 0, or E (X’,  ) = X’ (Y- B OLS X) which then creates the OLS estimator B OLS = (X’X) –1 X’y But, with omitted variables, the validity of OLS requires the omitted variables to be uncorrelated with T (the treatment). This is the essence of the self-selection problem. No covariates is the key assumption

20 Selection and attrition Random samples may be upset by self-selection and attrition (or both)

21 The original motivation for this procedure was to correct evaluation studies for sample distortion caused by self-selection. Two steps: 1.Estimate the probability of participation for each participant and non-participant. Y i = B 0 + B 1 X i1 + B 2 X i2 + … B k X ik + e i (Y=1 for a participant, for a non-participant). 2.For each participant/non-participant a unique probability of participation will be estimated. Call this λ i Now, this is inserted into the outcome regression W i = B 0 + B 1 X i1 + B 2 X i2 + … B k X ik + B l λ i + e i, where Wi is the outcome for person i (wages, hours worked, etc.) Heckman two step procedure (basic)

22 Matching In social experiments, participants differ from non- participants because: –failure to hear of program –constraints on participation or completion –selection by staff Matching participants and non-participants can be accomplished via –pair-wise –statistical

23 Matching Process

24 Pair wise matching The theory will indicate those attributes that are likely to make a difference in the quasi- experiment. –For labour markets, gender, education and rural- urban location are important –For health policy, age, rural-urban, and family history might be important. The analyst starts with the first variable, and divides the participants and non-participants into two sets. Within the sets the samples are classified with respect to the second, variable and so on.

25 Pair wise Matching

26 Statistical Matching Matching is needed because we cannot randomly allocate clients to the program and comparison groups. Program benefits cannot be withheld. Logit model provides the estimate of the propensity to participate for participants and non-participants. The key idea is that we estimate that propensity to participate is based on observed attributes of the participants and non-participants. Participants are assigned a “Y”value of 1 and non-participants are assigned a “Y” value of 0. A logistic regression then estimates the propensity to participate. Note that even though a non-participant actually did not participate the model will assign a score between 0 and 1. Typically non-participants will have lower scores than participants, but there will be an overlap. The overlap is termed the region of common support.

27 Rationale for statistical matching Matching is needed because we cannot randomly allocate EI clients to the program and comparison groups. Part II benefits cannot be withheld. Logit model provides the estimate of the propensity to participate for participants and non-participants. The key idea is that we estimate that propensity to participate is based on observed attributes of the participants and non-participants. Participants are assigned a “Y”value of 1 and non-participants are assigned a “Y” value of 0. A logistic regression then estimates the propensity to participate. Note that even though a non-participant actually did not participate the model will assign a score between 0 and 1. Typically non-participants will have lower scores than participants, but there will be an overlap. The overlap is termed the region of common support.

28 Statistical matching simplified

29 The logit model LPM Model P i = E(Y = 1| X i ) = B 1 + B 2 X 2 + B 3 X 3 +..+B k X k Logit Model P i = E(Y = 1| X i ) = 1/[1+ e – (  BiXi) ] In Log Odds format L i = ln(P i /1-P i ) = Z i =  B i X i

30 Region of Common Support Each participant has the value of 1 for P and each non-participant has the value 0. However, once the model is estimated, each participant and non-participant has a score between 0 and 1. Participants tend to have scores closer to 1 and non-participants are closer to 0. The distribution of scores can be graphed.

31 Statistical Matching

32 Statistical Matching: The region of common support

33 Issues in Matching The matching is limited to the variables available in the administrative files. The balancing test compares the program and participant groups for each covariate using a t test for differences in means. Two key weaknesses are: –matching on the observed variables may not align the program and comparison groups on the non-observed variables. –The statistical quality of the match is very important Every additional variable that is introduced to the matching equation process, potentially improves the closeness of the match.

34 Application of the DID estimator in a matching context When combined with control, it measures the impact of the observed differences between the two groups, which is participation in treatment (program) Cannot measure the net impact of different interventions unless these are added as covariates. This requires a 1 – 1 match between a program participant and a non-participant (i.e.., matched program and comparison groups) DID i-j = B 1 T 1,i-j + B 2 T 2,i-j +..B k T k,i-j + u i-j


Download ppt "Causal Inference: experimental and quasi-experimental methods Draft ©G. Mason (2005)"

Similar presentations


Ads by Google