Explanation of slide: Logos, to show while the audience arrive.

Slides:



Advertisements
Similar presentations
Designing an impact evaluation: Randomization, statistical power, and some more fun…
Advertisements

The World Bank Human Development Network Spanish Impact Evaluation Fund.
Holland on Rubin’s Model Part II. Formalizing These Intuitions. In the 1920 ’ s and 30 ’ s Jerzy Neyman, a Polish statistician, developed a mathematical.
The World Bank Human Development Network Spanish Impact Evaluation Fund.
Study Design Data. Types of studies Design of study determines whether: –an inference to the population can be made –causality can be inferred random.
TOOLS OF POSITIVE ANALYSIS
The World Bank Human Development Network Spanish Impact Evaluation Fund.
Chapter 2: The Research Enterprise in Psychology
Chapter 2: The Research Enterprise in Psychology
Understanding Statistics
Chapter 1: The Research Enterprise in Psychology.
Africa Impact Evaluation Program on AIDS (AIM-AIDS) Cape Town, South Africa March 8 – 13, Causal Inference Nandini Krishnan Africa Impact Evaluation.
CAUSAL INFERENCE Presented by: Dan Dowhower Alysia Cohen H 615 Friday, October 4, 2013.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
Ch. 2 Tools of Positive Economics. Theoretical Tools of Public Finance theoretical tools The set of tools designed to understand the mechanics behind.
Propensity Score Matching for Causal Inference: Possibilities, Limitations, and an Example sean f. reardon MAPSS colloquium March 6, 2007.
LECTURE 1 - SCOPE, OBJECTIVES AND METHODS OF DISCIPLINE "ECONOMETRICS"
Africa Program for Education Impact Evaluation Dakar, Senegal December 15-19, 2008 Experimental Methods Muna Meky Economist Africa Impact Evaluation Initiative.
Impact Evaluation Sebastian Galiani November 2006 Causal Inference.
Public Finance and Public Policy Jonathan Gruber Third Edition Copyright © 2010 Worth Publishers 1 of 24 Copyright © 2010 Worth Publishers.
Copyright © 2015 Inter-American Development Bank. This work is licensed under a Creative Commons IGO 3.0 Attribution-Non Commercial-No Derivatives (CC-IGO.
Impact Evaluation Methods Randomization and Causal Inference Slides by Paul J. Gertler & Sebastian Martinez.
Cross-Country Workshop for Impact Evaluations in Agriculture and Community Driven Development Addis Ababa, April 13-16, Causal Inference Nandini.
Psy B07 Chapter 1Slide 1 BASIC CONCEPTS. Psy B07 Chapter 1Slide 2  Population  Random Sampling  Random Assignment  Variables  What do we do with.
Some Terminology experiment vs. correlational study IV vs. DV descriptive vs. inferential statistics sample vs. population statistic vs. parameter H 0.
Kenya Evidence Forum - June 14, 2016 Using Evidence to Improve Policy and Program Designs How do we interpret “evidence”? Aidan Coville, Economist, World.
Copyright © 2009 Pearson Education, Inc. Chapter 13 Experiments and Observational Studies.
Intro to Research Methods
Chapter 2: The Research Enterprise in Psychology
CHAPTER 4 Designing Studies
Prepared by Lloyd R. Jaisingh
Ch. 2 Tools of Positive Economics
Inference and Tests of Hypotheses
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
Chapter Six Normal Curves and Sampling Probability Distributions
Statistical Data Analysis
Explanation of slide: Logos, to show while the audience arrive.
Chapter 4: The Nature of Regression Analysis
Impact evaluation: The quantitative methods with applications
CHAPTER 4 Designing Studies
Cross Sectional Designs
Ten things about Experimental Design
Introduction to Econometrics
Experiments with Two Groups
Explanation of slide: Logos, to show while the audience arrive.
Empirical Tools of Public Finance
CHAPTER 4 Designing Studies
Evaluating Impacts: An Overview of Quantitative Methods
Chapter 2 What is Research?
Statistical Data Analysis
Explanation of slide: Logos, to show while the audience arrive.
Experimental Design: The Basic Building Blocks
Explanation of slide: Logos, to show while the audience arrive.
CHAPTER 4 Designing Studies
Intro to Confidence Intervals Introduction to Inference
CHAPTER 4 Designing Studies
CHAPTER 10 Comparing Two Populations or Groups
Positive analysis in public finance
CHAPTER 4 Designing Studies
Financial Econometrics Fin. 505
CHAPTER 4 Designing Studies
Primary Data Collection: Experimentation
Introduction to Producing Data
Chapter 4: The Nature of Regression Analysis
Sample Sizes for IE Power Calculations.
CHAPTER 4 Designing Studies
CHAPTER 10 Comparing Two Populations or Groups
Correlation and Causality
CHAPTER 4 Designing Studies
Chapter Ten: Designing, Conducting, Analyzing, and Interpreting Experiments with Two Groups The Psychologist as Detective, 4e by Smith/Davis.
Presentation transcript:

Explanation of slide: Logos, to show while the audience arrive.

Technical Track Session I CAUSAL INFERENCE Technical Track Session I Christel Vermeersch The World Bank

Policy questions are causal in nature Day 2 - Technical Track Session I: Causal Inference Policy questions are causal in nature Cause-effect relationships are a part of what policy makers do: Does school decentralization improve school quality? But the statistics you learnt in school/university do not address this… Does one more year of education cause higher income? Does conditional cash transfers cause better health outcomes in children? How do we improve student learning?

Standard Statistical Analysis Day 2 - Technical Track Session I: Causal Inference Standard Statistical Analysis Tools Likelihood and other estimation techniques. Aim To infer parameters of a distribution from samples drawn of that distribution. Uses With the help of such parameters, one can: Infer association among variables, Estimate the likelihood of past and future events, Update the likelihood of events in light of new evidence or new measurement.

Standard Statistical Analysis Condition For this to work well, experimental conditions must remain the same. But our policy questions were: If I decentralize schools, will quality improve? If I find a way to make a child go to school longer, will she earn more money? If I start giving cash to families, will their children become healthier? If I train teachers, will students learn more? The conditions change!!

Day 2 - Technical Track Session I: Causal Inference Causal Analysis For causal questions, we need to infer aspects of the data generation process. We need to be able to deduce: the likelihood of events under static conditions, (as in Standard Statistical Analysis) and also the dynamics of events under changing conditions.

Day 2 - Technical Track Session I: Causal Inference Causal Analysis “dynamics of events under changing conditions” includes: Predicting the effects of interventions. Predicting the effects of spontaneous changes. Identifying causes of reported events.

Causation vs. Correlation Day 2 - Technical Track Session I: Causal Inference Causation vs. Correlation Standard statistical analysis/probability theory: The word “cause” is not in its vocabulary. Allows us to say is that two events are mutually correlated, or dependent. This is not enough for policy makers They look at rationales for policy decisions: if we do XXX, then will we get YYY? We need a vocabulary for causality.

Vocabulary for Causality THE RUBIN CAUSAL MODEL Vocabulary for Causality

Population & Outcome Variable Day 2 - Technical Track Session I: Causal Inference Population & Outcome Variable Define the population by U. Each unit in U is denoted by u. The outcome of interest is Y. Also called the response variable. For each u  U, there is an associated value Y(u).

Day 2 - Technical Track Session I: Causal Inference “ Causes/Treatment Causes are those things that could be treatments in hypothetical experiments. - Rubin For simplicity, we assume that there are just two possible states: Unit u is exposed to treatment and Unit u is exposed to comparison.

The Treatment Variable Let D be a variable that indicates the state to which each unit in U is exposed. 1 If unit u is exposed to treatment D = 0 If unit u is exposed to comparison Where does D come from? In a controlled study: constructed by the experimenter. In an uncontrolled study: determined by factors beyond the experimenter’s control.

Linking Y and D Y=response variable D= treatment variable Day 2 - Technical Track Session I: Causal Inference Linking Y and D Y=response variable D= treatment variable The response Y is potentially affected by whether u receives treatment or not. Thus, we need two response variables: Y1(u) is the outcome if unit u is exposed to treatment. Y0(u) is the outcome if unit u is exposed to comparison.

The effect of treatment on outcome Treatment variable D 1 If unit u is exposed to treatment D = 0 If unit u is exposed to comparison Response variable Y Y1(u) is the outcome if unit u is exposed to treatment Y0(u) is the outcome if unit u is exposed to comparison For any unit u, treatment causes the effect δu = Y1 (u) - Y0 (u)

But there is a problem: For any unit u, treatment causes the effect δu = Y1 (u) - Y0 (u) Fundamental problem of causal inference For a given unit u, we observe either Y1 (u) or Y0 (u), it is impossible to observe the effect of treatment on u by itself! We do not observe the counterfactual If we give u treatment, then we cannot observe what would have happened to u in the absence of treatment.

So what do we do? Instead of measuring the treatment effect on unit u, we identify the average treatment effect for the population U (or for sub-populations)

Day 2 - Technical Track Session I: Causal Inference Estimating the ATE So, Replace the impossible-to-observe treatment effect of D on a specific unit u, with the possible-to-estimate average treatment effect of D over a population U of such units. Although EU (Y1 ) and EU (Y0 ) cannot both be calculated, they can be estimated. Most econometrics methods attempt to construct from observational data consistent estimators of: EU (Y1 ) = Y̅1 and EU (Y0 )= Y̅0

A simple estimator of ATEU Day 2 - Technical Track Session I: Causal Inference A simple estimator of ATEU So we are trying to estimate: ATEU = EU (Y1) - EU (Y0) = Y̅1 - Y̅0 (1) Consider the following simple estimator: δ̅̂ = [ Y̅̂1 | D = 1] - [ Y̅̂0 | D =0 ] (2) Note Equation (1) is defined for the whole population. Equation (2) is an estimator to be computed on a sample drawn from that population.

Day 2 - Technical Track Session I: Causal Inference An important lemma

Fundamental Conditions Day 2 - Technical Track Session I: Causal Inference Fundamental Conditions Thus, a sufficient condition for the simple estimator to consistently estimate the true ATE is that: [Y̅1 |D=1]=[Y̅1 |D=0] The average outcome under treatment Y̅1 is the same for the treatment (D=1) and the comparison (D=0) groups And [Y̅0 |D=1]=[Y̅0 |D=0] The average outcome under comparison Y̅0 is the same for the treatment (D=1) and the comparison (D=0) groups

When will those conditions be satisfied? It is sufficient that treatment assignment D be uncorrelated with the potential outcome distributions of Y0 and Y1 Intuitively, there can be no correlation between (1) Whether someone gets the treatment and (2) How much that person potentially benefits from the treatment. The easiest way to achieve this uncorrelatedness is through random assignment of treatment.

Another way of looking at it Day 2 - Technical Track Session I: Causal Inference Another way of looking at it With some algebra, it can be shown that:

Another way of looking at it (in words) Day 2 - Technical Track Session I: Causal Inference Another way of looking at it (in words) There are two sources of bias that need to be eliminated from estimates of causal effects : Baseline difference/Selection bias Heterogeneous response to the treatment Most of the methods available only deal with selection bias.

Treatment on the Treated Day 2 - Technical Track Session I: Causal Inference Treatment on the Treated “ Average Treatment Effect is not always the parameter of interest… often, it is the average treatment effect for the treated that is of substantive interest:

Treatment on the Treated Day 2 - Technical Track Session I: Causal Inference Treatment on the Treated If we need to estimate Treatment on the Treated: Then the simple estimator (2) consistently estimates Treatment on the Treated if: “No baseline difference between the treatment and comparison groups.”

Day 2 - Technical Track Session I: Causal Inference References Judea Pearl (2000): Causality: Models, Reasoning and Inference, Cambridge University press. (Book) Chapters 1, 5 and 7. Trygve Haavelmo (1944): “The probability approach in econometrics”, Econometrica 12, pp. iii-vi+1-115. Arthur Goldberger (1972): “Structural Equations Methods in the Social Sciences”, Econometrica 40, pp. 979-1002. Donald B. Rubin (1974): “Estimating causal effects of treatments in randomized and nonrandomized experiments”, Journal of Educational Psychology 66, pp. 688-701. Paul W. Holland (1986): “Statistics and Causal Inference”, Journal of the American Statistical Association 81, pp. 945-70, with discussion.