1 THE SYMBIOTIC APPROACH TO CAUSAL INFERENCE Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea/)

Slides:



Advertisements
Similar presentations
Department of Computer Science
Advertisements

From Propensity Scores And Mediation To External Validity
RELATED CLASS CS 262 Z – SEMINAR IN CAUSAL MODELING CURRENT TOPICS IN COGNITIVE SYSTEMS INSTRUCTOR: JUDEA PEARL Spring Quarter Monday and Wednesday, 2-4pm.
The World Bank Human Development Network Spanish Impact Evaluation Fund.
1 WHAT'S NEW IN CAUSAL INFERENCE: From Propensity Scores And Mediation To External Validity Judea Pearl University of California Los Angeles (
Omitted Variable Bias Methods of Economic Investigation Lecture 7 1.
CAUSES AND COUNTERFACTUALS Judea Pearl University of California Los Angeles (
Linear Regression and Binary Variables The independent variable does not necessarily need to be continuous. If the independent variable is binary (e.g.,
TRYGVE HAAVELMO AND THE EMERGENCE OF CAUSAL CALCULUS Judea Pearl University of California Los Angeles (
PHSSR IG CyberSeminar Introductory Remarks Bryan Dowd Division of Health Policy and Management School of Public Health University of Minnesota.
THE MATHEMATICS OF CAUSAL MODELING Judea Pearl Department of Computer Science UCLA.
RECONSIDERING DEFINITIONS OF DIRECT AND INDIRECT EFFECTS IN MEDIATION ANALYSIS WITH A SOLUTION FOR A CONTINUOUS FACTOR Ilya Novikov, Michal Benderly, Laurence.
COMMENTS ON By Judea Pearl (UCLA). notation 1990’s Artificial Intelligence Hoover.
Causal Networks Denny Borsboom. Overview The causal relation Causality and conditional independence Causal networks Blocking and d-separation Excercise.
Specifying a Purpose, Research Questions or Hypothesis
Judea Pearl University of California Los Angeles CAUSAL REASONING FOR DECISION AIDING SYSTEMS.
Sampling Theory Determining the distribution of Sample statistics.
1 CAUSAL INFERENCE: MATHEMATICAL FOUNDATIONS AND PRACTICAL APPLICATIONS Judea Pearl University of California Los Angeles (
CAUSES AND COUNTERFACTUALS OR THE SUBTLE WISDOM OF BRAINLESS ROBOTS.
1 WHAT'S NEW IN CAUSAL INFERENCE: From Propensity Scores And Mediation To External Validity Judea Pearl University of California Los Angeles (
Objectives of Multiple Regression
CAUSAL INFERENCE IN THE EMPIRICAL SCIENCES Judea Pearl University of California Los Angeles (
1 REASONING WITH CAUSES AND COUNTERFACTUALS Judea Pearl UCLA (
Judea Pearl Computer Science Department UCLA DIRECT AND INDIRECT EFFECTS.
Judea Pearl University of California Los Angeles ( THE MATHEMATICS OF CAUSE AND EFFECT.
Lecture 8: Generalized Linear Models for Longitudinal Data.
Judea Pearl University of California Los Angeles ( THE MATHEMATICS OF CAUSE AND EFFECT.
THE MATHEMATICS OF CAUSE AND EFFECT: With Reflections on Machine Learning Judea Pearl Departments of Computer Science and Statistics UCLA.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Estimating Causal Effects from Large Data Sets Using Propensity Scores Hal V. Barron, MD TICR 5/06.
SUPA Advanced Data Analysis Course, Jan 6th – 7th 2009 Advanced Data Analysis for the Physical Sciences Dr Martin Hendry Dept of Physics and Astronomy.
V13: Causality Aims: (1) understand the causal relationships between the variables of a network (2) interpret a Bayesian network as a causal model whose.
REASONING WITH CAUSE AND EFFECT Judea Pearl Department of Computer Science UCLA.
BIOST 536 Lecture 11 1 Lecture 11 – Additional topics in Logistic Regression C-statistic (“concordance statistic”)  Same as Area under the curve (AUC)
Chapter 13 Multiple Regression
CAUSES AND COUNTERFACTIALS IN THE EMPIRICAL SCIENCES Judea Pearl University of California Los Angeles (
REASONING WITH CAUSE AND EFFECT Judea Pearl Department of Computer Science UCLA.
THE MATHEMATICS OF CAUSE AND COUNTERFACTUALS Judea Pearl University of California Los Angeles (
Chapter 8: Simple Linear Regression Yang Zhenlin.
Review of Probability. Important Topics 1 Random Variables and Probability Distributions 2 Expected Values, Mean, and Variance 3 Two Random Variables.
1 Chapter 16 logistic Regression Analysis. 2 Content Logistic regression Conditional logistic regression Application.
Impact Evaluation Sebastian Galiani November 2006 Causal Inference.
Applying Causal Inference Methods to Improve Identification of Health and Healthcare Disparities, and the Underlying Mediators and Moderators of Disparities.
POPLHLTH 304 Regression (modelling) in Epidemiology Simon Thornley (Slides adapted from Assoc. Prof. Roger Marshall)
Judea Pearl Computer Science Department UCLA ROBUSTNESS OF CAUSAL CLAIMS.
CAUSAL REASONING FOR DECISION AIDING SYSTEMS COGNITIVE SYSTEMS LABORATORY UCLA Judea Pearl, Mark Hopkins, Blai Bonet, Chen Avin, Ilya Shpitser.
Mediation: The Causal Inference Approach David A. Kenny.
1 CONFOUNDING EQUIVALENCE Judea Pearl – UCLA, USA Azaria Paz – Technion, Israel (
Summary: connecting the question to the analysis(es) Jay S. Kaufman, PhD McGill University, Montreal QC 26 February :40 PM – 4:20 PM National Academy.
Copyright © 2015 Inter-American Development Bank. This work is licensed under a Creative Commons IGO 3.0 Attribution-Non Commercial-No Derivatives (CC-IGO.
Variable selection in Regression modelling Simon Thornley.
1 Day 2: Search June 9, 2015 Carnegie Mellon University Center for Causal Discovery.
Identification in Econometrics: A Way to Get Causal Information from Observations? Damien Fennell, LSE UCL, May 27, 2005.
THE MATHEMATICS OF CAUSE AND EFFECT: Thinking Nature and Talking Counterfactuals Judea Pearl Departments of Computer Science and Statistics UCLA.
CAUSAL INFERENCE IN STATISTICS: A Gentle Introduction Judea Pearl Departments of Computer Science and Statistics UCLA.
Explanation of slide: Logos, to show while the audience arrive.
University of California
Department of Computer Science
Regression Analysis Week 4.
Chen Avin Ilya Shpitser Judea Pearl Computer Science Department UCLA
Department of Computer Science
A MACHINE LEARNING EXERCISE
Computer Science and Statistics
CAUSAL INFERENCE IN STATISTICS
From Propensity Scores And Mediation To External Validity
THE MATHEMATICS OF PROGRAM EVALUATION
Counterfactual models Time dependent confounding
Department of Computer Science
CAUSAL REASONING FOR DECISION AIDING SYSTEMS
Presentation transcript:

1 THE SYMBIOTIC APPROACH TO CAUSAL INFERENCE Judea Pearl University of California Los Angeles (

2 Inference: Statistical vs. Causal, distinctions, and mental barriers Unified conceptualization of counterfactuals, structural-equations, and graphs The logical equivalence of SEM and potential outcomes How to use the best features of both Direct and Indirect effects The Mediation Formula and its ramifications OUTLINE

3 TRADITIONAL STATISTICAL INFERENCE PARADIGM Data Inference Q(P) (Aspects of P ) P Joint Distribution e.g., Infer whether customers who bought product A would also buy product B. Q = P(B | A)

4 What happens when P changes? e.g., Infer whether customers who bought product A would still buy A if we were to double the price. FROM STATISTICAL TO CAUSAL ANALYSIS: 1. THE DIFFERENCES Probability and statistics deal with static relations Data Inference Q( P ) (Aspects of P ) P Joint Distribution P Joint Distribution change

5 FROM STATISTICAL TO CAUSAL ANALYSIS: 1. THE DIFFERENCES Note: P (v)  P (v | price = 2) P does not tell us how it ought to change e.g. Curing symptoms vs. curing diseases e.g. Analogy: mechanical deformation What remains invariant when P changes say, to satisfy P (price=2)=1 Data Inference Q( P ) (Aspects of P ) P Joint Distribution P Joint Distribution change

6 FROM STATISTICAL TO CAUSAL ANALYSIS: 1. THE DIFFERENCES (CONT) CAUSAL Spurious correlation Randomization / Intervention Confounding / Effect Instrumental variable Exogeneity / Ignorability Mediation STATISTICAL Regression Association / Independence “Controlling for” / Conditioning Odd and risk ratios Collapsibility / Granger causality Propensity score 1.Causal and statistical concepts do not mix

7 CAUSAL Spurious correlation Randomization / Intervention Confounding / Effect Instrumental variable Exogeneity / Ignorability Mediation STATISTICAL Regression Association / Independence “Controlling for” / Conditioning Odd and risk ratios Collapsibility / Granger causality Propensity score 1.Causal and statistical concepts do not mix Causal assumptions cannot be expressed in the mathematical language of standard statistics. FROM STATISTICAL TO CAUSAL ANALYSIS: 2. MENTAL BARRIERS 2.No causes in – no causes out (Cartwright, 1989) statistical assumptions + data causal assumptions causal conclusions  }

8 4.Non-standard mathematics: a)Structural equation models (Wright, 1920; Simon, 1960) b)Counterfactuals (Neyman-Rubin (Y x ), Lewis (x Y)) CAUSAL Spurious correlation Randomization / Intervention Confounding / Effect Instrumental variable Exogeneity / Ignorability Mediation STATISTICAL Regression Association / Independence “Controlling for” / Conditioning Odd and risk ratios Collapsibility / Granger causality Propensity score 1.Causal and statistical concepts do not mix. 3.Causal assumptions cannot be expressed in the mathematical language of standard statistics. FROM STATISTICAL TO CAUSAL ANALYSIS: 2. MENTAL BARRIERS 2.No causes in – no causes out (Cartwright, 1989) statistical assumptions + data causal assumptions causal conclusions  }

9 Data Inference Q(M) (Aspects of M ) Data Generating Model M – Invariant strategy (mechanism, recipe, law, protocol) by which Nature assigns values to variables in the analysis. Joint Distribution THE STRUCTURAL MODEL PARADIGM M “Think Nature, not experiment!”

10 STRUCTURAL CAUSAL MODELS Definition: A structural causal model is a 4-tuple  V,U, F, P(u) , where V = {V 1,...,V n } are endogeneas variables U = {U 1,...,U m } are background variables F = {f 1,..., f n } are functions determining V, v i = f i (v, u) P(u) is a distribution over U P(u) and F induce a distribution P(v) over observable variables e.g.,

11 CAUSAL MODELS AND COUNTERFACTUALS Definition: The sentence: “ Y would be y (in unit u ), had X been x,” denoted Y x (u) = y, means: The solution for Y in a mutilated model M x, (i.e., the equations for X replaced by X = x ) with input U=u, is equal to y. U X (u)Y (u) M U X = xY X (u) MxMx

12 CAUSAL MODELS AND COUNTERFACTUALS Definition: The sentence: “ Y would be y (in unit u ), had X been x,” denoted Y x (u) = y, means: The solution for Y in a mutilated model M x, (i.e., the equations for X replaced by X = x ) with input U=u, is equal to y. The Fundamental Equation of Counterfactuals:

13 In particular: CAUSAL MODELS AND COUNTERFACTUALS Definition: The sentence: “ Y would be y (in situation u ), had X been x,” denoted Y x (u) = y, means: The solution for Y in a mutilated model M x, (i.e., the equations for X replaced by X = x ) with input U=u, is equal to y. Joint probabilities of counterfactuals:

14 TWO PARADIGMS FOR CAUSAL INFERENCE Observed: P(X, Y, Z,...) Conclusions needed: P(Y x =y), P(X y =x | Z=z)... How do we connect observables, X,Y,Z,… to counterfactuals Y x, X z, Z y,… ? N-R model Counterfactuals are primitives, new variables Super-distribution Structural model Counterfactuals are derived quantities Subscripts modify the model and distribution

15 ARE THE TWO PARADIGMS EQUIVALENT? Yes (Galles and Pearl, 1998; Halpern 1998) In the N-R paradigm, Y x is defined by consistency: In SCM, consistency is a theorem. Moreover, a theorem in one approach is a theorem in the other.

16 Define: Assume: Identify: Estimate: Test: THE FIVE NECESSARY STEPS OF CAUSAL ANALYSIS Express the target quantity Q as a function Q ( M ) that can be computed from any model M. Formulate causal assumptions A using some formal language. Determine if Q is identifiable given A. Estimate Q if it is identifiable; approximate it, if it is not. Test the testable implications of A (if any).

17 THE FIVE NECESSARY STEPS FOR EFFECT ESTIMATION Define: Assume: Identify: Estimate: Test: Express the target quantity Q as a function Q ( M ) that can be computed from any model M. Formulate causal assumptions A using some formal language. Determine if Q is identifiable given A. Estimate Q if it is identifiable; approximate it, if it is not. Test the testable implications of A (if any).

18 FORMULATING ASSUMPTIONS THREE LANGUAGES 1. English: Smoking ( X ), Cancer ( Y ), Tar ( Z ), Genotypes ( U ) Not too friendly: consistent? complete? redundant? arguable? ZXY 3. Structural: 2. Counterfactuals:

19 Define: Assume: Identify: Estimate: IDENTIFYING CAUSAL EFFECTS IN POTENTIAL-OUTCOME FRAMEWORK Express the target quantity Q as a counterfactual formula, e.g., E(Y(1) – Y(0)) Formulate causal assumptions using the distribution: Determine if Q is identifiable using P* and Y=x Y (1) + (1 – x) Y (0). Estimate Q if it is identifiable; approximate it, if it is not.

20 IDENTIFICATION IN SCM Find the effect of X on Y, P ( y | do ( x )), given the causal assumptions shown in G, where Z 1,..., Z k are auxiliary variables. Z6Z6 Z3Z3 Z2Z2 Z5Z5 Z1Z1 X Y Z4Z4 G Can P ( y | do ( x )) be estimated if only a subset, Z, can be measured?

21 ELIMINATING CONFOUNDING BIAS THE BACK-DOOR CRITERION P(y | do(x)) is estimable if there is a set Z of variables such that Z d -separates X from Y in G x. Z6Z6 Z3Z3 Z2Z2 Z5Z5 Z1Z1 X Y Z4Z4 Z6Z6 Z3Z3 Z2Z2 Z5Z5 Z1Z1 X Y Z4Z4 Z z GxGx G Moreover, (“adjusting” for Z )  Ignorability

22 Front Door EFFECT OF WARM-UP ON INJURY (After Shrier & Platt, 2008) No, no! Watch out! Warm-up Exercises ( X ) Injury ( Y ) ???

23 GRAPHICAL – COUNTERFACTUALS SYMBIOSIS Every causal graph expresses counterfactuals assumptions, e.g., X  Y  Z consistent, and readable from the graph. Express assumption in graphs Derive estimands by graphical or algebraic methods 1.Missing arrows Y  Z 2. Missing arcs YZ

24 EFFECT DECOMPOSITION (direct vs. indirect effects) 1.Why decompose effects? 2.What is the definition of direct and indirect effects? 3.What are the policy implications of direct and indirect effects? 4.When can direct and indirect effect be estimated consistently from experimental and nonexperimental data?

25 WHY DECOMPOSE EFFECTS? 1.To understand how Nature works 2.To comply with legal requirements 3.To predict the effects of new type of interventions: Signal routing, rather than variable fixing

26 XZ Y LEGAL IMPLICATIONS OF DIRECT EFFECT What is the direct effect of X on Y ? (averaged over z ) (Qualifications) (Hiring) (Gender) Can data prove an employer guilty of hiring discrimination? Adjust for Z? No! No!

27 z = f (x, u) y = g (x, z, u) XZ Y NATURAL INTERPRETATION OF AVERAGE DIRECT EFFECTS Natural Direct Effect of X on Y: The expected change in Y, when we change X from x 0 to x 1 and, for each u, we keep Z constant at whatever value it attained before the change. In linear models, DE = Controlled Direct Effect Robins and Greenland (1992) – “Pure”

28 z = f (x, u) y = g (x, z, u) XZ Y DEFINITION OF INDIRECT EFFECTS Indirect Effect of X on Y: The expected change in Y when we keep X constant, say at x 0, and let Z change to whatever value it would have attained had X changed to x 1. In linear models, IE = TE - DE

29 POLICY IMPLICATIONS OF INDIRECT EFFECTS f GENDERQUALIFICATION HIRING What is the indirect effect of X on Y? The effect of Gender on Hiring if sex discrimination is eliminated. XZ Y IGNORE Blocking a link – a new type of intervention

30 1.The natural direct and indirect effects are identifiable in Markovian models (no confounding), 2.And are given by: 3.Applicable to linear and non-linear models, continuous and discrete variables, regardless of distributional form. MEDIATION FORMULAS

31 MEDIATION FORMULAS IN UNCONFOUNDED MODELS X Z Y

32 COMPUTING THE MEDIATION FORMULA X Z YXZY E(Y|x,z)= E(Y|x,z)=g xz E(Z|x)=h x n1n1n1n1000 n2n2n2n2001 n3n3n3n3010 n4n4n4n4011 n5n5n5n5100 n6n6n6n6101 n7n7n7n7110 n8n8n8n8111

33 I TOLD YOU CAUSALITY IS SIMPLE Formal basis for causal and counterfactual inference (complete) Unification of the graphical, potential-outcome and structural equation approaches Friendly and formal solutions to century-old problems and confusions. He is wise who bases causal inference on an explicit causal structure that is defensible on scientific grounds. (Aristotle B.C.) From Charlie Poole CONCLUSIONS

34 RAMIFICATIONS OF THE MEDIATION FORMULA Effect measures should not be defined in terms of ML estimates of regressional coefficients (e.g, polynomials, logistic, or probit,) but in terms of population distributions. The difference and product heuristics resemble TE-DE and IE, but: DE should be averaged over mediator levels, IE should NOT be averaged over exposure levels. TE-DE need not equal IE TE-DE = proportion for whom mediation is necessary IE = proportion for whom mediation is sufficient. TE-DE informs interventions on indirect pathways IE informs intervention on direct pathways.

35 They will be answered QUESTIONS???