Judea Pearl Computer Science Department UCLA www.cs.ucla.edu/~judea DIRECT AND INDIRECT EFFECTS.

Slides:



Advertisements
Similar presentations
Department of Computer Science
Advertisements

From Propensity Scores And Mediation To External Validity
Introduction Simple Random Sampling Stratified Random Sampling
Holland on Rubin’s Model Part II. Formalizing These Intuitions. In the 1920 ’ s and 30 ’ s Jerzy Neyman, a Polish statistician, developed a mathematical.
1 WHAT'S NEW IN CAUSAL INFERENCE: From Propensity Scores And Mediation To External Validity Judea Pearl University of California Los Angeles (
ASSESSING CAUSAL QUANTITIES FROM EXPERIMENTAL AND NONEXPERIMENTAL DATA Judea Pearl Computer Science and Statistics UCLA
Omitted Variable Bias Methods of Economic Investigation Lecture 7 1.
CAUSES AND COUNTERFACTUALS Judea Pearl University of California Los Angeles (
1 THE SYMBIOTIC APPROACH TO CAUSAL INFERENCE Judea Pearl University of California Los Angeles (
THE MATHEMATICS OF CAUSAL MODELING Judea Pearl Department of Computer Science UCLA.
COMMENTS ON By Judea Pearl (UCLA). notation 1990’s Artificial Intelligence Hoover.
Positive Semantics of Projections in Venn-Euler Diagrams Joseph Gil – Technion Elena Tulchinsky – Technion.
Judea Pearl University of California Los Angeles CAUSAL REASONING FOR DECISION AIDING SYSTEMS.
Introduction to Management Science
Education 795 Class Notes Non-Experimental Designs ANCOVA Note set 5.
1 CAUSAL INFERENCE: MATHEMATICAL FOUNDATIONS AND PRACTICAL APPLICATIONS Judea Pearl University of California Los Angeles (
SIMPSON’S PARADOX, ACTIONS, DECISIONS, AND FREE WILL Judea Pearl UCLA
CAUSES AND COUNTERFACTUALS OR THE SUBTLE WISDOM OF BRAINLESS ROBOTS.
1 WHAT'S NEW IN CAUSAL INFERENCE: From Propensity Scores And Mediation To External Validity Judea Pearl University of California Los Angeles (
Solve Systems of Equations By Graphing
Accountability through Information Flow Experiments Michael Carl Tschantz UC Berkeley Amit Datta, CMU Anupam Datta, CMU Jeannette M. Wing, MSR
CAUSAL INFERENCE IN THE EMPIRICAL SCIENCES Judea Pearl University of California Los Angeles (
1 REASONING WITH CAUSES AND COUNTERFACTUALS Judea Pearl UCLA (
Sampling Theory Determining the distribution of Sample statistics.
Flows and Networks Plan for today (lecture 5): Last time / Questions? Blocking of transitions Kelly / Whittle network Optimal design of a Kelly / Whittle.
Judea Pearl University of California Los Angeles ( THE MATHEMATICS OF CAUSE AND EFFECT.
Section 4.1 Vectors in ℝ n. ℝ n Vectors Vector addition Scalar multiplication.
Judea Pearl University of California Los Angeles ( THE MATHEMATICS OF CAUSE AND EFFECT.
Statistics and Quantitative Analysis U4320 Segment 12: Extension of Multiple Regression Analysis Prof. Sharyn O’Halloran.
SIMPSON’S PARADOX, ACTIONS, DECISIONS, AND FREE WILL Judea Pearl UCLA
MA Day 25- February 11, 2013 Review of last week’s material Section 11.5: The Chain Rule Section 11.6: The Directional Derivative.
Networks of Queues Plan for today (lecture 6): Last time / Questions? Product form preserving blocking Interpretation traffic equations Kelly / Whittle.
Statistical Applications Binominal and Poisson’s Probability distributions E ( x ) =  =  xf ( x )
REASONING WITH CAUSE AND EFFECT Judea Pearl Department of Computer Science UCLA.
GG 313 Geological Data Analysis Lecture 13 Solution of Simultaneous Equations October 4, 2005.
Wage Discrimination: MBAs Powell chapter in Moe book. Reviews theories of discrimination arising from prejudice: –employers –fellow employees –customers.
CAUSES AND COUNTERFACTIALS IN THE EMPIRICAL SCIENCES Judea Pearl University of California Los Angeles (
REASONING WITH CAUSE AND EFFECT Judea Pearl Department of Computer Science UCLA.
§ 3.6 Solving Quadratic Equations by Factoring. Martin-Gay, Developmental Mathematics 2 Zero Factor Theorem Quadratic Equations Can be written in the.
AUTOMATIC CONTROL THEORY II Slovak University of Technology Faculty of Material Science and Technology in Trnava.
THE MATHEMATICS OF CAUSE AND COUNTERFACTUALS Judea Pearl University of California Los Angeles (
Chapter 8: Simple Linear Regression Yang Zhenlin.
Review of Probability. Important Topics 1 Random Variables and Probability Distributions 2 Expected Values, Mean, and Variance 3 Two Random Variables.
Impact Evaluation Sebastian Galiani November 2006 Causal Inference.
1 Management Science. 2 Chapter Topics The Management Science Approach to Problem Solving Model Building: Break-Even Analysis Computer Solution Management.
Applying Causal Inference Methods to Improve Identification of Health and Healthcare Disparities, and the Underlying Mediators and Moderators of Disparities.
Solving Quadratic Equations by Factoring. Martin-Gay, Developmental Mathematics 2 Zero Factor Theorem Quadratic Equations Can be written in the form ax.
Getting a Job. Definitions Self-employed: people who are not employees since they work for themselves. Employer: is a person or a company that hires one.
Judea Pearl Computer Science Department UCLA ROBUSTNESS OF CAUSAL CLAIMS.
CAUSAL REASONING FOR DECISION AIDING SYSTEMS COGNITIVE SYSTEMS LABORATORY UCLA Judea Pearl, Mark Hopkins, Blai Bonet, Chen Avin, Ilya Shpitser.
Key Concepts for Sect. 7.1 *A system of equations is two or more equations in two or more variables. *Numerically, a solution to a system of equations.
Mediation: The Causal Inference Approach David A. Kenny.
1 CONFOUNDING EQUIVALENCE Judea Pearl – UCLA, USA Azaria Paz – Technion, Israel (
1 BN Semantics 2 – Representation Theorem The revenge of d-separation Graphical Models – Carlos Guestrin Carnegie Mellon University September 17.
A Mechanism-Based Approach to the Identification of Age, Period, Cohort Models Christopher Winship and David J. Harding June 2008.
Sampling Theory Determining the distribution of Sample statistics.
Gender discrimination
Chapter 4 Basic Estimation Techniques
Department of Computer Science
Chen Avin Ilya Shpitser Judea Pearl Computer Science Department UCLA
Department of Computer Science
A MACHINE LEARNING EXERCISE
University of California
Computer Science and Statistics
CAUSAL INFERENCE IN STATISTICS
From Propensity Scores And Mediation To External Validity
THE MATHEMATICS OF PROGRAM EVALUATION
Department of Computer Science
CAUSAL REASONING FOR DECISION AIDING SYSTEMS
Mathematical Expectation
Presentation transcript:

Judea Pearl Computer Science Department UCLA DIRECT AND INDIRECT EFFECTS

QUESTIONS ASKED Why decompose effects? What is the semantics of direct and indirect effects? What are the policy implications of direct and indirect effects? Can path-analytic techniques be extended to nonlinear and nonparametric models? When can direct and indirect effect be estimated consistently from experimental and nonexperimental data.

1.Direct (or indirect) effect may be more transportable. 2.Indirect effects may be prevented or controlled. 3.Direct (or indirect) effect may be forbidden WHY DECOMPOSE EFFECTS? Pill Thrombosis Pregnancy +  + Gender Hiring Qualification

EFFECT-DECOMPOSITION IN LINEAR MODELS XZ Y c a b abc Definition:

COUNTERFACTUALS: STRUCTURAL SEMANTICS Notation: Y x (u) = y Abbreviation: y x Formal: Y has the value y in the solution to a mutilated system of equations, where the equation for X is replaced by a constant X=x. u Y x (u)=y Z W X=x u Y Z W X Probability of Counterfactuals: Functional Bayes Net

TOTAL, DIRECT, AND INDIRECT EFFECTS HAVE SIMPLE SEMANTICS IN LINEAR MODELS XZ Y c a b z = bx +  1 y = ax + cz +  2 a + bc bc a

z = f (x,  1 ) y = g (x, z,  2 ) XZ Y SEMANTICS BECOMES NONTRIVIAL IN NONLINEAR MODELS (even when the model is completely specified) Dependent on z ? Void of operational meaning?

Indirect Effect? NEED OF FORMALIZATION XZ AND Y = What is the direct effect of X on Y?

Starting from X=0, (and Z=0 and Y=0) Total Effect: Change X from 0 to 1, and test the change in Y. Controlled DE: Keep Z constant at Z=0, or Z=1, and change X=0 to X=1. Controlled IE: None. Natural DE: Keep Z constant at its current value, and change X to 1. Natural IE: Keep X at 0, but set Z to what it would be if X were 1. TWO CONCEPTIONS OF DIRECT AND INDIRECT EFFECTS: Controlled vs. Natural XZ AND Y =

``The central question in any employment-discrimination case is whether the employer would have taken the same action had the employee been of different race (age, sex, religion, national origin etc.) and everything else had been the same’’ [ Carson versus Bethlehem Steel Corp. (70 FEP Cases 921, 7 th Cir. (1996))] x = male, x = female y = hire, y = not hire z = applicant’s qualifications LEGAL DEFINITIONS TAKE THE NATURAL CONCEPTION (FORMALIZING DISCRIMINATION) NO DIRECT EFFECT

Starting from X=x *, (and Z=Z x* (u) and Y= Y x* (u)) Total Effect: TE(x,x * ;Y) = E(Y x ) – E(Y x* ) Controlled DE: CDE Z (x,x * ;Y) = E(Y xz ) – E(Y x*z ) Controlled IE: None. Natural DE: NDE(x,x * ;Y) = E(Y xZ x* ) – E(Y x* ) Natural IE: NIE(x,x * ;Y) = E(Y x*Z x ) – E(Y x* ) TWO CONCEPTIONS OF AVERAGE DIRECT AND INDIRECT EFFECTS: POPULATION-LEVEL DEFINITIONS XZ y = f (x,z,u) Y u2u2 u3u3 u1u1 Probabilistic causal model:  P(u)M, (all other parents of Y)

z = f (x,  1 ) y = g (x, z,  2 ) XZ Y THE OPERATIONAL MEANING OF AVERAGE DIRECT EFFECTS “Natural” Direct Effect of X on Y: The expected change in Y per unit change of X, when we keep Z constant at whatever value it attains before the change. In linear models, NDE = Controlled Direct Effect

POLICY IMPLICATIONS (Who cares?) f GENDERQUALIFICATION HIRING What is the direct effect of X on Y? The effect of Gender on Hiring if sex discrimination is eliminated. indirect XZ Y IGNORE

z = f (x,  1 ) y = g (x, z,  2 ) XZ Y THE OPERATIONAL MEANING OF INDIRECT EFFECTS “Natural” Indirect Effect of X on Y: The expected change in Y when we keep X constant, say at x 0, and let Z change to whatever value it would have under a unit change in X. In linear models, NIE = TE - DE

Example: Theorem: If there exists a set W such that GRAPHICAL CONDITION FOR EXPERIMENTAL IDENTIFICATION OF AVERAGE NATURAL DIRECT EFFECTS

HOW THE PROOF GOES? Proof: Each factor is identifiable by experimentation.

GRAPHICAL CRITERION FOR COUNTERFACTUAL INDEPENDENCE U3U3 U1U1 X Y Z U2U2 U3U3 U1U1 XZ Y U2U2 U3U3 U1U1 X Y U2U2 Z

GRAPHICAL CONDITION FOR NONEXPERIMENTAL IDENTIFICATION OF AVERAGE NATURAL DIRECT EFFECTS Identification conditions 1.There exists a W such that (Y Z | W) G XZ 2.There exist additional covariates that render all counterfactual terms identifiable.

Corollary 3: The average natural direct effect in Markovian models is identifiable from nonexperimental data, and it is given by where S stands for all parents of X (or another sufficient set). IDENTIFICATION IN MARKOVIAN MODELS X Z Example: S =  Y

How effective would the drug be if we eliminate its side-effect (Headache)? POLICY QUESTION ANSWERED BY NATURAL DIRECT EFFECT Outcome Drug Aspirin X Y Z W Headache

NIE(x,x * ;Y) = Expected increase in sales, if we bluff the competitor into believing that X is about to change from x * to x. For Markovian models: POLICY-BASED INTERPRETATION OF INDIRECT EFFECTS X Y Z (Sales) (Advertisement Budget) (Competitor’s Budget)

Theorem 5: The total, direct and indirect effects obey The following equality In words, the total effect (on Y) associated with the transition from x * to x is equal to the difference between the direct effect associated with this transition and the indirect effect associated with the reverse transition, from x to x *. RELATIONS BETWEEN TOTAL, DIRECT, AND INDIRECT EFFECTS

Y Z X W x*x* z * = Z x* (u) Nonidentifiable even in Markovian models GENERAL PATH-SPECIFIC EFFECTS (Def.) Y Z X W Form a new model,, specific to active subgraph g Definition: g -specific effect

SUMMARY OF RESULTS 1.New formulation of path-specific effects, based on signal blocking, instead of value fixing. 2.Path-analytic techniques extended to nonlinear and nonparametric models. 3.Conditions for estimating direct and indirect effects from experimental and nonexperimental data. 4.Estimability conditions hold in Markovian models. 5.Graphical techniques of inferring effects of nonstandard policies, involving signal blocking.