From Propensity Scores And Mediation To External Validity

Slides:

Advertisements

Similar presentations

Advertisements

PDAs Accept Context-Free Languages

Angstrom Care 培苗社 Quadratic Equation II

AP STUDY SESSION 2.

STATISTICS Linear Statistical Models

STATISTICS HYPOTHESES TEST (I)

STATISTICS INTERVAL ESTIMATION Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University.

STATISTICS POINT ESTIMATION Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University.

David Burdett May 11, 2004 Package Binding for WS CDL.

Create an Application Title 1Y - Youth Chapter 5.

CHAPTER 18 The Ankle and Lower Leg

The 5S numbers game..

Department of Computer Science

A Fractional Order (Proportional and Derivative) Motion Controller Design for A Class of Second-order Systems Center for Self-Organizing Intelligent.

Numerical Analysis 1 EE, NCKU Tien-Hao Chang (Darby Chang)

Media-Monitoring Final Report April - May 2010 News.

Biostatistics Unit 5 Samples Needs to be completed. 12/24/13.

Stationary Time Series

Break Time Remaining 10:00.

The basics for simulations

Factoring Quadratics — ax² + bx + c Topic

EE, NCKU Tien-Hao Chang (Darby Chang)

Turing Machines.

PP Test Review Sections 6-1 to 6-6

Regression with Panel Data

Numerical Analysis 1 EE, NCKU Tien-Hao Chang (Darby Chang)

Copyright © 2012, Elsevier Inc. All rights Reserved. 1 Chapter 7 Modeling Structure with Blocks.

Biology 2 Plant Kingdom Identification Test Review.

Chapter 1: Expressions, Equations, & Inequalities

2.5 Using Linear Models Month Temp º F 70 º F 75 º F 78 º F.

Adding Up In Chunks.

MaK_Full ahead loaded 1 Alarm Page Directory (F11)

Facebook Pages 101: Your Organization’s Foothold on the Social Web A Volunteer Leader Webinar Sponsored by CACO December 1, 2010 Andrew Gossen, Senior.

1 Termination and shape-shifting heaps Byron Cook Microsoft Research, Cambridge Joint work with Josh Berdine, Dino Distefano, and.

Artificial Intelligence

When you see… Find the zeros You think….

Slide R - 1 Copyright © 2009 Pearson Education, Inc. Publishing as Pearson Prentice Hall Active Learning Lecture Slides For use with Classroom Response.

12 October, 2014 St Joseph's College ADVANCED HIGHER REVISION 1 ADVANCED HIGHER MATHS REVISION AND FORMULAE UNIT 2.

1 Non Deterministic Automata. 2 Alphabet = Nondeterministic Finite Accepter (NFA)

1 hi at no doifpi me be go we of at be do go hi if me no of pi we Inorder Traversal Inorder traversal. n Visit the left subtree. n Visit the node. n Visit.

12 System of Linear Equations Case Study

Converting a Fraction to %

Numerical Analysis 1 EE, NCKU Tien-Hao Chang (Darby Chang)

Clock will move after 1 minute

famous photographer Ara Guler famous photographer ARA GULER.

Copyright © 2013 Pearson Education, Inc. All rights reserved Chapter 11 Simple Linear Regression.

Physics for Scientists & Engineers, 3rd Edition

Select a time to count down from the clock above

16. Mean Square Estimation

Copyright Tim Morris/St Stephen's School

1.step PMIT start + initial project data input Concept Concept.

9. Two Functions of Two Random Variables

1 Dr. Scott Schaefer Least Squares Curves, Rational Representations, Splines and Continuity.

1 Decidability continued…. 2 Theorem: For a recursively enumerable language it is undecidable to determine whether is finite Proof: We will reduce the.

1 Non Deterministic Automata. 2 Alphabet = Nondeterministic Finite Accepter (NFA)

Chapter 4 FUGACITY.

1 WHAT'S NEW IN CAUSAL INFERENCE: From Propensity Scores And Mediation To External Validity Judea Pearl University of California Los Angeles (

1 THE SYMBIOTIC APPROACH TO CAUSAL INFERENCE Judea Pearl University of California Los Angeles (

1 CAUSAL INFERENCE: MATHEMATICAL FOUNDATIONS AND PRACTICAL APPLICATIONS Judea Pearl University of California Los Angeles (

1 WHAT'S NEW IN CAUSAL INFERENCE: From Propensity Scores And Mediation To External Validity Judea Pearl University of California Los Angeles (

1 REASONING WITH CAUSES AND COUNTERFACTUALS Judea Pearl UCLA (

Judea Pearl Computer Science Department UCLA DIRECT AND INDIRECT EFFECTS.

CAUSES AND COUNTERFACTIALS IN THE EMPIRICAL SCIENCES Judea Pearl University of California Los Angeles (

Chen Avin Ilya Shpitser Judea Pearl Computer Science Department UCLA

From Propensity Scores And Mediation To External Validity

Presentation transcript:

From Propensity Scores And Mediation To External Validity WHAT'S NEW IN CAUSAL INFERENCE: From Propensity Scores And Mediation To External Validity And Selection Bias Judea Pearl UCLA (www.cs.ucla.edu/~judea/)

OUTLINE Unified conceptualization of counterfactuals, structural-equations, and graphs Propensity scores demystified Direct and indirect effects (Mediation) External validity mathematized

TRADITIONAL STATISTICAL INFERENCE PARADIGM Data Inference Q(P) (Aspects of P) P Joint Distribution e.g., Infer whether customers who bought product A would also buy product B. Q = P(B | A)

THE STRUCTURAL MODEL PARADIGM Joint Distribution Data Generating Model Q(M) (Aspects of M) Data M Inference M – Invariant strategy (mechanism, recipe, law, protocol) by which Nature assigns values to variables in the analysis. “Think Nature, not experiment!”

ORACLE FOR MANIPILATION FAMILIAR CAUSAL MODEL ORACLE FOR MANIPILATION X Y Z Here is a causal model we all remember from high-school -- a circuit diagram. There are 4 interesting points to notice in this example: (1) It qualifies as a causal model -- because it contains the information to confirm or refute all action, counterfactual and explanatory sentences concerned with the operation of the circuit. For example, anyone can figure out what the output would be like if we set Y to zero, or if we change this OR gate to a NOR gate or if we perform any of the billions combinations of such actions. (2) Logical functions (Boolean input-output relation) is insufficient for answering such queries (3)These actions were not specified in advance, they do not have special names and they do not show up in the diagram. In fact, the great majority of the action queries that this circuit can answer have never been considered by the designer of this circuit. (4) So how does the circuit encode this extra information? Through two encoding tricks: 4.1 The symbolic units correspond to stable physical mechanisms (i.e., the logical gates) 4.2 Each variable has precisely one mechanism that determines its value. INPUT OUTPUT

STRUCTURAL CAUSAL MODELS Definition: A structural causal model is a 4-tuple V,U, F, P(u), where V = {V1,...,Vn} are endogeneas variables U = {U1,...,Um} are background variables F = {f1,..., fn} are functions determining V, vi = fi(v, u) P(u) is a distribution over U P(u) and F induce a distribution P(v) over observable variables e.g.,

CAUSAL MODELS AND COUNTERFACTUALS Definition: The sentence: “Y would be y (in situation u), had X been x,” denoted Yx(u) = y, means: The solution for Y in a mutilated model Mx, (i.e., the equations for X replaced by X = x) with input U=u, is equal to y. The Fundamental Equation of Counterfactuals:

READING COUNTERFACTUALS FROM SEM Data shows: a = 0.7, b = 0.5, g = 0.4 A student named Joe, measured X=0.5, Z=1, Y=1.9 Q1: What would Joe’s score be had he doubled his study time?

READING COUNTERFACTUALS Q1: What would Joe’s score be had he doubled his study time? Answer: Joe’s score would be 1.9 Or, In counterfactual notation:

READING COUNTERFACTUALS Q2: What would Joe’s score be, had the treatment been 0 and had he studied at whatever level he would have studied had the treatment been 1?

POTENTIAL AND OBSERVED OUTCOMES PREDICTED BY A STRUCTURAL MODEL

CAUSAL MODELS AND COUNTERFACTUALS Definition: The sentence: “Y would be y (in situation u), had X been x,” denoted Yx(u) = y, means: The solution for Y in a mutilated model Mx, (i.e., the equations for X replaced by X = x) with input U=u, is equal to y. Joint probabilities of counterfactuals: In particular:

THE FIVE NECESSARY STEPS OF CAUSAL ANALYSIS Define: Assume: Identify: Estimate: Test: Express the target quantity Q as a function Q(M) that can be computed from any model M. Formulate causal assumptions A using some formal language. Determine if Q is identifiable given A. Estimate Q if it is identifiable; approximate it, if it is not. Test the testable implications of A (if any).

THE FIVE NECESSARY STEPS OF CAUSAL ANALYSIS Define: Assume: Identify: Estimate: Test: Express the target quantity Q as a function Q(M) that can be computed from any model M. Formulate causal assumptions A using some formal language. Determine if Q is identifiable given A. Estimate Q if it is identifiable; approximate it, if it is not. Test the testable implications of A (if any).

THE LOGIC OF CAUSAL ANALYSIS CAUSAL MODEL (MA) CAUSAL MODEL (MA) A - CAUSAL ASSUMPTIONS A* - Logical implications of A Causal inference Q Queries of interest Q(P) - Identified estimands T(MA) - Testable implications Statistical inference Data (D) Q - Estimates of Q(P) Goodness of fit Provisional claims Model testing

IDENTIFICATION IN SCM G Z1 Z2 Z3 Z4 Z5 X Z6 Y Find the effect of X on Y, P(y|do(x)), given the causal assumptions shown in G, where Z1,..., Zk are auxiliary variables. G Z1 Z2 Z3 Z4 Z5 X Z6 Y Can P(y|do(x)) be estimated if only a subset, Z, can be measured?

ELIMINATING CONFOUNDING BIAS THE BACK-DOOR CRITERION P(y | do(x)) is estimable if there is a set Z of variables such that Z d-separates X from Y in Gx. G Gx Z1 Z1 Z2 Z2 Z Z3 Z3 Z4 Z5 Z5 Z4 X Z6 Y X Z6 Y Moreover, (“adjusting” for Z)  Ignorability

EFFECT OF WARM-UP ON INJURY (After Shrier & Platt, 2008) Watch out! ??? Front Door No, no! Warm-up Exercises (X) Injury (Y)

PROPENSITY SCORE ESTIMATOR (Rosenbaum & Rubin, 1983) Z1 Z2 P(y | do(x)) = ? L Z4 Z3 Z5 X Z6 Y Can L replace {Z1, Z2, Z3, Z4, Z5} ? Adjustment for L replaces Adjustment for Z Theorem:

WHAT PROPENSITY SCORE (PS) PRACTITIONERS NEED TO KNOW The asymptotic bias of PS is EQUAL to that of ordinary adjustment (for same Z). Including an additional covariate in the analysis CAN SPOIL the bias-reduction potential of others. X Y Z Z In particular, instrumental variables tend to amplify bias. Choosing sufficient set for PS, requires knowledge of the model.

Instrumental variables are Bias-Amplifiers in linear SURPRISING RESULT: Instrumental variables are Bias-Amplifiers in linear models (Bhattarcharya & Vogt 2007; Wooldridge 2009) Z U (Unobserved) c3 c1 c2 “Naive” bias Adjusted bias X c0 Y

When Z is allowed to vary, it absorbs (or explains) INTUTION: When Z is allowed to vary, it absorbs (or explains) some of the changes in X. U c1 X Y Z c2 c3 c0 U c1 X Y Z c2 c3 c0 When Z is fixed the burden falls on U alone, and transmitted to Y (resulting in a higher bias) Z U c3 c1 c2 X c0 Y

WHAT’S BETWEEN AN INSTRUMENT AND A CONFOUNDER? Should we adjust for Z? Y X ANSWER: CONCLUSION: Yes, if No, otherwise Adjusting for a parent of Y is safer than a parent of X

WHICH SET TO ADJUST FOR Should we adjust for {T}, {Z}, or {T, Z}? Answer 1: (From bias-amplification considerations) {T} is better than {T, Z} which is the same as {Z} Answer 2: (From variance considerations) {T} is better than {T, Z} which is better than {Z} Z T X Y

CONCLUSIONS The prevailing practice of adjusting for all covariates, especially those that are good predictors of X (the “treatment assignment,” Rubin, 2009) is totally misguided. The “outcome mechanism” is as important, and much safer, from both bias and variance viewpoints As X-rays are to the surgeon, graphs are for causation

REGRESSION VS. STRUCTURAL EQUATIONS (THE CONFUSION OF THE CENTURY) Regression (claimless, nonfalsifiable): Y = ax + Y Structural (empirical, falsifiable): Y = bx + uY Claim: (regardless of distributions): E(Y | do(x)) = E(Y | do(x), do(z)) = bx  The mothers of all questions: Q. When would b equal a? A. When all back-door paths are blocked, (uY X) Q. When is b estimable by regression methods? A. Graphical criteria available

TWO PARADIGMS FOR CAUSAL INFERENCE Observed: P(X, Y, Z,...) Conclusions needed: P(Yx=y), P(Xy=x | Z=z)... How do we connect observables, X,Y,Z,… to counterfactuals Yx, Xz, Zy,… ? N-R model Counterfactuals are primitives, new variables Super-distribution Structural model Counterfactuals are derived quantities Subscripts modify the model and distribution

“SUPER” DISTRIBUTION IN N-R MODEL X Y Z Yx=0 Yx=1 Xz=0 Xz=1 Xy=0 U 1 Y 1 Z 1 Yx=0 1 Yx=1 1 Xz=0 1 Xz=1 1 Xy=0 0 1 U u1 u2 u3 u4 inconsistency: x = 0  Yx=0 = Y Y = xY1 + (1-x) Y0

ARE THE TWO PARADIGMS EQUIVALENT? Yes (Galles and Pearl, 1998; Halpern 1998) In the N-R paradigm, Yx is defined by consistency: In SCM, consistency is a theorem. Moreover, a theorem in one approach is a theorem in the other. Difference: Clarity of assumptions and their implications

AXIOMS OF STRUCTURAL COUNTERFACTUALS Yx(u)=y: Y would be y, had X been x (in state U = u) (Galles, Pearl, Halpern, 1998): Definiteness Uniqueness Effectiveness Composition (generalized consistency) Reversibility

FORMULATING ASSUMPTIONS THREE LANGUAGES 1. English: Smoking (X), Cancer (Y), Tar (Z), Genotypes (U) X Y Z U 2. Counterfactuals: Z X Y 3. Structural:

COMPARISON BETWEEN THE N-R AND SCM LANGUAGES Expressing scientific knowledge Recognizing the testable implications of one's assumptions Locating instrumental variables in a system of equations Deciding if two models are equivalent or nested Deciding if two counterfactuals are independent given another Algebraic derivations of identifiable estimands

GRAPHICAL – COUNTERFACTUALS SYMBIOSIS Every causal graph expresses counterfactuals assumptions, e.g., X  Y  Z Missing arrows Y  Z 2. Missing arcs Y Z consistent, and readable from the graph. Express assumption in graphs Derive estimands by graphical or algebraic methods

(direct vs. indirect effects) EFFECT DECOMPOSITION (direct vs. indirect effects) Why decompose effects? What is the definition of direct and indirect effects? What are the policy implications of direct and indirect effects? When can direct and indirect effect be estimated consistently from experimental and nonexperimental data?

WHY DECOMPOSE EFFECTS? To understand how Nature works To comply with legal requirements To predict the effects of new type of interventions: Signal routing, rather than variable fixing

LEGAL IMPLICATIONS OF DIRECT EFFECT X Z Y Can data prove an employer guilty of hiring discrimination? (Gender) X Z (Qualifications) Y (Hiring) What is the direct effect of X on Y ? (averaged over z) Adjust for Z? No! No!

FISHER’S GRAVE MISTAKE (after Rubin, 2005) What is the direct effect of treatment on yield? (Soil treatment) X Z (Plant density) (Latent factor) Y (Yield) Compare treated and untreated lots of same density No! No! Proposed solution (?): “Principal strata”

NATURAL INTERPRETATION OF AVERAGE DIRECT EFFECTS Robins and Greenland (1992) – “Pure” X Z z = f (x, u) y = g (x, z, u) Y Natural Direct Effect of X on Y: The expected change in Y, when we change X from x0 to x1 and, for each u, we keep Z constant at whatever value it attained before the change. In linear models, DE = Natural Direct Effect

DEFINITION AND IDENTIFICATION OF NESTED COUNTERFACTUALS Consider the quantity Given M, P(u), Q is well defined Given u, Zx*(u) is the solution for Z in Mx*, call it z is the solution for Y in Mxz Can Q be estimated from data? Experimental: nest-free expression Nonexperimental: subscript-free expression

DEFINITION OF INDIRECT EFFECTS X Z z = f (x, u) y = g (x, z, u) Y No Controlled Indirect Effect Indirect Effect of X on Y: The expected change in Y when we keep X constant, say at x0, and let Z change to whatever value it would have attained had X changed to x1. In linear models, IE = TE - DE

POLICY IMPLICATIONS OF INDIRECT EFFECTS What is the indirect effect of X on Y? The effect of Gender on Hiring if sex discrimination is eliminated. GENDER QUALIFICATION HIRING X Z IGNORE f Y Deactivating a link – a new type of intervention

MEDIATION FORMULAS The natural direct and indirect effects are identifiable in Markovian models (no confounding), And are given by: Applicable to linear and non-linear models, continuous and discrete variables, regardless of distributional form.

Z g xz Linear + interaction m1 m2 X Y In linear systems

IN UNCONFOUNDED MODELS MEDIATION FORMULAS IN UNCONFOUNDED MODELS X Z Y

Z m1 m2 X Y In linear systems Disabling mediation direct path DE TE - DE TE IE Is NOT equal to:

MEDIATION FORMULA FOR BINARY VARIABLES X Z Y E(Y|x,z)=gxz E(Z|x)=hx n1 n2 1 n3 n4 n5 n6 n7 n8

RAMIFICATION OF THE MEDIATION FORMULA DE should be averaged over mediator levels, IE should NOT be averaged over exposure levels. TE-DE need not equal IE TE-DE = proportion for whom mediation is necessary IE = proportion for whom mediation is sufficient TE-DE informs interventions on indirect pathways IE informs intervention on direct pathways.

TRANSPORTABILITY -- WHEN CAN WE EXTRAPOLATE EXPERIMENTAL FINDINGS TO DIFFERENT POPULATIONS? X Y Z = age X Y Z = age Experimental study in LA Measured: Problem: We find (LA population is younger) What can we say about Intuition: Observational study in NYC Measured:

TRANSPORT FORMULAS DEPEND ON THE STORY X Y Z (a) X Y Z (b) X Y (c) Z a) Z represents age b) Z represents language skill c) Z represents a bio-marker

(Pearl and Bareinboim, 2010) TRANSPORTABILITY (Pearl and Bareinboim, 2010) Definition 1 (Transportability) Given two populations, denoted  and *, characterized by models M = <F,V,U> and M* = <F,V,U+S>, respectively, a causal relation R is said to be transportable from  to * if 1. R() is estimable from the set I of interventional studies on , and 2. R(*) is identified from I, P*, G, and G + S. S = external factors responsible for M  M*

TRANSPORT FORMULAS DEPEND ON THE STORY X Y Z (a) S X Y Z (b) S X Y (c) Z S a) Z represents age b) Z represents language skill c) Z represents a bio-marker ? ?

WHICH MODEL LICENSES THE TRANSPORT OF THE CAUSAL EFFECT Y X S (b) Y X S (c) X Y Z S X Y (d) Z S W X Y (f) Z S S S S X X X W W W Z Z Z Y Y Y (e)

DETERMINE IF THE CAUSAL EFFECT IS TRANSPORTABLE V What measurements need to be taken in the study and in the target population? T S X W Z Y The transport formula

SURROGATE ENDPOINTS – A CAUSAL PERSPECTIVE The problem: Infer effects of (randomized) treatment (X) on outcome (Y) from measurements taken on a surrogate variable (Z), which is more readily measurable, and is sufficiently well correlated with the first (Ellenberg and Hamilton 1989). Prentice 1989: "strong correlation is not sufficient," there should be "no pathways that bypass the surrogate" (2005). 1989-2011 - Everyone agrees that correlation is not sufficient, and no one explains why.

WHY STRONG CORRELATION IS NOT SUFFICIENT FOR SURROGACY Joffe and Green (2009): “A surrogate outcome is an outcome for which knowing the effect of treatment on the surrogate allows prediction of the effect of treatment on the more clinically relevant outcome.” Two effects = Two experiments conducted under two different conditions. Surrogacy = “Strong correlation,” + robustness to the new conditions. New condition = Interventions to change the surrogate Z.

WHO IS A GOOD SURROGATE? S Y (c) Z X Y (a) Z X Y (b) Z S X S S S S X Y (f) Z U W X (d) Z Y X Y (e) Z

CORRELATIONS AND ROBUSTNESS SURROGACY: CORRELATIONS AND ROBUSTNESS Definition (Pearl and Bareinboim, 2011): A variable Z is said to be a surrogate endpoint relative the effect of X on Y if and only if: P(y|do(x), z) is highly sensitive to Z in the experimental study, and P(y|do(x), z, s) = P(y|do(x), z, s ) where S is a selection variable added to G and directed towards Z. In words, the causal effect of X on Y can be reliably predicted from measurements of Z, regardless of the mechanism responsible for variations in Z.

SURROGACY: A GRAPHICAL CRITERION Z and X d-separate S from Y. S Y (c) (b) Z X S S S S S X Y (f) Z U W X (d) Z Y X Y (e) Z

THE ORIGIN OF SELECTION BIAS: UX UY c0 X Y b1 b2 S No selection bias Selection bias activated by a virtual collider Selection bias activated by both a virtual collider and real collider

CONTROLLING SELECTION BIAS BY ADJUSTMENT UY X Y S2 S3 S1 Can be eliminated by randomization or adjustment

CONTROLLING SELECTION BIAS BY ADJUSTMENT UY X Y S2 S3 S1 Cannot be eliminated by randomization, requires adjustment for U2

CONTROLLING SELECTION BIAS BY ADJUSTMENT UY X Y S2 S3 S1 Cannot be eliminated by randomization or adjustment

CONTROLLING SELECTION BIAS BY ADJUSTMENT UX UY c0 X Y b1 b2 S Cannot be eliminated by adjustment or by randomization

CONTROLLING BY ADJUSTMENT MAY REQUIRE EXTERNAL INFORMATION UY X Y S2 S3 S1 Adjustment for U2 gives If all we have is P(u2 | S2 = 1), not P(u2), then only the U2-specific effect is recoverable

WHEN WOULD THE ODDS RATIO BE RECOVERABLE? (a) (b) (c) Y X Z S Y X W C OR is recoverable, despite the virtual collider at Y. whenever (X Y | Y,Z)G or (Y S | X , Z)G, giving: OR (X,Y | S = 1) = OR(X,Y) (Cornfield 1951, Wittemore 1978; Geng 1992) (b) the Z-specific OR is recoverable, but is meaningless. OR (X ,Y | Z) = OR (X,Y | Z, S = 1) (c) the C-specific OR is recoverable, which is meaningful. OR (X ,Y | C) = OR (X,Y | W,C, S = 1)

GRAPHICAL CONDITION FOR ODDS-RATIO RECOVERABILITY Theorem (Bareinboim and Pearl, 2011) Let graph G contain the arrow X  Y and a selection node S. A necessary condition for G to permit the G-recoverability of OR(Y,X | C) for some set C of pre-treatment covariates is that every ancestor Ai of S that is also a descendant of X have a separating set Ti that either d-separates Ai from X given Y, or d-separates Ai from Y given X. Moreover, For every C s.t. where

EXAMPLES OF ODDS RATIO RECOVERABILITY X Y S Y X S {W4, W3, W1} Separator X Y W3 W4 S Y X W2 W1 W4 W3 W2 W1 {W1, W2, W4} S {W4, W3, W1} Recoverable Non-recoverable

EXAMPLES OF ODDS RATIO RECOVERABILITY X Y S Y X S Separator W3 W4 W2 Recoverable Non-recoverable

EXAMPLES OF ODDS RATIO RECOVERABILITY X Y S Y X S Separator W3 W4 W2 W2 W2 W1 W1 S Recoverable Non-recoverable

EXAMPLES OF ODDS RATIO RECOVERABILITY X Y S Y X S Separator W3 W4 W2 Recoverable Non-recoverable

EXAMPLES OF ODDS RATIO RECOVERABILITY X Y S Y X S Separator W3 W4 W2 W4 W3 W2 W2 W1 S Recoverable Non-recoverable

CONCLUSIONS I TOLD YOU CAUSALITY IS SIMPLE Formal basis for causal and counterfactual inference (complete) Unification of the graphical, potential-outcome and structural equation approaches Friendly and formal solutions to century-old problems and confusions. No other method can do better (theorem)

Thank you for agreeing with everything I said.