SIMPSON’S PARADOX, ACTIONS, DECISIONS, AND FREE WILL Judea Pearl UCLA www.cs.ucla.edu/~judea.

Slides:



Advertisements
Similar presentations
CAN I KILL MY YOUNGER SELF? Time Travel and the Retro-Suicide Paradox Peter B. M. Vranas The University of Michigan 15 September 2000.
Advertisements

The World Bank Human Development Network Spanish Impact Evaluation Fund.
Chapter 4: More on Two- Variable Data.  Correlation and Regression Describe only linear relationships Are not resistant  One influential observation.
Section 1.3 Experimental Design © 2012 Pearson Education, Inc. All rights reserved. 1 of 61.
Section 1.3 Experimental Design.
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Analysis of Categorical Data Tests for Homogeneity.
COMMENTS ON By Judea Pearl (UCLA). notation 1990’s Artificial Intelligence Hoover.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 18, Slide 1 Chapter 18 Confidence Intervals for Proportions.
Integrating Bayesian Networks and Simpson’s Paradox in Data Mining Alex Freitas University of Kent Ken McGarry University of Sunderland.
Chapter 2: Looking at Data - Relationships /true-fact-the-lack-of-pirates-is-causing-global-warming/
 Once you know the correlation coefficient for your sample, you might want to determine whether this correlation occurred by chance.  Or does the relationship.
8 TECHNIQUES OF INTEGRATION. In defining a definite integral, we dealt with a function f defined on a finite interval [a, b] and we assumed that f does.
Chapter 51 Experiments, Good and Bad. Chapter 52 Experimentation u An experiment is the process of subjecting experimental units to treatments and observing.
Copyright © 2010 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
Richard M. Jacobs, OSA, Ph.D.
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
Overview Definition Hypothesis
Hypothesis testing – mean differences between populations
CAUSAL INFERENCE IN THE EMPIRICAL SCIENCES Judea Pearl University of California Los Angeles (
Judea Pearl Computer Science Department UCLA DIRECT AND INDIRECT EFFECTS.
Judea Pearl University of California Los Angeles ( THE MATHEMATICS OF CAUSE AND EFFECT.
Chapter 2 -Test for one and two means
The Argument for Using Statistics Weighing the Evidence Statistical Inference: An Overview Applying Statistical Inference: An Example Going Beyond Testing.
Chapter 8 Introduction to Hypothesis Testing
SIMPSON’S PARADOX, ACTIONS, DECISIONS, AND FREE WILL Judea Pearl UCLA
1 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 12 The Analysis of Categorical Data and Goodness-of-Fit Tests.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
Hypothesis Testing Hypothesis Testing Topic 11. Hypothesis Testing Another way of looking at statistical inference in which we want to ask a question.
Chapter 20 Testing hypotheses about proportions
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 20 Testing Hypotheses About Proportions.
V13: Causality Aims: (1) understand the causal relationships between the variables of a network (2) interpret a Bayesian network as a causal model whose.
Mediation: Solutions to Assumption Violation
10.1: Confidence Intervals Falls under the topic of “Inference.” Inference means we are attempting to answer the question, “How good is our answer?” Mathematically:
Copyright © 2010 Pearson Education, Inc. Slide Beware: Lots of hidden slides!
Copyright © 2010 Pearson Education, Inc. Chapter 22 Comparing Two Proportions.
CAUSES AND COUNTERFACTIALS IN THE EMPIRICAL SCIENCES Judea Pearl University of California Los Angeles (
Copyright © 2009 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
SIMPSON’S PARADOX Any statistical relationship between two variables may be reversed by including additional factors in the analysis. Application: The.
Feb. 13 Chapter 12, Try 1-9 Read Ch. 15 for next Monday No meeting Friday.
Tuesday, April 8 n Inferential statistics – Part 2 n Hypothesis testing n Statistical significance n continued….
Significance Tests Martin Bland Professor of Health Statistics University of York
© Copyright McGraw-Hill 2004
01/20151 EPI 5344: Survival Analysis in Epidemiology Confounding and Effect Modification March 24, 2015 Dr. N. Birkett, School of Epidemiology, Public.
Section 1.3 Experimental Design.
Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 1 Section 2 – Slide 1 of 21 Chapter 1 Section 2 Observational Studies, Experiments,
C HAPTER 4  Hypothesis Testing -Test for one and two means -Test for one and two proportions.
Judea Pearl Computer Science Department UCLA ROBUSTNESS OF CAUSAL CLAIMS.
CAUSAL REASONING FOR DECISION AIDING SYSTEMS COGNITIVE SYSTEMS LABORATORY UCLA Judea Pearl, Mark Hopkins, Blai Bonet, Chen Avin, Ilya Shpitser.
Descriptive and Inferential Statistics Descriptive statistics The science of describing distributions of samples or populations Inferential statistics.
1 CONFOUNDING EQUIVALENCE Judea Pearl – UCLA, USA Azaria Paz – Technion, Israel (
Copyright © 2009 Pearson Education, Inc. Chapter 11 Understanding Randomness.
Copyright © 2009 Pearson Education, Inc. 9.2 Hypothesis Tests for Population Means LEARNING GOAL Understand and interpret one- and two-tailed hypothesis.
Variable selection in Regression modelling Simon Thornley.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
SECTION 1 TEST OF A SINGLE PROPORTION
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 FINAL EXAMINATION STUDY MATERIAL III A ADDITIONAL READING MATERIAL – INTRO STATS 3 RD EDITION.
Displaying and Describing Categorical Data
Lecture #8 Thursday, September 15, 2016 Textbook: Section 4.4
Statistics 200 Lecture #9 Tuesday, September 20, 2016
Two-Sample Hypothesis Testing
Sampling Population: The overall group to which the research findings are intended to apply Sampling frame: A list that contains every “element” or.
Testing Hypotheses About Proportions
Two-Way Tables and The Chi-Square Test*
AP Statistics Chapter 3 Part 3
Displaying and Describing Categorical Data
Risk ratios 12/6/ : Risk Ratios 12/6/2018 Risk ratios StatPrimer.
Testing Hypotheses About Proportions
Methods of Psychological Research
Chapter 4: More on Two-Variable Data
Presentation transcript:

SIMPSON’S PARADOX, ACTIONS, DECISIONS, AND FREE WILL Judea Pearl UCLA

SIMPSON’S PARADOX (Pearson et al. 1899; Yule 1903; Simpson 1951) Any statistical relationship between two variables may be reversed by including additional factors in the analysis.Any statistical relationship between two variables may be reversed by including additional factors in the analysis. Application: The adjustment problem Which factors should be included in the analysis.Which factors should be included in the analysis.

SIMPSON’S PARADOX (1951 – 1994) M R R T T M R R T T R R T T = 0.6 < < > 0.4 T – Treated T – Not treated R – Recovered R – Dead M – Males M – Females Easy question ( ) When / why the reversal? Harder questions (1994) Is the treatment useful? Which table to consult? Why is Simpson’s reversal a paradox?

SIMPSON’S REVERSAL Pr(recovery | drug, male) < Pr(recovery | no-drug, male) Pr(recovery | drug, female) < Pr(recovery | no-drug, female) Group behavior: Overall behavior: Pr(recovery | drug) > Pr(recovery | no-drug)

TO ADJUST OR NOT TO ADJUST? Treatment Recovery Gender T F Mediating factor F Treatment T R Recovery R Solution: Adjust iff F blocks all back-door paths

TWO PROOFS: 1. 2.When two causal models generate the same statistical data and in one we decide to use the drug yet in the other not to use it, our decision must be driven by causal and not by statistical considerations. Thus, there is no statistical criterion to warn us against consulting the wrong table. Q.Can Temporal information help? A. No!, see Figure (c). F = Car Type THE INEVITABLE CONCLUSION: THE PARADOX STEMS FROM CAUSAL INTERPRETATION 1.Surprise surfaces only when we speak about “efficacy,” not about evidence for recovery.

WHY TEMPORAL INFORMATION DOES NOT HELP In (c), F may occur before or after T, and the correct answer is to consult the combined table. In (d), F may occur before or after T, and the correct answer is to consult the F -specific tables. Treatment Recovery Gender T F R Treatment Recovery Blood Pressure Treatment Recovery T F R T F R T F R (a)(b)(c)(d) Car

WHY “CAR TYPE” SHOULD NOT MATTER Listening to the Bell creates association between Treatment and Recovery though Treatment does not affect Recovery Treatment efficacy is distorted in the Bell-specific tables. Treatment efficacy is properly represented in the combined table. TreatmentRecovery Coin-1Coin-2 Bell

THE SOLUTION OF THE ADJUSTMENT PROBLEM P(y | do(x)) is estimable if there is a set Z of variables such that Z d -separates X from Y in G x. Z6Z6 Z3Z3 Z2Z2 Z5Z5 Z1Z1 X Y Z4Z4 Z6Z6 Z3Z3 Z2Z2 Z5Z5 Z1Z1 X Y Z4Z4 Z Moreover, P(y | do(x)) =   P(y | x,z) P(z) (“adjusting” for Z ) z GxGx G

WHY TEMPORAL INFORMATION DOES NOT HELP In (c), F may occur before or after T, and the correct answer is to consult the combined table. In (d), F may occur before or after T, and the correct answer is to consult the F -specific tables. Treatment Recovery Gender T F R Treatment Recovery Blood Pressure Treatment Recovery T F R T F R T F R (a)(b)(c)(d) Car

WHY SIMPSON’S PARADOX EVOKES SURPRISE 1.People think causes, not proportions. 2."Reversal" is possible in the calculus of proportions but impossible in the calculus of causes.

CAUSAL CALCULUS PROHIBITS REVERSAL Pr(recovery | drug) < Pr(recovery | no-drug) Pr (male | do{drug}) = Pr (male | do{no-drug}) Pr(recovery | drug, male) < Pr(recovery | no-drug, male) Pr(recovery | drug, female) < Pr(recovery | no-drug, female) Group behavior: Entailed overall behavior: Assumption: do{drug}do{no-drug} do{drug}do{no-drug} do{drug}do{no-drug}

THE SURE THING PRINCIPLE Theorem An action A that increases the probability of an event E in each subpopulation must also increase the probability of E in the population as a whole, provided that the action does not change the distribution of the subpopulations.

Female 0.5 Male 0.5 TTTT R 0.6 R 0.2 R 0.7 R 0.3 RR 0.7 RR < < 0.15 T F Gender R (a) DECISION TREE FOR F = Gender F

-HBP 0.75 R 0.6 R 0.7 R 0.2 E 0.3 R 0.4 -E 0.7 R 0.3 R T F Blood Pressure R (b) TT HBP HBP 0.25 HBP 0.75 > DECISION TREE FOR F = Blood Pressure FF

R 0.5 R 0.4 R 0.5 T F Blood Pressure R (b) TT R > 0.4 COMPRESSED TREE FOR F = Blood Pressure

WHAT DECISION TREE SHOULD WE BUILD FOR F = Car ? Treatment Recovery T F R Car (c) ??? Should F = Car be handled like F = Gender or F = Blood Pressure ???

Cheap 0.5 Expensive 0.5 TTTT R 0.2 R 0.7 R 0.3 RR 0.7 RR < < 0.15 TEMPORAL ORDER IS DECEPTIVE (c) Treatment Recovery T F R Car F R 0.6 Wrong 0.3

R 0.5 R 0.4 R 0.5 (c) TT R > 0.4 Treatment Recovery T F R Car CORRECT DECISION TREE

Cheap 0.5 Expensive 0.5 TTTT R 0.2 R 0.7 R 0.3 RR 0.7 RR = P(R | T,F)  P(R | do(T), F) WHAT WENT WRONG WITH F = Car ? (c) Treatment Recovery T F R Car F R 0.6 Wrong 0.3

0.6 Right 0.3 Female 0.5 Male 0.5 TTTT R R 0.2 R 0.7 R 0.3 RR 0.7 RR T F Gender R (a) WHAT WENT RIGHT WITH F = Gender ? 0.6 = P(R | T,F) = P(R | do(T), F) F

Treatment Recovery T F R Car (c) Compelled Act Treatment Recovery T F R Car (c) Free Act THE PERCEPTION OF FREE WILL (Explained reaction)(Decision)

DISAMBIGUATING THE TWO ROLES OF ACT Whatever evidence an act might provide On motives that cause such acts, Should never be used to help one decide On whether to choose that same act. (p. 109)

Treatment Recovery T F R Car (c) Compelled Act Treatment Recovery T F R Car (c) Free Act THE PARADOX OF FREE WILL (Explained reaction)(Decision) God Physics

Free will is an illusion that demands a computational model, to explain: 1.What mental activity generates the illusion of acting freely? 2.Why/how is such illusion generated from the activity above? 3.How can we survive the distortion generated by the illusion? Is the distortion harmless? Is the illusion dispensable? 4.What is the computational usefulness of the illusion? COMPUTATIONAL RESOLUTION OF THE PARADOX OF FREE WILL

CONCLUSIONS 1.Simpson's Paradox is resolved through the calculus of causation. 2.People think causally, not probabilistically 3.Decision trees cannot replace causal models 4.Free-will is a software illusion to be explained in computational terms.