University of California

Slides:



Advertisements
Similar presentations
Agenda of Week V Review of Week IV Inference on MV Mean Vector One population Two populations Multi-populations: MANOVA.
Advertisements

Modeling of Data. Basic Bayes theorem Bayes theorem relates the conditional probabilities of two events A, and B: A might be a hypothesis and B might.
Structural Equation Modeling (SEM) Niina Kotamäki.
Outline 1)Motivation 2)Representing/Modeling Causal Systems 3)Estimation and Updating 4)Model Search 5)Linear Latent Variable Models 6)Case Study: fMRI.
1 WHAT'S NEW IN CAUSAL INFERENCE: From Propensity Scores And Mediation To External Validity Judea Pearl University of California Los Angeles (
1 THE SYMBIOTIC APPROACH TO CAUSAL INFERENCE Judea Pearl University of California Los Angeles (
Ch11 Curve Fitting Dr. Deshi Ye
Definition  Regression Model  Regression Equation Y i =  0 +  1 X i ^ Given a collection of paired data, the regression equation algebraically describes.
RECONSIDERING DEFINITIONS OF DIRECT AND INDIRECT EFFECTS IN MEDIATION ANALYSIS WITH A SOLUTION FOR A CONTINUOUS FACTOR Ilya Novikov, Michal Benderly, Laurence.
0 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John.
Chapter 7 Sampling and Sampling Distributions
REGRESSION What is Regression? What is the Regression Equation? What is the Least-Squares Solution? How is Regression Based on Correlation? What are the.
Regression and Correlation
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved.
Principles of the Global Positioning System Lecture 10 Prof. Thomas Herring Room A;
1 CAUSAL INFERENCE: MATHEMATICAL FOUNDATIONS AND PRACTICAL APPLICATIONS Judea Pearl University of California Los Angeles (
Standard error of estimate & Confidence interval.
1 WHAT'S NEW IN CAUSAL INFERENCE: From Propensity Scores And Mediation To External Validity Judea Pearl University of California Los Angeles (
1 REASONING WITH CAUSES AND COUNTERFACTUALS Judea Pearl UCLA (
1 G Lect 8b G Lecture 8b Correlation: quantifying linear association between random variables Example: Okazaki’s inferences from a survey.
Multiple Regression The Basics. Multiple Regression (MR) Predicting one DV from a set of predictors, the DV should be interval/ratio or at least assumed.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Propensity Score Matching for Causal Inference: Possibilities, Limitations, and an Example sean f. reardon MAPSS colloquium March 6, 2007.
BIOST 536 Lecture 11 1 Lecture 11 – Additional topics in Logistic Regression C-statistic (“concordance statistic”)  Same as Area under the curve (AUC)
© John M. Abowd 2007, all rights reserved General Methods for Missing Data John M. Abowd March 2007.
Geology 5670/6670 Inverse Theory 21 Jan 2015 © A.R. Lowry 2015 Read for Fri 23 Jan: Menke Ch 3 (39-68) Last time: Ordinary Least Squares Inversion Ordinary.
Chapter 16 Social Statistics. Chapter Outline The Origins of the Elaboration Model The Elaboration Paradigm Elaboration and Ex Post Facto Hypothesizing.
Psychology 202a Advanced Psychological Statistics October 1, 2015.
Linear Correlation (12.5) In the regression analysis that we have considered so far, we assume that x is a controlled independent variable and Y is an.
Causal Model Ying Nian Wu UCLA Department of Statistics July 13, 2007 IPAM Summer School.
Judea Pearl Computer Science Department UCLA ROBUSTNESS OF CAUSAL CLAIMS.
Class 5 Multiple Regression CERAM February-March-April 2008 Lionel Nesta Observatoire Français des Conjonctures Economiques
CAUSAL REASONING FOR DECISION AIDING SYSTEMS COGNITIVE SYSTEMS LABORATORY UCLA Judea Pearl, Mark Hopkins, Blai Bonet, Chen Avin, Ilya Shpitser.
Mediation: The Causal Inference Approach David A. Kenny.
1 CONFOUNDING EQUIVALENCE Judea Pearl – UCLA, USA Azaria Paz – Technion, Israel (
1/61: Topic 1.2 – Extensions of the Linear Regression Model Microeconometric Modeling William Greene Stern School of Business New York University New York.
1 Day 2: Search June 9, 2015 Carnegie Mellon University Center for Causal Discovery.
Alexander Spermann University of Freiburg, SS 2008 Matching and DiD 1 Overview of non- experimental approaches: Matching and Difference in Difference Estimators.
Econometrics III Evgeniya Anatolievna Kolomak, Professor.
Copyright © 2015 Inter-American Development Bank. This work is licensed under a Creative Commons IGO 3.0 Attribution-Non Commercial-No Derivatives (CC-IGO.
Chapter 11 Linear Regression and Correlation. Explanatory and Response Variables are Numeric Relationship between the mean of the response variable and.
Chapter 7 Sampling and Sampling Distributions
Advanced Higher Statistics
Probability Theory and Parameter Estimation I
Inference: Conclusion with Confidence
CH 5: Multivariate Methods
Comparison of Correlation and % Bias to Invalidate Frameworks
The Centre for Longitudinal Studies Missing Data Strategy
Chapter 5 STATISTICS (PART 4).
Two Sample Tests When do use independent
Simple Linear Regression - Introduction
LESSON 21: REGRESSION ANALYSIS
Methods of Economic Investigation Lecture 12
LESSON 24: INFERENCES USING REGRESSION
Sampling Distribution
Sampling Distribution
Generalized Spatial Dirichlet Process Models
Explanation of slide: Logos, to show while the audience arrive.
Microeconometric Modeling
Bootstrapping Jackknifing
The European Statistical Training Programme (ESTP)
(AERA on-line video – cost is $105)
Chapter: 9: Propensity scores
Multivariate Methods Berlin Chen
Multivariate Methods Berlin Chen, 2005 References:
ASV Chapters 1 - Sample Spaces and Probabilities
From Propensity Scores And Mediation To External Validity
Alternative Scenarios and Related Techniques
Nazmus Saquib, PhD Head of Research Sulaiman AlRajhi Colleges
(AERA on-line video – cost is $105)
Presentation transcript:

University of California ON MEASUREMENT BIAS IN CAUSAL INFERENCE Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea/)

THE MEASUREMENT BIAS PROBLEM Given causal diagram Z-unobserved (latent) Find P(y | do (x)) We know that But Minimize the bias Z W X Y

OUTLINE Effect Restoration using Matrix Inversion Example: Restoration in binary models Extension to multivariate confounders Effect Restoration in linear models The 3-proxy principle and its variations Model testing with measurement errors

MEASUREMENT BIAS AND EFFECT RESTORATION Unobserved Z P(w|z) P(y | do(x)) is identifiable from measurement of W, if P(w | z) is given (Selen, 1986; Greenland & Lash, 2008) W X Y Assume: (local independence) Solution:

EFFECT RESTORATION IN BINARY MODELS Z 1 W X Y 1 Weight distribution from cell (x,y) To cell (x,y,z0) W X Y To cell (x,y,z1) undefined undefined 1

WHAT IF Z IS MULTI-VARIATE? G Z1 Z2 W1 W2 Z3 W3 Z4 Z5 W4 W5 X Z6 Y If Z is high-dimensional, most cells will be empty of samples and, even if P(w | z) is known, P(x,w,y) cannot be estimated

PROPENSITY SCORE ESTIMATOR (Rosenbaum & Rubin, 1983) Z1 Z2 P(y | do(x)) = ? L Z4 Z3 Z5 X Z6 Y Adjustment for L replaces Adjustment for Z Theorem:

PROPENSITY SCORE RESTORATION G Z1 Z2 W1 W2 Z3 W3 Z4 Z5 W4 W5 X Z6 Y From observed samples (x,w,y) to synthetic samples (x,z,y), to L(z), to

EFFECT RESTORATION IN LINEAR MODELS Z c1 c2 c3 W X c0 Y The pivotal parameter needed is

EFFECT RESTORATION FROM A SECOND PROXY W Z X Y c1 c2 c3 c0 V Z X Y c1 (b)

THE THREE-PROXIES PRINCIPLE Z Y c2 c0 c1 c3 c4 W V X Cai and Kuroki (2008) c0 is identifiable

MODEL TESTING WITH MEASUREMENT ERRORS Z unobserved Problem: Test if c1 Solution: Test if c0 = 0 Theorem 1: If a latent variable Z d-separates two measured variables, X and Y, and Z has a proxy W, W = cZ + , then cov(XY) must satisfy: cov(XY)=cov(XW) cov(WY) / c2 var(Z) Corollary: is testable if k=c2 var(Z) is estimable Example: Given W and V, k is estimable, and X c0 Y V Z X Y c1 c2 c4 c0 W c3

CONCLUSIONS Effect restoration is feasible Rests on two principles 2.1 Matrix inversion in discrete models Requires synthetic population    and propensity score estimation 2.2 3-proxies per latent in linear models proxies can be decoupled by clever conditioning 3. Conditional independence tests can be replaced by tetrad-like tests (using 2SLS) for model testing