RECONSIDERING DEFINITIONS OF DIRECT AND INDIRECT EFFECTS IN MEDIATION ANALYSIS WITH A SOLUTION FOR A CONTINUOUS FACTOR Ilya Novikov, Michal Benderly, Laurence.

Slides:



Advertisements
Similar presentations
Mediation: Multiple Variables David A. Kenny. 2 Mediation Webinars Four Steps Indirect Effect Causal Assumptions.
Advertisements

Topic 12: Multiple Linear Regression
FTP Biostatistics II Model parameter estimations: Confronting models with measurements.
Improving health worldwide George B. Ploubidis The role of sensitivity analysis in the estimation of causal pathways from observational.
Correlation and regression Dr. Ghada Abo-Zaid
1 THE SYMBIOTIC APPROACH TO CAUSAL INFERENCE Judea Pearl University of California Los Angeles (
Linear Regression and Binary Variables The independent variable does not necessarily need to be continuous. If the independent variable is binary (e.g.,
Introduction to Probability
Random effects estimation RANDOM EFFECTS REGRESSIONS When the observed variables of interest are constant for each individual, a fixed effects regression.
A Short Introduction to Curve Fitting and Regression by Brad Morantz
Variance and covariance M contains the mean Sums of squares General additive models.
SIMPLE LINEAR REGRESSION
1 Simple Linear Regression Chapter Introduction In this chapter we examine the relationship among interval variables via a mathematical equation.
Simple Linear Regression Analysis
SIMPLE LINEAR REGRESSION
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Business Statistics - QBM117 Statistical inference for regression.
Linear regression models in matrix terms. The regression function in matrix terms.
Introduction to Regression Analysis, Chapter 13,
Simple Linear Regression Analysis
Simple Linear Regression. Introduction In Chapters 17 to 19, we examine the relationship between interval variables via a mathematical equation. The motivation.
Multiple imputation using ICE: A simulation study on a binary response Jochen Hardt Kai Görgen 6 th German Stata Meeting, Berlin June, 27 th 2008 Göteborg.
SIMPLE LINEAR REGRESSION
Systems of Nonlinear Equations in Two Variables
Simple Linear Regression
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
1 G Lect 10a G Lecture 10a Revisited Example: Okazaki’s inferences from a survey Inferences on correlation Correlation: Power and effect.
1 Multiple Imputation : Handling Interactions Michael Spratt.
Andrew Thomson on Generalised Estimating Equations (and simulation studies)
Correlation is a statistical technique that describes the degree of relationship between two variables when you have bivariate data. A bivariate distribution.
G Lect 21 G Lecture 2 Regression as paths and covariance structure Alternative “saturated” path models Using matrix notation to write linear.
MTH 161: Introduction To Statistics
Ch4 Describing Relationships Between Variables. Section 4.1: Fitting a Line by Least Squares Often we want to fit a straight line to data. For example.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Estimating Causal Effects from Large Data Sets Using Propensity Scores Hal V. Barron, MD TICR 5/06.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
1 G Lect 14M Review of topics covered in course Mediation/Moderation Statistical power for interactions What topics were not covered? G Multiple.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Discussion of time series and panel models
Analysis of Variance (ANOVA) Brian Healy, PhD BIO203.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 13-1 Introduction to Regression Analysis Regression analysis is used.
Chapter 6 Simple Regression Introduction Fundamental questions – Is there a relationship between two random variables and how strong is it? – Can.
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
–The shortest distance is the one that crosses at 90° the vector u Statistical Inference on correlation and regression.
© Buddy Freeman, 2015 Let X and Y be two normally distributed random variables satisfying the equality of variance assumption both ways. For clarity let.
Psychology 202a Advanced Psychological Statistics November 12, 2015.
01/20151 EPI 5344: Survival Analysis in Epidemiology Confounding and Effect Modification March 24, 2015 Dr. N. Birkett, School of Epidemiology, Public.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Linear Correlation (12.5) In the regression analysis that we have considered so far, we assume that x is a controlled independent variable and Y is an.
4 basic analytical tasks in statistics: 1)Comparing scores across groups  look for differences in means 2)Cross-tabulating categoric variables  look.
CAUSAL REASONING FOR DECISION AIDING SYSTEMS COGNITIVE SYSTEMS LABORATORY UCLA Judea Pearl, Mark Hopkins, Blai Bonet, Chen Avin, Ilya Shpitser.
Mediation: The Causal Inference Approach David A. Kenny.
MathematicalMarketing Slide 5.1 OLS Chapter 5: Ordinary Least Square Regression We will be discussing  The Linear Regression Model  Estimation of the.
Chapter 14 Introduction to Regression Analysis. Objectives Regression Analysis Uses of Regression Analysis Method of Least Squares Difference between.
Week 1 Real Numbers and Their Properties (Section 1.6, 1.7, 1.8)
Mediation: Assumptions David A. Kenny davidakenny.net.
Research and Evaluation Methodology Program College of Education A comparison of methods for imputation of missing covariate data prior to propensity score.
Bivariate & Multivariate Regression Analysis
CH 5: Multivariate Methods
Copyright © Cengage Learning. All rights reserved.
Regression Models - Introduction
Statistical Assumptions for SLR
Chapter 4, Regression Diagnostics Detection of Model Violation
SIMPLE LINEAR REGRESSION
Cases. Simple Regression Linear Multiple Regression.
Implementation of the Bayesian approach to imputation at SORS Zvone Klun and Rudi Seljak Statistical Office of the Republic of Slovenia Oslo, September.
Wednesday, October 05, 2016 Farrokh Alemi, PhD.
Introduction to Econometrics, 5th edition
Presentation transcript:

RECONSIDERING DEFINITIONS OF DIRECT AND INDIRECT EFFECTS IN MEDIATION ANALYSIS WITH A SOLUTION FOR A CONTINUOUS FACTOR Ilya Novikov, Michal Benderly, Laurence Freedman Gertner Institute for Epidemiology and Health Policy Research, Tel Hashomer, Israel 1IBS-EMR 2013

Introduction Analysis of mediation is a part of causal inferences in statistics During the last two decades the area was substantially developed by Pearl, Robbins, Greenland and others We present an implementation of the modified Pearl’s approach for different situations The modification was proposed recently by several authors but the implementation for continuous factor is apparently new 2IBS-EMR 2013

Mediation Situation -There are at least two factors (X, Z) affecting the outcome ( Y) -There may be confounders ( C ) associated with X,Z, and Y Causal Model Factor X does not depend on Z and Y. Mediator Z depends on X but not on Y Outcome Y depends of X and Z HOWEVER F(X|Z,Y,C)≠F(X|C), F(X,Z|Y,C) ≠F(X,Y|C) XY Z C 3IBS-EMR 2013

Effects – concept. An effect is defined as the difference in the expectations of the outcome in two situations. Total effect of X on Y = effect on Y of changes in X Direct effect of X on Y = effect on Y of changes in X while Z remains unchanged Indirect effect of X on Y =effect on Y of changes in Z, that were induced by changes in X, while X remains unchanged 4IBS-EMR 2013 XY Z

Counterfactual approach. Binary factor DATA: there are two data sets with X=0 and X=1 (red) In order to estimate the direct and indirect effects we need two unobservable data sets (gray) Y(X=0,Z(0)) Y(X=0, Z(1)) Y(X=1,Z(0 )) Y(X=1, Z(1)) 5IBS-EMR 2013

Effects – elaboration of the definitions. What is the meaning of UNCHANGED? In comparison with what situation? Placing Z on causal path from X to Y provides the answer. Direct effect: Z remains as it was at the initial values of X Indirect effect: X remains at its new values Y(X=0,Z(0)) Y(X=0, Z(1)) Y(X=1,Z(0))Y(X=1, Z(1)) Total Indirect Direct 6IBS-EMR 2013

Estimation of the effects. Linear model. For a linear model there is an exact solution. Let x,y,z are continuous and all assumptions of linear regression are fulfilled. y=b0+b1*x+b2*z+e1 z=a0+a1*x + e2 e1,e2 – independent random errors, non-correlated with x Then Total effect = E(Y|X+1)-E(Y|X)= b1+b2*a1 (product formula) Direct effect = E(Y|X+1,Z(X))-E(Y|X,Z(X))=b1 Indirect effect= E(Y|X+1,Z(X+1))-E(Y|X+1,Z(X))= b2*a1 7IBS-EMR 2013

Problems with non-linear regression In non-linear situation (for example, for a binary outcome Y) the product formula is not applicable. Various attempts to estimate indirect effect using coefficients of non-linear regression were not commonly accepted The source of the problem is that the effect can not be expressed using only regression coefficients but needs also the distribution of the covariates The solution was found using a counterfactual approach 8IBS-EMR 2013

Mediation formula Total effect=E(Y|X=1,Z(1))-E(Y|X=0,Z(0)) Direct effect =E(Y|X=1,Z(0))-E(Y|X=0,Z(0)) Indirect effect(Pearl)=E(Y|X=0,Z(1))-E(X=0,Z(0)) Indirect effect (Modified)= E(Y|X=1,Z(1))-E(Y|X=1,Z(0)) In general the total effect is not equal to the sum of the direct and Pearl’s indirect effect However the total effect is always equal to the sum of the direct and modified indirect effect 9IBS-EMR 2013

Estimation of the effects. Binary factor. Since Y(X=1,Z(0)) does not exist, we estimate it using a multiple imputation technique for missing values Z(0) for X=1 is estimated using regression Z(0) on C when X=0 Y(X=1,Z(0)) is estimated using regression Y on C,Z when X=1 Y(X=0,Z(0)) Y(X=1,Z(0))Y(X=1, Z(1)) Total Indirect Direct 10IBS-EMR 2013

Continuous factor. Definitions Data: For continuous X we have only one data set Requirement: in linear model it should lead to the exact solution Definition (reduction to the binary situation): For each object j, and “intervention” replaces x(j) by x’(j)=x(j)+1 Y(X,Z(X)) Y(X’,Z(X))Y(X’, Z(X’1)) Total Indirect Direct Y(X’,Z(X’)) 11IBS-EMR 2013

Continuous factor. Estimation For each object “j” the potential outcome y’(j)=y(x+1,z(x)) is estimated and imputed using the appropriate model y(x,z) For each object “j” the mediator z’(j)=z(x’) is estimated and imputed using the appropriate model z(x) For each object “j” the potential outcome y’’(j)=y(x’,z(x’)) is estimated and imputed using the appropriate model y(x,z) Y(X,Z(X)) Y(X’,Z(X)) Y(X’, Z(X’)) Total Indirect Direct 12IBS-EMR 2013

Realization. General logic We wrote a SAS macro that implements the general logic described here and covers all 8 potential combinations of binary or continuous types of variables X,Y,Z For binary X it assumes two data sets with X=0 and X=1 For continuous X it uses one data set For imputation of Z the macro uses linear regression Z on X and covariates C, and for binary Z it uses logistic regression on X and C For imputation of Y the macro uses linear regression of Y on X,Z and covariates C, and for binary Y it uses logistic regression on X,Z and C 13IBS-EMR 2013

MACRO CALL (example) %mediation8(data=all, ep=new_chf, mediator=totMIn, predictor=CRPq, covbin=, covcont=age, RTF=y); Thanks for your attention. 14IBS-EMR 2013