Causal Diagrams for Epidemiological Research

Slides:



Advertisements
Similar presentations
Case-control study 3: Bias and confounding and analysis Preben Aavitsland.
Advertisements

How would you explain the smoking paradox. Smokers fair better after an infarction in hospital than non-smokers. This apparently disagrees with the view.
Structural Equation Modeling
M2 Medical Epidemiology
Presentation, data and programs at:
“Personality, Socioeconomic Status, and All-Cause Mortality in the United States” - Chapman BP et al. Journal Club 02/24/11.
Using causal graphs to understand bias in the medical literature.
Unit 14: Measures of Public Health Impact.
Chance, bias and confounding
Estimation and Reporting of Heterogeneity of Treatment Effects in Observational Comparative Effectiveness Research Prepared for: Agency for Healthcare.
What is a sample? Epidemiology matters: a new introduction to methodological foundations Chapter 4.
Causal Networks Denny Borsboom. Overview The causal relation Causality and conditional independence Causal networks Blocking and d-separation Excercise.
Causal Diagrams: Directed Acyclic Graphs to Understand, Identify, and Control for Confounding Maya Petersen PH 250B: 11/03/04.
ANOVA: PART II. Last week  Introduced to a new test:  One-Way ANOVA  ANOVA’s are used to minimize family-wise error:  If the ANOVA is statistically.
Chapter 2: Looking at Data - Relationships /true-fact-the-lack-of-pirates-is-causing-global-warming/
 Confounders are usually controlled with the “standard” response regression model.  The standard model includes confounders as covariates in the response.
Causation – Seminar Jørn Olsen 20 November, 2014 Causation – Seminar Jørn Olsen 20 November, 2014.
Intermediate methods in observational epidemiology 2008 Confounding - I.
Cohort Studies Hanna E. Bloomfield, MD, MPH Professor of Medicine Associate Chief of Staff, Research Minneapolis VA Medical Center.
Chapter 15: Probability Rules
Covariate Selection for Observational Comparative Effectiveness Research Prepared for: Agency for Healthcare Research and Quality (AHRQ)
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 12: Multiple and Logistic Regression Marshall University.
Chapter 8: Bivariate Regression and Correlation
INTRODUCTION TO EPIDEMIOLO FOR POME 105. Lesson 3: R H THEKISO:SENIOR PAT TIME LECTURER INE OF PRESENTATION 1.Epidemiologic measures of association 2.Study.
This Week: Testing relationships between two metric variables: Correlation Testing relationships between two nominal variables: Chi-Squared.
Comparing high-dimensional propensity score versus lasso variable selection for confounding adjustment in a novel simulation framework Jessica Franklin.
Causal Graphs, epi forum
Multiple Choice Questions for discussion
CHP400: Community Health Program - lI Research Methodology. Data analysis Hypothesis testing Statistical Inference test t-test and 22 Test of Significance.
Evidence-Based Medicine 3 More Knowledge and Skills for Critical Reading Karen E. Schetzina, MD, MPH.
Oct-15H.S.1Oct-15H.S.1Oct-151 H.S.1Oct-15H.S.1Oct-15H.S.1 Causal Graphs, epi forum Hein Stigum
Oct-15H.S.1Oct-15H.S.1Oct-15H.S.1 Directed Acyclic Graphs DAGs Hein Stigum
 Is there a comparison? ◦ Are the groups really comparable?  Are the differences being reported real? ◦ Are they worth reporting? ◦ How much confidence.
Amsterdam Rehabilitation Research Center | Reade Multiple regression analysis Analysis of confounding and effectmodification Martin van de Esch, PhD.
What is a causal diagram?
Statistics for clinicians Biostatistics course by Kevin E. Kip, Ph.D., FAHA Professor and Executive Director, Research Center University of South Florida,
Correlation and Linear Regression. Evaluating Relations Between Interval Level Variables Up to now you have learned to evaluate differences between the.
Correlational Research Chapter Fifteen Bring Schraw et al.
Introduction to confounding and DAGs
Article Review Cara Carty 09-Mar-06. “Confounding by indication in non-experimental evaluation of vaccine effectiveness: the example of prevention of.
Causation ? Tim Wiemken, PhD MPH CIC Assistant Professor Division of Infectious Diseases University of Louisville, Kentucky.
Estimating Causal Effects from Large Data Sets Using Propensity Scores Hal V. Barron, MD TICR 5/06.
Copyright © 2010 Pearson Education, Inc. Chapter 15 Probability Rules!
Case-control study Chihaya Koriyama August 17 (Lecture 1)
Analytical epidemiology Disease frequency Study design: cohorts & case control Choice of a reference group Biases Alain Moren, 2006 Impact Causality Effect.
Case-Control Study Duanping Liao, MD, Ph.D
11/20091 EPI 5240: Introduction to Epidemiology Confounding: concepts and general approaches November 9, 2009 Dr. N. Birkett, Department of Epidemiology.
1 Basic epidemiological study designs and its role in measuring disease exposure association M. A. Yushuf Sharker Assistant Scientist Center for Communicable.
A short introduction to epidemiology Chapter 9: Data analysis Neil Pearce Centre for Public Health Research Massey University Wellington, New Zealand.
Assessing the Total Effect of Time-Varying Predictors in Prevention Research Bethany Bray April 7, 2003 University of Michigan, Dearborn.
1 Multivariable Modeling. 2 nAdjustment by statistical model for the relationships of predictors to the outcome. nRepresents the frequency or magnitude.
01/20151 EPI 5344: Survival Analysis in Epidemiology Cox regression: Introduction March 17, 2015 Dr. N. Birkett, School of Epidemiology, Public Health.
Using Propensity Score Matching in Observational Services Research Neal Wallace, Ph.D. Portland State University February
01/20151 EPI 5344: Survival Analysis in Epidemiology Confounding and Effect Modification March 24, 2015 Dr. N. Birkett, School of Epidemiology, Public.
Matching. Objectives Discuss methods of matching Discuss advantages and disadvantages of matching Discuss applications of matching Confounding residual.
POPLHLTH 304 Regression (modelling) in Epidemiology Simon Thornley (Slides adapted from Assoc. Prof. Roger Marshall)
Probability and odds Suppose we a frequency distribution for the variable “TB status” The probability of an individual having TB is frequencyRelative.
Matched Case-Control Study Duanping Liao, MD, Ph.D Phone:
The research process Psych 231: Research Methods in Psychology.
1 Causation in epidemiology, confounding and bias Imre Janszky Faculty of Medicine NTNU.
Variable selection in Regression modelling Simon Thornley.
{Title of Your Presentation} {YOUR NAME} BENEDICTINE UNIVERSITY MPH 606 METHODS OF RESEARCH IN PUBLIC HEALTH.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 13: Multiple, Logistic and Proportional Hazards Regression.
Measures of disease frequency Simon Thornley. Measures of Effect and Disease Frequency Aims – To define and describe the uses of common epidemiological.
Using Directed Acyclic Graphs (DAGs) to assess confounding Glenys Webster & Anne Harris May 14, 2007 St Paul’s Hospital Statistical “Rounds”
Matched Case-Control Study
Observational Studies
Evaluating Effect Measure Modification
DAGs intro without exercises 1h Directed Acyclic Graph
Improving Overlap Farrokh Alemi, Ph.D.
Presentation transcript:

Causal Diagrams for Epidemiological Research Eyal Shahar, MD, MPH Professor Division of Epidemiology & Biostatistics Mel and Enid Zuckerman College of Public Health The University of Arizona

What is it and why does it matter? A tool (method) that: clarifies our wordy or vague causal thoughts about the research topic helps us to decide which covariates should enter the statistical model—and which should not unifies our understanding of confounding bias, selection bias, and information bias

What is the key question in a non-randomized study? When estimating the effect of E (“exposure”) on D (“disease”), what should we adjust for? or Confounder selection strategy

Adjusting for Confounders Common Practice The “change-in-estimate” method List “potential confounders” Adjust for (condition on) potential confounders Compare adjusted estimate to crude estimate (or “fully adjusted” to “partially adjusted”) Decide whether “potential confounders” were “real confounders” Decide how much confounding existed Premise: The data informs us about confounding. Are we asking too much from the data?

Adjusting for Confounders Common Practice What is “a potential confounder”? Typically, “a cause of the disease that is associated with the exposure” Confounder E D What is the effect of a confounder? Contributes to the crude (observed, marginal) association between E and D

Adjusting for Confounders Common Practice Extension to multiple confounders C1 C2 C3 E D E D E D C4 C5 C6 E D E D E D

Adjusting for Confounders Common Practice Problems A sequence of isolated, independent, causal diagrams but C1, C2, C3, C4, C5,.. might be connected causally Unidirectional arrow = a causal direction but what is the meaning of the bidirectional arrow? Even with a single confounder, the “change-in-estimate” method could fail

Adjusting for Confounders Problems An example where the “change-in-estimate” method fails U1 U2 C E D The crude estimate may be closer to the truth than the C-adjusted estimate To be explained

Alternative A Causal Diagram A method for selecting covariates Extension of the confounder triangle Premises displayed in the diagram New terms: Path Collider on a path Confounding path

Selected references Pearl J. Causality: models, reasoning, and inference. 2000. Cambridge University Press Greenland S et al. Causal diagrams for epidemiologic research. Epidemiology 1999;10:37-48 Robins JM. Data, design, and background knowledge in etiologic inference. Epidemiology 2001;11:313-320 Hernan MA et al. A structural approach to selection bias. Epidemiology 2004;15:615-625 Shahar E. Causal diagrams for encoding and evaluation of information bias. J Eval Clin Pract (forthcoming)

A Causal Diagram Notation and Terms An arrow=causal direction between two variables E D An arrow could abbreviate both direct and indirect effects U1 E D E D could summarize U2 U3

A Causal Diagram Notation and Terms A path between E and D: any sequence of causal arrows that connects E to D E D E U1 U2 D E U1 U2 D E U1 U2 D

A Causal Diagram Notation and Terms Circularity (self-causation) does not exist: Directed Acyclic Graph E U1 D U2 A collider on the path between E and D E U1 U2 D E and U2 collide at U1

A Causal Diagram Notation and Terms A confounding path for the effect of E on D: Any path between E and D that meets the following criteria: The arrow next to E points to E There are no colliders on the path C U1 V1 U2 V2 U3 E D In short: a path showing a common cause of E and D

C U1 V1 U2 C V2 U3 U1 V1 E D U2 V2 C U3 U1 V1 E D U2 V2 U3 E D The paths below are NOT confounding paths for the effect of E on D C U1 V1 U2 C V2 U3 U1 V1 E D U2 V2 C U3 U1 E V1 D U2 V2 U3 E D

What can affect the association between E and D What can affect the association between E and D? (Why do we observe an association between two variables?) Causal path: E causes D Causal path: D causes E Confounding paths Adjustment for colliders on a path from E to D E D D E C E D Later…

Why does a confounding path affect the crude (marginal) association between E and D? Intuitively: Association= being able to “guess” the value of one variable (D) from the value of another (E) ED allows us to guess D from E (and E from D) A confounding path allows for sequential guesses along the path C U1 V1 U2 V2 U3 E D

How can we block a confounding path between E and D? Condition on a variable on the path (on any variable) Methods for conditioning Restriction Stratification Regression C U1 V1 U2 V2 U3 E D

A point to remember C U1 V1 U2 V2 U3 E D We don’t need to adjust for confounders (the top of the triangle.) Adjustment for any U or V below will do. U and V are surrogates for the confounder C C U1 V1 U2 V2 U3 E D

Example If the diagram below corresponds to reality, then we have several options for conditioning For example: On C and U2 Only on U2 Only on U3 C U1 V1 U2 V2 U3 E D

What can affect the association between E and D? Causal path: E causes D Causal path: D causes E Confounding paths Adjustment for colliders on a path from E to D E D D E C E D NOW!

Conditioning on a Collider A Trap A collider may be viewed as the opposite of a confounder Collider and confounder are symmetrical entities, like matter and anti-matter Confounder Collider C U1 V1 U2 V2 U3 E D

Conditioning on a Collider A Trap A path from E to D that contains a collider is NOT a confounding path. There is no transfer of “guesses” across a collider. A path from E to D that contains a collider does NOT generate an association between E and D Conditioning on the collider, however, will turn that path into a confounding path. Why?

Conditioning on a Collider A Trap V1 U1 U2 V2 U3 E D The horizontal line indicates an association (the possibility of “guesses”) that was induced by conditioning on a collider

Properties of a Collider Intuitive Explanation A dataset contains three variables for N cars: Brake condition (good/bad) Street condition in the owner’s town (good/bad) Involved in an accident in the owner’s town? (yes/no) Brake condition (good, bad) Accident (yes, no) Street condition (good, bad) Accident is a collider. Brake condition and street condition are not associated in the dataset. We cannot use the data to guess one from the other.

Properties of a Collider Intuitive Explanation Why can’t we make a guess from the data? Let’s try. Suppose we are told: Car A has good brakes and car B has bad brakes. This information tells us nothing about the street condition in each owner’s town. Brake condition Street condition Car A Good ? Car B Bad Intuition: a common effect (collider) does not induce an association between its causes (colliding variables)

Properties of a Collider Intuitive Explanation If, however, we condition (stratify) on the collider “accident”, we can make some guesses about the street condition from the brake condition. Stratum #1 Accident = yes Brake condition Accident Street condition Car A Good Yes Bad (a guess) Car B ?

Properties of a Collider Intuitive Explanation Similarly, in the other stratum Stratum #2 Accident = no Brake condition Accident Street condition Car A Good No (a guess) Car B Bad ?

Properties of a Collider In summary: Conditioning on a collider creates an association between the colliding variables and, therefore, may open a confounding path Before conditioning on C After conditioning on C U1 U2 U1 U2 C C E D E D

Derivations The “change-in-estimate” method could fail if we condition on colliders, and thereby open confounding paths To (rationally) select covariates for adjustment, we must commit to a causal diagram (premises) (But we often say that we don’t know and can’t commit, and hope that the change-in-estimate method will work.) Causal inference, like all scientific inference, is conditional on premises (which may be false)—not on ignorance

Derivations U1 U2 U1 U2 C C E D E D Do not condition on colliders, if possible If you condition on a collider, Connect the colliding variables by a line Check if you opened a new confounding path Condition on another variable to block that new path Conditioning on C alone Conditioning on C and (U1 or U2) U1 U2 U1 U2 C C E D E D

Practical advice Study one exposure at a time A model that may be good for exposure A might not be good for exposure B (even if B is in the model) Never adjust for an effect of the exposure Never adjust for an effect of the disease Never select covariates by stepwise regression Never look at p-values to decide on confounding (actually, never look at p-values…)

Extension to other problems of causal inquiry Causation always remains uncertain, even if we deal with a single confounder Unbeknown to us the reality happens to be We draw U1 U2 C C E D E D And naively condition on C And our adjustment may fail

Extension to other problems of causal inquiry Estimating the “direct” effect by conditioning on an intermediary variable, I I E D We should remember that variable I may be a collider D U E I D U I E

Extension to other problems of causal inquiry Causal diagrams explain the mechanism of selection bias Example: What happens if we estimate the effect of marital status on dementia in a sample of nursing home residents? Assume: no effect both variables affect “place of residence” (home, or nursing home)

Extension to other problems of causal inquiry Marital status Dementia Place of residence (home, nursing home) By studying a sample of nursing home residents, we are conditioning on a collider (on a “sampling collider”) and might create an association between marital status and dementia in that stratum

Extension to other problems of causal inquiry Marital status Dementia Place of residence (home, nursing home) “Stratification” Home Nursing home Marital status Dementia

Extensions: control selection bias (Source: Hernan et al, Epidemiology 2004) Source cohort: no effect Estrogen MI DS because disease status affects selection. Diseased members of the cohort are over-sampled (cases) relative to non-diseased (controls) S (0,1) Selection into a case-control sample S=1 E D S (0,1) Estrogen MI Suppose: F is hip fracture F Suppose: EF Suppose: Controls preferentially selected from women with hip fracture

Extensions: control selection bias (Source: Hernan et al, Epidemiology 2004) Estrogen MI E D F S (0,1) S=0 (remainder of the source cohort) S=1 (our case-control sample) HRT MI E D Association of E and D was created

Extensions: information bias (LAST EXAMPLE) D Estrogen use Endometrial cancer D* Diagnosed endometrial cancer ? Z Frequency of exams Vaginal bleeding

Summary Points The “change-in-estimate” method could fail if we condition on colliders, and thereby open confounding paths The theory of causal diagrams extends the idea of a confounder to the multi-confounder case Unification of confounding bias, selection bias, and information bias under a single theoretical framework

“Back-door algorithm” Sufficient set for adjustment Minimally sufficient set Differential losses to follow-up Time-dependent confounders Interpretation of hazard ratios Conditioning on a common effect always induced an association between its causes, but this association could be restricted to some levels of the common effect

? Age (young, old) Smoking drive (low, high) Sex Physical activity Asthma (yes, no) Smoking status ? FEV1

? Age (young, old) Smoking drive (low, high) Sex Physical activity Asthma (yes, no) Smoking status ? FEV1

? Age (young, old) Smoking drive (low, high) Sex Physical activity Asthma (yes, no) Smoking status ? FEV1

? ? Pneumonia Ulcer Hospitalization Status hospitalized not hospitalized Coughing ? Abdominal Pain Stratification other patients hospitalized patients Pneumonia Ulcer Coughing ? Abdominal Pain

Example: Do men have higher systolic blood pressure than women Example: Do men have higher systolic blood pressure than women? (In other words: estimate the gender effect on systolic blood pressure) The following table summarizes the answer to this question from two regression models Association with gender Mean SBP in men Mean SBP in women Mean difference (coefficient) Inference Marginal (crude) 123.8 mmHg 122.1 mmHg +1.7 mmHg Men’s SBP is higher Conditional on (“adjusted for”) BMI and WHR 121.2 mmHg 124.3 mmHg -3.1 mmHg Women’s SBP is higher   So, which is the true estimate and which is biased?

WHR Gender SBP BMI Z1 Z2 .

U WHR Gender SBP BMI Z1 Z2 .

U WHR Gender SBP BMI Z1 Z2 .