Tim Wiemken PhD MPH CIC Assistant Professor Division of Infectious Diseases University of Louisville, Kentucky Confounding.

Slides:



Advertisements
Similar presentations
Confounding and effect modification
Advertisements

How would you explain the smoking paradox. Smokers fair better after an infarction in hospital than non-smokers. This apparently disagrees with the view.
Survival Analysis In many medical studies, the primary endpoint is time until an event occurs (e.g. death, remission) Data are typically subject to censoring.
Two-sample tests. Binary or categorical outcomes (proportions) Outcome Variable Are the observations correlated?Alternative to the chi- square test if.
M2 Medical Epidemiology
1 2 Two-samples tests, X 2 Dr. Mona Hassan Ahmed Prof. of Biostatistics HIPH, Alexandria University.
Third training Module, EpiSouth: Multivariate analysis, 15 th to 19 th June 20091/29 Multivariate analysis: Introduction Third training Module EpiSouth.
Case-Control Studies (Retrospective Studies). What is a cohort?
1 Case-Control Study Design Two groups are selected, one of people with the disease (cases), and the other of people with the same general characteristics.
Chance, bias and confounding
Tim Wiemken, PhD MPH CIC Assistant Professor Division of Infectious Diseases University of Louisville, Kentucky Planning Your Study Statistical Issues.
April 25 Exam April 27 (bring calculator with exp) Cox-Regression
Journal Club Alcohol, Other Drugs, and Health: Current Evidence July–August 2009.
BIOST 536 Lecture 9 1 Lecture 9 – Prediction and Association example Low birth weight dataset Consider a prediction model for low birth weight (< 2500.
1June In Chapter 19: 19.1 Preventing Confounding 19.2 Simpson’s Paradox (Severe Confounding) 19.3 Mantel-Haenszel Methods 19.4 Interaction.
Statistics for Health Care
Intermediate methods in observational epidemiology 2008 Confounding - I.
THREE CONCEPTS ABOUT THE RELATIONSHIPS OF VARIABLES IN RESEARCH
Assessing Survival: Cox Proportional Hazards Model Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 12: Multiple and Logistic Regression Marshall University.
Case Control Study Manish Chaudhary BPH, MPH
Absolute, Relative and Attributable Risks. Outcomes or differences that we are interested in:  Differences in means or proportions  Odds ratio (OR)
INTRODUCTION TO EPIDEMIOLO FOR POME 105. Lesson 3: R H THEKISO:SENIOR PAT TIME LECTURER INE OF PRESENTATION 1.Epidemiologic measures of association 2.Study.
Logistic Regression. Outline Review of simple and multiple regressionReview of simple and multiple regression Simple Logistic RegressionSimple Logistic.
HSTAT1101: 27. oktober 2004 Odd Aalen
Multiple Choice Questions for discussion
1 Journal Club Alcohol, Other Drugs, and Health: Current Evidence January–February 2014.
Tim Wiemken PhD MPH CIC Assistant Professor Division of Infectious Diseases University of Louisville, Kentucky Confounding.
Biostatistics Case Studies 2005 Peter D. Christenson Biostatistician Session 4: Taking Risks and Playing the Odds: OR vs.
Coffee Consumption and Risk of Myocardial Infarction among Older Swedish Women SA Rosner, A Akesson,MJ. Stampfer, A Wolk; AJE; :
Evidence-Based Medicine 3 More Knowledge and Skills for Critical Reading Karen E. Schetzina, MD, MPH.
Gitanjali Batmanabane
Measures of Association
POTH 612A Quantitative Analysis Dr. Nancy Mayo. © Nancy E. Mayo A Framework for Asking Questions Population Exposure (Level 1) Comparison Level 2 OutcomeTimePECOT.
Amsterdam Rehabilitation Research Center | Reade Multiple regression analysis Analysis of confounding and effectmodification Martin van de Esch, PhD.
Week 6: Model selection Overview Questions from last week Model selection in multivariable analysis -bivariate significance -interaction and confounding.
Racial and Ethnic Disparities in the Knowledge of Shaken Baby Syndrome among Recent Mothers Findings from the Rhode Island PRAMS Hanna Kim, Samara.
October 15. In Chapter 19: 19.1 Preventing Confounding 19.2 Simpson’s Paradox 19.3 Mantel-Haenszel Methods 19.4 Interaction.
Causation ? Tim Wiemken, PhD MPH CIC Assistant Professor Division of Infectious Diseases University of Louisville, Kentucky.
Contingency tables Brian Healy, PhD. Types of analysis-independent samples OutcomeExplanatoryAnalysis ContinuousDichotomous t-test, Wilcoxon test ContinuousCategorical.
R Programming Risk & Relative Risk 1. Session 2 Overview 1.Risk 2.Relative Risk 3.Percent Increase/Decrease Risk 2.
Analytical epidemiology Disease frequency Study design: cohorts & case control Choice of a reference group Biases Alain Moren, 2006 Impact Causality Effect.
Research Techniques Made Simple: Multivariable Analysis Marlies Wakkee Loes Hollestein Tamar Nijsten Department of Dermatology, Erasmus University Medical.
Evaluating Risk Adjustment Models Andy Bindman MD Department of Medicine, Epidemiology and Biostatistics.
Case-Control Study Duanping Liao, MD, Ph.D
11/20091 EPI 5240: Introduction to Epidemiology Confounding: concepts and general approaches November 9, 2009 Dr. N. Birkett, Department of Epidemiology.
1 Multivariable Modeling. 2 nAdjustment by statistical model for the relationships of predictors to the outcome. nRepresents the frequency or magnitude.
BC Jung A Brief Introduction to Epidemiology - XIII (Critiquing the Research: Statistical Considerations) Betty C. Jung, RN, MPH, CHES.
1 Chapter 16 logistic Regression Analysis. 2 Content Logistic regression Conditional logistic regression Application.
Introduction to Biostatistics, Harvard Extension School, Fall, 2005 © Scott Evans, Ph.D.1 Contingency Tables.
Meta-analysis of observational studies Nicole Vogelzangs Department of Psychiatry & EMGO + institute.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 13: Multiple, Logistic and Proportional Hazards Regression.
(www).
Measures of disease frequency Simon Thornley. Measures of Effect and Disease Frequency Aims – To define and describe the uses of common epidemiological.
Understanding Statistics for Research Staff!
Study Designs Group Work
April 18 Intro to survival analysis Le 11.1 – 11.2
Epidemiologic Measures of Association
Sec 9C – Logistic Regression and Propensity scores
Epidemiological Methods
Epidemiology 503 Confounding.
Statistics 103 Monday, July 10, 2017.
Lecture 3: Introduction to confounding (part 1)
Chapter 18 Cross-Tabulated Counts
Multiple logistic regression
Evaluating Effect Measure Modification
The Aga Khan University
Nazmus Saquib, PhD Head of Research Sulaiman AlRajhi Colleges
Effect Modifiers.
Presentation transcript:

Tim Wiemken PhD MPH CIC Assistant Professor Division of Infectious Diseases University of Louisville, Kentucky Confounding

1. Define and Identify Confounding 3. Identify How to Select Confounding Variables for Multivariate Analysis 2. Calculate Risk Ratio and Stratified Risk Ratio Overview

1. Define and Identify Confounding 3. Identify How to Select Confounding Variables for Multivariate Analysis 2. Calculate Risk Ratio and Stratified Risk Ratio Overview

A variable related to the exposure (predictor) and outcome but not in the causal pathway Definition: Confounding

Confounding

Risk factor that has different prevalence in two study populations… e.g. Coffee drinking and lung cancer Why does this happen? Confounding

Men vs Women Example…. Men vs Women Example…. 25% Risk of lung cancer 5% Risk of Lung Cancer Example

Men vs Women Example…. Men vs Women Example…. 25% Risk of lung cancer 5% Risk of Lung Cancer Example Conclusion: People who drink coffee die more therefore coffee causes lung cancer

Men vs Women Example…. Men vs Women Example…. 25% Risk of lung cancer 5% Risk of Lung Cancer Example Truth: Coffee drinkers are more likely to smoke. Smoking is associated with a higher risk of lung cancer. mortality.

Example Outcome: Lung cancer Confounder: Smoking Predictor: Coffee

Example Outcome: Lung cancer Confounder: Smoking Predictor: Coffee Smoking associated with coffee drinking and lung cancer. Smoking is not caused by drinking coffee.

1. Define and Identify Confounding 3. Identify How to Select Confounding Variables for Multivariate Analysis 3. Identify How to Select Confounding Variables for Multivariate Analysis 2. Calculate Risk Ratio and Stratified Risk Ratio 2. Calculate Risk Ratio and Stratified Risk Ratio Overview

Question: Are coffee drinkers more likely to get lung cancer? Example Warning: The upcoming data are made up. Do not make any decisions based on the outcomes of our example!

3154 subjects 2648 Enrolled 506 Excluded 1307 coffee coffee- 178 cancer cancer- 79 cancer cancer- Example Flowchart

What Type of Study is That? Example

What is the correct measure of association? Example

What Type of Study is That? What is the correct measure of association? Example OK. Now Calculate the Correct Measure of Association

Data Do coffee drinkers get lung cancer more than non coffee drinkers? Cancer+Cancer- Coffee+ Coffee- Example

3154 Subjects 2648 Enrolled 506 Excluded 1307 coffee coffee- 178 cancer cancer- 79 cancer cancer- Example Flowchart

Do coffee drinkers get lung cancer more than non coffee drinkers? Cancer+Cancer- Coffee Coffee Example Data

? Well? Do coffee drinkers get lung cancer more than non coffee drinkers? Example

Yes! RR: 2.31, P=<0.001, 95% CI: 1.79 – 2.98 Yes! RR: 2.31, P=<0.001, 95% CI: 1.79 – 2.98 Do coffee drinkers get lung cancer more than non coffee drinkers? Example

Is this a true relationship or is another variable confounding that relationship? Example

We noticed a lot of coffee drinkers also smoke, much more than those patients who didn’t drink coffee. Could this be a confounder? Example

Input your data in the 2x2 Example: Step 1 Cancer+Cancer- Coffee Coffee This gives you a ‘crude’ odds or risk ratio

Stratify on the potential confounder Stratified data: Smoker+ Coffee+/ Cancer+: 168 Coffee -/Cancer+: 34 Coffee+/Cancer-: 880 Coffee-/Cancer-: 177 Stratified data: Smoker- Coffee+/ Cancer+: 10 Coffee -/Cancer+: 45 Coffee+/Cancer-: 249 Coffee-/Cancer-: 1085 Example: Step 2

Compute Risk Ratios for Both, Separately Example: Step 2 Smoker-Cancer+Cancer- Coffee+ Coffee- Smoker+Cancer+Cancer- Coffee+ Coffee-

Calculate the adjusted measure of association Example: Step 2 Stratified data: Smoker+ Coffee+/ Cancer+: 168 Coffee -/Cancer+: 34 Coffee+/Cancer-: 880 Coffee-/Cancer-: 177 Stratified data: Smoker- Coffee+/ Cancer+: 10 Coffee -/Cancer+: 45 Coffee+/Cancer-: 249 Coffee-/Cancer-: 1085

2. Compute Risk Ratios for Both, Separately Example: Step 2 Smoker-Cancer+Cancer- Coffee Coffee Smoker+Cancer+Cancer- Coffee Coffee

What do you see? Example

Ensure that, in the group without the outcome, the potential confounder is associated with the predictor Example: Step 3

Adjusted Ratio Must be >10% Different than the Crude Ratio Example: Step 4 Compute the adjusted odds/risk ratios Compute the percent difference between the ‘crude’ and adjusted ratios.

If the criteria are met, you have a confounder Example

As in our example, a confounder can create an apparent association between the predictor and outcome. Issues with Confounding

As in our example, a confounder can create an apparent association between the predictor and outcome. A confounder can also mask an association, so it does not look like there is an association originally, but when you stratify, you see there is one. Issues with Confounding

1. Define and Identify Confounding Identify How to Select Confounding Variables for Multivariate Analysis Identify How to Select Confounding Variables for Multivariate Analysis 2. Calculate Risk Ratio and Stratified Risk Ratio 2. Calculate Risk Ratio and Stratified Risk Ratio Overview

Regression methods adjust for multiple confounding variables at once – less time consuming. Logistic Regression Linear Regression Cox Proportional Hazards Regression … and many others Logistic Regression Linear Regression Cox Proportional Hazards Regression … and many others Multiple Confounding Variables

1: The way we just did it. This is probably the most reliable method with a few more steps. Multiple Confounding Variables

2. Include all clinically significant variables or those that are previously identified as confounders. Issues: May have too many confounders Confounding in other studies does NOT mean it is a confounder in yours. Issues: May have too many confounders Confounding in other studies does NOT mean it is a confounder in yours. Multiple Confounding Variables

3: If that variable is significantly associated with the outcome (chi- squared) then include it. Multiple Confounding Variables Sun, G. W., Shook, T. L., & Kay, G. L. (1996). Inappropriate use of bivariable analysis to screen risk factors for use in multivariable analysis. J Clin Epidemiol, 49(8),

3: If that variable is significantly associated with the outcome (chi- squared) then include it. Many issues with this method. Multiple Confounding Variables Sun, G. W., Shook, T. L., & Kay, G. L. (1996). Inappropriate use of bivariable analysis to screen risk factors for use in multivariable analysis. J Clin Epidemiol, 49(8), What is significant?

3: If that variable is significantly associated with the outcome (chi- squared) then include it. Many issues with this method. Multiple Confounding Variables Sun, G. W., Shook, T. L., & Kay, G. L. (1996). Inappropriate use of bivariable analysis to screen risk factors for use in multivariable analysis. J Clin Epidemiol, 49(8), Just because the ‘confounder’ is associated with the predictor doesn’t mean it is associated with the outcome and not in the causal pathway!

4. Automatic Selection Regression Methods Many ways to do this, and relatively reliable with certain methods. Forward Selection Backward Selection Stepwise Many ways to do this, and relatively reliable with certain methods. Forward Selection Backward Selection Stepwise Multiple Confounding Variables

Caveats Need to control for as few confounding variables as possible. Multiple Confounding Variables

Caveats Need to control for as few confounding variables as possible. You are limited by the number of cases of the outcome you have (10:1 Rule) Multiple Confounding Variables

Caveats Need to control for as few confounding variables as possible. You are limited by the number of cases of the outcome you have (10:1 Rule) Some journals just want it done a certain way. Multiple Confounding Variables

1. Define and Identify Confounding Identify How to Select Confounding Variables for Multivariate Analysis Identify How to Select Confounding Variables for Multivariate Analysis 2. Calculate Risk Ratio and Stratified Risk Ratio 2. Calculate Risk Ratio and Stratified Risk Ratio Overview