A Conceptual Approach to Multivariable Analysis Mitchell H. Katz, MD Director of Health San Francisco Department of Public Health

Slides:



Advertisements
Similar presentations
If we use a logistic model, we do not have the problem of suggesting risks greater than 1 or less than 0 for some values of X: E[1{outcome = 1} ] = exp(a+bX)/
Advertisements

Interpreting regression for non-statisticians Colin Fischbacher.
Calvet, Guilherme 1 ; Joao, Esau 1* ; Cunha, Cynthia 1 ; Silva, Raulino 1 ; Sidi, Leon 1 ; Silva, Sonia 1 ; Medeiros, Adriana 1 ; Martins, Ezequias 1 ;
Logistic Regression.
Perinatal HIV Case Series Reports for Births in Presenter: Elvia Ledezma Texas Department of State Health Services.
UOG Journal Club: September 2012 Perinatal outcome in women treated with progesterone for the prevention of preterm birth: a meta-analysis Sotiriadis A,
Progesterone Therapy for Preterm Labor Perinatal Conference April 14, 2006.
April 25 Exam April 27 (bring calculator with exp) Cox-Regression
Maternal and Perinatal Outcomes Associated with a Trial of Labor after Prior Cesarean Delivery Mark B. Landon, M.D., John C. Hauth, M.D., Kenneth J. Leveno,
Journal Club Alcohol, Other Drugs, and Health: Current Evidence January–February 2009.
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
Chapter 2: Looking at Data - Relationships /true-fact-the-lack-of-pirates-is-causing-global-warming/
Statistical presentation in international scientific publications 6. Reporting more complicated findings Malcolm Campbell Lecturer in Statistics, School.
Risk Factors for Recurrent Shoulder Dystocia, Washington State Hillary Moore, MD University of Washington School of Public Health and Community.
Introduction to Logistic Regression. Simple linear regression Table 1 Age and systolic blood pressure (SBP) among 33 adult women.
Multivariate Data Analysis Chapter 4 – Multiple Regression.
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
BIOST 536 Lecture 9 1 Lecture 9 – Prediction and Association example Low birth weight dataset Consider a prediction model for low birth weight (< 2500.
Predictive Analysis in Marketing Research
Correlation and Regression Analysis
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
Regression and Correlation
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 12: Multiple and Logistic Regression Marshall University.
Gastroschisis and Mode of Delivery: National Trends and Evaluation of Outcomes, 1991 to 2005 T. Mac Bird University of Arkansas for Medical Sciences College.
Presenter : Dr T. G. Nematadzira on behalf of The IMPAACT PROMISE 1077BF/1077FF Team Efficacy and Safety of Two Strategies to Prevent Perinatal HIV Transmission.
Inference for regression - Simple linear regression
Condom Use Questions as Predictors of Urogenital Gonorrhea Adrianne M. Williams, MD, Philana Liang, PA-C, MPH, Renee M. Gindi, MPH, Khalil G. Ghanem, MD,
1 Journal Club Alcohol, Other Drugs, and Health: Current Evidence January–February 2014.
Simple Linear Regression
Essentials of survival analysis How to practice evidence based oncology European School of Oncology July 2004 Antwerp, Belgium Dr. Iztok Hozo Professor.
Biostatistics Case Studies 2015 Youngju Pak, PhD. Biostatistician Session 4: Regression Models and Multivariate Analyses.
Racial Disparity in Correlates of Late Preterm Births: A Population-Based Study Shailja Jakhar, Christine Williams, Louis Flick, Jen Jen Chang, Qian Min,
Pediatric ID Previous presentation by Susan Schuval, MD
Impact of Smoke-free laws on Preterm Birth Kristin Ashford, PhD, APRN Joyce Robl, EdD, MS, CGC Ruth Ann Shepherd, MD, FAAP.
Herpes in Pregnancy Max Brinsmead MB BS PhD May 2015.
L 1 Chapter 12 Correlational Designs EDUC 640 Dr. William M. Bauer.
Excepted from HSRP 734: Advanced Statistical Methods June 5, 2008.
Week 6: Model selection Overview Questions from last week Model selection in multivariable analysis -bivariate significance -interaction and confounding.
Statistics for clinicians Biostatistics course by Kevin E. Kip, Ph.D., FAHA Professor and Executive Director, Research Center University of South Florida,
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Logistic and Nonlinear Regression Logistic Regression - Dichotomous Response variable and numeric and/or categorical explanatory variable(s) –Goal: Model.
Week 5: Logistic regression analysis Overview Questions from last week What is logistic regression analysis? The mathematical model Interpreting the β.
Preterm Birth Hazem Al-Mandeel, M.D Course 481 Obstetrics and Gynecology Rotation.
Research Techniques Made Simple: Multivariable Analysis Marlies Wakkee Loes Hollestein Tamar Nijsten Department of Dermatology, Erasmus University Medical.
September 18-19, 2006 – Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development Conducting and interpreting multivariate analyses.
HIV DISEASE IN PREGNANCY
Simple linear regression Tron Anders Moger
Transmission of HIV from mother to fetus. - is not simply one of the major health problems today, but also a big problem in the field of human rights.
Multiple Logistic Regression STAT E-150 Statistical Methods.
The Impact of Birth Spacing on Subsequent Feto-Infant Outcomes among Community Enrollees of a Federal Healthy Start Project Hamisu M. Salihu, MD, PhD Euna.
1 Introduction to Modeling Beyond the Basics (Chapter 7)
Instrument design Essential concept behind the design Bandit Thinkhamrop, Ph.D.(Statistics) Department of Biostatistics and Demography Faculty of Public.
HIV and Pregnancy. Introduction In the general obstetrical population in the United States, the frequency of HIV infection is about 1 per The prevalence.
Additional Regression techniques Scott Harris October 2009.
Meta-analysis of observational studies Nicole Vogelzangs Department of Psychiatry & EMGO + institute.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
Mei-Chun LU, Song-Shan HUANG, Yuan-Horng YAN, Panchalli WANG, Yueh-Han HSU, Wei CHEN Ditmanson Medical Foundation Chia-Yi Christian Hospital, Chia-Yi,
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 13: Multiple, Logistic and Proportional Hazards Regression.
Correlation & Simple Linear Regression Chung-Yi Li, PhD Dept. of Public Health, College of Med. NCKU 1.
Stats Methods at IC Lecture 3: Regression.
UOG Journal Club: July 2016 Ability of a preterm surveillance clinic to triage risk of preterm birth: a prospective cohort study J Min, HA Watson, NL Hezelgrave,
Inonu University, Turgut Ozal Medical Centre
REGRESSION G&W p
Journal Club Notes.
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
BMTRY 747: Introduction Jeffrey E. Korte, PhD
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Stats Club Marnie Brennan
Effect Modifiers.
Presentation transcript:

A Conceptual Approach to Multivariable Analysis Mitchell H. Katz, MD Director of Health San Francisco Department of Public Health

Table 2. Bivariate Risk Factors for Perinatal transmission of HIV Maternal Variable No. of Women* No. of Infected Infants (%) Odds Ratio 95% Confidence IntervalP Diagnosis of genital HSV infection during pregnancy Yes216 (28.6) No38140 (10.5) Lack of zidovudine prophylaxis during pregnancy / delivery Yes12421 (16.9) No26625 (9.4) Duration of membrane rupture (h) >4> (19.2) < (7.4) Gestational age at delivery (wk) < (27.3) <.001 > (8.6) * Numbers for some variables do not add up to total because of missing data. From:Chen KT, et al. Genital herpes simplex virus infection and perinatal transmission of human immunodeficiency virus. Obstet Gynecol. 2005;106:

Table 3. Risk Factors for Perinatal Transmission of HIV Risk FactorAdjusted OR95% CIP Diagnosis of genital HSV infection during pregnancy Lack of zidovudine prophylaxis during pregnancy / delivery Rupture of membranes >4 hours Delivery at < 37 weeks of gestation

Why do Multivariable Analyses? Identify independent predictors of outcome.

Table 4. Does indicated pre-term delivery decrease neonatal morbidity in infants <1000 grams? Intraventricular hemorrhage (III/IV) YesNoTotal Indicated pre-term delivery 9 (5.8 %) 147 (94.2 %) 156 Spontaneous rupture of membranes 38 (14.9%) 217 (85.1 %) OR = 0.35 (0.16 – 0.76)

Indicated Pre-term labor Spontaneous Pre-term labor Gestational age28 weeks26 weeks

Multivariable analysis adjusting for gestational age Indicated pre-term versus spontaneous delivery Bivariate odds ratio for intraventricular hemorrhage = 0.35 ( ) Indicated pre-term versus spontaneous delivery Multivariable odds ratio for intraventricular hemorrhage = 0.66 ( ) (adjusting for gestational age)

Why do Multivariable Analyses? Identify independent predictors of outcome. Apparent associations between a risk factor and an outcome may actually be due to a third factor: a confounder.

Randomized group assignment Outcome Potential confounder

Randomized controlled trial of preoperative vaginal preparation with povidone-lodine versus abdominal scrub only prior to cesarean section Vaginal scrubAbdominal scrub only EndometritisYes10 (7%)24 (14%) No132 (93%)142 (86%) OR =.45 ( ) From:Starr R, et al. Preoperative vaginal preparation with povidone-lodine and the risk of postcesarean endometritis. Obstet Gynecol. 2005;105:

Table 5. Multivariate Analysis of Factors Affecting Risk for Postcesarean Endometritis (N=308) Variable Adjusted Odds Ratio 95% Confidence Interval Vaginal scrub Severe anemia (hematocrit <30%) Use of intrapartum internal monitors History of antenatal genitourinary infections

Does the child with CSF pleocytosis have bacterial meningitis? Multivariable Logistic Regression Analysis* Predictor β Coefficient P Value 95% CI Gram stain5.6< CSF protein >80 mg / dL2.2< Peripheral ANC > cells / mm Seizure at or before presentation CSF ANC >1000 cells / mm From:Nigrovic LE, et al. Development and validation of a multivariable predictive model to distinguish bacterial from aseptic meningitis in children in the post- Haemophilus influenzae era. Pediatrics. 2002;110:

Causality versus Prediction Models (Diagnostic/Prognostic) For prediction models it does matter whether variables cause outcome. Association is all that is needed. Prediction models must be stronger models because decision will be made based on them. For causal models small percent of variance is enough. Prediction models will not generally be used unless they are replicated successfully.

Why do Multivariable Analyses? Identify independent predictors of outcome. Apparent associations between a risk factor and an outcome may actually be due to a third factor: a confounder. Construct predictive and diagnostic models. Variation in subjects’ risk factors within study groups can result in incorrect estimates of treatment effect in nonlinear models.

Why is multivariable analysis harder? Multidimensional instead of a flat plane. Harder to assess whether the model fits the data.

Table 3.1 Type of outcome variable determines choice of multivariable analysis. Type of outcomeExample of outcome variableType of multivariable analysis* IntervalBlood pressure, weight, temperatureMultiple linear regression Analysis of variance (and related procedures) Dichotomous Death, cancer, intensive care unit admission Multiple logistic regression Time to occurrence of a dichotomous event Time to death, time to cancerProportional hazards analysis Rare outcomes and counts Time to leukemia, number of infectionsPoisson regression * This text focuses on those procedures that are bolded.

What type of independent variables can be included in multivariable analysis? YesNo DichotomousNominal IntervalOrdinal Create multiple dichotomous variables for nominal and ordinal variables.

What independent variables should be entered into model? YesNo Variable of interest Intervening variables Known confounders Variables not on the causal path Potential confounders Variables with lots of missing data

Figure 5. Maternal viral HIV viral load is an intervening variable between antiretroviral treatment and perinatal transmission.

Prediction of Preterm delivery Multivariable Odds Ratio Low plasma protein A2.2 Elevated AFP2.1 If both risk factors, risk for preterm delivery 2.2 x 2.1 = 4.6 From:Smith GCS, et al. Pregnancy-associated plasma protein A and alpha-fetoprotein and prediction of adverse perinatal outcome. Obstet Gynecol. 2006;107:161-6.

Important assumptions of multivariable analysis Observations independent Censoring is random Proportionality assumption Multiplicative assumption

Checking the assumptions of the model Plot the residuals (the difference between the observed and the estimated value). Assess the pattern.

Limitations of multivariable analysis Can only statistically adjust for those confounders you know (have measured). Models are chosen that we believe fit the data, but the fit is always imperfect.