EPI 809/Spring 20081 Multiple Logistic Regression.

Slides:



Advertisements
Similar presentations
EPI 809/Spring Two main complications of analysis of single exposure effect (1) Effect modifier (2) Confounding factor - useful information - useful.
Advertisements

Logistic Regression I Outline Introduction to maximum likelihood estimation (MLE) Introduction to Generalized Linear Models The simplest logistic regression.
Simple Logistic Regression
Trashball: A Logistic Regression Classroom Activity Christopher Morrell (Joint work with Richard Auer) Mathematics and Statistics Department Loyola University.
1 If we live with a deep sense of gratitude, our life will be greatly embellished.
Exact Logistic Regression Larry Cook. Outline Review the logistic regression model Explore an example where model assumptions fail –Brief algebraic interlude.
April 25 Exam April 27 (bring calculator with exp) Cox-Regression
Logistic Regression Multivariate Analysis. What is a log and an exponent? Log is the power to which a base of 10 must be raised to produce a given number.
Some Terms Y =  o +  1 X Regression of Y on X Regress Y on X X called independent variable or predictor variable or covariate or factor Which factors.
EPI 809 / Spring 2008 Final Review EPI 809 / Spring 2008 Ch11 Regression and correlation  Linear regression Model, interpretation. Model, interpretation.
Adjusting for extraneous factors Topics for today Stratified analysis of 2x2 tables Regression Readings Jewell Chapter 9.
Chapter 11 Survival Analysis Part 3. 2 Considering Interactions Adapted from "Anderson" leukemia data as presented in Survival Analysis: A Self-Learning.
PH6415 Review Questions. 2 Question 1 A journal article reports a 95%CI for the relative risk (RR) of an event (treatment versus control as (0.55, 0.97).
Sociology 601 Class 28: December 8, 2009 Homework 10 Review –polynomials –interaction effects Logistic regressions –log odds as outcome –compared to linear.
Chapter 11 Survival Analysis Part 2. 2 Survival Analysis and Regression Combine lots of information Combine lots of information Look at several variables.
Nemours Biomedical Research Statistics April 23, 2009 Tim Bunnell, Ph.D. & Jobayer Hossain, Ph.D. Nemours Bioinformatics Core Facility.
1 Modeling Ordinal Associations Section 9.4 Roanna Gee.
Logistic Regression Biostatistics 510 March 15, 2007 Vanessa Perez.
Notes on Logistic Regression STAT 4330/8330. Introduction Previously, you learned about odds ratios (OR’s). We now transition and begin discussion of.
WLS for Categorical Data
Adjusting for extraneous factors Topics for today More on logistic regression analysis for binary data and how it relates to the Wolf and Mantel- Haenszel.
Generalized Linear Models
Assessing Survival: Cox Proportional Hazards Model Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Logistic Regression In logistic regression the outcome variable is binary, and the purpose of the analysis is to assess the effects of multiple explanatory.
Logistic Regression II Simple 2x2 Table (courtesy Hosmer and Lemeshow) Exposure=1Exposure=0 Disease = 1 Disease = 0.
MODELS OF QUALITATIVE CHOICE by Bambang Juanda.  Models in which the dependent variable involves two ore more qualitative choices.  Valuable for the.
Analysis of Categorical Data
Logistic Regression III: Advanced topics Conditional Logistic Regression for Matched Data Conditional Logistic Regression for Matched Data.
Simple Linear Regression
Biostatistics Case Studies 2005 Peter D. Christenson Biostatistician Session 4: Taking Risks and Playing the Odds: OR vs.
April 11 Logistic Regression –Modeling interactions –Analysis of case-control studies –Data presentation.
EIPB 698E Lecture 10 Raul Cruz-Cano Fall Comments for future evaluations Include only output used for conclusions Mention p-values explicitly (also.
Assessing Survival: Cox Proportional Hazards Model
April 6 Logistic Regression –Estimating probability based on logistic model –Testing differences among multiple groups –Assumptions for model.
2 December 2004PubH8420: Parametric Regression Models Slide 1 Applications - SAS Parametric Regression in SAS –PROC LIFEREG –PROC GENMOD –PROC LOGISTIC.
Applied Epidemiologic Analysis - P8400 Fall 2002
University of Warwick, Department of Sociology, 2014/15 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard) Week 7 Logistic Regression I.
1 היחידה לייעוץ סטטיסטי אוניברסיטת חיפה פרופ’ בנימין רייזר פרופ’ דוד פרג’י גב’ אפרת ישכיל.
LOGISTIC REGRESSION A statistical procedure to relate the probability of an event to explanatory variables Used in epidemiology to describe and evaluate.
When and why to use Logistic Regression?  The response variable has to be binary or ordinal.  Predictors can be continuous, discrete, or combinations.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
Assessing Binary Outcomes: Logistic Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Introduction to Biostatistics (ZJU 2008) Wenjiang Fu, Ph.D Associate Professor Division of Biostatistics, Department of Epidemiology Michigan State University.
1 Topic 2 LOGIT analysis of contingency tables. 2 Contingency table a cross classification Table containing two or more variables of classification, and.
1 STA 617 – Chp10 Models for matched pairs Summary  Describing categorical random variable – chapter 1  Poisson for count data  Binomial for binary.
Multiple Logistic Regression STAT E-150 Statistical Methods.
Multiple Regression  Similar to simple regression, but with more than one independent variable R 2 has same interpretation R 2 has same interpretation.
Log-linear Models HRP /03/04 Log-Linear Models for Multi-way Contingency Tables 1. GLM for Poisson-distributed data with log-link (see Agresti.
1 Topic 4 : Ordered Logit Analysis. 2 Often we deal with data where the responses are ordered – e.g. : (i) Eyesight tests – bad; average; good (ii) Voting.
Logistic Regression. Linear regression – numerical response Logistic regression – binary categorical response eg. has the disease, or unaffected by the.
Heart Disease Example Male residents age Two models examined A) independence 1)logit(╥) = α B) linear logit 1)logit(╥) = α + βx¡
Applied Epidemiologic Analysis - P8400 Fall 2002 Labs 6 & 7 Case-Control Analysis ----Logistic Regression Henian Chen, M.D., Ph.D.
LOGISTIC REGRESSION Binary dependent variable (pass-fail) Odds ratio: p/(1-p) eg. 1/9 means 1 time in 10 pass, 9 times fail Log-odds ratio: y = ln[p/(1-p)]
Logistic Regression Analysis Gerrit Rooks
1 Say good things, think good thoughts, and do good deeds.
1 Introduction to Modeling Beyond the Basics (Chapter 7)
Probability and odds Suppose we a frequency distribution for the variable “TB status” The probability of an individual having TB is frequencyRelative.
Applied Epidemiologic Analysis - P8400 Fall 2002 Labs 6 & 7 Case-Control Analysis ----Logistic Regression Henian Chen, M.D., Ph.D.
Nonparametric Statistics
Logistic Regression For a binary response variable: 1=Yes, 0=No This slide show is a free open source document. See the last slide for copyright information.
Analysis of matched data Analysis of matched data.
Logistic Regression.
Notes on Logistic Regression
Chapter 13 Nonlinear and Multiple Regression
Multiple logistic regression
Logistic Regression.
Introduction to Logistic Regression
Presentation transcript:

EPI 809/Spring Multiple Logistic Regression

EPI 809/Spring Logistic regression  Often response variables in health studies are binary, eg. disease vs no disease, damage vs no damage, death vs live, etc.  To explain the variability of the binary variable by other variables, either continuous or categorical, such as age, sex, BMI, marriage status, socio-economic status, etc. use a statistical model to relate the probability of the response event to the explanatory variables.

EPI 809/Spring Logistic regression  Prob of event labeled as binary outcome  Event (Y = 1), no event (Y = 0) model the mean: E(Y) = P(Y=1) * 1 + P(Y=0) *0 = P(Y=1) but π = P(Y=1) is between 0 and 1 while β 0 + x 1 β 1 + x 2 β 2 + x 3 β 3 +… is a linear combination and may take any value. A transformation is required. A transformation is required.  Logistic regression model: logit function. log [ π / (1- π )] = β 0 + x 1 β 1 +… + x p β p = η equivalent to p = exp(η) / [ 1+ exp(η) ]

EPI 809/Spring Assumptions of Logistic Regression  The independent variables are linear in the logit which may contain interaction and power terms  The dependent variable is binary Y=0 or 1  The independent variables may be binary, categorical, continuous Multiple Logistic Regression

EPI 809/Spring Multiple Logistic Regression- Formulation The relationship between π and x is S shaped The logit (log-odds) transformation (link function)

EPI 809/Spring  Individually H o : βk = 0  Globally H o : β m =… β m+t = 0 while controlling for confounders and other important determinants of the event Multiple Logistic Regression Assess risk factors

EPI 809/Spring Interpretation of the parameters  If π is the probability of an event and O is the odds for that event then  The link function in logistic regression gives the log- odds

EPI 809/Spring Interpretation of parameter β in logistic regression Model : logit ( π ) = β 0 + x 1 β 1 Y=1Y=0X=1 X=0

EPI 809/Spring Snoring & Heart Disease: Logistic Example  An epidemiologic study surveyed 2484 subjects to examine whether snoring was a possible risk factor for heart disease. Snoring NearlyEvery Heart Disease NeverOccasional Every night Night Yes No Prop(yes)

EPI 809/Spring Constructing Indicator variables  Let Z 1 =1 if occasional, 0 otherwise  Let Z 2 =1 if nearly every night, 0 otherwise  Let Z 3 =1 if every night, 0 otherwise

EPI 809/Spring SAS Codes data hd; input hd $ snoring $ count; Z1=(snoring="occa");Z2=(snoring="nearly");Z3=(snoring="every");cards; yes never 24 yes occa 35 yes nearly 21 yes every 30 no never 1355 no occa 603 no nearly 192 no every 224 ; run; proc logistic data=hd descending; model hd (event=“yes”) =Z1 Z2 Z3; freq count; run;

EPI 809/Spring SAS OUTPUT The LOGISTIC Procedure Analysis of Maximum Likelihood Estimates Standard Wald Parameter DF Estimate Error Chi-Square Pr > ChiSq Intercept <.0001 Z <.0001 Z <.0001 Z <.0001 Odds Ratio Estimates Point 95% Wald Effect Estimate Confidence Limits Z Z Z

EPI 809/Spring Calculating Probabilities  The fitted logistic regression function is Logit( π )= Z Z Z 3  So, the probability of heart disease if never snore is exp( )/(1+exp( ))=.0174  If snore occasionally, exp( )/(1+exp( )) exp( )/(1+exp( )) =.0549 =.0549

EPI 809/Spring Calculating Odds Ratios  If Z 1 =Z 2 =Z 3 =0, then odds are exp( )  If Z 2 =Z 3 =0, but Z 1 =1, then odds are exp( )  The ratio of odds is then exp(1.1869)  What is the odds ratio for comparing those who snore nearly every night with occasional snorers?  What is the odds ratio for comparing those who snore every night with those who snore nearly every night?

EPI 809/Spring Old example – Genetic Association study  Idiopathic Pulmonary Fibrosis (IPF) is known to be associated with age and gender (older and male are more likely)  One study had 174 cases and 225 controls found association of IPF with one gene genotype COX (C  T).  P-value by Pearson Chi-squares test: p =  Q: Is this association true? GenotypeCCCTTTtotal Case Control Total

EPI 809/Spring Old example on genetic effect of SNP COX on IPF  Logistic regression model logit [Pr(IPF)] = intercept + snp + sex + age  Results: Wald  EffectDFChi-squareP-value SNP sex age <.0001

EPI 809/Spring Conclusion: What we have learned 1. Define study designs 2. Measures of effects for categorical data (unstratified analysis) 3. Confounders and effects modifications 4. Stratified analysis (Mantel-Haenszel statistic, multiple logistic regression) 5. Use of SAS Proc FREQ and Proc Logistic

EPI 809/Spring Conclusion: Further readings Read textbook for Read textbook for 1. Power and sample size calculations 2. Tests in matched pair studies