Adjusting for extraneous factors Topics for today More on logistic regression analysis for binary data and how it relates to the Wolf and Mantel- Haenszel.

Slides:



Advertisements
Similar presentations
The %LRpowerCorr10 SAS Macro Power Estimation for Logistic Regression Models with Several Predictors of Interest in the Presence of Covariates D. Keith.
Advertisements

M2 Medical Epidemiology
Logistic Regression I Outline Introduction to maximum likelihood estimation (MLE) Introduction to Generalized Linear Models The simplest logistic regression.
Logistic Regression.
Simple Logistic Regression
Logistic Regression Example: Horseshoe Crab Data
Chapter 19 Stratified 2-by-2 Tables
Logistic Regression Part I - Introduction. Logistic Regression Regression where the response variable is dichotomous (not continuous) Examples –effect.
1 If we live with a deep sense of gratitude, our life will be greatly embellished.
April 25 Exam April 27 (bring calculator with exp) Cox-Regression
Header= Verdana 28 pt., Red 1 STA 517 – Chapter 3: Inference for Contingency Tables 3. Inference for Contingency Tables 3.1 Confidence Intervals for Association.
EPI 809 / Spring 2008 Final Review EPI 809 / Spring 2008 Ch11 Regression and correlation  Linear regression Model, interpretation. Model, interpretation.
Adjusting for extraneous factors Topics for today Stratified analysis of 2x2 tables Regression Readings Jewell Chapter 9.
Linear statistical models 2009 Models for continuous, binary and binomial responses  Simple linear models regarded as special cases of GLMs  Simple linear.
1 More on exposure/response associations Readings Jewell Chapters 3 & 7.
PH6415 Review Questions. 2 Question 1 A journal article reports a 95%CI for the relative risk (RR) of an event (treatment versus control as (0.55, 0.97).
1 Quantifying Disease-exposure association Readings Jewell Chapter 4 Rothman and Greenland Chapter 4.
Chapter 11 Survival Analysis Part 2. 2 Survival Analysis and Regression Combine lots of information Combine lots of information Look at several variables.
EPI 809/Spring Multiple Logistic Regression.
Regression Topics for today Readings Jewell Chapters 12, 13, 14 & 15.
1 Modeling Ordinal Associations Section 9.4 Roanna Gee.
Logistic Regression Biostatistics 510 March 15, 2007 Vanessa Perez.
Deaths of snails vs exposure by species. Deaths of snails vs exposure by temperature.
1 PH 240A: Chapter 12 Mark van der Laan University of California Berkeley (Slides by Nick Jewell)
Categorical Data Analysis: Stratified Analyses, Matching, and Agreement Statistics Biostatistics March 2007 Carla Talarico.
OLS versus MLE Example YX Here is the data:
WLS for Categorical Data
Linear statistical models 2009 Count data  Contingency tables and log-linear models  Poisson regression.
Logistic Regression II Simple 2x2 Table (courtesy Hosmer and Lemeshow) Exposure=1Exposure=0 Disease = 1 Disease = 0.
Analysis of Categorical Data
Logistic Regression III: Advanced topics Conditional Logistic Regression for Matched Data Conditional Logistic Regression for Matched Data.
Biostatistics Case Studies 2005 Peter D. Christenson Biostatistician Session 4: Taking Risks and Playing the Odds: OR vs.
Logistic Regression for binary outcomes
April 11 Logistic Regression –Modeling interactions –Analysis of case-control studies –Data presentation.
EIPB 698E Lecture 10 Raul Cruz-Cano Fall Comments for future evaluations Include only output used for conclusions Mention p-values explicitly (also.
POTH 612A Quantitative Analysis Dr. Nancy Mayo. © Nancy E. Mayo A Framework for Asking Questions Population Exposure (Level 1) Comparison Level 2 OutcomeTimePECOT.
April 6 Logistic Regression –Estimating probability based on logistic model –Testing differences among multiple groups –Assumptions for model.
2 December 2004PubH8420: Parametric Regression Models Slide 1 Applications - SAS Parametric Regression in SAS –PROC LIFEREG –PROC GENMOD –PROC LOGISTIC.
October 15. In Chapter 19: 19.1 Preventing Confounding 19.2 Simpson’s Paradox 19.3 Mantel-Haenszel Methods 19.4 Interaction.
Applied Epidemiologic Analysis - P8400 Fall 2002
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Different Distributions David Purdie. Topics Application of GEE to: Binary outcomes: – logistic regression Events over time (rate): –Poisson regression.
1 היחידה לייעוץ סטטיסטי אוניברסיטת חיפה פרופ’ בנימין רייזר פרופ’ דוד פרג’י גב’ אפרת ישכיל.
LOGISTIC REGRESSION A statistical procedure to relate the probability of an event to explanatory variables Used in epidemiology to describe and evaluate.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
1 Topic 2 LOGIT analysis of contingency tables. 2 Contingency table a cross classification Table containing two or more variables of classification, and.
© Department of Statistics 2012 STATS 330 Lecture 20: Slide 1 Stats 330: Lecture 20.
1 STA 617 – Chp10 Models for matched pairs Summary  Describing categorical random variable – chapter 1  Poisson for count data  Binomial for binary.
Log-linear Models HRP /03/04 Log-Linear Models for Multi-way Contingency Tables 1. GLM for Poisson-distributed data with log-link (see Agresti.
1 Topic 4 : Ordered Logit Analysis. 2 Often we deal with data where the responses are ordered – e.g. : (i) Eyesight tests – bad; average; good (ii) Voting.
Sigmoidal Response (knnl558.sas). Programming Example: knnl565.sas Y = completion of a programming task (1 = yes, 0 = no) X 2 = amount of programming.
A first order model with one binary and one quantitative predictor variable.
Heart Disease Example Male residents age Two models examined A) independence 1)logit(╥) = α B) linear logit 1)logit(╥) = α + βx¡
Applied Epidemiologic Analysis - P8400 Fall 2002 Labs 6 & 7 Case-Control Analysis ----Logistic Regression Henian Chen, M.D., Ph.D.
1 Say good things, think good thoughts, and do good deeds.
1 Introduction to Modeling Beyond the Basics (Chapter 7)
Probability and odds Suppose we a frequency distribution for the variable “TB status” The probability of an individual having TB is frequencyRelative.
Logistic Regression and Odds Ratios Psych DeShon.
Analysis of matched data Analysis of matched data.
1 Linear Regression Model. 2 Types of Regression Models.
March 28 Analyses of binary outcomes 2 x 2 tables
Multiple logistic regression
ביצוע רגרסיה לוגיסטית. פרק ה-2
Saturday, August 06, 2016 Farrokh Alemi, PhD.
Logistic Regression.
Introduction to Logistic Regression
Categorical Data Analysis
Logistic Regression.
Presentation transcript:

Adjusting for extraneous factors Topics for today More on logistic regression analysis for binary data and how it relates to the Wolf and Mantel- Haenszel estimates of a common odds ratios Interpreting logistic regression analysis Estimating a common risk ratio in the presence of a stratification factor. Connection to Poisson regression. Readings Jewell Chapter 9

Stratified analysis for binary data Data from the ith stratum: DNot D Eaiai bibi Not Ecici didi Variance formulae in Jewell Chapter 9

Regression-based stratified analysis for Berkeley data data berkeley; input stratum male a b ; cards; run; data berkeley; set berkeley; n=a+b; proc genmod; class stratum; model a/n=male stratum/dist=binomial; run;

Stratified analysis Standard 95% Conf Chi- Parameter DF Estimate Error Limits Square Pr > ChiSq Intercept <.0001 male stratum <.0001 stratum <.0001 stratum <.0001 stratum <.0001 stratum <.0001 stratum Scale Wolf estimate: SE= CI: ( , ) Mantel-Haenszel (have to take logs of this estimate): CI: ( , ) Lets talk about the rest of these regression results … what do they mean?

Interpreting the logistic regression model for the Berkeley data We are fitting the following model: Based on the fitted model, we can predict the admission probabilities in each cell

Observed and predicted values for Berkeley Data GenderDeptLogit of admit probability Predicted prob Observed prob N Female = %108 Male = %825 Female = %25 Male =63%560 Female %593 Male =37%325 Female %375 Male =33%417 Female %393 Male =28%191 Female %341 Male =.066%373

More on the Berkeley logistic regression analysis In addition to providing an estimate of the overall gender effect, the logistic regression analysis allows us to compare admission rates between the departments. Based on the observed data, what is the log odds ratio for admission to department 5 versus department 6 for females? What about for males? What about department 1 versus department 6 for males? Females?

Another example We can add additional factors into the logistic regression model so as to obtain an estimate of the log-odds ratio, adjusting for these additional factors. Example, smoking in the Epilepsy study. Lets look in SAS: proc freq ; table one3*cig2 /chisq; run;

Epilepsy data in SAS

Standard Wald 95% Confidence Chi- Parameter DF Estimate Error Limits Square Pr > ChiSq Intercept <.0001 DRUG DRUG DRUG Scale Standard Wald 95% Confidence Chi- Parameter DF Estimate Error Limits Square Pr > ChiSq Intercept <.0001 DRUG DRUG DRUG CIG Scale

Why don’t drug estimates change much?? Hint – look at association between drug and smoking proc freq ; table one3*cig2 /chisq; run;

Relative Risks We’ve talked about estimating odds ratios while adjusting for another factor. Several approaches: 1.Cochran-Mantel-Haenszel test 2.Wolf estimate of the adjusted logodds ratio 3.Mantel-Haenszel estimate of adjusted odds ratio 4.Logistic regression Lets turn now to analogous consideration for risk ratios or relative risks

Example from Jewell Table 9.2 Weight in pounds Behavior type CHDNo CHDRR (log RR) <=150A B10305(.924) A B10270(.832) A B21297(.298) A B19253(.825) 180+A B19361(.993) Relationship between behavior (type A vs type B personality) and coronary heart disease events (see p82 in Jewell for description). Unadjusted RR was 2.2 with 95% CI of (1.72, 2.87). Since weight is important, we need to adjust for it too

Wolf’s adjusted relative risk

Mantel-Haenszel adjusted relative risk

CHD example Jewell table 9.7 provides the weights for the Wolf and Mantel- Haenszel methods Lets look at using Poisson regression to do the adjustment.

Fitting the CHD model in SAS data chd; input weight behavior a b; cards; Run; data cdh; set chd; lnn=log(a+b); proc genmod; model a=behavior/dist=poisson offset=lnn; proc genmod; class weight; model a=behavior weight/dist=poisson offset=lnn; run; Notice the inclusion of the offset term corresponding to log of the sample size in each cell

SAS results Unadjusted Adjusted How do we interpret the weight coefficients?

Lets analyze the arsenic data in SAS data mlung; input village conc age atrisk events; cards; ……………………… ………………… run; data mlung; set mlung; latrisk=log(atrisk); proc genmod; class conc; model events= conc /dist=poisson offset=latrisk; where conc=0 | conc>900; run; proc genmod; class age conc; model events= conc age /dist=poisson offset=latrisk; where conc=0 | conc>900; run;

Results Unadjusted analysis Adjusted analysis Why does the concentration effect change? How do we interpret the age effects? How to add in all the concentrations?