Applied Epidemiologic Analysis - P8400 Fall 2002 Labs 6 & 7 Case-Control Analysis ----Logistic Regression Henian Chen, M.D., Ph.D.

Slides:



Advertisements
Similar presentations
The %LRpowerCorr10 SAS Macro Power Estimation for Logistic Regression Models with Several Predictors of Interest in the Presence of Covariates D. Keith.
Advertisements

A Model to Evaluate Recreational Management Measures Objective I – Stock Assessment Analysis Create a model to distribute estimated landings (A + B1 fish)
Logistic Regression Psy 524 Ainsworth.
Logistic Regression I Outline Introduction to maximum likelihood estimation (MLE) Introduction to Generalized Linear Models The simplest logistic regression.
Week 3. Logistic Regression Overview and applications Additional issues Select Inputs Optimize complexity Transforming Inputs.
Simple Logistic Regression
Logistic Regression Example: Horseshoe Crab Data
COPYRIGHT OF: ABHINAV ANAND JYOTI ARORA SHRADDHA RAMSWAMY DISCRETE CHOICE MODELING IN HEALTH ECONOMICS.
Overview of Logistics Regression and its SAS implementation
6.1.4 AIC, Model Selection, and the Correct Model oAny model is a simplification of reality oIf a model has relatively little bias, it tends to provide.
Datamining and statistical learning - lecture 9 Generalized linear models (GAMs)  Some examples of linear models  Proc GAM in SAS  Model selection in.
April 25 Exam April 27 (bring calculator with exp) Cox-Regression
Logistic Regression Multivariate Analysis. What is a log and an exponent? Log is the power to which a base of 10 must be raised to produce a given number.
1 Experimental design and analyses of experimental data Lesson 6 Logistic regression Generalized Linear Models (GENMOD)
Linear statistical models 2009 Models for continuous, binary and binomial responses  Simple linear models regarded as special cases of GLMs  Simple linear.
Linear statistical models 2008 Binary and binomial responses The response probabilities are modelled as functions of the predictors Link functions: the.
Chapter 11 Survival Analysis Part 3. 2 Considering Interactions Adapted from "Anderson" leukemia data as presented in Survival Analysis: A Self-Learning.
PH6415 Review Questions. 2 Question 1 A journal article reports a 95%CI for the relative risk (RR) of an event (treatment versus control as (0.55, 0.97).
Chapter 11 Survival Analysis Part 2. 2 Survival Analysis and Regression Combine lots of information Combine lots of information Look at several variables.
EPI 809/Spring Multiple Logistic Regression.
Nemours Biomedical Research Statistics April 23, 2009 Tim Bunnell, Ph.D. & Jobayer Hossain, Ph.D. Nemours Bioinformatics Core Facility.
1 Modeling Ordinal Associations Section 9.4 Roanna Gee.
Logistic Regression Biostatistics 510 March 15, 2007 Vanessa Perez.
Deaths of snails vs exposure by species. Deaths of snails vs exposure by temperature.
Notes on Logistic Regression STAT 4330/8330. Introduction Previously, you learned about odds ratios (OR’s). We now transition and begin discussion of.
WLS for Categorical Data
Linear statistical models 2009 Count data  Contingency tables and log-linear models  Poisson regression.
Logistic regression for binary response variables.
Logistic Regression II Simple 2x2 Table (courtesy Hosmer and Lemeshow) Exposure=1Exposure=0 Disease = 1 Disease = 0.
Logistic Regression III: Advanced topics Conditional Logistic Regression for Matched Data Conditional Logistic Regression for Matched Data.
1 G Lect 11W Logistic Regression Review Maximum Likelihood Estimates Probit Regression and Example Model Fit G Multiple Regression Week 11.
Logistic Regression I HRP 261 2/09/04 Related reading: chapters and of Agresti.
April 11 Logistic Regression –Modeling interactions –Analysis of case-control studies –Data presentation.
EIPB 698E Lecture 10 Raul Cruz-Cano Fall Comments for future evaluations Include only output used for conclusions Mention p-values explicitly (also.
April 6 Logistic Regression –Estimating probability based on logistic model –Testing differences among multiple groups –Assumptions for model.
2 December 2004PubH8420: Parametric Regression Models Slide 1 Applications - SAS Parametric Regression in SAS –PROC LIFEREG –PROC GENMOD –PROC LOGISTIC.
Applied Epidemiologic Analysis - P8400 Fall 2002
1 היחידה לייעוץ סטטיסטי אוניברסיטת חיפה פרופ’ בנימין רייזר פרופ’ דוד פרג’י גב’ אפרת ישכיל.
LOGISTIC REGRESSION A statistical procedure to relate the probability of an event to explanatory variables Used in epidemiology to describe and evaluate.
When and why to use Logistic Regression?  The response variable has to be binary or ordinal.  Predictors can be continuous, discrete, or combinations.
Linear vs. Logistic Regression Log has a slightly better ability to represent the data Dichotomous Prefer Don’t Prefer Linear vs. Logistic Regression.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
GEE Approach Presented by Jianghu Dong Instructor: Professor Keumhee Chough (K.C.) Carrière.
Applied Epidemiologic Analysis - P8400 Fall 2002 Lab 9 Survival Analysis Henian Chen, M.D., Ph.D.
Logistic regression. Recall the simple linear regression model: y =  0 +  1 x +  where we are trying to predict a continuous dependent variable y from.
Logistic Regression. Linear Regression Purchases vs. Income.
Log-linear Models HRP /03/04 Log-Linear Models for Multi-way Contingency Tables 1. GLM for Poisson-distributed data with log-link (see Agresti.
1 Everyday is a new beginning in life. Every moment is a time for self vigilance.
1 Topic 4 : Ordered Logit Analysis. 2 Often we deal with data where the responses are ordered – e.g. : (i) Eyesight tests – bad; average; good (ii) Voting.
Sigmoidal Response (knnl558.sas). Programming Example: knnl565.sas Y = completion of a programming task (1 = yes, 0 = no) X 2 = amount of programming.
LOGISTIC REGRESSION Binary dependent variable (pass-fail) Odds ratio: p/(1-p) eg. 1/9 means 1 time in 10 pass, 9 times fail Log-odds ratio: y = ln[p/(1-p)]
Logistic Regression Saed Sayad 1www.ismartsoft.com.
1 Say good things, think good thoughts, and do good deeds.
EIPB 698D Lecture 5 Raul Cruz-Cano Spring Midterm Comments PROC MEANS VS. PROS SURVEYMEANS For non–parametric: Kriskal-Wallis.
Exact Logistic Regression
Applied Epidemiologic Analysis - P8400 Fall 2002 Labs 6 & 7 Case-Control Analysis ----Logistic Regression Henian Chen, M.D., Ph.D.
Applied Epidemiologic Analysis - P8400 Fall 2002 Lab 3 Type I, II Error, Sample Size, and Power Henian Chen, M.D., Ph.D.
Birthweight (gms) BPDNProp Total BPD (Bronchopulmonary Dysplasia) by birth weight Proportion.
Analysis of matched data Analysis of matched data.
Chapter 13 LOGISTIC REGRESSION. Set of independent variables Categorical outcome measure, generally dichotomous.
Logistic Regression APKC – STATS AFAC (2016).
Notes on Logistic Regression
ביצוע רגרסיה לוגיסטית. פרק ה-2
Applied Epidemiologic Analysis - P8400 Fall 2002
Logistic Regression.
Introduction to Logistic Regression
Case-control studies: statistics
Modeling Ordinal Associations Bin Hu
Presentation transcript:

Applied Epidemiologic Analysis - P8400 Fall 2002 Labs 6 & 7 Case-Control Analysis ----Logistic Regression Henian Chen, M.D., Ph.D.

Applied Epidemiologic Analysis - P8400 Fall 2002 Logistic Regression for Intercept only SAS Program proc logistic data=case_control978 descending; model status=; run; * Descending: to get the probability and OR for dependent variable=1 SAS Output The LOGISTIC Procedure Model Information Data Set WORK.CASE_CONTROL978 Response Variable status Number of Response Levels 2 Number of Observations 978 Model binary logit Optimization Technique Fisher's scoring

Applied Epidemiologic Analysis - P8400 Fall 2002 Logistic Regression for Intercept only SAS Output Response Profile Ordered Total Value status Frequency Probability modeled is status=1. Model Convergence Status Convergence criterion (GCONV=1E-8) satisfied. -2 Log L = Analysis of Maximum Likelihood Estimates Standard Wald Parameter DF Estimate Error Chi-Square Pr > ChiSq Intercept <.0001

Applied Epidemiologic Analysis - P8400 Fall 2002 Logistic Regression for Intercept only Log [Y/(1-Y)] = α Y = e α / (1+ e α ) = exp(α) / [1 + exp(α)] In our model, α = , is the log odds of cancer for total sample. The odds ( e α ) is Y = exp( ) / [1 + exp( )] = =200/( ) Y is related to α in Logistic Model

Applied Epidemiologic Analysis - P8400 Fall 2002 Logistic Regression for Dichotomous Predictor Alcohol Consumption (alcgrp): 0=0-39 gm/day; 1=40+ gm/day SAS Program proc logistic data=case_control978 descending; model status=alcgrp; run; SAS Output Model Fit Statistics Criterion Intercept Only Intercept and Covariates -2 Log L Likelihood Ratio Test G = – = df = 1 The model with variable ‘alcgrp’ is significantly.

Applied Epidemiologic Analysis - P8400 Fall 2002 Logistic Regression for Dichotomous Predictor SAS Output Analysis of Maximum Likelihood Estimates Standard Wald Parameter DF Estimate Error Chi-Square Pr > ChiSq Intercept <.0001 alcgrp <.0001 Odds Ratio Estimates Point 95% Wald Effect Estimate Confidence Limits alcgrp is the log odds of cancer for light drinkers (alcgrp=0). Log odds of cancer for heavy drinkers (alcgrp=1) is –0.827 ( ). Y = for light drinkers, and for heavy drinkers. OR = exp(β) = exp(1.7641) = Heavy drinkers (alcgrp=1) are about 6 times more likely to get cancer than light drinkers (alcgrp=0). OR is not related to α in Logistic Model

Applied Epidemiologic Analysis - P8400 Fall 2002 Logistic Regression for Ordinal Predictor Alcohol Consumption (alcgrp4): 0=0-39 gm/day; 1=40-79 gm/day 2= gm/day; 3=120+ gm/day SAS Program proc logistic data=case_control978 descending; model status=alcgrp4; run; SAS Output Model Fit Statistics Criterion Intercept Only Intercept and Covariates -2 Log L Likelihood Ratio Test G = – = df = 1 The model with variable ‘alcgrp4’ is significantly.

Applied Epidemiologic Analysis - P8400 Fall 2002 Logistic Regression for Ordinal Predictor SAS Output Analysis of Maximum Likelihood Estimates Standard Wald Parameter DF Estimate Error Chi-Square Pr > ChiSq Intercept <.0001 alcgrp <.0001 Odds Ratio Estimates Point 95% Wald Effect Estimate Confidence Limits alcgrp OR = exp(1.0453) = Men with alcgrp4=1 are about 3 times more likely to get cancer than men with alcgrp4=0. This OR is also for alcgrp4= 1 vs. alcgrp4=2; or alcgrp4=2 vs. alcgrp4=3. OR = exp[(3-1)*1.0453] = exp(2.0906) = for alcgrp4=1 vs. alcgrp4=3 OR = exp[(3-0)*1.0453] = exp(3.1359) = for alcgrp4=0 vs. alcgrp4=3

Applied Epidemiologic Analysis - P8400 Fall 2002 Logistic Regression for Continuous Predictor Alcohol Consumption (alcohol): daily consumption in grams SAS Program proc logistic data=case_control978 descending; model status=alcohol; run; SAS Output Analysis of Maximum Likelihood Estimates Standard Wald Parameter DF Estimate Error Chi-Square Pr > ChiSq Intercept <.0001 alcohol <.0001 Odds Ratio Estimates Point 95% Wald Effect Estimate Confidence Limits alcohol

Applied Epidemiologic Analysis - P8400 Fall 2002 Logistic Regression for Continuous Predictor OR = exp(0.0261) = The odds of cancer increase by a factor of for each unit in alcohol consumption OR = exp[40*(0.0261)] = exp(1.044) = for a 40-grams increase in alcohol consumption per day OR = exp[120*(0.0261)] = for a man who drinks 160 grams per day compare with a man who is similar in other respects but drinks 40 grams per day.

Applied Epidemiologic Analysis - P8400 Fall 2002 Interaction in Logistic Regression model status = α + β 1 alcgrp + β 2 tobgrp β 1 : the effect of alcohol on cancer, controlling for tobacco (i.e., the same OR across levels of tobacco) β 2 :the effect of tobacco on cancer, controlling for alcohol (i.e., the same OR across levels of alcohol) model status = α + β 1 alcgrp + β 2 tobgrp + β 3 alcgrp*tobgrp β 1 : the effect of alcohol on cancer among non-smokers (tobgrp=0) β 2 :the effect of tobacco on cancer among non-drinkers (alcgrp=0) β 3 : interaction between smokers and drinkers

Applied Epidemiologic Analysis - P8400 Fall 2002 Interaction in Logistic Regression model status = (alcgrp) (tobgrp) –0.98 (alcgrp*tobgrp) Log odds odds A: alcgrp=0 & tobgrp=0 2.28* *0 – 0.98*0*0 = B: alcgrp=1 & tobgrp=0 2.28* *0 – 0.98*1*0 = C: alcgrp=0 & tobgrp=1 2.28* *1 – 0.98*0*1 = D: alcgrp=1 & tobgrp=1 2.28* *1 – 0.98*1*1 = Odds Ratio A vs. B9.78 = 9.78/1.00 A vs. C3.97 = 3.97/1.00 A vs. D14.59 = 14.59/1.00 B vs. D1.49 = 14.59/9.78 C vs. D3.68 = 14.59/3.97