Proportional Hazard Regression Cox Proportional Hazards Modeling (PROC PHREG)

Slides:



Advertisements
Similar presentations
Allison Dunning, M.S. Research Biostatistician
Advertisements

Surviving Survival Analysis
Survival Analysis. Key variable = time until some event time from treatment to death time for a fracture to heal time from surgery to relapse.
One-sample T-Test Matched Pairs T-Test Two-sample T-Test
Survival Analysis In many medical studies, the primary endpoint is time until an event occurs (e.g. death, remission) Data are typically subject to censoring.
Exploring the Shape of the Dose-Response Function.
Analysis of variance (ANOVA)-the General Linear Model (GLM)
Simple Logistic Regression
Introduction to Survival Analysis October 19, 2004 Brian F. Gage, MD, MSc with thanks to Bing Ho, MD, MPH Division of General Medical Sciences.
HSRP 734: Advanced Statistical Methods July 24, 2008.
Does Background Music Influence What Customers Buy?
Multiple Linear Regression
Multiple Logistic Regression RSQUARE, LACKFIT, SELECTION, and interactions.
April 25 Exam April 27 (bring calculator with exp) Cox-Regression
Some Terms Y =  o +  1 X Regression of Y on X Regress Y on X X called independent variable or predictor variable or covariate or factor Which factors.
بسم الله الرحمن الرحیم. Generally,survival analysis is a collection of statistical procedures for data analysis for which the outcome variable of.
Chapter 11 Survival Analysis Part 3. 2 Considering Interactions Adapted from "Anderson" leukemia data as presented in Survival Analysis: A Self-Learning.
Survival analysis1 Every achievement originates from the seed of determination.
PH6415 Review Questions. 2 Question 1 A journal article reports a 95%CI for the relative risk (RR) of an event (treatment versus control as (0.55, 0.97).
Regression Diagnostics Using Residual Plots in SAS to Determine the Appropriateness of the Model.
Plots, Correlations, and Regression Getting a feel for the data using plots, then analyzing the data with correlations and linear regression.
BIOST 536 Lecture 9 1 Lecture 9 – Prediction and Association example Low birth weight dataset Consider a prediction model for low birth weight (< 2500.
Mean Comparison With More Than Two Groups
Chapter 11 Survival Analysis Part 2. 2 Survival Analysis and Regression Combine lots of information Combine lots of information Look at several variables.
Two-Way ANOVA in SAS Multiple regression with two or
Introduction to Survival Analysis PROC LIFETEST and Survival Curves.
Model Checking in the Proportional Hazard model
Assessing Survival: Cox Proportional Hazards Model Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Cox Proportional Hazards Regression Model Mai Zhou Department of Statistics University of Kentucky.
Survival Analysis A Brief Introduction Survival Function, Hazard Function In many medical studies, the primary endpoint is time until an event.
Analysis of Complex Survey Data
Survival analysis Brian Healy, PhD. Previous classes Regression Regression –Linear regression –Multiple regression –Logistic regression.
STT : BIOSTATISTICS ANALYSIS Dr. Cuixian Chen Chapter 8: Fitting Parametric Regression Models STT
Essentials of survival analysis How to practice evidence based oncology European School of Oncology July 2004 Antwerp, Belgium Dr. Iztok Hozo Professor.
G Lecture 121 Analysis of Time to Event Survival Analysis Language Example of time to high anxiety Discrete survival analysis through logistic regression.
Dr Laura Bonnett Department of Biostatistics. UNDERSTANDING SURVIVAL ANALYSIS.
Assessing Survival: Cox Proportional Hazards Model
Time-dependent covariates and further remarks on likelihood construction Presenter Li,Yin Nov. 24.
April 6 Logistic Regression –Estimating probability based on logistic model –Testing differences among multiple groups –Assumptions for model.
Chapter 12 The Analysis of Categorical Data and Goodness-of-Fit Tests.
1 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 12 The Analysis of Categorical Data and Goodness-of-Fit Tests.
Testing Hypotheses about Differences among Several Means.
01/20151 EPI 5344: Survival Analysis in Epidemiology Survival curve comparison (non-regression methods) March 3, 2015 Dr. N. Birkett, School of Epidemiology,
HSRP 734: Advanced Statistical Methods July 17, 2008.
Introduction to Survival Analysis Utah State University January 28, 2008 Bill Welbourn.
Applied Epidemiologic Analysis - P8400 Fall 2002 Lab 9 Survival Analysis Henian Chen, M.D., Ph.D.
+ Chi Square Test Homogeneity or Independence( Association)
Chapter 11 Chi- Square Test for Homogeneity Target Goal: I can use a chi-square test to compare 3 or more proportions. I can use a chi-square test for.
Survival Analysis 1 Always be contented, be grateful, be understanding and be compassionate.
Lecture 12: Cox Proportional Hazards Model
1 STA 617 – Chp9 Loglinear/Logit Models 9.7 Poisson regressions for rates  In Section 4.3 we introduced Poisson regression for modeling counts. When outcomes.
Survival Analysis approach in evaluating the efficacy of ARV treatment in HIV patients at the Dr GM Hospital in Tshwane, GP of S. Africa Marcus Motshwane.
We’ll now look at the relationship between a survival variable Y and an explanatory variable X; e.g., Y could be remission time in a leukemia study and.
01/20151 EPI 5344: Survival Analysis in Epidemiology Estimating S(t) from Cox models March 24, 2015 Dr. N. Birkett, School of Epidemiology, Public Health.
01/20151 EPI 5344: Survival Analysis in Epidemiology Confounding and Effect Modification March 24, 2015 Dr. N. Birkett, School of Epidemiology, Public.
Handout Twelve: Design & Analysis of Covariance
X Treatment population Control population 0 Examples: Drug vs. Placebo, Drugs vs. Surgery, New Tx vs. Standard Tx  Let X = decrease (–) in cholesterol.
01/20141 EPI 5344: Survival Analysis in Epidemiology Estimating S(t) from Cox models April 1, 2014 Dr. N. Birkett, Department of Epidemiology & Community.
Additional Regression techniques Scott Harris October 2009.
Practical Solutions Additional Regression techniques.
03/20161 EPI 5344: Survival Analysis in Epidemiology Estimating S(t) from Cox models March 29, 2016 Dr. N. Birkett, School of Epidemiology, Public Health.
Bivariate analysis. * Bivariate analysis studies the relation between 2 variables while assuming that other factors (other associated variables) would.
Methods and Statistical analysis. A brief presentation. Markos Kashiouris, M.D.
Carolinas Medical Center, Charlotte, NC Website:
Logistic Regression APKC – STATS AFAC (2016).
April 18 Intro to survival analysis Le 11.1 – 11.2
Table 1 Characteristics of study population, by pneumococcal vaccination status. From: Prior Pneumococcal Vaccination Is Associated with Reduced Death,
Introduction to Logistic Regression
If we can reduce our desire,
Additional Regression techniques
Presentation transcript:

Proportional Hazard Regression Cox Proportional Hazards Modeling (PROC PHREG)

Consider the following data: Drug addicts are enrolled in two different residential treatment programs that differ in length (treat = 0 is short, treat = 1 is long). The patients are assigned to two different sites (site = 0 is site A, site = 1 is site B). Herco indicates heroine and cocaine use in the past three months (1= heroine and cocaine use, 2 = heroine or cocaine use, 3 = neither heroine or cocaine use). Other variables recorded were age at time of enrollment, ndrugtx (number of previous drug treatments), time until return to drug use, and censor (1=return to drug use, 0 = censored).

Reading a SAS Data Set into SAS You will need to save the data set uis_small to your computer. It is a SAS data set, and it can be read into a SAS program using the following code (making the appropriate adjustment to the file location):

To make sure the data set was read in properly, print out the first 10 observations: The SAS System Obs ID age ndrugtx treat site time censor herco

First compare survival rates for the three categorical variables of treat, site and herco:

The Wilcoxon and Log-Rank Tests (output not shown) are statistically significant (p = , p = , respectively). Treatment affects risk of returning to drug use.

The Wilcoxon and Log-Rank Tests (output not shown) are not statistically significant (p = , p = , respectively). Site does not affect risk of returning to drug use.

The Wilcoxon and Log-Rank Tests (output not shown) are not statistically significant (p = , p = , respectively). Herco does not affect risk of returning to drug use, although the curves do cross initially, so this may affect these statistical tests.

Now examine if ndrugtx and age affect the risk of returning to drug use. Because these are continuous variables, we will use proportional hazard regression (PROC PHREG):

Output from PHREG: ndrugtx

Interpreting the Output The proportional hazards regression model for these data with ndrugtx as the predictor is: λ (t) = λ o (t)exp( *ndrugtx) The relative risk of a 1 unit increase in the number of previous drug treatments is: = λ o (t)exp( *1)/ λ o (t)exp( *0) = exp( ) = exp( ) = 1.03 With each increase in the number of prior drug treatments, the risk of relapsing increases by 3% ( ). Notice that the SAS output also gives you this relative risk under “Hazard Ratio.” This term is significant (p<0.0001), which indicates that prior drug treatments affect risk of relapse.

Output from PHREG: age

Interpreting the Output: Age The proportional hazards regression model for these data with age as the predictor is: λ (t) = λ o (t)exp( *age) The relative risk of a 1 year increase in age at enrollment is: = λ o (t)exp( *1)/ λ o (t)exp( *0) = exp( ) = exp( ) = With each year increase in age of enrollment, the risk of relapsing decreases by 1.3% ( ). Notice that the SAS output also gives you this relative risk under “Hazard Ratio.” Age is not significantly related to risk, however (p=0.735).

The Full Model First consider the full model with all of the predictor variables. As part of the PHREG procedure, we will create 2 new variables: herco2 and herco3. In addition, we will conduct a test labeled “herco” to determine whether both of these variables together are significant.

Results from “herco” test

The test of our two new variables, herco2 and herco3, is non-significant (p = ), so we will drop herco from our model and run the refitted model.

Output from Model w/o herco

All of the terms in the model are significant, except for site, which is approaching significance. Because we know from previous research that site is important, we will leave it in our model. We will now check six different interactions in our model, to see if any significant ones exist: ndrugtx*age, ndrugtx*treat, ndrugtx*site, age*treat, age*site, treat*site

Adding ndrugtx*age to the model (notice you can create the interaction term within the PHREG procedure):

ndrugtx*age interaction not significant

Adding ndrugtx*treat to the model

ndrugtx*treat not significant

Adding ndrugtx*site to the model

ndrugtx*site not significant

Adding age*treat to the model

age*treat not significant

Adding age*site to the model

age*site interaction IS significant

Adding treat*site to the model

treat*site not significant

Final Model Selection Not only was the age*site interaction significant, but once we included it in our model, the site term also became statistically significant. The final proportional hazard model is: λ (t) = λ o (t)exp( β 1 *age + β 2 *ndrugtx + β 3 *treat + β 4 *site + β 5 *treatsite) λ (t) = λ o (t)exp(-0.034*age *ndrugtx – 0.267*treat – 1.246*site *treatsite)

Testing Proportionality The Cox proportional hazard regression we have just conducted assumes that the risks are proportional, that is, that the proportion is constant over time. To test this assumption of proportionality, we use time-dependent variables and test whether they are significant. If they are not significant, it means that time does not affect the relative risk, and we can conclude that the risks in our model are proportional.

Creating and testing time-dependent varibles (on the log scale):

Testing Proportionality Output

The test we labeled “test_proportionality” is not significant (p = ), which means that none of our time-dependent variables are significant. We can assume proportionality over time.

If we cannot assume proportionality… If the assumption of proportionality was not met, we could stratify across the variable that does not have a proportionate risk. For example, if we found the variable treat to be not proportional, we could stratify on that variable:

Output stratifying on treat