03/20161 EPI 5344: Survival Analysis in Epidemiology Estimating S(t) from Cox models March 29, 2016 Dr. N. Birkett, School of Epidemiology, Public Health.

Slides:



Advertisements
Similar presentations
HSRP 734: Advanced Statistical Methods July 24, 2008.
Advertisements

April 25 Exam April 27 (bring calculator with exp) Cox-Regression
Some Terms Y =  o +  1 X Regression of Y on X Regress Y on X X called independent variable or predictor variable or covariate or factor Which factors.
Chapter 11 Survival Analysis Part 3. 2 Considering Interactions Adapted from "Anderson" leukemia data as presented in Survival Analysis: A Self-Learning.
PH6415 Review Questions. 2 Question 1 A journal article reports a 95%CI for the relative risk (RR) of an event (treatment versus control as (0.55, 0.97).
Proportional Hazard Regression Cox Proportional Hazards Modeling (PROC PHREG)
Chapter 11 Survival Analysis Part 2. 2 Survival Analysis and Regression Combine lots of information Combine lots of information Look at several variables.
Using time-dependent covariates in the Cox model THIS MATERIAL IS NOT REQUIRED FOR YOUR METHODS II EXAM With some examples taken from Fisher and Lin (1999)
Introduction to Survival Analysis PROC LIFETEST and Survival Curves.
Model Checking in the Proportional Hazard model
Assessing Survival: Cox Proportional Hazards Model Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Cox Proportional Hazards Regression Model Mai Zhou Department of Statistics University of Kentucky.
Survival Analysis A Brief Introduction Survival Function, Hazard Function In many medical studies, the primary endpoint is time until an event.
Analysis of Complex Survey Data
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 12: Multiple and Logistic Regression Marshall University.
Survival analysis Brian Healy, PhD. Previous classes Regression Regression –Linear regression –Multiple regression –Logistic regression.
SAS Lecture 5 – Some regression procedures Aidan McDermott, April 25, 2005.
STT : BIOSTATISTICS ANALYSIS Dr. Cuixian Chen Chapter 8: Fitting Parametric Regression Models STT
01/20141 EPI 5344: Survival Analysis in Epidemiology Quick Review and Intro to Smoothing Methods March 4, 2014 Dr. N. Birkett, Department of Epidemiology.
Essentials of survival analysis How to practice evidence based oncology European School of Oncology July 2004 Antwerp, Belgium Dr. Iztok Hozo Professor.
1 Survival Analysis Biomedical Applications Halifax SAS User Group April 29/2011.
HSRP 734: Advanced Statistical Methods July 10, 2008.
Dr Laura Bonnett Department of Biostatistics. UNDERSTANDING SURVIVAL ANALYSIS.
Biostatistics Case Studies 2005 Peter D. Christenson Biostatistician Session 4: Taking Risks and Playing the Odds: OR vs.
Inference for a Single Population Proportion (p).
1 Introduction to medical survival analysis John Pearson Biostatistics consultant University of Otago Canterbury 7 October 2008.
Assessing Survival: Cox Proportional Hazards Model
EHA: More On Plots and Interpreting Hazards Sociology 229A: Event History Analysis Class 9 Copyright © 2008 by Evan Schofer Do not copy or distribute without.
01/20151 EPI 5344: Survival Analysis in Epidemiology Maximum Likelihood Estimation: An Introduction March 10, 2015 Dr. N. Birkett, School of Epidemiology,
01/20151 EPI 5344: Survival Analysis in Epidemiology Interpretation of Models March 17, 2015 Dr. N. Birkett, School of Epidemiology, Public Health & Preventive.
Time-dependent covariates and further remarks on likelihood construction Presenter Li,Yin Nov. 24.
01/20151 EPI 5344: Survival Analysis in Epidemiology SAS code and output February 24, 2015 Dr. N. Birkett, School of Epidemiology, Public Health & Preventive.
01/20151 EPI 5344: Survival Analysis in Epidemiology Time varying covariates March 24, 2015 Dr. N. Birkett, School of Epidemiology, Public Health & Preventive.
01/20151 EPI 5344: Survival Analysis in Epidemiology Epi Methods: why does ID involve person-time? March 10, 2015 Dr. N. Birkett, School of Epidemiology,
01/20141 EPI 5344: Survival Analysis in Epidemiology Epi Methods: why does ID involve person-time? March 13, 2014 Dr. N. Birkett, Department of Epidemiology.
01/20151 EPI 5344: Survival Analysis in Epidemiology Survival curve comparison (non-regression methods) March 3, 2015 Dr. N. Birkett, School of Epidemiology,
Lecture 13: Cox PHM Part II Basic Cox Model Parameter Estimation Hypothesis Testing.
HSRP 734: Advanced Statistical Methods July 17, 2008.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
HSRP 734: Advanced Statistical Methods July 31, 2008.
1 THE ROLE OF COVARIATES IN CLINICAL TRIALS ANALYSES Ralph B. D’Agostino, Sr., PhD Boston University FDA ODAC March 13, 2006.
Applied Epidemiologic Analysis - P8400 Fall 2002 Lab 9 Survival Analysis Henian Chen, M.D., Ph.D.
School of Epidemiology, Public Health &
01/20141 EPI 5344: Survival Analysis in Epidemiology SAS code and output March 4, 2014 Dr. N. Birkett, Department of Epidemiology & Community Medicine,
Lecture 3 Topic - Descriptive Procedures Programs 3-4 LSB 4:1-4.4; 4:9:4:11; 8:1-8:5; 5:1-5.2.
Lecture 12: Cox Proportional Hazards Model
Lecture 16: Regression Diagnostics I Proportional Hazards Assumption -graphical methods -regression methods.
Chapter 4 concerns various SAS procedures (PROCs). Every PROC operates on: –the most recently created dataset –all the observations –all the appropriate.
Lecture 10: Hypothesis Testing II Weight Functions Trend Tests.
Survival Analysis approach in evaluating the efficacy of ARV treatment in HIV patients at the Dr GM Hospital in Tshwane, GP of S. Africa Marcus Motshwane.
01/20151 EPI 5344: Survival Analysis in Epidemiology Actuarial and Kaplan-Meier methods February 24, 2015 Dr. N. Birkett, School of Epidemiology, Public.
01/20151 EPI 5344: Survival Analysis in Epidemiology Cox regression: Introduction March 17, 2015 Dr. N. Birkett, School of Epidemiology, Public Health.
12/20091 EPI 5240: Introduction to Epidemiology Incidence and survival December 7, 2009 Dr. N. Birkett, Department of Epidemiology & Community Medicine,
Satistics 2621 Statistics 262: Intermediate Biostatistics Jonathan Taylor and Kristin Cobb April 20, 2004: Introduction to Survival Analysis.
We’ll now look at the relationship between a survival variable Y and an explanatory variable X; e.g., Y could be remission time in a leukemia study and.
01/20151 EPI 5344: Survival Analysis in Epidemiology Estimating S(t) from Cox models March 24, 2015 Dr. N. Birkett, School of Epidemiology, Public Health.
01/20151 EPI 5344: Survival Analysis in Epidemiology Confounding and Effect Modification March 24, 2015 Dr. N. Birkett, School of Epidemiology, Public.
01/20151 EPI 5344: Survival Analysis in Epidemiology Quick Review from Session #1 March 3, 2015 Dr. N. Birkett, School of Epidemiology, Public Health &
01/20151 EPI 5344: Survival Analysis in Epidemiology Hazard March 3, 2015 Dr. N. Birkett, School of Epidemiology, Public Health & Preventive Medicine,
SAS ® is a very powerful tool when producing Graphics. A single graphical data step can easily create a Kaplan Meier Plot, but there is no single graphical.
INTRODUCTION TO CLINICAL RESEARCH Survival Analysis – Getting Started Karen Bandeen-Roche, Ph.D. July 20, 2010.
01/20151 EPI 5344: Survival Analysis in Epidemiology SAS Code for Cox models March 17, 2015 Dr. N. Birkett, School of Epidemiology, Public Health & Preventive.
01/20141 EPI 5344: Survival Analysis in Epidemiology Estimating S(t) from Cox models April 1, 2014 Dr. N. Birkett, Department of Epidemiology & Community.
02/20161 EPI 5344: Survival Analysis in Epidemiology Hazard March 8, 2016 Dr. N. Birkett, School of Epidemiology, Public Health & Preventive Medicine,
Additional Regression techniques Scott Harris October 2009.
EPI 5344: Survival Analysis in Epidemiology Week 6 Dr. N. Birkett, School of Epidemiology, Public Health & Preventive Medicine, University of Ottawa 03/2016.
SURVIVAL ANALYSIS WITH STATA. DATA INPUT 1) Using the STATA editor 2) Reading STATA (*.dta) files 3) Reading non-STATA format files (e.g. ASCII) - infile.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 13: Multiple, Logistic and Proportional Hazards Regression.
03/20161 EPI 5344: Survival Analysis in Epidemiology Testing the Proportional Hazard Assumption April 5, 2016 Dr. N. Birkett, School of Epidemiology, Public.
Statistics 262: Intermediate Biostatistics
Presentation transcript:

03/20161 EPI 5344: Survival Analysis in Epidemiology Estimating S(t) from Cox models March 29, 2016 Dr. N. Birkett, School of Epidemiology, Public Health & Preventive Medicine, University of Ottawa

03/20162 Objectives Theory of estimating S(t) SAS methods

03/20163 Cox methods do not require an equation for h(t) –does not give direct estimates of h(t), H(t) or S(t). If we could estimate any of these, we could get the others: Introduction (1) Assuming h(t) is constant in interval

03/20164 Introduction (2) With Cox, normally have explanatory/predictor variables Would like S(t) for different values of predictors. Proportional Hazards assumption makes this ‘easy’ once we get over basic hurdle

03/20165 Since hazards are proportional, all we need to know is h(t) or S(t) for the baseline group: h 0 (t) or S 0 (t) Introduction (3) So, estimate S 0 (t) and we can get S(t) for any ‘x’s’ S(t|x) is covariate adjusted survival curve

03/20166 How do we get estimate of S 0 (t)? Must use methods outside Cox regression. Two common approaches are used: –Generalize the Kaplan-Meier method to estimate S 0 (t) –Generalize Nelson-Aalen method to estimate H 0 (t) Implemented by using the BASELINE statement in SAS. First, a review of the technical background Introduction (4)

A bit of technical stuff We assume a piecewise constant hazard model based on risk sets –It keeps coming up, doesn’t it 03/20167

A bit of technical stuff What we want to do is estimate the piecewise hazard for the interval ending at each risk set. There are a number of ways to estimate this We’ll just look at one to get a general idea of what is being done 03/20168

9  Formula gives estimate of h(t)  We don’t care about the origin.  It assumes there are no ties.  Risk set ’m’ occurs at time ‘t j ’ Where: ‘l’ is subject having event at t j ‘m’ is an index for the risk set at t j Collett, p 96

03/ Then, to get S(t), we use: Where: ‘k’ is subject whose survival curve we want

03/ Non-RCT study of therapy Hypernephroma (type of kidney cancer) All patients treated with –chemotherapy and –Immunotherapy Some also had their affected kidney removed. Questions –Does having a nephrectomy affect survival? –Does age affect survival? Example (1)

03/ Answer is YES A 2i = 1 if age is = 0 otherwise A 3i = 1 if age is 70+ = 0 otherwise N i = 1 if had nephrectomy = 0 if no nephrectomy A 3 & N are statistically significant. What do survival curves look like? Example (2)

03/201613

03/201614

03/201615

03/ Could try smoothing the h(t) curve Estimated baseline hazard curve

How did we produce these curves? Use the formulae given earlier. In SAS, getting one S(t) is easy. –ODS produces graph Having more flexibility requires more work –SAS uses the BASELINE statement –COVARIATES option is useful –ODS graphs or plot your own 03/ Example (3)

03/ The data

03/ To start, we need S(t) for one set of covariates –Can be ‘0’ for all variables –mean values –anything else. Can generate any survival curve once we have this. Here’s the simplest way to get an S(t) curve SAS Methods (1)

proc phreg data=njb1 plots=survival; class nephrec/param=ref ref=first order=internal; model time*vstatus(0) = nephrec ; run; 03/201620

03/201621

If you leave off the CLASS statement, you get a different curve (the ‘why’ comes later’) proc phreg data=njb1 plots=survival; model time*vstatus(0) = nephrec ; run; 03/201622

03/201623

03/201624

03/ How to go beyond this simple curve? SAS uses the BASELINE statement Needs to select covariate values What covariate values are used? –Can use the default –Can define your own –Can use more than one set at a time SAS Methods (2)

03/ Here’s the ‘why’ By default, SAS produces an S(t) using these covariate values: –The reference level of any variable mentioned in a class statement –The mean value of each other variable Let’s look at what the BASELINE statement does SAS Methods (3)

proc phreg data=njb1 plots=survival; model time*vstatus(0) = nephrec; baseline out = a survival = s lower = lcl upper = ucl ; run; proc print data=a (obs=10); run; 03/201627

03/ Why don’t we get the same curve?

03/ Content of data set ‘a’

Hmm? ‘nephrectomy’ is an ordinal variable (yes/no for surgery) Why is it given the value in generating S(t)? –Failed to put it in a CLASS statement –It had numerical values assigned (0/1) –SAS treated it as an interval variable = prob(nephrectomy= 1) 03/201630

Let’s fix the nephrectomy covariate value: proc phreg data=njb1 plots=survival; class nephrec/param=ref ref=first order=internal; model time*vstatus(0) = nephrec; baseline out = b survival = s lower = lcl upper = ucl ; run; proc print data=b (obs=10); run; 03/201631

03/ Same curve as before

03/ Content of data set ‘b’

BASELINE output What can we do with the output from the BASELINE statement? –Use as input to your own graphic plots –Use as data within PHREG to create more curves Next, we need to learn about the COVARIATES option 03/201634

What if you want S(t) for some other set of covariates? –Interested in specific target group –Want to contrast extremes of the range of variables Need to tell SAS what values to use for covariates. Use COVARIATES option in SAS 03/ COVARIATES option

data covals2a; input nephrec; format nephrec nephrec. ; datalines; 1 ; run; proc phreg data=njb1 plots=survival; class nephrec/param=ref ref=first order=internal; model time*vstatus(0) = nephrec; baseline covariates = covals2a; run; 03/201636

03/201637

What if you want to display the results of multiple covariate sets on the same graph? –Method #1 Run several of previous models, for different covariates. Combine the output datasets into one dataset Plot using SAS Graph, etc. –Method #2 list more than one set of covariates in the covariate data set ODS graphics has complex options to overlay plots 03/ COVARIATES

data covals2; input nephrec; format nephrec nephrec. ; datalines; 0 1 ; run; proc phreg data=njb1 plots(overlay)=survival; class nephrec/param=ref ref=first order=internal; model time*vstatus(0) = nephrec; baseline covariates = covals2; run; 03/201639

03/201640

data covals3; input nephrec agegrp; format agegrp agegrp. nephrec nephrec. ; datalines; ; run; 03/201641

proc phreg data=njb1 plots(overlay)=survival; class nephrec agegrp/param=ref ref=first order=internal; model time*vstatus(0) = nephrec agegrp; baseline covariates = covals3; run; 03/201642

03/201643

03/ With multiple covariates, ODS doesn’t give nice labels by default. Can get labels with ODS but it needs more code Producing your own graphs lets you label things. ODS graphs

03/ Allison produces S(t) using the ‘STRATA’ statement STRATA used to allow different baseline hazards for the variable in the statement STRATA sex; –allows different baseline hazard for men and women Can produce S(t) curves Using STRATA (1)

proc phreg data=njb1 plots(overlay=row)=survival; class nephrec/param=ref ref=first order=internal; model time*vstatus(0) = nephrec; strata nephrec; run; 03/201646

03/201647

03/201648

03/ Curves are different. Could generate the STRATA curves by running sex-specific Cox models Using STRATA (2)

proc phreg data=njb1 plots(range=0,115)=survival; class nephrec/param=ref ref=first order=internal; model time*vstatus(0) = nephrec; where nephrec = 0; run; proc phreg data=njb1 plots=survival; class nephrec/param=ref ref=first order=internal; model time*vstatus(0) = nephrec; where nephrec = 1; run; 03/201650

03/201651

03/ Two different sets of S(t) curves STRATA –Curves have different baseline hazards –Do not follow PH for the variable in the STRATA statement BASELINE –Curves have same baseline hazards –Follow PH for the variables defining curves. Using STRATA (3)

03/201653