Assessing Survival: Cox Proportional Hazards Model Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.

Slides:



Advertisements
Similar presentations
The analysis of survival data in nephrology. Basic concepts and methods of Cox regression Paul C. van Dijk 1-2, Kitty J. Jager 1, Aeilko H. Zwinderman.
Advertisements

Survival Analysis. Key variable = time until some event time from treatment to death time for a fracture to heal time from surgery to relapse.
Continued Psy 524 Ainsworth
Interpreting regression for non-statisticians Colin Fischbacher.
Assessing Binary Outcomes: Logistic Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Introduction to Survival Analysis October 19, 2004 Brian F. Gage, MD, MSc with thanks to Bing Ho, MD, MPH Division of General Medical Sciences.
1 SSS II Lecture 1: Correlation and Regression Graduate School 2008/2009 Social Science Statistics II Gwilym Pryce
Departments of Medicine and Biostatistics
HSRP 734: Advanced Statistical Methods July 24, 2008.
Multiple Regression [ Cross-Sectional Data ]
SC968: Panel Data Methods for Sociologists
April 25 Exam April 27 (bring calculator with exp) Cox-Regression
Logistic Regression Multivariate Analysis. What is a log and an exponent? Log is the power to which a base of 10 must be raised to produce a given number.
Some Terms Y =  o +  1 X Regression of Y on X Regress Y on X X called independent variable or predictor variable or covariate or factor Which factors.
Chapter 11 Survival Analysis Part 3. 2 Considering Interactions Adapted from "Anderson" leukemia data as presented in Survival Analysis: A Self-Learning.
PH6415 Review Questions. 2 Question 1 A journal article reports a 95%CI for the relative risk (RR) of an event (treatment versus control as (0.55, 0.97).
© 2000 Prentice-Hall, Inc. Chap Multiple Regression Models.
Multiple Regression Models. The Multiple Regression Model The relationship between one dependent & two or more independent variables is a linear function.
Biostatistics in Research Practice Time to event data Martin Bland Professor of Health Statistics University of York
Chapter 11 Survival Analysis Part 2. 2 Survival Analysis and Regression Combine lots of information Combine lots of information Look at several variables.
EPI 809/Spring Multiple Logistic Regression.
An Introduction to Logistic Regression
Logistic Regression Chapter 8.
Model Checking in the Proportional Hazard model
Linear Regression and Correlation Explanatory and Response Variables are Numeric Relationship between the mean of the response variable and the level of.
STAT E-150 Statistical Methods
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 12: Multiple and Logistic Regression Marshall University.
Simple Linear Regression
Essentials of survival analysis How to practice evidence based oncology European School of Oncology July 2004 Antwerp, Belgium Dr. Iztok Hozo Professor.
Dr Laura Bonnett Department of Biostatistics. UNDERSTANDING SURVIVAL ANALYSIS.
Assessing Survival: Cox Proportional Hazards Model
Entering Multidimensional Space: Multiple Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Statistics for clinicians Biostatistics course by Kevin E. Kip, Ph.D., FAHA Professor and Executive Director, Research Center University of South Florida,
01/20151 EPI 5344: Survival Analysis in Epidemiology Interpretation of Models March 17, 2015 Dr. N. Birkett, School of Epidemiology, Public Health & Preventive.
April 6 Logistic Regression –Estimating probability based on logistic model –Testing differences among multiple groups –Assumptions for model.
University of Warwick, Department of Sociology, 2014/15 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard) Week 7 Logistic Regression I.
HSRP 734: Advanced Statistical Methods July 17, 2008.
Introduction to Survival Analysis Utah State University January 28, 2008 Bill Welbourn.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
Assessing Binary Outcomes: Logistic Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Statistical Inference for more than two groups Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Regression: Checking the Model Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Entering Multidimensional Space: Multiple Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Lecture 12: Cox Proportional Hazards Model
Lecture 6: Multiple Linear Regression Adjusted Variable Plots BMTRY 701 Biostatistical Methods II.
Lecture 6: Multiple Linear Regression Adjusted Variable Plots BMTRY 701 Biostatistical Methods II.
Multiple Logistic Regression STAT E-150 Statistical Methods.
Logistic Regression. Linear regression – numerical response Logistic regression – binary categorical response eg. has the disease, or unaffected by the.
We’ll now look at the relationship between a survival variable Y and an explanatory variable X; e.g., Y could be remission time in a leukemia study and.
Logistic Regression Analysis Gerrit Rooks
Introduction to Multiple Regression Lecture 11. The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & 2 or more.
1 Introduction to Modeling Beyond the Basics (Chapter 7)
Using Microsoft Excel to Conduct Regression Analysis.
Correlation and Linear Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Additional Regression techniques Scott Harris October 2009.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 13: Multiple, Logistic and Proportional Hazards Regression.
LOGISTIC REGRESSION. Purpose  Logistical regression is regularly used when there are only two categories of the dependent variable and there is a mixture.
Stats Methods at IC Lecture 3: Regression.
BINARY LOGISTIC REGRESSION
BIOST 513 Discussion Section - Week 10
Logistic Regression APKC – STATS AFAC (2016).
CHAPTER 7 Linear Correlation & Regression Methods
Applied Biostatistics: Lecture 2
Statistical Inference for more than two groups
Statistics 103 Monday, July 10, 2017.
Multiple logistic regression
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
CHAPTER 29: Multiple Regression*
Regression and Categorical Predictors
Presentation transcript:

Assessing Survival: Cox Proportional Hazards Model Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research

Objectives of Workshop Understand the general form of Cox PH model Understand the general form of Cox PH model Understand the need for adjusted Hazard Ratios (HR) Understand the need for adjusted Hazard Ratios (HR) Implement the Cox model in SPSS Implement the Cox model in SPSS Understand and interpret the output from SPSS Understand and interpret the output from SPSS

Modelling: Detecting signal from background noise

Survival Regression Models Expressed in terms of the hazard function formally defined as: The instantaneous risk of event (mortality) in next time interval t, conditional on having survived to start of the interval t

Survival Regression Models The Cox model expresses the relationship between the hazard and a set of variables or covariates These could be arm of trial, age, gender, social deprivation, Dukes stage, co-morbidity, etc….

How is the relationship formulated? Simplest equation is: h is the hazard K is a constant e.g. 0.3 per Person-year Age in years HazardHazardHazardHazard

How is the relationship formulated? Next Simplest is linear equation: h is the outcome; a is the intercept; β is the slope related to x the explanatory variable and; e is the error term or ‘noise’

Linear model of hazard Hazard Age in years

Cox Proportional Hazards Model (1972) h 0 is the baseline hazard; r ( β, x) function reflects how the hazard function changes (β) according to differences in subjects’ characteristics (x)

Exponential model of hazard Hazard Age in years

Cox Proportional Hazards Model: Hazard Ratio Consider hazard ratio for men vs. women, then -

Cox Proportional Hazards Model: Hazard Ratio If coding for gender is x=1 (men) and x=0 (women) then: where β is the regression coefficient for gender

Hazard ratios in SPSS SPSS gives hazard ratios for a binary factor coded (0,1) automatically from exponentiation of regression coefficients (95% CI are also given as an option) Note that the HR is labelled as EXP(B) in the output

Fitting Gender in Cox Model in SPSS

Output from Cox Model in SPSS Variable in model Regression Coefficient Standard error Test Statistic ( β/se(β) ) 2 Degrees of freedom p- value HR for men vs. women

Logrank Test: Null Hypothesis The Null hypothesis for the logrank test: Hazard Rate group A = Hazard Rate group A = Hazard Rate for group B Hazard Rate for group B = HR = O A / E A = 1 = HR = O A / E A = 1 O B / E B O B / E B

Wald Test: Null Hypothesis The Null hypothesis for the Wald test: Hazard Ratio = 1 Hazard Ratio = 1 Equivalent to regression coefficient β=0 Note that if the 95% CI for the HR includes 1 then the null hypothesis cannot be rejected

Hazard ratios for categorical factors in SPSS Enter factor as before Enter factor as before Click on ‘categorical’ and choose the reference category (usually first or last) Click on ‘categorical’ and choose the reference category (usually first or last) E.g. Duke’s staging may choose Stage A as the reference category E.g. Duke’s staging may choose Stage A as the reference category HRs are now given in output for survival in each category relative to Stage A HRs are now given in output for survival in each category relative to Stage A Hence there will be n-1 HRs for n categories Hence there will be n-1 HRs for n categories

Fitting a categorical variable: Duke’s Staging Reference category B vs. A C vs. A D vs. A UK vs. A

One Solution to Confounding Use multiple Cox regression with both predictor and confounder as explanatory variables i.e fit: x 1 is Duke’s Stage and x 2 is Age

Fitting a multiple regression: Duke’s Staging and Age Age adjusted for Duke’s Stage

Interpretation of the Hazard Ratio For a continuous variable such as age, HR represents the incremental increase in hazard per unit increase in age i.e HR=1.024, increase 2.4% for a one year increase in age For a categorical variable the HR represents the incremental increase in hazard in one category relative to the reference category i.e. HR = 6.66 for Stage D compared with A represents a 6.7 fold increase in hazard

First steps in modelling What hypotheses are you testing?What hypotheses are you testing? If main ‘exposure’ variable, enter first and assess confounders one at a timeIf main ‘exposure’ variable, enter first and assess confounders one at a time Assess each variable on statistical significance and clinical importance.Assess each variable on statistical significance and clinical importance. It is acceptable to have an ‘important’ variable without statistical significanceIt is acceptable to have an ‘important’ variable without statistical significance

Summary The Cox Proportional Hazards model is the most used analytical tool in survival research The Cox Proportional Hazards model is the most used analytical tool in survival research It is easily fitted in SPSS It is easily fitted in SPSS Model assessment requires some thought Model assessment requires some thought Next step is to consider how to select multiple factors for the ‘best’ model Next step is to consider how to select multiple factors for the ‘best’ model

Check assumption of proportional hazards (PH) Proportional hazards assumes that the ratio of hazard in one group to another remains the same throughout the follow-up period Proportional hazards assumes that the ratio of hazard in one group to another remains the same throughout the follow-up period For example, that the HR for men vs. women is constant over time For example, that the HR for men vs. women is constant over time Simplest method is to check for parallel lines in the Log (-Log) plot of survival Simplest method is to check for parallel lines in the Log (-Log) plot of survival

Check assumption of proportional hazards for each factor. Log minus log plot of survival should give parallel lines if PH holds Hint: Within Cox model select factor as CATEGORICAL and in PLOTS select log minus log function for separate lines of factor

Check assumption of proportional hazards for each factor. Log minus log plot of survival should give parallel lines if PH holds Hint: Within Cox model select factor as CATEGORICAL and in PLOTS select log minus log function for separate lines of factor

Proportional hazards holds for Duke’s Staging Categorical Variable Codings(b) Frequency(1)(2)(3)(4) dukes(a)0=A =B =C =D =UK a Indicator Parameter Coding b Category variable: dukes (Dukes Staging)

Proportional hazards holds for Duke’s Staging

Summary Selection of factors for Multiple Cox regression models requires some judgement Selection of factors for Multiple Cox regression models requires some judgement Automatic procedures are available but treat results with caution Automatic procedures are available but treat results with caution They are easily fitted in SPSS They are easily fitted in SPSS Check proportional hazards assumption Check proportional hazards assumption Parsimonious models are better Parsimonious models are better

Practical Read in Colorectal.sav and try to fit a multiple proportional hazards model Read in Colorectal.sav and try to fit a multiple proportional hazards model Check proportional hazards assumption Check proportional hazards assumption