Assessing Survival: Cox Proportional Hazards Model

Slides:



Advertisements
Similar presentations
The analysis of survival data in nephrology. Basic concepts and methods of Cox regression Paul C. van Dijk 1-2, Kitty J. Jager 1, Aeilko H. Zwinderman.
Advertisements

Survival Analysis. Key variable = time until some event time from treatment to death time for a fracture to heal time from surgery to relapse.
Interpreting regression for non-statisticians Colin Fischbacher.
Assessing Binary Outcomes: Logistic Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Introduction to Survival Analysis October 19, 2004 Brian F. Gage, MD, MSc with thanks to Bing Ho, MD, MPH Division of General Medical Sciences.
1 SSS II Lecture 1: Correlation and Regression Graduate School 2008/2009 Social Science Statistics II Gwilym Pryce
Departments of Medicine and Biostatistics
HSRP 734: Advanced Statistical Methods July 24, 2008.
Multiple Regression [ Cross-Sectional Data ]
SC968: Panel Data Methods for Sociologists
April 25 Exam April 27 (bring calculator with exp) Cox-Regression
Logistic Regression Multivariate Analysis. What is a log and an exponent? Log is the power to which a base of 10 must be raised to produce a given number.
Chapter 11 Survival Analysis Part 3. 2 Considering Interactions Adapted from "Anderson" leukemia data as presented in Survival Analysis: A Self-Learning.
PH6415 Review Questions. 2 Question 1 A journal article reports a 95%CI for the relative risk (RR) of an event (treatment versus control as (0.55, 0.97).
Multiple Regression Models. The Multiple Regression Model The relationship between one dependent & two or more independent variables is a linear function.
Biostatistics in Research Practice Time to event data Martin Bland Professor of Health Statistics University of York
Chapter 11 Survival Analysis Part 2. 2 Survival Analysis and Regression Combine lots of information Combine lots of information Look at several variables.
EPI 809/Spring Multiple Logistic Regression.
An Introduction to Logistic Regression
Logistic Regression Chapter 8.
Model Checking in the Proportional Hazard model
Assessing Survival: Cox Proportional Hazards Model Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
STAT E-150 Statistical Methods
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 12: Multiple and Logistic Regression Marshall University.
Quantitative Business Analysis for Decision Making Multiple Linear RegressionAnalysis.
Survival Analysis: From Square One to Square Two
Logistic Regression. Outline Review of simple and multiple regressionReview of simple and multiple regression Simple Logistic RegressionSimple Logistic.
Simple Linear Regression
Essentials of survival analysis How to practice evidence based oncology European School of Oncology July 2004 Antwerp, Belgium Dr. Iztok Hozo Professor.
Survival Data John Kornak March 29, 2011
Dr Laura Bonnett Department of Biostatistics. UNDERSTANDING SURVIVAL ANALYSIS.
Entering Multidimensional Space: Multiple Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Statistics for clinicians Biostatistics course by Kevin E. Kip, Ph.D., FAHA Professor and Executive Director, Research Center University of South Florida,
01/20151 EPI 5344: Survival Analysis in Epidemiology Interpretation of Models March 17, 2015 Dr. N. Birkett, School of Epidemiology, Public Health & Preventive.
April 6 Logistic Regression –Estimating probability based on logistic model –Testing differences among multiple groups –Assumptions for model.
University of Warwick, Department of Sociology, 2014/15 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard) Week 7 Logistic Regression I.
HSRP 734: Advanced Statistical Methods July 17, 2008.
Introduction to Survival Analysis Utah State University January 28, 2008 Bill Welbourn.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
Assessing Binary Outcomes: Logistic Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Statistical Inference for more than two groups Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Regression: Checking the Model Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Entering Multidimensional Space: Multiple Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Lecture 9: Analysis of intervention studies Randomized trial - categorical outcome Measures of risk: –incidence rate of an adverse event (death, etc) It.
Lecture 12: Cox Proportional Hazards Model
Lecture 6: Multiple Linear Regression Adjusted Variable Plots BMTRY 701 Biostatistical Methods II.
Lecture 6: Multiple Linear Regression Adjusted Variable Plots BMTRY 701 Biostatistical Methods II.
Multiple Logistic Regression STAT E-150 Statistical Methods.
Logistic Regression. Linear regression – numerical response Logistic regression – binary categorical response eg. has the disease, or unaffected by the.
We’ll now look at the relationship between a survival variable Y and an explanatory variable X; e.g., Y could be remission time in a leukemia study and.
Logistic Regression Analysis Gerrit Rooks
Introduction to Multiple Regression Lecture 11. The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & 2 or more.
Using Microsoft Excel to Conduct Regression Analysis.
Correlation and Linear Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Additional Regression techniques Scott Harris October 2009.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 13: Multiple, Logistic and Proportional Hazards Regression.
03/20161 EPI 5344: Survival Analysis in Epidemiology Estimating S(t) from Cox models March 29, 2016 Dr. N. Birkett, School of Epidemiology, Public Health.
Stats Methods at IC Lecture 3: Regression.
BINARY LOGISTIC REGRESSION
BIOST 513 Discussion Section - Week 10
Logistic Regression APKC – STATS AFAC (2016).
Applied Biostatistics: Lecture 2
Statistical Inference for more than two groups
Statistics 103 Monday, July 10, 2017.
Multiple logistic regression
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
CHAPTER 29: Multiple Regression*
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Regression and Categorical Predictors
Presentation transcript:

Assessing Survival: Cox Proportional Hazards Model Statistics for Health Research Assessing Survival: Cox Proportional Hazards Model Peter T. Donnan Professor of Epidemiology and Biostatistics

Objectives of Workshop Understand the general form of Cox PH model Understand the need for adjusted Hazard Ratios (HR) Implement the Cox model in SPSS Understand and interpret the output from SPSS

Modelling: Detecting signal from background noise

Survival Regression Models Expressed in terms of the hazard function formally defined as: The instantaneous risk of event (mortality) in next time interval t, conditional on having survived to start of the interval t

What is hazard? Hazard rate is an instantaneous rate of events as a function of time Survival is one measure of outcome but cancer studies also use hazards. Hazards intuitively sounds like the inverse of survival which is true but it is not quite as simple as that as they are related through the exponential function The hazard is an instantaneous rate of events. The hazard ratio (HR) (see later) is the ratio of hazards in two groups, often trial arms and is a measure of the efficacy in a cancer trial. A HR = 1 indicates no difference between the two arms, a HR > 1 indicates a greater hazard for the numerator arm and a HR < 1 indicates a lower hazard. Often in cancer trials of a new drug the trialists would be seeking a HR < 1 indicating a survival benefit by reducing deaths relative to the comparator.

Plot of hazard Note that the hazard changes over time denoted by h(t) The hazard rate is a function of time and tends to increase over time. Think of the risk of death from birth to old age which is high just after birth, declines through childhood and increases from middle-age. time Birth Old age

Survival Regression Models The Cox model expresses the relationship between the hazard and a set of variables or covariates These could be arm of trial, age, gender, social deprivation, Dukes stage, co-morbidity, etc….

How is the relationship formulated? Simplest equation is: Age in years Hazard h is the hazard K is a constant e.g. 0.3 per Person-year

How is the relationship formulated? Next Simplest is linear equation: h is the outcome; a is the intercept; β is the slope related to x the explanatory variable and; e is the error term or ‘noise’

Linear model of hazard Age in years Hazard

Cox Proportional Hazards Model (1972) h0 is the baseline hazard; r ( β, x) function reflects how the hazard function changes (β) according to differences in subjects’ characteristics (x)

Exponential model of hazard Age in years

What is Hazard Ratio? Hazard Ratio (HR) is ratio of hazards in two groups e.g. men vs women, new drug vs. BSC N.B. It is the improvement in one group over the other in terms of rate at which events will occur from a particular time point to another time point Note that although the hazard is changing over time the ratio of the hazard in two groups is often assumed to be constant for most of the models we use in cancer research. This is the property of proportional hazards.

What is Hazard Ratio? Hazard Ratio (HR) is ratio of hazards in two groups and remains constant over time (n.b. survival curve widens) Survival Constant ratio of hazards means the survival curves gradually widen. Hazard and survival are related through the log/ exponential functions. The HR is a summary of the difference in effect across the whole time course but at any time point the HR is the same if we have proportional hazards. Time

Interpretation of HR comparing two groups HR = 1 ; Do NOT reject null hypothesis (i.e. no difference) HR < 1; Reduction in Hazard relative to comparator (e.g. HR = 0.6 is 40% reduction) HR > 1; Increase in Hazard relative to comparator (e.g. HR = 1.7 is 70% increase) A HR = 1 indicates no difference between the two groups. HR < 1 is generally what we aim for in cancer trials as mainly seeking to reduce events such as death. A HR > 1 indicates a factor that increases the hazard in relation to the comparator.

Cox Proportional Hazards Model: Hazard Ratio Consider hazard ratio for men vs. women, then -

Cox Proportional Hazards Model: Hazard Ratio If coding for gender is x=1 (men) and x=0 (women) then: where β is the regression coefficient for gender

Hazard ratios in SPSS SPSS gives hazard ratios for a binary factor coded (0,1) automatically from exponentiation of regression coefficients (95% CI are also given as an option) Note that the HR is labelled as EXP(B) in the output

Fitting Gender in Cox Model in SPSS

Output from Cox Model in SPSS p-value Standard error Degrees of freedom Variable in model HR for men vs. women Regression Coefficient Test Statistic ( β/se(β) )2

Logrank Test: Null Hypothesis The Null hypothesis for the logrank test: Hazard Rate group A = Hazard Rate for group B = HR = OA / EA = 1 OB / EB

Wald Test: Null Hypothesis The Null hypothesis for the Wald test: Hazard Ratio = 1 Equivalent to regression coefficient β=0 Note that if the 95% CI for the HR includes 1 then the null hypothesis cannot be rejected

Hazard ratios for categorical factors in SPSS Enter factor as before Click on ‘categorical’ and choose the reference category (usually first or last) E.g. Duke’s staging may choose Stage A as the reference category HRs are now given in output for survival in each category relative to Stage A Hence there will be n-1 HRs for n categories

Fitting a categorical variable: Duke’s Staging Reference category B vs. A C vs. A D vs. A UK vs. A

One Solution to Confounding Use multiple Cox regression with both predictor and confounder as explanatory variables i.e fit: x1 is Duke’s Stage and x2 is Age

Fitting a multiple regression: Duke’s Staging and Age Age adjusted for Duke’s Stage

Interpretation of the Hazard Ratio For a continuous variable such as age, HR represents the incremental increase in hazard per unit increase in age i.e HR=1.024, increase 2.4% for a one year increase in age For a categorical variable the HR represents the incremental increase in hazard in one category relative to the reference category i.e. HR = 6.66 for Stage D compared with A represents a 6.7 fold increase in hazard

First steps in modelling What hypotheses are you testing? If main ‘exposure’ variable, enter first and assess confounders one at a time Assess each variable on statistical significance and clinical importance. It is acceptable to have an ‘important’ variable without statistical significance

Summary The Cox Proportional Hazards model is the most used analytical tool in survival research It is easily fitted in SPSS Model assessment requires some thought Next step is to consider how to select multiple factors for the ‘best’ model

Check assumption of proportional hazards (PH) Proportional hazards assumes that the ratio of hazard in one group to another remains the same throughout the follow-up period For example, that the HR for men vs. women is constant over time Simplest method is to check for parallel lines in the Log (-Log) plot of survival

Check assumption of proportional hazards for each factor Check assumption of proportional hazards for each factor. Log minus log plot of survival should give parallel lines if PH holds Hint: Within Cox model select factor as CATEGORICAL and in PLOTS select log minus log function for separate lines of factor

Check assumption of proportional hazards for each factor Check assumption of proportional hazards for each factor. Log minus log plot of survival should give parallel lines if PH holds Hint: Within Cox model select factor as CATEGORICAL and in PLOTS select log minus log function for separate lines of factor

Proportional hazards holds for Duke’s Staging Categorical Variable Codings(b) Frequency (1) (2) (3) (4) dukes(a) 0=A 18 1 0 0 0 1=B 107 0 1 0 0 2=C 188 0 0 1 0 3=D 123 0 0 0 1 9=UK 40 0 0 0 0 a Indicator Parameter Coding b Category variable: dukes (Dukes Staging)

Proportional hazards holds for Duke’s Staging

Summary Selection of factors for Multiple Cox regression models requires some judgement Automatic procedures are available but treat results with caution They are easily fitted in SPSS Check proportional hazards assumption Parsimonious models are better

Practical Read in Colorectal.sav and try to fit a multiple proportional hazards model Check proportional hazards assumption