Building multivariable survival models with time-varying effects: an approach using fractional polynomials Willi Sauerbrei Institut of Medical Biometry.

Slides:



Advertisements
Similar presentations
Residuals Residuals are used to investigate the lack of fit of a model to a given subject. For Cox regression, there’s no easy analog to the usual “observed.
Advertisements

Survival Analysis. Key variable = time until some event time from treatment to death time for a fracture to heal time from surgery to relapse.
© Department of Statistics 2012 STATS 330 Lecture 32: Slide 1 Stats 330: Lecture 32.
Introduction to Survival Analysis October 19, 2004 Brian F. Gage, MD, MSc with thanks to Bing Ho, MD, MPH Division of General Medical Sciences.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 13 Nonlinear and Multiple Regression.
Interactions With Continuous Variables – Extensions of the Multivariable Fractional Polynomial Approach Willi Sauerbrei Institut of Medical Biometry and.
HSRP 734: Advanced Statistical Methods July 24, 2008.
Introduction Cure models within the framework of flexible parametric survival models T.M-L. Andersson1, S. Eloranta1, P.W. Dickman1, P.C. Lambert1,2 1.
Detecting an interaction between treatment and a continuous covariate: a comparison between two approaches Willi Sauerbrei Institut of Medical Biometry.
1 Modelling time dependent hazard ratios in relative survival: application to colon cancer. BOLARD P, QUANTIN C, ABRAHAMOWICZ M, ESTEVE J, GIORGI R, CHADHA-BOREHAM.
Modelling continuous variables with a spike at zero – on issues of a fractional polynomial based procedure Willi Sauerbrei Institut of Medical Biometry.
April 25 Exam April 27 (bring calculator with exp) Cox-Regression
the Cox proportional hazards model (Cox Regression Model)
Making fractional polynomial models more robust Willi Sauerbrei Institut of Medical Biometry and Informatics University Medical Center Freiburg, Germany.
Model and Variable Selections for Personalized Medicine Lu Tian (Northwestern University) Hajime Uno (Kitasato University) Tianxi Cai, Els Goetghebeur,
Flexible modeling of dose-risk relationships with fractional polynomials Willi Sauerbrei Institut of Medical Biometry and Informatics University Medical.
Multivariate Data Analysis Chapter 4 – Multiple Regression.
Chapter 11 Survival Analysis Part 2. 2 Survival Analysis and Regression Combine lots of information Combine lots of information Look at several variables.
Lecture 24: Thurs. Dec. 4 Extra sum of squares F-tests (10.3) R-squared statistic (10.4.1) Residual plots (11.2) Influential observations (11.3,
Multivariable model building with continuous data Willi Sauerbrei Institut of Medical Biometry and Informatics University Medical Center Freiburg, Germany.
Regression Model Building Setting: Possibly a large set of predictor variables (including interactions). Goal: Fit a parsimonious model that explains variation.
Clinical Relevance of HER2 Overexpression/Amplification in Patients with Small Tumor Size and Node-Negative Breast Cancer Curigliano G et al. J Clin Oncol.
Model Checking in the Proportional Hazard model
Assessing Survival: Cox Proportional Hazards Model Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Cox Proportional Hazards Regression Model Mai Zhou Department of Statistics University of Kentucky.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 12: Multiple and Logistic Regression Marshall University.
Survival Analysis: From Square One to Square Two
DIY fractional polynomials Patrick Royston MRC Clinical Trials Unit, London 10 September 2010.
Modelling continuous exposures - fractional polynomials Willi Sauerbrei Institut of Medical Biometry and Informatics University Medical Center Freiburg,
Simple Linear Regression
Tang G et al. Proc SABCS 2010;Abstract S4-9.
Improved Use of Continuous Data- Statistical Modeling instead of Categorization Willi Sauerbrei Institut of Medical Biometry and Informatics University.
Essentials of survival analysis How to practice evidence based oncology European School of Oncology July 2004 Antwerp, Belgium Dr. Iztok Hozo Professor.
CI - 1 Cure Rate Models and Adjuvant Trial Design for ECOG Melanoma Studies in the Past, Present, and Future Joseph Ibrahim, PhD Harvard School of Public.
Statistical approaches to analyse interval-censored data in a confirmatory trial Margareta Puu, AstraZeneca Mölndal 26 April 2006.
1 Introduction to medical survival analysis John Pearson Biostatistics consultant University of Otago Canterbury 7 October 2008.
Sgroi DC et al. Proc SABCS 2012;Abstract S1-9.
Assessing Survival: Cox Proportional Hazards Model
Extending Cox Regression Accelerated Failure Time Models Tom Greene & Nan Hu.
Using Neural Networks to Predict Claim Duration in the Presence of Right Censoring and Covariates David Speights Senior Research Statistician HNC Insurance.
D:/rg/folien/ms/ms-USA ppt F 1 Assessment of prediction error of risk prediction models Thomas Gerds and Martin Schumacher Institute of Medical.
Time-dependent covariates and further remarks on likelihood construction Presenter Li,Yin Nov. 24.
Multivariable regression modelling – a pragmatic approach based on fractional polynomials for continuous variables Willi Sauerbrei Institut of Medical.
Use of FP and Other Flexible Methods to Assess Changes in the Impact of an exposure over time Willi Sauerbrei Institut of Medical Biometry and Informatics.
Linear correlation and linear regression + summary of tests
HSRP 734: Advanced Statistical Methods July 17, 2008.
Statistical Review: Recursive Partitioning Identifies Patients at High and Low Risk for Ipsilateral Tumor Recurrence After Breast- Conserving Surgery and.
1 THE ROLE OF COVARIATES IN CLINICAL TRIALS ANALYSES Ralph B. D’Agostino, Sr., PhD Boston University FDA ODAC March 13, 2006.
Lecture 12: Cox Proportional Hazards Model
Lecture 16: Regression Diagnostics I Proportional Hazards Assumption -graphical methods -regression methods.
Logistic Regression. Linear regression – numerical response Logistic regression – binary categorical response eg. has the disease, or unaffected by the.
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan 11/20/12 Multiple Regression SECTIONS 9.2, 10.1, 10.2 Multiple explanatory.
1 Using dynamic path analysis to estimate direct and indirect effects of treatment and other fixed covariates in the presence of an internal time-dependent.
Treat everyone with sincerity,
Love does not come by demanding from others, but it is a self initiation. Survival Analysis.
© Department of Statistics 2012 STATS 330 Lecture 24: Slide 1 Stats 330: Lecture 24.
Additional Regression techniques Scott Harris October 2009.
Neuropathy Is Not Associated With Clinical Outcomes in Patients Receiving Adjuvant Taxane-Containing Therapy for Operable Breast Cancer Bryan P. Schneider,
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 13: Multiple, Logistic and Proportional Hazards Regression.
J Clin Oncol 30: R2 윤경한 / Prof. 김시영 Huan Jin, Dongsheng Tu, Naiqing Zhao, Lois E. Shepherd, and Paul E. Goss.
03/20161 EPI 5344: Survival Analysis in Epidemiology Testing the Proportional Hazard Assumption April 5, 2016 Dr. N. Birkett, School of Epidemiology, Public.
Mamounas EP et al. Proc SABCS 2012;Abstract S1-10.
Survival Analysis: From Square One to Square Two Yin Bun Cheung, Ph.D. Paul Yip, Ph.D. Readings.
Multivariable regression models with continuous covariates with a practical emphasis on fractional polynomials and applications in clinical epidemiology.
Meta-analysis of joint longitudinal and event-time outcomes
Quantitative Methods What lies beyond?.
Quantitative Methods What lies beyond?.
Love does not come by demanding from others, but it is a self initiation. Survival Analysis.
Additional Regression techniques
Subgroup analysis on time-to-event: a Bayesian approach
Presentation transcript:

Building multivariable survival models with time-varying effects: an approach using fractional polynomials Willi Sauerbrei Institut of Medical Biometry and Informatics University Medical Center Freiburg, Germany Patrick Royston MRC Clinical Trials Unit, London, UK

2 Overview Extending the Cox model Assessing PH assumption Model time-by covariate interaction Fractional Polynomial time algorithm Illustration with breast cancer data

3 Cox model 0 (t) – unspecified baseline hazard Hazard ratio does not depend on time, failure rates are proportional ( assumption 1, PH) λ(t|X) = λ 0 (t)exp(β΄X) Covariates are linked to hazard function by exponential function (assumption 2) Continuous covariates act linearly on log hazard function (assumption 3)

4 Extending the Cox model Relax PH-assumption dynamic Cox model (t | X) = 0 (t) exp (  (t) X) HR(x,t) – function of X and time t Relax linearity assumption (t | X) = 0 (t) exp (  f (X))

5 Causes of non-proportionality Effect gets weaker with time Incorrect modelling omission of an important covariate incorrect functional form of a covariate different survival model is appropriate

6 Non-PH - Does it matter ? - Is it real ? Non-PH is large and real - stratify by the factor (t|X, V=j) = j (t) exp (X  ) effect of V not estimated, not tested for continuous variables grouping necessary - Partition time axis - Model non-proportionality by time-dependent covariate Non-PH - What can be done ?

7  Fractional polynomial of degree m with powers p = (p 1,…, p m ) is defined as Fractional polynomial models ( conventional polynomial p 1 = 1, p 2 = 2,... )  Notation: FP1 means FP with one term (one power), FP2 is FP with two terms, etc.  Powers p are taken from a predefined set S  We use S = {  2,  1,  0.5, 0, 0.5, 1, 2, 3}  Power 0 means log X here

8 Estimation and significance testing for FP models Fit model with each combination of powers –FP1: 8 single powers –FP2: 36 combinations of powers Choose model with lowest deviance (MLE) Comparing FPm with FP(m  1): –compare deviance difference with  2 on 2 d.f. –one d.f. for power, 1 d.f. for regression coefficient –supported by simulations; slightly conservative

9 Data: GBSG-study in node-positive breast cancer Tamoxifen (yes / no), 3 vs 6 cycles chemotherapy 299 events for recurrence-free survival time (RFS) in 686 patients with complete data Standard prognostic factors

10 FP analysis for the effect of age

11 χ 2 df Any effect? Best FP2 versus null Effect linear? Best FP2 versus linear FP1 sufficient? Best FP2 vs. best FP Effect of age at 5% level?

12 Continuous factors - different results with different analyses Age as prognostic factor in breast cancer P-value

13 Rotterdam breast cancer data 2982 patients 1 to 231 months follow-up time 1518 events for RFI (recurrence free interval) Adjuvant treatment with chemo- or hormonal therapy according to clinic guidelines 70% without adjuvant treatment Covariates continuous age, number of positive nodes, estrogen, progesterone categorical menopausal status, tumor size, grade

14 9 covariates, partly strong correlation (age-meno; estrogen-progesterone; chemo, hormon – nodes )  variable selection Use multivariable fractional polynomial approach for model selection in the Cox proportional hazards model Treatment variables ( chemo, hormon) will be analysed as usual covariates

15 -Plots Plots of log(-log(S(t))) vs log t should be parallel for groups Plotting Schoenfeld residuals against time to identify patterns in regression coefficients Many other plots proposed -Tests many proposed, often based on Schoenfeld residuals, most differ only in choice of time transformation -Partition the time axis and fit models seperatly to each time interval -Including time-by-covariate interaction terms in the model and estimate the log hazard ratio function Assessing PH-assumption

16 Smoothed Schoenfeld residuals

17 Selected model with MFP estimates test of time-varying effect for different time transformations

18 Selected model with MFP(time-fixed) Estimates in 3 time periods

19 model  (t) x =  x +  x g(t) calculate time-varying covariate x g(t) fit time-varying Cox model and test for  0 plot  (t) against t g(t) – which form? ‘usual‘ function, eg t, log(t) piecewise splines fractional polynomials Including time – by covariate interaction (Semi-) parametric models for  (t)

20 Motivation

21 Motivation (cont.)

22 MFP-time algorithm (1) Determine (time-fixed) MFP model M 0 possible problems variable included, but effect is not constant in time variable not included because of short term effect only Consider short term period only Additional to M 0 significant variables? This given M1

23 MFP-time algorithm (2) To determine time function for a variable compare deviance of models ( χ 2 ) from FPT2 to null (time fixed effect) 4 DF FPT2 to log 3 DF FPT2 to FPT12 DF Use strategy analogous to stepwise to add time-varying functions to MFP model M1 For all variables (with transformations) selected from full time-period and short time-period Investigate time function for each covariate in forward stepwise fashion - may use small P value Adjust for covariates from selected model

24 First step of the MFPT procedure Varia ble Power(s) of tStep 1 Deviance difference & P-value from FP2 FP2FP1Constant(PH)LogFP1 X1X1 0, X 3a -0.5, X 3b -0.5, X4X4 -2, X 5e (2) -2, logX , X8X8 -2, X9X9 0, o

25 Further steps of the MFPT procedure Varia ble Power(s) of tStep 2Step 3 Deviance difference & P-value from FP2FP2 v null FP2FP1Constant(PH)LogFP1P-value X1X1 0, X 3a -0.5, X 3b 0, X4X4 -1, X 5e (2) -2, logX 6 -[0] X8X8 2, X9X9 0, o

26 Development of the model VariableModel M 0 Model M 1 Model M 2 βSEβ β X1X X 3b X4X X 5e (2) X8X X9X X 3a logX X 3a (log(t)) logX 6 (log(t)) Index Index(log(t))

27 Time-varying effects in final model

28 Final model includes time-varying functions for progesterone ( log(t) ) and tumor size ( log(t) ) Prognostic ability of the Index vanishes in time

29 GBSG data Model III from S&R (1999) Selected with a multivariable FP procedure Model III (tumor grade (0,1), exp(-0.12 * number nodes), (progesterone + 1) ** 0.5, age (-2, -0.5)) Model III – false – replace age-function by age linear p-values for g(t) Mod III Mod III – false t log(t)t log(t) global age nodes

30 Summary Time-varying issues get more important with long term follow-up in large studies Issues related to ´correct´ modelling of non-linearity of continuous factors and of inclusion of important variables  we use MFP MFP-time combines selection of important variables selection of functions for continuous variables selection of time-varying function

31 Beware of ´too complex´ models Our FP based approach is simple, but needs ´fine tuning´ and investigation of properties Another approach based on FPs showed promising results in simulation (Berger et al 2003) Summary (continued)

32 Literature Berger, U., Schäfer, J, Ulm, K: Dynamic Cox Modeling based on Fractional Polynomials: Time-variations in Gastric Cancer Prognosis, Statistics in Medicine, 22: (2003) Hess, K.: Graphical Methods for Assessing Violations of the Proportional Hazard Assumption in Cox Regression, Statistics in Medicine, 14, 1707 – 1723 (1995) Gray, R.: Flexible Methods for Analysing Survival Data Using Splines, with Applications to Breast Cancer Prognosis, Journal of the American Statistical Association, 87, No 420, 942 – 951 (1992) Sauerbrei, W., Royston, P.: Building multivariable prognostic and diagnostic models : Transformation of the predictors by using fractional polynomials, Journal of the Royal Statistical Society, A. 162:71-94 (1999) Sauerbrei, W.,Royston, P., Look,M.: A new proposal for multivariable modelling of time-varying effects in survival data based on fractional polynomial time-transformation, submitted Therneau, T., Grambsch P.: Modeling Survival Data, Springer, 2000