Model Checking in the Proportional Hazard model

Slides:



Advertisements
Similar presentations
Residuals Residuals are used to investigate the lack of fit of a model to a given subject. For Cox regression, there’s no easy analog to the usual “observed.
Advertisements

Continued Psy 524 Ainsworth
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Forecasting Using the Simple Linear Regression Model and Correlation
Stat 112: Lecture 7 Notes Homework 2: Due next Thursday The Multiple Linear Regression model (Chapter 4.1) Inferences from multiple regression analysis.
Inference for Regression
Analysis of variance (ANOVA)-the General Linear Model (GLM)
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Prediction, Correlation, and Lack of Fit in Regression (§11. 4, 11
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 13 Nonlinear and Multiple Regression.
HSRP 734: Advanced Statistical Methods July 24, 2008.
Objectives (BPS chapter 24)
April 25 Exam April 27 (bring calculator with exp) Cox-Regression
Some Terms Y =  o +  1 X Regression of Y on X Regress Y on X X called independent variable or predictor variable or covariate or factor Which factors.
Chapter 13 Multiple Regression
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
Correlation and Regression. Spearman's rank correlation An alternative to correlation that does not make so many assumptions Still measures the strength.
Stat 512 – Lecture 18 Multiple Regression (Ch. 11)
Chapter 12 Multiple Regression
1 Chapter 3 Multiple Linear Regression Ray-Bing Chen Institute of Statistics National University of Kaohsiung.
Multivariate Data Analysis Chapter 4 – Multiple Regression.
Chapter 11 Survival Analysis Part 2. 2 Survival Analysis and Regression Combine lots of information Combine lots of information Look at several variables.
Lecture 16 – Thurs, Oct. 30 Inference for Regression (Sections ): –Hypothesis Tests and Confidence Intervals for Intercept and Slope –Confidence.
Regression Diagnostics Checking Assumptions and Data.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 11 th Edition.
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Correlation and Regression Analysis
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Regression Chapter 14.
Simple Linear Regression Analysis
Assessing Survival: Cox Proportional Hazards Model Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Correlation & Regression
Inference for regression - Simple linear regression
Simple Linear Regression
STATISTICS: BASICS Aswath Damodaran 1. 2 The role of statistics Aswath Damodaran 2  When you are given lots of data, and especially when that data is.
BPS - 3rd Ed. Chapter 211 Inference for Regression.
Data Handling & Analysis BD7054 Scatter Plots Andrew Jackson
Assessing Survival: Cox Proportional Hazards Model
Cox Regression II Kristin Sainani Ph.D. Stanford University Department of Health Research and Policy Kristin Sainani Ph.D.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
HSRP 734: Advanced Statistical Methods July 31, 2008.
Copyright ©2011 Brooks/Cole, Cengage Learning Inference about Simple Regression Chapter 14 1.
Lecture 12: Cox Proportional Hazards Model
Correlation & Regression Analysis
I271B QUANTITATIVE METHODS Regression and Diagnostics.
Logistic Regression Analysis Gerrit Rooks
Love does not come by demanding from others, but it is a self initiation. Survival Analysis.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 10 th Edition.
Proportional Hazards Model Checking the adequacy of the Cox model: The functional form of a covariate The link function The validity of the proportional.
Hypothesis Testing and Statistical Significance
Additional Regression techniques Scott Harris October 2009.
BPS - 5th Ed. Chapter 231 Inference for Regression.
LOGISTIC REGRESSION. Purpose  Logistical regression is regularly used when there are only two categories of the dependent variable and there is a mixture.
03/20161 EPI 5344: Survival Analysis in Epidemiology Testing the Proportional Hazard Assumption April 5, 2016 Dr. N. Birkett, School of Epidemiology, Public.
Stats Methods at IC Lecture 3: Regression.
Chapter 15 Multiple Regression Model Building
Logistic Regression APKC – STATS AFAC (2016).
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
AP Statistics Chapter 14 Section 1.
Statistics 262: Intermediate Biostatistics
CHAPTER 29: Multiple Regression*
6-1 Introduction To Empirical Models
The greatest blessing in life is
Simple Linear Regression
Basic Practice of Statistics - 3rd Edition Inference for Regression
Ch11 Curve Fitting II.
Love does not come by demanding from others, but it is a self initiation. Survival Analysis.
Nazmus Saquib, PhD Head of Research Sulaiman AlRajhi Colleges
Presentation transcript:

Model Checking in the Proportional Hazard model When we get the data set, sometime we are more interesting in the hazards ration. For example, how is the new treatment or new drug compared to the standard one, so we fit the pH model.

Influential observations PH assumption Model adequacy Influential observations PH assumption Example: leaders data Total 472 observations Total 11 covariates fitted in the initial PH model Manner start age conflict loginc region After we fit the model ,we will ask: how is model fit? How to assess the model? a couple of things we need take care is checking

Assessing model adequacy Residuals checking: Cox-Snell (& modified) Martingale Deviance Schoenfeld Score First to checking the model overall fit , …. These are the widely used residuals, some of them we will talk more detail later.

Residual Plots (a) Cox-Snell residual plot Yielding a straight line passing origin with unity slope When we got the residuals, we can plot them. Some graph will be helpful. As we have learned, If the fitted cox model is correct, the residuals will be a straight line ……..the values rci have a unit exponential distribution.

(b) Index plot of Martingale, Deviance residuals take values between - and one in large samples uncorrelated with each other, with mean zero can be interpreted as the difference the observed and expected number of deaths in (0,ti), for ith individual.

(c) deviance residuals vs. risk score A transformation of martingale residuals More symmetrically distributed about zero The smaller residuals, the better fitted by the model Risk score: the linear predictor in Large negative values a lower than average risk of fail Large positive values a higher than average risk of fail Sometimes the plot of …. Is more helpful. As we know, the deviance residual is ………… the residuals have negative value when their survival time is less than expected.

Plot of the deviance residuals against linear predictor

(3) Checking the functional form of covariate Martingale residual from fitting the null model (contains no covariates) plot residuals vs. covariate of interest a straight line indicates a linear term is needed (but most time need using smoother)

Martingale residuals from null model vs start It shows a roughly straight line

Indentifying influential observations delta-beta statistics the difference in the parameter estimate between the all observations fit and the ith observation omitted from the fit assessing the influence of observations on a parameter estimate To identify the influential observation, there are two basic ways. One is using the statistics called Delta-bets……. An approximation to the change is based on the score residual.

-------Lowest------ ------Highest------ SAS output Approx. delta-betas for manner Extreme Observations -------Lowest------ ------Highest------ Value Obs Value Obs -0.0300321 342 0.0109460 180 -0.0263841 234 0.0121594 212 -0.0180988 428 0.0128754 403 -0.0168336 141 0.0141662 258 -0.0156685 247 0.0154732 286 Approx. LDi --------Lowest------- ------Highest------ 6.21925E-05 302 0.0838694 234 6.33683E-05 345 0.0973350 286 7.60771E-05 346 0.1086945 54 7.62382E-05 395 0.1152216 342 8.00260E-05 373 0.1250329 449 In SAS we can use the dfbeta and lrchange option to do the diagnostic analysis. A negative value that means removing this observation will decrease the hazard of fail relative to the baseline hazard. Positive will increase …………

Likelihood displacement (LD) The changes of the maximized log-likelihood if omitting the ith observation from the fit Assessing the influence of observations on the overall fit( the set of parameter estimates)

Treatment: Check the original data, corrected it if found any mistakes Make inferences based on both situations (full & reduced data), contrast results In recording or transcribing……………..sometimes we can’t confirm the data is valid

Testing PH assumption A crucial assumption when using Cox model The effect of covariates on the hazard rate are the same over time

Before fitting a Cox Model Grouping data according the level of one or more factors Plot Parallel curves if the hazard ratios are proportional across the different groups In SAS, strata option

Checking PH assumption for manner It is roughly parallel.

Checking PH assumption for region Since we include total 11 covariates in the model, this plot takes no account of the values of the others variables, this could be the survive time has been affected by other variants. There is little reason to suspect the assumption.

After fitting a Cox Model Plot weighted Schoenfeld residuals vs. time the time-varying coefficient at the ith failure time the estimate in the fitted Cox PH model When the data has some time-varying effect covariate, we can plot weighted schoenfeld residuals against time to detect such effect. Since the scoenfeld residuals has such properties:

Plot weighted Schoenfeld residuals vs. time (cont’s) Detect if some form of time dependency in particular covariate exist A horizontal line show the coefficient is constant, and PH assumption is valid

Adding a time-dependent covariate if hazard rate varies with time For example, a PH model with one covariate for ith observation assumes If the hazard ratio varies with time between two groups, we can include an interaction term to the model: The relative hazard is now , which depend on t. If the coefficient is significant, the model is no longer a PH model. The test of the hypothesis that =0 is a test of the PH assumption. If the PH assumption is fail,

Summary Assessing model fit: Influence diagnostics: Residuals & residual plots Functional form of the covariates Influence diagnostics: Delta-beta statistics LD statistics Testing PH assumption : Plot of log[-logS] vs. log(t) Plot of Schoenfeld residuals vs. t Adding a time-dependent variable