Univariate Survival Analysis Prof. L. Duchateau Ghent University.

Slides:



Advertisements
Similar presentations
Analysis of multivariate transformations. Transformation of the response in regression The normalized power transformation is: is the geometric mean of.
Advertisements

Modeling of Data. Basic Bayes theorem Bayes theorem relates the conditional probabilities of two events A, and B: A might be a hypothesis and B might.
Uncertainty and confidence intervals Statistical estimation methods, Finse Friday , 12.45–14.05 Andreas Lindén.
1 Parametric Sensitivity Analysis For Cancer Survival Models Using Large- Sample Normal Approximations To The Bayesian Posterior Distribution Gordon B.
Workshop 1 Alternative models for clustered survival data.
The General Linear Model. The Simple Linear Model Linear Regression.
Visual Recognition Tutorial
Maximum likelihood estimates What are they and why do we care? Relationship to AIC and other model selection criteria.
Part 21: Hazard Models [1/29] Econometric Analysis of Panel Data William Greene Department of Economics Stern School of Business.
Maximum likelihood (ML) and likelihood ratio (LR) test
Maximum likelihood Conditional distribution and likelihood Maximum likelihood estimations Information in the data and likelihood Observed and Fisher’s.
Generalised linear models
Maximum likelihood (ML) and likelihood ratio (LR) test
Log-linear and logistic models Generalised linear model ANOVA revisited Log-linear model: Poisson distribution logistic model: Binomial distribution Deviances.
Log-linear and logistic models
Statistics and Probability Theory Prof. Dr. Michael Havbro Faber
Linear and generalised linear models
Computer vision: models, learning and inference
Linear and generalised linear models
Modeling clustered survival data The different approaches.
Accelerated Failure Time (AFT) Model As An Alternative to Cox Model
Maximum likelihood (ML)
Model Checking in the Proportional Hazard model
Principles of the Global Positioning System Lecture 10 Prof. Thomas Herring Room A;
Survival Analysis A Brief Introduction Survival Function, Hazard Function In many medical studies, the primary endpoint is time until an event.
Maximum Likelihood Estimation
17. Duration Modeling. Modeling Duration Time until retirement Time until business failure Time until exercise of a warranty Length of an unemployment.
Multivariate survival analysis Luc Duchateau, Ghent University Paul Janssen, Hasselt University 1.
Essentials of survival analysis How to practice evidence based oncology European School of Oncology July 2004 Antwerp, Belgium Dr. Iztok Hozo Professor.
HSRP 734: Advanced Statistical Methods July 10, 2008.
ECE 8443 – Pattern Recognition LECTURE 06: MAXIMUM LIKELIHOOD AND BAYESIAN ESTIMATION Objectives: Bias in ML Estimates Bayesian Estimation Example Resources:
The Triangle of Statistical Inference: Likelihoood
Moment Generating Functions
Multivariate Survival Analysis Prof. L. Duchateau Ghent University.
An Empirical Likelihood Ratio Based Goodness-of-Fit Test for Two-parameter Weibull Distributions Presented by: Ms. Ratchadaporn Meksena Student ID:
Learning Theory Reza Shadmehr logistic regression, iterative re-weighted least squares.
1 Advances in the Construction of Efficient Stated Choice Experimental Designs John Rose 1 Michiel Bliemer 1,2 1 The University of Sydney, Australia 2.
2 December 2004PubH8420: Parametric Regression Models Slide 1 Applications - SAS Parametric Regression in SAS –PROC LIFEREG –PROC GENMOD –PROC LOGISTIC.
CS 782 – Machine Learning Lecture 4 Linear Models for Classification  Probabilistic generative models  Probabilistic discriminative models.
HSRP 734: Advanced Statistical Methods July 17, 2008.
ECE 8443 – Pattern Recognition LECTURE 10: HETEROSCEDASTIC LINEAR DISCRIMINANT ANALYSIS AND INDEPENDENT COMPONENT ANALYSIS Objectives: Generalization of.
Pro gradu –thesis Tuija Hevonkorpi.  Basic of survival analysis  Weibull model  Frailty models  Accelerated failure time model  Case study.
: Chapter 3: Maximum-Likelihood and Baysian Parameter Estimation 1 Montri Karnjanadecha ac.th/~montri.
Maximum Likelihood Estimation Psych DeShon.
Lecture 12: Cox Proportional Hazards Model
Consistency An estimator is a consistent estimator of θ, if , i.e., if
Geology 5670/6670 Inverse Theory 21 Jan 2015 © A.R. Lowry 2015 Read for Fri 23 Jan: Menke Ch 3 (39-68) Last time: Ordinary Least Squares Inversion Ordinary.
Robert Engle UCSD and NYU and Robert F. Engle, Econometric Services DYNAMIC CONDITIONAL CORRELATIONS.
STT : BIOSTATISTICS ANALYSIS Dr. Cuixian Chen Chapter 7: Parametric Survival Models under Censoring STT
Brief Review Probability and Statistics. Probability distributions Continuous distributions.
Lecture 4: Likelihoods and Inference Likelihood function for censored data.
Multilevel and multifrailty models. Overview  Multifrailty versus multilevel Only one cluster, two frailties in cluster e.g., prognostic index (PI) analysis,
Treat everyone with sincerity,
M.Sc. in Economics Econometrics Module I Topic 4: Maximum Likelihood Estimation Carol Newman.
Week 31 The Likelihood Function - Introduction Recall: a statistical model for some data is a set of distributions, one of which corresponds to the true.
Proportional Hazards Model Checking the adequacy of the Cox model: The functional form of a covariate The link function The validity of the proportional.
Learning Theory Reza Shadmehr Distribution of the ML estimates of model parameters Signal dependent noise models.
Computational Intelligence: Methods and Applications Lecture 26 Density estimation, Expectation Maximization. Włodzisław Duch Dept. of Informatics, UMK.
Estimation Econometría. ADE.. Estimation We assume we have a sample of size T of: – The dependent variable (y) – The explanatory variables (x 1,x 2, x.
[Topic 11-Duration Models] 1/ Duration Modeling.
Part 3: Estimation of Parameters. Estimation of Parameters Most of the time, we have random samples but not the densities given. If the parametric form.
Fundamentals of Data Analysis Lecture 11 Methods of parametric estimation.
CHAPTER 18 SURVIVAL ANALYSIS Damodar Gujarati
Parametric Survival Models (ch. 7)
EC 331 The Theory of and applications of Maximum Likelihood Method
Basics of the semiparametric frailty model
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Parametric Methods Berlin Chen, 2005 References:
Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
Treat everyone with sincerity,
Presentation transcript:

Univariate Survival Analysis Prof. L. Duchateau Ghent University

Model specification  Most survival models are defined in terms of the hazard with the hazard at time t for subject i the baseline hazard at time t the incidence vector for subject i the parameter vector

Hazard function  Density function  Cumulative distribution function  Survival function  Hazard function

Alternative models  Hazard model: Baseline hazard function parametric Baseline hazard function unspecified Summary measure: hazard ratio  Accelerated failure time (AFT) model: Typically parametric Summary measure: accelerator factor

Parametric hazard model: Analytical solution  Assume constant baseline hazard (exponential lifetimes) with only control and treated group with = 0 for control and =1 for treated  Likelihood for exponential:

Likelihood specification constant hazard  Define as number of events in treated (control) group and  Define as at risk time in treated (control) group and

Solution from likelihood specification  Maximise the log likelihood function leading to

Analytical solution reconstitution data #The analytical solution DT<-sum(stat[trt==1]); DC<-sum(stat[trt==0]); yT<-sum(timerec[trt==1]);yC<-sum(timerec[trt==0]); lambda<-DC/yC;HR<-(DT/yT)/(DC/yC) lambda;HR

Exercise Obtain the analytical solution for the diagnosis data set First rework the data: #Read the data diag<-read.table("timetodiag.csv",header=T,sep=";") #Create 5 column vectors, five different variables timetodiag<-c(diag$t1,diag$t2) stat<-c(diag$c1,diag$c2) technique<-c(rep(0,106),rep(1,106)) dogid<-c(diag$dogid,diag$dogid) diagnosis<- data.frame(dogid=dogid,technique=technique,timetodiag=t imetodiag,stat=stat)

Variance of the estimates?  Obtain the Hessian, i.e., the matrix of the second derivatives of the log likelihood which is  The information matrix is then

Inverse of observed information matrix  The observed information matrix is thus and the asymptotic variance-covariance matrix is

Variance estimators #The observed information matrix I<-matrix(data=c((DT+DC)/(lambda^2), yT*HR,yT*HR,lambda*yT*HR), nrow = 2, ncol = 2) V<-solve(I) V;sqrt(V) [,1] [,2] [1,] [2,] [,1] [,2] [1,] NaN [2,] NaN

Exercise Obtain the asymptotic variance estimates for the parameters of the diagnosis data set

Maximizer solution reconstitution data #(negative) loglikelihood exponential with l=exp(p[1]), beta=p[2] loglikelihood.exponential<-function(p){ cumhaz<- exp(p[1])*timerec*(exp(p[2]*trt)) hazard<-stat*log(exp(p[1])*exp(p[2]*trt)) loglik<-sum(hazard)-sum(cumhaz) -loglik} #Apply minimizer to minus loglikelihood function res<-nlm(loglikelihood.exponential,c(-1,0)) res;lambda<-exp(res$estimate[1]);HR<- exp(res$estimate[2]) lambda;HR

Variances from maximizer solution #Apply minimizer to obtain Hessian matrix res<-nlm(loglikelihood.exponential,c(-1,0),hessian=T) solve(res$hessian) [,1] [,2] [1,] [2,]

Use parameters of interest as input #(negative) loglikelihood exponential with l=p[1], HR=p[2] loglikelihood.exponentialHR<-function(p){ cumhaz<- p[1]*timerec*(exp(log( p[2])*trt)) hazard<-stat*log(p[1]*exp(log(p[2])*trt)) loglik<-sum(hazard)-sum(cumhaz) -loglik} #Apply minimizer to obtain Hessian matrix res<- nlm(loglikelihood.exponentialHR,c(lambda,HR),hessian=T,iterl im=1) solve(res$hessian) [,1] [,2] [1,] [2,]

Exercise Obtain the parameter estimates and their variance for the diagnosis data set using the maximizer

Standard software? #Univariate model-exponential library(survival) res.unadjust<- survreg(Surv(timerec,stat)~trt,dist="exponential",data=reconstituti on) res.unadjust summary(res.unadjust) lambda<-res.unadjust$coef[1]; beta<-res.unadjust$coef[2];HR<-exp(beta) lambda;beta;HR

Loglinear model representation  Hazard model with parametric baseline hazard can be rewritten in a loglinear model representation  Most often used:

Examples Weibull distributions – varying 

Survival function for Weibull hazard model  Assume

Survival function for Weibull loglinear model  Assume with  From this follows that and thus  Based on the Gumbel assumption, the survival function becomes

Two presentations for Weibull event times and thus:

Two presentations for exponential event times and thus:

survreg function #unadjusted model-exponential res.unadjust<- survreg(Surv(timerec,stat)~trt,dist="exponential",data=reconstituti on) res.unadjust;summary(res.unadjust) mu<-res.unadjust$coef[1];alpha<-res.unadjust$coef[2]; lambda<- exp(-mu);beta<- -alpha;HR<-exp(beta) lambda;beta;HR

Exercise Obtain the parameter estimates of the diagnosis data set using the survreg function in R

survreg function variances #unadjusted model-exponential res.unadjust<- survreg(Surv(timerec,stat)~trt,dist="exponential",data=reconstituti on) res.unadjust;summary(res.unadjust) mu<-res.unadjust$coef[1];alpha<-res.unadjust$coef[2]; lambda<- exp(-mu);beta<- -alpha;HR<-exp(beta) lambda;beta;HR res.unadjust$var (Intercept) trt (Intercept) trt

The delta method on variance  Obtaining the variance of using the variance of

The delta method - general  Original parameters  Interest in univariate cont. function  Use one term Taylor expansion of with

The delta method - specific  Interest in univariate cont. function  The one term Taylor expansion of With

survreg function variances #unadjusted model-exponential-variances of transformed variables lambda<- exp(-mu);beta<- -alpha;HR<-exp(beta) Vlambda<- res.unadjust$var[1,1]*(lambda^2) Vbeta<- res.unadjust$var[2,2]

Exercise Obtain the variances parameter estimates of the diagnosis data set using the survreg function in R, and applying the delta method