Joint Modelling of Accelerated Failure Time and Longitudinal Data By By Yi-Kuan Tseng Yi-Kuan Tseng Joint Work With Joint Work With Professor Jane-Ling.

Slides:



Advertisements
Similar presentations
Residuals Residuals are used to investigate the lack of fit of a model to a given subject. For Cox regression, there’s no easy analog to the usual “observed.
Advertisements

Image Modeling & Segmentation
Kin 304 Regression Linear Regression Least Sum of Squares
Improving Forecast Accuracy by Unconstraining Censored Demand Data Rick Zeni AGIFORS Reservations and Yield Management Study Group May, 2001.
Hidden Markov Models. A Hidden Markov Model consists of 1.A sequence of states {X t |t  T } = {X 1, X 2,..., X T }, and 2.A sequence of observations.
Segmentation and Fitting Using Probabilistic Methods
Cox Model With Intermitten and Error-Prone Covariate Observation Yury Gubman PhD thesis in Statistics Supervisors: Prof. David Zucker, Prof. Orly Manor.
Goodness of Fit of a Joint Model for Event Time and Nonignorable Missing Longitudinal Quality of Life Data – A Study by Sneh Gulati* *with Jean-Francois.
Maximum likelihood estimates What are they and why do we care? Relationship to AIC and other model selection criteria.
Parametric Inference.
What is it? When would you use it? Why does it work? How do you implement it? Where does it stand in relation to other methods? EM algorithm reading group.
Bayesian Analysis of Dose-Response Calibration Curves Bahman Shafii William J. Price Statistical Programs College of Agricultural and Life Sciences University.
Linear and generalised linear models Purpose of linear models Least-squares solution for linear models Analysis of diagnostics Exponential family and generalised.
Modeling clustered survival data The different approaches.
Statistical Methods for Missing Data Roberta Harnett MAR 550 October 30, 2007.
1 Adjustment Procedures to Account for Nonignorable Missing Data in Environmental Surveys Breda Munoz Virginia Lesser R
Transforming the data Modified from: Gotelli and Allison Chapter 8; Sokal and Rohlf 2000 Chapter 13.
Survival Analysis A Brief Introduction Survival Function, Hazard Function In many medical studies, the primary endpoint is time until an event.
Markov Localization & Bayes Filtering
Correlated Topic Models By Blei and Lafferty (NIPS 2005) Presented by Chunping Wang ECE, Duke University August 4 th, 2006.
Prof. Dr. S. K. Bhattacharjee Department of Statistics University of Rajshahi.
A statistical model Μ is a set of distributions (or regression functions), e.g., all uni-modal, smooth distributions. Μ is called a parametric model if.
1 HMM - Part 2 Review of the last lecture The EM algorithm Continuous density HMM.
Stochastic Linear Programming by Series of Monte-Carlo Estimators Leonidas SAKALAUSKAS Institute of Mathematics&Informatics Vilnius, Lithuania
Bayesian Analysis and Applications of A Cure Rate Model.
Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.
Borgan and Henderson:. Event History Methodology
Empirical Efficiency Maximization: Locally Efficient Covariate Adjustment in Randomized Experiments Daniel B. Rubin Joint work with Mark J. van der Laan.
GEE Approach Presented by Jianghu Dong Instructor: Professor Keumhee Chough (K.C.) Carrière.
Pro gradu –thesis Tuija Hevonkorpi.  Basic of survival analysis  Weibull model  Frailty models  Accelerated failure time model  Case study.
Lecture 4: Statistics Review II Date: 9/5/02  Hypothesis tests: power  Estimation: likelihood, moment estimation, least square  Statistical properties.
Lecture 12: Cox Proportional Hazards Model
Introduction Sample Size Calculation for Comparing Strategies in Two-Stage Randomizations with Censored Data Zhiguo Li and Susan Murphy Institute for Social.
Monte-Carlo method for Two-Stage SLP Lecture 5 Leonidas Sakalauskas Institute of Mathematics and Informatics Vilnius, Lithuania EURO Working Group on Continuous.
HMM - Part 2 The EM algorithm Continuous density HMM.
The generalization of Bayes for continuous densities is that we have some density f(y|  ) where y and  are vectors of data and parameters with  being.
Simulation Study for Longitudinal Data with Nonignorable Missing Data Rong Liu, PhD Candidate Dr. Ramakrishnan, Advisor Department of Biostatistics Virginia.
Expectation-Maximization (EM) Algorithm & Monte Carlo Sampling for Inference and Approximation.
Multilevel and multifrailty models. Overview  Multifrailty versus multilevel Only one cluster, two frailties in cluster e.g., prognostic index (PI) analysis,
A shared random effects transition model for longitudinal count data with informative missingness Jinhui Li Joint work with Yingnian Wu, Xiaowei Yang.
Treat everyone with sincerity,
Love does not come by demanding from others, but it is a self initiation. Survival Analysis.
Sensitivity Analysis for Interval-Censored Discrete Failure Time Data: Application to ACTG 181 Daniel Scharfstein Department of Biostatistics Johns Hopkins.
Statistics Sampling Distributions and Point Estimation of Parameters Contents, figures, and exercises come from the textbook: Applied Statistics and Probability.
1 Chapter 8: Model Inference and Averaging Presented by Hui Fang.
CWR 6536 Stochastic Subsurface Hydrology
Proportional Hazards Model Checking the adequacy of the Cox model: The functional form of a covariate The link function The validity of the proportional.
Hidden Markov Models. A Hidden Markov Model consists of 1.A sequence of states {X t |t  T } = {X 1, X 2,..., X T }, and 2.A sequence of observations.
For multivariate data of a continuous nature, attention has focussed on the use of multivariate normal components because of their computational convenience.
Other Models for Time Series. The Hidden Markov Model (HMM)
EM Algorithm 主講人:虞台文 大同大學資工所 智慧型多媒體研究室. Contents Introduction Example  Missing Data Example  Mixed Attributes Example  Mixture Main Body Mixture Model.
1 Borgan and Henderson: Event History Methodology Lancaster, September 2006 Session 8.1: Cohort sampling for the Cox model.
Fundamentals of Data Analysis Lecture 11 Methods of parametric estimation.
Introduction We consider the data of ~1800 phenotype measurements Each mouse has a given probability distribution of descending from one of 8 possible.
Confidential and Proprietary Business Information. For Internal Use Only. Statistical modeling of tumor regrowth experiment in xenograft studies May 18.
Unsupervised Learning Part 2. Topics How to determine the K in K-means? Hierarchical clustering Soft clustering with Gaussian mixture models Expectation-Maximization.
Bootstrap and Model Validation
If we can reduce our desire,
Comparing Cox Model with a Surviving Fraction with regular Cox model
Hidden Markov Models.
Proportional Hazard Models
Meta-analysis of joint longitudinal and event-time outcomes
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
Cox Regression Model Under Dependent Truncation
Basics of the semiparametric frailty model
If we can reduce our desire,
Love does not come by demanding from others, but it is a self initiation. Survival Analysis.
A Gentle Tutorial of the EM Algorithm and its Application to Parameter Estimation for Gaussian Mixture and Hidden Markov Models Jeff A. Bilmes International.
Nazmus Saquib, PhD Head of Research Sulaiman AlRajhi Colleges
A Gentle Tutorial of the EM Algorithm and its Application to Parameter Estimation for Gaussian Mixture and Hidden Markov Models Jeff A. Bilmes International.
Presentation transcript:

Joint Modelling of Accelerated Failure Time and Longitudinal Data By By Yi-Kuan Tseng Yi-Kuan Tseng Joint Work With Joint Work With Professor Jane-Ling Wang Professor Jane-Ling Wang Professor Fushing Hsieh Professor Fushing Hsieh Tseng Y.K., Hsieh F., and Wang J.L. (2005). 92, pp , Biometrika.

I. Introduction CD4 counts and time to AIDS (or death) CD4 counts and time to AIDS (or death)CD4

Self and Powitan(1992), Degruttola and Tu(1994), Tsiatis et al.(1995), Faucett and Thomas (1996), Wulfsohn and Tsiatis(1997) Bycott and Taylor(1998) Dafni and Tsiatis (1998), Tsiatis and Davidian (2001) Taylor et al.(1994), Lavalley and Degruttola(1996), Henderson et al.(2000),Wang and Taylor(2001), Xu and Zeger(2001) f(t ) :a vector of known functions of time t, U (t) : a stochastic process

Bayesian approaches Bayesian approaches Conditional score approaches Conditional score approaches Two-stage partial likelihood approaches Two-stage partial likelihood approaches -truncation causes bias -truncation causes bias Joint likelihood approaches -robust to the distribution of random effects - unbiased - efficient

Accelerated failure time model is an attractive alternative when the proportional hazard assumption fails. Accelerated failure time model is an attractive alternative when the proportional hazard assumption fails. For time independent covariates X: For time independent covariates X:

For time dependent covariates X(t), we consider For time dependent covariates X(t), we consider the AFT model in Cox and Oakes (1984): the AFT model in Cox and Oakes (1984): Biological meaning: Allows the influence of entire Biological meaning: Allows the influence of entire covariate history on subject specific risk. covariate history on subject specific risk.

For an absolutely continuous S 0, the hazard rate function with covariate history: If baseline hazard is unspecified, the expression If baseline hazard is unspecified, the expression corresponds to a semi-parametric model. corresponds to a semi-parametric model. Robins and Tsiatis (1992)– rank estimating equation Lin and Ying (1995)– asym. consistency and Normality Hsieh (2003)– over-identified estimating equation

Goal of the study: provide an effective estimators for β with unspecified baseline hazard and the parameters β with unspecified baseline hazard and the parameters of longitudinal process of longitudinal process Different assumptions on baseline hazard: -- Wulfsohn and Tsiatis (1997) Discrete baseline hazard with jumps at event times Discrete baseline hazard with jumps at event times -- Our assumption: The baseline hazard is a step function. The baseline hazard is a step function.

II. Joint AFT and Longitudinal model

Model for longitudinal data: Model for longitudinal data:

Model for survival: Model for survival: Joint likelihood: Assumptions -- noninformative censoring -- noninformative measurement schedule t ij, both are independent of future covariate history and random effects b i Joint likelihood: Assumptions -- noninformative censoring -- noninformative measurement schedule t ij, both are independent of future covariate history and random effects b i

III. EM Algorithm M-step: M-step: Complete data likelihood: Complete data likelihood:

Therefore, we have

There’s no closed form expression for. We may There’s no closed form expression for. We may maximize the conditional likelihood by numerical maximize the conditional likelihood by numerical method. method. E-step: E-step: To compute E i (.),we need knowledge of To compute E i (.),we need knowledge of which can be expressed as: which can be expressed as:

E i (.), we may generating M multivariate To derive E i (.), we may generating M multivariate normal sequences for, denoted by normal sequences for, denoted by ThTh The accuracy increases as M increases. In order to The accuracy increases as M increases. In order to have higher accuracy and less computing time, we have higher accuracy and less computing time, we may follow the suggestion in Wei and Tanner (1990) may follow the suggestion in Wei and Tanner (1990). That is, to use small value of M in the initial iterations. That is, to use small value of M in the initial iterations of the algorithm, and increase the values of M as the of the algorithm, and increase the values of M as the algorithm moves closer to convergence. algorithm moves closer to convergence.

We encounter two difficulties when estimating standard error of : EM algorithm involved missing information EM algorithm involved missing information -Remedies in Louis (1982) and McLachlan and -Remedies in Louis (1982) and McLachlan and Krishnan (1997) are valid for finite dimensional Krishnan (1997) are valid for finite dimensional parameter space. parameter space. No explicit profile likelihood No explicit profile likelihood - Need projection onto all other parameters - However, it’s very hard to derive due to λ 0

Bootstrap technique in Efron(1994):

IV. Simulation Studies Sample size n=100 with 100 MC replications Sample size n=100 with 100 MC replications

β μ1μ1μ1μ1 μ2μ2μ2μ2 σ 11 σ 12 σ 22 σeσeσeσe target mean SD β μ1μ1μ1μ1 μ2μ2μ2μ2 σ 11 σ 12 σ 22 σeσeσeσetarget mean SD (i) Normal random effects without censoring (ii) Normal random effects with censoring

β μ1μ1μ1μ1 μ2μ2μ2μ2 σ 11 σ 12 σ 22 σeσeσeσe target empiricaltarget mean SD (iii) Nonnormal random effects with censoring 2

The medfly (Mediterranean fruit fly) data: The medfly (Mediterranean fruit fly) data: --From Carey, et al. (1998) -- We focus on 251 female medflies which have the most egg reproduction (>1150). --Range of event time from 22 to Range of total reproduction from 1151 to No censoring and missing Relationship between daily egg laying and mortality Relationship between daily egg laying and mortality V. Application on Medfly data

--Violate the proportionality (By scaled Schoenfeld residual test with p-value ) --Violate the proportionality (By scaled Schoenfeld residual test with p-value )

Initial model: Initial model: Log transformed model: Log transformed model:

β μ1μ1μ1μ1 μ2μ2μ2μ2 σ 11 σ 12 σ 22 σeσeσeσe fitted values bootstrap mean bootstrap SD The parameter estimates derived from original data and 100 bootstrap samples under the joint AFT

Fitting incomplete medfly data: Fitting incomplete medfly data: --Randomly select 1-7 days as the corresponding schedule times for each individual. times for each individual. --Then, add the day of death as the last schedule time. Therefore, each individual may have 2-8 repeated Therefore, each individual may have 2-8 repeated measurements. measurements. -- The sub data set is further censored by exponential distribution with mean 500 (20% censoring rate) distribution with mean 500 (20% censoring rate)

β μ1μ1μ1μ1 μ2μ2μ2μ2 σ 11 σ 12 σ 22 σeσeσeσe fitted values bootstrap mean bootstrap SD The parameter estimates derived from incomplete data and 100 bootstrap samples under the joint AFT

The End