Goodness of Fit of a Joint Model for Event Time and Nonignorable Missing Longitudinal Quality of Life Data – A Study by Sneh Gulati* *with Jean-Francois.

Slides:



Advertisements
Similar presentations
STATISTICS Joint and Conditional Distributions
Advertisements

Copula Regression By Rahul A. Parsa Drake University &
Unsupervised Learning
Random Variables ECE460 Spring, 2012.
1 Goodness-of-Fit Tests with Censored Data Edsel A. Pena Statistics Department University of South Carolina Columbia, SC [ Research.
Maximum Likelihood And Expectation Maximization Lecture Notes for CMPUT 466/551 Nilanjan Ray.
Outline input analysis input analyzer of ARENA parameter estimation
Chap 8: Estimation of parameters & Fitting of Probability Distributions Section 6.1: INTRODUCTION Unknown parameter(s) values must be estimated before.
Hidden Markov Models. A Hidden Markov Model consists of 1.A sequence of states {X t |t  T } = {X 1, X 2,..., X T }, and 2.A sequence of observations.
The General Linear Model. The Simple Linear Model Linear Regression.
Cox Model With Intermitten and Error-Prone Covariate Observation Yury Gubman PhD thesis in Statistics Supervisors: Prof. David Zucker, Prof. Orly Manor.
Maximum likelihood (ML) and likelihood ratio (LR) test

Maximum likelihood Conditional distribution and likelihood Maximum likelihood estimations Information in the data and likelihood Observed and Fisher’s.
1 Unsupervised Learning With Non-ignorable Missing Data Machine Learning Group Talk University of Toronto Monday Oct 4, 2004 Ben Marlin Sam Roweis Rich.
AGC DSP AGC DSP Professor A G Constantinides© Estimation Theory We seek to determine from a set of data, a set of parameters such that their values would.
Maximum likelihood (ML) and likelihood ratio (LR) test
Visual Recognition Tutorial
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 14 Goodness-of-Fit Tests and Categorical Data Analysis.
Probability theory 2010 Conditional distributions  Conditional probability:  Conditional probability mass function: Discrete case  Conditional probability.
Continuous Random Variables and Probability Distributions
Lecture 7 1 Statistics Statistics: 1. Model 2. Estimation 3. Hypothesis test.
Modeling clustered survival data The different approaches.
Maximum likelihood (ML)
Maximum Likelihood Estimation
Binary Variables (1) Coin flipping: heads=1, tails=0 Bernoulli Distribution.
Chapter 5 Sampling and Statistics Math 6203 Fall 2009 Instructor: Ayona Chatterjee.
Modeling Menstrual Cycle Length in Pre- and Peri-Menopausal Women Michael Elliott Xiaobi Huang Sioban Harlow University of Michigan School of Public Health.
Model Inference and Averaging
Functions of Random Variables. Methods for determining the distribution of functions of Random Variables 1.Distribution function method 2.Moment generating.
Bayesian Analysis and Applications of A Cure Rate Model.
Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.
The Examination of Residuals. Examination of Residuals The fitting of models to data is done using an iterative approach. The first step is to fit a simple.
Ch5. Probability Densities II Dr. Deshi Ye
Borgan and Henderson:. Event History Methodology
HSRP 734: Advanced Statistical Methods July 17, 2008.
Multiple Random Variables Two Discrete Random Variables –Joint pmf –Marginal pmf Two Continuous Random Variables –Joint Distribution (PDF) –Joint Density.
Pro gradu –thesis Tuija Hevonkorpi.  Basic of survival analysis  Weibull model  Frailty models  Accelerated failure time model  Case study.
Consistency An estimator is a consistent estimator of θ, if , i.e., if
Lecture 3: Statistics Review I Date: 9/3/02  Distributions  Likelihood  Hypothesis tests.
HMM - Part 2 The EM algorithm Continuous density HMM.
Empirical Likelihood for Right Censored and Left Truncated data Jingyu (Julia) Luan University of Kentucky, Johns Hopkins University March 30, 2004.
Estimation Method of Moments (MM) Methods of Moment estimation is a general method where equations for estimating parameters are found by equating population.
Lecture 2: Statistical learning primer for biologists
Simulation Study for Longitudinal Data with Nonignorable Missing Data Rong Liu, PhD Candidate Dr. Ramakrishnan, Advisor Department of Biostatistics Virginia.
Brief Review Probability and Statistics. Probability distributions Continuous distributions.
Lecture 4: Likelihoods and Inference Likelihood function for censored data.
STOCHASTIC HYDROLOGY Stochastic Simulation of Bivariate Distributions Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National.
1 Using dynamic path analysis to estimate direct and indirect effects of treatment and other fixed covariates in the presence of an internal time-dependent.
Statistics Sampling Distributions and Point Estimation of Parameters Contents, figures, and exercises come from the textbook: Applied Statistics and Probability.
1 Borgan and Henderson: Event History Methodology Lancaster, September 2006 Session 6.1: Recurrent event data Intensity processes and rate functions Robust.
Proportional Hazards Model Checking the adequacy of the Cox model: The functional form of a covariate The link function The validity of the proportional.
Hidden Markov Models. A Hidden Markov Model consists of 1.A sequence of states {X t |t  T } = {X 1, X 2,..., X T }, and 2.A sequence of observations.
Other Models for Time Series. The Hidden Markov Model (HMM)
Joint Modelling of Accelerated Failure Time and Longitudinal Data By By Yi-Kuan Tseng Yi-Kuan Tseng Joint Work With Joint Work With Professor Jane-Ling.
1 Borgan and Henderson: Event History Methodology Lancaster, September 2006 Session 8.1: Cohort sampling for the Cox model.
Presentation : “ Maximum Likelihood Estimation” Presented By : Jesu Kiran Spurgen Date :
DURATION ANALYSIS Eva Hromádková, Applied Econometrics JEM007, IES Lecture 9.
Estimating standard error using bootstrap
Comparing Cox Model with a Surviving Fraction with regular Cox model
Probability Theory and Parameter Estimation I
Model Inference and Averaging
Parameter Estimation 主講人:虞台文.
CH 5: Multivariate Methods
EC 331 The Theory of and applications of Maximum Likelihood Method
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
The Empirical FT. What is the large sample distribution of the EFT?
Basics of the semiparametric frailty model
Lecture 4: Likelihoods and Inference
Lecture 4: Likelihoods and Inference
Presentation transcript:

Goodness of Fit of a Joint Model for Event Time and Nonignorable Missing Longitudinal Quality of Life Data – A Study by Sneh Gulati* *with Jean-Francois Dupuy and Mounir Mesbah

HISTORY  Project is a result of the sabbatical spent at University of South Brittany  In survival studies two variables of interest: terminal event and a covariate (possibly time dependent)

Dupuy and Mesbah (2002) modeled above for unobserved covariates We propose here a test statistic to validate their model Still in progress

BREAKDOWN OF TALK  I) Preliminary Results  II) Missing Observations  III) Dupuy’s Model  IV) Goodness of fit for Dupuy’s Model

Preliminaries  The survival time (or duration time to some terminal event) T is often modeled by the Cox Regression Model : ( t | Z) =  (t)exp{ T Z)}   (t) is baseline hazard rate, Z is the vector of covariates. Survival times are censored on right – one observes X = min (T, C) and  = I { T C}

Types of Covariates  External – Not directly involved with the failure mechanism  Internal – Generated by the individual under study – observed only as long as the individual survives

Solution to the full model  Parameter vector obtained  0 by maximizing the following:

Estimate of the cumulative hazard function:

Goodness of Fit:  Graphical Methods  Chi-Squared Type Tests  Lin’s Method of Weights

Problem – Missing Data  Missing Covariates – often due to drop out

 Let denote the history of the covariate upto time t:  Let T be the time to some event. Then the hazard of T at time t is ((t)| )dt = lim dt →0 Pr( t < T < t + dt)| )

CLASSIFICATION OF THE DROP-OUT PROCESS  Completely Random Dropout – Drop-out process is independent of both observed and non observed measurements.  Random Drop-out – Drop-out process is independent of unobserved measurements, but depends on the observed measurements.  Nonignorable Drop-out – Drop-out process depends on unobserved measurements.

Approaches  Use only the complete observations  Replace missing values with sample mean.  Estimate missing values with consistent estimators so that the likelihood is maximized (IMPUTATION)

Previous Notable Work for Nonignorable Dropout  Diggle and Kenward (1994)  Little (1995), Hogan and Laird (1997) - Essentially one integrates out the unobserved covariates  Martinussen (1999) – uses EM algorithm

Work of Dupuy and Mesbah  Subjects measured at discrete time intervals  Terminal Event – Disease Progression  Patients can dropout and covariate can be unobserved at dropout

The Model  n subjects observed at fixed times t j, t 0 = 0 <... < t j-1 < t j <... <  0 <  0  t = t j – t j-1 <  1 <   Let Z = internal covariate and Z i (t) = value of Z at time t for the i th individual  Z i, j denote the response for the ith subject on (t j, t j+1 ].

Hazard Rate  = (t)exp( T w(t)) where w(t) = (z(t – t), z(t)) T  = (    ) T or  = (    ) T

Assumptions  1) The covariate vector Z is assumed to have uniformly bounded continuous density f(z,   2) The censoring time C has continuous distribution function G C (u)  3) The censoring distribution is assumed to be independent of the unobserved covariate, and of the parameters ,  and .

Likelihood L(  ) = Let us call the above model Equation (1)

Solution  Method of Sieves: Replace original parameter space  of the parameters () by an approximating space  n, called the sieve.

Instead of the hazard function,  one considers;  n,i =  n (T (i) ) T (i), i = 1, 2,..., p(n), where T (1)  T (2) ...  T (p(n) are the order statistics corresponding to the distinct dropout times T 1  T 2 ...  T p(n) Hence the approximating sieve is  n = {  = ( , ,  n ):   R p,   R 2,  n, 1   n, 2 ...   n, p(n) }.

One maximizes the psuedo-likelihood function: Ln(  ) =

here L (i) (  ) =

THE MLE The MLE Obtained via the EM algorithm is identifiable and asymptotically normally distributed

Goodness-of-fit for Dupuy’s Model  Issue of Model Checking Important  PROBLEM – MISSING DATA  Could use DOUBLE SAMPLING or IMPUTATION

SOLUTION: Validate model in Equation (1) – Marginal Model  Done by Using the Weights Method of Lin (1991)

Development of the Test Statistic  Using a random weight function, WG(.) define a class of weighted pseudo-likelihood functions given by  WL n () = Call the above equation (2)

where WL (i) (  ) =

Define the maximizer of equation (2) as: The test statistic is a function of

Asymptotic Results For  Under the model in Equation (1), the vector converges in distribution to a bivariate normal distribution with zero mean and a covariance matrix

If model in Equation (1) is correct, the weighted and the nonweighted MLE’s should be close to each other:

Under the model in Eqn (1), the vector converges in distribution to a bivariate normal distribution with zero mean and a covariance matrix D W =

Note that

Proof still in progress  Proposed method: Show that score function for weighted likelihood and unweighted likelihood are asymptotically joint normal.  Use counting process techniques and martingale theory.

THE TEST STATISTIC Under the model in Equation (1), the above statistic will have a chi-square distribution with 2 degrees of freedom.