STT520-420: BIOSTATISTICS ANALYSIS Dr. Cuixian Chen Chapter 7: Parametric Survival Models under Censoring STT520-420 1.

Slides:



Advertisements
Similar presentations
Modeling of Data. Basic Bayes theorem Bayes theorem relates the conditional probabilities of two events A, and B: A might be a hypothesis and B might.
Advertisements

The Maximum Likelihood Method
Point Estimation Notes of STAT 6205 by Dr. Fan.
GENERAL LINEAR MODELS: Estimation algorithms
Logistic Regression I Outline Introduction to maximum likelihood estimation (MLE) Introduction to Generalized Linear Models The simplest logistic regression.
Chap 8: Estimation of parameters & Fitting of Probability Distributions Section 6.1: INTRODUCTION Unknown parameter(s) values must be estimated before.
April 25 Exam April 27 (bring calculator with exp) Cox-Regression
Survival analysis 1 The greatest blessing in life is in giving and not taking.
Hypothesis testing Some general concepts: Null hypothesisH 0 A statement we “wish” to refute Alternative hypotesisH 1 The whole or part of the complement.
Linear Regression with One Regression
Biostatistics in Research Practice Time to event data Martin Bland Professor of Health Statistics University of York
Chapter 11 Survival Analysis Part 2. 2 Survival Analysis and Regression Combine lots of information Combine lots of information Look at several variables.
1 STATISTICAL INFERENCE PART I EXPONENTIAL FAMILY & POINT ESTIMATION.
Generalized Linear Models
Survival Analysis A Brief Introduction Survival Function, Hazard Function In many medical studies, the primary endpoint is time until an event.
STT : BIOSTATISTICS ANALYSIS Dr. Cuixian Chen Chapter 8: Fitting Parametric Regression Models STT
Likelihood probability of observing the data given a model with certain parameters Maximum Likelihood Estimation (MLE) –find the parameter combination.
NASSER DAVARZANI DEPARTMENT OF KNOWLEDGE ENGINEERING MAASTRICHT UNIVERSITY, 6200 MAASTRICHT, THE NETHERLANDS 22 OCTOBER 2012 Introduction to Survival Analysis.
HSRP 734: Advanced Statistical Methods July 10, 2008.
BPS - 3rd Ed. Chapter 211 Inference for Regression.
STATISTICAL INFERENCE PART I POINT ESTIMATION
Prof. Dr. S. K. Bhattacharjee Department of Statistics University of Rajshahi.
STT : Biostatistics Analysis Dr. Cuixian Chen
01/20151 EPI 5344: Survival Analysis in Epidemiology Maximum Likelihood Estimation: An Introduction March 10, 2015 Dr. N. Birkett, School of Epidemiology,
1 Dr. Jerrell T. Stracener EMIS 7370 STAT 5340 Probability and Statistics for Scientists and Engineers Department of Engineering Management, Information.
April 6 Logistic Regression –Estimating probability based on logistic model –Testing differences among multiple groups –Assumptions for model.
2 December 2004PubH8420: Parametric Regression Models Slide 1 Applications - SAS Parametric Regression in SAS –PROC LIFEREG –PROC GENMOD –PROC LOGISTIC.
Chapter 7 Point Estimation
1 Lecture 13: Other Distributions: Weibull, Lognormal, Beta; Probability Plots Devore, Ch. 4.5 – 4.6.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
1 Lecture 16: Point Estimation Concepts and Methods Devore, Ch
Applied Epidemiologic Analysis - P8400 Fall 2002 Lab 9 Survival Analysis Henian Chen, M.D., Ph.D.
Pro gradu –thesis Tuija Hevonkorpi.  Basic of survival analysis  Weibull model  Frailty models  Accelerated failure time model  Case study.
Censoring an observation of a survival r.v. is censored if we don’t know the survival time exactly. usually there are 3 possible reasons for censoring.
Lecture 12: Cox Proportional Hazards Model
01/20151 EPI 5344: Survival Analysis in Epidemiology Actuarial and Kaplan-Meier methods February 24, 2015 Dr. N. Birkett, School of Epidemiology, Public.
Lecture 4: Likelihoods and Inference Likelihood function for censored data.
We’ll now look at the relationship between a survival variable Y and an explanatory variable X; e.g., Y could be remission time in a leukemia study and.
1 Introduction to Statistics − Day 4 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Lecture 2 Brief catalogue of probability.
Lecture 3: Parametric Survival Modeling
STT : BIOSTATISTICS ANALYSIS Dr. Cuixian Chen Chapter 4: Data Plot STT
Treat everyone with sincerity,
Example x y We wish to check for a non zero correlation.
M.Sc. in Economics Econometrics Module I Topic 4: Maximum Likelihood Estimation Carol Newman.
1 Say good things, think good thoughts, and do good deeds.
R. Kass/W03 P416 Lecture 5 l Suppose we are trying to measure the true value of some quantity (x T ). u We make repeated measurements of this quantity.
Chapter 4 Continuous Random Variables and Probability Distributions  Probability Density Functions.2 - Cumulative Distribution Functions and E Expected.
BPS - 5th Ed. Chapter 231 Inference for Regression.
DURATION ANALYSIS Eva Hromádková, Applied Econometrics JEM007, IES Lecture 9.
Statistics 350 Lecture 3.
Chapter 4 Basic Estimation Techniques
Chapter 4 Continuous Random Variables and Probability Distributions
ASV Chapters 1 - Sample Spaces and Probabilities
STT : Biostatistics Analysis Dr. Cuixian Chen
Chapter 4 Continuous Random Variables and Probability Distributions
STATISTICAL INFERENCE PART I POINT ESTIMATION
Estimation Maximum Likelihood Estimates Industrial Engineering
Statistics 200b. Spring David Brillinger
6-1 Introduction To Empirical Models
Parametric Survival Models (ch. 7)
Estimation Maximum Likelihood Estimates Industrial Engineering
Simple Linear Regression
Basic Practice of Statistics - 3rd Edition Inference for Regression
Parametric Methods Berlin Chen, 2005 References:
STATISTICAL INFERENCE PART I POINT ESTIMATION
The hazard function The hazard function gives the so-called “instantaneous” risk of death (or failure) at time t, assuming survival up to time t. Estimate.
Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
Treat everyone with sincerity,
Lecture 4: Likelihoods and Inference
Lecture 4: Likelihoods and Inference
Presentation transcript:

STT : BIOSTATISTICS ANALYSIS Dr. Cuixian Chen Chapter 7: Parametric Survival Models under Censoring STT

Recall Chapter 5 and 6: Estimating the survival function with right censoring STT To estimate the survival function in the presence of right censoring:  Method 1: Cohort Lifetable method (Recall advantage/ disadvantage).  Method 2: Product-limit estimator (PLE), (also call Kaplan- Meier estimator (1958)).  Both are univariate Non-parametric/distribution-free estimate of survivor function.  Lifetable method uses fixed endpoints from intervals, while PLE used observed data points.

 Consider survival r.v. Y follows certain survival model with parameters.  It is called parametric survival models under censoring.  The aim is to estimate these parameters by maximizing the (log) likelihoods. STT Parametric Survival Models (ch. 7)

Parametric Survival Models  notations…  Y = survival r.v.; S=survival function, f=density (pdf); F=cdf=1-S. All these functions involve the unknown parameter (or vector of parameters) theta ( .  The 2 examples we’ll consider are:  exponential with  Weibull with

 The maximum likelihood estimator of is a function of the observed Y’s which maximizes the likelihood: Def 7.1 : L(  a constant multiple of the joint distribution of the observed data. So theta-hat maximizes L over all possible values of theta…  The mathematical method is discussed at the top of page note that usually the log of the likelihood is maximized…  Example: Find the log-likelihood function for Y ~ exp( . Parametric Survival Models

 Important results given in this section:  define the score function as the derivative of the log- likelihood  define Fisher’s information as  then  at the bottom of p. 120 and top of p.121, the method of actually finding maximum likelihood estimators is discussed and it is shown that Parametric Survival Models

Example: For Exponential model  With all exact observations, assume Y ~ exp( .  (1) Find the log-likelihood function.  (2) Find the maximum likelihood estimate.  (3) Find the Fisher information.  (4) Find the 95% Confidence Interval for   (5) Find the asymptotic distribution of MLE of  STT

 now go to page 131… Maximum likelihood under Type I censoring.  Suppose we have a survival variable Y ~exp(  ), subject to Type I censoring. Beta is the MTTF and we may use maximum likelihood methods to estimate it…(p.131 and 132) The Exponential Model

 Maximizing the likelihood equation gives the  MLE of  as  so we may use this to construct a 95% CI for the MTTF  The Exponential Model

Example: For Exponential model  With RC observations, assume Y ~ exp( .  (1) Find the log-likelihood function.  (2) Find the maximum likelihood estimate.  (3) Find the Fisher information.  (4) Find the 95% Confidence Interval for   (5) Find the asymptotic distribution of MLE of  STT

 Assume the survival r.v. Y follows Exp(  ). The observed data is given as following:  Flight control package failure time (mins): (n=14).  Observed: 1, 8, 10  Alive: 59, 72, 76, 113, 117, 124, 145, 149, 153, 182, 320.  Q: (a) Determine the maximum likelihood estimate of   (b) Determine the standard error of the estimate of   (c) Determine a 95% confidence interval of  STT Example 7.3, page 132

 Then hypothesis tests about  can be based on any of the 3 statistics below.  To test use one of :  Wald test:  LR test:  Score test: Parametric Survival Models ~ ~ ~

Recall: Weibull Prob Plots (chap4)  Recall Power hazard model:  Substitute, then take logarithm:  Take logarithm of base 10 again to create log-life variable:  We now have:  Note: base-10 log are traditionally used in lifetime, but natural log is equivalent with a difference of scale. STT

Recall: Weibull Prob Plots  If data fit Weibull model:  Weibull Prob plot is:   It follows a straight line with slope and intercept. STT

 Weibull model: assume Y ~ Weibull(  subject to Type I censoring. More generally, take log of survival data, X=log e (Y).  X=log e (Y) follows Extremevalue(u, b):  The survival function is given by  The original Weibull parameters can be estimated if u and b are estimated by u-hat and b-hat: The Weibull Model

 Use SAS to read in the switch failure time data (see website). Then get estimates of the Weibull parameters for both the log-transformed and non- transformed data - use PROC LIFEREG:  Notice that we may use the NOLOG option in the MODEL statement to not take logs of the data… proc lifereg data=switch; model u*censor(0)= /nolog dist=weibull; title 'Modeling u=log(Y) w/ NOLOG option'; proc lifereg data=switch; model y*censor(0)= /dist=weibull; title 'Modeling non-transformed Y'; run; quit;  Return to Section 4.4 (p ) and use SAS to get a probability plot (formula 4.4) of this data… Example 7.4, page 135 Modeling u=log(Y) w/ NOLOG option