Bayesian Analysis and Applications of A Cure Rate Model.

Slides:



Advertisements
Similar presentations
Pattern Recognition and Machine Learning
Advertisements

Bayes rule, priors and maximum a posteriori
“Students” t-test.
Lecture 11 (Chapter 9).
Previous Lecture: Distributions. Introduction to Biostatistics and Bioinformatics Estimation I This Lecture By Judy Zhong Assistant Professor Division.
Chapter 7 Title and Outline 1 7 Sampling Distributions and Point Estimation of Parameters 7-1 Point Estimation 7-2 Sampling Distributions and the Central.
Departments of Medicine and Biostatistics
HSRP 734: Advanced Statistical Methods July 24, 2008.
1 Parametric Sensitivity Analysis For Cancer Survival Models Using Large- Sample Normal Approximations To The Bayesian Posterior Distribution Gordon B.
The General Linear Model. The Simple Linear Model Linear Regression.
Cox Model With Intermitten and Error-Prone Covariate Observation Yury Gubman PhD thesis in Statistics Supervisors: Prof. David Zucker, Prof. Orly Manor.
Goodness of Fit of a Joint Model for Event Time and Nonignorable Missing Longitudinal Quality of Life Data – A Study by Sneh Gulati* *with Jean-Francois.
Statistical Inference Chapter 12/13. COMP 5340/6340 Statistical Inference2 Statistical Inference Given a sample of observations from a population, the.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 6-1 Introduction to Statistics Chapter 7 Sampling Distributions.
Test statistic: Group Comparison Jobayer Hossain Larry Holmes, Jr Research Statistics, Lecture 5 October 30,2008.
Continuous Random Variables and Probability Distributions
Today Today: Chapter 8 Assignment: 5-R11, 5-R16, 6-3, 6-5, 8-2, 8-8 Recommended Questions: 6-1, 6-2, 6-4, , 8-3, 8-5, 8-7 Reading: –Sections 8.1,
Copyright © Cengage Learning. All rights reserved. 6 Point Estimation.
Maximum likelihood (ML)
Review of Lecture Two Linear Regression Normal Equation
SIMULATION MODELING AND ANALYSIS WITH ARENA
Chapter Two Probability Distributions: Discrete Variables
Binary Variables (1) Coin flipping: heads=1, tails=0 Bernoulli Distribution.
Simple Linear Regression
Essentials of survival analysis How to practice evidence based oncology European School of Oncology July 2004 Antwerp, Belgium Dr. Iztok Hozo Professor.
HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2010 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Chapter 8 Continuous.
CI - 1 Cure Rate Models and Adjuvant Trial Design for ECOG Melanoma Studies in the Past, Present, and Future Joseph Ibrahim, PhD Harvard School of Public.
Statistical Decision Theory
Model Inference and Averaging
Bayesian Inference, Basics Professor Wei Zhu 1. Bayes Theorem Bayesian statistics named after Thomas Bayes ( ) -- an English statistician, philosopher.
Bayesian Inference Ekaterina Lomakina TNU seminar: Bayesian inference 1 March 2013.
Prof. Dr. S. K. Bhattacharjee Department of Statistics University of Rajshahi.
Random Sampling, Point Estimation and Maximum Likelihood.
Chap 6-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 6 Introduction to Sampling.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 6-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Chapter 12 The Analysis of Categorical Data and Goodness-of-Fit Tests.
1 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 12 The Analysis of Categorical Data and Goodness-of-Fit Tests.
Modeling Cure Rates Using the Survival Distribution of the General Population Wei Hou 1, Keith Muller 1, Michael Milano 2, Paul Okunieff 1, Myron Chang.
Chapter 7 Sampling Distributions Statistics for Business (Env) 1.
Pro gradu –thesis Tuija Hevonkorpi.  Basic of survival analysis  Weibull model  Frailty models  Accelerated failure time model  Case study.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 13 Multiple Regression Section 13.3 Using Multiple Regression to Make Inferences.
- 1 - Bayesian inference of binomial problem Estimating a probability from binomial data –Objective is to estimate unknown proportion (or probability of.
Statistical Decision Theory Bayes’ theorem: For discrete events For probability density functions.
Problem: 1) Show that is a set of sufficient statistics 2) Being location and scale parameters, take as (improper) prior and show that inferences on ……
The generalization of Bayes for continuous densities is that we have some density f(y|  ) where y and  are vectors of data and parameters with  being.
Lecture 4: Likelihoods and Inference Likelihood function for censored data.
Multilevel and multifrailty models. Overview  Multifrailty versus multilevel Only one cluster, two frailties in cluster e.g., prognostic index (PI) analysis,
Continuous Random Variables and Probability Distributions
Lecture 5 Introduction to Sampling Distributions.
Statistics Sampling Distributions and Point Estimation of Parameters Contents, figures, and exercises come from the textbook: Applied Statistics and Probability.
1 Borgan and Henderson: Event History Methodology Lancaster, September 2006 Session 6.1: Recurrent event data Intensity processes and rate functions Robust.
Copyright © Cengage Learning. All rights reserved. 5 Joint Probability Distributions and Random Samples.
Gibbs Sampling and Hidden Markov Models in the Event Detection Problem By Marc Sobel.
REGRESSION MODEL FITTING & IDENTIFICATION OF PROGNOSTIC FACTORS BISMA FAROOQI.
Virtual University of Pakistan Lecture No. 34 of the course on Statistics and Probability by Miss Saleha Naghmi Habibullah.
Ch 1. Introduction Pattern Recognition and Machine Learning, C. M. Bishop, Updated by J.-H. Eom (2 nd round revision) Summarized by K.-I.
STA248 week 121 Bootstrap Test for Pairs of Means of a Non-Normal Population – small samples Suppose X 1, …, X n are iid from some distribution independent.
Prediction and Missing Data. Summarising Distributions ● Models are often large and complex ● Often only interested in some parameters – e.g. not so interested.
Modeling and Simulation CS 313
Comparing Cox Model with a Surviving Fraction with regular Cox model
Chapter 4 Continuous Random Variables and Probability Distributions
Modeling and Simulation CS 313
Analyzing Redistribution Matrix with Wavelet
Econ 3790: Business and Economics Statistics
Bayesian Inference, Basics
Example Human males have one X-chromosome and one Y-chromosome,
Parametric Methods Berlin Chen, 2005 References:
Pattern Recognition and Machine Learning Chapter 2: Probability Distributions July chonbuk national university.
Mathematical Foundations of BME Reza Shadmehr
Applied Statistics and Probability for Engineers
Presentation transcript:

Bayesian Analysis and Applications of A Cure Rate Model

Introduction  Standard Cure Rate Model: : represents the survivor function for the entire population. : represents the survivor function for the non-cured group in the population. Exponential and Weibull distributions are commonly used : means the cured population.

Drawbacks 1. Cannot have a proportional hazard structure 2. When including covariates through the parameter via a standard binomial regression model, the standard cure rate model yields improper posterior distributions. Then it implies that Bayesian inference with a standard cure rate model essentially requires a proper prior.

New Model  N denote the number of cancer cells left active for that individual after the initial treatment, and have a Poisson distribution with mean  denotes the random time for the ith cancer cell to produce a detectable cancer mass and assumed them to be iid with a common distribution function F ( t ) = 1- S ( t ) and be independent of N.  Y= min (, ) denotes the time to relapse of the cancer, where P ( ) =1.  Hence, the survival function for the population is given by = P ( no cancer by time y) =P ( N = 0 ) + P ( ) =exp( - F(y)).

Property of the New Model , the cure rate fraction, is not a proper model. As, the cure fraction tends to 0, whereas as the cure fraction tends to 1.  The density function The hazard function: They are not proper probability density function or hazard function. However is multiplicative in and f(y); thus, it has the proportional hazard structure which is more appealing than the one from the standard cure rate model and is computationally attractive.

Property of the New Model(Con.)  The model can be written as: where which is a proper survival function. The density function is and the hazard function is: which doesn’t have a proportional hazard structure. It shows a mathematical relationship between the standard cure rate model and the new model.  In this model, let,where x is a vector of covariates and is vector of regression coefficients.

The Likelihood Function  Let n be the number of subjects. be the failure time for the ith subject. be the censoring time. be the censoring indicator. ‘1’ means failure time, ’0’ means right censoring. be the observed data, be the total data. N be an unobserved vector of a latent variable be the number of carcinogenic cells for the ith subject, assuming that are Poisson random variables with mean, i=1,2,…..,n.

The Likelihood Function(Con.)  Suppose that the iid incubation times for the cancer cells for the ith subject and all have cdf  The complete-data likelihood function of the parameter is Where

The Likelihood Function(Con.)  By summing out the unobserved latent vector N, the complete-data likelihood function can be reduced to

The Noninformative Prior Distibution  Suppose a joint noninformative prior for of the form where are the parameters in. This noninformative prior implies that and are independent priors and that is a uniform improper prior. Then, the posterior distribution of based on the observed data is given follows log-logistic, Gompertz and gamma distributions, respectively

The Noninformative Prior Distibution(Con.)  Even though we consider three distributions for, we give the same conditions which are where and are two specified hyperparameters.  Let and be an matrix with rows. If (a) is of full rank, (b) is proper, (c) and,  Based on these condition, we can get the result which is that the posterior is proper.

The Noninformative Prior Distibution(Con.)  Based on the specific property for Gompertz distribution, we can have another conditions and still get the same results. Firstly, we assume throughout that where and are two specified hyperparameters, where and, the posterior distribution of based on the observed data is Secondly, let and be an matrix with rows. If (a) is of full rank, (b) is proper, (c) and, where. Then the posterior is proper.

The Informative Prior Distibution  Kolmogorov-Smirnov Test To determine if two datasets differ significantly The Kolmogorov-Smirnov two sample test statistic is defined as where E1 and E2 are the empirical distribution functions based on Kaplan-Meier survival estimate for the two samples.

The Informative Prior Distibution  is an matrix of covariates corresponding to, let denote the complete historical data. Further, let denotes the initial prior distribution for We propose a joint informative prior distribution of the form where is the complete data likelihood with D being replaced by the historical data and We take a noninformative prior for such as, which implies. A beta prior is chosen for leading to the joint prior distribution. where are the specified prior parameters

Data Analysis  We will demonstrate the application of our proposed models by analyzing the data from a phase III melanoma clinical trial. 1. Observe the maximum likelihood estimates of the parameters for the proposed models under three different distributions. 2. Carry out a Bayesian analysis with covariates using the proposed noninformative priors. In this study, we obtain the posterior estimates of the parameters for the proposed models with log- logistic, Gompertz and gamma distributions. 3. Carry out a Bayesian analysis with covariates using the proposed informative priors. Three covariates are included in the analyses, they are age ( ), gender ( : male, female), and performance status (PS) ( : fully active, other)

Data Analysis(Con.)  Summary of E1684 data Survival time (year) Median 2.91 SD 2.83 Status(frequency) Censored 110 Death 174 Age(year) Mean SD Gender(frequency) Male 171 Female 113 PS (frequency) Fully active 253 Other 31

Data Analysis(Con.)  Summary of E 1673 data Survival time (year) Median 5.72 SD 8.20 Status(frequency) Censored 257 Death 393 Age(year) Mean SD Gender(frequency) Male 375 Female 275 PS (frequency) Fully active 561 Other 89

Data Analysis(Con.)  Figure 1.1 displays a Kaplan-Meier plot for overall survival. The three different distributions fit the data all well.

Data Analysis (Con.)  MLE's of the model parameters for the E1684 data Compare with the maximum likelihood estimates, standard deviations and p-values for the proposed models under log-logistic, Gompertz, and Gamma distributions each other. We find that all results are similar. The p-values associated with the covariates are all greater than This implies that none of age, gender and PS is statistically significant at level =0.05.

Data Analysis (MLE Analysis)  The following is the MLE's of the Model Parameters with Weibull Distribution  Variable MLE SD P-value Age Gender PS

Data Analysis (MLE Analysis)  The following is the MLE's of the Model Parameters with log- logistic Distribution  Variable MLE SD P-value Age Gender PS

Data Analysis (MLE Analysis)  The following is the MLE's of the Model Parameters with Gompertz Distribution  Variable MLE SD P-value Age Gender PS

Data Analysis (MLE Analysis)  The following is the MLE's of the Model Parameters with Gamma Distribution  Variable MLE SD P-value Age Gender PS

Data Analysis (Noninfroamtive and Informative Prior Analysis)  We carry out a Bayesian analysis with covariates using the proposed noninformative or informative priors to demonstrate our second or third application of the proposed model under the three different distributions. E1684 is used as current data., we take an improper uniform prior, and for, we take =1 and =0.01 to ensure a proper prior.The parameter is taken to have a normal distribution with mean 0 and variance 10,000. E1673 serves as the historical data for our Bayesian analysis of E1684.

Data Analysis (Result)  Incorporating historical data can yield more precise posterior estimates for age, gender and PS. Their posterior estimates, SD, and 95% HPD do not change a great deal if a low or moderate weight is given to the historical data. However, if a higher than moderate weight is given, the posterior summaries can change substantially. Then, age and gender are potentially important factors for predicting survival.The posterior estimate for age is positive, implying that as age goes up, the number of carcinogenic cells increases. Therefore, older patients have shorter relapse-free survival. the posterior estimate of gender is negative, implying that the number of carcinogenic cells for females are less than the number of carcinogenic cells for males.

Data Analysis (Result)  as the posterior estimate of increases, the posterior estimate for age becomes larger while the posterior estimates for gender and PS become smaller. The posterior standard deviations of the model parameters become smaller and the 95% HPD become narrower as the posterior estimate of increases. This demonstrates that incorporation of historical data can yield precise posterior estimates of age, gender and PS parameters. We can see that there is a large difference in these estimates, especially in the standard deviations and 95% HPD.

Data Analysis (Result)  When a low weight is given to the historical data, the posterior estimate of PS is negative. It implies that cancer cell counts for the patients whose PS is fully active are more than that when PS is not fully active after the initial treatment. When a higher weight is given to the historical data, the posterior estimate of PS becomes positive which implies that patients whose PS is fully active have longer relapse-free survival than patients whose PS is not fully active. The posterior estimates for age are all positive and their values increase when the posterior estimate of increases. This implies that as age goes up, the number of carcinogenic cells increases. This tells us that incorporation historical data, we can obtain better results. The posterior estimates for gender are all negative and becomes smaller when the power is increasing. There is a gender difference, when we incorporate historical data, the difference becomes significant.

Data Analysis (Table)

Data Analysis (Detailed Sensitivity Analysis by Varying the Hyperparameters)  For illustration purposes, we only show results with a fixed value for. In our study, we fix the hyperparameters for =0.29and vary the hyperparameters for. Firstly, varying the variance of and from small value to large value which implies that shape of the or becomes from narrow to flat. Secondly, varying the mean of or from the small to large. Based on the two conditions, we check the influence on the regression coefficients. Through these detailed sensitivity analysis, we find that the posterior estimates of age, gender and PS are robust for a wide range of hyperparameter values.

Conclusion and Discussion  We have also investigated the melanoma data using three different methods for each distribution: Even though we use different methods and different distributions, the results are almost the same. Incorporation of historical data can improve the posterior estimates, standard deviations and 95% HPD of age, gender and PS. And age and gender are potentially important prognostic factors for predicting overall survival in melanoma. This demonstrates a desirable feature of our model. Such a conclusion is not possible based only on a frequentist or a Bayesian analysis of the current data alone. Thus, incorporating historical data can yield more precise posterior estimates of age, gender and PS.