Imputation method to correct overestimated shape parameter of

Slides:



Advertisements
Similar presentations
METHODOLOGY FOR META- ANALYSIS OF TIME TO EVENT TYPE OUTCOMES TO INFORM ECONOMIC EVALUATIONS Nicola Cooper, Alex Sutton, Keith Abrams Department of Health.
Advertisements

Evidence synthesis of competing interventions when there is inconsistency in how effectiveness outcomes are measured across studies Nicola Cooper Centre.
Sampling Distributions (§ )
Central Limit Theorem.
Simulation Modeling and Analysis
CSE 221: Probabilistic Analysis of Computer Systems Topics covered: Statistical inference.
Survival analysis Brian Healy, PhD. Previous classes Regression Regression –Linear regression –Multiple regression –Logistic regression.
NASSER DAVARZANI DEPARTMENT OF KNOWLEDGE ENGINEERING MAASTRICHT UNIVERSITY, 6200 MAASTRICHT, THE NETHERLANDS 22 OCTOBER 2012 Introduction to Survival Analysis.
POSC 202A: Lecture 9 Lecture: statistical significance.
Graphical Summary of Data Distribution Statistical View Point Histograms Skewness Kurtosis Other Descriptive Summary Measures Source:
On Model Validation Techniques Alex Karagrigoriou University of Cyprus "Quality - Theory and Practice”, ORT Braude College of Engineering, Karmiel, May.
Statistical approaches to analyse interval-censored data in a confirmatory trial Margareta Puu, AstraZeneca Mölndal 26 April 2006.
Continuous Probability Distributions Continuous random variable –Values from interval of numbers –Absence of gaps Continuous probability distribution –Distribution.
2 Input models provide the driving force for a simulation model. The quality of the output is no better than the quality of inputs. We will discuss the.
An Empirical Likelihood Ratio Based Goodness-of-Fit Test for Two-parameter Weibull Distributions Presented by: Ms. Ratchadaporn Meksena Student ID:
Bayesian Analysis and Applications of A Cure Rate Model.
Toward urgent forecasting of aftershock hazard: Simultaneous estimation of b-value of the Gutenberg-Richter ’ s law of the magnitude frequency and changing.
Chapter 11: Inference for Distributions of Categorical Data Section 11.1 Chi-Square Goodness-of-Fit Tests.
Toward a unified approach to fitting loss models Jacques Rioux and Stuart Klugman, for presentation at the IAC, Feb. 9, 2004.
Pro gradu –thesis Tuija Hevonkorpi.  Basic of survival analysis  Weibull model  Frailty models  Accelerated failure time model  Case study.
Figure 13.1 Relationship to the Previous Chapters & The Marketing Research Process Figure 13.1 Relationship of Sample Size Determination to the Previous.
01/20151 EPI 5344: Survival Analysis in Epidemiology Actuarial and Kaplan-Meier methods February 24, 2015 Dr. N. Birkett, School of Epidemiology, Public.
Summarizing Risk Analysis Results To quantify the risk of an output variable, 3 properties must be estimated: A measure of central tendency (e.g. µ ) A.
Topic 19: Survival Analysis T = Time until an event occurs. Events are, e.g., death, disease recurrence or relapse, infection, pregnancy.
Confidential and Proprietary Business Information. For Internal Use Only. Statistical modeling of tumor regrowth experiment in xenograft studies May 18.
Theoretical distributions: the Normal distribution.
Carolinas Medical Center, Charlotte, NC Website:
Probability distributions and likelihood
Modeling and Simulation CS 313
Where Are You? Children Adults.
Chapter 10: Comparing Two Populations or Groups
Confidence Interval Estimation
Continuous Probability Distributions Part 2
Nature of Estimation.
CHAPTER 10 Comparing Two Populations or Groups
Modeling and Simulation CS 313
Sampling Distributions & Point Estimation
CHAPTER 10 Comparing Two Populations or Groups
Michael Epstein, Ben Calderhead, Mark A. Girolami, Lucia G. Sivilotti 
Statistics Branch of mathematics dealing with the collection, analysis, interpretation, presentation, and organization of data. Practice or science of.
Long Subject Presentation Sojourn Models for Ordinal Data
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Continuous Probability Distributions Part 2
CHAPTER 10 Comparing Two Populations or Groups
Lecture: statistical significance.
Continuous Probability Distributions Part 2
Chapter 7 Sampling Distributions
Fig. 2. Best model fits. Best model fits. Illustration of the best model fits for the (A) basic, (B) continuous, and (C) cluster models. See Table 1 and.
Survival analysis Diagnostic Histopathology
Continuous Probability Distributions Part 2
Continuous Probability Distributions Part 2
CHAPTER 10 Comparing Two Populations or Groups
Chapter 10: Comparing Two Populations or Groups
Chapter 10: Comparing Two Populations or Groups
Sampling Distributions (§ )
CHAPTER – 1.2 UNCERTAINTIES IN MEASUREMENTS.
Chapter 10: Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
Volume 12, Issue 9, Pages (April 2002)
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
Chapter 10: Comparing Two Populations or Groups
Continuous Probability Distributions Part 2
CHAPTER 10 Comparing Two Populations or Groups
Copyright © 2015 Elsevier Inc. All rights reserved.
Generating Random Variates
Kaplan-Meier survival curves and the log rank test
CHAPTER – 1.2 UNCERTAINTIES IN MEASUREMENTS.
Presentation transcript:

Imputation method to correct overestimated shape parameter of Weibull hazard function in time-to-event modeling Dong-Seok Yim1 , Sung Min Park1, Kyungmee Choi2, Seunghoon Han1 1Department of Clinical Pharmacology and Therapeutics, Seoul St. Mary’s Hospital, PIPET (Pharmacometrics Institute for Practical Education and Training), South Korea 2Division of Mathematics, College of Science and Technology, Hongik University, Sejong, South Korea BACKGROUND Hyper-acute, mass occurrence of events (e.g., at day 0~1) is not rare in biomedical data. But, the overestimation of maximum likelihood estimator (MLE) of the Weibull shape parameter at survival analysis of such data has been neglected so far (e.g., Fig. 1). This biased result is observed regardless of statistical software (R, SAS, NONMEM…) Without correcting this bias, robust models for time-to-event data with hyper-acutely occurring character may not be developed. Thus, we performed this simulation study and present a useful correction method that is applicable to time-to-event modeling in pharmacometrics as well as conventional survival analysis. RESULTS Iterative imputation worked like this… Iteration Stopped (p>0.999) Figure 4 Improvement of biases by the iterative imputation process (in the case of Weibull-based imputation) –an example case After repeated Kolmogorov-Smirnov test, when the p-value was at last > 0.999, we concluded that the K-M plots of reference dataset and the hybrid dataset (iteratively-renewed) were identical and stopped the iterative imputation. It took 3~20 iterative imputation steps before attaining the p-value > 0.99 for each of reference datasets with differing Weibull shape parameters of 0.2 ~ 2) Figure 1 An example of biased Weibull estimates for hyper-acutely occurring events in biomedical studies. (A part of this topic has been presented in ACoP 2015.) Question: When do we get biased maximum likelihood estimates? Answer: Bias was obvious when the shape parameter β was < 0.5 . (because information loss at day1 worsens.) METHOD Raw dataset simulation Survival datasets (100 subjects / dataset) per shape parameters(β) ranging from 0.2 to 2 were simulated using the rweibull (30, β) function in the R Statistical Software. The scale parameter α was fixed to 30. “Reference datasets” by modification of the raw datasets Discretization: the raw dataset event times (t) were discretized by rounding up. i.e., when 0 < t ≤ 1, t’s were rounded up to 1 (day) to mimic the survival data recorded by the unit of day. Censoring: The raw dataset event times (t) > 28 (days) were discarded. Estimation of α and β of reference datasets using survreg ( ) fuction of R: How big is the bias? The biases from the conventional statistical method were first estimated. Kaplan-Meier (K-M) curves and Bias were presented as results (Fig. 5). Use of hybrid dataset to correct the bias (To impute the lost information at day1) We replaced the day1 data with imputed (simulated) event time data (hybrid dataset): Detailed methods are summarized in Fig. 2. BIASED!!! Figure 5 Comparison of K-M curves from simulated data and its estimates. Black: reference datasets (simulated datasets) Blue: MLE (and 95% P.I.’s in green shade) from conventional statistical analysis See Fig. 3 See Fig. 4 Question: Which distribution to use for imputation, Weibull or Uniform? Answer: Just Use Weibull. Figure 6 Comparison of K-M curves of reference datasets and corresponding hybrid datasets imputed from Weibull or Uniform distribution. -Black: reference dataset - Red: using uniform distribution (bias remained despite iteration: never passed K-M test.) - Blue: using Weibull distribution Table 1 Biases after applying imputation methods. Figure 2 Workflow diagram of simulation study (K-S: Kolmogorov-Smirnov). True value of scale parameter shape parameter Bias (Uniform) (Weibull) 30 0.2 0.20 0.033 0.4 0.11 0.031 0.6 0.06 0.034 0.8 0.02 0.028 CONCLUSION Sometimes, we come across biomedical data with hyper-acute, mass events at the very early observation interval. It is not a rare situation. We confirmed that the conventional survival analysis was not appropriate to estimate time-to-event data when its shape parameter was ≤ 0.5. We suggest the use of the iterative-imputation method presented herein when biased estimates are observed in the analysis of such dataset. Figure 3 Schema of the iterative imputation process (simulation with Weibull parameters)