Survival analysis Brian Healy, PhD. Previous classes Regression Regression –Linear regression –Multiple regression –Logistic regression.

Slides:



Advertisements
Similar presentations
Surviving Survival Analysis
Advertisements

Survival Analysis. Key variable = time until some event time from treatment to death time for a fracture to heal time from surgery to relapse.
If we use a logistic model, we do not have the problem of suggesting risks greater than 1 or less than 0 for some values of X: E[1{outcome = 1} ] = exp(a+bX)/
Survival Analysis-1 In Survival Analysis the outcome of interest is time to an event In Survival Analysis the outcome of interest is time to an event The.
Survival Analysis. Statistical methods for analyzing longitudinal data on the occurrence of events. Events may include death, injury, onset of illness,
Introduction to Survival Analysis October 19, 2004 Brian F. Gage, MD, MSc with thanks to Bing Ho, MD, MPH Division of General Medical Sciences.
Departments of Medicine and Biostatistics
CHAPTER 23: Two Categorical Variables: The Chi-Square Test
April 25 Exam April 27 (bring calculator with exp) Cox-Regression
© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
بسم الله الرحمن الرحیم. Generally,survival analysis is a collection of statistical procedures for data analysis for which the outcome variable of.
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
Survival analysis1 Every achievement originates from the seed of determination.
Final Review Session.
Chapter 11 Survival Analysis Part 2. 2 Survival Analysis and Regression Combine lots of information Combine lots of information Look at several variables.
Introduction to Survival Analysis PROC LIFETEST and Survival Curves.
Measures of disease frequency (I). MEASURES OF DISEASE FREQUENCY Absolute measures of disease frequency: –Incidence –Prevalence –Odds Measures of association:
5-3 Inference on the Means of Two Populations, Variances Unknown
Sample Size Determination
ANOVA and Regression Brian Healy, PhD.
Linear regression Brian Healy, PhD BIO203.
Analysis of Complex Survey Data
1 Chapter 20 Two Categorical Variables: The Chi-Square Test.
The Bahrain Branch of the UK Cochrane Centre In Collaboration with Reyada Training & Management Consultancy, Dubai-UAE Cochrane Collaboration and Systematic.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 10: Survival Curves Marshall University Genomics Core.
The Chi-Square Test Used when both outcome and exposure variables are binary (dichotomous) or even multichotomous Allows the researcher to calculate a.
Essentials of survival analysis How to practice evidence based oncology European School of Oncology July 2004 Antwerp, Belgium Dr. Iztok Hozo Professor.
1 Survival Analysis Biomedical Applications Halifax SAS User Group April 29/2011.
HSRP 734: Advanced Statistical Methods July 10, 2008.
Section 10.1 ~ t Distribution for Inferences about a Mean Introduction to Probability and Statistics Ms. Young.
Dr Laura Bonnett Department of Biostatistics. UNDERSTANDING SURVIVAL ANALYSIS.
Lecture 3 Survival analysis.
Sample Size Determination Donna McClish. Issues in sample size determination Sample size formulas depend on –Study design –Outcome measure Dichotomous.
Introduction to Biostatistics/Hypothesis Testing
Inferences in Regression and Correlation Analysis Ayona Chatterjee Spring 2008 Math 4803/5803.
1 Introduction to medical survival analysis John Pearson Biostatistics consultant University of Otago Canterbury 7 October 2008.
CHAPTER 18: Inference about a Population Mean
Statistics 11 Correlations Definitions: A correlation is measure of association between two quantitative variables with respect to a single individual.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
INTRODUCTION TO SURVIVAL ANALYSIS
Biostatistics, statistical software VII. Non-parametric tests: Wilcoxon’s signed rank test, Mann-Whitney U-test, Kruskal- Wallis test, Spearman’ rank correlation.
01/20151 EPI 5344: Survival Analysis in Epidemiology Survival curve comparison (non-regression methods) March 3, 2015 Dr. N. Birkett, School of Epidemiology,
HSRP 734: Advanced Statistical Methods July 17, 2008.
Introduction to Survival Analysis Utah State University January 28, 2008 Bill Welbourn.
Contingency tables Brian Healy, PhD. Types of analysis-independent samples OutcomeExplanatoryAnalysis ContinuousDichotomous t-test, Wilcoxon test ContinuousCategorical.
Randomized Trial of Preoperative Chemoradiation Versus Surgery Alone in Patients with Locoregional Esophageal Carcinoma, Ursa et al. Statistical Methods:
The binomial applied: absolute and relative risks, chi-square.
Primer on Statistics for Interventional Cardiologists Giuseppe Sangiorgi, MD Pierfrancesco Agostoni, MD Giuseppe Biondi-Zoccai, MD.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
Analysis of Variance (ANOVA) Brian Healy, PhD BIO203.
1 Lecture 6: Descriptive follow-up studies Natural history of disease and prognosis Survival analysis: Kaplan-Meier survival curves Cox proportional hazards.
1 Multivariable Modeling. 2 nAdjustment by statistical model for the relationships of predictors to the outcome. nRepresents the frequency or magnitude.
Survival Analysis approach in evaluating the efficacy of ARV treatment in HIV patients at the Dr GM Hospital in Tshwane, GP of S. Africa Marcus Motshwane.
01/20151 EPI 5344: Survival Analysis in Epidemiology Actuarial and Kaplan-Meier methods February 24, 2015 Dr. N. Birkett, School of Epidemiology, Public.
01/20151 EPI 5344: Survival Analysis in Epidemiology Cox regression: Introduction March 17, 2015 Dr. N. Birkett, School of Epidemiology, Public Health.
P-values and statistical inference Dr. Omar Aljadaan.
INTRODUCTION TO CLINICAL RESEARCH Survival Analysis – Getting Started Karen Bandeen-Roche, Ph.D. July 20, 2010.
Correlation. u Definition u Formula Positive Correlation r =
+ Section 11.1 Chi-Square Goodness-of-Fit Tests. + Introduction In the previous chapter, we discussed inference procedures for comparing the proportion.
SUMMARY EQT 271 MADAM SITI AISYAH ZAKARIA SEMESTER /2015.
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8… Where we are going… Significance Tests!! –Ch 9 Tests about a population proportion –Ch 9Tests.
SURVIVAL ANALYSIS PRESENTED BY: DR SANJAYA KUMAR SAHOO PGT,AIIH&PH,KOLKATA.
Methods and Statistical analysis. A brief presentation. Markos Kashiouris, M.D.
1 Analysis of Survival Data with Demographic Applications (Spring term 2006) Lecture 3: Non-Parametric Comparison of two or more Survival Curves.
Carolinas Medical Center, Charlotte, NC Website:
April 18 Intro to survival analysis Le 11.1 – 11.2
Applied Biostatistics: Lecture 2
Statistics 103 Monday, July 10, 2017.
Statistical Inference about Regression
What’s the plan? First, we are going to look at the correlation between two variables: studying for calculus and the final percentage grade a student gets.
Presentation transcript:

Survival analysis Brian Healy, PhD

Previous classes Regression Regression –Linear regression –Multiple regression –Logistic regression

What are we doing today? Survival analysis Survival analysis –Kaplan-Meier curve –Dichotomous predictor –How to interpret results Cox proportional hazards Cox proportional hazards –Continuous predictor –How to interpret results

Big picture In medical research, we often confront continuous, ordinal or dichotomous outcomes In medical research, we often confront continuous, ordinal or dichotomous outcomes One other common outcome is time to event (survival time) One other common outcome is time to event (survival time) –Clinical trials often measure time to death or time to relapse We would like to estimate the survival distribution We would like to estimate the survival distribution

Types of analysis-independent samples OutcomeExplanatoryAnalysis ContinuousDichotomoust-test, Wilcoxon test ContinuousCategoricalANOVA, linear regression Continuous Correlation, linear regression Dichotomous Chi-square test, logistic regression DichotomousContinuousLogistic regression Time to eventDichotomousLog-rank test

Definitions Survival time: time to event Survival time: time to event Survival function: probability survival time is greater than a specific value S(t)=P(T>t) Survival function: probability survival time is greater than a specific value S(t)=P(T>t) Hazard function: risk of having the event (t)=# who had event/# at risk Hazard function: risk of having the event (t)=# who had event/# at risk These two factors are mathematically related These two factors are mathematically related

Example An important marker of disease activity in MS is the occurrence of a relapse An important marker of disease activity in MS is the occurrence of a relapse –This is the presence of new symptoms that lasts for at least 24 hours Many clinical trials in MS have demonstrated that treatments increase the time until the next relapse Many clinical trials in MS have demonstrated that treatments increase the time until the next relapse –How does the time to next relapse look in the clinic? What is the distribution of survival times? What is the distribution of survival times?

Kaplan-Meier curve Each drop in the curve represents an event

Survival data To create this curve, patients placed on treatment were followed and the time of the first relapse on treatment was recorded To create this curve, patients placed on treatment were followed and the time of the first relapse on treatment was recorded –Survival time If everyone had an event, some of the methods we have already learned could be applied If everyone had an event, some of the methods we have already learned could be applied Often, not everyone has event Often, not everyone has event –Loss to follow-up –End of study

Censoring The patients who did not have the event are considered censored The patients who did not have the event are considered censored –We know that they survived a specific amount of time, but do not know the exact time of the event –We believe that the event would have happened if we observed them long enough These patients provide some information, but not complete information These patients provide some information, but not complete information

Censoring How could we account for censoring? How could we account for censoring? –Ignore it and say event occurred at time of censoring  Incorrect because this is almost certainly not true –Remove patient from analysis  Potential bias and loss of power –Survival analysis Our objective is to estimate the survival distribution of patients in the presence of censoring Our objective is to estimate the survival distribution of patients in the presence of censoring

Example For simplicity, let’s focus on 10 patients whose time to relapse is provided here For simplicity, let’s focus on 10 patients whose time to relapse is provided here We assume that no one is censored initially We assume that no one is censored initially We would like to estimate S(t) and (t) We would like to estimate S(t) and (t) PatientTime

What do we see from our curve? 1)Drops in the curve only occur at time of event 2)Between events, the estimated survival remains constant What is the size of the drops?

Calculating size of drop To calculate the hazard at each time point=# events/# at risk To calculate the hazard at each time point=# events/# at risk –If no event, hazard=0 To calculate estimated survival use: To calculate estimated survival use: 1/ /9 1/8 1/7 1/5 1/4 1/3 1/2 1/1 1/ PatientTime

Example-censoring For simplicity, let’s focus on 10 patients whose time to relapse is provided here For simplicity, let’s focus on 10 patients whose time to relapse is provided here We assume that no one is censored initially We assume that no one is censored initially We would like to estimate S(t) and (t) We would like to estimate S(t) and (t) PatientTime

What do we see from our curve? 1)Drops in the curve only occur at time of event 2)Between events, the estimated survival remains constant 3)Survival curve does not drop at censored times

PatientTime Calculating size of drop To calculate the hazard at each time point=# events/# at risk To calculate the hazard at each time point=# events/# at risk –If no event, hazard=0 To calculate estimated survival use: To calculate estimated survival use: 1/ /8 1/7 1/5 1/4 1/3 0 1/

Confidence interval for survival curve A confidence interval can be placed around the estimated survival curve A confidence interval can be placed around the estimated survival curve –Greenwood’s formula

Summary Kaplan-Meier curve represents the distribution of survival times Kaplan-Meier curve represents the distribution of survival times Drops only occur at event times Drops only occur at event times Censoring easily accommodated Censoring easily accommodated If last time is not event, curve does not go to zero If last time is not event, curve does not go to zero

Comparison of survival curve One important aspect of survival analysis is the comparison of survival curves One important aspect of survival analysis is the comparison of survival curves Null hypothesis: S 1 (t)=S 2 (t) Null hypothesis: S 1 (t)=S 2 (t) Method: log-rank test Method: log-rank test

Example Untreated PatientTime TreatedPatientTime

Log-rank test-technical To compare survival curves, a log-rank test creates 2x2 tables at each event time and combines across the tables To compare survival curves, a log-rank test creates 2x2 tables at each event time and combines across the tables –Similar to MH-test Provides a  2 statistic with 1 degree of freedom (for a two sample comparison) and a p-value Provides a  2 statistic with 1 degree of freedom (for a two sample comparison) and a p-value Same procedure for hypothesis testing Same procedure for hypothesis testing

Hypothesis test 1) H 0 : S 1 (t)=S 2 (t) 2) Time to event outcome, dichotomous predictor 3) Log rank test 4) Test statistic:  2 =4.4 5) p-value= ) Since the p-value is less than 0.05, we reject the null hypothesis 7) We conclude that there is a significant difference in the survival time in the treated compared to untreated

p-value

Notes Inspection of Kaplan-Meier curve will allow you to determine which of the groups had the significantly longer survival time Inspection of Kaplan-Meier curve will allow you to determine which of the groups had the significantly longer survival time Other tests are possible Other tests are possible –Gehan’s generalized Wilcoxon test –Tarone-Ware test –Peto-Peto-Prentice test Generally give similar results, but emphasize different parts of survival curve Generally give similar results, but emphasize different parts of survival curve