Satistics 2621 Statistics 262: Intermediate Biostatistics Jonathan Taylor and Kristin Cobb April 20, 2004: Introduction to Survival Analysis.

Slides:



Advertisements
Similar presentations
Surviving Survival Analysis
Advertisements

If we use a logistic model, we do not have the problem of suggesting risks greater than 1 or less than 0 for some values of X: E[1{outcome = 1} ] = exp(a+bX)/
Survival Analysis. Statistical methods for analyzing longitudinal data on the occurrence of events. Events may include death, injury, onset of illness,
Introduction to Survival Analysis October 19, 2004 Brian F. Gage, MD, MSc with thanks to Bing Ho, MD, MPH Division of General Medical Sciences.
Tests for time-to-event outcomes (survival analysis)
April 25 Exam April 27 (bring calculator with exp) Cox-Regression
Statistics 262: Intermediate Biostatistics
1 Statistics 262: Intermediate Biostatistics Kaplan-Meier methods and Parametric Regression methods.
بسم الله الرحمن الرحیم. Generally,survival analysis is a collection of statistical procedures for data analysis for which the outcome variable of.
Intermediate methods in observational epidemiology 2008 Instructor: Moyses Szklo Measures of Disease Frequency.
Main Points to be Covered
Introduction to Survival Analysis
Introduction to Survival Analysis
BIOST 536 Lecture 3 1 Lecture 3 – Overview of study designs Prospective/retrospective  Prospective cohort study: Subjects followed; data collection in.
Chapter 11 Survival Analysis Part 2. 2 Survival Analysis and Regression Combine lots of information Combine lots of information Look at several variables.
Cohort Studies.
Measures of disease frequency (I). MEASURES OF DISEASE FREQUENCY Absolute measures of disease frequency: –Incidence –Prevalence –Odds Measures of association:
Cox Proportional Hazards Regression Model Mai Zhou Department of Statistics University of Kentucky.
Survival Analysis A Brief Introduction Survival Function, Hazard Function In many medical studies, the primary endpoint is time until an event.
Analysis of Complex Survey Data
1 Tests for time-to-event outcomes (survival analysis)
The Bahrain Branch of the UK Cochrane Centre In Collaboration with Reyada Training & Management Consultancy, Dubai-UAE Cochrane Collaboration and Systematic.
Survival analysis Brian Healy, PhD. Previous classes Regression Regression –Linear regression –Multiple regression –Logistic regression.
1 Kaplan-Meier methods and Parametric Regression methods Kristin Sainani Ph.D. Stanford University Department of Health.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 10: Survival Curves Marshall University Genomics Core.
Introduction to Survival Analysis August 3 and 5, 2004.
Study Design / Data: Case-Control, Descriptives Basic Medical Statistics Course: Module C October 2010 Wilma Heemsbergen
HSTAT1101: 27. oktober 2004 Odd Aalen
Multiple Choice Questions for discussion
01/20141 EPI 5344: Survival Analysis in Epidemiology Introduction to concepts and basic methods February 25, 2014 Dr. N. Birkett, Department of Epidemiology.
Essentials of survival analysis How to practice evidence based oncology European School of Oncology July 2004 Antwerp, Belgium Dr. Iztok Hozo Professor.
1 Survival Analysis Biomedical Applications Halifax SAS User Group April 29/2011.
NASSER DAVARZANI DEPARTMENT OF KNOWLEDGE ENGINEERING MAASTRICHT UNIVERSITY, 6200 MAASTRICHT, THE NETHERLANDS 22 OCTOBER 2012 Introduction to Survival Analysis.
Dr Laura Bonnett Department of Biostatistics. UNDERSTANDING SURVIVAL ANALYSIS.
The life table LT statistics: rates, probabilities, life expectancy (waiting time to event) Period life table Cohort life table.
Prevalence The presence (proportion) of disease or condition in a population (generally irrespective of the duration of the disease) Prevalence: Quantifies.
Design and Analysis of Clinical Study 11. Analysis of Cohort Study Dr. Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia.
INTRODUCTION TO SURVIVAL ANALYSIS
Cox Regression II Kristin Sainani Ph.D. Stanford University Department of Health Research and Policy Kristin Sainani Ph.D.
Chapter 12 Survival Analysis.
Introduction to Survival Analysis Utah State University January 28, 2008 Bill Welbourn.
HSRP 734: Advanced Statistical Methods July 31, 2008.
Epidemiologic design from a sampling perspective Epidemiology II Lecture April 14, 2005 David Jacobs.
Assessing Binary Outcomes: Logistic Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Measures of Association and Impact Michael O’Reilly, MD, MPH FETP Thailand Introductory Course.
Natural History/ Prognosis Studies Natural history of disease (clinical course; preclinical/ clinical stage) Prognosis (death/survivors of disease) Survival.
Censoring an observation of a survival r.v. is censored if we don’t know the survival time exactly. usually there are 3 possible reasons for censoring.
School of Epidemiology, Public Health &
Lecture 9: Analysis of intervention studies Randomized trial - categorical outcome Measures of risk: –incidence rate of an adverse event (death, etc) It.
1 Lecture 6: Descriptive follow-up studies Natural history of disease and prognosis Survival analysis: Kaplan-Meier survival curves Cox proportional hazards.
Describing the risk of an event and identifying risk factors Caroline Sabin Professor of Medical Statistics and Epidemiology, Research Department of Infection.
Lecture 5: The Natural History of Disease: Ways to Express Prognosis
01/20151 EPI 5344: Survival Analysis in Epidemiology Actuarial and Kaplan-Meier methods February 24, 2015 Dr. N. Birkett, School of Epidemiology, Public.
01/20151 EPI 5344: Survival Analysis in Epidemiology Cox regression: Introduction March 17, 2015 Dr. N. Birkett, School of Epidemiology, Public Health.
12/20091 EPI 5240: Introduction to Epidemiology Incidence and survival December 7, 2009 Dr. N. Birkett, Department of Epidemiology & Community Medicine,
Organization of statistical research. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and.
01/20151 EPI 5344: Survival Analysis in Epidemiology Quick Review from Session #1 March 3, 2015 Dr. N. Birkett, School of Epidemiology, Public Health &
BIOSTATISTICS Lecture 2. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and creating methods.
INTRODUCTION TO CLINICAL RESEARCH Survival Analysis – Getting Started Karen Bandeen-Roche, Ph.D. July 20, 2010.
Topic 19: Survival Analysis T = Time until an event occurs. Events are, e.g., death, disease recurrence or relapse, infection, pregnancy.
EPI 5344: Survival Analysis in Epidemiology Week 6 Dr. N. Birkett, School of Epidemiology, Public Health & Preventive Medicine, University of Ottawa 03/2016.
SURVIVAL ANALYSIS PRESENTED BY: DR SANJAYA KUMAR SAHOO PGT,AIIH&PH,KOLKATA.
Measures of disease frequency Simon Thornley. Measures of Effect and Disease Frequency Aims – To define and describe the uses of common epidemiological.
Survival time treatment effects
April 18 Intro to survival analysis Le 11.1 – 11.2
Overview What is survival analysis? Terminology and data structure.
Tests for time-to-event outcomes (survival analysis)
Statistics 262: Intermediate Biostatistics
Some Epidemiological Studies
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Presentation transcript:

Satistics 2621 Statistics 262: Intermediate Biostatistics Jonathan Taylor and Kristin Cobb April 20, 2004: Introduction to Survival Analysis

Satistics 2622 What is survival analysis? Statistical methods for analyzing longitudinal data on the occurrence of events. Events may include death, injury, onset of illness, recovery from illness (binary variables) or transition above or below the clinical threshold of a meaningful continuous variable (e.g. CD4 counts). Accommodates data from randomized clinical trial or cohort study design.

Randomized Clinical Trial (RCT) Target population Intervention Control Disease Disease-free Disease Disease-free TIME Random assignment Disease-free, at-risk cohort

Target population Treatment Control Cured Not cured Cured Not cured TIME Random assignment Patient population Randomized Clinical Trial (RCT)

Target population Treatment Control Dead Alive Dead Alive TIME Random assignment Patient population Randomized Clinical Trial (RCT)

Cohort study (prospective/retrospective) Target population Exposed Unexposed Disease Disease-free Disease Disease-free TIME Disease-free cohort

Satistics 2627 Estimate time-to-event for a group of individuals, such as time until second heart- attack for a group of MI patients. To compare time-to-event between two or more groups, such as treated vs. placebo MI patients in a randomized controlled trial. To assess the relationship of co-variables to time-to-event, such as: does weight, insulin resistance, or cholesterol influence survival time of MI patients? Note: expected time-to-event = 1/incidence rate Objectives of survival analysis

Satistics 2628 Examples of survival analysis in medicine

RCT: Women’s Health Initiative (JAMA, 2001) On hormones On placebo Cumulative incidence

Satistics Prospective cohort study: From April 15, 2004 NEJM: Use of Gene-Expression Profiling to Identify Prognostic Subclasses in Adult Acute Myeloid Leukemia

Satistics Retrospective cohort study: From December 2003 BMJ: Aspirin, ibuprofen, and mortality after myocardial infarction: retrospective cohort study

Satistics Why use survival analysis? 1. Why not compare mean time-to-event between your groups using a t-test or linear regression? -- ignores censoring 2. Why not compare proportion of events in your groups using logistic regression? --ignores time

Satistics Cox regression vs.logistic regression Distinction between rate and proportion: Incidence (hazard) rate: number of new cases of disease per population at-risk per unit time (or mortality rate, if outcome is death) Cumulative incidence: proportion of new cases that develop in a given time period

Satistics Cox regression vs.logistic regression Distinction between hazard/rate ratio and odds ratio/risk ratio: Hazard/rate ratio: ratio of incidence rates Odds/risk ratio: ratio of proportions By taking into account time, you are taking into account more information than just binary yes/no. Gain power/precision. Logistic regression aims to estimate the odds ratio; Cox regression aims to estimate the hazard ratio

Satistics Rates vs. risks Relationship between risk and rates:

Satistics Rates vs. risks For example, if rate is 5 cases/1000 person-years, then the chance of developing disease over 10 years is: Compare to.005(10) = 5% The loss of persons at risk because they have developed disease within the period of observation is small relative to the size of the total group.

Satistics Rates vs. risks If rate is 50 cases/1000 person-years, then the chance of developing disease over 10 years is: Compare to.05(10) = 50%

Satistics Rates vs. risks Relationship between risk and rates (derivation): Exponential density function for waiting time until the event (constant hazard rate) Preview: Waiting time distribution will change if the hazard rate changes as a function of time: h(t)

Survival Analysis: Terms Time-to-event: The time from entry into a study until a subject has a particular outcome Censoring: Subjects are said to be censored if they are lost to follow up or drop out of the study, or if the study ends before ends before they die or have an outcome of interest. They are counted as alive or disease-free for the time they were enrolled in the study. If dropout is related to both outcome and treatment, dropouts may bias the results

Satistics Right Censoring (T>t) Common examples Termination of the study Death due to a cause that is not the event of interest Loss to follow-up We know that subject survived at least to time t.

Satistics Left censoring (T<t) The origin time, not the event time, is known only to be less than some value. For example, if you are studying menarche and you begin following girls at age 12, you may find that some of them have already begun menstruating. Unless you can obtain information about the start date for those girls, the age of menarche is left-censored at age 12. *from:Allison, Paul. Survival Analysis. SAS Institute

Satistics Interval censoring (a<T<b) When we know the event has occurred between two time points, but don’t know the exact dates. For example, if you’re screening subjects for HIV infection yearly, you may not be able to determine the exact date of infection.* *from:Allison, Paul. Survival Analysis. SAS Institute

Satistics Data Structure: survival analysis Time variable: t i = time at last disease- free observation or time at event Censoring variable: c i =1 if had the event; c i =0 no event by time t i

Satistics Choice of origin

Satistics 26225

Satistics Describing survival distributions T i the event time for an individual, is a random variable having a probability distribution. Different models for survival data are distinguished by different choice of distribution for T i.

Satistics Survivor function (cumulative distribution function) Survival analysis typically uses complement, or the survivor function: Example: If t=100 years, S(t=100) = probability of surviving beyond 100 years. Cumulative failure function

Satistics Corresponding density function The probability of the failure time occurring at exactly time t (out of the whole range of possible t’s). Also written:

Satistics Hazard function In words: the probability that if you survive to t, you will succumb to the event in the next instant. Derivation:

Satistics Relating these functions:

Satistics Introduction to Kaplan-Meier Non-parametric estimate of survivor function. Commonly used to describe survivorship of study population/s. Commonly used to compare two study populations. Intuitive graphical presentation.

Beginning of studyEnd of study  Time in months  Subject B Subject A Subject C Subject D Subject E Survival Data (right-censored) 1. subject E dies at 4 months X

100%  Time in months  Corresponding Kaplan-Meier Curve Probability of surviving to just before 4 months is 100% = 5/5 Fraction surviving this death = 4/5 Subject E dies at 4 months

Beginning of studyEnd of study  Time in months  Subject B Subject A Subject C Subject D Subject E Survival Data 2. subject A drops out after 6 months 1. subject E dies at 4 months X 3. subject C dies at 7 months X

100%  Time in months  Corresponding Kaplan-Meier Curve subject C dies at 7 months Fraction surviving this death = 2/3

Beginning of studyEnd of study  Time in months  Subject B Subject A Subject C Subject D Subject E Survival Data 2. subject A drops out after 6 months 4. Subjects B and D survive for the whole year-long study period 1. subject E dies at 4 months X 3. subject C dies at 7 months X

100%  Time in months  Corresponding Kaplan-Meier Curve Product limit estimate of survival = P(surviving/at-risk through failure 1) * P(surviving/at-risk through failure 2) = 4/5 * 2/3=.5333

The product limit estimate The probability of surviving in the entire year, taking into account censoring = (4/5) (2/3) = 53% NOTE:  40% (2/5) because the one drop- out survived at least a portion of the year. AND <60% (3/5) because we don’t know if the one drop-out would have survived until the end of the year.

Satistics KM estimator, formally

Comparing 2 groups

Caveats Survival estimates can be unreliable toward the end of a study when there are small numbers of subjects at risk of having an event.

WHI and breast cancer Small numbers left

Satistics Overview of SAS PROCS LIFETEST - Produces life tables and Kaplan-Meier survival curves. Is primarily for univariate analysis of the timing of events. LIFEREG – Estimates regression models with censored, continuous-time data under several alternative distributional assumptions. Does not allow for time-dependent covariates. PHREG– Uses Cox’s partial likelihood method to estimate regression models with censored data. Handles both continuous-time and discrete-time data and allows for time-dependent covariables