G89.2247 Lecture 121 Analysis of Time to Event Survival Analysis Language Example of time to high anxiety Discrete survival analysis through logistic regression.

Slides:



Advertisements
Similar presentations
What is Event History Analysis?
Advertisements

What is Event History Analysis?
Surviving Survival Analysis
Survival Analysis. Key variable = time until some event time from treatment to death time for a fracture to heal time from surgery to relapse.
If we use a logistic model, we do not have the problem of suggesting risks greater than 1 or less than 0 for some values of X: E[1{outcome = 1} ] = exp(a+bX)/
Survival Analysis-1 In Survival Analysis the outcome of interest is time to an event In Survival Analysis the outcome of interest is time to an event The.
Survival Analysis. Statistical methods for analyzing longitudinal data on the occurrence of events. Events may include death, injury, onset of illness,
HSRP 734: Advanced Statistical Methods July 24, 2008.
SC968: Panel Data Methods for Sociologists
April 25 Exam April 27 (bring calculator with exp) Cox-Regression
Logistic Regression Multivariate Analysis. What is a log and an exponent? Log is the power to which a base of 10 must be raised to produce a given number.
Main Points to be Covered
Chapter 11 Survival Analysis Part 3. 2 Considering Interactions Adapted from "Anderson" leukemia data as presented in Survival Analysis: A Self-Learning.
PH6415 Review Questions. 2 Question 1 A journal article reports a 95%CI for the relative risk (RR) of an event (treatment versus control as (0.55, 0.97).
BIOST 536 Lecture 3 1 Lecture 3 – Overview of study designs Prospective/retrospective  Prospective cohort study: Subjects followed; data collection in.
Proportional Hazard Regression Cox Proportional Hazards Modeling (PROC PHREG)
Chapter 11 Survival Analysis Part 2. 2 Survival Analysis and Regression Combine lots of information Combine lots of information Look at several variables.
EPI 809/Spring Multiple Logistic Regression.
Logistic Regression Biostatistics 510 March 15, 2007 Vanessa Perez.
1 G Lect 11M Binary outcomes in psychology Can Binary Outcomes Be Studied Using OLS Multiple Regression? Transforming the binary outcome Logistic.
Main Points to be Covered Cumulative incidence using life table method Difference between cumulative incidence based on proportion of persons at risk and.
Measures of disease frequency (I). MEASURES OF DISEASE FREQUENCY Absolute measures of disease frequency: –Incidence –Prevalence –Odds Measures of association:
Unit 5c: Adding Predictors to the Discrete Time Hazard Model © Andrew Ho, Harvard Graduate School of EducationUnit 5c– Slide 1
Unit 5c: Adding Predictors to the Discrete Time Hazard Model © Andrew Ho, Harvard Graduate School of EducationUnit 5c– Slide 1
Survival Analysis A Brief Introduction Survival Function, Hazard Function In many medical studies, the primary endpoint is time until an event.
Survival Analysis: From Square One to Square Two
Survival analysis Brian Healy, PhD. Previous classes Regression Regression –Linear regression –Multiple regression –Logistic regression.
Logistic Regression II Simple 2x2 Table (courtesy Hosmer and Lemeshow) Exposure=1Exposure=0 Disease = 1 Disease = 0.
Estimating cancer survival and clinical outcome based on genetic tumor progression scores Jörg Rahnenführer 1,*, Niko Beerenwinkel 1,, Wolfgang A. Schulz.
Rob Lancaster, Orbitz Worldwide Survival Analysis & TTL Optimization.
HSTAT1101: 27. oktober 2004 Odd Aalen
Essentials of survival analysis How to practice evidence based oncology European School of Oncology July 2004 Antwerp, Belgium Dr. Iztok Hozo Professor.
NASSER DAVARZANI DEPARTMENT OF KNOWLEDGE ENGINEERING MAASTRICHT UNIVERSITY, 6200 MAASTRICHT, THE NETHERLANDS 22 OCTOBER 2012 Introduction to Survival Analysis.
Dr Laura Bonnett Department of Biostatistics. UNDERSTANDING SURVIVAL ANALYSIS.
Biostatistics Case Studies 2005 Peter D. Christenson Biostatistician Session 4: Taking Risks and Playing the Odds: OR vs.
April 11 Logistic Regression –Modeling interactions –Analysis of case-control studies –Data presentation.
Design and Analysis of Clinical Study 11. Analysis of Cohort Study Dr. Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia.
April 6 Logistic Regression –Estimating probability based on logistic model –Testing differences among multiple groups –Assumptions for model.
HSRP 734: Advanced Statistical Methods July 17, 2008.
Introduction to Survival Analysis Utah State University January 28, 2008 Bill Welbourn.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
Assessing Binary Outcomes: Logistic Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Lecture 12: Cox Proportional Hazards Model
1 Lecture 6: Descriptive follow-up studies Natural history of disease and prognosis Survival analysis: Kaplan-Meier survival curves Cox proportional hazards.
1 Topic 4 : Ordered Logit Analysis. 2 Often we deal with data where the responses are ordered – e.g. : (i) Eyesight tests – bad; average; good (ii) Voting.
01/20151 EPI 5344: Survival Analysis in Epidemiology Cox regression: Introduction March 17, 2015 Dr. N. Birkett, School of Epidemiology, Public Health.
Satistics 2621 Statistics 262: Intermediate Biostatistics Jonathan Taylor and Kristin Cobb April 20, 2004: Introduction to Survival Analysis.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics 10 th Edition.
Applied Epidemiologic Analysis - P8400 Fall 2002 Labs 6 & 7 Case-Control Analysis ----Logistic Regression Henian Chen, M.D., Ph.D.
Introduction to Multiple Regression Lecture 11. The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & 2 or more.
Probability and odds Suppose we a frequency distribution for the variable “TB status” The probability of an individual having TB is frequencyRelative.
INTRODUCTION TO CLINICAL RESEARCH Survival Analysis – Getting Started Karen Bandeen-Roche, Ph.D. July 20, 2010.
Applied Epidemiologic Analysis - P8400 Fall 2002 Labs 6 & 7 Case-Control Analysis ----Logistic Regression Henian Chen, M.D., Ph.D.
Beginners statistics Assoc Prof Terry Haines. 5 simple steps 1.Understand the type of measurement you are dealing with 2.Understand the type of question.
Analysis of matched data Analysis of matched data.
Additional Regression techniques Scott Harris October 2009.
1 Borgan and Henderson: Event History Methodology Lancaster, September 2006 Session 8.1: Cohort sampling for the Cox model.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 13: Multiple, Logistic and Proportional Hazards Regression.
Chapter 13 LOGISTIC REGRESSION. Set of independent variables Categorical outcome measure, generally dichotomous.
An introduction to Survival analysis and Applications to Predicting Recidivism Rebecca S. Frazier, PhD JBS International.
Logistic Regression APKC – STATS AFAC (2016).
April 18 Intro to survival analysis Le 11.1 – 11.2
Statistics 103 Monday, July 10, 2017.
Multiple logistic regression
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Survival Analysis {Chapter 12}
Categorical Data Analysis Review for Final
Logistic Regression.
Introduction to Logistic Regression
Kaplan-Meier survival curves and the log rank test
Presentation transcript:

G Lecture 121 Analysis of Time to Event Survival Analysis Language Example of time to high anxiety Discrete survival analysis through logistic regression Examples Final comments

G Lecture 122 Analysis of Time to Event Often we are interested the occurrence of key events in time  Attainment of positive development Speaking first word Taking first step Finishing first doctorate  Occurrence of negative outcome Death Relapse following treatment for smoking, drug use Illness among persons at risk

G Lecture 123 Survival Analysis / Life Table Analysis Demographers and Biostatisticians developed a set of methods to look at time to death (time of survival) that are called Survival Analyses. Key features of methods  Focuses on conditional probability of survival (demise) in a period, given that one has survived to that point  Includes data on persons who survive throughout full observation period  Can take into account “censoring” of data – loss to followup that is unrelated to outcome  Recently include methods that allow multivariate modeling of survival

G Lecture 124 Example from Bar Exam We expect persons preparing for the bar exam to be progressively more anxious as the exam draws near If Anxiety passes some threshold, it may be disruptive On the average, how long can people keep their cool?  Define “cool” as having anxiety POMS scores less than 3.0 on a 0-4 scale. How many people have high anxiety days?

G Lecture 125 Time to High Anxiety Suppose we looked at time to first Anxious Day in males and females  38% of males never had a high anxious day  35% of females never had a high anxious day Average time to anxious day appears to be shorter for females than males  17.7 days for females  22.3 days for males  both estimates are biased (too small), since I included values of 31 days for persons who did not experience anxiety. Bias would have been greater had I excluded those cases This illustrates problem of censoring

G Lecture 126 Survival Analysis Terminology Survival probability is the probability of surviving to time t. It tends to decrease over time. Call this S.  Survival Function describes the survival probabilities over time  Survival probabilities accumulate failure events. Hazard (h) describes the time-specific probability of the event among persons who have survived to time t.  Hazard Function describes the hazard over time  For discrete time periods (such as weeks) it is a probability  For small time periods (continuous time) it is an instantaneous risk. Relation of S to h: S t+1 =S t *(1-h t )

G Lecture 127 Example of Survival and Hazard Functions from Singer & Willett In this example of smoking relapse following smoking cessation treatment, the relapse rate is highest at first and then there is a flattening of the curve. Note that it is monotonic (it never goes back up) The hazard function shows the pattern more dramatically.

G Lecture 128 Discrete Time Survival Analysis Singer and Willet (1991) describe a logistic regression approach to time-to-event data that takes censoring into account  Assumes that time reports can be discretized into categories that do not lose too much information  Person-times become units of analysis Including only persons who had not previously experienced target event  Uses standard logistic regression programs to model the hazard function  Fits baseline probability of event using discrete dummy code variables

G Lecture 129 Specific Steps  Create special version of person-time data set Unequal numbers of times per person. Last record for person is either »time when event occurred »time of last followup Dummy codes for time categories need to be created  Apply regular logistic regression software to model probability of an event Include time dummy codes to take into account the possibly nonlinear change in the baseline hazard Can include both time-invariant predictors and time- varying predictors of hazard

G Lecture 1210 Example of Data Setup DATA NEW; *This step reads data with days as lines; infile 'dayanx.txt'; input id day sex dianx; diaryday=day-1; If day < 32; TITLE1 'G Lecture 12 Analyses'; Data new2; *This step creates file with days as variables, persons as lines; array anx[31] a1-a31; array dday[31] d1-d31; do i=1 to 31 until (last.id); set new; by id; anx[i]=dianx; dday[i]=day; end; *Now we tag days beyond first incidence of high anxiety cl1-cl32 are binary indicators of history of anxiety; array clos[32] cl1-cl32 ; do i=1 to 31; clos[i]=0; end; do i=1 to 30; if (anx[i]=1) or (clos[i]=1) then clos[i+1]=1; end; run;

G Lecture 1211 Example, Continued data new3; *This step creates file with tagged days as lines; set new2; array anx[31] a1-a31; array dday[31] d1-d31; array clos[32] cl1-cl32; past=0; do i=1 to 31; day=dday[i]; dianx=anx[i]; past=clos[i]; output; end; drop i a1-a31 d1-d31 cl1-cl32; data new4; *This creates dummy codes for day and deletes days beyond incident days; set new3; diaryday=day-15; array dumday[31] dm1-dm31; do i=1 to 31; dumday[i]=0; end; dumday[day]=1; if past=0; drop past i; run;

G Lecture 1212 Output: Eight Time Periods PROC LOGIST ESTIMATES OF LOG ODDS Parameter DF Estimate S. E. Chi-Square Pr > ChiSq dm <.0001 dm <.0001 dm <.0001 dm <.0001 dm <.0001 dm <.0001 dm <.0001 dm <.0001 sex

G Lecture 1213 Creating Hazard Function First exponetiate the log odds to get odds  Odds of anxiety in females (reference group) Multiply odds for females by odds ratio estimate [exp(-.23)=.794] to get odds for men  The odds for men are about 80% of what they are for women Transform the odds (W) to probabilities (P)  P = W/(1+W)

G Lecture 1214 Converting Log Odds to Probabilities

G Lecture 1215 Proportional Hazards vs. Non Proportional Hazards If Gender only enters as a main effect in the logistic model, the odds for men and women are proportional Singer and Willett show how to check this assumption by constructing interaction terms In this example, there is no evidence for interaction When proportional hazard assumptions hold, one can also use “Cox Proportional Hazards” Model  Cox models handle continuous time data  Most common form of multivariate survival analysis  Example is included in Handout

G Lecture 1216 Special Issues and Considerations Survival analysis makes most sense with random samples of defined populations Followup time should start from some clear event to avoid selection effects To get stable estimates, collect data so that the target event occurs to more than half the sample Generally, survival analyses need larger samples