Presentation is loading. Please wait.

Presentation is loading. Please wait.

Introduction to Survival Analysis Seminar in Statistics 1 Presented by: Stefan Bauer, Stephan Hemri 28.02.2011.

Similar presentations


Presentation on theme: "Introduction to Survival Analysis Seminar in Statistics 1 Presented by: Stefan Bauer, Stephan Hemri 28.02.2011."— Presentation transcript:

1 Introduction to Survival Analysis Seminar in Statistics 1 Presented by: Stefan Bauer, Stephan Hemri 28.02.2011

2 Definition Survival analysis:  method for analysing timing of events;  data analytic approach to estimate the time until an event occurs. Historically survival time refers to the time that an individual „survives“ over some period until the event of death occurs. Event is also named failure. 2

3 Areas of application Survival analysis is used as a tool in many different settings: proving or disproving the value of medical treatments for diseases; evaluating reliability of technical equipment; monitoring social phenomena like divorce and unemployment. 3

4 Examples Time from... marriage to divorce; birth to cancer diagnosis; entry to a study to relapse. 4

5 Censoring The survival time is not known exactly! This may occur due to the following reasons: a person does not experience the event before the study ends; a person is lost to follow-up during the study period; a person withdraws from the study because of some other reason. 5

6 Right censored 6

7 Left censored 7

8 Outcome variable 8

9 Survival time ≠ calendar time (e.g. follow- up starts for each individual on the day of surgery) Correct starting and ending times may be unknown due to censoring 9

10 Survivor function 10 tS(t) 1S(1) = P(T > 1) 2S(2) = P(T > 2) 3S(3) = P(T > 3)......

11 Survivor function 11

12 Survivor function 12

13 Hazard function 13

14 Example: Hazard function Assume having a huge follow-up study on heart attacks: 600 heart attacks (events) per year; 50 events per month; 11.5 events per week; 0.0011 events per minute. h(t) = rate of events occurring per time unit 14

15 Relation between S(t) and h(t) 15

16 Sketch of Proof i)Find relationship between density f(t) and S(t) ii)Express relationship between h(t) and S(t) as a function of density f(t) i in ii → h(t) as a function of S(t) and vice versa 16

17 Example: Relationship 17

18 Types of hazard functions 18

19 Hazard ratio 19

20 Example: Hazard ratio 20

21 Basic descriptive measures 21

22 Goals (of survival analysis) to estimate and interpret survivor and or hazard function; to compare survivor and or hazard function; to assess the relationship of explanatory variables to survival times -> we need mathematical modelling (Cox model). 22

23 Computer layout 23 individualt (in weeks) δ (failed or censored) 151 2120 33,50 480 560 6 1

24 Computer layout Layout for multivariate data with p explanatory variables: 24 individualt (in weeks) δ (failed or censored) X1X1 X2X2....XpXp 1t1t1 1 X 11 X 12...X 1p 2t2t2 2 X 21 X 22...X 2p... … ntntn n X n1 X n2....X np

25 25 Notation & terminology Ordered failures: Frequency counts: m i = # individuals who failed at t (i) q i = # ind. censored in [t (i),t (i+1) ) Risk set R(t (i) ): Collection of individuals who have survived at least until time t (i) unordered censored t’s failed t’s ordered (t (i) )

26 Manual analysis layout 26 Ordered failure times # of failures m i # censored in [t (i),t (i+1) ) Risk set R(t (i) ) t (0) =0mimi q0q0 R(t (0) ) t (1) m1m1 q1q1 R(t (1) )....... t (n) mnmn qnqn R(t (n) )

27 Manual analysis layout 27 Ordered failure times # of failures m i # censored in [t (i),t (i+1) ) Risk set R(t (i) ) t (0) = 000 6 persons survive ≥ 0 weeks t (1) = 3.5116 persons survive ≥ 3,5 weeks t (2) = 5134 persons survive ≥ 5 weeks

28 Example: Leukaemia remission Extended Remission Data containing: two groups of leukaemia patients: treatment & placebo; log WBC values of each individual; (WBC: white blood cell count) Expected behaviour: The higher the WBC value is the lower the expected survival time. 28

29 Example: Analysis layout 29 Analysis layout for treatment group: t (j) mimi qiqi R(t (j) ) t (0) = 00021 persons t (1) = 63121 persons t (2) = 71117 persons t (3) = 101215 persons t (4) = 131012 persons t (5) = 161311 persons t (6) = 22107 persons t (7) = 23156 persons

30 Example: Confounding 30

31 Example: Confounding 31

32 Example: Interaction 32

33 Example: Conclusion Need to consider confounding and interaction; basic problem: comparing survival of the two groups after adjusting for confounding and interaction; problem can be extended to the multivariate case by adding additional explanatory variables. 33

34 Summary Survival analysis encompasses a variety of methods for analyzing the timing of events; problem of censoring: exact survival time unknown;  mixture of complete and incomplete observations  difference to other statistical data 34

35 Summary 35

36 References A Conceptual Approach to Survival Analysis, Johnson, L.L., 2005. Downloaded from www.nihtraining.com on 19.02.2011. Applied Survival Analysis: Regression Modelling of Time to event Data, Hosmer, D.W., Lemeshow, S., Wiley Series in Probability and Statistics 1999. Lesson 6: Sample Size and Power - Part A, The Pennsylvania State University, 2007. Downloaded from http://www.stat.psu.edu/online/ courses/stat509/06_sample/09_sample_hazard.htm on 24.02.2011 Survival analysis: A self-learning text, Kleinbaum, D.G. & Klein M., Springer 2005. Survival and Event History Analysis: A Process Point of View (Statistics for Biology and Health), Aalen, O., Borgan, O. & Gjessing H., Springer 2010. 36


Download ppt "Introduction to Survival Analysis Seminar in Statistics 1 Presented by: Stefan Bauer, Stephan Hemri 28.02.2011."

Similar presentations


Ads by Google