Download presentation
Presentation is loading. Please wait.
Published byJeffry Allen Modified over 8 years ago
1
02/20161 EPI 5344: Survival Analysis in Epidemiology Hazard March 8, 2016 Dr. N. Birkett, School of Epidemiology, Public Health & Preventive Medicine, University of Ottawa
2
02/20162 Objectives Examine non-parametric methods of estimating hazard –Actuarial formulae –Nelson-Aalen method –Application of smoothing –Piecewise constant hazard models Review limitations of estimating hazard Applications of hazard estimates
3
02/20163 Hazard (1) h(t) –Instantaneous hazard –Rate of event occurring at time ‘t’, conditional having survived event-free until time ‘t’ H(t) –Cumulative hazard –‘Sum’ of all hazards from time ‘0’ to time ‘t’ –Area under the h(t) curve from ‘0’ to ‘t’
4
02/20164 Hazard (2) Simplest survival model assumes a constant hazard –Yields an exponential survival curve –Leads to basic epidemiology formulae for incidence, etc. Can extend it with the piecewise model –Fits a different constant hazard for given follow-up time intervals.
5
02/20165 Hazard estimation (1) If hazard is not constant, how does it vary over time?
6
02/20166 Hazard estimation (2) How can we estimate the hazard? –Parametric methods (not discussed today) –Non-parametric methods We can estimate: –h(t) –H(t) Preference is to estimate H(t) –Nelson-Aalen method is main approach.
7
02/20167 Hazard estimation (3) Direct hazard estimation has issues –h(t) shows much random variation –Unstable estimates due to small event numbers in time intervals Works ‘best’ with actuarial method since intervals are pre-defined –Interval length is generally the same for each interval (u i ).
8
Estimating h(t) 03/20168
9
02/20169 h(t) estimation (1) Let’s start by looking at direct estimation of h(t) Works from a piece-wise constant hazard model Start by dividing follow-up time into intervals –Actuarial has pre-defined intervals –KM uses time between events as intervals.
10
02/201610 h(t) direct estimation
11
02/201611 h(t) estimation (2) Actuarial method to estimate h(t) –Interval length is u i. Standard Incidence Density formula from Epi
12
02/201612 ABCDEFGHI Year# people still alive # lost# people dying in this year Effective # at risk Prob die in year Prob survive this year Cum. Prob of surviving to this year Cum. Prob of dying by this year 199010,0005,0001,5007,5000.20.8 0.2 1991 3,5001,750 5252,6250.20.80.640.36 1992 1,225 612 184 9190.20.80.510.49 1993 429 215 64 3220.20.80.410.59 1994 150 75 23 1130.20.80.330.67 Last week, we used this data to illustrate actuarial method Let’s use it to estimate h(t)
13
02/201613 NtNt WtWt ItIt h(t) Year# people still alive # lost# people dying in this year Effective # at risk Prob die in year 199010,0005,0001,5007,5000.2 1991 3,5001,750 5252,6250.2 1992 1,225 612 184 9190.2 1993 429 215 64 3220.2 1994 150 75 23 1130.2 1990: NtNt WtWt ItIt h(t) Year# people still alive # lost# people dying in this year Effective # at risk Prob die in year 199010,0005,0001,5007,5000.20.222 1991 3,5001,750 5252,6250.2 1992 1,225 612 184 9190.2 1993 429 215 64 3220.2 1994 150 75 23 1130.2 1991: NtNt WtWt ItIt h(t) Year# people still alive # lost# people dying in this year Effective # at risk Prob die in year 199010,0005,0001,5007,5000.20.222 1991 3,5001,750 5252,6250.20.222 1992 1,225 612 184 9190.2 1993 429 215 64 3220.2 1994 150 75 23 1130.2 NtNt WtWt ItIt h(t) Year# people still alive # lost# people dying in this year Effective # at risk Prob die in year 199010,0005,0001,5007,5000.20.222 1991 3,5001,750 5252,6250.20.222 1992 1,225 612 184 9190.20.222 1993 429 215 64 3220.20.222 1994 150 75 23 1130.20.222
14
02/201614 h(t) estimation (3) Look at denominator of this formula –First part is the ‘effective number of people under follow-up during interval’ –Second part is ‘# of years each person followed (on average) in the interval’ –Product is the ‘person-time of follow-up’ in the interval –An approximation since we don’t use data on when each person left follow-up
15
02/201615 h(t) estimation (4) Suppose we had exact time of follow-up for each subject? Person-time variant –Divide follow-up time into fixed intervals –Compute actual person-time in each interval (rather than using approximation). –Gives a slightly smoother curve
16
02/201616 h(t) estimation (5) Kaplan-Meier method to estimate h(t) –‘interval’ is time between death events Varies irregularly –Formula has same structure as person-time estimate given above: d i = # with event u i = t i – t i-1 n i = size of risk set at ‘t’
17
02/201617 h(t) estimation (6) Issues with using KM method to estimate h(t) –Normally, only have 1 or 2 in numerator –Makes estimates ‘unstable’ Liable to considerable random variation and noise –Do not usually estimate h(t) from KM methods Any estimation of h(t) can use a Kernel Smoothing approach to improve estimates
18
Smoothing & hazard (1) What is smoothing? Think of a moving average –Could present monthly rates –Instead, average the rates for 3 month groups Jan/Feb/March Feb/March/April etc. –Present the 3 month averages –Smooths out variation and reveals trends 02/201618
19
Smoothing & hazard (2) Many other methods exist Key idea is to define a ‘sliding window’ –Estimate data point in window –Slide to the next target region LOESS smoothing is very commonly used 02/201619
20
Estimating H(t) 03/201620
21
02/201621 Estimating Cumulative hazard: H(t) –Measures the area under the h(t) curve. Tends to be more stable since it is based on number of events from ‘0’ to ‘t’ rather than number in the last interval H(t) estimation (1)
22
02/201622 Four ways you can do this: Actuarial using ‘epi’ formula Actuarial using Person-time method Kaplan-Meier approach using Nelson-Aalen estimator Kaplan-Meier approach using –log(S(t)) Let’s talk methods 1 & 2 H(t) estimation (2)
23
02/201623 Simple approach –Estimate h(t) assuming a piece-wise constant model –H(t) is the sum of the pieces. –For each ‘piece’ before time ‘t’, compute: Product of the estimated ‘h i ’ for the interval multiplied by the length of the interval it is based on. H(t) estimation (2)
24
02/201624 Simple approach (cont) –Add these up across all ‘pieces’ before time ‘t’. Width of last ‘piece’ is up to ‘t’ only –Relates to the density method from epi H(t) estimation (3)
25
02/201625 H(t) estimation based on piecewise estimation of h(t)
26
H(t) estimation (4) The formula are as follows. The sum is done at the end of each interval. 02/201626
27
02/201627 Nelson-Aalen estimator for H(t) Apply above approach but define intervals by using the time points for events Most commonly used approach to estimate H(t) Related to Kaplan-Meier method Compute H(t) at each time when event happens: H(t) estimation (5) d i = # with event at ‘t i ’ n i = size of risk set at ‘t i ’
28
02/201628 Approach #4 to estimate H(t) Use -log(S(t)) From our basic formulae, we have: Estimate S(t) and convert using this formula H(t) estimation (6)
29
02/201629 For those who care, methods 3 and 4 are very similar From KM, the estimate of S(t) is: H(t) estimation (7)
30
02/201630 Hence, we have: But, for small values, we have: So, we get: H(t) estimation (8)
31
02/201631 Numerical example IDTime(mons)Censored 114XXXXX 222 329 437XXXXX 545XXXXX 646 761 876XXXXX 992XXXXX 10111XXXXX Very coarse: 10 events in 10 years
32
Year# people under follow- up # lost# people dying in this year h(t)H(t) 0-1100000 1-21011 2-3 3-4 4-5 5-6 6-7 7-8 8-9 02/201632 Actuarial Method for h(t) Year# people under follow- up # lost# people dying in this year h(t)H(t) 0-1100000 1-210110.111 2-3801 3-4 4-5 5-6 6-7 7-8 8-9 Year# people under follow- up # lost# people dying in this year h(t)H(t) 0-1100000 1-210110.111 2-38010.1330.244 3-4721 4-5 5-6 6-7 7-8 8-9 Year# people under follow- up # lost# people dying in this year h(t)H(t) 0-1100000 1-210110.111 2-38010.1330.244 3-47210.1820.426 4-540000.426 5-64010.2860.712 6-731000.712 7-821000.712 8-911000.712
33
Nelson-Aalen estimate of H(t) IntervalComputationH(t) from actuarial method 0-22H(t) = 00.111 22 + -29H(t) =0.111 29 + -46H(t) =0.426 46 + -51H(t) =0.712 51 + H(t) =0.712 02/201633
34
02/201634 A new example
35
02/201635 H(t) has many uses, largely based on: Applications of H(t) (1)
36
Applications of H(t) (2) Nelson-Aalen estimate of H(t) gives another way to estimate S(t). Uses formula: 02/201636
37
Applications of H(t) (3) 02/201637 IntervalH(t)S(t)Cum Incid(t) 0-2201.00.0 22 + -290.1110.8950.105 29 + -460.2360.7900.210 46 + -510.4360.6470.353 51 + 0.6860.5040.496
38
02/201638
39
Applications of H(t) (4) log(-log(S(t)) and Proportional Hazards Can plot this in SAS using the ‘p=ls’ option 02/201639
40
02/201640 Key for testing proportional hazards assumption (later) Applications of H(t) (5)
41
Applications of H(t) (6) 02/201641 Suppose the hazard is a constant (λ), then we have: Plot ‘ln(S(t))’ against ‘t’. A straight line indicates a constant hazard. Approach can be used to test other models (e.g. Weibull).
42
Example (from Allison) Recidivism data set –432 male inmates released from prison –Followed for 52 weeks –Dates of re-arrests were recorded –Study designed to examine the impact of a financial support programme on reducing re- arrest 02/201642
43
02/201643
44
02/201644 Simple hazard estimates using actuarial method Adjusted hazard estimates using actuarial method: last interval ends at 53 weeks, not 60 weeks
45
02/201645
46
02/201646
47
02/201647
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.