Main Points to be Covered Cumulative incidence using life table method Difference between cumulative incidence based on proportion of persons at risk and.

Slides:



Advertisements
Similar presentations
Measures of disease frequency (II). Calculation of incidence Strategy #2 ANALYSIS BASED ON PERSON-TIME CALCULATION OF PERSON-TIME AND INCIDENCE RATES.
Advertisements

Measures of Disease Frequency
Survival Analysis-1 In Survival Analysis the outcome of interest is time to an event In Survival Analysis the outcome of interest is time to an event The.
Survival Analysis. Statistical methods for analyzing longitudinal data on the occurrence of events. Events may include death, injury, onset of illness,
Intermediate methods in observational epidemiology 2008 Instructor: Moyses Szklo Measures of Disease Frequency.
Measures of Disease Association Measuring occurrence of new outcome events can be an aim by itself, but usually we want to look at the relationship between.
Main Points to be Covered
Measure of disease frequency
Measuring Disease Occurrence
Epidemiology Kept Simple
Measures of disease frequency (I). MEASURES OF DISEASE FREQUENCY Absolute measures of disease frequency: –Incidence –Prevalence –Odds Measures of association:
Manish Chaudhary MPH (BPKISH)
Measuring Epidemiologic Outcomes
SMRs, PMRs and Survival Measures Principles of Epidemiology Lecture 3 Dona SchneiderDona Schneider, PhD, MPH, FACE.
Analysis of Complex Survey Data
Survival Analysis Diane Stockton. Survival Curves Y axis, gives the proportion of people surviving from 1 at the top to zero at the bottom, representing.
Incidence and Prevalence
Survival analysis Brian Healy, PhD. Previous classes Regression Regression –Linear regression –Multiple regression –Logistic regression.
Lecture 3: Measuring the Occurrence of Disease
7 Regression & Correlation: Rates Basic Medical Statistics Course October 2010 W. Heemsbergen.
Disease Occurrence II Main Points to be Covered Incidence rates (person-time incidence) “Average” incidence rate –Calculating “average” incidence rate.
Measures of Disease Association
Essentials of survival analysis How to practice evidence based oncology European School of Oncology July 2004 Antwerp, Belgium Dr. Iztok Hozo Professor.
Measurement Measuring disease and death frequency FETP India.
G Lecture 121 Analysis of Time to Event Survival Analysis Language Example of time to high anxiety Discrete survival analysis through logistic regression.
Dr Laura Bonnett Department of Biostatistics. UNDERSTANDING SURVIVAL ANALYSIS.
Lecture 3 Survival analysis.
1/26/09 1 Community Health Assessment in Small Populations: Tools for Working With “Small Numbers” Region 2 Quarterly Meeting January 26, 2009.
Main Points to be Covered Cumulative incidence using life table method Difference between cumulative incidence and person time incidence rate Uses of person-time.
Retrospective Cohort Study. Review- Retrospective Cohort Study Retrospective cohort study: Investigator has access to exposure data on a group of people.
1 Introduction to medical survival analysis John Pearson Biostatistics consultant University of Otago Canterbury 7 October 2008.
Prevalence The presence (proportion) of disease or condition in a population (generally irrespective of the duration of the disease) Prevalence: Quantifies.
Design and Analysis of Clinical Study 11. Analysis of Cohort Study Dr. Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
01/20151 EPI 5344: Survival Analysis in Epidemiology Epi Methods: why does ID involve person-time? March 10, 2015 Dr. N. Birkett, School of Epidemiology,
01/20141 EPI 5344: Survival Analysis in Epidemiology Epi Methods: why does ID involve person-time? March 13, 2014 Dr. N. Birkett, Department of Epidemiology.
Introduction to Survival Analysis Utah State University January 28, 2008 Bill Welbourn.
Discussion for a statement for biobank and cohort studies in human genome epidemiology John P.A. Ioannidis, MD International Biobank and Cohort Studies.
Epidemiologic design from a sampling perspective Epidemiology II Lecture April 14, 2005 David Jacobs.
Measures of Disease Frequency COURTNEY D. LYNCH, PhD MPH ASSISTANT PROFESSOR DEPT. OF OBSTETRICS & GYNECOLOGY
Rates, Ratios and Proportions and Measures of Disease Frequency
Epidemiology: Basic concepts and principles ENV
Measures of Association and Impact Michael O’Reilly, MD, MPH FETP Thailand Introductory Course.
Rate versus Risk Two basic measures of the occurrence of new events (disease) –Cumulative incidence=Risk=Probability –Incidence rate=Rate=events per time.
Measuring Disease Occurrence
Disease Occurrence II Main Points to be Covered Incidence rates (person-time incidence) “Average” incidence rate –Calculating “average” incidence rate.
Design and Analysis of Clinical Study 10. Cohort Study Dr. Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia.
1 Lecture 6: Descriptive follow-up studies Natural history of disease and prognosis Survival analysis: Kaplan-Meier survival curves Cox proportional hazards.
Describing the risk of an event and identifying risk factors Caroline Sabin Professor of Medical Statistics and Epidemiology, Research Department of Infection.
Measuring Disease Occurrence Occurrence of disease is the fundamental outcome measurement of epidemiology Occurrence of disease is typically a binary (yes/no)
A short introduction to epidemiology Chapter 9: Data analysis Neil Pearce Centre for Public Health Research Massey University Wellington, New Zealand.
Instructor Resource Chapter 3 Copyright © Scott B. Patten, Permission granted for classroom use with Epidemiology for Canadian Students: Principles,
Lecture 5: The Natural History of Disease: Ways to Express Prognosis
01/20151 EPI 5344: Survival Analysis in Epidemiology Actuarial and Kaplan-Meier methods February 24, 2015 Dr. N. Birkett, School of Epidemiology, Public.
12/20091 EPI 5240: Introduction to Epidemiology Incidence and survival December 7, 2009 Dr. N. Birkett, Department of Epidemiology & Community Medicine,
Measures of Disease Frequency
Biostatistics Case Studies 2007 Peter D. Christenson Biostatistician Session 2: Aging and Survival.
Measure of disease Dr Nadjarzadeh. 1/25/2011Incidence and prevalence2 The population perspective requires measuring disease in populations Science is.
01/20151 EPI 5344: Survival Analysis in Epidemiology Hazard March 3, 2015 Dr. N. Birkett, School of Epidemiology, Public Health & Preventive Medicine,
INTRODUCTION TO CLINICAL RESEARCH Survival Analysis – Getting Started Karen Bandeen-Roche, Ph.D. July 20, 2010.
III. Measures of Morbidity: Morbid means disease. Morbidity is an important part of community health. It gives an idea about disease status in that community.
02/20161 EPI 5344: Survival Analysis in Epidemiology Hazard March 8, 2016 Dr. N. Birkett, School of Epidemiology, Public Health & Preventive Medicine,
Chapter 2. **The frequency distribution is a table which displays how many people fall into each category of a variable such as age, income level, or.
Measures of disease frequency Simon Thornley. Measures of Effect and Disease Frequency Aims – To define and describe the uses of common epidemiological.
Methods and Statistical analysis. A brief presentation. Markos Kashiouris, M.D.
Instructional Objectives:
April 18 Intro to survival analysis Le 11.1 – 11.2
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Measures of Disease Occurrence
Cohort and longitudinal studies: statistics
Presentation transcript:

Main Points to be Covered Cumulative incidence using life table method Difference between cumulative incidence based on proportion of persons at risk and incidence rate based on person-time Calculating person-time incidence rates Uses of person-time incidence rates Relation of prevalence to incidence Odds versus probability

Two assumptions in survival analysis Censoring is unrelated to survival (unrelated to the probability of experiencing the outcome) There are no temporal trends in the probability of the outcome

Long-Term Survival Data May Be Invalid Due to Temporal Trends Paper in Current Lancet analyzes data from National Cancer Institute’s Follow-up of Diagnoses 1978 –1998: Overall survival cohort method = 40% Overall survival with period analysis allowing for Changes in survival over time = 51% Brenner, The Lancet, Oct 12, 2002

Cumulative incidence: Life table No exact times of events or censoring needed Assume events and censoring occurred uniformly during the fixed time intervals (uniformity assumption) Therefore assume on average each censored person at risk for half of the time period Subtract one-half of subjects lost during interval from denominator at interval beginning Calculations just like Kaplan-Meier

Life Table vs. Kaplan-Meier Time interval fixed length Time intervals usually same (not required) Assume uniform timing of censoring Calculate probability of surviving the interval Cumulative incidence = (1 – product of interval survival probabilities) Time interval based on time of events Time intervals vary (not required) No assumption required about timing of censoring Calculate probability of surviving the interval Cumulative incidence = (1 – product of interval survival probabilities)

Life Table: Example in Text Szklo and Nieto use the same example of 10 observations to illustrate life table and KM Life table uniformity assumption not valid Life table more commonly used on large secondary data sets where exact failure times are not known With very large numbers the uniformity assumption is more likely to be valid

. ltable time d Interval Total Deaths Lost Survival SE [95% Conf. Int.] Life Table: Primary Biliary Cirrhosis Survival Data

Calculating a Life Table Cumulative Int. Total D Lost N At-risk P Event Survival – 7/2= /180.5= Subtract ½ of lost during interval from denominator – 9/2= /154.5= Repeat for next interval and so forth – 9/2= /126.5=

Interval Total Deaths Lost Survival SE [95% CI Int.] Time Total Fail Lost Survival Function SE [95% CI Int.]

Survival Curve from Life Table: Cirrhosis Data

The Three Elements in Measures of Disease Incidence E = an event = a disease diagnosis or death N = number of persons in the population in which the events are observed T = time period during which the events are observed

E E/TE/NT E/N

Two Measures Described as Incidence in the Text The proportion of individuals who experience the event in a defined time period (E/N during some time T) = cumulative incidence The number of events divided by the amount of person-time observed (E/NT) = incidence rate or density (not a proportion)

Person-Time Incidence Rates The numerator is the same as incidence based on proportion of persons = events (E) The denominator is the sum of the follow- up times for each individual The resulting ratio of E/NT is not a proportion; may be greater than 1; value depends on unit of time used

c

rates: year 1 = 3/7.083 = 42.4/100 person-years year 2 = 3/2.50 = 120/100 person-years both yrs 6/9.583 = 62.6/100 person-years

We have been calculating average rates; rate is often instantaneous change in one measure with respect to a second measure as interval 0 time Population size death rate In disease, the occurrence rate is often called a hazard or the force of morbidity (mortality)

Rates We are used to rates being change in a measure with respect to time but time does not have to be involved Accidents per passenger-mile, for example, is often used in transportation Economics often uses rates in which time is not an element (eg, energy use per unit of gross national product)

Comparison of cumulative incidence and incidence rate (density) Kaplan-Meier cumulative incidence estimate for these data was ( ) = 0.82 (ie, 82% of persons will experience event in a two-year period) Two-year incidence density is 62.6 / 100 person-years or per person-year Not a proportion--if calculated per person- days, rate would be 0.17 / 100 person-days

Incidence rate (density) value depends on the time units used An incidence rate of 100 cases per 1 person-yr: 100 cases/person-year 10,000 cases/person-century 8.33 cases/person-month 1.92 cases/person-week 0.27 cases/person-day Note: time period during which rate is measured can differ from the units used

Person-time incidence based on grouped vs. individual data Szklo and Nieto use rate when based on group data and density when based on individual data (not followed by most) Total person-time for grouped data is based on the time interval x the average population at risk during the interval Assumes uniform occurrence of events and of censoring during the interval (like life table)

Calculating person-time incidence using grouped data Use average number of persons at risk In the text example, start with 10 persons, 6 die and 3 are lost to follow-up Subtract 0.5 x (6 + 3) from 10 = 5.5 –uniformity assumption as in life tables Total person-time is 5.5 x 2-years = 11 person-years. 6 events, so rate 6/11= = 54.5 per 100 person-years (compare to 62.6 when calculated using individual data)

Incidence from grouped data Most commonly used for large secondary data sets where precise information on occurrence of events and on persons leaving and entering population are not available –eg, annual cancer mortality rates per 100,000 population ( = per 100,000 person-years) If times of events and of censoring available, would normally use individual level data

Group data rates versus individual data rates Differ depending on how close events and losses are to occurring uniformly –If losses perfectly uniform, they are the same Analogous to life table assumption of uniform timing of losses versus Kaplan- Meier use of individual data

Individual calculation: 2 deaths / 5 pers-yrs = 0.4 per pers-yr Group data: average population = (4 + 1) / 2 = 2.5 rate = 2 / 2.5 x 2 = 0.4 pers-yr

Rates based on group data Uniformity of events and losses likely to be approximately true for large secondary data sets Rates using secondary data sets on free- living populations assume new members and losses balance out (= approx. stable) Important for the use of population reference rates (eg, expected mortality in U.S. population)

Calculating Rates in STATA A few STATA survival analysis and rate commands: Declare data set survival data:. stset timevar, fail(failvar). ltable timevar, graph gives life table analysis & graph.strate gives person-years rate.strate groupvar gives rates within groups

Immediate Commands in STATA STATA has an option to use it like a calculator for various computations without using a data set. Called immediate commands. Example, to calculate the confidence interval around a person-time rate:. cii #person-time units #events, poisson Eg. 6 events occur in 10 person-years of follow-up. cii 10 6, poisson 95% CI = – 1.306

Assumption of Person-Time Incidence Estimation T time units of follow-up on N persons is the same as N time units on T persons Observing 2 deaths in 2 persons followed for 50 years gives the same incidence rate as 2 deaths in 100 persons followed 1 year Assumption is not reasonable if sample sizes and follow-up times differ greatly

Assumption of Person-Time Incidence Estimation If looking at relationship between exposure and outcome rate, one rate for a follow-up period implies exposure does not have cumulative effect on probability of event over time Clearly false for exposures with cumulative effects like length of time smoking

Why use person-time rather than cumulative incidence? Rates using group data can be calculated in open populations from a variety of data sources where population sizes are estimated Incidence rates from a cohort study can be compared to standardized rates from the general population to obtain ratio measures called standardized mortality ratio (SMR) or standardized incidence ratio (SIR)

Why use person-time rather than cumulative incidence? If E is a recurrent event, rate may seem more natural. For example, cumulative incidence of episodes of the common cold, would have to be done separately for each (ie, proportion with 1st cold, proportion with 2nd cold given that you have had 1, etc.).

Calculating stratified person-time incidence rates in cohorts For persons followed in a cohort some potential risk factors may be fixed but some may be variable –eg, ethnicity is fixed; occupational exposure to asbestos can change over time with the job Total person-time in an exposure category is one way to deal with risk factors that change over time

Relation of Prevalence and Incidence Prevalence is a function of incidence and duration of disease by the equation: point prevalence = incidence x duration x (1 - point prevalence) [P = I x D (1 - P)] For many typically low prevalence diseases prevalence becomes approximately I x D since (1 - P) is close to 1 if P is very low

Prevalence and Etiology Because prevalence depends both on incidence and duration of disease, it is not a good measure for etiological studies Cannot examine the determinants of occurrence alone when you have to account for determinants of duration (Rx, etc.) Etiologic study designs should avoid sampling prevalent cases of disease or prevalent controls

Summary Points Person-time incidence rate or density is not equivalent to cumulative incidence and is not a proportion Person-time incidence rate can be calculated with group or individual data Allows comparison with population reference rates from other data sources Allows accumulation of time at risk for different strata

Odds versus Probability Odds based on probability; expresses probability (p) as ratio: odds = p / (1 - p) –odds is always > p because divided by < 1 For example, if probability of dying = 1/5, then odds of dying = 1/5 / 4/5 = 1/4 Thinking of odds as 2 outcomes, the numerator is the # of times of one outcome and the denominator the # of times of the other P = odds / (1 + odds), so 1/4 / 1 + 1/4 = 1/5

Odds versus Probability Less intuitive than probability (probably wouldn’t say “my odds of dying are 1/4”) No less legitimate mathematically, just not so easily understood Used in epidemiology primarily because the log of the ratio of two odds is given by the coefficients in logistic regression equations