Cohort and longitudinal studies: statistics

Slides:



Advertisements
Similar presentations
Qualitative predictor variables
Advertisements

KRUSKAL-WALIS ANOVA BY RANK (Nonparametric test)
Measures of Impact Dublin June Measures of Impact You want to reduce deaths from road traffic accidents Most impact for least cost Cohort study.
Unit 14: Measures of Public Health Impact.
Understanding real research 3. Assessment of risk.
Epidemiology Practice. Epidemiology is the study of diseases in populations of humans or other animals, specifically how, when and where they occur, who.
Measure of disease frequency
Main Points to be Covered Cumulative incidence using life table method Difference between cumulative incidence based on proportion of persons at risk and.
BIOSTATISTICS 5.5 MEASURES OF FREQUENCY BIOSTATISTICS TERMINAL OBJECTIVE: 5.5 Prepare a Food Specific Attack Rate Table IAW PEF 5.5.
HaDPop Measuring Disease and Exposure in Populations (MD) &
COHORT STUDY DR. A.A.TRIVEDI (M.D., D.I.H.) ASSISTANT PROFESSOR
Understanding study designs through examples Manish Chaudhary MPH (BPKIHS)
Chapter 6: Incidence & Prevalence Chapter 3 Epidemiologic Measures
Assessing Survival: Cox Proportional Hazards Model Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Incidence and Prevalence
Cohort Study.
INTRODUCTION TO EPIDEMIOLO FOR POME 105. Lesson 3: R H THEKISO:SENIOR PAT TIME LECTURER INE OF PRESENTATION 1.Epidemiologic measures of association 2.Study.
Lecture 3: Measuring the Occurrence of Disease
Measuring disease and death frequency
Hypothesis Testing Field Epidemiology. Hypothesis Hypothesis testing is conducted in etiologic study designs such as the case-control or cohort as well.
Measures of Association
Assessing Survival: Cox Proportional Hazards Model
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Approaches to the measurement of excess risk 1. Ratio of RISKS 2. Difference in RISKS: –(risk in Exposed)-(risk in Non-Exposed) Risk in Exposed Risk in.
Statistics for clinicians Biostatistics course by Kevin E. Kip, Ph.D., FAHA Professor and Executive Director, Research Center University of South Florida,
S. Mazloomzadeh MD, PhD COHORT STUDIES Learning Objectives To develop an understanding of: - What is a cohort study? - What types of cohort studies are.
RATES AND RISK Daniel E. Ford, MD, MPH Johns Hopkins School of Medicine Introduction to Clinical Research July 12, 2010.
Assessing Binary Outcomes: Logistic Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
MBP1010 – Lecture 8: March 1, Odds Ratio/Relative Risk Logistic Regression Survival Analysis Reading: papers on OR and survival analysis (Resources)
Epidemiology: Basic concepts and principles ENV
Measures of Disease Frequency, Effect and Impact Lecture by: Dr Amna Rehana Siddiqui Associate Professor Department of Family & Community Medicine September.
Standardization of Rates. Rates of Disease Are the basic measure of disease occurrence because they most clearly express probability or risk of disease.
Measures of Disease Frequency
Probability and odds Suppose we a frequency distribution for the variable “TB status” The probability of an individual having TB is frequencyRelative.
Basic Analytical Techniques Used in epidemiology and public health intelligence Justine Fitzpatrick.
Methods of quantifying disease Stuart Harris Public Health Intelligence Analyst Course – Day 3.
Chapter 2. **The frequency distribution is a table which displays how many people fall into each category of a variable such as age, income level, or.
Measures of disease frequency Simon Thornley. Measures of Effect and Disease Frequency Aims – To define and describe the uses of common epidemiological.
Measures of Association (2) Attributable Risks and fractions October Epidemiology 511 W. A. Kukull.
Health Indicators.
Descriptive statistics (2)
Instructional Objectives:
Sample size calculation
Lecture notes on epidemiological studies for undergraduates
Epidemiologic Measures of Association
بسم الله الرحمن الرحيم COHORT STUDIES.
Showing Cause, Introduction to Study Design
Epidemiologic Measures Of Association
College of Applied Medical Sciences
Multiple logistic regression
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Prospective Cohort Study (Click or press Enter to follow animation)
INDIRECT STANDARDIZATION BY MBBSPPT.COM
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Measures of Disease Occurrence
Epidemiology MPH 531 Analytic Epidemiology Cohort Studies
Measurements of Risk & Association …
Evaluating Effect Measure Modification
Measures of risk and association
Epidemiological Measurements of health
Measures of Mortality Part 2
Mortality rate = No. of deaths * K
Presentation and interpretation of epidemiological data: objectives Raj Bhopal, Bruce and John Usher Professor of Public Health, Public Health Sciences.
RISK ASSESSMENT, Association and causation
Case-control studies: statistics
Risk Ratio A risk ratio, or relative risk, compares the risk of some health-related event such as disease or death in two groups. The two groups are typically.
Measures of Disease Frequency, Effect and Impact
Effect Modifiers.
Summary of Measures and Design
BMTRY 736 Foundation of Epidemiology Fall 2019
Presentation transcript:

Cohort and longitudinal studies: statistics Kath Bennett 25th June 2007

Outline Statistical methods Prevalence, Incidence Rates, standardised rates, case fatality rate Relative rates/risks Attributable risk and Attributable risk fraction Regression and beta coefficients

Measuring disease/drug frequency Prevalence is the proportion of a population that are cases at one time = no of existing cases with disease/on drug Total population in same time period e.g birth defect prevalence rate = no of births with given abnormality Total number of live births

Measuring disease/drug frequency Incidence is the rate at which new cases occur in a population during a specified period =Number of new cases/starting on therapy in period Population at risk in given period e.g. Bacteremia among OC users in study of 482 using OC, 27 developed bacteremia Incidence= (27/482)*100=5.6%

Inter-relation between incidence and prevalence Prevalence = incidence* average duration

Rates commonly used in epidemiology Crude rates For entire population eg. Total deaths/population Category specific (e.g. age-specific, gender-specific) Eg. Cancer rates by age-category, 35-44, 45-54 Age-adjusted or standardised rates Allows for appropriate comparison when differing populations being studies. Reduces distortions in comparison due to age differences in populations.

Rates Used to quantify the risk in a population. Numerator is number of events Denominator is Population at risk Person-years at risk or exposure Usually presented as per 1000 persons or 1000 person-years at risk.

Rates Provided the disease/event occurs randomly and independently, reasonable to assume follow Poisson distribution. Estimated standard error of rate SE (rate)=number of events / pop at risk Approx 95% CI for number events (E) EL=(1.96/2 - E)2 EU=(1.96/2 + (E+1))2 Approx 95% CI for rate Lower CI= EL/Pop, Upper CI= EU/Pop

Rates The number of deaths from CHD in Scottish men in 1995 was 1080, total population of men 612955. Rate of CHD deaths per 1000 (1080/612955)*1000 = 1.76 95% CI for number of deaths is EL=(1.96/2 - 1080)2 = 1016.55 EU=(1.96/2 + 1081)2 = 1146.4 95% CI for rates is 1016.55/612955 , 1146.4/612955 (1.66, 1.87)

Case fatality rate Usually expressed as the percentage of persons diagnosed as having a specified disease who die as a result of that illness within a given period. Sometimes 30-day or 1 year case-fatality rates quoted. The case-fatality rate is not the same as mortality rate. Is also referred to as fatality rate or case-fatality ratio.

Relative rate Relative rate is ratio of 2 rates i.e. (rate group 1)/(rate group 2) Also known as rate ratio Example, rate for CHD in men calculated as 1.76 per 1000 in 1995, and 0.483 per 1000 in women. Relative rate =1.76/0.483 =3.65

Mortality from lung cancer by 5-year age-bands

Standardised rates Direct method Requires a standard population to which age-specific rates are applied Multiply age-specific rates in pops A &B by standard population then compare Indirect method If age-specific rates for populations not possible Based on applying age-specific rates for standard population to population of interest to determine ‘expected’ Standardised Mortality Ratio (SMR) = Total Observed deaths / Total Expected

Direct standardisation – raw data

Direct standardisation

Indirect method

Indirect method Observed 1781 deaths SMR = 1781/2032.5 = 0.876 SMR>1 more deaths than expected SMR<1 less deaths than expected Sometimes multiplied by 100.

Examples of cohort studies Framingham study. Cohort of 5,209 men and women in 1948 between the ages of 30 and 62 from the town of Framingham, Massachusetts, to examine CVD development, returning every two years. British doctors study. Cohort of male doctors started in 1951 to examine smoking and related mortality (Doll and Hill). Million Women Study. Cohort of 1 million women asked about HRT use and followed up for cancer incidence.

Data from a cohort study are expressed as: Follow-up to see whether: All persons free of disease Persons who develop disease over time Persons who do not develop disease over time TOTAL Exposed to risk factor a b a+b Not exposed to risk factor c d c+d a+c b+d a+b+c+d Incidence in exposed (IE)=a/a+b Incidence in unexposed (IO)=c/c+d

Data from cohort studies are analyzed in terms of: 1. Relative risk (RR)= Incidence rate in the exposed group (IE)    Incidence rate in the non-exposed group (IO) Relative risks significantly higher than 1 imply that the factor under study is associated with an increased risk of disease Relative risks significantly lower than 1 imply that the factor is  associated with a decreased risk of disease. The magnitude of the relative risk indicates the strength of the association.

Attributable Risk Attributable risk implies that not all of the disease incidence is due to the exposure, as some nonexposed individuals may develop the disease. IO (Incidence in nonexposed group) =  "background incidence" IE (Incidence in exposed group) = ”background incidence”+ Incidence due to the exposure  Therefore, the incidence in the exposed group which is attributable to the exposure can be calculated by subtracting: Attributable risk (AR): AR = IE – IO

Attributable risk percent: The percentage of the total incidence in the exposed group which is attributable to the exposure can be calculated by: AR% = IE - IO x 100 IE Short cut also = (RR-1)/RR x 100

The Population is not all exposed How many cases per 1000 population are attributable to the exposure? Do we know the Incidence in the total population (It)? It =Ie x Pe+Io x (1-Pe) Pe is the Prevalence of the exposure Population Attributable Risk PAR=It-Io Short cut = AR X Pe

The Population What proportion of the risk in the total population is attributable? Population Attributable Risk %,PAR%, PAF The Denominator is the risk in the total population (It). The numerator is the “extra” risk (PAR) which is It -Io PAR% =(It -Io)/ It PAR% = Pe (RR-1)/[Pe (RR-1)+1]

Example: Cohort study of smoking and coronary heart disease Cohort of initially healthy people Develop CHD Don't Develop CHD Total Smokers 84 2916 3000 Non-smokers 87 4913 5000 Incidence in Smokers = 84/3000 = 28.0 per 1,000/year Incidence in Non-Smokers = 87/5000 = 17.4 per 1,000/year RR = Incidence in Smokers = 28.0/1000/yr = 1.6 Incidence in Non-Smokers 17.4/1000/yr

Example: Attributable risk AR (incidence in exposed group attributable to the exposure) = Incidence in Smokers - Incidence in Non-Smokers = 28.0 - 17.4 = 10.6/1000/year  AR% (% total incidence in exposed grp attributable to exposure) =Incidence in Smokers - Incidence in Non-Smokers x 100 Incidence in Smokers =(28.0 - 17.4) x 100 = 10.6 x 100  = 37.9% 28.0 28.0 37.9% of the morbidity from CHD among smokers may be attributable to smoking.

Example: Population AR (PAR) Population Attributable Risk PAR= It –IO = AR X Pe Suppose prevalence of smoking in the population is 40% We can calculate It =Ie*Pe+Io(1-Pe)= 21.6/1000 PAR = 21.6/1000 – 17.4/1000 =4.2/1000 Also = AR X Pe =(10.6/1000)X 0.4 =4.2/1000 Population Attributable Risk % PAR% =[(It-Io)/It] = (21.6-17.4)/21.6 = 19.4% Or PAR%=( Pe(RR-1) / [Pe(RR-1)+1] =0.4*(1.6-1)/[0.4*(1.6-1)+1]=19.4%

Methods to account for varying lengths of follow-up Since participants may enter or leave the study at various times due to death, emmigration or loss to follow-up, the time of observation is usually not uniform. This is accounted for by main methods: 1. Person - years of observation 2. Life - table method 3. Survival analysis, Cox proportional hazards

Regression

Is the linear relationship reasonable?

Regression In broad terms, regression can be thought of as a statistical model which is used to help us get the ‘best guess’. Formally, we assume that there is an underlying linear relationship between the variables, and our observations lie scattered about that line. The actual value of ‘Y’ is ‘scattered’ about the expected or predicted value of Y.

Linear Model Illustrated Line is ‘best’ fit in statistical sense. Scatter is called residual variability.

Regression equations The equation of a line can be written as E[Y] = a + bX. a = intercept (where line cuts Y axis) b = slope of the line (can be +ve or –ve) Software fits the ‘best guess’ for a and b. Regression Analysis: Leaving versus Mock The regression equation is Leaving = 7.75 + 0.817 Mock Predictor Coef SE T P Constant 7.749 3.122 2.48 0.023 Mock 0.81749 0.04284 19.08 0.000

More questions Is there an effect of gender? That is can one consider different lines for red and blue? Should these lines be parallel or not? What about ‘adjusting’ for other covariates such as age or ‘intelligence’? More than one variable = multiple linear regression