Case-control studies: statistics

Slides:



Advertisements
Similar presentations
1 Epidemiologic Measures of Association Saeed Akhtar, PhD Associate Professor, Epidemiology Division of Epidemiology and Biostatistics Aga Khan University,
Advertisements

Matching in Case-Control Designs EPID 712 Lecture 13 02/23/00 Megan O’Brien.
M2 Medical Epidemiology
Logistic Regression I Outline Introduction to maximum likelihood estimation (MLE) Introduction to Generalized Linear Models The simplest logistic regression.
KRUSKAL-WALIS ANOVA BY RANK (Nonparametric test)
Observational Designs Oncology Journal Club April 26, 2002.
Case-Control Studies (Retrospective Studies). What is a cohort?
KINE 4565: The epidemiology of injury prevention Case control and case crossover studies.
Chapter 19 Stratified 2-by-2 Tables
Chance, bias and confounding
Journal Club Alcohol and Health: Current Evidence March-April 2007.
EPI 809 / Spring 2008 Final Review EPI 809 / Spring 2008 Ch11 Regression and correlation  Linear regression Model, interpretation. Model, interpretation.
© Scott Evans, Ph.D. and Lynne Peeples, M.S.
Intermediate methods in observational epidemiology 2008 Confounding - I.
HaDPop Measuring Disease and Exposure in Populations (MD) &
Logistic Regression In logistic regression the outcome variable is binary, and the purpose of the analysis is to assess the effects of multiple explanatory.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 12: Multiple and Logistic Regression Marshall University.
Case Control Study Manish Chaudhary BPH, MPH
The Chi-Square Test Used when both outcome and exposure variables are binary (dichotomous) or even multichotomous Allows the researcher to calculate a.
Stratification and Adjustment
INTRODUCTION TO EPIDEMIOLO FOR POME 105. Lesson 3: R H THEKISO:SENIOR PAT TIME LECTURER INE OF PRESENTATION 1.Epidemiologic measures of association 2.Study.
Logistic Regression. Outline Review of simple and multiple regressionReview of simple and multiple regression Simple Logistic RegressionSimple Logistic.
Measures of Association
R Programming Odds & Odds Ratios 1. Session 3 Overview 1.Odds 2.Odds Ratio (OR) 3.Confidence Intervals for OR’s 4.Inference based on OR’s 2.
Dr.Shaikh Shaffi Ahamed Ph.D., Dept. of Family & Community Medicine
Week 6: Model selection Overview Questions from last week Model selection in multivariable analysis -bivariate significance -interaction and confounding.
October 15. In Chapter 19: 19.1 Preventing Confounding 19.2 Simpson’s Paradox 19.3 Mantel-Haenszel Methods 19.4 Interaction.
MBP1010 – Lecture 8: March 1, Odds Ratio/Relative Risk Logistic Regression Survival Analysis Reading: papers on OR and survival analysis (Resources)
Analytical epidemiology Disease frequency Study design: cohorts & case control Choice of a reference group Biases Alain Moren, 2006 Impact Causality Effect.
Leicester Warwick Medical School Health and Disease in Populations Case-Control Studies Paul Burton.
Case Control Study : Analysis. Odds and Probability.
Design and Analysis of Clinical Study 7. Analysis of Case-control Study Dr. Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia.
Chronic Obstructive Pulmonary Disease Steven Markowitz, Problem-Based Exercises for Environmental Epidemiology, Office of Global and Integrated Environmental.
Logistic Regression. Linear regression – numerical response Logistic regression – binary categorical response eg. has the disease, or unaffected by the.
Applied Epidemiologic Analysis - P8400 Fall 2002 Labs 6 & 7 Case-Control Analysis ----Logistic Regression Henian Chen, M.D., Ph.D.
COHORT STUDY COHORT A group of people who share a common characteristic or experience within a defined period of time. e.g. age, occupation, exposure.
Matching. Objectives Discuss methods of matching Discuss advantages and disadvantages of matching Discuss applications of matching Confounding residual.
POPLHLTH 304 Regression (modelling) in Epidemiology Simon Thornley (Slides adapted from Assoc. Prof. Roger Marshall)
Probability and odds Suppose we a frequency distribution for the variable “TB status” The probability of an individual having TB is frequencyRelative.
Matched Case-Control Study Duanping Liao, MD, Ph.D Phone:
THE CHI-SQUARE TEST BACKGROUND AND NEED OF THE TEST Data collected in the field of medicine is often qualitative. --- For example, the presence or absence.
Introduction to Biostatistics, Harvard Extension School, Fall, 2005 © Scott Evans, Ph.D.1 Contingency Tables.
Analysis of matched data Analysis of matched data.
Case control & cohort studies
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 13: Multiple, Logistic and Proportional Hazards Regression.
Measures of disease frequency Simon Thornley. Measures of Effect and Disease Frequency Aims – To define and describe the uses of common epidemiological.
Epidemiological Study Designs And Measures Of Risks (1)
Measures of Association (2) Attributable Risks and fractions October Epidemiology 511 W. A. Kukull.
Chapter 9: Case Control Studies Objectives: -List advantages and disadvantages of case-control studies -Identify how selection and information bias can.
March 28 Analyses of binary outcomes 2 x 2 tables
Lecture notes on epidemiological studies for undergraduates
Matched Case-Control Study
Epidemiologic Measures of Association
Epidemiological Methods
Epidemiology 503 Confounding.
The binomial applied: absolute and relative risks, chi-square
Statistics 103 Monday, July 10, 2017.
Lecture 6: Introduction to effect modification (part 2)
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Saturday, August 06, 2016 Farrokh Alemi, PhD.
Kanguk Samsung Hospital, Sungkyunkwan University
Epidemiology MPH 531 Analytic Epidemiology Case control studies
Measurements of Risk & Association …
Evaluating Effect Measure Modification
Measures of risk and association
Cohort and longitudinal studies: statistics
Research Techniques Made Simple: Interpreting Measures of Association in Clinical Research Michelle Roberts PhD,1,2 Sepideh Ashrafzadeh,1,2 Maryam Asgari.
Chapter 18 Part C: Matched Pairs
Risk Ratio A risk ratio, or relative risk, compares the risk of some health-related event such as disease or death in two groups. The two groups are typically.
Effect Modifiers.
Presentation transcript:

Case-control studies: statistics Kath Bennett 16 April 2008

Outline Statistical methods Odds ratios in unmatched and matched studies Adjusted Odd ratio Mantel-Haenszel method Logistic regression Conditional logistic regression

Examples of case-control studies Doll and Hill. Lung cancer and smoking. INTERHEART study. Large international standardised case-control study on aetiology of CHD.

Data from a case-control study are expressed as: Past exposure to suspected factor Cases (with disease under study) Controls (without disease under study) Total Yes (a) 200 (c) 50 250 No (b)300 (d)450 750 500 1000

Data from unmatched case-control studies are analysed in terms of: Comparison of the frequency of exposure between the cases and controls Odds ratios (OR), i.e, the ratio of  the odds in favour of exposure among the cases compared to the odds in favour of exposure among the controls:  a / b  =  a d c / d b c For a rare disease, or short observation time (which makes the disease effectively rare), the odds-ratio (OR) approximates RR.

Odds ratio Using the example above: 40% (200/500) of the cases were exposed to the suspected risk factor vs. 10% (50/500) of the controls.  The odds ratio is calculated as:  200/300 = 200 x 450 = 6  50/450     300 x 50 This is interpreted as: the cases are 6 times more likely than the controls to have been exposed to the risk factor under study. 

Example: OR in unmatched case Age at First Full-Term Pregnancy and Breast Cancer Age(years) Cases Control <25 131 (35%) 600 (48%) >=25 243 (65%) 662 (52%) Total 374 (100%) 1262 (100%) More cases than controls had their first child after the age of 25 years: 65% vs. 52%. Odds ratio (OR): 243 / 131 =1.7 662 / 600

Odds Ratio in a Matched Case-Control Study (with matched pairs) History of Cigarette Smoking and Chronic Obstructive Pulmonary Disease (COPD) Cases with COPD Smoker Nonsmoker Total Controls (No COPD) Smoker 14 5 19 Nonsmoker 17 16 33 Total 31 21 52 Pairs: a. Case smoker, control smoker = 14 b. Case smoker, control nonsmoker =17 c. Case nonsmoker, control smoker =5 d. Case nonsmoker, control nonsmoker = 16

Odds Ratio in a Matched Case-Control Study (with matched pairs) Pairs: a. Case smoker, control smoker = 14 b. Case smoker, control nonsmoker = 17 c. Case nonsmoker, control smoker = 5 d. Case nonsmoker, control nonsmoker = 16 Only the discordant pairs contribute relevant information i.e. those where the case is a smoker and the control is a non-smoker (b) or visa-versa (c). The OR is the ratio of pairs in which the case is a smoker and the control is a non-smoker (b) to pairs in which the case is a non-smoker and the control is a smoker (c). OR=b/c. Therefore, the odds ratio based on the discordant pairs is 17/5 = 3.4

Odds ratio and 95% confidence limits An odds ratio >1 indicates a positive association between a factor and the disease. An odds ratio <1 indicates that the factor is protective. An odds ratio of 1.0 indicates that there is no association between the factor and the disease. Confidence limits are used to interpret the significance of relative risks and odds ratios. Relative risks and odds ratios are interpreted as statistically significant if confidence intervals do not include 1. OR = 7.0 (0.5 - 15.0) OR = 7.0 (3.0 - 12.0) The second odds ratio is statistically significant, the first one is not.

Adjusted ORs Mantel-Haenszel (stratified) method can be applied to deal with confounding. Assumption that the OR for association does not differ by level of confounding variable. Logistic regression for unmatched case-control studies, deal with confounding and interaction Conditional logistic regression for matched case-control studies.

Example: Mantel-Haenszel method Is smoking a risk factor for pulmonary carcinoma after adjustment for occupation and age? Cases (diseased) Controls (no disease) Occupation Age Non-smoker >=1 pack/day Housewives <45 45-54 55-64 65+ 2 5 6 11 3 7 24 49 42 1 White collar 4 18 23 Other occupation 10 12 19 15 M-H Adjusted OR=10.68 (95% CI 4.47, 39.26)

Logistic regression Binary outcome e.g. Case/control Logistic regression used to predict the likelihood of being a case vs control in relation to explanatory variables (risk factors). The resulting beta or regression coefficients are log(OR), therefore adjusted OR and 95% CI can be presented. Similar in many respects to ordinary regression methods.

Logistic Regression Logistic regression allows us to do 2x2 table analysis, and much more: Let y = 1 if MI, 0 if not Let x = 1 if new risk factor, 0 if no risk factor What is difference between an “exposed” and “unexposed” pair of individuals?

Logistic Regression What is the interpretation of β1? That was simplest case Logistic regression allows us much more freedom: x’s can be anything (continuous, binary, etc.) And we need to adjust for x2 = BMI, x3 = smoking, x4 = alcohol, x5 = years before diagnosis etc.. What is the interpretation of β1?

Interpretation of coefficients β1 is the log odds ratio comparing risk of MI for those with new risk factor vs no risk factor, adjusted for BMI, smoking, alcohol, and years since diagnosis. e β = 2.4 Individuals with new risk factor are at 2.4 times the risk of MI compared to those without risk factor, adjusting for BMI, smoking, alcohol, and years since diagnosis.

Conditional logistic regression Conditional logistic regression is similar to the usual logistic regression except it keeps track of which case was matched to which control The computer output and presentation of conditional logistic regression is similar to logistic regression

Example: Study of exposure to medication and risk of falls N=1,486 cases in nursing home with injurious fall matched to 1,486 matched controls Controls drawn from similar population matched on: Sex, Age, level of dependency, year and Duration of residence, chronic disorder

Example: results from conditional logistic regression OR 95% CI Inotrophic agents 0.69** 0.54-0.89 BBs 1.04 0.64-1.63 Anti-hypertensives 0.91 0.68-1.26 Anti-psychotic agents 1.31** 1.06-1.61 Anxiolytics/sedatives/hypnotics 1.35** 1.09-1.68

Interaction Interaction occurs between two risk factors when the effect of one risk factor upon disease is different at different levels of the second risk factor. Also know as ‘effect modification’. When there is no interaction the effects of each risk factor are constant across levels of other risk factors (homogeneous).