TWO-STAGE CASE-CONTROL STUDIES USING EXPOSURE ESTIMATES FROM A GEOGRAPHICAL INFORMATION SYSTEM Jonas Björk 1 & Ulf Strömberg 2 1 Competence Center for.

Slides:



Advertisements
Similar presentations
Sources and effects of bias in investigating links between adverse health outcomes and environmental hazards Frank Dunstan University of Wales College.
Advertisements

1 Analyzing HIV Prevalence Trends from Antenatal Clinic (ANC) Sentinel Surveillance Data and Serosurveillance Data from High Risk Groups* Ray Shiraishi.
M2 Medical Epidemiology
Uncertainty and confidence intervals Statistical estimation methods, Finse Friday , 12.45–14.05 Andreas Lindén.
Nicky Best, Chris Jackson, Sylvia Richardson Department of Epidemiology and Public Health Imperial College, London Studying.
1 Case-Control Study Design Two groups are selected, one of people with the disease (cases), and the other of people with the same general characteristics.
“Personality, Socioeconomic Status, and All-Cause Mortality in the United States” - Chapman BP et al. Journal Club 02/24/11.
Revisiting causal neighborhood effects on individual ischemic heart disease risk: a quasi-experimental analysis among Swedish siblings Juan Merlo In collaboration.
Estimation of Sample Size
FINAL REVIEW BIOST/EPI 536 December 14, Outline Before the midterm: Interpretation of model parameters (Cohort vs case-control studies) Hypothesis.
Intermediate methods in observational epidemiology 2008 Quality Assurance and Quality Control.
Measures of association
Basic Elements of Testing Hypothesis Dr. M. H. Rahbar Professor of Biostatistics Department of Epidemiology Director, Data Coordinating Center College.
Bias in Epidemiology Wenjie Yang
BIOST 536 Lecture 3 1 Lecture 3 – Overview of study designs Prospective/retrospective  Prospective cohort study: Subjects followed; data collection in.
NACC National Alzheimer’s Coordinating Center Time Dependent Exposure in Case-Control Studies Roger Higdon, PhD Senior Biostatistician NACC, University.
Lecture 9: p-value functions and intro to Bayesian thinking Matthew Fox Advanced Epidemiology.
Hierarchical models for combining multiple data sources measured at individual and small area levels Chris Jackson With Nicky Best and Sylvia Richardson.
Study Design / Data: Case-Control, Descriptives Basic Medical Statistics Course: Module C October 2010 Wilma Heemsbergen
Measuring Associations Between Exposure and Outcomes.
Case control study Moderator : Chetna Maliye Presenter Reshma Sougaijam.
Evidence-Based Medicine 4 More Knowledge and Skills for Critical Reading Karen E. Schetzina, MD, MPH.
Biostatistics Case Studies 2005 Peter D. Christenson Biostatistician Session 4: Taking Risks and Playing the Odds: OR vs.
Estimation of Various Population Parameters Point Estimation and Confidence Intervals Dr. M. H. Rahbar Professor of Biostatistics Department of Epidemiology.
Design and Analysis of Clinical Study 8. Cross-sectional Study Dr. Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia.
1 Rob Woodruff Battelle Memorial Institute, Health & Analytics Cynthia Ferre Centers for Disease Control and Prevention Conditional.
October 15H.S.1 Causal inference Hein Stigum Presentation, data and programs at:
Amsterdam Rehabilitation Research Center | Reade Multiple regression analysis Analysis of confounding and effectmodification Martin van de Esch, PhD.
Statistics for clinicians Biostatistics course by Kevin E. Kip, Ph.D., FAHA Professor and Executive Director, Research Center University of South Florida,
Literature searching & critical appraisal Chihaya Koriyama August 15, 2011 (Lecture 2)
A short introduction to epidemiology Chapter 2b: Conducting a case- control study Neil Pearce Centre for Public Health Research Massey University Wellington,
The binomial applied: absolute and relative risks, chi-square.
A short introduction to epidemiology Chapter 4: More complex study designs Neil Pearce Centre for Public Health Research Massey University Wellington,
An Introductory Lecture to Environmental Epidemiology Part 5. Ecological Studies. Mark S. Goldberg INRS-Institut Armand-Frappier, University of Quebec,
MBP1010 – Lecture 8: March 1, Odds Ratio/Relative Risk Logistic Regression Survival Analysis Reading: papers on OR and survival analysis (Resources)
Chapter 2 Nature of the evidence. Chapter overview Introduction What is epidemiology? Measuring physical activity and fitness in population studies Laboratory-based.
What is “collapsing”? (for epidemiologists) Picture a 2x2 tables from Intro Epi: (This is a collapsed table; there are no strata) DiseasedUndiseasedTotal.
Leicester Warwick Medical School Health and Disease in Populations Case-Control Studies Paul Burton.
Issues concerning the interpretation of statistical significance tests.
Measuring covariate data_Presentation (November 14, 2007) 1 Measuring covariate data in subsets of study populations: Design options Jean-François Boivin,
Case Control Study : Analysis. Odds and Probability.
A short introduction to epidemiology Chapter 9: Data analysis Neil Pearce Centre for Public Health Research Massey University Wellington, New Zealand.
Master’s Essay in Epidemiology I P9419 Methods Luisa N. Borrell, DDS, PhD October 25, 2004.
BC Jung A Brief Introduction to Epidemiology - XIII (Critiquing the Research: Statistical Considerations) Betty C. Jung, RN, MPH, CHES.
Simulation Study for Longitudinal Data with Nonignorable Missing Data Rong Liu, PhD Candidate Dr. Ramakrishnan, Advisor Department of Biostatistics Virginia.
Authenticity of results of statistical research. The Normal Distribution n Mean = median = mode n Skew is zero n 68% of values fall between 1 SD n 95%
Organization of statistical research. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and.
Chronic Obstructive Pulmonary Disease Steven Markowitz, Problem-Based Exercises for Environmental Epidemiology, Office of Global and Integrated Environmental.
Instructor Resource Chapter 15 Copyright © Scott B. Patten, Permission granted for classroom use with Epidemiology for Canadian Students: Principles,
Clinical Epidemiology and Evidence-based Medicine Unit FKUI – RSCM
Arsenic and Nonmelanoma Skin Cancer in Slovakia Beate Pesch Environmental Health Research Institute, Germany.
Design of Clinical Research Studies ASAP Session by: Robert McCarter, ScD Dir. Biostatistics and Informatics, CNMC
BIOSTATISTICS Lecture 2. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and creating methods.
Tutorial I: Missing Value Analysis
Bayesian methods in epidemiological research JONAS BJÖRK, LUND UNIVERSITY. 5 FEBRUARY 2016.
A short introduction to epidemiology Chapter 6: Precision Neil Pearce Centre for Public Health Research Massey University Wellington, New Zealand.
Introduction to Biostatistics, Harvard Extension School, Fall, 2005 © Scott Evans, Ph.D.1 Contingency Tables.
Meta-analysis of observational studies Nicole Vogelzangs Department of Psychiatry & EMGO + institute.
1 Borgan and Henderson: Event History Methodology Lancaster, September 2006 Session 8.1: Cohort sampling for the Cox model.
Exposure Prediction and Measurement Error in Air Pollution and Health Studies Lianne Sheppard Adam A. Szpiro, Sun-Young Kim University of Washington CMAS.
Table 1. Methodological Evaluation of Observational Research (MORE) – observational studies of incidence or prevalence of chronic diseases Tatyana Shamliyan.
Measures of disease frequency Simon Thornley. Measures of Effect and Disease Frequency Aims – To define and describe the uses of common epidemiological.
CHAPTER 6: SAMPLING, SAMPLING DISTRIBUTIONS, AND ESTIMATION Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.
How many study subjects are required ? (Estimation of Sample size) By Dr.Shaik Shaffi Ahamed Associate Professor Dept. of Family & Community Medicine.
Epidemiology 503 Confounding.
Lecture 1: Fundamentals of epidemiologic study design and analysis
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Intermediate methods in observational epidemiology 2008
Effect Modifiers.
Presentation transcript:

TWO-STAGE CASE-CONTROL STUDIES USING EXPOSURE ESTIMATES FROM A GEOGRAPHICAL INFORMATION SYSTEM Jonas Björk 1 & Ulf Strömberg 2 1 Competence Center for Clinical Research 2 Occupational and Environmental Medicine Lund University Hospital

OUTLINE OF TALK Previous project: What have we done? (Jonas Björk) Ongoing project: What shall we do? (Ulf Strömberg)

Two-stage procedure for case- control studies 1 st stage Complete data obtained from registries Disease status General characteristics Group affiliation (e.g. occupation or residential area)  Group-level exposure X G 2 nd stage Individual exposure data for a subset of the 1 st stage sample

Exposure database  group-level exposure JEM = Job Exposure Matrix Occupational group  proportion exposed GIS Residential group (area)  average concentration of an air pollutant

JEM - proportion exposed Most data typically in groups with low X G

Linear Relation between Proportion Exposed and Relative Risk No confounding between/within groups Example: RR (exposed vs. unexposed) = 2.0 Proportion exposed X G Average RR 0%1.0 10%0.10 * =1.1 50% %2.0

Linear OR model: OR(X G ) = 1 + β X G X G = Exposure proportion OR for exposed vs. unexposed = OR(1) = 1 + β 1 OR(1) XGXG 0 1 Most data typically in groups with low X G

Confounding between groups General confounders (eg, gender and age) can normally be adjusted for Assuming no confounding within groups and no effect modification in any stratum s k : OR(X G ;s 1, s 2,...s k ) = (1 + β X G ) exp(Σγ k s k )

Combining 1 st and 2 nd stage data Assumption: 2 nd stage data missing at random condition on disease status and 1 st stage group affiliation For subjects with missing 2 nd stage data: Use 1 st stage data to calculate expected number of exposed/unexposed Expectation-maximization (EM) algorithm

EM-algorithm (Wacholder & Weinberg 1994) 1.Select a starting value, e.g. OR=1 2.E-step Among the non-participants, calculate expected number of exposed/unexposed case and controls in each group 3.M-step Maximize the likelihood for observed+expected cell frequencies using the chosen risk model for individual-level data (not necessarily linear)  New OR-estimate 4. Repeat 2. and 3. until convergence

E-step in our situation (Strömberg & Björk, submitted) m 0 controls with missing 2 nd stage data  m 0 * X G = expected number of exposed m 1 cases with missing 2 nd stage data  m 1 * X G * ÔR / [1+(ÔR-1)* X G ] ÔR = Current OR-estimate Complete the data in each group G:

Simulated case-control studies 400 cases, 1200 controls in the 1 st stage 2 nd stage participation 75% of the cases 25% of the controls Selective participation of 2 nd stage controls Corr(Participation, X G ) =0, > 0, < replications in each scenario True OR = 3

Simulations - Results Participation1 st stage data only ( ) 2 nd stage data only ( ) EM-method ( ) ORSDCoverageORSDCoverageORSDCoverage Corr(Part., X G )= % % % Corr(Part., X G )< % % % Corr(Part., X G )> % % % SD = Empirical standard deviation of the ln(OR) estimates Coverage = Coverage of 95% confidence intervals

Simulations - Conclusions Combining 1 st and 2 nd stage data, using the EM method can: 1. Improve precision 2. Remove bias from selective participation Method is sensitive to errors in the (1 st stage) external exposure data!

Simulations – Conclusions II EM-method is sensitive to 1.Violations of the MAR-assumption (condition on on disease status and 1 st stage group affiliation) 2. Errors in the (1 st stage) external exposure data

Ongoing methodological research project Focus on exposure estimates from a GIS

GIS data: NO2 (Scania)

Two-stage exposure assessment procedure X G = 4.8 X G = 10.1 X G = x i 1 st stage: X G represents mean exposure levels rather than proportion exposed x i 2 nd stage: x i is a continuous, rather than a dichotomous, exposure variable

Assume a linear relation between and x i and disease odds (cf. radon exposure and lung cancer [Weinberg et al., 1996]). xixi Odds For the ”only 1 st stage” subjects: no bias expected by using their X G :s (Berkson errors) provided MAR in each group – independent of disease status. EM method? Exposure variation in each group?

Two-stage exposure assessment procedure – related work Multilevel studies with applications to a study of air pollution [Navidi et al., 1994]: pooling exposure effect estimates based on individual-level and group-level models, respectively

Collecting data on confounders or effect modifiers at 2 nd stage X G = 4.8 X G = 10.1 X G = c i 1 st stage: X G = mean exposure levels c i 2 nd stage: c i is a covariate, e.g. smoking history

Data on confounders or effect modifiers at 2 nd stage – estimation of exposure effect Confounder adjustment based on logistic regression: pseudo-likelihood approach [Cain & Breslow, 1988] More general approach: EM method [Wacholder & Weinberg, 1994]

Design stage (“stage 0”) Group 1 Group 2 Group 3... Subjects? 1 st stage: How many geographical areas (groups)? ? ? 2 nd stage: Fractions of the 1 st stage cases and controls?

Design stage – related work Two-stage exposure assessment: power depends more strongly on the number of groups than on the number of subjects per group [Navidi et al., 1994]

References I Björk & Strömberg. Int J Epidemiol 2002;31: Strömberg & Björk. “Incorporating group- level exposure information in case-control studies with missing data on dichotomous exposures”. Submitted.

References II Cain & Breslow. Am J Epidemiol 1988;128: Navidi et al. Environ Health Perspect 1994;102(Suppl 8): Wacholder & Weinberg. Biometrics 1994;50: Weinberg et al. Epidemiology 1996;7:190-7.