Applied Epidemiologic Analysis Fall 2002 Applied Epidemiologic Analysis Patricia Cohen, Ph.D. Henian Chen, M.D., Ph. D. Teaching Assistants Julie KranickSylvia.

Slides:



Advertisements
Similar presentations
SJS SDI_141 Design of Statistical Investigations Stephen Senn 14 Case Control Studies.
Advertisements

Brief introduction on Logistic Regression
M2 Medical Epidemiology
Logistic Regression I Outline Introduction to maximum likelihood estimation (MLE) Introduction to Generalized Linear Models The simplest logistic regression.
Observational Studies and RCT Libby Brewin. What are the 3 types of observational studies? Cross-sectional studies Case-control Cohort.
Case-Control Studies (Retrospective Studies). What is a cohort?
Observational Studies Based on Rosenbaum (2002) David Madigan Rosenbaum, P.R. (2002). Observational Studies (2 nd edition). Springer.
Cell phones and brain cancer: Unlocking the controversy? Faina Linkov, PhD Assistant Professor, University of Pittsburgh Cancer Institute.
Chance, bias and confounding
Journal Club Alcohol and Health: Current Evidence March-April 2007.
EPI 809 / Spring 2008 Final Review EPI 809 / Spring 2008 Ch11 Regression and correlation  Linear regression Model, interpretation. Model, interpretation.
BIOST 536 Lecture 3 1 Lecture 3 – Overview of study designs Prospective/retrospective  Prospective cohort study: Subjects followed; data collection in.
BIOST 536 Lecture 4 1 Lecture 4 – Logistic regression: estimation and confounding Linear model.
Generalized Linear Models
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 12: Multiple and Logistic Regression Marshall University.
Conditional Logistic Regression for Matched Data HRP /25/04 reading: Agresti chapter 9.2.
AS 737 Categorical Data Analysis For Multivariate
Stratification and Adjustment
Cohort Study.
Unit 6: Standardization and Methods to Control Confounding.
Logistic Regression III: Advanced topics Conditional Logistic Regression for Matched Data Conditional Logistic Regression for Matched Data.
Gerstman Case-Control Studies 1 Epidemiology Kept Simple Section 11.5 Case-Control Studies.
Measuring Associations Between Exposure and Outcomes.
Case-Control Study of Human Papillomavirus and Oropharyngeal Cancer Osler Journal Club Shaline Rao, MD June 10, 2009.
1 Journal Club Alcohol, Other Drugs, and Health: Current Evidence January–February 2014.
Simple Linear Regression
Biostatistics Case Studies 2005 Peter D. Christenson Biostatistician Session 4: Taking Risks and Playing the Odds: OR vs.
TWO-STAGE CASE-CONTROL STUDIES USING EXPOSURE ESTIMATES FROM A GEOGRAPHICAL INFORMATION SYSTEM Jonas Björk 1 & Ulf Strömberg 2 1 Competence Center for.
Evidence-Based Medicine 3 More Knowledge and Skills for Critical Reading Karen E. Schetzina, MD, MPH.
Measures of Association
Excepted from HSRP 734: Advanced Statistical Methods June 5, 2008.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
A short introduction to epidemiology Chapter 2b: Conducting a case- control study Neil Pearce Centre for Public Health Research Massey University Wellington,
Applied Epidemiologic Analysis - P8400 Fall 2002
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
Epidemiologic design from a sampling perspective Epidemiology II Lecture April 14, 2005 David Jacobs.
MBP1010 – Lecture 8: March 1, Odds Ratio/Relative Risk Logistic Regression Survival Analysis Reading: papers on OR and survival analysis (Resources)
Applied Epidemiologic Analysis Fall 2002 Applied Epidemiologic Analysis Patricia Cohen, Ph.D. Henian Chen, M.D., Ph. D. Teaching Assistants Julie KranickSylvia.
Instructor Resource Chapter 9 Copyright © Scott B. Patten, Permission granted for classroom use with Epidemiology for Canadian Students: Principles,
Analytical epidemiology Disease frequency Study design: cohorts & case control Choice of a reference group Biases Alain Moren, 2006 Impact Causality Effect.
Comparative Analyses of Three Measures of Concordance between Current and Longest Held Jobs Orlando Gómez-Marín MSc PhD, Lora E. Fleming MD PhD, William.
A short introduction to epidemiology Chapter 9: Data analysis Neil Pearce Centre for Public Health Research Massey University Wellington, New Zealand.
1 Multivariable Modeling. 2 nAdjustment by statistical model for the relationships of predictors to the outcome. nRepresents the frequency or magnitude.
Applied Epidemiologic Analysis - P8400 Fall 2002 Labs 6 & 7 Case-Control Analysis ----Logistic Regression Henian Chen, M.D., Ph.D.
Instructor Resource Chapter 15 Copyright © Scott B. Patten, Permission granted for classroom use with Epidemiology for Canadian Students: Principles,
Case Control Studies Dr Amna Rehana Siddiqui Department of Family and Community Medicine October 17, 2010.
POPLHLTH 304 Regression (modelling) in Epidemiology Simon Thornley (Slides adapted from Assoc. Prof. Roger Marshall)
Matched Case-Control Study Duanping Liao, MD, Ph.D Phone:
Applied Epidemiologic Analysis - P8400 Fall 2002 Labs 6 & 7 Case-Control Analysis ----Logistic Regression Henian Chen, M.D., Ph.D.
ALDH2 and Conduct Disorder Mediate Ethnicity and Alcohol Dependence in Chinese-, Korean-, and White-American College Students S.E. Luczak, T.A.R. Cook,
Introduction to Biostatistics, Harvard Extension School, Fall, 2005 © Scott Evans, Ph.D.1 Contingency Tables.
Nonparametric Statistics
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 13: Multiple, Logistic and Proportional Hazards Regression.
Instructor: R. Makoto 1richard makoto UZ Econ313 Lecture notes.
(www).
Case Control study. An investigation that compares a group of people with a disease to a group of people without the disease. Used to identify and assess.
Measures of disease frequency Simon Thornley. Measures of Effect and Disease Frequency Aims – To define and describe the uses of common epidemiological.
Methods of Presenting and Interpreting Information Class 9.
Chapter 9: Case Control Studies Objectives: -List advantages and disadvantages of case-control studies -Identify how selection and information bias can.
Nonparametric Statistics
March 28 Analyses of binary outcomes 2 x 2 tables
Epidemiologic Measures of Association
Coffee drinking and leukocyte telomere length: A meta-analysis
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Nonparametric Statistics
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Evaluating Effect Measure Modification
Case-control studies: statistics
Presentation transcript:

Applied Epidemiologic Analysis Fall 2002 Applied Epidemiologic Analysis Patricia Cohen, Ph.D. Henian Chen, M.D., Ph. D. Teaching Assistants Julie KranickSylvia Taylor Chelsea MorroniJudith Weissman

Applied Epidemiologic Analysis Fall 2002 Lecture 7 Categorical analysis Conditional logistic regression Unconditional logistic regression Introduction to stratifiers

Applied Epidemiologic Analysis Fall 2002 Objectives To understand the basic assumptions of analyses of case-control and cohort data To see how assumptions about the predictor variables differ between categorical analyses and some regression models To see the connection between stratified analyses and analyses incorporating all stratifiers as predictors

Applied Epidemiologic Analysis Fall 2002 Categorical analysis: Analyses of tables of frequencies Assumptions / requirements Adequate sample size in each table cell and in total Independence of outcomes no contagion effects single event per person For rates, homogeneity: probability of outcome is uniform for all time units in a stratum e.g., doesn’t matter if 6 people are observed for 10 years or 10 people are observed for 6 years

Applied Epidemiologic Analysis Fall 2002 Does not assume that distributions of exposure and other predictors are fixed. In contrast, ordinary regression analysis assumes that distributions of independent variables are fixed (selected or created by the researchers, rather than whatever distributions happen to characterize the sampled population). Ordinary or “unconditional” logistic regression also assumes that independent variables are fixed. Categorical analysis

Applied Epidemiologic Analysis Fall 2002 Cateforical analysis of incidence rates: A single group in comparison to some expected rate Incidence per time unit in exposed group Incidence per time unit expected in the reference population for the same distribution of person-time (e.g., based on morbidity rates for equivalent age groups)

Applied Epidemiologic Analysis Fall 2002 Categorical analysis: Single group in comparison to some expected rate ratio = standardized morbidity ratio Since person-time distribution is contant: Confidence limits on this rate ratio employ the Poisson distribution and maximum likelihood estimation. Should be adequate when E > 5.

Applied Epidemiologic Analysis Fall 2002 Categorical analysis of 2 groups, exposed and unexposed Maximum likelihood estimates using the Poisson model are used to estimate rate ratios and rate difference or risk ratios and risk differences. Hand calculation of these estimates is rare, partly because of the inclusion of multiple confounders and/or exposures in the models.

Applied Epidemiologic Analysis Fall 2002 Selecting an analytic model for case-control (or cohort) data The ordinary least squares (OLS) method of analyzing dichotomous outcomes is problematic because the formal assumptions of the model (homoscedasticity) are necessarily violated. Nevertheless, for case-control data with similar sample sizes in the two groups, conclusions from OLS and logistic regression may well be similar.

Applied Epidemiologic Analysis Fall 2002 Ordinary Least Squares This model uses as a link function the “identity” function: a difference in the value of the predictor is (linearly) related to a difference in the value of the outcome. When the outcome is disease or non-disease, this is equivalent to a difference in the proportion with the disease (incidence or prevalence). For a binary exposure the B = difference in proportion, or risk difference.

Applied Epidemiologic Analysis Fall 2002 Logistic Regression Model The link function estimated by the logistic regression model (using maximum likelihood methods) is the log odds or logit. In this model, for a binary exposure the B = difference in the log odds of the outcome (disease). It is equivalent to an exponential odds model, so taking the anti-log provides the odds ratio, an estimate of risk ratio.

Applied Epidemiologic Analysis Fall 2002 Other models Exponential risk models – a log- linear risk model (requires an estimate of risk in the source population) Probit model – assumes a normal distribution underlying outcome; used in bioassay and economics Note: These alternative models are designed to provide apprpriate statistical tests, but do not necessarily match the actual biological mechanisms.

Applied Epidemiologic Analysis Fall 2002 Stratification of case-control data A means of equating for stratifiers Most often on sex and age categories Note: If there is a non-trivial age difference there will be a remaining mean difference within categories.

Applied Epidemiologic Analysis Fall 2002 Stratifying variables: standardization Standardization of rates or risks with regard to a stratifying variable. Example: Control group = 40% male Case group = 60% male Can standardize the case group to the control by weighting every female case by 1.5 and every male case by.67. So the sum of weights still = N in the case group.

Applied Epidemiologic Analysis Fall 2002 Stratifying variables: standardization Thus, for every 100 cases we have: 60 males *.67 (= 40) 40 females * 1.5 (=60) Weighted N = 100. Could, alternatively, weight both case and control groups to equal male and female sizes.

Applied Epidemiologic Analysis Fall 2002 Weighting for rate or risk difference, or unconditional logistic regression This weighting to produce equality on predictors can be done for hand calculation of rate or risk differences or for computer analyses of data by conditional or unconditional logistic regression. Note: This is only one reason for weighting observations. Another common reason is to take into account sampling strategies with unequal probabilities for inclusion. Such strategies often over-sample certain strata in order to improve the statistical power for analyses of subgroups.

Applied Epidemiologic Analysis Fall 2002 Weighting It is useful to see this as analogous to what the analytic program does when inclusion of a predictor “equates” groups by removing effects of counfounders. Simple standardization assumes a uniform effect of exposure across strata: each stratum provides an estimate of the same quantity. Statistical tests of homogeneity are commonly used to decide whether this assumption is warranted.

Applied Epidemiologic Analysis Fall 2002 Mantel – Haenszel Estimation Mantel – Haenszel estimation of uniform rate differences (using weights as described above applied to person – time) Preferred when some strata have fewer than 10 cases Unbiased, unlike maximum likelihood estimates, but larger SE (much larger for rate difference, not much for rate ratio)

Applied Epidemiologic Analysis Fall 2002 First Study : Wine drinking and risk of non- Hodgkin’s lymphoma among men in the United States: a population based case-control study Reference: Nathaniel C. Briggs, Robert S. Levine, Linda D. Bobo, William P. Haliburton, Edward A. Brann, and Charles H. Hennekens, American Journal of Epidemiology, 156, No. 5,

Applied Epidemiologic Analysis Fall 2002 The problem: Lymphoma study Non-Hodgkin’s lymphoma (NHL) is the fifth most common cancer in the United States with etiology mostly unknown. Can exploration of protective factors help move toward etiological understanding? Specifically, will this study strengthen prior weak evidence of lower NHL in wine drinkers?

Applied Epidemiologic Analysis Fall 2002 Population studied, study design, and sample size : Lymphoma study 960 cases of NHL males born 1929 – 1953 and diagnosed 1984 – 1988 (without specific known risks such as HIV) 1717 controls of males recruited through random digit dialing and matched geographically

Applied Epidemiologic Analysis Fall 2002 Measurement issues: Lymphoma Data collected by interviews regarding life-time habits Selection and inclusion of predictors in the analysis: * All odds ratios (OR) are adjusted for age, race/ethnicity, cancer registry, smoking history, and education. Odds ratios for each alcohol beverage type are adjusted for the other types. All odds ratios are in reference to nondrinkers.

Applied Epidemiologic Analysis Fall 2002 The effect being estimated: Lymphoma Basic analysis to answer study questions: Logistic regression analysis Test for the significance of the trend (dose- response) in the OR as dose increases Odds ratios of NHL associated with alcohol consumption by type and quantity over the life-time

Applied Epidemiologic Analysis Fall 2002

Applied Epidemiologic Analysis Fall 2002 Conclusions: Lymphoma “Among wine drinkers, there was a significant linear decrease in risk of NHL with increasing quantity of wine intake. A more than twofold decrease in risk was seen for consumption of one wine drink or more per day.” Note that the p for trend tests the dose-response aspect.

Applied Epidemiologic Analysis Fall 2002

Applied Epidemiologic Analysis Fall 2002 Conclusions: Lymphoma Early age of onset of drinking was associated with decreased risk of NHL specifically for wine drinkers. Discussed biologic plausibility, probable effects of self-report, and data limitations (biases generally would be expected to lower effects) and age-sex limitations of sample.

Applied Epidemiologic Analysis Fall 2002 Second Study : Occupation and Adult Gliomas Reference: Susan E. Carozza, Margaret Wrensch, Rei Miike, Beth Newman, Andrew F. Olshan, David A. Savitz, Michael Yost and Marion Lee American Journal of Epidemiology, 152, No 9,

Applied Epidemiologic Analysis Fall 2002 The problem: Gliomas Gliomas are the most common form of primary malignant brain tumor in adults. The etiology is largely unknown but prior evidence implicates occupational exposures associated with certain chemically-exposed industrial, agricultural and blue-collar workers.

Applied Epidemiologic Analysis Fall 2002 Population studied, study design, and sample size : Gliomas 492 incident cases in San Francisco bay area, age over controls recruited through random digit dialing, matched by : 5 year age group gender ethnicity (Note: 1/3 declined to participate. Controls more educated because of participation bias.)

Applied Epidemiologic Analysis Fall 2002 Measurement issues: Gliomas Because of rapid death of cases, many proxy informants needed to supply information. How might these interviews be biased? Are the controls likely to be adequate? Control variables: age ( vs 55+), gender, years of education, race

Applied Epidemiologic Analysis Fall 2002 Analyses Exposure measures: All jobs held at least 6 months in lifetime All jobs up to 10 years previously (assuming a 10 year latency) Within each, ever employed < 10 years => 10 years

Applied Epidemiologic Analysis Fall 2002 Logic of study: Gliomas If real, the association should increase with longer exposure. Also, if real, the effect should be more apparent when the latency period is excluded.

Applied Epidemiologic Analysis Fall 2002

Applied Epidemiologic Analysis Fall 2002 Gliomas Odd Ratios Virtually no odds ratios were statistically significantly different from 1.0. Nevertheless, several were discussed. Is this sensible?