Conditional Logistic Regression for Matched Data HRP 261 02/25/04 reading: Agresti chapter 9.2.

Slides:



Advertisements
Similar presentations
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Advertisements

M2 Medical Epidemiology
Logistic Regression I Outline Introduction to maximum likelihood estimation (MLE) Introduction to Generalized Linear Models The simplest logistic regression.
Logistic Regression.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
11 Simple Linear Regression and Correlation CHAPTER OUTLINE
Chance, bias and confounding
Lecture 17: Regression for Case-control Studies BMTRY 701 Biostatistical Methods II.
April 25 Exam April 27 (bring calculator with exp) Cox-Regression
FINAL REVIEW BIOST/EPI 536 December 14, Outline Before the midterm: Interpretation of model parameters (Cohort vs case-control studies) Hypothesis.
EPI 809 / Spring 2008 Final Review EPI 809 / Spring 2008 Ch11 Regression and correlation  Linear regression Model, interpretation. Model, interpretation.
Detecting Spatial Clustering in Matched Case-Control Studies Andrea Cook, MS Collaboration with: Dr. Yi Li November 4, 2004.
Analysis of matched data; plus, diagnostic testing.
BIOST 536 Lecture 3 1 Lecture 3 – Overview of study designs Prospective/retrospective  Prospective cohort study: Subjects followed; data collection in.
Lecture 5 Outline: Thu, Sept 18 Announcement: No office hours on Tuesday, Sept. 23rd after class. Extra office hour: Tuesday, Sept. 23rd from 12-1 p.m.
BIOST 536 Lecture 9 1 Lecture 9 – Prediction and Association example Low birth weight dataset Consider a prediction model for low birth weight (< 2500.
Chapter 11 Survival Analysis Part 2. 2 Survival Analysis and Regression Combine lots of information Combine lots of information Look at several variables.
EPI 809/Spring Multiple Logistic Regression.
Cumulative Geographic Residual Test Example: Taiwan Petrochemical Study Andrea Cook.
Categorical Data Analysis: Stratified Analyses, Matching, and Agreement Statistics Biostatistics March 2007 Carla Talarico.
Notes on Logistic Regression STAT 4330/8330. Introduction Previously, you learned about odds ratios (OR’s). We now transition and begin discussion of.
Detecting Spatial Clustering in Matched Case-Control Studies Andrea Cook, MS Collaboration with: Dr. Yi Li December 2, 2004.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
BIOST 536 Lecture 4 1 Lecture 4 – Logistic regression: estimation and confounding Linear model.
Survival Analysis A Brief Introduction Survival Function, Hazard Function In many medical studies, the primary endpoint is time until an event.
The 2x2 table, RxCxK contingency tables, and pair-matched data July 27, 2004.
Logistic Regression II Simple 2x2 Table (courtesy Hosmer and Lemeshow) Exposure=1Exposure=0 Disease = 1 Disease = 0.
Applied Epidemiologic Analysis Fall 2002 Applied Epidemiologic Analysis Patricia Cohen, Ph.D. Henian Chen, M.D., Ph. D. Teaching Assistants Julie KranickSylvia.
Inference for regression - Simple linear regression
Chapter 13: Inference in Regression
Logistic Regression III: Advanced topics Conditional Logistic Regression for Matched Data Conditional Logistic Regression for Matched Data.
OKU 9 Chapter 15: ORTHOPAEDIC RESEARCH Brian E. Walczak.
Evidence-Based Medicine 4 More Knowledge and Skills for Critical Reading Karen E. Schetzina, MD, MPH.
Essentials of survival analysis How to practice evidence based oncology European School of Oncology July 2004 Antwerp, Belgium Dr. Iztok Hozo Professor.
Biostatistics Case Studies 2005 Peter D. Christenson Biostatistician Session 4: Taking Risks and Playing the Odds: OR vs.
Bootstrap and Cross-Validation Bootstrap and Cross-Validation.
Lecture 8: Generalized Linear Models for Longitudinal Data.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Excepted from HSRP 734: Advanced Statistical Methods June 5, 2008.
Analysis of matched data HRP /02/04 Chapter 9 Agresti – read sections 9.1 and 9.2.
University of Warwick, Department of Sociology, 2014/15 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard) Week 7 Logistic Regression I.
Introduction to Survival Analysis Utah State University January 28, 2008 Bill Welbourn.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
MBP1010 – Lecture 8: March 1, Odds Ratio/Relative Risk Logistic Regression Survival Analysis Reading: papers on OR and survival analysis (Resources)
1 11 Simple Linear Regression and Correlation 11-1 Empirical Models 11-2 Simple Linear Regression 11-3 Properties of the Least Squares Estimators 11-4.
Lecture 2 Review Probabilities Probability Distributions Normal probability distributions Sampling distributions and estimation.
Matching (in case control studies) James Stuart, Fernando Simón EPIET Dublin, 2006.
Evaluating Risk Adjustment Models Andy Bindman MD Department of Medicine, Epidemiology and Biostatistics.
Session 13: Correlation (Zar, Chapter 19). (1)Regression vs. correlation Regression: R 2 is the proportion that the model explains of the variability.
1 STA 617 – Chp10 Models for matched pairs Summary  Describing categorical random variable – chapter 1  Poisson for count data  Binomial for binary.
1 Chapter 16 logistic Regression Analysis. 2 Content Logistic regression Conditional logistic regression Application.
More Contingency Tables & Paired Categorical Data Lecture 8.
© Department of Statistics 2012 STATS 330 Lecture 22: Slide 1 Stats 330: Lecture 22.
Instructor Resource Chapter 15 Copyright © Scott B. Patten, Permission granted for classroom use with Epidemiology for Canadian Students: Principles,
Logistic Regression and Odds Ratios Psych DeShon.
Nonparametric Statistics
Analysis of matched data Analysis of matched data.
AP Statistics Chapter 12 Section 1. TestConfidence IntervalFormulasAssumptions 1-sample z-test mean SRS Normal pop. Or large n (n>40) Know 1-sample t-test.
Nonparametric Statistics
Stats/Methods II JEOPARDY.
Lecture 18 Matched Case Control Studies
This Week Review of estimation and hypothesis testing
Lecture 1: Fundamentals of epidemiologic study design and analysis
Nonparametric Statistics
CHAPTER 26: Inference for Regression
6-1 Introduction To Empirical Models
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Categorical Data Analysis Review for Final
Presentation transcript:

Conditional Logistic Regression for Matched Data HRP /25/04 reading: Agresti chapter 9.2

Recall: Matching Matching can control for extraneous sources of variability and increase the power of a statistical test. Match M controls to each case based on potential confounders, such as age and gender.

Recall: Agresti example, diabetes and MI Match each MI case to an MI control based on age and gender. Ask about history of diabetes to find out if diabetes increases your risk for MI.

Diabetes No diabetes DiabetesNo Diabetes MI cases MI controls P(“favors” case/discordant pair) = =the probability of observing a case-control pair with only the control exposed =the probability of observing a case-control pair with only the case exposed

Diabetes No diabetes DiabetesNo Diabetes MI cases MI controls odds(“favors” case/discordant pair) =

Logistic Regression for Matched Pairs option 1: the logistic-normal model Mixed model; logit=  i +  x Where  i represents the “stratum effect” – (e.g. different odds of disease for different ages and genders) – Example of a “random effect” Allow  i ’s to follow a normal distribution with unknown mean and standard deviation Gives “marginal ML estimate of  ”

option 2: Conditional Logistic Regression The conditional likelihood is based on…. The conditional probability (for pair-matched data): or, prospectively: P(“favors” case/discordant pair) =

The Conditional Likelihood: each discordant stratum (rather than individual) gets 1 term in the likelihood Note: the marginal probability of disease may differ in each age-gender stratum, but we assume that the (multiplicative) increase in disease risk due to exposure is constant across strata.

Recall probability terms:

 The conditional likelihood= Each age-gender stratum has the same baseline odds of disease; but these baseline odds may differ across strata

Conditional Logistic Regression

Example: MI and diabetes

Conditional Logistic Regression

Could there be an association between exposure to ultrasound in utero and an increased risk of childhood malignancies? Previous studies have found no association, but they have had poor statistical power to detect an association. Swedish researchers performed a nationwide population based case-control study using prospectively assembled data on prenatal exposure to ultrasound. Example:Prenatal ultrasound examinations and risk of childhood leukemia: case-control study BMJ 2000;320:

535 cases: all children born and diagnosed as having myeloid leukemia between 1973 and 1989 in Swedish registers of birth, cancer, and causes of death. 535 matched controls: 1 control was randomly selected for each case from the Swedish Birth Registry, matched by sex and year and month of birth.

Ultrasound No ultrasound UltrasoundNo Ultrasound Leukemia cases Myeloid leukemia controls But this type of analysis is limited to single dichotomous exposure…

Used conditional logistic regression to look at dose-response with number of ultrasounds: Results: Reference OR = 1.0; no ultrasounds OR =.91 for 1-2 ultrasounds OR=.64 for >=3 ultrasounds Conclusion: no evidence of a positive association between prenatal ultrasound and childhood leukemia; even evidence of inverse association (which could be explained by reasons for frequent ultrasound)

Each term in the likelihood represents a stratum of 1+M individuals More complicated likelihood expression! See: 02/02/04 lecture Extension: 1:M matching

e15/Lecture15_223_2003.ppt e15/Lecture15_223_2003.ppt Available here: -SAS tips, explanations and code -SAS macro that generates automatic logit plots (under “Lecture 15” at: to check if predictor is linear in the logit. Conditional Logistic Regression in SAS: Please read Ray’s slides at:

M:N Matching Syntax The basic syntax is shown here. proc phreg data=BLAH; model WEIRD*IsOUTCOME(Censor_v)= PREDICTORS /ties=discrete; strata STRATA_VARS; run; Put the values in the IsOUTCOME variable here that are the controls. Typically this is just the value 0. This is the switch requesting a m:n CLR. This is the m:n matching variable. Courtesy: Ray Balise

Part II: Rater agreement: Cohen’s Kappa Agresti, Chapter 9.5

Cohen’s Kappa Actual agreement = sum of the proportions found on the diagonals. Cohen: Compare the actual agreement with the “chance agreement” (which depends on the marginals). Normalize by its maximum possible value.

Ex: student teacher ratings Rating by supervisor 2 Rating by supervisor 1 Authoritarian Democratic Permissive Totals Authoritarian Democratic Permissive Totals

Example: student teacher ratings Null hypothesis: Kappa=0 (no agreement beyond chance)

Example: student teacher ratings Null hypothesis: Kappa=0 (no agreement beyond chance) Interpretation: achieved 36.2% of maximum possible improvement over that expected by chance alone