1 Rob Woodruff Battelle Memorial Institute, Health & Analytics Cynthia Ferre Centers for Disease Control and Prevention Conditional.

Slides:



Advertisements
Similar presentations
Chapter 2 Describing Contingency Tables Reported by Liu Qi.
Advertisements

Statistical Analysis SC504/HS927 Spring Term 2008
Qualitative predictor variables
Brief introduction on Logistic Regression
Gologit2: Generalized Logistic Regression/ Partial Proportional Odds Models for Ordinal Dependent Variables Part 1: The gologit model & gologit2 program.
Logit & Probit Regression
Limited Dependent Variables
Nguyen Ngoc Anh Nguyen Ha Trang
Nicky Best and Chris Jackson With Sylvia Richardson Department of Epidemiology and Public Health Imperial College, London
April 25 Exam April 27 (bring calculator with exp) Cox-Regression
Introduction to Logistic Regression. Simple linear regression Table 1 Age and systolic blood pressure (SBP) among 33 adult women.
Ordinal Logistic Regression
EPI 809/Spring Multiple Logistic Regression.
Nemours Biomedical Research Statistics April 23, 2009 Tim Bunnell, Ph.D. & Jobayer Hossain, Ph.D. Nemours Bioinformatics Core Facility.
BIOST 536 Lecture 12 1 Lecture 12 – Introduction to Matching.
Notes on Logistic Regression STAT 4330/8330. Introduction Previously, you learned about odds ratios (OR’s). We now transition and begin discussion of.
Logistic regression for binary response variables.
Linear Regression and Correlation Explanatory and Response Variables are Numeric Relationship between the mean of the response variable and the level of.
Assessing Survival: Cox Proportional Hazards Model Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Mixture Modeling Chongming Yang Research Support Center FHSS College.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 12: Multiple and Logistic Regression Marshall University.
Logistic Regression II Simple 2x2 Table (courtesy Hosmer and Lemeshow) Exposure=1Exposure=0 Disease = 1 Disease = 0.
Presenting Statistical Aspects of Your Research Analysis of Factors Associated with Pre-term Births in North Carolina.
MODELS OF QUALITATIVE CHOICE by Bambang Juanda.  Models in which the dependent variable involves two ore more qualitative choices.  Valuable for the.
Occurrence and timing of events depend on Exposure to the risk of an event exposure Risk depends on exposure.
Logistic Regression III: Advanced topics Conditional Logistic Regression for Matched Data Conditional Logistic Regression for Matched Data.
Evidence-Based Medicine 4 More Knowledge and Skills for Critical Reading Karen E. Schetzina, MD, MPH.
TWO-STAGE CASE-CONTROL STUDIES USING EXPOSURE ESTIMATES FROM A GEOGRAPHICAL INFORMATION SYSTEM Jonas Björk 1 & Ulf Strömberg 2 1 Competence Center for.
Assessing Survival: Cox Proportional Hazards Model
Excepted from HSRP 734: Advanced Statistical Methods June 5, 2008.
HSRP 734: Advanced Statistical Methods June 19, 2008.
01/20151 EPI 5344: Survival Analysis in Epidemiology Interpretation of Models March 17, 2015 Dr. N. Birkett, School of Epidemiology, Public Health & Preventive.
Multilevel Linear Models Field, Chapter 19. Why use multilevel models? Meeting the assumptions of the linear model – Homogeneity of regression coefficients.
Limited Dependent Variables Ciaran S. Phibbs May 30, 2012.
Applied Epidemiologic Analysis - P8400 Fall 2002
LOGISTIC REGRESSION A statistical procedure to relate the probability of an event to explanatory variables Used in epidemiology to describe and evaluate.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
Contingency tables Brian Healy, PhD. Types of analysis-independent samples OutcomeExplanatoryAnalysis ContinuousDichotomous t-test, Wilcoxon test ContinuousCategorical.
Logistic and Nonlinear Regression Logistic Regression - Dichotomous Response variable and numeric and/or categorical explanatory variable(s) –Goal: Model.
Week 5: Logistic regression analysis Overview Questions from last week What is logistic regression analysis? The mathematical model Interpreting the β.
Assessing Binary Outcomes: Logistic Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
MBP1010 – Lecture 8: March 1, Odds Ratio/Relative Risk Logistic Regression Survival Analysis Reading: papers on OR and survival analysis (Resources)
Logistic Regression Applications Hu Lunchao. 2 Contents 1 1 What Is Logistic Regression? 2 2 Modeling Categorical Responses 3 3 Modeling Ordinal Variables.
Regression & Correlation. Review: Types of Variables & Steps in Analysis.
Lecture 18 Ordinal and Polytomous Logistic Regression BMTRY 701 Biostatistical Methods II.
Limited Dependent Variables Ciaran S. Phibbs. Limited Dependent Variables 0-1, small number of options, small counts, etc. 0-1, small number of options,
1 Follow the three R’s: Respect for self, Respect for others and Responsibility for all your actions.
A first order model with one binary and one quantitative predictor variable.
We’ll now look at the relationship between a survival variable Y and an explanatory variable X; e.g., Y could be remission time in a leukemia study and.
Applied Epidemiologic Analysis - P8400 Fall 2002 Labs 6 & 7 Case-Control Analysis ----Logistic Regression Henian Chen, M.D., Ph.D.
Logistic Regression Analysis Gerrit Rooks
Qualitative and Limited Dependent Variable Models ECON 6002 Econometrics Memorial University of Newfoundland Adapted from Vera Tabakova’s notes.
Logistic regression (when you have a binary response variable)
POPLHLTH 304 Regression (modelling) in Epidemiology Simon Thornley (Slides adapted from Assoc. Prof. Roger Marshall)
1 Say good things, think good thoughts, and do good deeds.
Instrument design Essential concept behind the design Bandit Thinkhamrop, Ph.D.(Statistics) Department of Biostatistics and Demography Faculty of Public.
Applied Epidemiologic Analysis - P8400 Fall 2002 Labs 6 & 7 Case-Control Analysis ----Logistic Regression Henian Chen, M.D., Ph.D.
Analysis of matched data Analysis of matched data.
Logistic Regression Logistic Regression - Binary Response variable and numeric and/or categorical explanatory variable(s) –Goal: Model the probability.
EHS Lecture 14: Linear and logistic regression, task-based assessment
Logistic Regression APKC – STATS AFAC (2016).
Limited Dependent Models
Notes on Logistic Regression
Basic Estimation Techniques
Generalized Linear Models
Generalized Linear Models (GLM) in R
Introduction to logistic regression a.k.a. Varbrul
SA3202 Statistical Methods for Social Sciences
Introduction to Logistic Regression
Presentation transcript:

1 Rob Woodruff Battelle Memorial Institute, Health & Analytics Cynthia Ferre Centers for Disease Control and Prevention Conditional Stereotype Logistic Regression A new estimation command

2 Overview What is it? - Stereotype Logistic Regression - Conditional on what? What‘s it good for? Syntax and Examples

3 Constrained Multinomial Logistic Regression Multinomial Model -Categorical Outcome Variable -Vector of Explanatory Variables -Related through the m logits:

4 Constrained Multinomial (continued) -The stereotype model imposes the constraints: Note: The phi’s are scalar quantities

5 It’s all about the phi’s Full multinomial has m(p+1) parameters Stereotype model has m-1 + m + p = 2m-1+p The phi parameters give a way to quantify ordinality of the outcome variable. If Then we have evidence of ordinal effect. Also allow tests of distinguishability of outcome categories

6 So what’s the condition? The multinomial and stereotype logistic regression models are implemented in Stata by mlogit and slogit Assume independence of observations, not true for matched case-control data For matched case control study, only independence of matched groups (strata, panels, clusters, etc) For 1:M matching, condition on stratum total for outcome variable and focus instead on conditional likelihood Do I have to? Why condition on this particular event?

7 Conditional vs. Unconditional Likelihood

8

9 CSTEREO cstereo command Basic syntax:. cstereo depvar indepvars [if] [in], group(varname) [options]

10 Example with Real Data: Preterm Birth and Vitamin D 1:2 (some 1:1) Pooled, Matched Case-Control Study of 2,583 Mothers in 870 matched groups A case defined as gestational age at delivery of <37 weeks outcome4=3 (<32 weeks), outcome4=2, (32-35 weeks), outcome4=1 (36 weeks) and outcome4=0 (control: 37+ weeks) Primary exposure variable of interest: Vitamin D levels, ohd25_total: blood serum concentration of (25)OHD in ng/ml Sample of other covariates measured: edu = 0/1 indicator of post-high school education vitamin = 0/1 indicator of vitamin use during pregnancy

11 Example Continued (nolog option):

12 Example Continued:

13 Interpretation of cstereo output: Estimated beta coefficient of ohd25_total = with 95% confidence interval ( , ) Odds ratio of being in <32 weeks gestational age compared to control is exp( ) = (0.965, 1.021) Now for odds ratios for the weeks and 36 week case categories, we need the products of the parameters: For standard errors, use Delta Method via nlcom

14 Interpretation continued: Exponentiating gives the odds ratio of being in the weeks case category compare to controls of with a 95% C.I. of (0.983, 1.004)

15 Constraints: Are the 36 week and weeks case categories distinguishable?

16 Constraint Output

17 Constraint Output The log-likelihood from the constrained model is compared to for the unconstrained stereotype model Difference of gives a chi2 value of on 1 degree of freedom P-value = 0.91 Unconstrained stereotype model does not fit significantly better than the constrained and the two case categories are indistinguishable

18 Relationship to Other Models for Ordered/Categorical Outcomes Constrained Multinomial Not as parsimonious as the proportional odds model (ologit) but not valid in outcome dependent sampling Adjacent category model is (basically) a constrained stereotype model. Also valid under outcome dependent sampling

19 Limitations Convergence Issues Currently only a one dimensional stereotype model Cannot currently force an ordering on the stereotype parameters Additional dependence structure

20 References: Ferre C, et al; Maternal 25-Hydroxyvitamin D Status and the Risk of Preterm Delivery: A Multi-Center Nested Case Control Study; preprint Mukherjee B, Liu I, Sinha S; Analysis of matched case- control data with multiple ordered disease states; Statistics in Medicine 2007 Ahn J et. al.; Missing Exposure Date in Stereotype Regression Model; Biometrics 2011 Andersen EB; Asymptotic Properties of Conditional Maximum- Likelihood Estimators; Journal of the Royal Statistical Society 1970 Liang KY, Stewart WF; Polychotomous Logistic Regression Methods for Matched Case-Control Studies with Multiple Case or Control Groups; American Journal of Epidemiology 1987 Scott AJ, Wild CJ; Fitting Regression Models to Case-Contro Data by Maximum Likelihood; Biometrika 1997 Anderson JA; Regression and Ordered Categorical Variable; Journal of the Royal Statistical Society 1984\ Greenland S; Alternative Models for Ordinal Logistic Regression; Statistics in Medicine 1994