Why use marginal model when I can use a multi-level model?

Slides:



Advertisements
Similar presentations
Introduction Describe what panel data is and the reasons for using it in this format Assess the importance of fixed and random effects Examine the Hausman.
Advertisements

Randomized Complete Block and Repeated Measures (Each Subject Receives Each Treatment) Designs KNNL – Chapters 21,
CLINICAL QUESTION By: Resident Name. 25 y/o male w/ hx of ___, ____, and ____, who presented w/ _______ found to have ________ admitted for ______ and.
Lecture 11 (Chapter 9).
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Other Analysis of Variance Designs Chapter 15. Chapter Topics Basic Experimental Design Concepts  Defining Experimental Design  Controlling Nuisance.
Longitudinal Experiments Larry V. Hedges Northwestern University Prepared for the IES Summer Research Training Institute July 28, 2010.
Chapter 10 Simple Regression.

QUALITATIVE AND LIMITED DEPENDENT VARIABLE MODELS.
Journal Club Alcohol and Health: Current Evidence July-August 2006.
© 2003 Prentice-Hall, Inc.Chap 14-1 Basic Business Statistics (9 th Edition) Chapter 14 Introduction to Multiple Regression.
Clustered or Multilevel Data
FIN357 Li1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 1. Estimation.
Lecture 9: Marginal Logistic Regression Model and GEE (Chapter 8)
CHP400: Community Health Program - lI Research Methodology. Data analysis Hypothesis testing Statistical Inference test t-test and 22 Test of Significance.
1 Lecture 1 Introduction to Multi-level Models Course Website: All lecture materials extracted and.
Lecture 8: Generalized Linear Models for Longitudinal Data.
1 Module I: Statistical Background on Multi-level Models Francesca Dominici Scott L. Zeger Michael Griswold The Johns Hopkins University Bloomberg School.
Multilevel Data in Outcomes Research Types of multilevel data common in outcomes research Random versus fixed effects Statistical Model Choices “Shrinkage.
FIXED AND RANDOM EFFECTS IN HLM. Fixed effects produce constant impact on DV. Random effects produce variable impact on DV. F IXED VS RANDOM EFFECTS.
Statistical Analysis Of Population Prepared by, Sushruth Puttaswamy.
Randomized Assignment Difference-in-Differences
PO 141: INTRODUCTION TO PUBLIC POLICY Summer I (2015) Claire Leavitt Boston University.
Class Six Turn In: Chapter 15: 30, 32, 38, 44, 48, 50 Chapter 17: 28, 38, 44 For Class Seven: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 Read.
Methods of Presenting and Interpreting Information Class 9.
Bootstrap and Model Validation
Statistics 200 Objectives:
Chapter 6: Sampling Distributions
Multiple Regression.
Effect Sizes.
Chapter 14 Introduction to Multiple Regression
STATISTICAL INFERENCE
Present: Disease Past: Exposure
Lezione di approfondimento su RDD (in inglese)
Statistical Core Didactic
Is “Dead” the same as “Missing”?
Applied Biostatistics: Lecture 2
Sec 9C – Logistic Regression and Propensity scores
Econometrics ITFD Week 8.
Randomized Block Design
Biostatistics Case Studies 2016
Relative Values.
The binomial applied: absolute and relative risks, chi-square
Chapter 5 Sampling Distributions
Elementary Statistics
BPK 304W Correlation.
Introduction to logistic regression a.k.a. Varbrul
Analyzing Intervention Studies
Chapter 18 Cross-Tabulated Counts
2 independent Groups Graziano & Raulin (1997).
I271B Quantitative Methods
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Multiple Regression.
Economics 20 - Prof. Anderson
Model Comparison: some basic concepts
Empirical Tools of Public Finance
I. Statistical Tests: Why do we use them? What do they involve?
What is Regression Analysis?
Randomized Complete Block and Repeated Measures (Each Subject Receives Each Treatment) Designs KNNL – Chapters 21,
BY: Mohammed Hussien Feb 2019 A Seminar Presentation on Longitudinal data analysis Bahir Dar University School of Public Health Post Graduate Program.
Ch. 13. Pooled Cross Sections Across Time: Simple Panel Data.
Day 2 Applications of Growth Curve Models June 28 & 29, 2018
Korelasi Parsial dan Pengontrolan Parsial Pertemuan 14
Chapter 7: Sampling Distributions
CCSSO National Conference on Student Assessment June 21, 2010
Ch. 13. Pooled Cross Sections Across Time: Simple Panel Data.
Exercise 1 Use Transform  Compute variable to calculate weight lost by each person Calculate the overall mean weight lost Calculate the means and standard.
Risk Ratio A risk ratio, or relative risk, compares the risk of some health-related event such as disease or death in two groups. The two groups are typically.
Chapter 5: Sampling Distributions
Presentation transcript:

Why use marginal model when I can use a multi-level model? Public health problems: what is the impact of intervention/exposure on the population? Most translation into policy makes sense at the population level Clinicians may be more interested in subject specific or hospital unit level analyses What impact does a policy shift within the hospital have on patient outcomes or unit level outcomes?

Why use marginal model when I can use a multi-level model? Your study design may induce a correlation structure that you are not interested in Sampling individuals within neighborhoods or households Outcome: population mortality Marginal model allows you to adjust inferences for the correlation while focusing attention on the model for mortality Dose-response or growth-curve Here we are specifically interested in an individual trajectory And also having an estimate of how the individual trajectories vary across individuals is informative.

Additional Points: Marginal Model We focus attention on the population level associations in the data and we try to model these best we can (mean model) We acknowledge that there is correlation and adjust for this in our statistical inferences. These methods (GEE) are robust to misspecification of the correlation We are obtaining estimates of the target of interest and valid inferences even when we get the form of the correlation structure wrong.

Multi-level Models Suppose you have hospital level summaries of patient outcomes The fixed effect portion of your model suggests that these outcomes may differ by whether the hospital is teaching/non-teaching or urban/rural The hospital level random effect represents variability across hospitals in the summary measures of patient outcomes; this measure of variability may be of interest Additional interest lies in how large the hospital level variability is relative to a measure of total variability; what fraction of variability is attributable to hospital differences?

Additional considerations: Interpretations in the multi-level models can be tricky! Think about interpretation of gender in a random effects model: E(Y|gender,bi) = b0 + b1gender + bi Interpretation of b1: Among persons with similar unobserved latent effect bi, the difference in average Y if those same people had been males instead of females Imagine the counter-factual world….does it make sense?

Comparison of Estimates: Linear Model and Non-linear model A hypothetical cross-over trial N = 15 participants 2 periods treatment vs placebo Two outcomes of interest Continuous response: say alcohol consumption (Y) Binary response: say alcohol dependence (AD)

Linear model E(Y|Period,Treatment) = b0 + b1Period + b2Treatment Ordinary Least Squares GEE (Indep) GEE (Exchange) Random subject effect Intercept (b0) 15.2 (1.22) (1.16) (1.07) (1.13) Period (b1) 2.57 (1.38) (1.31) (1.01) (1.08) Treatment (b2) -0.43 SAME estimates . . . DIFFERENT standard errors . . .

Ordinary Least Squares Non-Linear model Log(Odds(AD|Period,Treatment)) = b0 + b1Period + b2Treatment Ordinary Least Squares GEE (Indep) GEE (Exchange) Random subject effect Intercept (b0) -1.14 (0.75) -1.11 (0.83) Period (b1) 0.79 0.76 (1.02) Treatment (b2) 1.82 1.80 (1.03) SAME estimates and standard errors Estimates and standard errors change (a little)

What happened in the GEE models? In non-linear models (binary, count, etc), the mean of the outcome is linked to the variance of outcome: X ~ Binomial, mean p, variance p(1-p) X ~ Poisson, mean λ, variance λ When we change the structure of the correlation/variance, we change the estimation of the mean too! The target of estimation is the same and our estimates are unbiased.

Why similarity between GEE and random effects here? No association in AD within person Little variability across persons Odds ratio of exposure across persons ~ 1 tab AD0 AD1 | 1 AD 0 AD | 0 1 | Total -----------+----------------------+---------- 0 | 1 7 | 8 1 | 5 2 | 7 Total | 6 9 | 15