1 Lecture 1 Introduction to Multi-level Models Course Website: All lecture materials extracted and.

Slides:



Advertisements
Similar presentations
Hierarchical Linear Modeling: An Introduction & Applications in Organizational Research Michael C. Rodriguez.
Advertisements

Lecture 11 (Chapter 9).
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Lecture 28 Categorical variables: –Review of slides from lecture 27 (reprint of lecture 27 categorical variables slides with typos corrected) –Practice.
Lecture 6 (chapter 5) Revised on 2/22/2008. Parametric Models for Covariance Structure We consider the General Linear Model for correlated data, but assume.
Lecture 4 (Chapter 4). Linear Models for Correlated Data We aim to develop a general linear model framework for longitudinal data, in which the inference.
Repeated Measures, Part 3 May, 2009 Charles E. McCulloch, Division of Biostatistics, Dept of Epidemiology and Biostatistics, UCSF.
Analysis of Clustered and Longitudinal Data Module 3 Linear Mixed Models (LMMs) for Clustered Data – Two Level Part A 1 Biostat 512: Module 3A - Kathy.
2005 Hopkins Epi-Biostat Summer Institute1 Module I: Statistical Background on Multi-level Models Francesca Dominici Michael Griswold The Johns Hopkins.
Lecture 8 Relationships between Scale variables: Regression Analysis
Multiple Linear Regression Model

Econ Prof. Buckles1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 4. Further Issues.
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
Clustered or Multilevel Data
1Prof. Dr. Rainer Stachuletz Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 4. Further Issues.
So far, we have considered regression models with dummy variables of independent variables. In this lecture, we will study regression models whose dependent.
Mixed models Various types of models and their relation
Foster Care Reunification: The use of hierarchical modeling to account for sibling and county correlation Emily Putnam-Hornstein, MSW Center for Social.
Logistic regression for binary response variables.
Analysis of Clustered and Longitudinal Data
Review of Lecture Two Linear Regression Normal Equation
9. Binary Dependent Variables 9.1 Homogeneous models –Logit, probit models –Inference –Tax preparers 9.2 Random effects models 9.3 Fixed effects models.
Regression and Correlation Methods Judy Zhong Ph.D.
Lecture 9: Marginal Logistic Regression Model and GEE (Chapter 8)
Simple Linear Regression
Biostatistics Case Studies 2005 Peter D. Christenson Biostatistician Session 4: Taking Risks and Playing the Odds: OR vs.
Multilevel and Multi-method Designs Capturing the Effects of Organizational Context and Process in Implementation Research AHRQ 2007 Annual Conference:
Sampling and Nested Data in Practice- Based Research Stephen Zyzanski, PhD Department of Family Medicine Case Western Reserve University School of Medicine.
From GLM to HLM Working with Continuous Outcomes EPSY 5245 Michael C. Rodriguez.
T tests comparing two means t tests comparing two means.
Chapter 3: Generalized Linear Models 3.1 The Generalization 3.2 Logistic Regression Revisited 3.3 Poisson Regression 1.
Lecture 8: Generalized Linear Models for Longitudinal Data.
1 Module I: Statistical Background on Multi-level Models Francesca Dominici Scott L. Zeger Michael Griswold The Johns Hopkins University Bloomberg School.
 Is there a comparison? ◦ Are the groups really comparable?  Are the differences being reported real? ◦ Are they worth reporting? ◦ How much confidence.
Introduction Multilevel Analysis
Excepted from HSRP 734: Advanced Statistical Methods June 5, 2008.
HSRP 734: Advanced Statistical Methods June 19, 2008.
Multilevel Data in Outcomes Research Types of multilevel data common in outcomes research Random versus fixed effects Statistical Model Choices “Shrinkage.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
1 GLM I: Introduction to Generalized Linear Models By Curtis Gary Dean Distinguished Professor of Actuarial Science Ball State University By Curtis Gary.
Term 4, 2006BIO656--Multilevel Models 1 PART 4 Non-linear models Logistic regression Other non-linear models Generalized Estimating Equations (GEE) Examples.
MBP1010 – Lecture 8: March 1, Odds Ratio/Relative Risk Logistic Regression Survival Analysis Reading: papers on OR and survival analysis (Resources)
The Choice Between Fixed and Random Effects Models: Some Considerations For Educational Research Clarke, Crawford, Steele and Vignoles and funding from.
Lecture 3 Linear random intercept models. Example: Weight of Guinea Pigs Body weights of 48 pigs in 9 successive weeks of follow-up (Table 3.1 DLZ) The.
Term 4, 2006BIO656--Multilevel Models 1 Part 2 Schematic of the alcohol model Marginal and conditional models Variance components Random Effects and Bayes.
Term 4, 2006BIO656--Multilevel Models Multi-Level Statistical Models If you did not receive the welcome from me, me at:
Introduction to logistic regression and Generalized Linear Models July 14, 2011 Introduction to Statistical Measurement and Modeling Karen Bandeen-Roche,
Lecture 10. Examples of count data –Number of panic attacks occurring during 6-month intervals after receiving treatment –Number of infant deaths per.
Data Analysis in Practice- Based Research Stephen Zyzanski, PhD Department of Family Medicine Case Western Reserve University School of Medicine October.
1 Multivariable Modeling. 2 nAdjustment by statistical model for the relationships of predictors to the outcome. nRepresents the frequency or magnitude.
BC Jung A Brief Introduction to Epidemiology - XIII (Critiquing the Research: Statistical Considerations) Betty C. Jung, RN, MPH, CHES.
Chapter 8: Simple Linear Regression Yang Zhenlin.
Organization of statistical research. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics 10 th Edition.
Sampling and Nested Data in Practice-Based Research Stephen Zyzanski, PhD Department of Family Medicine Case Western Reserve University School of Medicine.
Logistic regression (when you have a binary response variable)
1 Statistics 262: Intermediate Biostatistics Regression Models for longitudinal data: Mixed Models.
Week 7: General linear models Overview Questions from last week What are general linear models? Discussion of the 3 articles.
Instructor: R. Makoto 1richard makoto UZ Econ313 Lecture notes.
Methods of Presenting and Interpreting Information Class 9.
Chapter 14 Introduction to Multiple Regression
Applied Biostatistics: Lecture 2
Introduction to logistic regression a.k.a. Varbrul
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Why use marginal model when I can use a multi-level model?
From GLM to HLM Working with Continuous Outcomes
Presentation transcript:

1 Lecture 1 Introduction to Multi-level Models Course Website: All lecture materials extracted and further developed from the Multilevel Model course taught by Francesca Dominici:

2 Statistical Background on MLMs Main Ideas Accounting for Within-Cluster Associations Marginal & Conditional Models A Simple Example Key MLM components

3 The Main Idea…

4 Biological, psychological and social processes that influence health occur at many levels: –Cell –Organ –Person –Family –Neighborhood –City –Society An analysis of risk factors should consider: –Each of these levels –Their interactions Multi-level Models – Main Idea Health Outcome

5 Example: Alcohol Abuse 1.Cell: Neurochemistry 2.Organ: Ability to metabolize ethanol 3.Person: Genetic susceptibility to addiction 4.Family: Alcohol abuse in the home 5.Neighborhood: Availability of bars 6.Society: Regulations; organizations; social norms Level:

6 Example: Alcohol Abuse; Interactions between Levels 5Availability of bars and 6State laws about drunk driving 4Alcohol abuse in the family and 2Person’s ability to metabolize ethanol 3Genetic predisposition to addiction and 4Household environment 6State regulations about intoxication and 3Job requirements Level:

7 Notation: Population Neighborhood: i=1,…,I s State: s=1,…,S Family: j=1,…,J si Person: k=1,…,K sij Outcome: Y sijk Predictors: X sijk Person: sijk ( y 1223, x 1223 )

8 Notation (cont.)

9 Multi-level Models: Idea Predictor Variables Alcohol Abuse Response Person’s Income Family Income Percent poverty in neighborhood State support of the poor Level:

10 A Rose is a Rose is a… Multi-level model Random effects model Mixed model Random coefficient model Hierarchical model Meta-analysis (in some cases) Many names for similar models, analyses, and goals.

11 Digression on Statistical Models A statistical model is an approximation to reality There is not a “correct” model; –( forget the holy grail ) A model is a tool for asking a scientific question; –( screw-driver vs. sludge-hammer ) A useful model combines the data with prior information to address the question of interest. Many models are better than one.

12 Generalized Linear Models (GLMs) g(  ) =  0 +  1 *X 1 + … +  p *X p ModelResponse g(  )g(  ) DistributionCoef Interp Linear Continuous (ounces)  Gaussian Change in avg(Y) per unit change in X Logistic Binary (disease) logBinomialLog Odds Ratio Log- linear Count/Times to events log(  ) PoissonLog Relative Risk (  = E(Y|X) = mean )  (1-  )

13 Since: E(y|Age+1,Gender) =  0 +  1 (Age+1) +  2 Gender And: E(y|Age,Gender) =  0 +  1 Age +  2 Gender  E(y) =  1 Generalized Linear Models (GLMs) g(  ) =  0 +  1 *X 1 + … +  p *X p Gaussian – Linear :E(y) =  0 +  1 Age +  2 Gender Example: Age & Gender  1 = Change in Average Response per 1 unit increase in Age, Comparing people of the SAME GENDER. WHY?

14 Generalized Linear Models (GLMs) g(  ) =  0 +  1 *X 1 + … +  p *X p Binary – Logistic : log{odds(Y)} =  0 +  1 Age +  2 Gender Example: Age & Gender  1 = log-OR of “+ Response” for a 1 unit increase in Age, Comparing people of the SAME GENDER. WHY? Since: log{odds(y|Age+1,Gender)} =  0 +  1 (Age+1) +  2 Gender And: log{odds(y|Age,Gender)} =  0 +  1 Age +  2 Gender  log-Odds =  1 log-OR =  1

15 Generalized Linear Models (GLMs) g(  ) =  0 +  1 *X 1 + … +  p *X p Counts – Log-linear : log{E(Y)} =  0 +  1 Age +  2 Gender Example: Age & Gender  1 = log-RR for a 1 unit increase in Age, Comparing people of the SAME GENDER. WHY? Self-Check: Verify Tonight

16 D. Responses are independent B. All the key covariates are included in the model “Quiz”: Most Important Assumptions of Regression Analysis? A. Data follow normal distribution B. All the key covariates are included in the model C. Xs are fixed and known D. Responses are independent

17 Non-independent responses (Within-Cluster Correlation) Fact: two responses from the same family tend to be more like one another than two observations from different families Fact: two observations from the same neighborhood tend to be more like one another than two observations from different neighborhoods Why?

18 Why? (Family Wealth Example) GOD Grandparents Parents You Great-Grandparents You Parents Grandparents

19 Key Components of Multi-level Models Specification of predictor variables from multiple levels (Fixed Effects) –Variables to include –Key interactions Specification of correlation among responses from same clusters (Random Effects) Choices must be driven by scientific understanding, the research question and empirical evidence.

20 Correlated Data… (within-cluster associations)

21 Multi-level analyses Multi-level analyses of social/behavioral phenomena: an important idea Multi-level models involve predictors from multi-levels and their interactions They must account for associations among observations within clusters (levels) to make efficient and valid inferences.

22 Regression with Correlated Data Must take account of correlation to: Obtain valid inferences –standard errors –confidence intervals Make efficient inferences

23 Logistic Regression Example: Cross-over trial Response: 1-normal; 0- alcohol dependence Predictors: period (x 1 ); treatment group (x 2 ) Two observations per person (cluster) Parameter of interest: log odds ratio of alcohol dependence: placebo vs. treatment Mean Model: log{odds(AD)} =  0 +  1 Period +  2 Placebo

24 Results: estimate (standard error) Model Variable Ordinary Logistic Regression Account for correlation Intercept0.66 (0.32) 0.67 (0.29) Period-0.27 (0.38) (0.23) Placebo0.56 (0.38) 0.57 (0.23) Similar Estimates, WRONG Standard Errors (& Inferences) for OLR (  0 ) (  2 ) (  1 )

25 Simulated Data: Non-Clustered Cluster Number (Neighborhood) Alcohol Consumption (ml/day)

26 Simulated Data: Clustered Cluster Number (Neighborhood) Alcohol Consumption (ml/day)

27 Within-Cluster Correlation Correlation of two observations from same cluster = Non-Clustered = ( ) / 9.8 = 0 Clustered = ( ) / 9.8 = 0.67 Tot Var - Var Within Tot Var

28 Models for Clustered Data Models are tools for inference Choice of model determined by scientific question Scientific Target for inference? –Marginal mean: Average response across the population –Conditional mean: Given other responses in the cluster(s) Given unobserved random effects We will deal mainly with conditional models (but we’ll mention some important differences)

29 Marginal vs Conditional Models…

30 Marginal Models Focus is on the “mean model”: E(Y|X) Group comparisons are of main interest, i.e. neighborhoods with high alcohol use vs. neighborhoods with low alcohol use Within-cluster associations are accounted for to correct standard errors, but are not of main interest. log{ odds(AD) } =  0 +  1 Period +  2 Placebo

31 Marginal Model Interpretations log{ odds(AD) } =  0 +  1 Period +  2 Placebo = (-0.30)Period + (0.57)Placebo TRT Effect: (placebo vs. trt) OR = exp( 0.57 ) = 1.77, 95% CI (1.12, 2.80) Risk of Alcohol Dependence is almost twice as high on placebo, regardless of, (adjusting for), time period Since: log{odds(AD|Period, placebo)} =  0 +  1 Period +  2 And: log{odds(AD|Period, trt)} =  0 +  1 Period  log-Odds =  2 OR = exp(  2 ) WHY?

32 Random Effects Models Conditional on unobserved latent variables or “random effects” –Alcohol use within a family is related because family members share an unobserved “family effect”: common genes, diets, family culture and other unmeasured factors –Repeated observations within a neighborhood are correlated because neighbors share: common traditions, access to services, stress levels,… –log{ odds(AD) } = b i +  0 +  1 Period +  2 Placebo

33 Random Effects Model Interpretations Since: log{odds(AD i |Period, Placebo, b i ) )} =  0 +  1 Period +  2 + b i And: log{odds(AD i | Period, TRT, b i ) )} =  0 +  1 Period + b i  log-Odds =  2 OR = exp(  2 ) WHY? In order to make comparisons we must keep the subject-specific latent effect (b i ) the same. In a Cross-Over trial we have outcome data for each subject on both placebo & treatment In other study designs we may not.

34 Marginal vs. Random Effects Models For linear models, regression coefficients in random effects models and marginal models are identical: average of linear function = linear function of average For non-linear models, (logistic, log-linear,…) coefficients have different meanings/values, and address different questions -Marginal models -> population-average parameters -Random effects models -> cluster-specific parameters

35 Marginal -vs- Random Intercept Models; Cross-over Example Model VariableOrdinary Logistic Regression Marginal (GEE) Logistic Regression Random-Effect Logistic Regression Intercept0.66 (0.32) 0.67 (0.29) 2.2 (1.0) Period-0.27 (0.38) (0.23) (0.84) Placebo0.56 (0.38) 0.57 (0.23) 1.8 (0.93) Log OR (assoc.) (0.81) 5.0 (2.3)

36 Comparison of Marginal and Random Effect Logistic Regressions Regression coefficients in the random effects model are roughly 3.3 times as large –Marginal: population odds (prevalence with/prevalence without) of AD is exp(.57) = 1.8 greater for placebo than on active drug; population-average parameter –Random Effects: a person’s odds of AD is exp(1.8)= 6.0 times greater on placebo than on active drug; cluster-specific, here person-specific, parameter Which model is better? They ask different questions.

37 Refresher: Forests & Trees Multi-Level Models: –Explanatory variables from multiple levels i.e. person, family, n’bhd, state, … Interactions –Take account of correlation among responses from same clusters: i.e. observations on the same person, family,… Marginal: GEE, MMM Conditional: RE, GLMM Remainder of the course will focus on these.

38 Key Points “Multi-level” Models: –Have covariates from many levels and their interactions –Acknowledge correlation among observations from within a level (cluster) Random effect MLMs condition on unobserved “latent variables” to account for the correlation Assumptions about the latent variables determine the nature of the within cluster correlations Information can be borrowed across clusters (levels) to improve individual estimates

39 Examples of two-level data Studies of health services: assessment of quality of care are often obtained from patients that are clustered within hospitals. Patients are level 1 data and hospitals are level 2 data. In developmental toxicity studies: pregnant mice (dams) are assigned to increased doses of a chemical and examined for evidence of malformations (a binary response). Data collected in developmental toxicity studies are clustered. Observations on the fetuses (level 1 units) nested within dams/litters (level 2 data) The “level” signifies the position of a unit of observation within the hierarchy

40 Examples of three-level data Observations might be obtained in patients nested within clinics, that in turn, are nested within different regions of the country. Observations are obtained on children (level 1) nested within classrooms (level 2), nested within schools (level 3).

41 Why use marginal model when I can use a multi-level model? Public health problems: what is the impact of intervention/exposure on the population? –Most translation into policy makes sense at the population level Clinicians may be more interested in subject specific or hospital unit level analyses –What impact does a policy shift within the hospital have on patient outcomes or unit level outcomes?

42 Your study design may induce a correlation structure that you are not interested in –Sampling individuals within neighborhoods or households –Outcome: population mortality –Marginal model allows you to adjust inferences for the correlation while focusing attention on the model for mortality Dose-response or growth-curve –Here we are specifically interested in an individual trajectory –And also having an estimate of how the individual trajectories vary across individuals is informative. Why use marginal model when I can use a multi-level model?

43 Additional Points: Marginal Model We focus attention on the population level associations in the data and we try to model these best we can (mean model) We acknowledge that there is correlation and adjust for this in our statistical inferences. These methods (GEE) are robust to misspecification of the correlation We are obtaining estimates of the target of interest and valid inferences even when we get the form of the correlation structure wrong.

44 Multi-level Models Suppose you have hospital level summaries of patient outcomes –The fixed effect portion of your model suggests that these outcomes may differ by whether the hospital is teaching/non- teaching or urban/rural –The hospital level random effect represents variability across hospitals in the summary measures of patient outcomes; this measure of variability may be of interest –Additional interest lies in how large the hospital level variability is relative to a measure of total variability; what fraction of variability is attributable to hospital differences?

45 Additional considerations: Interpretations in the multi-level models can be tricky! Think about interpretation of gender in a random effects model: –E(Y|gender,bi) = b0 + b1gender + bi –Interpretation of b1: Among persons with similar unobserved latent effect bi, the difference in average Y if those same people had been males instead of females –Imagine the counter-factual world….does it make sense?

46 Comparison of Estimates: Linear Model and Non-linear model A hypothetical cross-over trial –N = 15 participants –2 periods –treatment vs placebo Two outcomes of interest –Continuous response: say alcohol consumption (Y) –Binary response: say alcohol dependence (AD)

47 Linear model E(Y|Period,Treatment) = b0 + b1Period + b2Treatment Ordinary Least Squares GEE (Indep) GEE (Exchange) Random subject effect Intercept (b0) 15.2 (1.22) 15.2 (1.16) 15.2 (1.07) 15.2 (1.13) Period (b1) 2.57 (1.38) 2.57 (1.31) 2.57 (1.01) 2.57 (1.08) Treatment (b2) (1.38) (1.31) (1.01) (1.08) SAME estimates... DIFFERENT standard errors...

48 Non-Linear model Log(Odds(AD|Period,Treatment)) = b0 + b1Period + b2Treatment Ordinary Logistic Regression GEE (Indep) GEE (Exchange) Random subject effect Intercept (b0) (0.75) (0.75) (0.83) (0.75) Period (b1) 0.79 (0.83) 0.79 (0.83) 0.76 (1.02) 0.79 (0.83) Treatment (b2) 1.82 (0.83) 1.82 (0.83) 1.80 (1.03) 1.82 (0.83) SAME estimates and standard errors Estimates and standard errors change (a little)

49 What happened in the GEE models? In non-linear models (binary, count, etc), the mean of the outcome is linked to the variance of outcome: –X ~ Binomial, mean p, variance p(1-p) –X ~ Poisson, mean λ, variance λ When we change the structure of the correlation/variance, we change the estimation of the mean too! The target of estimation is the same and our estimates are unbiased.

50 Why similarity between GEE and random effects here? No association in AD within person Little variability across persons Odds ratio of exposure across persons ~ 1 tab AD0 AD1 | 1 AD 0 AD | 0 1 | Total | 1 7 | 8 1 | 5 2 | Total | 6 9 | 15