1 Module I: Statistical Background on Multi-level Models Francesca Dominici Scott L. Zeger Michael Griswold The Johns Hopkins University Bloomberg School.

Slides:



Advertisements
Similar presentations
Hierarchical Linear Modeling: An Introduction & Applications in Organizational Research Michael C. Rodriguez.
Advertisements

Econometric Modeling Through EViews and EXCEL
Lecture 11 (Chapter 9).
Computational Statistics. Basic ideas  Predict values that are hard to measure irl, by using co-variables (other properties from the same measurement.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Lecture 4 (Chapter 4). Linear Models for Correlated Data We aim to develop a general linear model framework for longitudinal data, in which the inference.
2005 Hopkins Epi-Biostat Summer Institute1 Module 2: Bayesian Hierarchical Models Francesca Dominici Michael Griswold The Johns Hopkins University Bloomberg.
2005 Hopkins Epi-Biostat Summer Institute1 Module I: Statistical Background on Multi-level Models Francesca Dominici Michael Griswold The Johns Hopkins.
Multiple Linear Regression Model
Chapter 10 Simple Regression.

QUALITATIVE AND LIMITED DEPENDENT VARIABLE MODELS.
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
Clustered or Multilevel Data
Mixed models Various types of models and their relation
Chapter 11 Multiple Regression.
Logistic regression for binary response variables.
Analysis of Clustered and Longitudinal Data
Review of Lecture Two Linear Regression Normal Equation
GEE and Generalized Linear Mixed Models
9. Binary Dependent Variables 9.1 Homogeneous models –Logit, probit models –Inference –Tax preparers 9.2 Random effects models 9.3 Fixed effects models.
3. Multiple Regression Analysis: Estimation -Although bivariate linear regressions are sometimes useful, they are often unrealistic -SLR.4, that all factors.
Lecture 9: Marginal Logistic Regression Model and GEE (Chapter 8)
Inference for regression - Simple linear regression
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 25 Categorical Explanatory Variables.
Simple Linear Regression
Biostatistics Case Studies 2005 Peter D. Christenson Biostatistician Session 4: Taking Risks and Playing the Odds: OR vs.
T tests comparing two means t tests comparing two means.
Estimation of Statistical Parameters
1 Lecture 1 Introduction to Multi-level Models Course Website: All lecture materials extracted and.
Lecture 8: Generalized Linear Models for Longitudinal Data.
Introduction Multilevel Analysis
Excepted from HSRP 734: Advanced Statistical Methods June 5, 2008.
Factorial Survey Methods: and the use of HLM, HOLIT, HULIT, and HLIT Models R. L. Brown, Ph.D. University of Wisconsin-Madison
HSRP 734: Advanced Statistical Methods June 19, 2008.
Estimating a Population Proportion
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Repeated Measurements Analysis. Repeated Measures Analysis of Variance Situations in which biologists would make repeated measurements on same individual.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
1 GLM I: Introduction to Generalized Linear Models By Curtis Gary Dean Distinguished Professor of Actuarial Science Ball State University By Curtis Gary.
Term 4, 2006BIO656--Multilevel Models 1 PART 4 Non-linear models Logistic regression Other non-linear models Generalized Estimating Equations (GEE) Examples.
Lecture 3 Linear random intercept models. Example: Weight of Guinea Pigs Body weights of 48 pigs in 9 successive weeks of follow-up (Table 3.1 DLZ) The.
Term 4, 2006BIO656--Multilevel Models 1 Part 2 Schematic of the alcohol model Marginal and conditional models Variance components Random Effects and Bayes.
Term 4, 2006BIO656--Multilevel Models Multi-Level Statistical Models If you did not receive the welcome from me, me at:
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Introduction to logistic regression and Generalized Linear Models July 14, 2011 Introduction to Statistical Measurement and Modeling Karen Bandeen-Roche,
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
Data Analysis in Practice- Based Research Stephen Zyzanski, PhD Department of Family Medicine Case Western Reserve University School of Medicine October.
Agresti/Franklin Statistics, 1 of 88 Chapter 11 Analyzing Association Between Quantitative Variables: Regression Analysis Learn…. To use regression analysis.
1 Multivariable Modeling. 2 nAdjustment by statistical model for the relationships of predictors to the outcome. nRepresents the frequency or magnitude.
28. Multiple regression The Practice of Statistics in the Life Sciences Second Edition.
1 STA 617 – Chp12 Generalized Linear Mixed Models SAS for Model (12.3) with Matched Pairs from Table 12.1.
Statistical inference Statistical inference Its application for health science research Bandit Thinkhamrop, Ph.D.(Statistics) Department of Biostatistics.
Logistic regression (when you have a binary response variable)
Logistic Regression and Odds Ratios Psych DeShon.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
Instructor: R. Makoto 1richard makoto UZ Econ313 Lecture notes.
Methods of Presenting and Interpreting Information Class 9.
Multiple Regression.
Module 2: Bayesian Hierarchical Models
Introduction to logistic regression a.k.a. Varbrul
CONCEPTS OF ESTIMATION
Multiple Regression.
Why use marginal model when I can use a multi-level model?
From GLM to HLM Working with Continuous Outcomes
Simple Linear Regression
MGS 3100 Business Analysis Regression Feb 18, 2016
Presentation transcript:

1 Module I: Statistical Background on Multi-level Models Francesca Dominici Scott L. Zeger Michael Griswold The Johns Hopkins University Bloomberg School of Public Health

2 Statistical Background on Multi-level Models Multi-level models -Main ideas -Conditional -Marginal -Contrasting Examples

3 A Rose is a Rose is a… Multi-level model Random effects model Mixed model Random coefficient model Hierarchical model

4 Biological, psychological and social processes that influence health occur at many levels: –Cell –Organ –Person –Family –Neighborhood –City –Society An analysis of risk factors should consider: –Each of these levels –Their interactions Multi-level Models – Main Idea Health Outcome

5 Example: Alcohol Abuse 1.Cell: Neurochemistry 2.Organ: Ability to metabolize ethanol 3.Person: Genetic susceptibility to addiction 4.Family: Alcohol abuse in the home 5.Neighborhood: Availability of bars 6.Society: Regulations; organizations; social norms Level:

6 Example: Alcohol Abuse; Interactions among Levels 5Availability of bars and 6State laws about drunk driving 4Alcohol abuse in the family and 2Person’s ability to metabolize ethanol 3Genetic predisposition to addiction and 4Household environment 6State regulations about intoxication and 3Job requirements Level:

7 Notation: Population Neighborhood: i=1,…,I s State: s=1,…,S Family: j=1,…,J si Person: k=1,…,K sij Outcome: Y sijk Predictors: X sijk Person: sijk ( y 1223, x 1223 )

8 Notation (cont.)

9 Multi-level Models: Idea Predictor Variables Alcohol Abuse Response Person’s Income Family Income Percent poverty in neighborhood State support of the poor s sij si sijk Level: X.n X.s X.f X.p Y sijk

10 Digression on Statistical Models A statistical model is an approximation to reality There is not a “correct” model; –( forget the holy grail ) A model is a tool for asking a scientific question; –( screw-driver vs. sludge-hammer ) A useful model combines the data with prior information to address the question of interest. Many models are better than one.

11 Generalized Linear Models (GLMs) g(  ) =  0 +  1 *X 1 + … +  p *X p ModelResponse g(  )g(  ) DistributionCoef Interp Linear Continuous (ounces)  Gaussian Change in avg(Y) per unit change in X Logistic Binary (disease) logBinomialLog Odds Ratio Log- linear Count/Times to events log(  ) PoissonLog Relative Risk where:  = E(Y|X) = mean  (1-  )

12 Since: E(y|Age+1,Gender) =  0 +  1 (Age+1) +  2 Gender And: E(y|Age,Gender) =  0 +  1 Age +  2 Gender  E(y) =  1 Generalized Linear Models (GLMs) g(  ) =  0 +  1 *X 1 + … +  p *X p Gaussian – Linear :E(y) =  0 +  1 Age +  2 Gender Example: Age & Gender  1 = Change in Average Response per 1 unit increase in Age, Comparing people of the SAME GENDER. WHY?

13 Generalized Linear Models (GLMs) g(  ) =  0 +  1 *X 1 + … +  p *X p Binary – Logistic : log{odds(Y)} =  0 +  1 Age +  2 Gender Example: Age & Gender  1 = log-OR of “+ Response” for a 1 unit increase in Age, Comparing people of the SAME GENDER. WHY? Since: log{odds(y|Age+1,Gender)} =  0 +  1 (Age+1) +  2 Gender And: log{odds(y|Age,Gender)} =  0 +  1 Age +  2 Gender  log-Odds =  1 log-OR =  1

14 Generalized Linear Models (GLMs) g(  ) =  0 +  1 *X 1 + … +  p *X p Counts – Log-linear : log{E(Y)} =  0 +  1 Age +  2 Gender Example: Age & Gender  1 = log-RR for a 1 unit increase in Age, Comparing people of the SAME GENDER. WHY? Verify for Yourself Tonight

15 D. Responses are independent B. All the key covariates are included in the model Most Important Assumptions of Regression Analysis? A. Data follow normal distribution B. All the key covariates are included in the model C. Xs are fixed and known D. Responses are independent

16 Within-Cluster Correlation Fact: two responses from the same family tend to be more like one another than two observations from different families Fact: two observations from the same neighborhood tend to be more like one another than two observations from different neighborhoods Why?

17 Why? (Family Wealth Example) GOD Grandparents Parents You Great-Grandparents You Parents Grandparents

18 Multi-level Models: Idea Predictor Variables Alcohol Abuse Response Person’s Income Family Income Percent poverty in neighborhood State support of the poor s sij si sijk Level: X.n X.s X.f X.p Y sijk Bars a.n si Drunk Driving Laws a.s s Unobserved random intercepts Genes a.f sij

19 Key Components of Multi-level Model Specification of predictor variables from multiple levels –Variables to include –Key interactions Specification of correlation among responses from same clusters Choices must be driven by the scientific question

20 Multi-level Shmulti-level Multi-level analysis of social/behavioral phenomena: an important idea Multi-level models involve predictors from multi-levels and their interactions They must account for correlation among observations within clusters (levels) to make efficient and valid inferences.

21 Key Idea for Regression with Correlated Data Must take account of correlation to: Obtain valid inferences –standard errors –confidence intervals –posteriors Make efficient inferences

22 Logistic Regression Example: Cross-over trial Ordinary logistic regression: Response: 1-normal; 0- alcohol dependence Predictors: period (x 1 ); treatment group (x 2 ) Two observations per person Parameter of interest: log odds ratio of dependence: treatment vs placebo Mean Model: log{odds(AD)} =  0 +  1 Period +  2 Trt

23 Results: estimate, (standard error) Model Variable Ordinary Logistic Regression Account for correlation Intercept 0.66 (0.32) 0.67 (0.29) Period (0.38) (0.23) Treatment 0.56 (0.38) 0.57 (0.23) Similar estimates, WRONG Standard Errors (& Inferences) for OLR (  0 ) (  2 ) (  1 )

24 Variance of Least Squares and ML Estimators of Slope –vs- First Lag Correlation Variance reported by LS True variance of LS Variance of mle Source: DHLZ 2002 (pg 19)

25 Simulated Data: Non-Clustered Cluster Number (Neighborhood) Alcohol Consumption (ml/day)

26 Simulated Data: Clustered Cluster Number (Neighborhood) Alcohol Consumption (ml/day)

27 Within-Cluster Correlation Correlation of two observations from same cluster = Non-Clustered = ( ) / 9.8 = 0 Clustered = ( ) / 9.8 = 0.67 Total Var – Within Var Total Var

28 Models for Clustered Data Models are tools for inference Choice of model determined by scientific question Scientific Target for inference? –Marginal mean: Average response across the population –Conditional mean: Given other responses in the cluster(s) Given unobserved random effects

29 Marginal Models Target – marginal mean or population-average response for different values of predictor variables Compare Groups Examples: –Mean alcohol consumption for Males vs Females –Rates of alcohol abuse for states with active addiction treatment programs vs inactive states Public health (a.k.a. population) questions ex. mean model: E(AlcDep) =  0 +  1 Gender

30 Marginal GLMS for Multi-level Data: Generalized Estimating Equations (GEE) Mean Model: (Ordinary GLM - linear, logistic,..) –Population-average parameters –e.g. log{ odds(AlcDep ij ) } =  0 +  1 Gender ij Solving GEE (DHLZ, 2002) gives nearly efficient and valid inferences about population-average parameters subject i in cluster j Association Model: (for observations in clusters) –e.g. log{ Odds Ratio(Y ij,Y kj ) } =  0 two different subjects (i & k) in cluster j

31 OLR vs GEE Cross-over Example Model VariableOrdinary Logistic Regression GEE Logistic Regression Intercept0.66 (0.32) 0.67 (0.29) Period-0.27 (0.38) (0.23) Treatment0.56 (0.38) 0.57 (0.23) log( OR ) (association) (0.81)

32 Marginal Model Interpretations log{ odds(AlcDep) } =  0 +  1 Period +  2 trt = (-0.30)Period + (0.57)trt TRT Effect: (placebo vs. trt) OR = exp( 0.57 ) = 1.77, 95% CI (1.12, 2.80) Risk of Alcohol Dependence is almost twice as high on placebo, regardless of, (adjusting for), time period Since: log{odds(AlcDep|Period, pl)} =  0 +  1 Period +  2 And: log{odds(AlcDep|Period, trt)} =  0 +  1 Period  log-Odds =  2 OR = exp(  2 ) WHY?

33 Conditional Models Conditional on other observations in cluster –Probability a person abuses alcohol as a function of the number of family members that do –A person’s average alcohol consumption as a function on the average in the neighborhood Use other responses from the cluster as predictors in regressions like additional covariates ex: E(AlcDep ij ) =  0 +  1 Gender ij +  2 AlcDep j

34 Conditional on Other Responses: - Usually a Bad Idea - Definition of “other responses in cluster” depends on size/nature of cluster –e.g. “number of other family members who do” 0 for a single person means something different that 0 in a family with 10 others The “risk factors” may affect the entire cluster; conditioning on the responses for the others will dilute the risk factor effect –Two eyes example ex: log{odds(Blind i,Left )} =  0 +  1 Sun +  2 Blind i,Right  0

35 Conditional Models Conditional on unobserved latent variables or “random effects” –Alcohol use within a family is related because family members share an unobserved “family effect”: common genes, diets, family culture and other unmeasured factors –Repeated observations within a neighborhood are correlated because neighbors share: common traditions, access to services, stress levels,…

36 Random Effects Models Latent (random) effects are unobserved –inferred from the correlation among residuals Random effects models describe the marginal mean and the source of correlation in one equation Assumptions about the latent variables determine the nature of the associations –ex: Random Intercept = Uniform Correlation ex: E(AlcDep ij | b j ) =  0 +  1 Gender ij + b j where: b j ~ N(0,  2 ) Cluster specific random effect

37 OLR vs R.E. Cross-over Example Model VariableOrdinary Logistic Regression Random Int. Logistic Regression Intercept0.66 (0.32) 2.2 (1.0) Period-0.27 (0.38) (0.84) Treatment0.56 (0.38) 1.8 (0.93) log(  ) (association) (2.3)

38 Conditional Model Interpretations log{ odds(AlcDep i | b i ) } =  0 +  1 Period +  2 trt + b i = (-1.0)Period + (1.8)trt + b i where: b i ~ N(0,5 2 ) TRT Effect: (placebo vs. trt) OR = exp( 1.8 ) = 6.05, 95% CI (0.94, 38.9) A Specific Subject’s Risk of Alcohol Dependence is 6 TIMES higher on placebo, regardless of, (adjusting for), time period i th subject’s latent propensity for Alcohol Dependence

39 Conditional Model Interpretations Since: log{odds(AlcDep i |Period, pl, b i ) )} =  0 +  1 Period +  2 + b i And: log{odds(AlcDep|Period, trt, b i ) )} =  0 +  1 Period + b i  log-Odds =  2 OR = exp(  2 ) WHY? In order to make comparisons we must keep the subject-specific latent effect (b i ) the same. In a Cross-Over trial we have outcome data for each subject on both placebo & treatment What about in a usual clinical trial / cohort study?

40 Marginal vs. Random Effects Models For linear models, regression coefficients in random effects models and marginal models are identical: average of linear function = linear function of average For non-linear models, (logistic, log-linear,…) coefficients have different meanings/values, and address different questions -Marginal models -> population-average parameters -Random effects models -> cluster-specific parameters

41 Marginal –vs- Random Intercept Model log{odds(Y i ) } =  0 +  1 *Gender VS. log{odds(Y i | u i ) } =  0 +  1 *Gender + u i cluster specific comparisons population prevalences Female Male Female Male Source: DHLZ 2002 (pg 135)

42 Marginal -vs- Random Intercept Models; Cross-over Example Model VariableOrdinary Logistic Regression Marginal (GEE) Logistic Regression Random- Effect Logistic Regression Intercept0.66 (0.32) 0.67 (0.29) 2.2 (1.0) Period-0.27 (0.38) (0.23) (0.84) Treatment0.56 (0.38) 0.57 (0.23) 1.8 (0.93) Log OR (assoc.) (0.81) 5.0 (2.3)

43 Comparison of Marginal and Random Effect Logistic Regressions Regression coefficients in the random effects model are roughly 3.3 times as large –Marginal: population odds (prevalence with/prevalence without) of AlcDep is exp(.57) = 1.8 greater for placebo than on active drug; population-average parameter –Random Effects: a person’s odds of AlcDep is exp(1.8)= 6.0 times greater on placebo than on active drug; cluster-specific, here person-specific, parameter Which model is better? They ask different questions.

44 Marginalized Multi-level Models Heagerty (1999, Biometrics); Heagerty and Zeger (2000, Statistical Science) Model: –marginal mean as a function of covariates –conditional mean given random effects as a function of marginal mean and cluster-specific random effects Random Effects allow flexible association models, but public health is usually concerned with population-averaged (marginal) questions.  MMM

45 Schematic of Marginal Random- effects Model

46 Model VariableOrdinary Logistic Regression GEE Logistic Regression MMM Logistic Regression Random Int. Logistic Regression Intercept0.66 (0.32) 0.67 (0.29) 0.65 (0.28) 2.2 (1.0) Period-0.27 (0.38) (0.23) (0.22) (0.84) Treatment0.56 (0.38) 0.57 (0.23) 0.58 (0.23) 1.8 (0.93) log(OR) (assoc.) (0.81) 5.44 (3.72) 5.0 (2.3) Marginal and Random Intercept Models Cross-over Example

47 Refresher: Forests & Trees Multi-Level Models: –Explanatory variables from multiple levels Family Neighborhood State –Interactions Must take account of correlation among responses from same clusters: –Marginal: GEE, MMM –Conditional: RE, GLMM

48 Illustration of Conditional Models and Marginal Multi-level Models; The British Social Attitudes Survey Binary Response: Y ijk = Levels (notation) –Year: k=1,…,4 ( ) –Subject: j=1,…,264 –District: i=1,…54 –Overall Sample: N = 1,056 Levels (conception) –1: time within person –2: persons within districts –3: districts 1 if favor abortion 0 if not

49 Covariates at Three Levels Level 1: time –Indicators of time Level 2: person –Class: upper working; lower working –Gender –Religion: protestant, catholic, other Level 3: district –Percentage protestant (derived)

50 Scientific Questions How does a person’s religion influence her probability of favoring abortion? How does the predominant religion in a person’s district influence her probability of favoring abortion? How does the rate of favoring abortion differ between protestants and otherwise similar catholics? How does the rate of favoring abortion differ between districts that are predominantly protestant versus other religions? Conditional model Marginal model

51 Conditional Multi-level Model Person and district random effects 1.Time: k 2.Person: j 3.District: i Levels:

52 Conditional Multi-level Model Results 3 2 1

53 Conditional Scientific Answers How does a person’s religion influence her probability of favoring abortion? How does the predominant religion in a person’s district influence her probability of favoring abortion? But Wait!…

54 Conditional Model Interpretations: Model 4 WHY? log{odds(Fav|Catholic,X,b 2,ij,b 3,ij ) )} =  0 +X  +  8 + b 2,0 + b C + b 3,0 log{odds(Fav|Protestant,X,b 2,ij,b 3,ij ) )} =  0 +X  + b 2,0 + b 3,0 OR  exp(  8 ) OR = exp(  8 + b C )  log-Odds =  8 + b C

55 Conditional Model Interpretations: Model 4 What happens if you simply report exp(  )?? log{odds(Fav|Catholic,X,b 2,ij,b 3,ij ) )} =  0 +X  +  8 + b 2,0 + b C + b 3,0 log{odds(Fav|Prot/Cath,X,b 2,ij,b 3,ij ) )} =  0 +X  + b 2,0 + b C + b 3,0 OR = exp(  8 )  log-Odds =  8 But there were NO subjects in the study who were simultaneously BOTH Catholic AND Protestant ( Similar for % protestant! )

56 Marginal Multi-level Model 1.Time: k 2.Person: j 3.District: i Levels: Mean Model: Person and district random effects Association Model: (Separate)

57 Marginal Multi-level Model Results 3 2 1

58 Marginal Scientific Answers How does the rate of favoring abortion differ between protestants and otherwise similar catholics? How does the rate of favoring abortion differ between districts that are predominantly protestant versus other religions?

59 Key Points “Multi-level” Models: –Have covariates from many levels and their interactions –Acknowledge correlation among observations from within a level (cluster) Conditional and Marginal Multi-level models have different targets; ask different questions When population-averaged parameters are the focus, use –GEE –Marginal Multi-level Models (Heagerty and Zeger, 2000)

60 Key Points (continued) When cluster-specific parameters are the focus, use random effects models that condition on unobserved latent variables that are assumed to be the source of correlation Warning: Model Carefully. Cluster-specific targets often involve extrapolations where there are no actual data for support –e.g. % protestant in neighborhood given a random neighborhood effect