Presentation is loading. Please wait.

Presentation is loading. Please wait.

Term 4, 2006BIO656--Multilevel Models1 140.656 Multi-Level Statistical Models If you did not receive the welcome from me, me at:

Similar presentations


Presentation on theme: "Term 4, 2006BIO656--Multilevel Models1 140.656 Multi-Level Statistical Models If you did not receive the welcome from me, me at:"— Presentation transcript:

1 Term 4, 2006BIO656--Multilevel Models1 140.656 Multi-Level Statistical Models If you did not receive the welcome email from me, email me at: (tlouis@jhsph.edu)tlouis@jhsph.edu

2 Term 4, 2006BIO656--Multilevel Models2 ROOM CHANGE, AGAIN! Starting Thursday, March 30 th and henceforth, lectures will be in W2030 Labs will still be in W2009

3 Term 4, 2006BIO656--Multilevel Models3

4 Term 4, 2006BIO656--Multilevel Models4 Prerequisites, resources and Grading

5 Term 4, 2006BIO656--Multilevel Models5 Learning Objectives

6 Term 4, 2006BIO656--Multilevel Models6 Content & Approach

7 Term 4, 2006BIO656--Multilevel Models7 Approach Lectures include basic illustrations and case studies, structuring an approach and interpreting results –Labs address computing and amplify on the foregoing My approach is formal, but not “mathematical” To understand MLMs, you need a very good understanding on single-level models –If you understand these, you are ready to multi-level!

8 Term 4, 2006BIO656--Multilevel Models8 Structure

9 Term 4, 2006BIO656--Multilevel Models9 RULES FOR HOMEWORK, MID-TERM AND PROJECT Homework Must be individually prepared, but you can get help Homework due dates should be honored. Turn in hard copy for grading The in-class, midterm Must be prepared absolutely independently During the exam, no advice or information can be obtained from others You can use your notes and reference materials The term project Must be individually prepared, but you can get help Must be electronically submitted

10 Term 4, 2006BIO656--Multilevel Models10 Handouts and the Web Virtually all course materials will be on the web Check frequently for updates I’ve provided hard copy of the general information sheet However, other lectures will be on the web in powerpoint format and won’t be handed out Download to your computer so you have an electronic version each part Print if you need hard copy, but do it 4 or 6 to a page to save paper More generally, try to “go electronic” printing sparingly

11 Term 4, 2006BIO656--Multilevel Models11 COMPUTING & DATA We will support WinBUGS, Stata We provide partial support for SAS, which should be used only by current SAS users; we aren’t teaching it from scratch Some homeworks require use of WinBUGS and another “traditional” program (STATA, SAS, R,...) We provide datasets, including some in the WinBUGS examples

12 Term 4, 2006BIO656--Multilevel Models12 WHY BUGS? Freeware! In MLMs, it’s important to see distributions – e.g., Skewness of sampling distribution of variance component estimates It’s important to incorporate all uncertainties in estimating random effects Note that WinBugs isn’t very data input friendly And, it’s difficult to produce P-values  

13 Term 4, 2006BIO656--Multilevel Models13 STATISTICAL MODELS A statistical model is an approximation Almost never is there a “correct” or “best” model, no holy grail A model is a tool for structuring a statistical approach and addressing a scientific question An effective model combines the data with prior information to address a question

14 Term 4, 2006BIO656--Multilevel Models14 MULTI-LEVEL MODELS Biological, physical, psycho/social processes that influence health occur at many levels: –Cell  Organ  Person  Family  Nhbd   City  Society ...  Solar system –Crew  Vessel  Fleet ... –Block  Block Group  Tract ... –Visit  Patient  Phy  Clinic  HMO ... Covariates can be at each level Many “units of analysis” More modern and flexible parlance and approach: “many variance components”

15 Term 4, 2006BIO656--Multilevel Models15 Example: Alcohol Abuse Cell: neurochemistry Organ: ability to metabolize ethanol Person: genetic susceptibility to addiction Family: alcohol abuse in the home Neighborhood: availability of bars Society: regulations; organizations; social norms

16 Term 4, 2006BIO656--Multilevel Models16 ALCOHOL ABUSE: ALCOHOL ABUSE: A multi-level, interaction model Interaction between existence of bars & state, drunk driving laws Alcohol abuse in a family & ability to metabolize ethanol Genetic predisposition to addiction & household environment State regulations about intoxication & job requirements

17 Term 4, 2006BIO656--Multilevel Models17 Many names for similar, but not identical models, analyses and goals Multi-Level Models Random effects models Mixed models Random coefficient models Hierarchical models Bayesian Models

18 Term 4, 2006BIO656--Multilevel Models18 We don’t need MLMs If your question is about slopes on regressors, you can run a standard regression and (usually) get valid slope estimates Y =  0 +  1 (areal monitor) +  2 (home monitor) +... Y =  0 +  1 (zipcode income) +  2 (personal income) +... logit(P) =...... Analysis can be followed by computing a “robust” SE to get valid inferences

19 Term 4, 2006BIO656--Multilevel Models19 We do need MLMs If your question is about variance components, you need to build the multi-level model Y ijkl =  0 +  1 X 1 +  2 X 2 +... +  ijkl Var(Y ijkl ) = Var(  ijkl ) = = V Hospital + V Clinic + V Physician + V Patient + V unexplained These variances depend on what Xs are in the model

20 Term 4, 2006BIO656--Multilevel Models20 We do need MLMs To create a broad class of correlation structures –Longitudinal correlations –Nested correlations To structure improving unit-level estimates (latent effects) and to make unit-level predictions

21 Term 4, 2006BIO656--Multilevel Models21 MLMs are effective in producing “working models” that incorporate stochastic realities Producing efficient population estimates Broadening the inference beyond “these units” Protecting against some types of informative missing data processes Producing correlation structures Generating “overdispersed” versions of standard models Structuring estimation of latent effects But, MLMs can be fragile and care is needed

22 Term 4, 2006BIO656--Multilevel Models22 MLMs are not and should not be A religion A truth The only way to model multi-level data!

23 Term 4, 2006BIO656--Multilevel Models23 Improving individual-level estimates Improving individual-level estimates Similar to the BUGS rat data Dependent variable (Y ij ) is weight for rat “i” at age X ij i = 1,..., I (=10); j = 1,..., J (=5) X ij = X j = (-14, -7, 0, 7, 14) = (8-22, 15-22, 22-22, 29-22 36-22) Y ij = b i0 + b i1 X j +  ij –As usual, the intercept depends on the centering Analyses  –Each rat has its own line  –All rats follow the same line: b i0 =  0, b i1 =  1 –A compromise between these two

24 Term 4, 2006BIO656--Multilevel Models24 Each rat has its own (LSE, MLE) line Each rat has its own (LSE, MLE) line (with the population line) Pop line

25 Term 4, 2006BIO656--Multilevel Models25 A multi-level model: A multi-level model: Each rat has its own line, but the lines come from the same distribution The b i0 are independent Normal(  0,  0 2 ) The b i1 are independent N(  1,  1 2 ) Overdispersion Sample variance of the OLS estimated intercepts: 345 = SE int 2 +  0 2 = 320 +  0 2   0 2 = 25,  0 = 5 Sample variance of the OLS estimated slopes 4.25 = SE slope 2 +  1 2 = 3.25 +  1 2   1 2 = 1.00,  1 = 1.00

26 Term 4, 2006BIO656--Multilevel Models26 A compromise: each rat has its own line, but the lines come from the same distribution Pop line

27 Term 4, 2006BIO656--Multilevel Models27 ONE-WAY RANDOM EFFECTS ANOVA

28 Term 4, 2006BIO656--Multilevel Models28 Simulated “Neighborhood Clustering” Random mean for each of 10 neighborhoods (J=10) b 1, b 2,..., b 10 (iid) N(10, 9) Random deviation from neighborhood mean for each of 10 persons in each neighborhood (n=10) Y ij = b j + e ij, e ij (iid) N(0, 4) Conditional Independence  Over-dispersion: Variance of each point is 13 (= 4 + 9) Correlation: Measurements within each cluster are correlated

29 Term 4, 2006BIO656--Multilevel Models29

30 Term 4, 2006BIO656--Multilevel Models30 Intra-class Correlation (ICC) Correlation of two observations in the same cluster: ICC = Var(Between)/ Var(Total) = 1 – Var(Within)/Var(Total) Estimated ICC: 0.67 = (9.8-3.2)/9.8 True ICC: 0.69 = 9/(9 + 4) = 9/13

31 Term 4, 2006BIO656--Multilevel Models31 V(b)

32 Term 4, 2006BIO656--Multilevel Models32

33 Term 4, 2006BIO656--Multilevel Models33

34 Term 4, 2006BIO656--Multilevel Models34

35 Term 4, 2006BIO656--Multilevel Models35

36 Term 4, 2006BIO656--Multilevel Models36

37 Term 4, 2006BIO656--Multilevel Models37 regression line Pop line 45 o line

38 Term 4, 2006BIO656--Multilevel Models38

39 Term 4, 2006BIO656--Multilevel Models39

40 Term 4, 2006BIO656--Multilevel Models40

41 Term 4, 2006BIO656--Multilevel Models41

42 Term 4, 2006BIO656--Multilevel Models42

43 Term 4, 2006BIO656--Multilevel Models43

44 Term 4, 2006BIO656--Multilevel Models44 WEIGHTED MEANS

45 Term 4, 2006BIO656--Multilevel Models45

46 Term 4, 2006BIO656--Multilevel Models46

47 Term 4, 2006BIO656--Multilevel Models47

48 Term 4, 2006BIO656--Multilevel Models48

49 Term 4, 2006BIO656--Multilevel Models49

50 Term 4, 2006BIO656--Multilevel Models50

51 Term 4, 2006BIO656--Multilevel Models51

52 Term 4, 2006BIO656--Multilevel Models52 INFERENCE SPACE (Sanders) The choice between fixed and random effects depends in part on the reference population (the inference space) –These studies or people – Studies or people like these –.........

53 Term 4, 2006BIO656--Multilevel Models53 Random Effects should replace “unit of analysis” Models contain Fixed-effects, Random effects (via Variance Components) and other correlation- inducers There are many “units” and so in effect no single set of units Random Effects induce unexplained (co)variance Some of the unexplained may be explicable by including additional covariates MLMs are one way to induce a structure and estimate the REs

54 Term 4, 2006BIO656--Multilevel Models54 PLEASE DO THIS If you did not receive the welcome email from me, email me at: (tlouis@jhsph.edu)tlouis@jhsph.edu

55 Term 4, 2006BIO656--Multilevel Models55 ROOM CHANGE, AGAIN! Starting Thursday, March 30 th and henceforth, lectures will be in W2030 Labs will still be in W2009

56 Term 4, 2006BIO656--Multilevel Models56 END OF PART I


Download ppt "Term 4, 2006BIO656--Multilevel Models1 140.656 Multi-Level Statistical Models If you did not receive the welcome from me, me at:"

Similar presentations


Ads by Google