Advanced Methods and Models in Behavioral Research – 2009/2010 AMMBR course design CONTENT METHOD Y is 0/1 conjoint analysis logistic regression multi-level.

Slides:



Advertisements
Similar presentations
Introduction Describe what panel data is and the reasons for using it in this format Assess the importance of fixed and random effects Examine the Hausman.
Advertisements

Multilevel modelling short course
ASSUMPTION CHECKING In regression analysis with Stata
AMMBR - final stuff xtmixed (and xtreg) (checking for normality, random slopes)
Kin 304 Regression Linear Regression Least Sum of Squares
AMMBR from xtreg to xtmixed (+checking for normality, random slopes)
Toolkit + “show your skills” AMMBR from xtreg to xtmixed (+checking for normality, and random slopes, and cross-classified models, and then we are almost.
Copyright © 2009 Pearson Education, Inc. Chapter 29 Multiple Regression.
1 SSS II Lecture 1: Correlation and Regression Graduate School 2008/2009 Social Science Statistics II Gwilym Pryce
Xtreg and xtmixed: recap We have the standard regression model (here with only one x): but think that the data are clustered, and that the intercept (c.
Advanced Methods and Models in Behavioral Research – 2014 Been there / done that: Stata Logistic regression (……) Conjoint analysis Coming up: Multi-level.
AMMBR from xtreg to xtmixed (+checking for normality, and random slopes, and cross-classified models, and then we are done in terms of theory)
Graphs in HLM. Model setup, Run the analysis before graphing Sector = 0 public school Sector = 1 private school.
© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
School of Veterinary Medicine and Science Multilevel modelling Chris Hudson.
Chapter 4 Multiple Regression.
Clustered or Multilevel Data
1 1 Slide © 2015 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Correlation and Regression Analysis
Finding help. Stata manuals You have all these as pdf! Check the folder /Stata12/docs.
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
Introduction to Regression Analysis, Chapter 13,
Simple Linear Regression Analysis
Basic Analysis of Variance and the General Linear Model Psy 420 Andrew Ainsworth.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.
3nd meeting: Multilevel modeling: introducing level 1 (individual) and level 2 (contextual) variables + interactions Subjects for today:  Intra Class.
Objectives of Multiple Regression
Introduction to Linear Regression and Correlation Analysis
Chapter 13: Inference in Regression
Chapter 11 Simple Regression
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
One-Factor Experiments Andy Wang CIS 5930 Computer Systems Performance Analysis.
Today’s program Herwart / Axel: Kiva intro (the Galak et al. paper) Follow-up questions Non-response (and respondent list) Multi-level models in Stata.
Typical paper follow-ups Paper is wrong (in the sense of a real mistake) There is an alternative explanation for the analytical results. You test that.
Advanced Methods and Models in Behavioral Research – 2010/2011 AMMBR course design CONTENT METHOD Y is 0/1 conjoint analysis logistic regression multi-level.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved OPIM 303-Lecture #9 Jose M. Cruz Assistant Professor.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved Chapter 13 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Hierarchical Linear Modeling (HLM): A Conceptual Introduction Jessaca Spybrook Educational Leadership, Research, and Technology.
Introduction Multilevel Analysis
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 15 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
Dealing with data All variables ok? / getting acquainted Base model Final model(s) Assumption checking on final model(s) Conclusion(s) / Inference Better.
Statistics for clinicians Biostatistics course by Kevin E. Kip, Ph.D., FAHA Professor and Executive Director, Research Center University of South Florida,
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
Multiple Regression The Basics. Multiple Regression (MR) Predicting one DV from a set of predictors, the DV should be interval/ratio or at least assumed.
Multilevel Linear Models Field, Chapter 19. Why use multilevel models? Meeting the assumptions of the linear model – Homogeneity of regression coefficients.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
1st meeting: Multilevel modeling: introduction Subjects for today:  Basic statistics (testing)  The difference between regression analysis and multilevel.
Testing Hypotheses about Differences among Several Means.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Chapter 13 Multiple Regression
STA 286 week 131 Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression.
I271B The t distribution and the independent sample t-test.
Assumption checking in “normal” multiple regression with Stata.
Chapter 8: Simple Linear Regression Yang Zhenlin.
Advanced MMBR Conjoint analysis (1). Advanced Methods and Models in Behavioral Research Conjoint analysis -> Multi-level models You have to understand:
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Introduction to Multiple Regression Lecture 11. The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & 2 or more.
Multiple Regression Learning Objectives n Explain the Linear Multiple Regression Model n Interpret Linear Multiple Regression Computer Output n Test.
Advanced MMBR Conjoint analysis. Advanced Methods and Models in Behavioral Research Conjoint analysis -> Multi-level models You have to understand: -What.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Multiple Regression Chapter 14.
Stats Methods at IC Lecture 3: Regression.
Module II Lecture 1: Multiple Regression
Chapter 14 Introduction to Multiple Regression
Linear Mixed Models in JMP Pro
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
Slope of the regression line:
LESSON 24: INFERENCES USING REGRESSION
Presentation transcript:

Advanced Methods and Models in Behavioral Research – 2009/2010 AMMBR course design CONTENT METHOD Y is 0/1 conjoint analysis logistic regression multi-level methods our own survey

Advanced Methods and Models in Behavioral Research – 2009/2010 DATA COLLECTION

Advanced Methods and Models in Behavioral Research – 2009/2010 Data collection surveys not ready to roll – some technical issues unsolved please wait until you get the green light to send your invitations If you have not done so yet – send your list of respondents (it’s mandatory)

Advanced Methods and Models in Behavioral Research – 2009/2010 MULTI – LEVEL ANALYSIS

Advanced Methods and Models in Behavioral Research – 2009/2010 Multi-level models or... dealing with clustered data. One solution: the variance component model Bayesian hierarchical models mixed models (in SPSS) hierarchical linear models random effects models random coefficient models subject specific models variance component models variance heterogeneity models

Advanced Methods and Models in Behavioral Research – 2009/2010 Clustered data / multi-level models Pupils within schools (within regions within countries) Firms within regions (or sectors) Vignettes within persons [copy to blackboard]

Advanced Methods and Models in Behavioral Research – 2009/2010 Two issues with clustered data Your estimates will (in all likelihood) be too precise: you find effects that do not exist in the population [further explanation on blackboard] You will want to distinguish between effects within clusters and effects between clusters [see next two slides]

Advanced Methods and Models in Behavioral Research – 2009/2010 On individual vs aggregate data For instance: X = introvertX = age of McDonald’s employee Y = school resultsY = like the manager

Advanced Methods and Models in Behavioral Research – 2009/2010 Had we only known, that the data are clustered! So the effect of X within clusters can be different from the effect between clusters! Using the school example: lines represent schools. And within schools the effect of being introvert is positive!

Advanced Methods and Models in Behavioral Research – 2009/2010 MAIN MESSAGES Be able to recognize clustered data and deal with it appropriately (how you do that will follow) Distinguish two kinds of effects: those at the "micro-level" vs those at the aggregate level (and do not test a micro-hypothesis with aggregate data)

Advanced Methods and Models in Behavioral Research – 2009/2010 A toy example – two schools, two pupils Overall mean(0) Two schools each with two pupils. We first calculate the means. Overall mean= (3+2+(-1)+(-4))/4= exam score School 2School 1 (taken from Rasbash)

Advanced Methods and Models in Behavioral Research – 2009/2010 Now the variance Overall mean(0) exam score School 2School 1 The total variance is the sum of the squares of the departures of the observations around mean divided by the sample size (4) = ( )/4=7.5

Advanced Methods and Models in Behavioral Research – 2009/2010 The variance of the school means around the overall mean exam score School 2School 1 Overall mean(0) The variance of the school means around the overall mean= ( (-2.5) 2 )/2=6.25 (total variance was 7.5)

Advanced Methods and Models in Behavioral Research – 2009/2010 The variance of the pupils scores around their school’s mean exam score School 2School The variance of the pupils scores around their school’s mean= ((3-2.5) 2 + (2-2.5) 2 + (-1-(-2.5)) 2 + (-4-(-2.5)) 2 )/4 =1.25

Advanced Methods and Models in Behavioral Research – 2009/2010 -> So you can partition the variance in individual level and school level How much of the variability in pupil attainment is attributable to factors at the school and how much to factors at the pupil level? In terms of our toy example we can now say 6.25/7.5= 82% of the total variation of pupils attainment is attributable to school level factors 1.25/7.5= 18% of the total variation of pupils attainment is attributable to pupil level factors And this is important; we want to know how to explain (in this example) school attainment, and appararently the differences are at the school level more than the pupil level

Advanced Methods and Models in Behavioral Research – 2009/2010 Standard multiple regression won't do YD1D2D3D4D5id… ……………3 +2……………3 -3……………4 +4……………4 ………………… So: You can use all the data and just run a multiple regression, but then you disregard the clustering effect, which gives uncorrect confidence intervals You can aggregate within clusters, and then run a multiple regression on the aggregate data. Two problems: no individual level testing possible + you get less data points. So what can we do?

Advanced Methods and Models in Behavioral Research – 2009/2010 Multi-level models The usual multiple regression model assumes... with the subscript "i" defined at the case-level.... and the epsilons independently distributed with covariance matrix I. With clustered data, you know these assumptions are not met.

Advanced Methods and Models in Behavioral Research – 2009/2010 Solution 1: add dummy-variables per cluster So just multiple regression, but with as many dummy variables as you have clusters (minus 1)... where, in this example, there are j+1 clusters. IF the clustering is (largely) due to differences in the intercept between persons, this might work. BUT if there are only a handful of cases per person, this necessitates a huge number of extra variables

Advanced Methods and Models in Behavioral Research – 2009/2010 Solution 2: split your micro-level X-vars Say you have: then create: and add both as predictors (instead of x 1 ) Make sure that you understand what is happening here, and why it is of use.

Advanced Methods and Models in Behavioral Research – 2009/2010 Solution 3: the variance component model In the variance component model, we split the randomness in a "personal part" and a "rest part"

Advanced Methods and Models in Behavioral Research – 2009/2010 Now: how do you do this in Stata? [note to CS: use age and schooling as examples to split at restaurant level] relevant commands xtset and xtreg bys : egen = mean( ) gen dvarB = - convenience commands tab, gen()drop orderdes editsum

Advanced Methods and Models in Behavioral Research – 2009/2010 Up next How do we run the "Solution 1" and "Solution 2" analysis? Random intercept we now saw, but how about random slopes?

Advanced Methods and Models in Behavioral Research – 2009/2010 When you have multi-level data (2 levels) 1.If applicable: consider whether using separate dummies per group might help (use only when this does not create a lot of dummies) 2.Run an empty mixed model (i.e., just the constant included) in Stata. Look at the level on which most of the variance resides. 3.If applicable: divide micro-variables in "group mean" variables and "difference from group mean" variables. 4.Re-run your mixed model with these variables included (as you would a multiple regression analysis)

Advanced Methods and Models in Behavioral Research – 2009/2010 To Do Put your respondent list only if you have not done so yet. Check the material online (as of tomorrow morning) Check in Stata: how do I create: –dummies per cluster –the mean of a variable within a cluster –the deviation from the mean within a cluster Next time: bring your laptop. We’ll have a full session of practicing only