Factorial Survey Methods: and the use of HLM, HOLIT, HULIT, and HLIT Models R. L. Brown, Ph.D. University of Wisconsin-Madison

Slides:



Advertisements
Similar presentations
Assumptions underlying regression analysis
Advertisements

Hierarchical Linear Modeling: An Introduction & Applications in Organizational Research Michael C. Rodriguez.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Qualitative predictor variables
StatisticalDesign&ModelsValidation. Introduction.
Analysis of Categorical Data Nick Jackson University of Southern California Department of Psychology 10/11/
Logistic Regression Psy 524 Ainsworth.
Longitudinal and Multilevel Methods for Models with Discrete Outcomes with Parametric and Non-Parametric Corrections for Unobserved Heterogeneity David.
ADVANCED STATISTICS FOR MEDICAL STUDIES Mwarumba Mwavita, Ph.D. School of Educational Studies Research Evaluation Measurement and Statistics (REMS) Oklahoma.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
11 Simple Linear Regression and Correlation CHAPTER OUTLINE
Linear regression models
Generalized Linear Models (GLM)
Introduction to Categorical Data Analysis
1-1 Regression Models  Population Deterministic Regression Model Y i =  0 +  1 X i u Y i only depends on the value of X i and no other factor can affect.
Correlation and Simple Regression Introduction to Business Statistics, 5e Kvanli/Guynes/Pavur (c)2000 South-Western College Publishing.

Clustered or Multilevel Data
Correlation. Two variables: Which test? X Y Contingency analysis t-test Logistic regression Correlation Regression.
1 1 Slide © 2015 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Analysis of Covariance Goals: 1)Reduce error variance. 2)Remove sources of bias from experiment. 3)Obtain adjusted estimates of population means.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Introduction to Regression Analysis, Chapter 13,
Multivariate Probability Distributions. Multivariate Random Variables In many settings, we are interested in 2 or more characteristics observed in experiments.
Lecture 5 Correlation and Regression
Introduction to Multilevel Modeling Using SPSS
1 Regression Models with Binary Response Regression: “Regression is a process in which we estimate one variable on the basis of one or more other variables.”
Regression and Correlation Methods Judy Zhong Ph.D.
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
Chapter 11 Simple Regression
Statistics & Biology Shelly’s Super Happy Fun Times February 7, 2012 Will Herrick.
1 1 Slide © 2005 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved Chapter 13 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 1 Slide Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple Coefficient of Determination n Model Assumptions n Testing.
Hierarchical Linear Modeling (HLM): A Conceptual Introduction Jessaca Spybrook Educational Leadership, Research, and Technology.
Introduction Multilevel Analysis
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 15 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
Statistics for clinicians Biostatistics course by Kevin E. Kip, Ph.D., FAHA Professor and Executive Director, Research Center University of South Florida,
Introduction to Multilevel Modeling Stephen R. Porter Associate Professor Dept. of Educational Leadership and Policy Studies Iowa State University Lagomarcino.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
Forecasting Choices. Types of Variable Variable Quantitative Qualitative Continuous Discrete (counting) Ordinal Nominal.
Regression. Types of Linear Regression Model Ordinary Least Square Model (OLS) –Minimize the residuals about the regression linear –Most commonly used.
1 11 Simple Linear Regression and Correlation 11-1 Empirical Models 11-2 Simple Linear Regression 11-3 Properties of the Least Squares Estimators 11-4.
Discrete Choice Modeling William Greene Stern School of Business New York University.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Chapter 13 Multiple Regression
Single-Factor Studies KNNL – Chapter 16. Single-Factor Models Independent Variable can be qualitative or quantitative If Quantitative, we typically assume.
Discrete Choice Modeling William Greene Stern School of Business New York University.
Multiple Logistic Regression STAT E-150 Statistical Methods.
Generalized Linear Models (GLMs) and Their Applications.
A first order model with one binary and one quantitative predictor variable.
Nonparametric Statistics
[Part 5] 1/43 Discrete Choice Modeling Ordered Choice Models Discrete Choice Modeling William Greene Stern School of Business New York University 0Introduction.
Chapter 14 Introduction to Regression Analysis. Objectives Regression Analysis Uses of Regression Analysis Method of Least Squares Difference between.
The “Big Picture” (from Heath 1995). Simple Linear Regression.
Biostatistics Class 3 Probability Distributions 2/15/2000.
Université d’Ottawa / University of Ottawa 2001 Bio 4118 Applied Biostatistics L12.1 Lecture 12: Generalized Linear Models (GLM) What are they? When do.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
BINARY LOGISTIC REGRESSION
John Loucks St. Edward’s University . SLIDES . BY.
Generalized Linear Models
Simple Linear Regression - Introduction
Introduction to logistic regression a.k.a. Varbrul
Correlation and Regression
From GLM to HLM Working with Continuous Outcomes
Fixed, Random and Mixed effects
Presentation transcript:

Factorial Survey Methods: and the use of HLM, HOLIT, HULIT, and HLIT Models R. L. Brown, Ph.D. University of Wisconsin-Madison

Rossi PH, Anderson AB. The factorial survey approach: an introduction. In Rossi PH, Nock SL, eds. Measuring social judgments: the factorial survey approach. Beverly Hills: Sage, Classic Reference

The use of vignettes in social science research has a long history, including applications in psychological research since the 1950s.

Factorial Survey Methods are a technique for applying experimental design to survey research. Basic Definition

FS Methodology FS has been devised to help in the unravelling of complex choices FS offer a better approximation to “real” cases than traditional survey questions FS describe fictitious persons (families, organizations, etc.) whose relevant characteristics are described in sketches.

FS Methodology (continued) FS are constructed using experimental design by systematically varying factors or characteristics. FS may allow us to investigate hard to study issues due to simulation. FS major limitation – reality?

Factorial Surveys typically use designs with repeated measures 2 factor designs

A1A1 A1A1 A1A1 A2A2 A2A2 A2A2 B1B1 B2B2 B3B3 B1B1 B2B2 B3B3 S1S S2S S3S SkSk (A x B x S) Design Physician Trust A = Gender of physician, B = Race of physician

ABS (A x B x S) Design SiSi a1b1a1b2a1b3a2b1a2b2a2b3 Repeated Measures Factorial Designs produce hierarchical or multilevel data. Vignette- level Patient-level

Typically we want to study the relationships between variables both within and between levels. What’s the problem with using classic general linear modeling (GLM)? Statistical Analysis

Assumptions of the GLM Cases are independently sampled from a normal distribution (Vignettes) –Vignettes are hierarchical not randomly sampled The covariance between individual error terms is assumed to be zero –Since vignettes are judged by the same subject correlation between error terms is likely.

Responses may be categorical or unlimited continuous Unlimited continuous = HLM (Hierarchical linear modeling) Binary outcome = HLIT (Hierarchical Logit modeling) Ordered categorical outcome = HOLIT (Hierarchical ordered logit modeling) Unordered categorical outcome = HULIT (Hierarchical unordered logit modeling)

Example Brown, R., Brown, R. L., Rounds, L., Castelaz, C., and Papasouliotis, O. Physicians' decisions to prescribe Benzodiazepines for nervousness and insomnia. Journal of General Internal Medicine, 1997, 12, Brown, R., Brown, R. L., Edwards, J., and Nutz, J. Variation in a medical faculty's decisions to transfuse: Implications for modifying blood product utilization. Medical Care, 1992; 30(12),

(A x B x C x D x E x S) Design A = Gender – 2 levels B = Race – 2 levels C = Psychiatric Diagnosis – 3 levels D = Patient Health Status – 2 levels E = Patient’s Reported Alcohol Use – 3 levels

Factor (C=3) Factor (D=2) Factor (E=3)

Incomplete Factorial Design D1D2D3 HS1HS2HS1HS2HS1HS2 AU1AU2AU3AU1AU2AU3AU1AU2AU3 Male Female AA WW

The hierarchical logistic model poses two regression equations, one modeling the vignette effects within the respondents, and the other modeling respondent effects between respondents. First, for each respondent we model a separate within-respondent regression model: where  ij = the prescribing probability for vignette i by physician j, x ijp = the value of the vignette characteristics for vignette i and respondent j,  ip = the regression coefficients within respondent j. for i = 1, 2,..., k vignettes j = 1, 2,..., n respondents, p = 0, 1,..., p vignette variables.

Physician responses on the respondent level are subsequently predicted by the values of the corresponding vignette characteristics. Second, each of the regression coefficients from the above model may be represented as a between-respondents model: Where  im = the within-respondents regression coefficient for vignette characteristic m and physician respondent i, z ri = the values of the respondent characteristics for physician respondent i,  rm = the regression coefficient that describes the effects of respondent variables on the within- respondents relationships  im, u im = random errors.

The observed (0,1) prescribing response, at level 1, is y ij ~ Bin(1,  ij ) with Binomial variance  ij (1-  ij ). This assumption of binomial variation can be tests by fitting an ‘extra- binomial parameter’  2 e, so the vignette-level variance would be  2 e  ij (1-  ij ). Estimates close to 1.00 indicate appropriateness of the binomial assumption.