Ordinal Regression Analysis: Fitting the Proportional Odds Model Using Stata and SAS Xing Liu Neag School of Education University of Connecticut.

Slides:



Advertisements
Similar presentations
Dummy Dependent variable Models
Advertisements

Continued Psy 524 Ainsworth
Brief introduction on Logistic Regression
Simple Logistic Regression
[Part 1] 1/15 Discrete Choice Modeling Econometric Methodology Discrete Choice Modeling William Greene Stern School of Business New York University 0Introduction.
Limited Dependent Variables
HSRP 734: Advanced Statistical Methods July 24, 2008.
Trashball: A Logistic Regression Classroom Activity Christopher Morrell (Joint work with Richard Auer) Mathematics and Statistics Department Loyola University.
Overview of Logistics Regression and its SAS implementation
Multiple Logistic Regression RSQUARE, LACKFIT, SELECTION, and interactions.
The Generalized IV Estimator IV estimation with a single endogenous regressor and a single instrument can be naturally generalized. Suppose that there.
Logistic Regression Multivariate Analysis. What is a log and an exponent? Log is the power to which a base of 10 must be raised to produce a given number.
1 Module II Lecture 8: Categorical/Limited Dependent Variables and Logistic Regression Graduate School Quantitative Research Methods Gwilym Pryce.
© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
To accompany Quantitative Analysis for Management, 9e by Render/Stair/Hanna 4-1 © 2006 by Prentice Hall, Inc., Upper Saddle River, NJ Chapter 4 RegressionModels.
Instructor: K.C. Carriere
Introduction to Logistic Regression. Simple linear regression Table 1 Age and systolic blood pressure (SBP) among 33 adult women.
Ordinal Logistic Regression
Logistic Regression Biostatistics 510 March 15, 2007 Vanessa Perez.
Notes on Logistic Regression STAT 4330/8330. Introduction Previously, you learned about odds ratios (OR’s). We now transition and begin discussion of.
GRA 6020 Multivariate Statistics Probit and Logit Models Ulf H. Olsson Professor of Statistics.
Generalized Linear Models
Ordinal Logistic Regression “Good, better, best; never let it rest till your good is better and your better is best” (Anonymous)
Psychometrics Timothy A. Steenbergh and Christopher J. Devers Indiana Wesleyan University.
Logistic Regression II Simple 2x2 Table (courtesy Hosmer and Lemeshow) Exposure=1Exposure=0 Disease = 1 Disease = 0.
1 Regression Models with Binary Response Regression: “Regression is a process in which we estimate one variable on the basis of one or more other variables.”
Lecture 14-1 (Wooldridge Ch 17) Linear probability, Probit, and
1 G Lect 11W Logistic Regression Review Maximum Likelihood Estimates Probit Regression and Example Model Fit G Multiple Regression Week 11.
1 Rob Woodruff Battelle Memorial Institute, Health & Analytics Cynthia Ferre Centers for Disease Control and Prevention Conditional.
EE325 Introductory Econometrics1 Welcome to EE325 Introductory Econometrics Introduction Why study Econometrics? What is Econometrics? Methodology of Econometrics.
Teaching Microeconometrics using at Warsaw School of Economics Marcin OwczarczukMonika Książek.
1 Advanced Topics in Regression Quantile Regression Analysis of Causality Mediation Analysis Hierarchical Linear Modeling Compiled by Nick Evangelopoulos,
HSRP 734: Advanced Statistical Methods June 19, 2008.
Limited Dependent Variable Models ECON 6002 Econometrics Memorial University of Newfoundland Adapted from Vera Tabakova’s notes.
Business Intelligence and Decision Modeling Week 11 Predictive Modeling (2) Logistic Regression.
Limited Dependent Variables Ciaran S. Phibbs May 30, 2012.
Chap 14-1 Copyright ©2012 Pearson Education, Inc. publishing as Prentice Hall Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics.
References A. Good Intros 1. Knoke, David and Peter J. Burke (1980), Log Linear Modeling, Sage. 2. Fienberg, Stephen E. (1980), The Analysis of Cross-Classified.
Nested Logit Model by Asif Khan Phd Graduate Seminar in advance Statistics Institute of Rural Development (IRE) Georg-August University Goettingen July.
1 GCRC Data Analysis with SPSS Workshop Session 5 Follow Up on FEV data Binary and Categorical Outcomes 2x2 tables 2xK tables JxK tables Logistic Regression.
Logistic Regression Applications Hu Lunchao. 2 Contents 1 1 What Is Logistic Regression? 2 2 Modeling Categorical Responses 3 3 Modeling Ordinal Variables.
Lecture 18 Ordinal and Polytomous Logistic Regression BMTRY 701 Biostatistical Methods II.
Limited Dependent Variables Ciaran S. Phibbs. Limited Dependent Variables 0-1, small number of options, small counts, etc. 0-1, small number of options,
More complex event history analysis. Start of Study End of Study 0 t1 0 = Unemployed; 1 = Working UNEMPLOYMENT AND RETURNING TO WORK STUDY Spell or Episode.
1 ASSESSING THE PERFORMANCE OF MEDICAL DIAGNOSTIC SYSTEMS: THE RECEIVER OPERATING CHARACTERISTIC (ROC) CURVE JOSEPH GEORGE CALDWELL, PH.D. 27 FEBRUARY.
Multiple Logistic Regression STAT E-150 Statistical Methods.
Testing statistical significance of differences between coefficients Jane E. Miller, PhD The Chicago Guide to Writing about Multivariate Analysis, 2nd.
Logistic Regression Analysis Gerrit Rooks
Qualitative and Limited Dependent Variable Models ECON 6002 Econometrics Memorial University of Newfoundland Adapted from Vera Tabakova’s notes.
Introduction to Multiple Regression Lecture 11. The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & 2 or more.
Panel Data Analysis Using GAUSS 2 Kuan-Pin Lin Portland State University.
1 Introduction to Modeling Beyond the Basics (Chapter 7)
Multiple Regression Learning Objectives n Explain the Linear Multiple Regression Model n Interpret Linear Multiple Regression Computer Output n Test.
Applied Epidemiologic Analysis - P8400 Fall 2002 Labs 6 & 7 Case-Control Analysis ----Logistic Regression Henian Chen, M.D., Ph.D.
Panel Data Analysis Using GAUSS 3 Kuan-Pin Lin Portland State University.
Nonlinear Logistic Regression of Susceptibility to Windthrow Seminar 7 Likelihood Methods in Forest Ecology October 9 th – 20 th, 2006.
The Probit Model Alexander Spermann University of Freiburg SS 2008.
LOGISTIC REGRESSION. Purpose  Logistical regression is regularly used when there are only two categories of the dependent variable and there is a mixture.
Non-Linear Dependent Variables Ciaran S. Phibbs November 17, 2010.
1 Stabilization activity in Italian IPOs by Dimitri Borienko and Stefano Lombardo: Discussion Carlo Drago, Paolo Santella Italian Society of Law and Economics.
The Probit Model Alexander Spermann University of Freiburg SoSe 2009
Using Multilevel Modeling in Institutional Research
Logistic Regression APKC – STATS AFAC (2016).
M.Sc. in Economics Econometrics Module I
Lecture 1 Basic Econometrics Rifai Afin SE, MSc.
Generalized Linear Models
Introduction to logistic regression a.k.a. Varbrul
Introductory Econometrics
ORDINAL REGRESSION MODELS
Generalized Ordered Logit Models Part II: Interpretation
Presentation transcript:

Ordinal Regression Analysis: Fitting the Proportional Odds Model Using Stata and SAS Xing Liu Neag School of Education University of Connecticut

Purpose Introduce Ordinal Logistic Regression Analysis Demonstrate the use of the proportional odds (PO) model using Stata (V. 9.0) Compare the results of the proportional odds model using both Stata OLOGIT and SAS LOGISTIC.

Why Ordinal Regression Analysis? Ordinal Dependent Variable  Teaching experience  SES (high, middle, low)  Degree of Agreement  Ability level (e.g. literacy, reading) Context is important

Why Using STATA and SAS? Both are powerful general statistics software packages Stata are more powerful in the analysis of binary logistic regression and ordinal logistic regression SAS has two options for Ordinal dependent variable  Ascending  Descending

Proportional Odds Model (1) One of several possible regression models for the analysis of ordinal data, and also the most common. Model predicts the ln(odds) of being in category j or beyond. Simplifying assumption  Effect of an IV assumed to be invariant across splits

Proportional Odds Model (2) Model predicts cumulative logits across K-1 response option categories. For K=6, (here, Y = 0 to 5) these cumulative logits can be used to make predictions for the K-1=five cumulative probabilities, given the collection of explanatory variables: Logit = ln(odds); Probability = odds / (1 + odds) logits  odds  estimated probability

A Latent-variable Model (1) It can be expressed as a latent variable model (Agresti, 2002; Greene, 2003; Long, 1997, Long & Freese, 2006; Powers & Xie, 2000; Wooldridge & Jeffrey, 2001) Assuming a latent variable, Y* exists, we can define Y* = xβ + ε Let Y* be divided by some cut points (thresholds): α1, α2, α3… αj, and α1<α2<α3…< αj. The observed child’s literacy proficiency level is the ordinal outcome, y, ranging from 0 to 5

A Latent-variable Model (2) We define:

A Latent-variable Model (3) We can compute probability of a child attaining each proficiency level.  P(y=0) = P (y* ≤α1) = P(xβ + ε ≤ α1) = F (α1- xβ);  P(y=1) = P (α1<y* ≤α2) = F (α2- xβ)- F (α1- xβ);  …  P(y=4) = P (α4<y* ≤α5) = F (α5- xβ)- F (α4- xβ);  P(y=5) = P (y* >α5) = 1- F (α5- xβ). We can also compute the cumulative probabilities using the form: P(Y≤j) = F (αj - xβ)

General Logistic Regression Model In a binary logistic regression model, we predict the log(odds) of success on a set of predictors. In Stata, the ordinal logistic regression model is expressed as: SAS uses a different ordinal logit model for estimating the parameters

Methodology Sample: ECLS-K Longitudinal Study (NCES)  n= 3365 from 225 schools  Outcome variable: proficiency in early reading  Eight explanatory variables The PO model with a single explanatory variable was fitted first Full-model with all eight explanatory variables The assumption of the PO models were tested Software packages: STATA and SAS

Results

Figure 1: Full-Model Analysis of Proportional Odds Using Stata

Figure 2: Brant Test of Parallel Regression (Proportional Odds) Assumption

Figure 3: Measure of Fit Statistics for Full- Model

Conclusions Both packages produce the same or similar results in model fit statistics and the test of the proportional odds assumption Stata produces more detailed information of PO assumption test, and fit statistics The estimated coefficients and cut points (thresholds) are the same in magnitude but may be reversed in sign Compared to Stata, SAS (ascending and descending) does not negate the signs before the logit coefficients in the equations

References Agresti, A. (2002). Categorical data analysis (2nd ed.). New York: John Wiley & Sons. Agresti, A. (1996). An introduction to categorical data analysis. New York: John Wiley & Sons. Allison, P.D. (1999). Logistic regression using the SAS system: Theory and application. Cary, NC: SAS Institute, Inc. Ananth, C. V. and Kleinbaum, D. G. (1997). Regression models for ordinal responses: A review of methods and applications. International Journal of Epidemiology 26, p Armstrong, B. B. & Sloan, M. (1989). Ordinal regression models for epidemiological data. American Journal of Epidemiology, 129(1), Bender, R. & Benner, A. (2000). Calculating ordinal regression models in SAS and S-Plus. Biometrical Journal, 42(6), Bender, R. & Grouven, U. (1998). Using binary logistic regression models for ordinal data with non-proportional odds. Journal of Clinical Epidemiology, 51(10), Brant (1990). Assessing proportionality in the proportional odds model for ordinal logistic regression. Biometrics, 46, Clogg, C.C., Shihadeh, E.S. (1994). Statistical models for ordinal variables. Thousand Oaks, CA: Sage. Greene, William H. (2003). Econometric analysis (5th ed.). Upper Saddle River, NJ: Prentice Hall. Hosmer, D. W. & Lemeshow, S. (2000). Applied logistic regression (2nd ed.). New York: John Wiley & Sons. Long, J. S. (1997). Regression models for categorical and limited dependent variables. Thousand Oaks, CA: Sage. Long, J. S. & Freese, J. (2006). Regression models for categorical dependent variables using Stata (2nd ed.). Texas: Stata Press. McCullagh, P. (1980). Regression models for ordinal data (with discussion). Journal of the Royal Statistical Society Ser. B, 42, McCullagh, P. & Nelder, J. A. (1989). Generalized linear models(2nd ed.). London: Chapman and Hall. Menard, S. (1995). Applied logistic regression analysis. Thousand Oaks, CA: Sage. O’Connell, A.A., (2000). Methods for modeling ordinal outcome variables. Measurement and Evaluation in Counseling and Development, 33(3), O’Connell, A. A. (2006). Logistic regression models for ordinal response variables. Thousand Oaks: SAGE. O’Connell, A.A., Liu, X., Zhao, J., & Goldstein, J. (2006, April). Model Diagnostics for proportional and partial proportional odds models. Paper presented at the 2006 Annual American Educational Research Association (AERA). San Francisco, CA. Powers D. A., & Xie, Y. (2000). Statistical models for categorical data analysis. San Diego, CA: Academic Press. Wooldridge, Jeffrey M. (2001). Econometric analysis of cross section and panel data. Cambridge, MA: The MIT Press.