Multinomial Logit Sociology 8811 Lecture 11 Copyright © 2007 by Evan Schofer Do not copy or distribute without permission.

Slides:



Advertisements
Similar presentations
Continued Psy 524 Ainsworth
Advertisements

Brief introduction on Logistic Regression
Gologit2: Generalized Logistic Regression/ Partial Proportional Odds Models for Ordinal Dependent Variables Part 1: The gologit model & gologit2 program.
Logistic Regression Psy 524 Ainsworth.
Discrete Choice Modeling William Greene Stern School of Business IFS at UCL February 11-13, 2004
The Art of Model Building and Statistical Tests. 2 Outline The art of model building Using Software output The t-statistic The likelihood ratio test The.
Logit & Probit Regression
Limited Dependent Variables
Multinomial Logit Sociology 229: Advanced Regression
Introduction to Logistic Regression In Stata Maria T. Kaylen, Ph.D. Indiana Statistical Consulting Center WIM Spring 2014 April 11, 2014, 3:00-4:30pm.
Count Models Sociology 229: Advanced Regression Copyright © 2010 by Evan Schofer Do not copy or distribute without permission.
Gologit2: Generalized Logistic Regression/ Partial Proportional Odds Models for Ordinal Dependent Variables Richard Williams Department of Sociology University.
1 BINARY CHOICE MODELS: PROBIT ANALYSIS In the case of probit analysis, the sigmoid function F(Z) giving the probability is the cumulative standardized.
Multilevel Models 4 Sociology 8811, Class 26 Copyright © 2007 by Evan Schofer Do not copy or distribute without permission.
Lecture 17: Regression for Case-control Studies BMTRY 701 Biostatistical Methods II.
1Prof. Dr. Rainer Stachuletz Limited Dependent Variables P(y = 1|x) = G(  0 + x  ) y* =  0 + x  + u, y = max(0,y*)
Binary Response Lecture 22 Lecture 22.
Multinomial Logistic Regression
Sociology 601 Class 28: December 8, 2009 Homework 10 Review –polynomials –interaction effects Logistic regressions –log odds as outcome –compared to linear.
An Introduction to Logistic Regression JohnWhitehead Department of Economics Appalachian State University.
So far, we have considered regression models with dummy variables of independent variables. In this lecture, we will study regression models whose dependent.
In previous lecture, we dealt with the unboundedness problem of LPM using the logit model. In this lecture, we will consider another alternative, i.e.
Topic 3: Regression.
Event History Models Sociology 229: Advanced Regression Class 5
Lecture 14-2 Multinomial logit (Maddala Ch 12.2)
Event History Models 2 Sociology 229A: Event History Analysis Class 4 Copyright © 2008 by Evan Schofer Do not copy or distribute without permission.
Multiple Regression 2 Sociology 5811 Lecture 23 Copyright © 2005 by Evan Schofer Do not copy or distribute without permission.
Linear Regression 2 Sociology 5811 Lecture 21 Copyright © 2005 by Evan Schofer Do not copy or distribute without permission.
Logistic Regression Sociology 229: Advanced Regression
Logistic Regression- Dichotomous Dependent Variables March 21 & 23, 2011.
Methods Workshop (3/10/07) Topic: Event Count Models.
1 BINARY CHOICE MODELS: PROBIT ANALYSIS In the case of probit analysis, the sigmoid function is the cumulative standardized normal distribution.
Logistic Regression 2 Sociology 8811 Lecture 7 Copyright © 2007 by Evan Schofer Do not copy or distribute without permission.
Returning to Consumption
Serial Correlation and the Housing price function Aka “Autocorrelation”
Multinomial Logit Sociology 8811 Lecture 10
What is the MPC?. Learning Objectives 1.Use linear regression to establish the relationship between two variables 2.Show that the line is the line of.
EHA: More On Plots and Interpreting Hazards Sociology 229A: Event History Analysis Class 9 Copyright © 2008 by Evan Schofer Do not copy or distribute without.
Multiple Regression 3 Sociology 5811 Lecture 24 Copyright © 2005 by Evan Schofer Do not copy or distribute without permission.
Count Models 1 Sociology 8811 Lecture 12
Limited Dependent Variables Ciaran S. Phibbs May 30, 2012.
Lecture Slide #1 Logistic Regression Analysis Estimation and Interpretation Hypothesis Tests Interpretation Reversing Logits: Probabilities –Averages.
Qualitative and Limited Dependent Variable Models ECON 6002 Econometrics Memorial University of Newfoundland Adapted from Vera Tabakova’s notes.
Lecture 18 Ordinal and Polytomous Logistic Regression BMTRY 701 Biostatistical Methods II.
Limited Dependent Variables Ciaran S. Phibbs. Limited Dependent Variables 0-1, small number of options, small counts, etc. 0-1, small number of options,
Special topics. Importance of a variable Death penalty example. sum death bd- yv Variable | Obs Mean Std. Dev. Min Max
The dangers of an immediate use of model based methods The chronic bronchitis study: bronc: 0= no 1=yes poll: pollution level cig: cigarettes smokes per.
Qualitative and Limited Dependent Variable Models ECON 6002 Econometrics Memorial University of Newfoundland Adapted from Vera Tabakova’s notes.
Logistic Regression 2 Sociology 8811 Lecture 7 Copyright © 2007 by Evan Schofer Do not copy or distribute without permission.
Birthweight (gms) BPDNProp Total BPD (Bronchopulmonary Dysplasia) by birth weight Proportion.
1 BINARY CHOICE MODELS: LOGIT ANALYSIS The linear probability model may make the nonsense predictions that an event will occur with probability greater.
The Probit Model Alexander Spermann University of Freiburg SS 2008.
Logit Models Alexander Spermann, University of Freiburg, SS Logit Models.
BINARY LOGISTIC REGRESSION
EHS Lecture 14: Linear and logistic regression, task-based assessment
Logistic Regression APKC – STATS AFAC (2016).
Advanced Quantitative Techniques
MULTINOMIAL REGRESSION MODELS
Discussion: Week 4 Phillip Keung.
Advanced Quantitative Techniques
Lecture 18 Matched Case Control Studies
Event History Analysis 3
A Logit model of brand choice calibrated on scanner data
Introduction to Logistic Regression
Gologit2: Generalized Logistic Regression/ Partial Proportional Odds Models for Ordinal Dependent Variables Part 1: The gologit model & gologit2 program.
Problems with infinite solutions in logistic regression
Logistic Regression 4 Sociology 8811 Lecture 9
Count Models 2 Sociology 8811 Lecture 13
Presentation transcript:

Multinomial Logit Sociology 8811 Lecture 11 Copyright © 2007 by Evan Schofer Do not copy or distribute without permission

Announcements Paper # 1 due March 8 Look for data NOW!!!

Multinomial Logistic Regression What if you want have a dependent variable with more than two outcomes? A “polytomous” outcome Multinomial Logit strategy: Contrast outcomes with a common “reference point” Similar to conducting a series of 2-outcome logit models comparing pairs of categories The “reference category” is like the reference group when using dummy variables in regression –It serves as the contrast point for all analyses

MLogit Example: Family Vacation Mode of Travel. Reference category = Train. mlogit mode income familysize Multinomial logistic regression Number of obs = 152 LR chi2(4) = Prob > chi2 = Log likelihood = Pseudo R2 = mode | Coef. Std. Err. z P>|z| [95% Conf. Interval] Bus | income | family size | _cons | Car | income | family size | _cons | (mode==Train is the base outcome) Large families less likely to take bus (vs. train) Note: It is hard to directly compare Car vs. Bus in this table

MLogit Example: Car vs. Bus vs. Train Mode of Travel. Reference category = Car. mlogit mode income familysize, base(3) Multinomial logistic regression Number of obs = 152 LR chi2(4) = Prob > chi2 = Log likelihood = Pseudo R2 = mode | Coef. Std. Err. z P>|z| [95% Conf. Interval] Train | income | family size | _cons | Bus | income | family size | _cons | (mode==Car is the base outcome) Here, the pattern is clearer: Wealthy & large families use cars

Predicted Probability Across X Vars Like logit, you can show how probabilies change across independent variables However, “adjust” command doesn’t work with mlogit So, manually compute mean of predicted probabilities –Note: Other variables will be left “as is” unless you set them manually before you use “predict”. mean predcar, over(familysize) Over | Mean predcar | 1 | | | | | | Probability of using car increases with family size Note: Values bounce around because other vars are not set to common value. Note 2: Again, scatter plots aid in summarizing such results

Stata Notes: mlogit Like logit, you can’t include variables that perfectly predict the outcome Note: Stata “logit” command gives a warning of this mlogit command doesn’t give a warning, but coefficient will have z-value of zero, p-value =1 Remove problematic variables if this occurs!

Hypothesis Tests Individual coefficients can be tested as usual Wald test/z-values provided for each variable However, adding a new variable to model actually yields more than one coefficient If you have 4 categories, you’ll get 3 coefficients LR tests are especially useful because you can test for improved fit across the whole model

LR Tests in Multinomial Logit Example: Does “familysize” improve model? Recall: It wasn’t always significant… maybe not! –Run full model, save results mlogit mode income familysize estimates store fullmodel –Run restricted model, save results mlogit mode income estimates store smallmodel –Compare: lrtest fullmodel smallmodel Likelihood-ratio test LR chi2(2) = 9.55 (Assumption: smallmodel nested in fullmodel) Prob > chi2 = Yes, model fit is significantly improved

Multinomial Logit Assumptions: IIA Multinomial logit is designed for outcomes that are not complexly interrelated Critical assumption: Independence of Irrelevant Alternatives (IIA) Odds of one outcome versus another should be independent of other alternatives –Problems often come up when dealing with individual choices… Multinomial logit is not appropriate if the assumption is violated.

Multinomial Logit Assumptions: IIA IIA Assumption Example: –Odds of voting for Gore vs. Bush should not change if Nader is added or removed from ballot If Nader is removed, those voters should choose Bush & Gore in similar pattern to rest of sample –Is IIA assumption likely met in election model? –NO! If Nader were removed, those voters would likely vote for Gore Removal of Nader would change odds ratio for Bush/Gore.

Multinomial Logit Assumptions: IIA IIA Example 2: Consumer Preferences –Options: coffee, Gatorade, Coke Might meet IIA assumption –Options: coffee, Gatorade, Coke, Pepsi Won’t meet IIA assumption. Coke & Pepsi are very similar – substitutable. Removal of Pepsi will drastically change odds ratios for coke vs. others.

Multinomial Logit Assumptions: IIA Solution: Choose categories carefully when doing multinomial logit! Long and Freese (2006), quoting Mcfadden: “Multinomial and conditional logit models should only be used in cases where the alternatives “can plausibly be assumed to be distinct and weighed independently in the eyes of the decisionmaker.” Categories should be “distinct alternatives”, not substitutes –Note: There are some formal tests for violation of IIA. But they don’t work well. Don’t use them. See Long and Freese (2006) p. 243

Multinomial Assumptions/Problems Aside from IIA, assumptions & problems of multinomial logit are similar to standard logit Sample size –You often want to estimate MANY coefficients, so watch out for small N Outliers Multicollinearity Model specification / omitted variable bias Etc.

Real World Multinomial Example Gerber (2000): Russian political views Prefer state control or Market reforms vs. uncertain Older Russians more likely to support state control of economy (vs. being uncertain) Younger Russians prefer market reform (vs. uncertain)

Multinomial Example 2 Example: McVeigh, Rory and Christian Smith “Who Protests in America: An Analysis of Three Political Alternatives – Inaction, Institutionalized Politics, or Protest.” Sociological Forum, 14, 4:

Other Logit-type Models Ordered logit: Appropriate for ordered categories Useful for non-interval measures Useful if there are too few categories to use OLS Conditional Logit Useful for “alternative specific” data –Ex: Data on characteristics of voters AND candidates Also: McFadden’s Choice Model –A variant to model choices Problems with IIA assumption Nested logit, Alternative specific multinomial probit And several others!