1/53: Topic 3.1 – Models for Ordered Choices Microeconometric Modeling William Greene Stern School of Business New York University New York NY USA William.

Slides:



Advertisements
Similar presentations
Econometrics I Professor William Greene Stern School of Business
Advertisements

Discrete Choice Modeling William Greene Stern School of Business New York University Lab Sessions.
Discrete Choice Modeling
Empirical Methods for Microeconomic Applications University of Lugano, Switzerland May 27-31, 2013 William Greene Department of Economics Stern School.
Part 24 – Statistical Tests:3 Statistics and Data Analysis Professor William Greene Stern School of Business IOMS Department Department of Economics.
[Part 1] 1/15 Discrete Choice Modeling Econometric Methodology Discrete Choice Modeling William Greene Stern School of Business New York University 0Introduction.
Discrete Choice Modeling William Greene Stern School of Business New York University Lab Sessions.
Empirical Methods for Microeconomic Applications William Greene Department of Economics Stern School of Business.
Discrete Choice Modeling William Greene Stern School of Business New York University Lab Sessions.
Clustered or Multilevel Data
Part 18: Ordered Outcomes [1/88] Econometric Analysis of Panel Data William Greene Department of Economics Stern School of Business.
1/62: Topic 2.3 – Panel Data Binary Choice Models Microeconometric Modeling William Greene Stern School of Business New York University New York NY USA.
Part 7: Multiple Regression Analysis 7-1/54 Regression Models Professor William Greene Stern School of Business IOMS Department Department of Economics.
8. Heterogeneity: Latent Class Models. Latent Classes A population contains a mixture of individuals of different types (classes) Common form of the.
Discrete Choice Modeling William Greene Stern School of Business New York University Lab Sessions.
Econometric Methodology. The Sample and Measurement Population Measurement Theory Characteristics Behavior Patterns Choices.
Discrete Choice Modeling William Greene Stern School of Business New York University.
Discrete Choice Modeling William Greene Stern School of Business New York University.
Empirical Methods for Microeconomic Applications William Greene Department of Economics Stern School of Business.
Introduction to plausible values National Research Coordinators Meeting Madrid, February 2010.
Discrete Choice Modeling William Greene Stern School of Business New York University.
Empirical Methods for Microeconomic Applications University of Lugano, Switzerland May 27-31, 2013 William Greene Department of Economics Stern School.
Discrete Choice Modeling William Greene Stern School of Business New York University.
Empirical Methods for Microeconomic Applications University of Lugano, Switzerland May 27-31, 2013 William Greene Department of Economics Stern School.
[Part 4] 1/43 Discrete Choice Modeling Bivariate & Multivariate Probit Discrete Choice Modeling William Greene Stern School of Business New York University.
Part 2: Model and Inference 2-1/49 Regression Models Professor William Greene Stern School of Business IOMS Department Department of Economics.
Discrete Choice Modeling William Greene Stern School of Business New York University.
Discrete Choice Modeling William Greene Stern School of Business New York University.
1/69: Topic Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New.
Discrete Choice Modeling William Greene Stern School of Business New York University.
[Topic 9-Latent Class Models] 1/66 9. Heterogeneity: Latent Class Models.
1/62: Topic 2.3 – Panel Data Binary Choice Models Microeconometric Modeling William Greene Stern School of Business New York University New York NY USA.
Discrete Choice Modeling William Greene Stern School of Business New York University.
Qualitative and Limited Dependent Variable Models ECON 6002 Econometrics Memorial University of Newfoundland Adapted from Vera Tabakova’s notes.
Discrete Choice Modeling William Greene Stern School of Business New York University.
1/53: Topic 3.1 – Models for Ordered Choices Microeconometric Modeling William Greene Stern School of Business New York University New York NY USA William.
6. Ordered Choice Models. Ordered Choices Ordered Discrete Outcomes E.g.: Taste test, credit rating, course grade, preference scale Underlying random.
[Part 5] 1/43 Discrete Choice Modeling Ordered Choice Models Discrete Choice Modeling William Greene Stern School of Business New York University 0Introduction.
Discrete Choice Modeling William Greene Stern School of Business New York University.
1/26: Topic 2.2 – Nonlinear Panel Data Models Microeconometric Modeling William Greene Stern School of Business New York University New York NY USA William.
5. Extensions of Binary Choice Models
Microeconometric Modeling
Microeconometric Modeling
William Greene Stern School of Business New York University
Microeconometric Modeling
William Greene Stern School of Business New York University
Discrete Choice Modeling
Discrete Choice Modeling
Discrete Choice Modeling
Microeconometric Modeling
Microeconometric Modeling
Microeconometric Modeling
Microeconometric Modeling
Microeconometric Modeling
Microeconometric Modeling
Econometric Analysis of Panel Data
Microeconometric Modeling
Discrete Choice Modeling
Econometric Analysis of Panel Data
Microeconometric Modeling
Microeconometric Modeling
Microeconometric Modeling
Microeconometric Modeling
Microeconometric Modeling
Microeconometric Modeling
Microeconometric Modeling
Empirical Methods for Microeconomic Applications University of Lugano, Switzerland May 27-31, 2019 William Greene Department of Economics Stern School.
Microeconometric Modeling
Empirical Methods for Microeconomic Applications
Presentation transcript:

1/53: Topic 3.1 – Models for Ordered Choices Microeconometric Modeling William Greene Stern School of Business New York University New York NY USA William Greene Stern School of Business New York University New York NY USA 3.1 Models for Ordered Choices

2/53: Topic 3.1 – Models for Ordered Choices Concepts Ordered Choice Subjective Well Being Health Satisfaction Random Utility Fit Measures Normalization Threshold Values (Cutpoints0 Differential Item Functioning Anchoring Vignette Panel Data Incidental Parameters Problem Attrition Bias Inverse Probability Weighting Transition Matrix Models Ordered Probit and Logit Generalized Ordered Probit Hierarchical Ordered Probit Vignettes Fixed and Random Effects OPM Dynamic Ordered Probit Sample Selection OPM

3/53: Topic 3.1 – Models for Ordered Choices Ordered Discrete Outcomes  E.g.: Taste test, credit rating, course grade, preference scale  Underlying random preferences: Existence of an underlying continuous preference scale Mapping to observed choices  Strength of preferences is reflected in the discrete outcome  Censoring and discrete measurement  The nature of ordered data

4/53: Topic 3.1 – Models for Ordered Choices Ordered Choices at IMDb

5/53: Topic 3.1 – Models for Ordered Choices

6/53: Topic 3.1 – Models for Ordered Choices Health Satisfaction (HSAT) Self administered survey: Health Care Satisfaction (0 – 10) Continuous Preference Scale

7/53: Topic 3.1 – Models for Ordered Choices Modeling Ordered Choices  Random Utility (allowing a panel data setting) U it =  +  ’x it +  it = a it +  it  Observe outcome j if utility is in region j  Probability of outcome = probability of cell Pr[Y it =j] = F(  j – a it ) - F(  j-1 – a it )

8/53: Topic 3.1 – Models for Ordered Choices Ordered Probability Model

9/53: Topic 3.1 – Models for Ordered Choices Combined Outcomes for Health Satisfaction

10/53: Topic 3.1 – Models for Ordered Choices Ordered Probabilities

11/53: Topic 3.1 – Models for Ordered Choices

12/53: Topic 3.1 – Models for Ordered Choices Coefficients

13/53: Topic 3.1 – Models for Ordered Choices Partial Effects in the Ordered Choice Model Assume the β k is positive. Assume that x k increases. β’x increases. μ j - β’x shifts to the left for all 5 cells. Prob[y=0] decreases Prob[y=1] decreases – the mass shifted out is larger than the mass shifted in. Prob[y=3] increases – same reason in reverse. Prob[y=4] must increase. When β k > 0, increase in x k decreases Prob[y=0] and increases Prob[y=J]. Intermediate cells are ambiguous, but there is only one sign change in the marginal effects from 0 to 1 to … to J

14/53: Topic 3.1 – Models for Ordered Choices Partial Effects of 8 Years of Education

15/53: Topic 3.1 – Models for Ordered Choices An Ordered Probability Model for Health Satisfaction | Ordered Probability Model | | Dependent variable HSAT | | Number of observations | | Underlying probabilities based on Normal | | Cell frequencies for outcomes | | Y Count Freq Y Count Freq Y Count Freq | | | | | | | | | |Variable | Coefficient | Standard Error |b/St.Er.|P[|Z|>z] | Mean of X| Index function for probability Constant FEMALE EDUC AGE HHNINC HHKIDS Threshold parameters for index Mu(1) Mu(2) Mu(3) Mu(4) Mu(5) Mu(6) Mu(7) Mu(8) Mu(9)

16/53: Topic 3.1 – Models for Ordered Choices Ordered Probability Partial Effects | Marginal effects for ordered probability model | | M.E.s for dummy variables are Pr[y|x=1]-Pr[y|x=0] | | Names for dummy variables are marked by *. | |Variable | Coefficient | Standard Error |b/St.Er.|P[|Z|>z] | Mean of X| These are the effects on Prob[Y=00] at means. *FEMALE EDUC D AGE D HHNINC *HHKIDS These are the effects on Prob[Y=01] at means. *FEMALE EDUC D AGE D HHNINC *HHKIDS repeated for all 11 outcomes These are the effects on Prob[Y=10] at means. *FEMALE EDUC AGE HHNINC *HHKIDS

17/53: Topic 3.1 – Models for Ordered Choices Ordered Probit Marginal Effects

18/53: Topic 3.1 – Models for Ordered Choices Analysis of Model Implications  Partial Effects  Fit Measures  Predicted Probabilities Averaged: They match sample proportions. By observation Segments of the sample Related to particular variables

19/53: Topic 3.1 – Models for Ordered Choices Predictions from the Model Related to Age

20/53: Topic 3.1 – Models for Ordered Choices Fit Measures  There is no single “dependent variable” to explain.  There is no sum of squares or other measure of “variation” to explain.  Predictions of the model relate to a set of J+1 probabilities, not a single variable.  How to explain fit? Based on the underlying regression Based on the likelihood function Based on prediction of the outcome variable

21/53: Topic 3.1 – Models for Ordered Choices Log Likelihood Based Fit Measures

22/53: Topic 3.1 – Models for Ordered Choices

23/53: Topic 3.1 – Models for Ordered Choices A Somewhat Better Fit

24/53: Topic 3.1 – Models for Ordered Choices Different Normalizations  NLOGIT Y = 0,1,…,J, U* = α + β’x + ε One overall constant term, α J-1 “cutpoints;” μ -1 = -∞, μ 0 = 0, μ 1,… μ J-1, μ J = + ∞  Stata Y = 1,…,J+1, U* = β’x + ε No overall constant, α=0 J “cutpoints;” μ 0 = -∞, μ 1,… μ J, μ J+1 = + ∞

25/53: Topic 3.1 – Models for Ordered Choices

26/53: Topic 3.1 – Models for Ordered Choices

27/53: Topic 3.1 – Models for Ordered Choices Generalizing the Ordered Probit with Heterogeneous Thresholds

28/53: Topic 3.1 – Models for Ordered Choices Hierarchical Ordered Probit

29/53: Topic 3.1 – Models for Ordered Choices Ordered Choice Model

30/53: Topic 3.1 – Models for Ordered Choices HOPit Model

31/53: Topic 3.1 – Models for Ordered Choices Differential Item Functioning

32/53: Topic 3.1 – Models for Ordered Choices A Vignette Random Effects Model

33/53: Topic 3.1 – Models for Ordered Choices Vignettes

34/53: Topic 3.1 – Models for Ordered Choices

35/53: Topic 3.1 – Models for Ordered Choices

36/53: Topic 3.1 – Models for Ordered Choices

37/53: Topic 3.1 – Models for Ordered Choices Panel Data  Fixed Effects The usual incidental parameters problem Practically feasible but methodologically ambiguous Partitioning Prob(y it > j|x it ) produces estimable binomial logit models. (Find a way to combine multiple estimates of the same β.  Random Effects Standard application Extension to random parameters – see above

38/53: Topic 3.1 – Models for Ordered Choices Incidental Parameters Problem Table 9.1 Monte Carlo Analysis of the Bias of the MLE in Fixed Effects Discrete Choice Models (Means of empirical sampling distributions, N = 1,000 individuals, R = 200 replications)

39/53: Topic 3.1 – Models for Ordered Choices A Study of Health Status in the Presence of Attrition

40/53: Topic 3.1 – Models for Ordered Choices Model for Self Assessed Health  British Household Panel Survey (BHPS) Waves 1-8, Self assessed health on 0,1,2,3,4 scale Sociological and demographic covariates Dynamics – inertia in reporting of top scale  Dynamic ordered probit model Balanced panel – analyze dynamics Unbalanced panel – examine attrition

41/53: Topic 3.1 – Models for Ordered Choices Dynamic Ordered Probit Model It would not be appropriate to include h i,t-1 itself in the model as this is a label, not a measure

42/53: Topic 3.1 – Models for Ordered Choices Random Effects Dynamic Ordered Probit Model

43/53: Topic 3.1 – Models for Ordered Choices Data

44/53: Topic 3.1 – Models for Ordered Choices Variable of Interest

45/53: Topic 3.1 – Models for Ordered Choices Dynamics

46/53: Topic 3.1 – Models for Ordered Choices Attrition

47/53: Topic 3.1 – Models for Ordered Choices Testing for Attrition Bias Three dummy variables added to full model with unbalanced panel suggest presence of attrition effects.

48/53: Topic 3.1 – Models for Ordered Choices Probability Weighting Estimators  A Patch for Attrition  (1) Fit a participation probit equation for each wave.  (2) Compute p(i,t) = predictions of participation for each individual in each period. Special assumptions needed to make this work  Ignore common effects and fit a weighted pooled log likelihood: Σ i Σ t [d it /p(i,t)]logLP it.

49/53: Topic 3.1 – Models for Ordered Choices Attrition Model with IP Weights Assumes (1) Prob(attrition|all data) = Prob(attrition|selected variables) (ignorability) (2) Attrition is an ‘absorbing state.’ No reentry. Obviously not true for the GSOEP data above. Can deal with point (2) by isolating a subsample of those present at wave 1 and the monotonically shrinking subsample as the waves progress.

50/53: Topic 3.1 – Models for Ordered Choices Estimated Partial Effects by Model

51/53: Topic 3.1 – Models for Ordered Choices Partial Effect for a Category These are 4 dummy variables for state in the previous period. Using first differences, the estimated for SAHEX means transition from EXCELLENT in the previous period to GOOD in the previous period, where GOOD is the omitted category. Likewise for the other 3 previous state variables. The margin from ‘POOR’ to ‘GOOD’ was not interesting in the paper. The better margin would have been from EXCELLENT to POOR, which would have (EX,POOR) change from (1,0) to (0,1).

52/53: Topic 3.1 – Models for Ordered Choices Ordered Choice Model Extensions

53/53: Topic 3.1 – Models for Ordered Choices Model Extensions  Multivariate Bivariate Multivariate  Inflation and Two Part Zero inflation Sample Selection Endogenous Latent Class

54/53: Topic 3.1 – Models for Ordered Choices Generalizing the Ordered Probit with Heterogeneous Thresholds

55/53: Topic 3.1 – Models for Ordered Choices Generalized Ordered Probit-1 Y=Grade (rank) Z=Sex, Race X=Experience, Education, Training, History, Marital Status, Age

56/53: Topic 3.1 – Models for Ordered Choices Generalized Ordered Probit-2

57/53: Topic 3.1 – Models for Ordered Choices A G.O.P Model |Variable | Coefficient | Standard Error |b/St.Er.|P[|Z|>z] | Mean of X| Index function for probability Constant AGE LOGINC EDUC MARRIED Estimates of t(j) in mu(j)=exp[t(j)+d*z] Theta(1) Theta(2) Theta(3) Theta(4) Threshold covariates mu(j)=exp[t(j)+d*z] FEMALE How do we interpret the result for FEMALE?

58/53: Topic 3.1 – Models for Ordered Choices Hierarchical Ordered Probit

59/53: Topic 3.1 – Models for Ordered Choices Ordered Choice Model

60/53: Topic 3.1 – Models for Ordered Choices HOPit Model

61/53: Topic 3.1 – Models for Ordered Choices A Sample Selection Model

62/53: Topic 3.1 – Models for Ordered Choices Zero Inflated Ordered Probit

63/53: Topic 3.1 – Models for Ordered Choices Teenage Smoking

64/53: Topic 3.1 – Models for Ordered Choices A Bivariate Latent Class Correlated Generalised Ordered Probit Model with an Application to Modelling Observed Obesity Levels William Greene Stern School of Business, New York University With Mark Harris, Bruce Hollingsworth, Pushkar Maitra Monash University Stern Economics Working Paper Forthcoming, Economics Letters, 2014

65/53: Topic 3.1 – Models for Ordered Choices Obesity  The International Obesity Taskforce ( calls obesity one of the most important medical and public health problems of our time.  Defined as a condition of excess body fat; associated with a large number of debilitating and life-threatening disorders  Health experts argue that given an individual’s height, their weight should lie within a certain range Most common measure = Body Mass Index (BMI): Weight (Kg)/height(Meters) 2  WHO guidelines: BMI < 18.5 are underweight 18.5 < BMI < 25 are normal 25 < BMI < 30 are overweight BMI > 30 are obese  Around 300 million people worldwide are obese, a figure likely to rise

66/53: Topic 3.1 – Models for Ordered Choices Models for BMI Simple Regression Approach Based on Actual BMI: BMI* = ′x + ,  ~ N[0, 2 ] No accommodation of heterogeneity Rigid measurement by the guidelines Interval Censored Regression Approach WT = 0 if BMI* < 25 Normal 1 if 25 < BMI* < 30 Overweight 2 if BMI* > 30 Obese Inadequate accommodation of heterogeneity Inflexible reliance on WHO classification

67/53: Topic 3.1 – Models for Ordered Choices An Ordered Probit Approach A Latent Regression Model for “True BMI” BMI* = ′x + ,  ~ N[0,σ 2 ], σ 2 = 1 “True BMI” = a proxy for weight is unobserved Observation Mechanism for Weight Type WT = 0 if BMI* < 0 Normal 1 if 0 < BMI* <  Overweight 2 if BMI* >  Obese

68/53: Topic 3.1 – Models for Ordered Choices A Basic Ordered Probit Model

69/53: Topic 3.1 – Models for Ordered Choices Latent Class Modeling  Irrespective of observed weight category, individuals can be thought of being in one of several ‘types’ or ‘classes. e.g. an obese individual may be so due to genetic reasons or due to lifestyle factors  These distinct sets of individuals likely to have differing reactions to various policy tools and/or characteristics  The observer does not know from the data which class an individual is in.  Suggests use of a latent class approach Growing use in explaining health outcomes (Deb and Trivedi, 2002, and Bago d’Uva, 2005)

70/53: Topic 3.1 – Models for Ordered Choices A Latent Class Model For modeling purposes, class membership is distributed with a discrete distribution, Prob(individual i is a member of class = c) =  ic =  c Prob(WT i = j | x i ) = Σ c Prob(WT i = j | x i,class = c)Prob(class = c).

71/53: Topic 3.1 – Models for Ordered Choices Probabilities in the Latent Class Model

72/53: Topic 3.1 – Models for Ordered Choices Class Assignment Class membership may relate to demographics such as age and sex.

73/53: Topic 3.1 – Models for Ordered Choices Generalized Ordered Probit – Latent Classes and Variable Thresholds

74/53: Topic 3.1 – Models for Ordered Choices Data  US National Health Interview Survey (2005); conducted by the National Centre for Health Statistics  Information on self-reported height and weight levels, BMI levels  Demographic information  Remove those underweight  Split sample (30,000+) by gender

75/53: Topic 3.1 – Models for Ordered Choices Model Components  x: determines observed weight levels within classes For observed weight levels we use lifestyle factors such as marital status and exercise levels  z: determines latent classes For latent class determination we use genetic proxies such as age, gender and ethnicity: the things we can’t change  w: determines position of boundary parameters within classes For the boundary parameters we have: weight-training intensity and age (BMI inappropriate for the aged?) pregnancy (small numbers and length of term unknown)

76/53: Topic 3.1 – Models for Ordered Choices Correlation Between Classes and Regression  Outcome Model (BMI*|class = c) =  c ′x +  c,  c ~ N[0,1] WT|class=c = 0 if BMI*|class = c < 0 1 if 0 < BMI*|class = c <  c 2 if BMI*|class = c >  c.  Threshold|class = c:  c = exp( c + γ c ′r)  Class Assignment c* = ′w + u, u ~ N[0,1]. c = 0 if c* < 0 1 if c* > 0.  Endogenous Class Assignment ( c,u) ~ N 2 [(0,0),(1, c,1)]