[Part 2] 1/86 Discrete Choice Modeling Binary Choice Models Discrete Choice Modeling William Greene Stern School of Business New York University 0Introduction.

Slides:

Advertisements

Similar presentations

Tests of Static Asset Pricing Models

Advertisements

Econometrics I Professor William Greene Stern School of Business

Empirical Methods for Microeconomic Applications University of Lugano, Switzerland May 27-31, 2013 William Greene Department of Economics Stern School.

Discrete Choice Modeling William Greene Stern School of Business New York University.

Part 17: Nonlinear Regression 17-1/26 Econometrics I Professor William Greene Stern School of Business Department of Economics.

Discrete Choice Modeling William Greene Stern School of Business IFS at UCL February 11-13, 2004

3. Binary Choice – Inference. Hypothesis Testing in Binary Choice Models.

12. Random Parameters Logit Models. Random Parameters Model Allow model parameters as well as constants to be random Allow multiple observations with.

[Part 1] 1/15 Discrete Choice Modeling Econometric Methodology Discrete Choice Modeling William Greene Stern School of Business New York University 0Introduction.

Econometrics I Professor William Greene Stern School of Business

Part 15: Binary Choice [ 1/121] Econometric Analysis of Panel Data William Greene Department of Economics Stern School of Business.

1/62: Topic 2.3 – Panel Data Binary Choice Models Microeconometric Modeling William Greene Stern School of Business New York University New York NY USA.

Part 21: Generalized Method of Moments 21-1/67 Econometrics I Professor William Greene Stern School of Business Department of Economics.

Empirical Methods for Microeconomic Applications University of Lugano, Switzerland May 27-31, 2013 William Greene Department of Economics Stern School.

Discrete Choice Modeling William Greene Stern School of Business New York University Lab Sessions.

Discrete Choice Modeling William Greene Stern School of Business New York University.

2. Binary Choice Estimation. Modeling Binary Choice.

Econometric Methodology. The Sample and Measurement Population Measurement Theory Characteristics Behavior Patterns Choices.

Discrete Choice Modeling William Greene Stern School of Business New York University.

[Topic 6-Nonlinear Models] 1/87 Discrete Choice Modeling William Greene Stern School of Business New York University.

Discrete Choice Modeling William Greene Stern School of Business New York University.

Empirical Methods for Microeconomic Applications William Greene Department of Economics Stern School of Business.

Discrete Choice Modeling William Greene Stern School of Business New York University.

Discrete Choice Modeling William Greene Stern School of Business New York University.

Discrete Choice Modeling William Greene Stern School of Business New York University.

Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.

[Part 4] 1/43 Discrete Choice Modeling Bivariate & Multivariate Probit Discrete Choice Modeling William Greene Stern School of Business New York University.

Maximum Likelihood Estimation Methods of Economic Investigation Lecture 17.

Discrete Choice Modeling William Greene Stern School of Business New York University.

Discrete Choice Modeling William Greene Stern School of Business New York University.

Discrete Choice Modeling William Greene Stern School of Business New York University.

1/69: Topic Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New.

Discrete Choice Modeling William Greene Stern School of Business New York University.

1/62: Topic 2.3 – Panel Data Binary Choice Models Microeconometric Modeling William Greene Stern School of Business New York University New York NY USA.

Empirical Methods for Microeconomic Applications University of Lugano, Switzerland May 27-31, 2013 William Greene Department of Economics Stern School.

Discrete Choice Modeling William Greene Stern School of Business New York University.

M.Sc. in Economics Econometrics Module I Topic 4: Maximum Likelihood Estimation Carol Newman.

Discrete Choice Modeling William Greene Stern School of Business New York University.

[Part 5] 1/43 Discrete Choice Modeling Ordered Choice Models Discrete Choice Modeling William Greene Stern School of Business New York University 0Introduction.

Discrete Choice Modeling William Greene Stern School of Business New York University.

The Probit Model Alexander Spermann University of Freiburg SS 2008.

Applied Econometrics William Greene Department of Economics Stern School of Business.

1/26: Topic 2.2 – Nonlinear Panel Data Models Microeconometric Modeling William Greene Stern School of Business New York University New York NY USA William.

5. Extensions of Binary Choice Models

The Probit Model Alexander Spermann University of Freiburg SoSe 2009

Microeconometric Modeling

Microeconometric Modeling

William Greene Stern School of Business New York University

William Greene Stern School of Business New York University

Econometric Analysis of Panel Data

Discrete Choice Modeling

Microeconometric Modeling

Microeconometric Modeling

Microeconometric Modeling

Econometric Analysis of Panel Data

Microeconometric Modeling

Microeconometric Modeling

Microeconometric Modeling

Microeconometric Modeling

Econometric Analysis of Panel Data

Microeconometric Modeling

Microeconometric Modeling

William Greene Stern School of Business New York University

Econometrics I Professor William Greene Stern School of Business

Microeconometric Modeling

Microeconometric Modeling

Microeconometric Modeling

Empirical Methods for Microeconomic Applications University of Lugano, Switzerland May 27-31, 2019 William Greene Department of Economics Stern School.

Empirical Methods for Microeconomic Applications University of Lugano, Switzerland May 27-31, 2019 William Greene Department of Economics Stern School.

Empirical Methods for Microeconomic Applications University of Lugano, Switzerland May 27-31, 2019 William Greene Department of Economics Stern School.

Presentation transcript:

[Part 2] 1/86 Discrete Choice Modeling Binary Choice Models Discrete Choice Modeling William Greene Stern School of Business New York University 0Introduction 1Summary 2Binary Choice 3Panel Data 4Bivariate Probit 5Ordered Choice 6Count Data 7Multinomial Choice 8Nested Logit 9Heterogeneity 10Latent Class 11Mixed Logit 12Stated Preference 13Hybrid Choice

[Part 2] 2/86 Discrete Choice Modeling Binary Choice Models A Random Utility Approach  Underlying Preference Scale, U*(choices)  Revelation of Preferences: U*(choices) < 0 Choice “0” U*(choices) > 0 Choice “1”

[Part 2] 3/86 Discrete Choice Modeling Binary Choice Models Binary Outcome: Visit Doctor

[Part 2] 4/86 Discrete Choice Modeling Binary Choice Models A Model for Binary Choice  Yes or No decision (Buy/NotBuy, Do/NotDo)  Example, choose to visit physician or not  Model: Net utility of visit at least once U visit =  +  1 Age +  2 Income +  Sex +  Choose to visit if net utility is positive Net utility = U visit – U not visit  Data: X = [1,age,income,sex] y = 1 if choose visit,  U visit > 0, 0 if not. Random Utility

[Part 2] 5/86 Discrete Choice Modeling Binary Choice Models Modeling the Binary Choice Net Utility = U visit – U not visit Normalize U not visit = 0 Net U visit =  +  1 Age +  2 Income +  3 Sex +  Chooses to visit: U visit > 0  +  1 Age +  2 Income +  3 Sex +  > 0  > -[  +  1 Age +  2 Income +  3 Sex ] Choosing Between the Two Alternatives

[Part 2] 6/86 Discrete Choice Modeling Binary Choice Models Probability Model for Choice Between Two Alternatives  > -[  +  1 Age +  2 Income +  3 Sex ] Probability is governed by , the random part of the utility function.

[Part 2] 7/86 Discrete Choice Modeling Binary Choice Models Application 27,326 Observations 1 to 7 years, panel 7,293 households observed We use the 1994 year, 3,337 household observations

[Part 2] 8/86 Discrete Choice Modeling Binary Choice Models Binary Choice Data

[Part 2] 9/86 Discrete Choice Modeling Binary Choice Models An Econometric Model  Choose to visit iff U visit > 0 U visit =  +  1 Age +  2 Income +  3 Sex +  U visit > 0   > -(  +  1 Age +  2 Income +  3 Sex)  <  +  1 Age +  2 Income +  3 Sex  Probability model: For any person observed by the analyst, Prob(visit) = Prob[  <  +  1 Age +  2 Income +  3 Sex ]  Note the relationship between the unobserved  and the outcome

[Part 2] 10/86 Discrete Choice Modeling Binary Choice Models  +  1 Age +  2 Income +  3 Sex

[Part 2] 11/86 Discrete Choice Modeling Binary Choice Models Modeling Approaches  Nonparametric – “relationship” Minimal Assumptions Minimal Conclusions  Semiparametric – “index function” Stronger assumptions Robust to model misspecification (heteroscedasticity) Still weak conclusions  Parametric – “Probability function and index” Strongest assumptions – complete specification Strongest conclusions Possibly less robust. (Not necessarily)  The linear probability “model”

[Part 2] 12/86 Discrete Choice Modeling Binary Choice Models Nonparametric Regressions P(Visit)=f(Income) P(Visit)=f(Age)

[Part 2] 13/86 Discrete Choice Modeling Binary Choice Models Klein and Spady Semiparametric No specific distribution assumed Note necessary normalizations. Coefficients are relative to FEMALE. Prob(y i = 1 | x i ) = G(  ’x) G is estimated by kernel methods

[Part 2] 14/86 Discrete Choice Modeling Binary Choice Models Fully Parametric  Index Function: U* = β’x + ε  Observation Mechanism: y = 1[U* > 0]  Distribution: ε ~ f(ε); Normal, Logistic, …  Maximum Likelihood Estimation: Max(β) logL = Σ i log Prob(Y i = y i |x i )

[Part 2] 15/86 Discrete Choice Modeling Binary Choice Models Parametric: Logit Model What do these mean?

[Part 2] 16/86 Discrete Choice Modeling Binary Choice Models Parametric Model Estimation  How to estimate ,  1,  2,  3 ? The technique of maximum likelihood Prob[y=1] = Prob[  > -(  +  1 Age +  2 Income +  3 Sex )] Prob[y=0] = 1 - Prob[y=1]  Requires a model for the probability

[Part 2] 17/86 Discrete Choice Modeling Binary Choice Models Completing the Model: F(  )  The distribution Normal: PROBIT, natural for behavior Logistic: LOGIT, allows “thicker tails” Gompertz: EXTREME VALUE, asymmetric Others…  Does it matter? Yes, large difference in estimates Not much, quantities of interest are more stable.

[Part 2] 18/86 Discrete Choice Modeling Binary Choice Models Estimated Binary Choice Models Log-L(0) = log likelihood for a model that has only a constant term. Ignore the t ratios for now.

[Part 2] 19/86 Discrete Choice Modeling Binary Choice Models  +  1 ( Age+1 ) +  2 ( Income ) +  3 Sex Effect on Predicted Probability of an Increase in Age (  1 is positive)

[Part 2] 20/86 Discrete Choice Modeling Binary Choice Models Partial Effects in Probability Models  Prob[Outcome] = some F(  +  1 Income…)  “Partial effect” =  F(  +  1 Income…) /  ”x” (derivative) Partial effects are derivatives Result varies with model  Logit:  F(  +  1 Income…) /  x = Prob * (1-Prob)    Probit:  F(  +  1 Income…)/  x = Normal density    Extreme Value:  F(  +  1 Income…)/  x = Prob * (-log Prob)   Scaling usually erases model differences

[Part 2] 21/86 Discrete Choice Modeling Binary Choice Models Estimated Partial Effects

[Part 2] 22/86 Discrete Choice Modeling Binary Choice Models Partial Effect for a Dummy Variable  Prob[y i = 1|x i,d i ] = F(  ’x i +  d i ) = conditional mean  Partial effect of d Prob[y i = 1|x i, d i =1] - Prob[y i = 1|x i, d i =0]  Probit:

[Part 2] 23/86 Discrete Choice Modeling Binary Choice Models Partial Effect – Dummy Variable

[Part 2] 24/86 Discrete Choice Modeling Binary Choice Models Computing Partial Effects  Compute at the data means? Simple Inference is well defined.  Average the individual effects More appropriate? Asymptotic standard errors are complicated.

[Part 2] 25/86 Discrete Choice Modeling Binary Choice Models Average Partial Effects

[Part 2] 26/86 Discrete Choice Modeling Binary Choice Models Average Partial Effects vs. Partial Effects at Data Means

[Part 2] 27/86 Discrete Choice Modeling Binary Choice Models Practicalities of Nonlinearities PROBIT; Lhs=doctor ; Rhs=one,age,agesq,income,female ; Partial effects $ PROBIT ; Lhs=doctor ; Rhs=one,age,age*age,income,female $ PARTIALS ; Effects : age $ The software does not know that agesq = age 2. The software knows that age * age is age 2.

[Part 2] 28/86 Discrete Choice Modeling Binary Choice Models Partial Effect for Nonlinear Terms

[Part 2] 29/86 Discrete Choice Modeling Binary Choice Models Average Partial Effect: Averaged over Sample Incomes and Genders for Specific Values of Age

[Part 2] 30/86 Discrete Choice Modeling Binary Choice Models The Linear Probability “Model”

[Part 2] 31/86 Discrete Choice Modeling Binary Choice Models The Dependent Variable equals zero for 98.9% of the observations. In the sample of 163,474 observations, the LHS variable equals 1 about 1,500 times.

[Part 2] 32/86 Discrete Choice Modeling Binary Choice Models 2SLS for a binary dependent variable.

[Part 2] 33/86 Discrete Choice Modeling Binary Choice Models

[Part 2] 34/86 Discrete Choice Modeling Binary Choice Models

[Part 2] 35/86 Discrete Choice Modeling Binary Choice Models 1.7% of the observations are > 20 DV = 1(DocVis > 20)

[Part 2] 36/86 Discrete Choice Modeling Binary Choice Models What does OLS Estimate? MLE Average Partial Effects OLS Coefficients

[Part 2] 37/86 Discrete Choice Modeling Binary Choice Models

[Part 2] 38/86 Discrete Choice Modeling Binary Choice Models Negative Predicted Probabilities

[Part 2] 39/86 Discrete Choice Modeling Binary Choice Models Measuring Fit

[Part 2] 40/86 Discrete Choice Modeling Binary Choice Models How Well Does the Model Fit?  There is no R squared. Least squares for linear models is computed to maximize R 2 There are no residuals or sums of squares in a binary choice model The model is not computed to optimize the fit of the model to the data  How can we measure the “fit” of the model to the data? “Fit measures” computed from the log likelihood  “Pseudo R squared” = 1 – logL/logL0  Also called the “likelihood ratio index”  Others… - these do not measure fit. Direct assessment of the effectiveness of the model at predicting the outcome

[Part 2] 41/86 Discrete Choice Modeling Binary Choice Models Log Likelihoods  logL = ∑ i log density (y i |x i,β)  For probabilities Density is a probability Log density is < 0 LogL is < 0  For other models, log density can be positive or negative. For linear regression, logL=-N/2(1+log2π+log(e’e/N)] Positive if s 2 <

[Part 2] 42/86 Discrete Choice Modeling Binary Choice Models Likelihood Ratio Index

[Part 2] 43/86 Discrete Choice Modeling Binary Choice Models The Likelihood Ratio Index  Bounded by 0 and 1-ε  Rises when the model is expanded  Values between 0 and 1 have no meaning  Can be strikingly low.  Should not be used to compare models Use logL Use information criteria to compare nonnested models

[Part 2] 44/86 Discrete Choice Modeling Binary Choice Models Fit Measures Based on LogL Binary Logit Model for Binary Choice Dependent variable DOCTOR Log likelihood function Full model LogL Restricted log likelihood Constant term only LogL0 Chi squared [ 5 d.f.] Significance level McFadden Pseudo R-squared – LogL/logL0 Estimation based on N = 3377, K = 6 Information Criteria: Normalization=1/N Normalized Unnormalized AIC LogL + 2K Fin.Smpl.AIC LogL + 2K + 2K(K+1)/(N-K-1) Bayes IC LogL + KlnN Hannan Quinn LogL + 2Kln(lnN) Variable| Coefficient Standard Error b/St.Er. P[|Z|>z] Mean of X |Characteristics in numerator of Prob[Y = 1] Constant| *** AGE| *** AGESQ|.00154*** INCOME| AGE_INC| FEMALE|.65366***

[Part 2] 45/86 Discrete Choice Modeling Binary Choice Models Fit Measures Based on Predictions  Computation Use the model to compute predicted probabilities Use the model and a rule to compute predicted y = 0 or 1  Fit measure compares predictions to actuals

[Part 2] 46/86 Discrete Choice Modeling Binary Choice Models Predicting the Outcome  Predicted probabilities P = F(a + b 1 Age + b 2 Income + b 3 Female+…)  Predicting outcomes Predict y=1 if P is “large” Use 0.5 for “large” (more likely than not) Generally, use  Count successes and failures

[Part 2] 47/86 Discrete Choice Modeling Binary Choice Models Cramer Fit Measure | Fit Measures Based on Model Predictions| | Efron =.04825| | Ben Akiva and Lerman =.57139| | Veall and Zimmerman =.08365| | Cramer =.04771|

[Part 2] 48/86 Discrete Choice Modeling Binary Choice Models Hypothesis Testing in Binary Choice Models

[Part 2] 49/86 Discrete Choice Modeling Binary Choice Models Covariance Matrix for the MLE

[Part 2] 50/86 Discrete Choice Modeling Binary Choice Models Simplifications

[Part 2] 51/86 Discrete Choice Modeling Binary Choice Models Robust Covariance Matrix

[Part 2] 52/86 Discrete Choice Modeling Binary Choice Models Robust Covariance Matrix for Logit Model Variable| Coefficient Standard Error b/St.Er. P[|Z|>z] Mean of X |Robust Standard Errors Constant| *** AGE| *** AGESQ|.00154*** INCOME| AGE_INC| FEMALE|.65366*** |Conventional Standard Errors Based on Second Derivatives Constant| *** AGE| *** AGESQ|.00154*** INCOME| AGE_INC| FEMALE|.65366***

[Part 2] 53/86 Discrete Choice Modeling Binary Choice Models Base Model for Hypothesis Tests Binary Logit Model for Binary Choice Dependent variable DOCTOR Log likelihood function Restricted log likelihood Chi squared [ 5 d.f.] Significance level McFadden Pseudo R-squared Estimation based on N = 3377, K = 6 Information Criteria: Normalization=1/N Normalized Unnormalized AIC Variable| Coefficient Standard Error b/St.Er. P[|Z|>z] Mean of X |Characteristics in numerator of Prob[Y = 1] Constant| *** AGE| *** AGESQ|.00154*** INCOME| AGE_INC| FEMALE|.65366*** H 0 : Age is not a significant determinant of Prob(Doctor = 1) H 0 : β 2 = β 3 = β 5 = 0

[Part 2] 54/86 Discrete Choice Modeling Binary Choice Models Likelihood Ratio Tests  Null hypothesis restricts the parameter vector  Alternative relaxes the restriction  Test statistic: Chi-squared = 2 (LogL|Unrestricted model – LogL|Restrictions) > 0 Degrees of freedom = number of restrictions

[Part 2] 55/86 Discrete Choice Modeling Binary Choice Models LR Test of H 0 RESTRICTED MODEL Binary Logit Model for Binary Choice Dependent variable DOCTOR Log likelihood function Restricted log likelihood Chi squared [ 2 d.f.] Significance level McFadden Pseudo R-squared Estimation based on N = 3377, K = 3 Information Criteria: Normalization=1/N Normalized Unnormalized AIC UNRESTRICTED MODEL Binary Logit Model for Binary Choice Dependent variable DOCTOR Log likelihood function Restricted log likelihood Chi squared [ 5 d.f.] Significance level McFadden Pseudo R-squared Estimation based on N = 3377, K = 6 Information Criteria: Normalization=1/N Normalized Unnormalized AIC Chi squared[3] = 2[ ( )] =

[Part 2] 56/86 Discrete Choice Modeling Binary Choice Models Wald Test  Unrestricted parameter vector is estimated  Discrepancy: q= Rb – m (or r(b,m) if nonlinear) is computed  Variance of discrepancy is estimated: Var[q] = R V R ’  Wald Statistic is q’[Var(q)] -1 q = q’[RVR’] -1 q

[Part 2] 57/86 Discrete Choice Modeling Binary Choice Models Chi squared[3] = Wald Test

[Part 2] 58/86 Discrete Choice Modeling Binary Choice Models Lagrange Multiplier Test  Restricted model is estimated  Derivatives of unrestricted model and variances of derivatives are computed at restricted estimates  Wald test of whether derivatives are zero tests the restrictions  Usually hard to compute – difficult to program the derivatives and their variances.

[Part 2] 59/86 Discrete Choice Modeling Binary Choice Models LM Test for a Logit Model  Compute b 0 (subject to restictions) (e.g., with zeros in appropriate positions.  Compute P i (b 0 ) for each observation.  Compute e i (b 0 ) = [y i – P i (b 0 )]  Compute g i (b 0 ) = x i e i using full x i vector  LM = [Σ i g i (b 0 )]’[Σ i g i (b 0 )g i (b 0 )] -1 [Σ i g i (b 0 )]

[Part 2] 60/86 Discrete Choice Modeling Binary Choice Models

[Part 2] 61/86 Discrete Choice Modeling Binary Choice Models

[Part 2] 62/86 Discrete Choice Modeling Binary Choice Models

[Part 2] 63/86 Discrete Choice Modeling Binary Choice Models

[Part 2] 64/86 Discrete Choice Modeling Binary Choice Models Inference About Partial Effects

[Part 2] 65/86 Discrete Choice Modeling Binary Choice Models Marginal Effects for Binary Choice

[Part 2] 66/86 Discrete Choice Modeling Binary Choice Models The Delta Method

[Part 2] 67/86 Discrete Choice Modeling Binary Choice Models Computing Effects  Compute at the data means? Simple Inference is well defined  Average the individual effects More appropriate? Asymptotic standard errors more complicated.  Is testing about marginal effects meaningful? f(b’x) must be > 0; b is highly significant How could f(b’x)*b equal zero?

[Part 2] 68/86 Discrete Choice Modeling Binary Choice Models APE vs. Partial Effects at the Mean

[Part 2] 69/86 Discrete Choice Modeling Binary Choice Models Method of Krinsky and Robb Estimate β by Maximum Likelihood with b Estimate asymptotic covariance matrix with V Draw R observations b(r) from the normal population N[b,V] b(r) = b + C*v(r), v(r) drawn from N[0,I] C = Cholesky matrix, V = CC’ Compute partial effects d(r) using b(r) Compute the sample variance of d(r),r=1,…,R Use the sample standard deviations of the R observations to estimate the sampling standard errors for the partial effects.

[Part 2] 70/86 Discrete Choice Modeling Binary Choice Models Krinsky and Robb Delta Method

[Part 2] 71/86 Discrete Choice Modeling Binary Choice Models Partial Effect for Nonlinear Terms

[Part 2] 72/86 Discrete Choice Modeling Binary Choice Models Average Partial Effect: Averaged over Sample Incomes and Genders for Specific Values of Age

[Part 2] 73/86 Discrete Choice Modeling Binary Choice Models Endogeneity

[Part 2] 74/86 Discrete Choice Modeling Binary Choice Models Endogenous RHS Variable  U* = β’x + θh + ε y = 1[U* > 0] E[ε|h] ≠ 0 (h is endogenous) Case 1: h is continuous Case 2: h is binary = a treatment effect  Approaches Parametric: Maximum Likelihood Semiparametric (not developed here):  GMM  Various approaches for case 2

[Part 2] 75/86 Discrete Choice Modeling Binary Choice Models Endogenous Continuous Variable U* = β’x + θh + ε y = 1[U* > 0]  h = α’z + u E[ε|h] ≠ 0  Cov[u, ε] ≠ 0 Additional Assumptions: (u,ε) ~ N[(0,0),(σ u 2, ρσ u, 1)] z = a valid set of exogenous variables, uncorrelated with (u, ε) Correlation = ρ. This is the source of the endogeneity This is not IV estimation. Z may be uncorrelated with X without problems.

[Part 2] 76/86 Discrete Choice Modeling Binary Choice Models Endogenous Income 0 = Not Healthy 1 = Healthy Healthy = 0 or 1 Age, Married, Kids, Gender, Income Determinants of Income (observed and unobserved) also determine health satisfaction. Income responds to Age, Age 2, Educ, Married, Kids, Gender

[Part 2] 77/86 Discrete Choice Modeling Binary Choice Models Estimation by ML (Control Function)

[Part 2] 78/86 Discrete Choice Modeling Binary Choice Models Two Approaches to ML

[Part 2] 79/86 Discrete Choice Modeling Binary Choice Models FIML Estimates Probit with Endogenous RHS Variable Dependent variable HEALTHY Log likelihood function Variable| Coefficient Standard Error b/St.Er. P[|Z|>z] Mean of X |Coefficients in Probit Equation for HEALTHY Constant| *** AGE| *** MARRIED| HHKIDS|.06932*** FEMALE| *** INCOME|.53778*** |Coefficients in Linear Regression for INCOME Constant| *** AGE|.02159*** AGESQ| *** D EDUC|.02064*** MARRIED|.07783*** HHKIDS| *** FEMALE|.00413** |Standard Deviation of Regression Disturbances Sigma(w)|.16445*** |Correlation Between Probit and Regression Disturbances Rho(e,w)|

[Part 2] 80/86 Discrete Choice Modeling Binary Choice Models Partial Effects: Scaled Coefficients

[Part 2] 81/86 Discrete Choice Modeling Binary Choice Models Partial Effects The scale factor is computed using the model coefficients, means of the variables and 35,000 draws from the standard normal population. θ =

[Part 2] 82/86 Discrete Choice Modeling Binary Choice Models Endogenous Binary Variable U* = β’x + θh + ε y = 1[U* > 0] h* = α’z + u h = 1[h* > 0] E[ε|h*] ≠ 0  Cov[u, ε] ≠ 0 Additional Assumptions: (u,ε) ~ N[(0,0),(σ u 2, ρσ u, 1)] z = a valid set of exogenous variables, uncorrelated with (u, ε) Correlation = ρ. This is the source of the endogeneity  This is not IV estimation. Z may be uncorrelated with X without problems.

[Part 2] 83/86 Discrete Choice Modeling Binary Choice Models Endogenous Binary Variable Doctor = F(age,age 2,income,female,Public)Public = F(age,educ,income,married,kids,female)

[Part 2] 84/86 Discrete Choice Modeling Binary Choice Models FIML Estimates FIML Estimates of Bivariate Probit Model Dependent variable DOCPUB Log likelihood function Estimation based on N = 27326, K = Variable| Coefficient Standard Error b/St.Er. P[|Z|>z] Mean of X |Index equation for DOCTOR Constant|.59049*** AGE| *** AGESQ|.00082*** D INCOME|.08883* FEMALE|.34583*** PUBLIC|.43533*** |Index equation for PUBLIC Constant| *** AGE| EDUC| *** INCOME| *** MARRIED| HHKIDS| *** FEMALE|.12139*** |Disturbance correlation RHO(1,2)| ***

[Part 2] 85/86 Discrete Choice Modeling Binary Choice Models Partial Effects

[Part 2] 86/86 Discrete Choice Modeling Binary Choice Models Identification Issues  Exclusions are not needed for estimation  Identification is, in principle, by “functional form”  Researchers usually have a variable in the treatment equation that is not in the main probit equation “to improve identification”  A fully simultaneous model y1 = f(x1,y2), y2 = f(x2,y1) Not identified even with exclusion restrictions (Model is “incoherent”)