3. Binary Choice – Inference. Hypothesis Testing in Binary Choice Models.

Slides:



Advertisements
Similar presentations
Tests of Static Asset Pricing Models
Advertisements

Econometrics I Professor William Greene Stern School of Business
Part 12: Asymptotics for the Regression Model 12-1/39 Econometrics I Professor William Greene Stern School of Business Department of Economics.
12. Random Parameters Logit Models. Random Parameters Model Allow model parameters as well as constants to be random Allow multiple observations with.
[Part 1] 1/15 Discrete Choice Modeling Econometric Methodology Discrete Choice Modeling William Greene Stern School of Business New York University 0Introduction.
Econometrics I Professor William Greene Stern School of Business
Part 15: Binary Choice [ 1/121] Econometric Analysis of Panel Data William Greene Department of Economics Stern School of Business.
Empirical Methods for Microeconomic Applications University of Lugano, Switzerland May 27-31, 2013 William Greene Department of Economics Stern School.
Discrete Choice Modeling William Greene Stern School of Business New York University Lab Sessions.
Discrete Choice Modeling William Greene Stern School of Business New York University.
2. Binary Choice Estimation. Modeling Binary Choice.
Review Guess the correlation. A.-2.0 B.-0.9 C.-0.1 D.0.1 E.0.9.
Econometric Methodology. The Sample and Measurement Population Measurement Theory Characteristics Behavior Patterns Choices.
Discrete Choice Modeling William Greene Stern School of Business New York University.
Empirical Methods for Microeconomic Applications William Greene Department of Economics Stern School of Business.
Discrete Choice Modeling William Greene Stern School of Business New York University.
- Interfering factors in the comparison of two sample means using unpaired samples may inflate the pooled estimate of variance of test results. - It is.
Discrete Choice Modeling William Greene Stern School of Business New York University.
Microeconometric Modeling William Greene Stern School of Business New York University.
Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.
Multiple Regression Petter Mostad Review: Simple linear regression We define a model where are independent (normally distributed) with equal.
Maximum Likelihood Estimation Methods of Economic Investigation Lecture 17.
L. Liu PM Outreach, USyd.1 Survey Analysis. L. Liu PM Outreach, USyd.2 Types of research Descriptive Exploratory Evaluative.
Discrete Choice Modeling William Greene Stern School of Business New York University.
1/62: Topic 2.3 – Panel Data Binary Choice Models Microeconometric Modeling William Greene Stern School of Business New York University New York NY USA.
Discrepancy between Data and Fit. Introduction What is Deviance? Deviance for Binary Responses and Proportions Deviance as measure of the goodness of.
Empirical Methods for Microeconomic Applications University of Lugano, Switzerland May 27-31, 2013 William Greene Department of Economics Stern School.
1 G Lect 7a G Lecture 7a Comparing proportions from independent samples Analysis of matched samples Small samples and 2  2 Tables Strength.
[Topic 1-Regression] 1/37 1. Descriptive Tools, Regression, Panel Data.
- We have samples for each of two conditions. We provide an answer for “Are the two sample means significantly different from each other, or could both.
Discrete Choice Modeling William Greene Stern School of Business New York University.
M.Sc. in Economics Econometrics Module I Topic 4: Maximum Likelihood Estimation Carol Newman.
Discrete Choice Modeling William Greene Stern School of Business New York University.
[Part 2] 1/86 Discrete Choice Modeling Binary Choice Models Discrete Choice Modeling William Greene Stern School of Business New York University 0Introduction.
Nonparametric Statistics
1/61: Topic 1.2 – Extensions of the Linear Regression Model Microeconometric Modeling William Greene Stern School of Business New York University New York.
Lecture 7: Bivariate Statistics. 2 Properties of Standard Deviation Variance is just the square of the S.D. If a constant is added to all scores, it has.
The Probit Model Alexander Spermann University of Freiburg SS 2008.
Applied Econometrics William Greene Department of Economics Stern School of Business.
1/26: Topic 2.2 – Nonlinear Panel Data Models Microeconometric Modeling William Greene Stern School of Business New York University New York NY USA William.
5. Extensions of Binary Choice Models
The Probit Model Alexander Spermann University of Freiburg SoSe 2009
Microeconometric Modeling
Nonparametric Statistics
Microeconometric Modeling
Econometric Analysis of Panel Data
Discrete Choice Modeling
Microeconometric Modeling
Discrete Choice Modeling
Microeconometric Modeling
Microeconometric Modeling
Nonparametric Statistics
Econometric Analysis of Panel Data
Microeconometric Modeling
Microeconometric Modeling
Microeconometric Modeling
Microeconometric Modeling
Chengyuan Yin School of Mathematics
Tutorial 1: Misspecification
Econometric Analysis of Panel Data
Microeconometric Modeling
William Greene Stern School of Business New York University
Econometrics I Professor William Greene Stern School of Business
Microeconometric Modeling
Microeconometric Modeling
Empirical Methods for Microeconomic Applications University of Lugano, Switzerland May 27-31, 2019 William Greene Department of Economics Stern School.
Statistical Inference for the Mean: t-test
Econometrics I Professor William Greene Stern School of Business
Microeconometric Modeling
Presentation transcript:

3. Binary Choice – Inference

Hypothesis Testing in Binary Choice Models

Hypothesis Tests Restrictions: Linear or nonlinear functions of the model parameters Structural ‘change’: Constancy of parameters Specification Tests: Model specification: distribution Heteroscedasticity: Generally parametric

Hypothesis Testing There is no F statistic Comparisons of Likelihood Functions: Likelihood Ratio Tests Distance Measures: Wald Statistics Lagrange Multiplier Tests

Requires an Estimator of the Covariance Matrix for b

Robust Covariance Matrix(?)

The Robust Matrix is Not Robust To: Heteroscedasticity Correlation across observations Omitted heterogeneity Omitted variables (even if orthogonal) Wrong distribution assumed Wrong functional form for index function In all cases, the estimator is inconsistent so a “robust” covariance matrix is pointless. (In general, it is merely harmless.)

Estimated Robust Covariance Matrix for Logit Model Variable| Coefficient Standard Error b/St.Er. P[|Z|>z] Mean of X |Robust Standard Errors Constant| *** AGE| *** AGESQ|.00154*** INCOME| AGE_INC| FEMALE|.65366*** |Conventional Standard Errors Based on Second Derivatives Constant| *** AGE| *** AGESQ|.00154*** INCOME| AGE_INC| FEMALE|.65366***

Testing: Base Model Binary Logit Model for Binary Choice Dependent variable DOCTOR Log likelihood function Restricted log likelihood Chi squared [ 5 d.f.] Significance level McFadden Pseudo R-squared Estimation based on N = 3377, K = Variable| Coefficient Standard Error b/St.Er. P[|Z|>z] Mean of X Constant| *** AGE| *** AGESQ|.00154*** INCOME| AGE_INC| FEMALE|.65366*** H 0 : Age is not a significant determinant of Prob(Doctor = 1) H 0 : β 2 = β 3 = β 5 = 0

Likelihood Ratio Tests Null hypothesis restricts the parameter vector Alternative releases the restriction Test statistic: Chi-squared = 2 (LogL|Unrestricted model – LogL|Restrictions) > 0 Degrees of freedom = number of restrictions

LR Test of H 0 RESTRICTED MODEL Binary Logit Model for Binary Choice Dependent variable DOCTOR Log likelihood function Restricted log likelihood Chi squared [ 2 d.f.] Significance level McFadden Pseudo R-squared Estimation based on N = 3377, K = 3 UNRESTRICTED MODEL Binary Logit Model for Binary Choice Dependent variable DOCTOR Log likelihood function Restricted log likelihood Chi squared [ 5 d.f.] Significance level McFadden Pseudo R-squared Estimation based on N = 3377, K = 6 Chi squared[3] = 2[ ( )] =

Wald Test Unrestricted parameter vector is estimated Discrepancy: q= Rb – m Variance of discrepancy is estimated: Var[q] = RVR’ Wald Statistic is q’[Var(q)] -1 q = q’[RVR’] -1 q

Carrying Out a Wald Test Chi squared[3] = b0b0 V0V0 R Rb 0 - m RV 0 R Wald

Lagrange Multiplier Test Restricted model is estimated Derivatives of unrestricted model and variances of derivatives are computed at restricted estimates Wald test of whether derivatives are zero tests the restrictions Usually hard to compute – difficult to program the derivatives and their variances.

LM Test for a Logit Model Compute b 0 (subject to restictions) (e.g., with zeros in appropriate positions. Compute P i (b 0 ) for each observation using restricted estimator in the full model. Compute e i (b 0 ) = [y i – P i (b 0 )] Compute g i (b 0 ) = x i e i using full x i vector LM = [Σ i g i (b 0 )][Σ i g i (b 0 )g i (b 0 )] -1 [Σ i g i (b 0 )]

Test Results Matrix LM has 1 rows and 1 columns | | Wald Chi squared[3] = LR Chi squared[3] = 2[ ( )] = Matrix DERIV has 6 rows and 1 columns | D-05 zero from FOC 2| | D+06 4| D-06 zero from FOC 5| | D-05 zero from FOC

A Test of Structural Stability In the original application, separate models were fit for men and women. We seek a counterpart to the Chow test for linear models. Use a likelihood ratio test.

Testing Structural Stability Fit the same model in each subsample Unrestricted log likelihood is the sum of the subsample log likelihoods: LogL1 Pool the subsamples, fit the model to the pooled sample Restricted log likelihood is that from the pooled sample: LogL0 Chi-squared = 2*(LogL1 – LogL0) Degrees of freedom = (#Groups - 1)*model size.

Structural Change (Over Groups) Test Dependent variable DOCTOR Pooled Log likelihood function Variable| Coefficient Standard Error b/St.Er. P[|Z|>z] Mean of X Constant| *** AGE| *** AGESQ|.00139*** INCOME| AGE_INC| Male Log likelihood function Constant| * AGE| *** AGESQ|.00165*** INCOME| AGE_INC| Female Log likelihood function Constant| *** AGE| ** AGESQ|.00143*** INCOME| AGE_INC| Chi squared[5] = 2[ ( ) – ( ] =

Vuong Test for Nonnested Models Test of Logit (Model A) vs. Probit (Model B)? | Listed Calculator Results | VUONGTST=

Inference About Partial Effects

Partial Effects for Binary Choice

The Delta Method

Computing Effects Compute at the data means? Simple Inference is well defined Average the individual effects More appropriate? Asymptotic standard errors a bit more complicated.

APE vs. Partial Effects at the Mean

Partial Effect for Nonlinear Terms

Average Partial Effect: Averaged over Sample Incomes and Genders for Specific Values of Age

Krinsky and Robb Estimate β by Maximum Likelihood with b Estimate asymptotic covariance matrix with V Draw R observations b(r) from the normal population N[b,V] b(r) = b + C*v(r), v(r) drawn from N[0,I] C = Cholesky matrix, V = CC’ Compute partial effects d(r) using b(r) Compute the sample variance of d(r),r=1,…,R Use the sample standard deviations of the R observations to estimate the sampling standard errors for the partial effects.

Krinsky and Robb vs. Delta Method