Microeconometric Modeling

Slides:



Advertisements
Similar presentations
Chapter 3 Properties of Random Variables
Advertisements

A. The Basic Principle We consider the multivariate extension of multiple linear regression – modeling the relationship between m responses Y 1,…,Y m and.
Computational Statistics. Basic ideas  Predict values that are hard to measure irl, by using co-variables (other properties from the same measurement.
Brief introduction on Logistic Regression
Part 12: Asymptotics for the Regression Model 12-1/39 Econometrics I Professor William Greene Stern School of Business Department of Economics.
3. Binary Choice – Inference. Hypothesis Testing in Binary Choice Models.
Part 7: Estimating the Variance of b 7-1/53 Econometrics I Professor William Greene Stern School of Business Department of Economics.
Uncertainty and confidence intervals Statistical estimation methods, Finse Friday , 12.45–14.05 Andreas Lindén.
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
The General Linear Model. The Simple Linear Model Linear Regression.
A Short Introduction to Curve Fitting and Regression by Brad Morantz
[Part 3] 1/49 Stochastic FrontierModels Stochastic Frontier Model Stochastic Frontier Models William Greene Stern School of Business New York University.
1 Chapter 2 Simple Linear Regression Ray-Bing Chen Institute of Statistics National University of Kaohsiung.
Topics in Microeconometrics Professor William Greene Stern School of Business, New York University at Curtin Business School Curtin University Perth July.
OUTLIER, HETEROSKEDASTICITY,AND NORMALITY
Generalized Regression Model Based on Greene’s Note 15 (Chapter 8)
Parametric Inference.
Chapter 11 Multiple Regression.
Economics Prof. Buckles
Generalized Linear Models
Efficiency Measurement William Greene Stern School of Business New York University.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
Model Inference and Averaging
Random Sampling, Point Estimation and Maximum Likelihood.
Efficiency of Public Spending in Developing Countries: A Stochastic Frontier Approach William Greene Stern School of Business World Bank, May 23, 2005.
Efficiency Measurement William Greene Stern School of Business New York University.
A statistical model Μ is a set of distributions (or regression functions), e.g., all uni-modal, smooth distributions. Μ is called a parametric model if.
Microeconometric Modeling William Greene Stern School of Business New York University.
Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.
Part 9: Hypothesis Testing /29 Econometrics I Professor William Greene Stern School of Business Department of Economics.
Efficiency Measurement William Greene Stern School of Business New York University.
Efficiency Measurement William Greene Stern School of Business New York University.
Frontier Models and Efficiency Measurement Lab Session 1 William Greene Stern School of Business New York University 0Introduction 1Efficiency Measurement.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Part 2: Model and Inference 2-1/49 Regression Models Professor William Greene Stern School of Business IOMS Department Department of Economics.
Efficiency Measurement William Greene Stern School of Business New York University.
1 Lecture 16: Point Estimation Concepts and Methods Devore, Ch
Frontier Models and Efficiency Measurement Lab Session 4: Panel Data William Greene Stern School of Business New York University 0Introduction 1Efficiency.
Efficiency Measurement William Greene Stern School of Business New York University.
Frontier Models and Efficiency Measurement Lab Session 2: Stochastic Frontier William Greene Stern School of Business New York University 0Introduction.
Econometrics in Health Economics Discrete Choice Modeling and Frontier Modeling and Efficiency Estimation Professor William Greene Stern School of Business.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
Empirical Methods for Microeconomic Applications University of Lugano, Switzerland May 27-31, 2013 William Greene Department of Economics Stern School.
[Topic 1-Regression] 1/37 1. Descriptive Tools, Regression, Panel Data.
M.Sc. in Economics Econometrics Module I Topic 4: Maximum Likelihood Estimation Carol Newman.
Efficiency Measurement William Greene Stern School of Business New York University.
[Part 1] 1/18 Stochastic FrontierModels Efficiency Measurement Stochastic Frontier Models William Greene Stern School of Business New York University 0Introduction.
Lecture 1: Basic Statistical Tools. A random variable (RV) = outcome (realization) not a set value, but rather drawn from some probability distribution.
Statistical Data Analysis 2011/2012 M. de Gunst Lecture 4.
Stats Term Test 4 Solutions. c) d) An alternative solution is to use the probability mass function and.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University.
Stochastic Error Functions I: Another Composed Error Lecture X.
1/61: Topic 1.2 – Extensions of the Linear Regression Model Microeconometric Modeling William Greene Stern School of Business New York University New York.
1/26: Topic 2.2 – Nonlinear Panel Data Models Microeconometric Modeling William Greene Stern School of Business New York University New York NY USA William.
Lecturer: Ing. Martina Hanová, PhD. Business Modeling.
Estimating standard error using bootstrap
Microeconometric Modeling
Stochastic Frontier Models
Efficiency Measurement
Stochastic Frontier Models
Microeconometric Modeling
Econometrics I Professor William Greene Stern School of Business
Stochastic Frontier Models
Econometrics I Professor William Greene Stern School of Business
Econometrics I Professor William Greene Stern School of Business
Microeconometric Modeling
Microeconometric Modeling
Empirical Methods for Microeconomic Applications University of Lugano, Switzerland May 27-31, 2019 William Greene Department of Economics Stern School.
Microeconometric Modeling
Econometrics I Professor William Greene Stern School of Business
Presentation transcript:

Microeconometric Modeling William Greene Stern School of Business New York University New York NY USA 1.2 Extensions of the Linear Regression Model

Concepts Models Multiple Imputation Robust Covariance Matrices Bootstrap Maximum Likelihood Method of Moments Estimating Individual Outcomes Linear Regression Model Quantile Regression Stochastic Frontier

Multiple Imputation for Missing Data

Imputed Covariance Matrix

Implementation SAS, Stata: Create full data sets with imputed values inserted. M = 5 is the familiar standard number of imputed data sets. Data are replicated and redistributed SAS: Standard procedure and code distributed. Stata: Elaborate imputation equations, M=5 NLOGIT Create an internal map of the missing values and a set of engines for filling missing values Loop through imputed data sets during estimation. M may be arbitrary – memory usage and data storage are independent of M. Data may be replicated

Regression with Conventional Standard Errors

Robust Covariance Matrices Robust standard errors, not estimates Robust to: Heteroscedasticty Not robust to: (all considered later) Correlation across observations Individual unobserved heterogeneity Incorrect model specification ‘Robust inference’ means hypothesis tests and confidence intervals using robust covariance matrices

A Robust Covariance Matrix Uncorrected

Bootstrap Estimation of the Asymptotic Variance of an Estimator Known form of asymptotic variance: Compute from known results Unknown form, known generalities about properties: Use bootstrapping Root N consistency Sampling conditions amenable to central limit theorems Compute by resampling mechanism within the sample.

Bootstrapping Algorithm 1. Estimate parameters using full sample:  b 2. Repeat R times: Draw n observations from the n, with replacement Estimate  with b(r). 3. Estimate variance with V = (1/R)r [b(r) - b][b(r) - b]’ (Some use mean of replications instead of b. Advocated (without motivation) by original designers of the method.)

Application: Correlation between Age and Education

Bootstrapped Regression

Bootstrap Replications

Bootstrapped Confidence Intervals Estimate Norm()=(12 + 22 + 32 + 42)1/2

Quantile Regression Q(y|x,) = x,  = quantile Estimated by linear programming Q(y|x,.50) = x, .50  median regression Median regression estimated by LAD (estimates same parameters as mean regression if symmetric conditional distribution) Why use quantile (median) regression? Semiparametric Robust to some extensions (heteroscedasticity?) Complete characterization of conditional distribution

Estimated Variance for Quantile Regression

Quantile Regressions  = .25  = .50  = .75

OLS vs. Least Absolute Deviations

Coefficient on MALE dummy variable in quantile regressions

A Production Function Model with Inefficiency The Stochastic Frontier Model

Inefficiency in Production

Cost Inefficiency y* = f(x)  C* = g(y*,w) (Samuelson – Shephard duality results) Cost inefficiency: If y < f(x), then C must be greater than g(y,w). Implies the idea of a cost frontier. lnC = lng(y,w) + u, u > 0.

Corrected Ordinary Least Squares

COLS Cost Frontier

Stochastic Frontier Models Motivation: Factors not under control of the firm Measurement error Differential rates of adoption of technology Frontier is randomly placed by the whole collection of stochastic elements which might enter the model outside the control of the firm. Aigner, Lovell, Schmidt (1977), Meeusen, van den Broeck (1977), Battese, Corra (1977)

The Stochastic Frontier Model ui > 0, but vi may take any value. A symmetric distribution, such as the normal distribution, is usually assumed for vi. Thus, the stochastic frontier is +’xi+vi and, as before, ui represents the inefficiency.

Least Squares Estimation Average inefficiency is embodied in the third moment of the disturbance εi = vi - ui. So long as E[vi - ui] is constant, the OLS estimates of the slope parameters of the frontier function are unbiased and consistent. (The constant term estimates α-E[ui]. The average inefficiency present in the distribution is reflected in the asymmetry of the distribution, which can be estimated using the OLS residuals:

Application to Spanish Dairy Farms N = 247 farms, T = 6 years (1993-1998) Input Units Mean Std. Dev. Minimum Maximum Milk Milk production (liters) 131,108 92,539 14,110 727,281 Cows # of milking cows 2.12 11.27 4.5 82.3 Labor # man-equivalent units 1.67 0.55 1.0 4.0 Land Hectares of land devoted to pasture and crops. 12.99 6.17 2.0 45.1 Feed Total amount of feedstuffs fed to dairy cows (tons) 57,941 47,981 3,924.14 376,732

Example: Dairy Farms

The Normal-Half Normal Model

Skew Normal Variable

Estimation: Least Squares/MoM OLS estimator of β is consistent E[ui] = (2/π)1/2σu, so OLS constant estimates α+ (2/π)1/2σu Second and third moments of OLS residuals estimate Method of Moments: Use [a,b,m2,m3] to estimate [,,u, v]

Standard Form: The Skew Normal Distribution

Log Likelihood Function Waldman (1982) result on skewness of OLS residuals: If the OLS residuals are positively skewed, rather than negative, then OLS maximizes the log likelihood, and there is no evidence of inefficiency in the data.

Airlines Data – 256 Observations

Least Squares Regression

Alternative Models: Half Normal and Exponential

Normal-Exponential Likelihood Other Models Many other parametric models Semiparametric and nonparametric – the recent outer reaches of the theoretical literature Other variations including heterogeneity in the frontier function and in the distribution of inefficiency

A Test for Inefficiency? Base test on u = 0 <=>  = 0 Standard test procedures Likelihood ratio Wald Lagrange Multiplier Nonstandard testing situation: Variance = 0 on the boundary of the parameter space Standard chi squared distribution does not apply.

Estimating ui No direct estimate of ui Data permit estimation of yi – β’xi. Can this be used? εi = yi – β’xi = vi – ui Indirect estimate of ui, using E[ui|vi – ui] This is E[ui|yi, xi] vi – ui is estimable with ei = yi – b’xi.

Fundamental Tool - JLMS We can insert our maximum likelihood estimates of all parameters. Note: This estimates E[u|vi – ui], not ui.

Application: Electricity Generation

Estimated Translog Production Frontiers

Inefficiency Estimates

Estimated Inefficiency Distribution

Estimated Efficiency

A Semiparametric Approach Y = g(x,z) + v - u [Normal-Half Normal] (1) Locally linear nonparametric regression estimates g(x,z) (2) Use residuals from nonparametric regression to estimate variance parameters using MLE (3) Use estimated variance parameters and residuals to estimate technical efficiency.

Airlines Application

Efficiency Distributions

Nonparametric Methods - DEA

DEA is done using linear programming

Methodological Problems with DEA Measurement error Outliers Specification errors The overall problem with the deterministic frontier approach

DEA and SFA: Same Answer? Christensen and Greene data N=123 minus 6 tiny firms X = capital, labor, fuel Y = millions of KWH Cobb-Douglas Production Function vs. DEA

Comparing the Two Methods.