Panel Stochastic Frontier Models with Endogeneity in Stata

Slides:



Advertisements
Similar presentations
3SLS 3SLS is the combination of 2SLS and SUR.
Advertisements

Cointegration and Error Correction Models
Functional Form and Dynamic Models
Multiple Regression.
F-tests continued.
Introduction Describe what panel data is and the reasons for using it in this format Assess the importance of fixed and random effects Examine the Hausman.
Panel Data Models Prepared by Vera Tabakova, East Carolina University.
Chapter 3: Editing and Debugging SAS Programs. Some useful tips of using Program Editor Add line number: In the Command Box, type num, enter. Save SAS.
Topics in Microeconometrics Professor William Greene Stern School of Business, New York University at Curtin Business School Curtin University Perth July.
Cross section and panel method
Chapter 3 Simple Regression. What is in this Chapter? This chapter starts with a linear regression model with one explanatory variable, and states the.
Econ 140 Lecture 131 Multiple Regression Models Lecture 13.
Chapter 5 Heteroskedasticity. What is in this Chapter? How do we detect this problem What are the consequences of this problem? What are the solutions?
The Role of Financial System in Economic Growth Presented By: Saumil Nihalani.
1 Innovation and Employment: Evidence from Italian Microdata Mariacristina Piva and Marco Vivarelli Università Cattolica S.Cuore - Piacenza.
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 11 Regression.
Suwimon Vongsingthong Department of Information Technology and Management, Krirk University, Thailand
Efficiency Measurement William Greene Stern School of Business New York University.
CJT 765: Structural Equation Modeling Class 7: fitting a model, fit indices, comparingmodels, statistical power.
Ordinary Least Squares Estimation: A Primer Projectseminar Migration and the Labour Market, Meeting May 24, 2012 The linear regression model 1. A brief.
1 REGRESSION ANALYSIS WITH PANEL DATA: INTRODUCTION A panel data set, or longitudinal data set, is one where there are repeated observations on the same.
De la Economía Agraria a la Economía Rural y Agroalimentaria TECHNICAL EFFICIENCY AND PRODUCTIVITY ANALYSIS OF SPANISH CITRUS FARMS Fatima Lambarraa, Teresa.
The challenge of a mixed-mode design survey and new IT tools application: the case of the Italian Structure Earning Surveys Fabiana Rocci Stefania Cardinleschi.
Ifo Institute for Economic Research at the University of Munich Employment Effects of Innovation at the Firm Level Stefan Lachenmaier *, Horst Rottmann.
1 Econometrics (NA1031) Lecture 1 Introduction. 2 ”How much” type questions oBy how much a unit change in income affects consumption? oBy how much should.
14-1 Qualitative Variable - Example Frequently we wish to use nominal-scale variables—such as gender, whether the home has a swimming pool, or whether.
Review Section on Instrumental Variables Economics 1018 Abby Williamson and Hongyi Li October 11, 2006.
Efficiency Measurement William Greene Stern School of Business New York University.
[Part 1] 1/18 Stochastic FrontierModels Efficiency Measurement Stochastic Frontier Models William Greene Stern School of Business New York University 0Introduction.
A Framework to Predict the Quality of Answers with Non-Textual Features Jiwoon Jeon, W. Bruce Croft(University of Massachusetts-Amherst) Joon Ho Lee (Soongsil.
FOREIGN DIRECT INVESTMENT AND PRODUCTIVITY SPILLOVERS: Firm Level Evidence from Chilean industrial sector. Leopoldo LabordaDaniel Sotelsek University of.
Chapter 17 STRUCTURAL EQUATION MODELING. Structural Equation Modeling (SEM)  Relatively new statistical technique used to test theoretical or causal.
Comparison of Estimation Methods for Agricultural Productivity Yu Sheng ABARES the Superlative vs. the Quantity- based Index Approach August 2015.
Demand Elasticities of residential electricity demand in South Korea, Yejin Keum* (Depart of Economics, Chungbuk National University)
F-tests continued.
Econ 326 Prof. Mariana Carrera Lab Session X [DATE]
Correlation and Linear Regression
Vera Tabakova, East Carolina University
The Multiple Regression Model
Chapter 4 Basic Estimation Techniques
Econ 326 Lecture 19.
Vera Tabakova, East Carolina University
Analyze ICD-10 Diagnosis Codes with Stata
15.5 The Hausman test For the random effects estimator to be unbiased in large samples, the effects must be uncorrelated with the explanatory variables.
Chow test.
Author: Konstantinos Drakos Journal: Economica
Chapter 11: Simple Linear Regression
Parts of an Academic Paper
Econometric methods of analysis and forecasting of financial markets
CJT 765: Structural Equation Modeling
Efficiency of Agro-processing SMEs in Cameroon
The Impact of Public Health Spending on the Rate of Gonorrhea: Evidence from California Counties Given the time constraint for your presentation, (i) don’t.
Instrumental Variables and Two Stage Least Squares
Instrumental Variables and Two Stage Least Squares
Efficiency Measurement
Migration and the Labour Market
BMC Health Service Research 2015 By Gang Nathan Dong PERFORMING WELL IN FINANCIAL MANAGMGMENG AND QUALITY OF CARE.
Life cycle patterns, farm performance and structural change: an empirical research Steven Van Passel I’m working for the policy research centre for sustainable.
1/18/2019 ST3131, Lecture 1.
Simultaneous equation models Prepared by Nir Kamal Dahal(Statistics)
An Introduction to Econometrics
Instrumental Variables and Two Stage Least Squares
Simple Linear Regression and Correlation
Product moment correlation
M. Kezunovic (P.I.) S. S. Luo D. Ristanovic Texas A&M University
SEM: Step by Step In AMOS and Mplus.
Innovation and Employment: Evidence from Italian Microdata
Model and Hypothesis Table Explanation of Variables
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

Panel Stochastic Frontier Models with Endogeneity in Stata Mustafa U. Karakaplan

Introduction I introduce xtsfkk as a new Stata command for fitting panel stochastic frontier models with endogeneity. The advantage of xtsfkk is that it can control for the endogenous variables in the frontier and/or the inefficiency term in a longitudinal setting. xtsfkk performs better than standard panel frontier estimators such as xtfrontier that overlook endogeneity by design.

About SFM It has been more than 40 years since Aigner, Lovell, and Schmidt (1977) and Meeusen and van den Broeck (1977) introduced stochastic frontier models (SFM). These models are composed of a deterministic part identifying the frontier, a stochastic part for the two-sided error term, and a one-sided inefficiency error term identifying the distance from the stochastic frontier. These models can be used to study production, cost, revenue, profit, or other goals within various industries such as accounting, advertising, banks, education, financial markets, environment, hospitals, hotels, labor markets, military, police, real estate, sports, transportation, and utilities.

SFM in Stata Stata provides the frontier command to estimate the parameters of a stochastic frontier model. Belotti, et al. (2013) introduce a Stata command called sfcross that mirrors Stata’s frontier command with additional functionality, options, and models. The standard xtfrontier command of Stata and sfpanel command of Belotti, et al. (2013) estimate panel stochastic frontier models However, the literature on stochastic frontier models, various Stata commands, and the estimator options in other general-purpose statistical software packages does not offer a way to control for endogeneity that can exist in these models.

Endogeneity in SFM If the determinants of the frontier or the inefficiency term are correlated with the two-sided error term of the model, then the outcomes of the standard estimators would be contaminated by endogeneity. Kutlu (2010) and Karakaplan and Kutlu (2017a) developed a model to handle endogeneity due to the determinants of the frontier or the inefficiency term, or both. Karakaplan (2017) released a new Stata command called sfkk to analyze empirical stochastic frontier models with endogeneity.

Panel SFM with Endogeneity in Stata One shortcoming of Karakaplan and Kutlu (2017a) and the sfkk command of Karakaplan (2017) is that their model was designed to be cross-sectional. Karakaplan and Kutlu (2017b) came up with a stochastic frontier estimator that would resolve endogeneity issues in a panel setting. The standard xtfrontier command of Stata and sfpanel command of Belotti, et al. (2013) estimate panel stochastic frontier models but cannot handle the endogeneity issues identified by Karakaplan and Kutlu (2017b). Therefore, in this paper, I introduce a new command, xtsfkk, for estimating panel stochastic frontier models with endogeneity in Stata.

About the Command Using Stata’s maximum likelihood estimator tools (the ml commands), and the guidance provided by Gould, Pitblado, and Poi (2010), I programmed the xtsfkk command that can calculate the estimator and efficiency equations presented in Karakaplan and Kutlu (2017b). Three files are included in the xtsfkk command package: xtsfkk.ado file for the main estimation syntax that users can access by running xtsfkk in Stata, xtsfkk_ml.ado file for the evaluator subroutines that xtsfkk calls for behind the scenes, and xtsfkk.sthlp file for helping users when help xtsfkk is run in Stata. Syntax: xtsfkk depvar [indepvars] [if] [in] [weight] [, options]

About the Command Options Users can run help xtsfkk in Stata for the full-length documentation of the xtsfkk syntax, options, stored results, and other details. Some standard options of xtsfkk are: production cost endogenous(endovarlist) instruments(ivarlist) uhet(uvarlist) whet(wvarlist)

Some Other Important Options Some other important options of xtsfkk: efficiency(effvar[, replace]) generates the production or cost efficiency variable once the estimation is completed, and displays its summary statistics in detail. test provides a method to test the endogeneity in the model. This option tests the joint significance of the components of the eta term, and reports the findings after displaying the regression results. For more information about test see Karakaplan and Kutlu (2017b). save option saves the current state of the estimation in real-time while the estimation is running. load option loads a previously saved state of an estimation and continues from where it was left.

Examples Users can run help xtsfkk in Stata for several examples about the usage of the xtsfkk command. Panel Stochastic Cost Frontier Model with Endogeneity: Panel data include 85 individuals and a total of 300 observations between 2011 and 2015. This unbalanced dataset is for illustrative purposes and does not characterize a certain sector. Cost (y) is modeled as a function of two frontier variables (x1 and z1) Cost inefficiency is modeled as a function of a variable (z2). Two instrumental variables (iv1 and iv2) are used to handle the potential endogeneity of two variables (z1 and z2) in the model.

Estimation The header option displays the model fully. The compare option displays the results from exogenous model. The nicely option displays the results in a single table using estout command of Jann (2005). . use http://www.mukarakaplan.com/files/xtsfkkcost.dta, clear . xtset id t panel variable: id (unbalanced) time variable: t, 2011 to 2015, but with gaps delta: 1 unit  . xtsfkk y x1 z1, cost u(z2) en(z1 z2) i(iv1 iv2) header compare nicely   ENDOGENOUS PANEL STOCHASTIC COST FRONTIER MODEL (Model EN) Dependent Variable: y Frontier Variables: Constant x1 z1  U Variables: Constant z2  W Variable: Constant Endogenous Variables: z1 z2 Added Instruments: iv1 iv2  Exogenous Variables: iv1 iv2 x1  Panel Variable: id Time Variable: t

Estimation Results Model EX Model EN ---------------------------------------------------------------- Dep.var: y Constant 0.391** (0.129) 0.295* (0.136) x1 0.136* (0.068) 0.494*** (0.092) z1 0.963*** (0.047) 0.746*** (0.097) Dep.var: ln(sigma²_u) Constant -0.544* (0.251) -0.945*** (0.215) z2 1.190*** (0.068) 1.131*** (0.063) Dep.var: ln(sigma²_v) Constant -1.503*** (0.097) Dep.var: ln(sigma²_w) Constant -1.918*** (0.094) eta1 (z1) 0.421*** (0.109) eta2 (z2) 0.568*** (0.055) eta Endogeneity Test X2=138.68 p=0.000 Observations 300 300 Log Likelihood -302.44 -782.62 Mean Cost Efficiency 0.3625 0.4838 Median Cost Efficiency 0.3341 0.4976 Notes: Standard errors are in parentheses. Asterisks indicate significance at the 0.1% (***), 1% (**) and 5% (*) levels. Model EX is the model that ignores endogeneity, and Model EN is the model that handles endogeneity. Individual eta terms for z1 and z2 are both statistically significant at the 0.1% level and the eta endogeneity test result rejects the null hypothesis at 0.1% level, which indicates that a correction for endogeneity in the model is necessary. The coefficients of z1 and z2 in Model EX are positive and statistically significant. In Model EN, these coefficients are significant and positive but smaller. Mean cost efficiency in Model EX is 0.3625, while in Model EN, the same statistic is 0.4838. So, individuals in this data are more cost efficient than how they would seem in Model EX that overlooks endogeneity.

Conclusion xtsfkk is a new Stata command to estimate endogenous panel stochastic frontier models. xtsfkk can handle endogenous variables in the frontier and/or the inefficiency, and studies and examples show that xtsfkk estimates outperform the standard frontier estimates that ignore endogeneity. xtsfkk provides many options that can become handy for researchers from many different fields such as agriculture, airline, banking, education, energy, and health. For updates, comments, suggestions, or questions about xtsfkk: mukarakaplan@yahoo.com www.mukarakaplan.com