Lecture 12 (Ch16) Simultaneous Equations Models (SEMs)

Slides:



Advertisements
Similar presentations
Financial Econometrics
Advertisements

Graduate Methods Master Class
PANEL DATA 1. Dummy Variable Regression 2. LSDV Estimator
Instrumental Variables Estimation and Two Stage Least Square
Economics 20 - Prof. Anderson
Economics 20 - Prof. Anderson1 Simultaneous Equations y 1 =  1 y 2 +  1 z 1 + u 1 y 2 =  2 y 1 +  2 z 2 + u 2.
Econ 140 Lecture 221 Simultaneous Equations Lecture 22.
Multiple Regression Analysis
SEM PURPOSE Model phenomena from observed or theoretical stances
General Structural Equation (LISREL) Models
Random Assignment Experiments
Economics 20 - Prof. Anderson1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 7. Specification and Data Problems.
Lecture 8 (Ch14) Advanced Panel Data Method
Specification Error II
Instrumental Variables Estimation and Two Stage Least Square
Assumption MLR.3 Notes (No Perfect Collinearity)
Pooled Cross Sections and Panel Data II
Prof. Dr. Rainer Stachuletz
Simultaneous Equations Models
1Prof. Dr. Rainer Stachuletz Simultaneous Equations y 1 =  1 y 2 +  1 z 1 + u 1 y 2 =  2 y 1 +  2 z 2 + u 2.
Chapter 4 Multiple Regression.
Chapter 9 Simultaneous Equations Models. What is in this Chapter? In Chapter 4 we mentioned that one of the assumptions in the basic regression model.
Econometric Analysis of Panel Data Instrumental Variables in Panel Data –Assumptions of Instrumental Variables –Fixed Effects Model –Random Effects Model.
So far, we have considered regression models with dummy variables of independent variables. In this lecture, we will study regression models whose dependent.
7.1 Chapter 7 – Empirical Demand Functions  Optimal pricing is critical to the success of any business.  Given the stakes, it is frequently worth investing.
Chapter 11 Multiple Regression.
Lecture 14-2 Multinomial logit (Maddala Ch 12.2)
1 Research Method Lecture 11-1 (Ch15) Instrumental Variables Estimation and Two Stage Least Square ©
Lecture 2 (Ch3) Multiple linear regression
Demand Estimation & Forecasting
1 In a second variation, we shall consider the model shown above. x is the rate of growth of productivity, assumed to be exogenous. w is now hypothesized.
Lecture 3-2 Summarizing Relationships among variables ©
ECON 6012 Cost Benefit Analysis Memorial University of Newfoundland
Chapter 11 Simple Regression
SIMULTANEOUS EQUATION MODELS
Lecture 14-1 (Wooldridge Ch 17) Linear probability, Probit, and
JDS Special program: Pre-training1 Carrying out an Empirical Project Empirical Analysis & Style Hint.
1 Research Method Lecture 6 (Ch7) Multiple regression with qualitative variables ©
Simultaneous Equations Models (聯立方程式模型)
Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.
Ordinary Least Squares Estimation: A Primer Projectseminar Migration and the Labour Market, Meeting May 24, 2012 The linear regression model 1. A brief.
1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u.
Issues in Estimation Data Generating Process:
7.4 DV’s and Groups Often it is desirous to know if two different groups follow the same or different regression functions -One way to test this is to.
1 Some Basic Stuff on Empirical Work Master en Economía Industrial Matilde P. Machado.
1 Prof. Dr. Rainer Stachuletz Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 1. Estimation.
Review Section on Instrumental Variables Economics 1018 Abby Williamson and Hongyi Li October 11, 2006.
Simultaneous Equations Models A simultaneous equations model is one in which there are endogenous variables which are determined jointly. e.g. the demand-supply.
10-1 MGMG 522 : Session #10 Simultaneous Equations (Ch. 14 & the Appendix 14.6)
1 Empirical methods: endogeneity, instrumental variables and panel data Advanced Corporate Finance Semester
Financial Econometrics – 2014 – Dr. Kashif Saleem 1 Financial Econometrics Dr. Kashif Saleem Associate Professor (Finance) Lappeenranta School of Business.
INSTRUMENTAL VARIABLES Eva Hromádková, Applied Econometrics JEM007, IES Lecture 5.
Endogeneity in Econometrics: Simultaneous Equations Models Ming LU.
Time Series Econometrics
The simple linear regression model and parameter estimation
Esman M. Nyamongo Central Bank of Kenya
More on Specification and Data Issues
Simultaneous equation system
STOCHASTIC REGRESSORS AND THE METHOD OF INSTRUMENTAL VARIABLES
More on Specification and Data Issues
Instrumental Variables and Two Stage Least Squares
Chapter 6: MULTIPLE REGRESSION ANALYSIS
Instrumental Variables and Two Stage Least Squares
Identification: Instrumental Variables
Instrumental Variables
Simultaneous equation models Prepared by Nir Kamal Dahal(Statistics)
Instrumental Variables and Two Stage Least Squares
Instrumental Variables Estimation and Two Stage Least Squares
More on Specification and Data Issues
Simultaneous Equations Models
Presentation transcript:

Lecture 12 (Ch16) Simultaneous Equations Models (SEMs) Research Method Lecture 12 (Ch16) Simultaneous Equations Models (SEMs)

Introducdtion We have learned two “sources” of endogeneity. 1. Omitted variables 2. Errors in variables In this handout, we will learn another source of endogeneity: Simultaneity.

In econometrics, “endogeneity” usually means that an explanatory variable is correlated with the error term. In simultaneous equation models, endogeneity means that the observed variable is determined by the equilibrium. For example, an observed quantity is determined by the equilibrium between demand and supply. When a variable is endogenous in ‘simultaneous equation’ sense, it is usually endogenous in econometric sense (i.e., correlated with the error term). We will see this soon.

The nature of simultaneous equation. Consider the following model describing equilibrium quantity of labor (in hours) in agricultural sector in a country. Labor supply : hs=α1w+β1z1+u1 Labor demand: hd=α2w+β2z2+u2 hs is the hours of labor supplied, and hd is the hours of labor demanded. These quantities depends on the wage rate, w, and other factors, z1 and z2.

z1 would be the wage rate of the manufacturing sector z1 would be the wage rate of the manufacturing sector. If the manufacturing wage increases, people would move to manufacturing sector, reducing hours worked in agricultural sector. z1 is called the observed demand shifter. u1 is called the unobserved demand shifter. z2 would be agricultural land area. The more land available, more demand for labor. z2 is the observed supply shifter. u1 is the unobserved supply shifter.

Demand and supply describes entirely different relationships. The observed labor quantity and wage rate are determined by the equilibirum between these two equations. The equilibrium: hs=hd

Consider you have country level data Consider you have country level data. Then, for each country, we observe only the equilibirum labor supply and wage rate. Demand: hi=α1wi+β1zi1+ui1 Supply: hi=α2wi+β2zi2+ui2 where i is the country subscript. These two equations constitute a simultaneous equations model (SEM). These two equations are called the structural equations. α1,β1, α2, β2 are called the structural parameters.

In SEM framework, hi and wi are endogenous variables because they are determined by the equilibrium between the two equations. In the same way, zi1 and zi2 are exogenous variables because they are determined outside of the model. u1 and u2 are called the structural errors. One more important point: Without z1 or z2, there is no way to distinguish whether one equation is demand or supply.

Simultaneous equation bias Consider the following simultaneous equation model. y1=α1y2+β1z1+u1…………….(1) y2=α2y1+β2z2+u2…………….(2) In this model, y1 and y2 are endogenous variables since they are determined by the equilibrium between the two equations. z1 z2 are exogenous variables.

Since z1 and z2 are determined outside of the model, we assume that z1 and z2 are uncorrelated with both of the structural errors. Thus, by definition, the exgoneous variables in SEM are exogenous in ‘econometric sense’ as well. In addition, the two structural errors, u1 & u2, are assumed to be uncorrelated with each other.

These parameters are called the reduced form parameters. Now, solve the equations (1) and (2) for y1 and y2, then you get the following reduced form equations. y1=п11z1+п12z2+v1 y2=п21z1+п22z2+v2 where п11= β1/(1- α1 α2) п112= α1 β2/(1- α1 α2) v1 =(u1+ α1 u2)/(1- α1 α2) п21 =α2β1/(1- α2 α1) п22 = β2/(1- α2 α1) v2=(α2u1+u2)/(1- α2 α1) These parameters are called the reduced form parameters.

You can check that v1 and v2 are uncorrelated with z1 and z2 You can check that v1 and v2 are uncorrelated with z1 and z2. Therefore, you can estimate these reduced form parameters by OLS (Just apply OLS separately for each equation).

However, you cannot estimate the structural equations with OLS However, you cannot estimate the structural equations with OLS. For example, consider the first structural equation. y1=α1y2+β1z1+u1 Notice that Cov(y2, u1) =[α2/(1-α2α1)]E(u12) =[α2/(1-α2α1)]σ21 ≠0 Thus, y2 is correlated with u1 (assuming that α2 ≠0.) In other words, y2 is endogenous in ‘econometric sense’.

Thus, endogenous variables in SEM are usually endogenous in ‘econometric sense’ as well. Thus, you cannot apply OLS to the structural equations. Cov(y2, u1) =[α2/(1-α2α1)]σ21 can be used to predict the direction of bias. If this is positive, OLS estimate of α1 will be biased upward. If it is negative, it will be biased downward. The formula above does not carry over to more general models. But we can use this as a guide to check the direction of the bias.

An example Suppose that you are interested in estimating the effect of police size on the city murder rate. Notice that the ‘supply’ of murder would be a function of police size. But the ‘demand’ for police is a function of murder rates.

Thus, the observed murder rate and the police size are determined simultaneously by the following model. (Murder)=α1(police)+β10+β1(Income per capita)+u1..(3) (Police)=α2(Murder)+ β20+β2(other vars)+u2………..(4) Allthe variables are the city-level variables. (Murder) is the number of murders per capita. (Police) is the number of police officers per capita. We are interested in estimating the effect of police on the murder rate: equation (3).

However, since murder rate and police force are determined simultaneously, (police) is endogenous in equation (3). Thus OLS estimate for α1 is biased. Question: What would be the direction of the bias?

Identifying and estimating a structural equation: 2 equations case When we learned OLS, a parameter was said to be identified when the explanatory variable is not correlated with the error. In 2SLS chapter, we learned how to identify (i.e., eliminate the bias) by apply IV method. In SEM, the term ‘identification’ is used slightly differently.

Suppose the following model describing the supply and demand. Supply: q =α1p+β1z1+u1 Demand: q =α2p+u1 Note that supply curve has an observed supply shifter z1, but demand has no obsedved supply shifter. Given the data on q, p and z1, which equation can be estimated? That is, which is an identified equation?

Supply: location is different depending on the value of z1. Demand These are the data points. Notice: data points trace the demand curve. Thus, it is the demand equation that can be estimated.

Because there is observed supply shifter z1 which is not contained in demand equation, we can identify the demand equation. It is the presence of an exogenous variable in the supply equation that allows us to estimate the demand equation. In SEM, identification is used to mean which equation can be estimated.

Now turn to a more general case. (z11~z1k) and (z21~ ) may contain the same variables, but may contain different variables as well. When one equation contains exogenous variables not contained in the other equation, this means that we have imposed exclusion restrictions.

The condition for identification is the following. The condition for identification: The first equation is identified if and only if the second equation contains at least one exogenous variable (non zero coefficient) that is excluded from the first equation.

The above condition have two components The above condition have two components. First, at least one exogenous variable should be excluded from the first equation (order condition). Second, the excluded variable should have non zero coefficients in the second equation (rank condition). The identification condition for the second equation is just a mirror image of the statement.

Example Labor supply of married working women. Labor supply equation: Wage offer equation: In the model, hours and lwage are endogenous variables. All other variables are exogenous. (Thus, we are ignoring the endogeneity of educ arising from omitted ability.)

Suppose that you are interested in estimating the first equation. Since exp and exp2 are excluded from the first equation, the order condition is satisfied for the first equation. The rank condition is that, at least one of exp and exp2 has a non zero coefficient in the second equation. Assuming that the rank condition is satisfied, the first equation is identified. In a similar way, you can see that the second equation is also identified.

Estimating SEM using 2SLS Once we have determined that an equation is identified, we can estimate it by two stage least square.

Consider the labor supply equation example again Consider the labor supply equation example again. You are interested in estimating the first equation. Suppose that the first equation is identified (both order and rank conditions are satisfied). lwage is correlated with u1. Thus, OLS cannot be used.

However, exp and exp2 can be used as instruments for lwage in the first equation. Why? First, exp and exp2 are uncorrelated with u1 by assumption of the model (instrument exogeneity satisfied). Second exp and exp2 are correlated with lwage by the rank condition (instrument relevance satisfied).

In general, you can use the excluded exogenous variables as the instruments.

Exercise Consider the following simultaneous equation model. Q1: Which equation(s) is/are identified? Q2: Estimate the identified equation(s).

Answer OLS 2SLS

Note on the terminology In the previous slides, the exogenous variables excluded from the equation were called the instruments. In SEM (and in usual IV method too), people often refer to all the exogenous variables (regardless of whether they are included or excluded) as the instruments. The instruments that are excluded from the equation is called specifically as the ‘excluded instruments’.

Simultaneous equations models with panel data. Consider the following SEM. The notation is a short hand notation for . The same for . Due to the fixed effect term and , z-variables are correlated with the composite error terms. Therefore, the excluded exogenous variables cannot be used as instruments unless we do something.

To apply 2SLS, we should first (i) first-difference, or (i) demean the equations. First-differenced version Time demeaned (fixed effect) version

Then or are not correlated with the error term Then or are not correlated with the error term. Thus we can apply the 2SLS method. Estimation procedure is the same. First, determine which equation is identified. Then, use the excluded exogenous variable as the instruments in the 2SLS method.

An application The effect of prison population on the violent crime rate (Levitte 1996). This paper answers to the following question: To what extent an increase in prison population would decrease the violent crime?

Consider the following model. (Crime): the number of violent crimes per capita. (Prison) prison population per capita. : intercepts (different at each year: just include year dummies.) z1: police per capita, log of income per capita, unemployment rate, proportions of black and those living in metropolitan areas, and age distributions.

First-differece the equation to eliminate the fixed effect ai. Even after eliminating the fixed effect, there still is the simultaneous equation bias, because the prison population is determined by the crime rate as well.

The simultaneity can be expressed in the SEM framework as: (Exogenous vars) in equation (7) could contain . However, in order to identify the crime equation (6), (exogenous vars) should contain variables that are not included the crime equation. What can be the variable?

Levitte (1996) used the overcrowding litigation as the excluded instruments. In the US, prisoner’s right groups have filed law suits to mitigate the overcrowding of the prisons. When the law suit is successful, the court orders the prisons to mitigate the overcrowding of the prisons. It usually takes the form of population caps.

Thus, overcrowding litigation, if victories are achieved, will affect the change in the prison population. At the same time, it is reasonable to assume that the overcrowding litigation affect crime rate only through prison population. Thus, the model is now: Whether the final decisions about the overcrowiding litigation is reached.

The results Simple first differenced model. The coefficient would be biased.

First-difference plus 2SLS to eliminate the simultaneous equation bias.

The results of the overidentifying restriction test and endogeneity test.

The first stage regression Overcrowding litigation reduces the prison population growth.