1 Research Method Lecture 11-1 (Ch15) Instrumental Variables Estimation and Two Stage Least Square ©

Slides:



Advertisements
Similar presentations
Multiple Regression.
Advertisements

Instrumental Variables Estimation and Two Stage Least Square
Economics 20 - Prof. Anderson
Multiple Regression Analysis
The Simple Regression Model
There are at least three generally recognized sources of endogeneity. (1) Model misspecification or Omitted Variables. (2) Measurement Error.
Lecture 8 (Ch14) Advanced Panel Data Method
3.3 Omitted Variable Bias -When a valid variable is excluded, we UNDERSPECIFY THE MODEL and OLS estimates are biased -Consider the true population model:
Specification Error II
Instrumental Variables Estimation and Two Stage Least Square
Lecture 12 (Ch16) Simultaneous Equations Models (SEMs)
Lecture 4 Econ 488. Ordinary Least Squares (OLS) Objective of OLS  Minimize the sum of squared residuals: where Remember that OLS is not the only possible.
The Generalized IV Estimator IV estimation with a single endogenous regressor and a single instrument can be naturally generalized. Suppose that there.
Lecture 3 Cameron Kaplan
Cross section and panel method
Economics Prof. Buckles1 Time Series Data y t =  0 +  1 x t  k x tk + u t 1. Basic Analysis.
Economics 20 - Prof. Anderson1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 6. Heteroskedasticity.
Prof. Dr. Rainer Stachuletz
Econ Prof. Buckles1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 1. Estimation.
Specific to General Modelling The traditional approach to econometrics modelling was as follows: 1.Start with an equation based on economic theory. 2.Estimate.
1Prof. Dr. Rainer Stachuletz Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 6. Heteroskedasticity.
Chapter 4 Multiple Regression.
The Fundamental Problem of Causal Inference Alexander Tabarrok January 2007.
FIN357 Li1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 1. Estimation.
Multicollinearity Omitted Variables Bias is a problem when the omitted variable is an explanator of Y and correlated with X1 Including the omitted variable.
Met 2651 Instrument variabler (sider: 539,543, 544,545,549,550,551,559,560,561,564) Ulf H. Olsson Professor of Statistics.
Lecture 24: Thurs., April 8th
Chapter 11 Multiple Regression.
Economics 20 - Prof. Anderson
Topic4 Ordinary Least Squares. Suppose that X is a non-random variable Y is a random variable that is affected by X in a linear fashion and by the random.
Lecture 1 (Ch1, Ch2) Simple linear regression
Lecture 2 (Ch3) Multiple linear regression
FIN357 Li1 The Simple Regression Model y =  0 +  1 x + u.
1Prof. Dr. Rainer Stachuletz Time Series Data y t =  0 +  1 x t  k x tk + u t 1. Basic Analysis.
Economics Prof. Buckles
1 In a second variation, we shall consider the model shown above. x is the rate of growth of productivity, assumed to be exogenous. w is now hypothesized.
12 Autocorrelation Serial Correlation exists when errors are correlated across periods -One source of serial correlation is misspecification of the model.
1 PROXY VARIABLES Suppose that a variable Y is hypothesized to depend on a set of explanatory variables X 2,..., X k as shown above, and suppose that for.
1 Research Method Lecture 6 (Ch7) Multiple regression with qualitative variables ©
Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.
1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u.
1 Javier Aparicio División de Estudios Políticos, CIDE Primavera Regresión.
Endogeneity in Econometrics: Instrumental Variable Estimation
Chapter 8: Simple Linear Regression Yang Zhenlin.
1 Prof. Dr. Rainer Stachuletz Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 1. Estimation.
5. Consistency We cannot always achieve unbiasedness of estimators. -For example, σhat is not an unbiased estimator of σ -It is only consistent -Where.
ECON 3039 Labor Economics By Elliott Fan Economics, NTU Elliott Fan: Labor 2015 Fall Lecture 91.
8-1 MGMG 522 : Session #8 Heteroskedasticity (Ch. 10)
INSTRUMENTAL VARIABLES 1 Suppose that you have a model in which Y is determined by X but you have reason to believe that Assumption B.7 is invalid and.
10-1 MGMG 522 : Session #10 Simultaneous Equations (Ch. 14 & the Appendix 14.6)
Economics 20 - Prof. Anderson1 Time Series Data y t =  0 +  1 x t  k x tk + u t 1. Basic Analysis.
The Instrumental Variables Estimator The instrumental variables (IV) estimator is an alternative to Ordinary Least Squares (OLS) which generates consistent.
Heteroscedasticity Heteroscedasticity is present if the variance of the error term is not a constant. This is most commonly a problem when dealing with.
Lecture 6 Feb. 2, 2015 ANNOUNCEMENT: Lab session will go from 4:20-5:20 based on the poll. (The majority indicated that it would not be a problem to chance,
Esman M. Nyamongo Central Bank of Kenya
ECON 4009 Labor Economics 2017 Fall By Elliott Fan Economics, NTU
Instrumental Variable (IV) Regression
Simultaneous equation system
STOCHASTIC REGRESSORS AND THE METHOD OF INSTRUMENTAL VARIABLES
Instrumental Variables and Two Stage Least Squares
Pooling Cross Sections across Time: Simple Panel Data Methods
Serial Correlation and Heteroskedasticity in Time Series Regressions
Instrumental Variables and Two Stage Least Squares
Migration and the Labour Market
Ch. 13. Pooled Cross Sections Across Time: Simple Panel Data.
Instrumental Variables and Two Stage Least Squares
Linear Panel Data Models
Instrumental Variables Estimation and Two Stage Least Squares
Instrumental Variables
Ch. 13. Pooled Cross Sections Across Time: Simple Panel Data.
Presentation transcript:

1 Research Method Lecture 11-1 (Ch15) Instrumental Variables Estimation and Two Stage Least Square ©

Motivation One explanatory variable case 4Consider the following regression. 2 4Since ability is not observed, we can only run the following regression. 4Since ability is correlated with educ, educ is endogenous (i.e, correlated with u). Thus, will be biased.

4We learned two methods to eliminate the bias. (1)Plug in the proxy variable for ability, such as IQ. (2)Use panel data method (either the fixed effect or the first differenced model). 4Instrumental variable method is another method to eliminate the bias. 3

Instrumental variable method: One explanatory variable case. 4Consider the following model. 4Suppose that x is endogenous, that is cov(x, u)≠0. 4Further, suppose that you have another variable, z, which satisfies the following conditions. Cov(z,u)=0 (instrument exogeneity) …....(1) Cov(z,x)≠0 (instrument relevance)………(2) 4If the above conditions are satisfied, we call z an instrumental variable. 4

4There are two ways to intuitively understand these conditions. 1.Instrumental variable is a variable that is not correlated with the omitted variable, but is correlated with the endogenous explanatory variable. 2.Instrumental variable is a variable that affects y only through x. 5

4The condition Cov(z,u)=0 involves unobserved u. Therefore, we cannot test this condition. (When you have extra instrumental variables, you can test this. This will be discussed later). 4The condition Cov(z,x)≠0 is easy to test. Just runt the following OLS, x=π 0 +π 1 z+v then test H 0 :π 1 =0 6

Instrumental variable estimation: One explanatory variable-one instrument case 4Now, consider 4Then we have Cov(z,y)= Cov(z,β 0 +β 1 x+u) So we have, Cov(z,y)= β 1 Cov(z,x)+Cov(z,u) Since Cov(z,u)=0, we have 7

 By replacing Cov(z,y) and Cov(z,x) with their sample covariances, we have the instrumental variable estimator of β 1 which is given by  You can easily show that is a consistent estimator of β 1. 8

Statistical inference with IV: Homoskedasticity case 4Homoskedasticity assumption in the case of IV regression is stated in terms of z. E(u 2 |z)=σ 2  It can be shown that the asymptotic variance of is given by: where is the variance of x, and is the correlation between x and z. 9

 Now, the estimator of var( ) is obtained by replacing σ 2,, and with their sample estimates.  Sample estimator of σ 2 is obtained in the following way. First, obtain the IV estimates for β 0 and β 1, then compute The estimator for σ 2 is then computed as 10

4The sample estimator for is given as: 4Finally, sample estimator for can be most easily obtained in the following way. First, regress x on z. Then the R-squared from this regression equals the square of the sample correlation. Let call this R 2 x,z. (Off course, you can compute the sample correlation and raise it by power 2. You will get the same result). 11

 Then, the estimator for the variance of is given by: 4You can show that this is a consistent estimator of the asymptotic variance given by (5). 12

Note: R-squared in IV regression 4The R-squared for IV regression is computed as R 2 =1-SSR/SST Where SSR is the sum of the squared IV residuals. (The IV residual is given by (6)). Unlike in the case of OLS, SSR can be greater than SST. Thus, R 2 can be negative. In IV regression, R 2 does not have a natural interpretation. 13

Finding the instrumental variable 4The most difficult part of the instrumental variable estimation is to find suitable instrumental variables. 4Consider the following regression 4Then, you have to find z that is correlated with educ, but not correlated with abil. What can be z? 14

4Consider the father’s education. Perhaps a person whose father is highly educated tends to take more education as well. So the father’s education is likely correlated with educ. 4But, for father’s education to be an instrument, this should not be correlated with the unobserved ability. A highly educated father may nurture his child better, so father’s education may be correlated with the unobserved ability. If this is the case, father’s education is not a good instrument. 4Nonetheless, many studies have used father’s and mother’s education as instruments. 15

Exercises 1. Run the following regression using OLS, using MROZ.dat 2. Using the father’s education as an instrument for edu, estimate the same model using IV regression. Also check if father’s education is correlated with educ. 16

17 OLS IV regression

18 Check if father’s education is correlated with educ.

An application Angrist and Krueger (1991), “Does Compulsory School Attendance Affect Schooling and Earning?” They used the quarter of the birth dummy as an instrument for education to estimate the effect of education on wage. 19

4In the US, the compulsory schooling law requires students to remain in school until their 16th birthday. 4At the same time, schools usually requires Children to be 6 years old on January 1 st to be admitted to school. Therefore, children who were born in the first quarter were older than children who were born in the last quarter when they were first admitted to schools (6.45 v.s years). 20

4This also means that children who were born in the first quarter of the year has shorter schooling when they reach the legal drop out age. So, children who were born in the first quarter can legally drop out of school with less education than children who were born in other quarters. 4If some people want to take as little education as possible but are constrained by the compulsory schooling law, the quarter of birth should affect the education attainment. 21

4At the same time, the quarter of birth is unlikely to be correlated with the unobserved ability. 4Therefore, the dummy variable indicating if a person was born in the first quarter of the year is a good instrument for education. 22

23 Those born in the first quarter of the year tend to have lower education attainment

24 This is the IV regression