Lecture 20 Preview: Omitted Variables and the Instrumental Variables Estimation Procedure The Ordinary Least Squares Estimation Procedure, Omitted Explanatory.

Slides:



Advertisements
Similar presentations
Multiple Regression Analysis
Advertisements

The Simple Regression Model
CHAPTER 8 MULTIPLE REGRESSION ANALYSIS: THE PROBLEM OF INFERENCE
3.3 Omitted Variable Bias -When a valid variable is excluded, we UNDERSPECIFY THE MODEL and OLS estimates are biased -Consider the true population model:
Specifying an Econometric Equation and Specification Error
Lecture 9 Preview: One-Tailed Tests, Two-Tailed Tests, and Logarithms A One-Tailed Hypothesis Test: The Downward Sloping Demand Curve A Two-Tailed Hypothesis.
Specification Error II
Random effects estimation RANDOM EFFECTS REGRESSIONS When the observed variables of interest are constant for each individual, a fixed effects regression.
MEASUREMENT ERROR 1 In this sequence we will investigate the consequences of measurement errors in the variables in a regression model. To keep the analysis.
Econ Prof. Buckles1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 1. Estimation.
Chapter 9 Simultaneous Equations Models. What is in this Chapter? In Chapter 4 we mentioned that one of the assumptions in the basic regression model.
Met 2651 Instrument variabler (sider: 539,543, 544,545,549,550,551,559,560,561,564) Ulf H. Olsson Professor of Statistics.
Econ 140 Lecture 191 Heteroskedasticity Lecture 19.
Topic 3: Regression.
Lecture 2 (Ch3) Multiple linear regression
FIN357 Li1 The Simple Regression Model y =  0 +  1 x + u.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 3) Slideshow: prediction Original citation: Dougherty, C. (2012) EC220 - Introduction.
1 In a second variation, we shall consider the model shown above. x is the rate of growth of productivity, assumed to be exogenous. w is now hypothesized.
1 PREDICTION In the previous sequence, we saw how to predict the price of a good or asset given the composition of its characteristics. In this sequence,
Ordinary Least Squares
What does it mean? The variance of the error term is not constant
Specification Error I.
Chapter 10 Hetero- skedasticity Copyright © 2011 Pearson Addison-Wesley. All rights reserved. Slides by Niels-Hugo Blunch Washington and Lee University.
LECTURE 2. GENERALIZED LINEAR ECONOMETRIC MODEL AND METHODS OF ITS CONSTRUCTION.
Pure Serial Correlation
Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.
Lecture 3 Preview: Interval Estimates and the Central Limit Theorem Review Populations, Samples, Estimation Procedures, and the Estimate’s Probability.
Managerial Economics Demand Estimation & Forecasting.
1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u.
Properties of OLS How Reliable is OLS?. Learning Objectives 1.Review of the idea that the OLS estimator is a random variable 2.How do we judge the quality.
1Spring 02 Problems in Regression Analysis Heteroscedasticity Violation of the constancy of the variance of the errors. Cross-sectional data Serial Correlation.
Lecture 7 Preview: Estimating the Variance of an Estimate’s Probability Distribution Review: Ordinary Least Squares (OLS) Estimation Procedure Importance.
3.4 The Components of the OLS Variances: Multicollinearity We see in (3.51) that the variance of B j hat depends on three factors: σ 2, SST j and R j 2.
Lecture 11 Preview: Hypothesis Testing and the Wald Test Wald Test Let Statistical Software Do the Work Testing the Significance of the “Entire” Model.
Chapter 2 Ordinary Least Squares Copyright © 2011 Pearson Addison-Wesley. All rights reserved. Slides by Niels-Hugo Blunch Washington and Lee University.
Chapter 4 The Classical Model Copyright © 2011 Pearson Addison-Wesley. All rights reserved. Slides by Niels-Hugo Blunch Washington and Lee University.
Lecture 12 Preview: Model Specification and Model Development Model Specification: Ramsey REgression Specification Error Test (RESET) RESET Logic Model.
Lecture 14: Omitted Explanatory Variables, Multicollinearity, and Irrelevant Explanatory Variables Omitted Explanatory Variables: “Too Few” Explanatory.
Lecture 6 Preview: Ordinary Least Squares Estimation Procedure  The Properties Clint’s Assignment: Assess the Effect of Studying on Quiz Scores General.
1 Prof. Dr. Rainer Stachuletz Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 1. Estimation.
I271B QUANTITATIVE METHODS Regression and Diagnostics.
5. Consistency We cannot always achieve unbiasedness of estimators. -For example, σhat is not an unbiased estimator of σ -It is only consistent -Where.
8-1 MGMG 522 : Session #8 Heteroskedasticity (Ch. 10)
INSTRUMENTAL VARIABLES 1 Suppose that you have a model in which Y is determined by X but you have reason to believe that Assumption B.7 is invalid and.
10-1 MGMG 522 : Session #10 Simultaneous Equations (Ch. 14 & the Appendix 14.6)
7-1 MGMG 522 : Session #7 Serial Correlation (Ch. 9)
The Instrumental Variables Estimator The instrumental variables (IV) estimator is an alternative to Ordinary Least Squares (OLS) which generates consistent.
Regression Overview. Definition The simple linear regression model is given by the linear equation where is the y-intercept for the population data, is.
Lecture 6 Feb. 2, 2015 ANNOUNCEMENT: Lab session will go from 4:20-5:20 based on the poll. (The majority indicated that it would not be a problem to chance,
Multiple Regression Analysis: Estimation
Revisit Omitted Explanatory Variable Bias
Lecture 21 Preview: Panel Data
Lecture 9 Preview: One-Tailed Tests, Two-Tailed Tests, and Logarithms
Lecture 8 Preview: Interval Estimates and Hypothesis Testing
Lecture 18 Preview: Explanatory Variable/Error Term Independence Premise, Consistency, and Instrumental Variables Review Regression Model Standard Ordinary.
Simultaneous equation system
STOCHASTIC REGRESSORS AND THE METHOD OF INSTRUMENTAL VARIABLES
Lecture 23 Preview: Simultaneous Equations – Identification
Pure Serial Correlation
Lecture 17 Preview: Autocorrelation (Serial Correlation)
Basic Estimation Techniques
Review: Explanatory Variable/Error Term Correlation and Bias
Chapter 6: MULTIPLE REGRESSION ANALYSIS
Lecture 22 Preview: Simultaneous Equation Models – Introduction
Lecture 16 Preview: Heteroskedasticity
Undergraduated Econometrics
Simple Linear Regression
Best Fitting Line Clint’s Assignment Simple Regression Model
Instrumental Variables Estimation and Two Stage Least Squares
Regression Part II.
Presentation transcript:

Lecture 20 Preview: Omitted Variables and the Instrumental Variables Estimation Procedure The Ordinary Least Squares Estimation Procedure, Omitted Explanatory Variable Bias, and Consistency Revisit Omitted Explanatory Variable Bias Review of Our Previous Explanation of Omitted Explanatory Variable Bias Omitted Explanatory Variable Bias and the Explanatory Variable/Error Term Independence Premise Instrumental Variable (IV) Estimation Procedure: A Two Regression Procedure Omitted Explanatory Variables Example: 2008 Presidential Election Justifying the Instrumental variable (IV) Estimation Procedure The Mechanics The Two “Good” Instrument Conditions Instrument Variable (IV) Application: 2008 Presidential Election The Mechanics The Two “Good” Instrument Conditions Revisited

x1 t Up  y t Up  Direct Effect  x1 > 0  x2 > 0  x1 and x2 positively correlated  Typically, x2 t Up  y t Up  Proxy Effect The direct effect is what we want the coefficient estimate to capture, the influence that x1 t has on y t. Review: Omitting an explanatory variable from a regression will bias the OLS estimation procedure whenever two conditions are met. Bias results if the omitted explanatory variable: influences the dependent variable; is correlated with an included explanatory variable. When these two conditions are met, the coefficient estimate of the included variable is a composite of two effects; the estimate of the included variable reflects the influence that the Direct Effect: the effect that the included explanatory variable actually has on the dependent variable. This is the effect we want regression analysis to isolate. Proxy Effect: the effect that the omitted explanatory variable has on the dependent variable because the included variable is acting as a proxy for the omitted explanatory variable. Revisit Omitted Variables Phenomenon x1 t and x2 t are positively correlated. To illustrate the effect of omitted variables, assume that  x1 > 0 and  x2 > 0 Model: y t =  Const +  x1 x1 t +  x2 x2 t + e t Question: What happens when we omit the explanatory variable x2 from the regression? Proxy effect causes upward bias. We will now see how we can approach the omitted variable phenomenon in a different way. Effect of omitting the explanatory variable x2: Intuitive Approach

Review: The Ordinary Least Squares (OLS) Estimation Procedure, Correlation between an Explanatory Variable and the Error Term, and Bias Explanatory Variable and Error Term Are Positively Correlated  Estimated Equation More Steeply Sloped Than Actual Equation  OLS Estimation Procedure for Coefficient Value Is Biased Up Explanatory Variable and Error Term Are Negatively Correlated  Estimated Equation Less Steeply Sloped Than Actual Equation  OLS Estimation Procedure for Coefficient Value Is Biased Down Esty = b Const + b x x

Effect of omitting the explanatory variable x2: Error term approach x1 t Up  t =  x2 x2 t + e t and  x2 > 0  x1 and x2 positively correlated  Typically, x2 t Up   t Up x1 t and x2 t are positively correlated.  x1 > 0 and  x2 > 0 y t =  Const +  x1 x1 t +  x2 x2 t + e t =  Const +  x1 x1 t + (  x2 x2 t + e t ) =  Const +  x1 x1 t +  t when x2 t is omitted, the error term becomes:  t =  x2 x2 t + e t  OLS estimation procedure for the value of x1’s coefficient is biased upward x1 t is positively correlated with  t Question: Several lectures ago, we showed that the ordinary least squares (OLS) estimation procedure for the included coefficient’s value is biased, but might the ordinary least squares (OLS) estimation procedure be consistent?

Review: Unbiased and Consistent Estimation Procedures Unbiased: Small Sample Property. The estimation procedure does not systematically underestimate or overestimate the actual value. Consistent: Large Sample Property. Both the mean and variance of the estimate’s probability distribution are important for consistency: Formally, the mean of the estimate’s probability distribution equals the actual value. Mean[Est] = Actual Value When the estimate’s probability distribution is symmetric, the chances that the estimate is greater than the actual value equal the chances that it is less. Mean of the estimate’s probability distribution: The estimation procedure is unbiased: Mean[Est] = Actual Value The estimation procedure is biased, but the magnitude of the bias diminishes as the sample size becomes larger. or Variance of the estimate’s probability distribution: The variance diminishes as the sample size becomes larger Formally, as the sample size approaches infinity the variance approaches 0: As Sample Size   : Variance[Est]  0 Formally, as the sample size approaches infinity the mean approaches the actual value: As Sample Size   : Mean[Est]  Actual Value Unbiasedness is called a small sample property because it does not depend on the sample size. Unbiasedness depends only of the mean of the estimate’s probability distribution. Either

 Lab 20.1 Question: Might the ordinary least squares (OLS) estimation procedure be consistent? EstimationCorrActualSampleActualMean ofMagnitudeVariance of ProcedureX1&X2Coef2SizeCoefCoef1 Estsof BiasCoef1 Ests OLS OLS OLS  4.4  2.4  6.6  4.4  2.4  3.2  4.4  2.4  2.1 Question: Is the OLS estimation procedure unbiased?No. Question: Is the OLS estimation procedure consistent?No. Summary: When an explanatory variable is omitted that influences the dependent variable; is correlated with an included explanatory variable. the ordinary least squares (OLS) estimation procedure for the value of the included variable’s coefficient is: biased. not consistent.

The Instrumental Variable (IV) Estimation Procedure y t =  Const +  x x t +  t When x t and  t are correlated  x t is a “problem” explanatory variable “Problem” Explanatory Variable: x t is the “problem” explanatory variable: The explanatory variable, x t, is correlated with the error term,  t. Consequently, the explanatory variable/error term independence premise is violated. The ordinary least squares (OLS) estimation procedure for the coefficient value is biased. Question: Why is there a problem?

Addressing the “Problem” Explanatory Variable Using Instrumental Variables Choose an Instrument: A “good” instrument, z t, must meet two conditions. The instrument, z t, must be: Correlated with the “problem” explanatory variable, x t. Instrumental Variables (IV) Regression 1: Use the instrument, z t, to provide an “estimate” of the problem explanatory variable, x t. Dependent Variable: “Problem” explanatory variable, x t. Explanatory Variable: Instrument, z t. Instrumental Variables (IV) Regression 2:In the original model, replace the “problem” explanatory variable, x t, with its surrogate, Estx t, the estimate of the “problem” explanatory variable provided by the instrument, z t, from IV Regression 1. Dependent Variable: Original dependent variable, y t. Explanatory Variable: Estimate of the “problem” explanatory variable based on the results from IV Regression 1, Estx t. Uncorrelated with the error term,  t. Estimate of the problem explanatory variable: Estx t = a Const + a z z t where a Const and a z are the estimates of the constant and coefficient in this regression, IV Regression 1.

Justifying the Instrument Conditions Instrument/”Problem” Explanatory Variable Correlation: The instrument, z t, must be correlated with the “problem” explanatory variable, x t. Estx t = a Const + a z z t We are using the instrument to create a surrogate for the “problem” explanatory variable: Use the instrument, z t, to provide an estimate of the “problem” explanatory variable, x t. The estimate, Estx t, will be a “good” surrogate only if the instrument is correlated with the problem explanatory variable. This will occur only if the instrument, z t, is correlated with the “problem” explanatory variable, x t. Dependent Variable: “Problem” Explanatory Variable, x t. Explanatory Variable: Instrument, z t Focus on Instrumental Variables (IV) Regression 1:

Original Model: Replace the “problem” explanatory variable with its surrogate To avoid violating the explanatory variable/error term independence premise z t and  t must be independent. Estx t = a Const + a z z t y t =  Const +  x x t +  t y t =  Const +  x Estx t +  t  Estx t and  t must be independent Instrument/Error Term Independence: The instrument, z t, must be independent of the error term,  t. Focus on Instrumental Variables (IV) Regression 2 In the original model, replace the “problem” explanatory variable, x t, with its surrogate, Estx t, the estimate of the “problem” explanatory variable provided by the instrument, z t, from IV Regression 1.

Instrumental Variable Example: 2008 Presidential Election AdvDeg t Percent adults who have advanced degrees in state t Coll t Percent adults who graduated from college in state t HS t Percent adults who graduated from high school in state t PopDen t Population density of state t (persons per square mile) VoteDemPartyTwo t Percent of the vote received in 2008 received by the Democratic party in state t based on the two major parties (percent)  EViews Population Density Theory: States with high population densities have large urban areas which are more likely to vote for the Democratic candidate, Obama. “Liberalness” Theory: Since the Democratic party is more liberal than the Republican party, a high “liberalness” value would increase the vote of the Democratic candidate, Obama. Model: VoteDemPartyTwo t =  Const +  PopDen PopDen t +  Lib Liberal + e t  PopDen > 0  Lib > 0 But we have no data for a state’s “liberalness”; hence, we have no choice – we must omit it. Ordinary Least Squares (OLS) Dependent Variable: VoteDemPartyTwo Explanatory Variable(s):EstimateSEt-StatisticProb PopDen Const Number of Observations51

Model: VoteDemPartyTwo t =  Const +  PopDen PopDen t +  Lib Liberal + e t Question: Might there be a serious econometric problem when Liberal is omitted? =  Const +  PopDen PopDen t + (  Lib Liberal + e t ) =  Const +  PopDen PopDen t +  t where  t =  Lib Liberal + e t PopDen t Up  t =  Lib Liberal + e t and  Lib > 0 PopDen t and Liberal t positively correlated Typically, Liberal t Up  t Up  OLS estimation procedure for PopDen’s coefficient is biased PopDen t is positively correlated with  t PopDen t is the “problem” explanatory variable. Summary: When Liberal is omitted PopDen is a “problem” explanatory variable. It is correlated with the error term,  t. The explanatory variable/error term independence premise is violated.

Applying the Instrumental Variable (IV) Approach Choose an Instrument: We shall choose Coll t as our instrument. Instrumental Variables (IV) Regression 1  EViews uncorrelated with the error term,  t =  Lib Liberal t + e t. Consequently, we believe that the instrument, Coll t, is uncorrelated with the omitted variable, Liberal t. By doing so, we believe that it satisfies the two “good” instrument conditions; that is, we believe that the percent of college graduates, Coll t, is: positively correlated with the “problem” explanatory variable, PopDen t Dependent variable: “Problem” explanatory variable, PopDen Explanatory variable: Instrument, Coll. Estimated Equation: EstPopDen =  3, Coll Ordinary Least Squares (OLS) Dependent Variable: PopDen Explanatory Variable(s):EstimateSEt-StatisticProb Coll Const   Number of Observations51

Ordinary Least Squares (OLS) Dependent Variable: VoteDemPartyTwo Explanatory Variable(s):EstimateSEt-StatisticProb EstPopDen Const Number of Observations51 Instrumental Variables (IV) Regression 2 Dependent Variable: Original dependent variable, VoteDemPartyTwo. Explanatory Variable: Estimate of the “problem” explanatory variable based on the results from Regression 1, EstPopDen. EstPopDen =  3, Coll  EViews Estimated Equation: VoteDemPartyTwo = EstPopDen

Ordinary Least Squares (OLS) Dependent Variable: PopDen Explanatory Variable(s):EstimateSEt-StatisticProb Coll Const   Number of Observations51 Correlation Matrix CollPopDen Coll PopDen Good Instrument Conditions Revisited Instrument/”Problem” Explanatory Variable Correlation: The instrument, Coll t, must be correlated with the “problem” explanatory variable, PopDen t. Their correlation coefficient equals.60 The coefficient estimate is significant at the 1 percent level. Consequently, it is not unreasonable to judge that the instrument meets the first condition. Instrumental Variables (IV) Regression 1

Instrument/Error Term Independence: The instrument, Coll t, and the error term,  t, must be independent. VoteDemPartyTwo t =  Const +  PopDen EstPopDen t +  t Question: Are EstPopDen t and  t independent? EstPopDen =  3, Coll  t =  Lib Liberal + e t Answer: Only if Coll t and Liberal t are independent. The explanatory variable/error term independence premise will be satisfied only if the instrument, Coll t, and the omitted variable, Liberal t, are independent. Unfortunately, there is no way to confirm this empirically with our data. This can be the “Achilles heel” of the instrumental variable (IV) estimation procedure.

Justifying the Instrumental Variable (IV) Approach: A Simulation Model: y t =  Const +  x1 x1 t +  x2 x2 t + e t Defaults The Only X1 button is selected. Hence, second explanatory variable, x2, is omitted. y t =  Const +  x1 x1 t + (  x2 x2 t + e t ) =  Const +  x1 x1 t +  t where  t =  x2 x2 t + e t The actual coefficient for the omitted explanatory variable, x2 t, equals 4. In the CorrX1&Z list.50 is selected. Hence, the included explanatory variable and the instrument are correlated. In the CorrX2&Z list.00 is selected. Hence, the instrument and the omitted variable are uncorrelated; consequently, the instrument and the error term are independent. In the CorrX1&X2 list.60 is selected. Hence, the included and omitted explanatory variables are correlated. Corr X1&Z Corr X2&Z Corr X1&X

IV IV IV IV IV IV  1.82 .18  32.8  1.89 .11  13.7  1.95 .05  9.2 Question: Is the IV estimation procedure: Unbiased?Consistent?No.Yes.  1.97 .03  3.9 Question: Is the instrument better?Yes. Both the magnitude of the bias and the variance of the estimates is less.  2.67 .67  31.1  2.70 .70  13.8  2.73 .73  8.7 Question: Is the IV estimation procedure: Unbiased?No.Consistent?No. EstimationCorrelation CoefficientsSampleActualMean ofMagnitudeVar of ProcedureX1&ZX2&ZX1&X2SizeCoef1Coef1 Estsof BiasCoef1 Ests IV EstimationCorrelation CoefficientsSampleActualMean ofMagnitudeVar of ProcedureX1&ZX2&ZX1&X2SizeCoef1Coef1 Estsof BiasCoef1 Ests Instrument/Error Term Independence: Suppose that the instrument is correlated with the error term? Instrument/”Problem” Explanatory Variable Correlation: Suppose that the instrument is more highly correlated with the “problem” explanatory variable:  Lab 20.2  Lab 20.3  Lab 20.2