Error Component Models Methods of Economic Investigation Lecture 8 1.

Slides:



Advertisements
Similar presentations
Questions From Yesterday
Advertisements

Introduction Describe what panel data is and the reasons for using it in this format Assess the importance of fixed and random effects Examine the Hausman.
PANEL DATA 1. Dummy Variable Regression 2. LSDV Estimator
Economics 20 - Prof. Anderson1 Panel Data Methods y it = x it k x itk + u it.
Economics 20 - Prof. Anderson
Multiple Regression Analysis
Review of Identifying Causal Effects Methods of Economic Investigation Lecture 13.
Methods of Economic Investigation Lecture 2
Economics 20 - Prof. Anderson1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 7. Specification and Data Problems.
Lecture 8 (Ch14) Advanced Panel Data Method
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Omitted Variable Bias Methods of Economic Investigation Lecture 7 1.
Econ 140 Lecture 151 Multiple Regression Applications Lecture 15.
Lecture 4 Econ 488. Ordinary Least Squares (OLS) Objective of OLS  Minimize the sum of squared residuals: where Remember that OLS is not the only possible.
8.4 Weighted Least Squares Estimation Before the existence of heteroskedasticity-robust statistics, one needed to know the form of heteroskedasticity -Het.
1 Lecture 2: ANOVA, Prediction, Assumptions and Properties Graduate School Social Science Statistics II Gwilym Pryce
1 Lecture 2: ANOVA, Prediction, Assumptions and Properties Graduate School Social Science Statistics II Gwilym Pryce
CHAPTER 3 ECONOMETRICS x x x x x Chapter 2: Estimating the parameters of a linear regression model. Y i = b 1 + b 2 X i + e i Using OLS Chapter 3: Testing.
Economics 20 - Prof. Anderson1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 6. Heteroskedasticity.
1Prof. Dr. Rainer Stachuletz Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 7. Specification and Data Problems.
Econ Prof. Buckles1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 1. Estimation.
1Prof. Dr. Rainer Stachuletz Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 6. Heteroskedasticity.
FIN357 Li1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 1. Estimation.
Econ 140 Lecture 181 Multiple Regression Applications III Lecture 18.
CHAPTER 4 ECONOMETRICS x x x x x Multiple Regression = more than one explanatory variable Independent variables are X 2 and X 3. Y i = B 1 + B 2 X 2i +
Multiple Regression Analysis
1Prof. Dr. Rainer Stachuletz Fixed Effects Estimation When there is an observed fixed effect, an alternative to first differences is fixed effects estimation.
FIN357 Li1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 1. Estimation.
Statistical Analysis SC504/HS927 Spring Term 2008 Session 7: Week 23: 7 th March 2008 Complex independent variables and regression diagnostics.
Economics 20 - Prof. Anderson1 Fixed Effects Estimation When there is an observed fixed effect, an alternative to first differences is fixed effects estimation.
Economics Prof. Buckles
Statistics 350 Lecture 17. Today Last Day: Introduction to Multiple Linear Regression Model Today: More Chapter 6.
1Prof. Dr. Rainer Stachuletz Panel Data Methods y it =  0 +  1 x it  k x itk + u it.
So are how the computer determines the size of the intercept and the slope respectively in an OLS regression The OLS equations give a nice, clear intuitive.
3. Multiple Regression Analysis: Estimation -Although bivariate linear regressions are sometimes useful, they are often unrealistic -SLR.4, that all factors.
Specification Error I.
Introduction 1 Panel Data Analysis. And now for… Panel Data! Panel data has both a time series and cross- section component Observe same (eg) people over.
Ch4 Describing Relationships Between Variables. Section 4.1: Fitting a Line by Least Squares Often we want to fit a straight line to data. For example.
1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u.
Public Policy Analysis ECON 3386 Anant Nyshadham.
Application 3: Estimating the Effect of Education on Earnings Methods of Economic Investigation Lecture 9 1.
Stat 112: Notes 2 Today’s class: Section 3.3. –Full description of simple linear regression model. –Checking the assumptions of the simple linear regression.
Lecture 7: What is Regression Analysis? BUEC 333 Summer 2009 Simon Woodcock.
Application 2: Minnesota Domestic Violence Experiment Methods of Economic Investigation Lecture 6.
Copyright © 2006 The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin The Two-Variable Model: Hypothesis Testing chapter seven.
1 Javier Aparicio División de Estudios Políticos, CIDE Primavera Regresión.
Instrumental Variables: Introduction Methods of Economic Investigation Lecture 14.
1 Prof. Dr. Rainer Stachuletz Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 1. Estimation.
8-1 MGMG 522 : Session #8 Heteroskedasticity (Ch. 10)
1 Regression Review Population Vs. Sample Regression Line Residual and Standard Error of Regression Interpretation of intercept & slope T-test, F-test.
Quantitative Methods. Bivariate Regression (OLS) We’ll start with OLS regression. Stands for  Ordinary Least Squares Regression. Relatively basic multivariate.
R. Kass/W03 P416 Lecture 5 l Suppose we are trying to measure the true value of some quantity (x T ). u We make repeated measurements of this quantity.
4-1 MGMG 522 : Session #4 Choosing the Independent Variables and a Functional Form (Ch. 6 & 7)
Experimental Evaluations Methods of Economic Investigation Lecture 4.
Lecture 6 Feb. 2, 2015 ANNOUNCEMENT: Lab session will go from 4:20-5:20 based on the poll. (The majority indicated that it would not be a problem to chance,
Multiple Regression Analysis: Estimation
Econometrics ITFD Week 8.
Difference-in-Differences
PANEL DATA 1. Dummy Variable Regression 2. LSDV Estimator
Multiple Regression Analysis
Advanced Panel Data Methods
Chapter 6: MULTIPLE REGRESSION ANALYSIS
The Regression Model Suppose we wish to estimate the parameters of the following relationship: A common method is to choose parameters to minimise the.
Economics 20 - Prof. Anderson
Instrumental Variables
Ch. 13. Pooled Cross Sections Across Time: Simple Panel Data.
Violations of Assumptions In Least Squares Regression
Ch. 13. Pooled Cross Sections Across Time: Simple Panel Data.
Violations of Assumptions In Least Squares Regression
Advanced Panel Data Methods
Presentation transcript:

Error Component Models Methods of Economic Investigation Lecture 8 1

Today’s Lecture  Review of Omitted Variables Bias  Error component models Fixed Effects Random Effects 2

Omitted Variable Bias Formula  True Relationship Y = α + βT +γX+ε  We estimate  We get a bias from the omitted variable 3

A specific form of this…  Suppose we had to estimate  Where the error term can be decomposed into: Individual specific factor Going to set this to zero for ease of exposition Group specific factor Random error term uncorrelated with everything else 4

What are the problems with this?  Errors have cross-correlation across terms OLS is not efficient Still get consistent estimates, but may be harder to do inference  If X is correlated with γ or δ and we cannot observe this factor, then our OLS estimates will be biased 5

Unobserved effects are not correlated with X’s  This can happen if the group effect is ‘random’ Maybe different geographic areas, not correlated with X Different within-group correlations, violate OLS assumptions that errors uncorrelated with each other  Don’t have to worry about OVB  Still have problems with standard error OLS not efficient Can’t do inference with our estimates 6

OLS standard errors  Usually estimate  Standard way to estimate variance of OLS estimates not valid—won’t necessarily be the same across j’s: 7

Robust Standard Errors  Estimate the residuals:  Then estimate standard errors  Consistent, but may be able to do better… 8

Exploiting the Random Effects Structure  We have more information than we’re using in the robust standard errors  The composite error means there is correlation within groups j but not between groups  Define where 9

FGLS Estimates—2 steps  Estimate OLS and get  Get weighting matrix  Can now adjust the regression for the known correlation 10

FGLS Variance Estimate  With a variance  If our model is correctly specified, this will simplify to: 11

 Simplify, let  So we need to estimate:  When might this happen? Individuals are part of some “group” and that group has a unique relationship to the outcomes (relative to other groups) Looking in a time period—that time period is related to the outcome differently than other time periods Unobserved effects are correlated with X’s 12

What does a group specific term do? X Y 13

How might we fix this?  If we could observe multiple individuals in a group j, then we could difference out the group effect  So for example: look at individuals 1 & 2, both of whom are in group 1 14

Differencing out Group Effects  Difference between individuals 1 and 2, gives us the following estimate:  How do we interpret this? β tells us how a change in the difference in X’s between people 1 and 2, changes the difference in outcomes between people 1 and two Often don’t really care about difference between 2 people… 15

Average Group Effect  Take the sample average outcome for each group  Crucial assumption: average group effect is the group effect  This means: NO within group variation 16

Estimating Deviation from the Average Group Effect  Difference individual and group average equations:  Define ~ terms as deviation from means so we estimate:  So we can estimate this with our usual OLS (more on the standard errors in a few minutes…) 17

How do ‘fixed effects’ help?  Define δ j =1 for group j and zero for all groups  Then we estimate:  And we will get an estimate for β 1, our effect of X on Y, and δ our group effect 18

Estimation  To see how, imagine we have two groups: men and women  If we estimated two equations then its:  We won’t recover β 0 separate from the δ’s constant, c m constant, c f 19

Multiple Equation Estimation  Let men be the base group, then for men we estimate:  What does this give us? CEF: E[Y| X, j=m] = δ m + β*E[X | j=m] δ m is the intercept for men or E[Y |X=0, j=m] If X is de-meaned, then this is the average effect for men at the mean of the sample 20

Why estimate a single equation?  Single equation easy to estimate: run a regression for each group separately BUT Hard to do inference on size of group effect Lose some power if, conditional on group—the effect of various X’s is the same  For example: Men and women on average have different starting wages Conditional on starting level, each year of additional experience, increases wages by some fixed % 21

Single equation estimation  Now estimate for everyone:  For men this estimates  For women Constant: E[Y |j=m, X=0] Additional “female” effect 22

How to interpret your “fixed effect”?  The additional effect of women, relative to men  Do we care about the fixed effects? Sometimes they have an interpretation e.g. controlling for all observable factors, is there a different level of wages for men and women Sometimes they are a just differencing out a bunch of things, and don’t have an interpretation, e.g. repeated observations on an individual, difference out an individual effect— not much interpretation for idiosyncratic individual effect 23

What did we learn today  When have unobserved group effects can be two issues: Uncorrelated with X’s: OLS not efficient, can fix this with GLS Correlated with X’s: OVB, can include “fixed effects”  Fixed effects, within-group differences, and deviation from means differences can all remove bias from unobserved group effect 24

Next Class  Application: The effect of Schooling on wages Ability Bias Fixing this with “twins” and “siblings” models 25