When should you use fixed effects estimation rather than random effects estimation, or vice versa? FIXED EFFECTS OR RANDOM EFFECTS? 1 NLSY 1980–1996 Dependent.

Slides:



Advertisements
Similar presentations
Christopher Dougherty EC220 - Introduction to econometrics (chapter 2) Slideshow: a Monte Carlo experiment Original citation: Dougherty, C. (2012) EC220.
Advertisements

1 THE DISTURBANCE TERM IN LOGARITHMIC MODELS Thus far, nothing has been said about the disturbance term in nonlinear regression models.
EC220 - Introduction to econometrics (chapter 7)
1 XX X1X1 XX X Random variable X with unknown population mean  X function of X probability density Sample of n observations X 1, X 2,..., X n : potential.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 2) Slideshow: testing a hypothesis relating to a regression coefficient Original citation:
HETEROSCEDASTICITY-CONSISTENT STANDARD ERRORS 1 Heteroscedasticity causes OLS standard errors to be biased is finite samples. However it can be demonstrated.
1 PROBABILITY DISTRIBUTION EXAMPLE: X IS THE SUM OF TWO DICE red This sequence provides an example of a discrete random variable. Suppose that you.
Random effects estimation RANDOM EFFECTS REGRESSIONS When the observed variables of interest are constant for each individual, a fixed effects regression.
MEASUREMENT ERROR 1 In this sequence we will investigate the consequences of measurement errors in the variables in a regression model. To keep the analysis.
1 ASSUMPTIONS FOR MODEL C: REGRESSIONS WITH TIME SERIES DATA Assumptions C.1, C.3, C.4, C.5, and C.8, and the consequences of their violations are the.
EC220 - Introduction to econometrics (chapter 9)
00  sd  0 –sd  0 –1.96sd  0 +sd 2.5% CONFIDENCE INTERVALS probability density function of X null hypothesis H 0 :  =  0 In the sequence.
EXPECTED VALUE OF A RANDOM VARIABLE 1 The expected value of a random variable, also known as its population mean, is the weighted average of its possible.
TESTING A HYPOTHESIS RELATING TO THE POPULATION MEAN 1 This sequence describes the testing of a hypothesis at the 5% and 1% significance levels. It also.
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: confidence intervals Original citation: Dougherty, C. (2012) EC220.
EC220 - Introduction to econometrics (review chapter)
TESTING A HYPOTHESIS RELATING TO A REGRESSION COEFFICIENT This sequence describes the testing of a hypotheses relating to regression coefficients. It is.
1 A MONTE CARLO EXPERIMENT In the previous slideshow, we saw that the error term is responsible for the variations of b 2 around its fixed component 
Christopher Dougherty EC220 - Introduction to econometrics (chapter 3) Slideshow: prediction Original citation: Dougherty, C. (2012) EC220 - Introduction.
SLOPE DUMMY VARIABLES 1 The scatter diagram shows the data for the 74 schools in Shanghai and the cost functions derived from a regression of COST on N.
1 In the previous sequence, we were performing what are described as two-sided t tests. These are appropriate when we have no information about the alternative.
Cross-sectional:Observations on individuals, households, enterprises, countries, etc at one moment in time (Chapters 1–10, Models A and B). 1 During this.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 10) Slideshow: maximum likelihood estimation of regression coefficients Original citation:
1 In a second variation, we shall consider the model shown above. x is the rate of growth of productivity, assumed to be exogenous. w is now hypothesized.
1 This sequence shows why OLS is likely to yield inconsistent estimates in models composed of two or more simultaneous relationships. SIMULTANEOUS EQUATIONS.
1 PREDICTION In the previous sequence, we saw how to predict the price of a good or asset given the composition of its characteristics. In this sequence,
1 UNBIASEDNESS AND EFFICIENCY Much of the analysis in this course will be concerned with three properties of estimators: unbiasedness, efficiency, and.
FIXED EFFECTS REGRESSIONS: WITHIN-GROUPS METHOD The two main approaches to the fitting of models using panel data are known, for reasons that will be explained.
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: sampling and estimators Original citation: Dougherty, C. (2012)
Christopher Dougherty EC220 - Introduction to econometrics (chapter 12) Slideshow: autocorrelation, partial adjustment, and adaptive expectations Original.
THE DUMMY VARIABLE TRAP 1 Suppose that you have a regression model with Y depending on a set of ordinary variables X 2,..., X k and a qualitative variable.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 8) Slideshow: measurement error Original citation: Dougherty, C. (2012) EC220 - Introduction.
THE FIXED AND RANDOM COMPONENTS OF A RANDOM VARIABLE 1 In this short sequence we shall decompose a random variable X into its fixed and random components.
1 General model with lagged variables Static model AR(1) model Model with lagged dependent variable Methodologically, in developing a regression specification.
Confidence intervals were treated at length in the Review chapter and their application to regression analysis presents no problems. We will not repeat.
CONSEQUENCES OF AUTOCORRELATION
ALTERNATIVE EXPRESSION FOR POPULATION VARIANCE 1 This sequence derives an alternative expression for the population variance of a random variable. It provides.
CONFLICTS BETWEEN UNBIASEDNESS AND MINIMUM VARIANCE
1 t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN The diagram summarizes the procedure for performing a 5% significance test on the slope coefficient.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 7) Slideshow: weighted least squares and logarithmic regressions Original citation:
ASYMPTOTIC AND FINITE-SAMPLE DISTRIBUTIONS OF THE IV ESTIMATOR
EC220 - Introduction to econometrics (chapter 8)
MULTIPLE RESTRICTIONS AND ZERO RESTRICTIONS
F TEST OF GOODNESS OF FIT FOR THE WHOLE EQUATION 1 This sequence describes two F tests of goodness of fit in a multiple regression model. The first relates.
MULTIPLE REGRESSION WITH TWO EXPLANATORY VARIABLES: EXAMPLE 1 This sequence provides a geometrical interpretation of a multiple regression model with two.
TYPE II ERROR AND THE POWER OF A TEST A Type I error occurs when the null hypothesis is rejected when it is in fact true. A Type II error occurs when the.
A.1The model is linear in parameters and correctly specified. PROPERTIES OF THE MULTIPLE REGRESSION COEFFICIENTS 1 Moving from the simple to the multiple.
1 REGRESSION ANALYSIS WITH PANEL DATA: INTRODUCTION A panel data set, or longitudinal data set, is one where there are repeated observations on the same.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 9) Slideshow: instrumental variable estimation: variation Original citation: Dougherty,
Christopher Dougherty EC220 - Introduction to econometrics (chapter 6) Slideshow: multiple restrictions and zero restrictions Original citation: Dougherty,
1 We will now look at the properties of the OLS regression estimators with the assumptions of Model B. We will do this within the context of the simple.
1 Y SIMPLE REGRESSION MODEL Suppose that a variable Y is a linear function of another variable X, with unknown parameters  1 and  2 that we wish to estimate.
1 We will continue with a variation on the basic model. We will now hypothesize that p is a function of m, the rate of growth of the money supply, as well.
Definition of, the expected value of a function of X : 1 EXPECTED VALUE OF A FUNCTION OF A RANDOM VARIABLE To find the expected value of a function of.
4 In our case, the starting point should be the model with all the lagged variables. DYNAMIC MODEL SPECIFICATION General model with lagged variables Static.
HETEROSCEDASTICITY 1 This sequence relates to Assumption A.4 of the regression model assumptions and introduces the topic of heteroscedasticity. This relates.
INSTRUMENTAL VARIABLES 1 Suppose that you have a model in which Y is determined by X but you have reason to believe that Assumption B.7 is invalid and.
1 INSTRUMENTAL VARIABLE ESTIMATION OF SIMULTANEOUS EQUATIONS In the previous sequence it was asserted that the reduced form equations have two important.
1 HETEROSCEDASTICITY: WEIGHTED AND LOGARITHMIC REGRESSIONS This sequence presents two methods for dealing with the problem of heteroscedasticity. We will.
1 ESTIMATORS OF VARIANCE, COVARIANCE, AND CORRELATION We have seen that the variance of a random variable X is given by the expression above. Variance.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 2) Slideshow: confidence intervals Original citation: Dougherty, C. (2012) EC220 -
1 We will illustrate the heteroscedasticity theory with a Monte Carlo simulation. HETEROSCEDASTICITY: MONTE CARLO ILLUSTRATION 1 standard deviation of.
WHITE TEST FOR HETEROSCEDASTICITY 1 The White test for heteroscedasticity looks for evidence of an association between the variance of the disturbance.
FOOTNOTE: THE COCHRANE–ORCUTT ITERATIVE PROCESS 1 We saw in the previous sequence that AR(1) autocorrelation could be eliminated by a simple manipulation.
Introduction to Econometrics, 5th edition
Introduction to Econometrics, 5th edition
Introduction to Econometrics, 5th edition
Introduction to Econometrics, 5th edition
Introduction to Econometrics, 5th edition
Introduction to Econometrics, 5th edition
Presentation transcript:

When should you use fixed effects estimation rather than random effects estimation, or vice versa? FIXED EFFECTS OR RANDOM EFFECTS? 1 NLSY 1980–1996 Dependent variable logarithm of hourly earnings OLS Fixed effects Random effects Married –0.134– (0.007)(0.012)(0.010) Soon-to-be – –0.075 married(0.009)(0.010)(0.008)(0.009)(0.007) Single–––0.106 – –0.134 (0.012)(0.010) R DWH test – – – n 20,343 20,343 20,343 20,343 20,343

In principle random effects is more attractive because observed characteristics that remain constant for each individual are retained in the regression model. In fixed effects estimation, they have to be dropped. FIXED EFFECTS OR RANDOM EFFECTS? 2 NLSY 1980–1996 Dependent variable logarithm of hourly earnings OLS Fixed effects Random effects Married –0.134– (0.007)(0.012)(0.010) Soon-to-be – –0.075 married(0.009)(0.010)(0.008)(0.009)(0.007) Single–––0.106 – –0.134 (0.012)(0.010) R DWH test – – – n 20,343 20,343 20,343 20,343 20,343

Also with random effects estimation we do not lose n degrees of freedom, as is the case with fixed effects. FIXED EFFECTS OR RANDOM EFFECTS? 3 NLSY 1980–1996 Dependent variable logarithm of hourly earnings OLS Fixed effects Random effects Married –0.134– (0.007)(0.012)(0.010) Soon-to-be – –0.075 married(0.009)(0.010)(0.008)(0.009)(0.007) Single–––0.106 – –0.134 (0.012)(0.010) R DWH test – – – n 20,343 20,343 20,343 20,343 20,343

However if either of the preconditions for using random effects is violated, we should use fixed effects instead. FIXED EFFECTS OR RANDOM EFFECTS? 4 NLSY 1980–1996 Dependent variable logarithm of hourly earnings OLS Fixed effects Random effects Married –0.134– (0.007)(0.012)(0.010) Soon-to-be – –0.075 married(0.009)(0.010)(0.008)(0.009)(0.007) Single–––0.106 – –0.134 (0.012)(0.010) R DWH test – – – n 20,343 20,343 20,343 20,343 20,343

One precondition is that the observations can be described as being drawn randomly from a given population. This is a reasonable assumption in the case of the NLSY because it was designed to be a random sample. FIXED EFFECTS OR RANDOM EFFECTS? 5 NLSY 1980–1996 Dependent variable logarithm of hourly earnings OLS Fixed effects Random effects Married –0.134– (0.007)(0.012)(0.010) Soon-to-be – –0.075 married(0.009)(0.010)(0.008)(0.009)(0.007) Single–––0.106 – –0.134 (0.012)(0.010) R DWH test – – – n 20,343 20,343 20,343 20,343 20,343

By contrast, it would not be a reasonable assumption if the units of observation in the panel data set were countries and the sample consisted of those countries that are members of the Organization for Economic Cooperation and Development (OECD). FIXED EFFECTS OR RANDOM EFFECTS? 6 NLSY 1980–1996 Dependent variable logarithm of hourly earnings OLS Fixed effects Random effects Married –0.134– (0.007)(0.012)(0.010) Soon-to-be – –0.075 married(0.009)(0.010)(0.008)(0.009)(0.007) Single–––0.106 – –0.134 (0.012)(0.010) R DWH test – – – n 20,343 20,343 20,343 20,343 20,343

These countries certainly cannot be considered to represent a random sample of the 200- odd sovereign states in the world. FIXED EFFECTS OR RANDOM EFFECTS? 7 NLSY 1980–1996 Dependent variable logarithm of hourly earnings OLS Fixed effects Random effects Married –0.134– (0.007)(0.012)(0.010) Soon-to-be – –0.075 married(0.009)(0.010)(0.008)(0.009)(0.007) Single–––0.106 – –0.134 (0.012)(0.010) R DWH test – – – n 20,343 20,343 20,343 20,343 20,343

The other precondition is that the unobserved effect be distributed independently of the X j variables. How can we tell if this is the case? FIXED EFFECTS OR RANDOM EFFECTS? 8 NLSY 1980–1996 Dependent variable logarithm of hourly earnings OLS Fixed effects Random effects Married –0.134– (0.007)(0.012)(0.010) Soon-to-be – –0.075 married(0.009)(0.010)(0.008)(0.009)(0.007) Single–––0.106 – –0.134 (0.012)(0.010) R DWH test – – – n 20,343 20,343 20,343 20,343 20,343

The standard procedure is yet another implementation of the Durbin–Wu–Hausman test used to help us choose between OLS and IV estimation in models where there is suspected measurement error or simultaneous equations endogeneity. FIXED EFFECTS OR RANDOM EFFECTS? 9 NLSY 1980–1996 Dependent variable logarithm of hourly earnings OLS Fixed effects Random effects Married –0.134– (0.007)(0.012)(0.010) Soon-to-be – –0.075 married(0.009)(0.010)(0.008)(0.009)(0.007) Single–––0.106 – –0.134 (0.012)(0.010) R DWH test – – – n 20,343 20,343 20,343 20,343 20,343

The null hypothesis is that the  i are distributed independently of the X j. FIXED EFFECTS OR RANDOM EFFECTS? 10 NLSY 1980–1996 Dependent variable logarithm of hourly earnings OLS Fixed effects Random effects Married –0.134– (0.007)(0.012)(0.010) Soon-to-be – –0.075 married(0.009)(0.010)(0.008)(0.009)(0.007) Single–––0.106 – –0.134 (0.012)(0.010) R DWH test – – – n 20,343 20,343 20,343 20,343 20,343

If this is correct, both random effects and fixed effects are consistent, but fixed effects will be inefficient because, looking at it in its LSDV form, it involves estimating an unnecessary set of dummy variable coefficients. FIXED EFFECTS OR RANDOM EFFECTS? 11 NLSY 1980–1996 Dependent variable logarithm of hourly earnings OLS Fixed effects Random effects Married –0.134– (0.007)(0.012)(0.010) Soon-to-be – –0.075 married(0.009)(0.010)(0.008)(0.009)(0.007) Single–––0.106 – –0.134 (0.012)(0.010) R DWH test – – – n 20,343 20,343 20,343 20,343 20,343

If the null hypothesis is false, the random effects estimates will be subject to unobserved heterogeneity bias and will therefore differ systematically from the fixed effects estimates. FIXED EFFECTS OR RANDOM EFFECTS? 12 NLSY 1980–1996 Dependent variable logarithm of hourly earnings OLS Fixed effects Random effects Married –0.134– (0.007)(0.012)(0.010) Soon-to-be – –0.075 married(0.009)(0.010)(0.008)(0.009)(0.007) Single–––0.106 – –0.134 (0.012)(0.010) R DWH test – – – n 20,343 20,343 20,343 20,343 20,343

As in its other applications, the DWH test determines whether the estimates of the coefficients, taken as a group, are significantly different in the two regressions. FIXED EFFECTS OR RANDOM EFFECTS? 13 NLSY 1980–1996 Dependent variable logarithm of hourly earnings OLS Fixed effects Random effects Married –0.134– (0.007)(0.012)(0.010) Soon-to-be – –0.075 married(0.009)(0.010)(0.008)(0.009)(0.007) Single–––0.106 – –0.134 (0.012)(0.010) R DWH test – – – n 20,343 20,343 20,343 20,343 20,343

If any variables are dropped in the fixed effects regression, they are excluded from the test. Under the null hypothesis the test statistic has a chi-squared distribution. FIXED EFFECTS OR RANDOM EFFECTS? 14 NLSY 1980–1996 Dependent variable logarithm of hourly earnings OLS Fixed effects Random effects Married –0.134– (0.007)(0.012)(0.010) Soon-to-be – –0.075 married(0.009)(0.010)(0.008)(0.009)(0.007) Single–––0.106 – –0.134 (0.012)(0.010) R DWH test – – – n 20,343 20,343 20,343 20,343 20,343

In principle this should have degrees of freedom equal to the number of slope coefficients being compared, but for technical reasons that require matrix algebra for an explanation, the actual number may be lower. FIXED EFFECTS OR RANDOM EFFECTS? 15 NLSY 1980–1996 Dependent variable logarithm of hourly earnings OLS Fixed effects Random effects Married –0.134– (0.007)(0.012)(0.010) Soon-to-be – –0.075 married(0.009)(0.010)(0.008)(0.009)(0.007) Single–––0.106 – –0.134 (0.012)(0.010) R DWH test – – – n 20,343 20,343 20,343 20,343 20,343

A regression application that implements the test, such as Stata, should determine the actual number of degrees of freedom. FIXED EFFECTS OR RANDOM EFFECTS? 16 NLSY 1980–1996 Dependent variable logarithm of hourly earnings OLS Fixed effects Random effects Married –0.134– (0.007)(0.012)(0.010) Soon-to-be – –0.075 married(0.009)(0.010)(0.008)(0.009)(0.007) Single–––0.106 – –0.134 (0.012)(0.010) R DWH test – – – n 20,343 20,343 20,343 20,343 20,343

We will perform the test for the marriage-effect-on-earnings example. The fixed effects estimates, using the within-groups approach, of the coefficients of married men and soon- to-be married men are and 0.045, respectively. FIXED EFFECTS OR RANDOM EFFECTS? 17 NLSY 1980–1996 Dependent variable logarithm of hourly earnings OLS Fixed effects Random effects Married –0.134– (0.007)(0.012)(0.010) Soon-to-be – –0.075 married(0.009)(0.010)(0.008)(0.009)(0.007) Single–––0.106 – –0.134 (0.012)(0.010) R DWH test – – – n 20,343 20,343 20,343 20,343 20,343

The corresponding random effects estimates are considerably higher, and 0.060, inviting the suspicion that they may be inflated by unobserved heterogeneity. FIXED EFFECTS OR RANDOM EFFECTS? 18 NLSY 1980–1996 Dependent variable logarithm of hourly earnings OLS Fixed effects Random effects Married –0.134– (0.007)(0.012)(0.010) Soon-to-be – –0.075 married(0.009)(0.010)(0.008)(0.009)(0.007) Single–––0.106 – –0.134 (0.012)(0.010) R DWH test – – – n 20,343 20,343 20,343 20,343 20,343

The DWH test involves the comparison of 13 coefficients (those of MARRIED, SOONMARR, and 11 controls). The test statistic is FIXED EFFECTS OR RANDOM EFFECTS? 19 NLSY 1980–1996 Dependent variable logarithm of hourly earnings OLS Fixed effects Random effects Married –0.134– (0.007)(0.012)(0.010) Soon-to-be – –0.075 married(0.009)(0.010)(0.008)(0.009)(0.007) Single–––0.106 – –0.134 (0.012)(0.010) R DWH test – – – n 20,343 20,343 20,343 20,343 20,343

With 13 degrees of freedom the critical value of chi-squared at the 0.1 percent level is 34.5, so we definitely conclude that we should be using fixed effects estimation. FIXED EFFECTS OR RANDOM EFFECTS? 20 NLSY 1980–1996 Dependent variable logarithm of hourly earnings OLS Fixed effects Random effects Married –0.134– (0.007)(0.012)(0.010) Soon-to-be – –0.075 married(0.009)(0.010)(0.008)(0.009)(0.007) Single–––0.106 – –0.134 (0.012)(0.010) R DWH test – – – n 20,343 20,343 20,343 20,343 20,343

Suppose that the DWH test indicates that we can use random effects rather than fixed effects. FIXED EFFECTS OR RANDOM EFFECTS? 21 Random effects estimation

We should then consider whether there are any unobserved effects at all. It is just possible that the model has been so well specified that the disturbance term u consists of only the purely random component  it and there is no individual-specific  i term. FIXED EFFECTS OR RANDOM EFFECTS? 22 Random effects estimation

In this situation we should use pooled OLS, with two advantages. There is a gain in efficiency because we are not attempting to allow for non-existent within-groups autocorrelation. FIXED EFFECTS OR RANDOM EFFECTS? 23 Random effects estimation OLS

In addition we will be able to take advantage of the finite-sample properties of OLS, instead of having to rely on the asymptotic properties of random effects. FIXED EFFECTS OR RANDOM EFFECTS? 24 Random effects estimation OLS

Various tests have been developed to detect the presence of random effects. FIXED EFFECTS OR RANDOM EFFECTS? 25 Random effects estimation OLS

The most common, implemented in some regression applications, is the Breusch–Pagan Lagrange multiplier test, the test statistic having a chi-squared distribution with one degree of freedom under the null hypothesis of no random effects. FIXED EFFECTS OR RANDOM EFFECTS? 26 Random effects estimation OLS

In the case of the marriage effect example the statistic is very high indeed, 20,007, but in this case it is meaningless because are not able to use random effects estimation. FIXED EFFECTS OR RANDOM EFFECTS? 27 Random effects estimation OLS

A note on the random effects and fixed effects terminology. It is generally agreed that random effects/fixed effects terminology can be misleading, but that it is too late to change it now. FIXED EFFECTS OR RANDOM EFFECTS? 28 Random effects estimation OLS

It is natural to think that random effects estimation should be used when the unobserved effect can be characterized as being drawn randomly from a given population and that fixed effects should be used when the unobserved effect is non-random. FIXED EFFECTS OR RANDOM EFFECTS? 29 Random effects estimation OLS

The second part of that statement is correct. However the first part is correct only if the unobserved effect is distributed independently of the X j variables. If it is not, fixed effects should be used instead to avoid the problem of unobserved heterogeneity bias. FIXED EFFECTS OR RANDOM EFFECTS? 30 Random effects estimation OLS

The diagram summarizes the decision-making procedure. Make sure that you understand the reasons for making the choices. FIXED EFFECTS OR RANDOM EFFECTS? 31 Can the observations be described as being a random sample from a given population? Use fixed effectsPerform both fixed effects and random effects regressions. Does a DWH test indicate significant differences in the coefficients? Provisionally choose random effects. Does a test indicate the presence of random effects? Use fixed effects Use random effects Use pooled OLS YesNo Yes No YesNo

Copyright Christopher Dougherty These slideshows may be downloaded by anyone, anywhere for personal use. Subject to respect for copyright and, where appropriate, attribution, they may be used as a resource for teaching an econometrics course. There is no need to refer to the author. The content of this slideshow comes from Section 14.3 of C. Dougherty, Introduction to Econometrics, fourth edition 2011, Oxford University Press. Additional (free) resources for both students and instructors may be downloaded from the OUP Online Resource Centre Individuals studying econometrics on their own and who feel that they might benefit from participation in a formal course should consider the London School of Economics summer school course EC212 Introduction to Econometrics or the University of London International Programmes distance learning course 20 Elements of Econometrics