Heteroskedasticity Adapted from Vera Tabakova’s notes ECON 4551 Econometrics II Memorial University of Newfoundland.

Slides:



Advertisements
Similar presentations
Heteroskedasticity Hill et al Chapter 11. Predicting food expenditure Are we likely to be better at predicting food expenditure at: –low incomes; –high.
Advertisements

Applied Econometrics Second edition
Panel Data Models Prepared by Vera Tabakova, East Carolina University.
The Simple Regression Model
Endogenous Regressors and Instrumental Variables Estimation Adapted from Vera Tabakova, East Carolina University.
ECON 4551 Econometrics II Memorial University of Newfoundland
Prediction, Goodness-of-Fit, and Modeling Issues ECON 6002 Econometrics Memorial University of Newfoundland Adapted from Vera Tabakova’s notes.
Heteroskedasticity Prepared by Vera Tabakova, East Carolina University.
Heteroskedasticity The Problem:
8. Heteroskedasticity We have already seen that homoskedasticity exists when the error term’s variance, conditional on all x variables, is constant: Homoskedasticity.
8.4 Weighted Least Squares Estimation Before the existence of heteroskedasticity-robust statistics, one needed to know the form of heteroskedasticity -Het.
Module II Lecture 6: Heteroscedasticity: Violation of Assumption 3
1 Lecture 2: ANOVA, Prediction, Assumptions and Properties Graduate School Social Science Statistics II Gwilym Pryce
© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
The Multiple Regression Model Prepared by Vera Tabakova, East Carolina University.
The Simple Linear Regression Model: Specification and Estimation
HETEROSKEDASTICITY Chapter 8.
Economics Prof. Buckles1 Time Series Data y t =  0 +  1 x t  k x tk + u t 1. Basic Analysis.
Economics 20 - Prof. Anderson1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 6. Heteroskedasticity.
1Prof. Dr. Rainer Stachuletz Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 6. Heteroskedasticity.
Chapter 5 Heteroskedasticity. What is in this Chapter? How do we detect this problem What are the consequences of this problem? What are the solutions?
Review.
Economics Prof. Buckles
Chapter 12 Section 1 Inference for Linear Regression.
ECON 7710, Heteroskedasticity What is heteroskedasticity? What are the consequences? How is heteroskedasticity identified? How is heteroskedasticity.
Marketing Research Aaker, Kumar, Day and Leone Tenth Edition
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
Returning to Consumption
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
Quantitative Methods Heteroskedasticity.
+ Chapter 12: Inference for Regression Inference for Linear Regression.
Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.
12.1 Heteroskedasticity: Remedies Normality Assumption.
Chapter 5 Heteroskedasticity.
Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Interval Estimation and Hypothesis Testing Prepared by Vera Tabakova, East Carolina University.
Heteroskedasticity ECON 4550 Econometrics Memorial University of Newfoundland Adapted from Vera Tabakova’s notes.
Prediction, Goodness-of-Fit, and Modeling Issues Prepared by Vera Tabakova, East Carolina University.
STA 286 week 131 Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression.
1 Javier Aparicio División de Estudios Políticos, CIDE Primavera Regresión.
The Simple Linear Regression Model: Specification and Estimation ECON 4550 Econometrics Memorial University of Newfoundland Adapted from Vera Tabakova’s.
EC 532 Advanced Econometrics Lecture 1 : Heteroscedasticity Prof. Burak Saltoglu.
Principles of Econometrics, 4t h EditionPage 1 Chapter 8: Heteroskedasticity Chapter 8 Heteroskedasticity Walter R. Paczkowski Rutgers University.
I271B QUANTITATIVE METHODS Regression and Diagnostics.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
Dynamic Models, Autocorrelation and Forecasting ECON 6002 Econometrics Memorial University of Newfoundland Adapted from Vera Tabakova’s notes.
Endogenous Regressors and Instrumental Variables Estimation Adapted from Vera Tabakova, East Carolina University.
8-1 MGMG 522 : Session #8 Heteroskedasticity (Ch. 10)
Chap 8 Heteroskedasticity
1 Heteroskedasticity. 2 The Nature of Heteroskedasticity  Heteroskedasticity is a systematic pattern in the errors where the variances of the errors.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University.
11.1 Heteroskedasticity: Nature and Detection Aims and Learning Objectives By the end of this session students should be able to: Explain the nature.
Heteroskedasticity ECON 6002 Econometrics Memorial University of Newfoundland Adapted from Vera Tabakova’s notes.
Heteroscedasticity Chapter 8
Vera Tabakova, East Carolina University
Linear Regression with One Regression
Vera Tabakova, East Carolina University
Dynamic Models, Autocorrelation and Forecasting
The Simple Linear Regression Model: Specification and Estimation
Kakhramon Yusupov June 15th, :30pm – 3:00pm Session 3
REGRESSION DIAGNOSTIC II: HETEROSCEDASTICITY
Prediction, Goodness-of-Fit, and Modeling Issues
Review of Statistical Inference
Interval Estimation and Hypothesis Testing
Tutorial 1: Misspecification
Heteroskedasticity.
Heteroskedasticity.
Financial Econometrics Fin. 505
Presentation transcript:

Heteroskedasticity Adapted from Vera Tabakova’s notes ECON 4551 Econometrics II Memorial University of Newfoundland

 8.1 The Nature of Heteroskedasticity  8.2 Using the Least Squares Estimator  8.3 The Generalized Least Squares Estimator  8.4 Detecting Heteroskedasticity

Figure 8.1 Heteroskedastic Errors

Food expenditure example:

Figure 8.2 Least Squares Estimated Expenditure Function and Observed Data Points

The existence of heteroskedasticity implies:  The least squares estimator is still a linear and unbiased estimator, but it is no longer best. There is another estimator with a smaller variance.  The standard errors usually computed for the least squares estimator are incorrect. Confidence intervals and hypothesis tests that use these standard errors may be misleading.

We can use a robust estimator: GRETL offers several options…check the defaults

The existence of heteroskedasticity implies:  Why not use robust estimation all the time?  Well, that is a good idea for large samples but for small samples, homoskedasticity plus normality guarantees that the t ratios are distributed as t  But robust estimates do not guarantee that, so our inference could be misleading!  If you have a small sample, check whether there is homoskedasticity or not!

To obtain the best linear unbiased estimator for a model with heteroskedasticity of the type specified in equation (8.11): 1. Calculate the transformed variables given in (8.13). 2. Use least squares to estimate the transformed model given in (8.14).

The generalized least squares estimator is as a weighted least squares estimator. Minimizing the sum of squared transformed errors that is given by: When is small, the data contain more information about the regression function and the observations are weighted heavily. When is large, the data contain less information and the observations are weighted lightly.

Food example again, where was the problem coming from? regress food_exp income [aweight = 1/income]

The steps for obtaining a feasible generalized least squares estimator for are:  1.Estimate (8.25) by least squares and compute the squares of the least squares residuals.  2.Estimate by applying least squares to the equation

3.Compute variance estimates. 4.Compute the transformed observations defined by (8.23), including if. 5.Apply least squares to (8.24), or to an extended version of (8.24) if.

For our food expenditure example (GRETL: #Estimating the skedasticity function and GLS ols y const x genr lnsighat = log($uhat*$uhat) genr z = log(x) #Obtain prediction of variance: ols lnsighat const z genr predsighat = exp($yhat) #generate weights; genr w = 1/predsighat wls w y const x

For our food expenditure example (STATA): gen z = log(income) regress food_exp income predict ehat, residual gen lnehat2 = log(ehat*ehat) regress lnehat2 z * * Feasible GLS * predict sig2, xb gen wt = exp(sig2) regress food_exp income [aweight = 1/wt]

Using our wage data (cps2.dta): ???

Feasible generalized least squares: 1.Obtain estimated and by applying least squares separately to the metropolitan and rural observations Apply least squares to the transformed model

* * Rural subsample regression * regress wage educ exper if metro == 0 scalar rmse_r = e(rmse) scalar df_r = e(df_r) * * Urban subsample regression * regress wage educ exper if metro == 1 scalar rmse_m = e(rmse) scalar df_m = e(df_r) * * Groupwise heteroskedastic regression using FGLS * gen rural = 1 - metro gen wt=(rmse_r^2*rural) + (rmse_m^2*metro) regress wage educ exper metro [aweight = 1/wt] STATA Commands:

#Wage Example open "c:\Program Files\gretl\data\poe\cps2.gdt" ols wage const educ exper metro # Use only metro observations smpl metro --dummy ols wage const educ exper scalar stdm = $sigma #Restore the full sample smpl full GRETL Commands:

#Create a dummy variable for rural genr rural = 1-metro #Restrict sample to rural observations smpl rural --dummy ols wage const educ exper scalar stdr = $sigma #Restore the full sample smpl full GRETL Commands:

 #Generate standard deviations for each metro and rural obs  genr wm = metro*stdm  genr wr = rural*stdr  #Make the weights (reciprocal)  #Remember, Gretl's wls needs these to be variances so you'll need to square them  genr w = 1/(wm + wr)^2  #Weighted least squares  wls w wage const educ exper metro Principles of Econometrics, 3rd Edition

Remark: To implement the generalized least squares estimators described in this Section for three alternative heteroskedastic specifications, an assumption about the form of the heteroskedasticity is required. Using least squares with White standard errors avoids the need to make an assumption about the form of heteroskedasticity, but does not realize the potential efficiency gains from generalized least squares.

8.4.1Residual Plots  Estimate the model using least squares and plot the least squares residuals.  With more than one explanatory variable, plot the least squares residuals against each explanatory variable, or against, to see if those residuals vary in a systematic way relative to the specified variable.

8.4.2The Goldfeld-Quandt Test

STATA: * * Goldfeld Quandt test * scalar GQ = rmse_m^2/rmse_r^2 scalar crit = invFtail(df_m,df_r,.05) scalar pvalue = Ftail(df_m,df_r,GQ) scalar list GQ pvalue crit GRETL: #Goldfeld Quandt statistic scalar fstatistic = stdm^2/stdr^2

8.4.2The Goldfeld-Quandt Test More generally, the test can be based Simply on a continuous variable Split the sample in halves (usually omitting some from the middle) after ordering them according to the suspected variable (income in our food example)

8.4.2The Goldfeld-Quandt Test For the food expenditure data You should now be able to obtain this test statistic And check whether it exceeds the critical value Remember that you can probably use the one-tail version of this test Why?

8.4.3Testing the Variance Function For the mean: For the variance, in general: For example::

8.4.3Testing the Variance Function

S is the number of variables used

 This is a large sample test  It is a Lagrange Multiplier (LM) test, which are based on an auxiliary regression  In this case named after Breusch and Pagan  Here (and in the textbook) we saw a test statistic based on a linear function of the squared residual, but the good thing about this test is that this form can be used to test for any form of heteroskedasticity Principles of Econometrics, 3rd Edition

8.4.3a The White Test Since we may not know which variables explain heteroskedasticity…

8.4.3b Testing the Food Expenditure Example whitetst Or estat imtest, white Breusch-Pagan test White test STATA: GRETL: ols y const x modtest --breusch-pagan modtest –white

Slide 8-48 Principles of Econometrics, 3rd Edition  Breusch-Pagan test  generalized least squares  Goldfeld-Quandt test  heteroskedastic partition  heteroskedasticity  heteroskedasticity-consistent standard errors  homoskedasticity  Lagrange multiplier test  mean function  residual plot  transformed model  variance function  weighted least squares  White test

Slide 8-49 Principles of Econometrics, 3rd Edition

Slide 8-50 Principles of Econometrics, 3rd Edition (8A.1)

Slide 8-51 Principles of Econometrics, 3rd Edition

Slide 8-52 Principles of Econometrics, 3rd Edition (8A.2)

Slide 8-53 Principles of Econometrics, 3rd Edition (8A.3)

Slide 8-54 Principles of Econometrics, 3rd Edition (8B.2) (8B.1)

Slide 8-55 Principles of Econometrics, 3rd Edition (8B.4) (8B.3) (8B.5)

Slide 8-56 Principles of Econometrics, 3rd Edition (8B.6) (8B.7)

Slide 8-57 Principles of Econometrics, 3rd Edition (8B.8)