Econ 140 Lecture 191 Heteroskedasticity Lecture 19.

Slides:



Advertisements
Similar presentations
Autocorrelation and Heteroskedasticity
Advertisements

Heteroskedasticity Hill et al Chapter 11. Predicting food expenditure Are we likely to be better at predicting food expenditure at: –low incomes; –high.
Applied Econometrics Second edition
Heteroskedasticity Lecture 17 Lecture 17.
Autocorrelation Lecture 20 Lecture 20.
Multivariate Regression
The Simple Regression Model
Homoscedasticity equal error variance. One of the assumption of OLS regression is that error terms have a constant variance across all value so f independent.
Regression Analysis Simple Regression. y = mx + b y = a + bx.
Econ 140 Lecture 81 Classical Regression II Lecture 8.
Heteroskedasticity Prepared by Vera Tabakova, East Carolina University.
Multicollinearity Multicollinearity - violation of the assumption that no independent variable is a perfect linear function of one or more other independent.
Objectives (BPS chapter 24)
8. Heteroskedasticity We have already seen that homoskedasticity exists when the error term’s variance, conditional on all x variables, is constant: Homoskedasticity.
LECTURE 3 Introduction to Linear Regression and Correlation Analysis
Classical Regression III
Econ 140 Lecture 121 Prediction and Fit Lecture 12.
2.5 Variances of the OLS Estimators
Economics 20 - Prof. Anderson1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 6. Heteroskedasticity.
Econ 140 Lecture 131 Multiple Regression Models Lecture 13.
1Prof. Dr. Rainer Stachuletz Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 6. Heteroskedasticity.
Chapter 5 Heteroskedasticity. What is in this Chapter? How do we detect this problem What are the consequences of this problem? What are the solutions?
Econ 140 Lecture 71 Classical Regression Lecture 7.
Multiple Regression Models
Econ 140 Lecture 181 Multiple Regression Applications III Lecture 18.
Multiple Regression Applications
T-test.
Chapter 11 Multiple Regression.
Inference about a Mean Part II
Topic 3: Regression.
Review.
Economics Prof. Buckles
Autocorrelation Lecture 18 Lecture 18.
Multiple Linear Regression Analysis
ECON 7710, Heteroskedasticity What is heteroskedasticity? What are the consequences? How is heteroskedasticity identified? How is heteroskedasticity.
Inference for regression - Simple linear regression
Hypothesis Testing in Linear Regression Analysis
Returning to Consumption
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
What does it mean? The variance of the error term is not constant
Specification Error I.
Chapter 10 Hetero- skedasticity Copyright © 2011 Pearson Addison-Wesley. All rights reserved. Slides by Niels-Hugo Blunch Washington and Lee University.
12.1 Heteroskedasticity: Remedies Normality Assumption.
Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Inference for Regression Chapter 14. Linear Regression We can use least squares regression to estimate the linear relationship between two quantitative.
2.4 Units of Measurement and Functional Form -Two important econometric issues are: 1) Changing measurement -When does scaling variables have an effect.
VI. Regression Analysis A. Simple Linear Regression 1. Scatter Plots Regression analysis is best taught via an example. Pencil lead is a ceramic material.
1 Javier Aparicio División de Estudios Políticos, CIDE Primavera Regresión.
EC 532 Advanced Econometrics Lecture 1 : Heteroscedasticity Prof. Burak Saltoglu.
Principles of Econometrics, 4t h EditionPage 1 Chapter 8: Heteroskedasticity Chapter 8 Heteroskedasticity Walter R. Paczkowski Rutgers University.
I271B QUANTITATIVE METHODS Regression and Diagnostics.
1 Heteroskedasticity. 2 The Nature of Heteroskedasticity  Heteroskedasticity is a systematic pattern in the errors where the variances of the errors.
Quantitative Methods. Bivariate Regression (OLS) We’ll start with OLS regression. Stands for  Ordinary Least Squares Regression. Relatively basic multivariate.
11.1 Heteroskedasticity: Nature and Detection Aims and Learning Objectives By the end of this session students should be able to: Explain the nature.
4-1 MGMG 522 : Session #4 Choosing the Independent Variables and a Functional Form (Ch. 6 & 7)
Ch. 2: The Simple Regression Model
Kakhramon Yusupov June 15th, :30pm – 3:00pm Session 3
REGRESSION DIAGNOSTIC II: HETEROSCEDASTICITY
Fundamentals of regression analysis 2
Ch. 2: The Simple Regression Model
Autocorrelation.
The Regression Model Suppose we wish to estimate the parameters of the following relationship: A common method is to choose parameters to minimise the.
REGRESSION DIAGNOSTIC I: MULTICOLLINEARITY
Simple Linear Regression
Heteroskedasticity.
Chapter 7: The Normality Assumption and Inference with OLS
The Simple Regression Model
Autocorrelation.
Heteroskedasticity.
Presentation transcript:

Econ 140 Lecture 191 Heteroskedasticity Lecture 19

Econ 140 Lecture 192 Today’s plan How to test for it: graphs, Park and Glejser tests What we can do if we find heteroskedasticity How to estimate in the presence of heteroskedasticity

Econ 140 Lecture 193 Palm Beach County revisited How far is Palm Beach an outlier? –Can the outlier be explained by heteroskedasticity? –If so, what are the consequences? Heteroskedasticity will affect the variance of the regression line –It will consequently affect the variance of the estimated coefficients L19.XLS provides an example of how to work through a problem like this using Excel

Econ 140 Lecture 194 Palm Beach County revisited (2) Palm Beach is a good example to use since there are scale effects in the data –The voting pattern shows that the voting behavior and number of registered voters are related to the population in each county As the county gets larger, voting patterns may diverge from what would be assumed given the number of registered voters –Note from the graph: as we move away from the origin, the difference between registered Reform voters and Reform votes cast increases –We’ll hypothesize that this will have an affect on heteroskedasticity

Econ 140 Lecture 195 Notation Heteroskedasticity is observed as cross-section variability in the data –data across units at point in time In our notation, heteroskedasticity is: E(e i 2 )   2 We can also write: E(e i 2 ) =  i 2 –This means that we expect variable variance: the variance changes with each unit of observation

Econ 140 Lecture 196 Consequences When heteroskedasticity is present: 1) OLS estimator is still linear 2) OLS estimator is still unbiased 3) OLS estimator is not efficient - the minimum variance property no longer holds 4) Estimates of the variances are biased 5) is not an unbiased estimator of  YX 2 6) We can’t trust the confidence intervals or hypothesis tests (t-tests & F-tests): we may draw the wrong conclusions

Econ 140 Lecture 197 Consequences (2) When BLUE holds and there is homoskedasticity, the first- order condition gives: With heteroskedasticity, we have: If we substitute the equation for c i to both equations, we find: where

Econ 140 Lecture 198 Cases With homoskedasticity: around each point, the variance around the regression line is constant With heteroskedasticity: around each point, the variance around the regression line varies with each value of the independent variable

Econ 140 Lecture 199 Detecting heteroskedasticity There are three ways of detecting heteroskedastiticy: 1) Graphically 2) Park Test 3) Glejser Test

Econ 140 Lecture 1910 Graphical detection We can see that the errors vary with the unit of observation With homoskedasticity we find that for E(e i, X) = 0 : The errors are independent of the independent variables With heteroskedasticity we can get a variety of patterns The errors show a systematic relationship with the independent variables Note: you can use either e or e 2 on the y-axis

Econ 140 Lecture 1911 Graphical detection (3) Using the Palm Beach example (L19.xls), the estimated regression equation was: The errors of this equation,can be graphed against the number of registered Reform party voters, (the independent variable) –Graph shows that the errors increasing with the number of registered reform voters While the graphs may be convincing, we also want to use a test to confirm this. We have two:

Econ 140 Lecture 1912 Park Test Here’s the procedure: 1) Run regression Y i = a + bX i + e i despite the heteroskedasticity problem (it can also be multivariate) 2) Obtain residuals (e i ), square them (e i 2 ), and take their logs (ln e i 2 ) 3) Run a spurious regression: 4) Do a hypothesis test on with H 0 : g 1 = 0 5) Look at the results of the hypothesis test: reject the null: you have heteroskedasticity fail to reject the null: homoskedasticity, or which is a constant

Econ 140 Lecture 1913 Glejser Test When we use the Glejser, we’re looking for a scaling effect The procedure: 1) Run the regression (it can also be multivariate) 2) Collect e i terms 3) Take the absolute value of the errors 4) Regress |e i | against independent variable(s) you can run different kinds of regressions:

Econ 140 Lecture 1914 Glejser Test (2) 4) [continued] If heteroskedasticity takes one of these forms, this will suggest an appropriate transformation of the model The null hypothesis is still H 0 : g 1 = 0 since we’re testing for a relationship between the errors and the independent variables We reach the same conclusions as in the Park Test

Econ 140 Lecture 1915 A cautionary note The errors in the Park Test (v i ) and the Glejser Test (u i ) might also be heteroskedastic. –If this is the case, we cannot trust the hypothesis test H 0 : g 1 = 0 or the t-test If we find heteroskedastic disturbances in the data, what can we do? –Estimate the model Y i = a + bX i + e i using weighted least squares –We’ll look at two examples of weighted least squares: one where we know the true variance, and one where we don’t

Econ 140 Lecture 1916 Correction with known  i 2 Given that the true variance is known and our model is: Y i = a + bX i + e i Consider the following transformation of the model: –In the transformed model, let –So the expected value of the error squared is:

Econ 140 Lecture 1917 Correction with known  i 2 (2) Given that there is heteroskedasticity, E(e i 2 ) =  i 2 –thus: In this simplistic example, we re-weighted model by the constant  i What this example shows: when the variance is known, we must transform our model to obtain a homoskedastic error term.

Econ 140 Lecture 1918 Correction with unknown  i 2 Given an unknown variance, we need to state the ad-hoc but plausible assumptions with our variance  i 2 (how the errors vary with the independent variable) For example: we can assert that E(e i 2 ) =  2 X i Remember: Glejser Test allows us to choose a relationship between the errors and the independent variable

Econ 140 Lecture 1919 Correction with unknown  i 2 (2) In this example you would transform the estimating equation by dividing through by to get: Letting: –The expected value of this error squared is:

Econ 140 Lecture 1920 Correction with unknown  i 2 (3) Recalling an earlier assumption, we find: When we don’t know the true variance we re-scale the estimating equation by the independent variable

Econ 140 Lecture 1921 Returning to Palm Beach On L19.xls we have presidential election data by county in Florida –To get a correct estimating equation, we can run a regression without Palm Beach if we think it’s an outlier. –Then we can see if we can obtain a prediction for the number of reform votes cast in Palm Beach –We can perform a Glejser Test for the regression excluding Palm Beach –We run a regression of the absolute value of the errors (|e i |)against registered Reform voters (X i )

Econ 140 Lecture 1922 Returning to Palm Beach (2) The t-test rejects the null –this indicates the presence of heteroskedasticity We can re-scale the model in different ways or introduce a new independent variable (such as the total number of registered voters by county) Keep transforming the model and running the Glejser Test –When we fail to reject the null: there is no longer heteroskedasticity in the model

Econ 140 Lecture 1923 Summary Even with re-weighted equations, we might still have heteroskedastic errors –so we have to rerun the Glejser Test until we cannot reject the null If we cannot reject the null, we may have to rethink our model transformation –if we suspect a scale effect, we may want to introduce new scaling variables Variables from the re-scaled equation are comparable with the coefficients from the original model