Christopher Dougherty EC220 - Introduction to econometrics (chapter 3) Slideshow: prediction Original citation: Dougherty, C. (2012) EC220 - Introduction.

Slides:



Advertisements
Similar presentations
EC220 - Introduction to econometrics (chapter 14)
Advertisements

Christopher Dougherty EC220 - Introduction to econometrics (chapter 9) Slideshow: two-stage least squares Original citation: Dougherty, C. (2012) EC220.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 12) Slideshow: consequences of autocorrelation Original citation: Dougherty, C. (2012)
Christopher Dougherty EC220 - Introduction to econometrics (chapter 11) Slideshow: model c assumptions Original citation: Dougherty, C. (2012) EC220 -
EC220 - Introduction to econometrics (chapter 8)
Christopher Dougherty EC220 - Introduction to econometrics (chapter 8) Slideshow: model b: properties of the regression coefficients Original citation:
EC220 - Introduction to econometrics (chapter 3)
EC220 - Introduction to econometrics (review chapter)
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: asymptotic properties of estimators: the use of simulation Original.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 2) Slideshow: a Monte Carlo experiment Original citation: Dougherty, C. (2012) EC220.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 10) Slideshow: introduction to maximum likelihood estimation Original citation: Dougherty,
Christopher Dougherty EC220 - Introduction to econometrics (chapter 11) Slideshow: adaptive expectations Original citation: Dougherty, C. (2012) EC220.
EC220 - Introduction to econometrics (chapter 7)
1 XX X1X1 XX X Random variable X with unknown population mean  X function of X probability density Sample of n observations X 1, X 2,..., X n : potential.
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: asymptotic properties of estimators: plims and consistency Original.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 2) Slideshow: testing a hypothesis relating to a regression coefficient Original citation:
EC220 - Introduction to econometrics (chapter 7)
Random effects estimation RANDOM EFFECTS REGRESSIONS When the observed variables of interest are constant for each individual, a fixed effects regression.
MEASUREMENT ERROR 1 In this sequence we will investigate the consequences of measurement errors in the variables in a regression model. To keep the analysis.
1 ASSUMPTIONS FOR MODEL C: REGRESSIONS WITH TIME SERIES DATA Assumptions C.1, C.3, C.4, C.5, and C.8, and the consequences of their violations are the.
EC220 - Introduction to econometrics (chapter 2)
EC220 - Introduction to econometrics (chapter 9)
00  sd  0 –sd  0 –1.96sd  0 +sd 2.5% CONFIDENCE INTERVALS probability density function of X null hypothesis H 0 :  =  0 In the sequence.
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: expected value of a function of a random variable Original citation:
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: confidence intervals Original citation: Dougherty, C. (2012) EC220.
EC220 - Introduction to econometrics (review chapter)
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: continuous random variables Original citation: Dougherty, C. (2012)
1 A MONTE CARLO EXPERIMENT In the previous slideshow, we saw that the error term is responsible for the variations of b 2 around its fixed component 
Christopher Dougherty EC220 - Introduction to econometrics (chapter 3) Slideshow: precision of the multiple regression coefficients Original citation:
Christopher Dougherty EC220 - Introduction to econometrics (chapter 4) Slideshow: semilogarithmic models Original citation: Dougherty, C. (2012) EC220.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 10) Slideshow: maximum likelihood estimation of regression coefficients Original citation:
EC220 - Introduction to econometrics (chapter 12)
Christopher Dougherty EC220 - Introduction to econometrics (chapter 5) Slideshow: Chow test Original citation: Dougherty, C. (2012) EC220 - Introduction.
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: the normal distribution Original citation: Dougherty, C. (2012)
1 In a second variation, we shall consider the model shown above. x is the rate of growth of productivity, assumed to be exogenous. w is now hypothesized.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 5) Slideshow: dummy variable classification with two categories Original citation:
1 PREDICTION In the previous sequence, we saw how to predict the price of a good or asset given the composition of its characteristics. In this sequence,
EC220 - Introduction to econometrics (review chapter)
1 UNBIASEDNESS AND EFFICIENCY Much of the analysis in this course will be concerned with three properties of estimators: unbiasedness, efficiency, and.
FIXED EFFECTS REGRESSIONS: WITHIN-GROUPS METHOD The two main approaches to the fitting of models using panel data are known, for reasons that will be explained.
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: sampling and estimators Original citation: Dougherty, C. (2012)
Christopher Dougherty EC220 - Introduction to econometrics (chapter 5) Slideshow: the effects of changing the reference category Original citation: Dougherty,
Christopher Dougherty EC220 - Introduction to econometrics (chapter 12) Slideshow: autocorrelation, partial adjustment, and adaptive expectations Original.
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: conflicts between unbiasedness and minimum variance Original citation:
Christopher Dougherty EC220 - Introduction to econometrics (chapter 8) Slideshow: measurement error Original citation: Dougherty, C. (2012) EC220 - Introduction.
THE FIXED AND RANDOM COMPONENTS OF A RANDOM VARIABLE 1 In this short sequence we shall decompose a random variable X into its fixed and random components.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 11) Slideshow: Friedman Original citation: Dougherty, C. (2012) EC220 - Introduction.
CONSEQUENCES OF AUTOCORRELATION
Christopher Dougherty EC220 - Introduction to econometrics (chapter 7) Slideshow: weighted least squares and logarithmic regressions Original citation:
EC220 - Introduction to econometrics (chapter 8)
Christopher Dougherty EC220 - Introduction to econometrics (chapter 12) Slideshow: footnote: the Cochrane-Orcutt iterative process Original citation: Dougherty,
Simple regression model: Y =  1 +  2 X + u 1 We have seen that the regression coefficients b 1 and b 2 are random variables. They provide point estimates.
A.1The model is linear in parameters and correctly specified. PROPERTIES OF THE MULTIPLE REGRESSION COEFFICIENTS 1 Moving from the simple to the multiple.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 9) Slideshow: instrumental variable estimation: variation Original citation: Dougherty,
Christopher Dougherty EC220 - Introduction to econometrics (chapter 6) Slideshow: multiple restrictions and zero restrictions Original citation: Dougherty,
1 We will now look at the properties of the OLS regression estimators with the assumptions of Model B. We will do this within the context of the simple.
1 We will continue with a variation on the basic model. We will now hypothesize that p is a function of m, the rate of growth of the money supply, as well.
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: alternative expression for population variance Original citation:
1 ASYMPTOTIC PROPERTIES OF ESTIMATORS: THE USE OF SIMULATION In practice we deal with finite samples, not infinite ones. So why should we be interested.
Definition of, the expected value of a function of X : 1 EXPECTED VALUE OF A FUNCTION OF A RANDOM VARIABLE To find the expected value of a function of.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 2) Slideshow: exercise 2.11 Original citation: Dougherty, C. (2012) EC220 - Introduction.
1 ESTIMATORS OF VARIANCE, COVARIANCE, AND CORRELATION We have seen that the variance of a random variable X is given by the expression above. Variance.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 2) Slideshow: confidence intervals Original citation: Dougherty, C. (2012) EC220 -
1 REPARAMETERIZATION OF A MODEL AND t TEST OF A LINEAR RESTRICTION Linear restrictions can also be tested using a t test. This involves the reparameterization.
1 We will illustrate the heteroscedasticity theory with a Monte Carlo simulation. HETEROSCEDASTICITY: MONTE CARLO ILLUSTRATION 1 standard deviation of.
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: independence of two random variables Original citation: Dougherty,
Christopher Dougherty EC220 - Introduction to econometrics (chapter 1) Slideshow: simple regression model Original citation: Dougherty, C. (2012) EC220.
FOOTNOTE: THE COCHRANE–ORCUTT ITERATIVE PROCESS 1 We saw in the previous sequence that AR(1) autocorrelation could be eliminated by a simple manipulation.
Introduction to Econometrics, 5th edition
Introduction to Econometrics, 5th edition
Presentation transcript:

Christopher Dougherty EC220 - Introduction to econometrics (chapter 3) Slideshow: prediction Original citation: Dougherty, C. (2012) EC220 - Introduction to econometrics (chapter 3). [Teaching Resource] © 2012 The Author This version available at: Available in LSE Learning Resources Online: May 2012 This work is licensed under a Creative Commons Attribution-ShareAlike 3.0 License. This license allows the user to remix, tweak, and build upon the work even for commercial purposes, as long as the user credits the author and licenses their new creations under the identical terms

1 PREDICTION In the previous sequence, we saw how to predict the price of a good or asset given the composition of its characteristics. In this sequence, we discuss the properties of such predictions.

2 PREDICTION Suppose that, given a sample of n observations, we have fitted a pricing model with k – 1 characteristics, as shown.

3 PREDICTION Suppose now that one encounters a new variety of the good with characteristics {X 2 *, X 3 *,..., X k * }. Given the sample regression result, it is natural to predict that the price of the new variety should be given by the third equation.

4 PREDICTION What can one say about the properties of this prediction? First, it is natural to ask whether it is fair, in the sense of not systematically overestimating or underestimating the actual price. Second, we will be concerned about the likely accuracy of the prediction.

5 PREDICTION We will start by supposing that the good has only one relevant characteristic and that we have fitted the simple regression model shown. Hence, given a new variety of the good with characteristic {X * }, the model gives us the predicted price.

6 PREDICTION We will define the prediction error of the model, PE, as the difference between the actual price and the predicted price.

7 PREDICTION We will assume that the model applies to the new good and therefore the actual price is generated as shown, where u* is the value of the disturbance term for the new good.

8 PREDICTION Then the prediction error is as shown.

9 PREDICTION We take expectations.

10 PREDICTION  1 and  2 are assumed to be fixed parameters, so they are not affected by taking expectations. Likewise, X* is assumed to be a fixed quantity and unaffected by taking expectations. However, u*, b 1 and b 2 are random variables.

11 PREDICTION E(u*) = 0 because u* is randomly drawn from the distribution for u, which we have assumed as zero population mean. Under the usual OLS assumptions, b 1 will be an unbiased estimator of  1 and b 2 an unbiased estimator of  2.

12 PREDICTION Hence the expectation of the prediction error is zero. The result generalizes easily to the case where there are multiple characteristics and the new good embodies a new combination of them.

13 PREDICTION The population variance of the prediction error is given by the expression shown. Unsurprisingly, this implies that, the further is the value of from the sample mean, the larger will be the population variance of the prediction error.

14 PREDICTION The population variance of the prediction error is given by the expression shown. Unsurprisingly, this implies that, the further is the value of from the sample mean, the larger will be the population variance of the prediction error.

15 PREDICTION It also implies, again unsurprisingly, that, the larger is the sample, the smaller will be the population variance of the prediction error, with a lower limit of  u 2.

16 PREDICTION Provided that the regression model assumptions are valid, b 1 and b 2 will tend to their true values as the sample becomes large, so the only source of error in the prediction will be u*, and by definition this has population variance  u 2.

17 PREDICTION The standard error of the prediction error is calculated using the square root of the expression for the population variance, replacing the variance of u with the estimate obtained when fitting the model in the sample period.

18 PREDICTION Hence we are able to construct a confidence interval for a prediction. t crit is the critical level of t, given the significance level selected and the number of degrees of freedom, and s.e. is the standard error of the prediction.

19 PREDICTION The confidence interval has been drawn as a function of X *. As we noted from the mathematical expression, it becomes wider, the greater the distance from X * to the sample mean.

20 PREDICTION With multiple explanatory variables, the expression for the prediction variance becomes complex. One point to note is that multicollinearity may not have an adverse effect on prediction precision, even though the estimates of the coefficients have large variances.

21 PREDICTION Suppose X 2 and X 3 are positively correlated, and that  2 and  3 are both positive. Then it can be shown that cov(b 2, b 3 ) < 0. So if b 2 is an overestimate, b 3 is likely to compensate by being an underestimate, and (b 2 X 2 * + b 3 X 3 * ) may be a relatively good estimate of (  2 X 2 * +  3 X 3 * ). Similarly, for other combinations. For simplicity, suppose that there are two explanatory variables, that both have positive true coefficients, and that they are positively correlated, the model being as shown, and that we are predicting the value of Y *, given values X 2 * and X 3 *.

22 PREDICTION Then if the effect of X 2 is overestimated, so that b 2 >  2, the effect of X 3 will almost certainly be underestimated, with b 3 <  3. As a consequence, the effects of the errors may to some extent cancel out, with the result that the linear combination may be close to (  2 X 2 * +  3 X 3 * ). Suppose X 2 and X 3 are positively correlated, and that  2 and  3 are both positive. Then it can be shown that cov(b 2, b 3 ) < 0. So if b 2 is an overestimate, b 3 is likely to compensate by being an underestimate, and (b 2 X 2 * + b 3 X 3 * ) may be a relatively good estimate of (  2 X 2 * +  3 X 3 * ). Similarly, for other combinations.

23 PREDICTION This will be illustrated with a simulation, with the model and data shown. We fit the model and make the prediction Y * = b 1 + b 2 X 2 * + b 3 X 3 *. Simulation

24 PREDICTION Since X 2 and X 3 are virtually identical, this may be approximated as Y * = b 1 + (b 2 + b 3 )X 2 *. Thus the predictive accuracy depends on how close (b 2 + b 3 ) is to (b 2 + b 3 ), that is, to 5. Simulation

25 PREDICTION Simulation The figure shows the distributions of b 2 and b 3 for 10 million samples. Their distributions have relatively wide variances around their true values, as should be expected, given the multicollinearity. The actual standard deviations of their distributions is standard deviations 0.45 standard deviation 0.04

26 PREDICTION The figure also shows the distribution of their sum. As anticipated, it is distributed around 5, but with a much lower standard deviation, 0.04, despite the multicollinearity affecting the point estimates of the individual coefficients. Simulation standard deviations 0.45 standard deviation 0.04

Copyright Christopher Dougherty These slideshows may be downloaded by anyone, anywhere for personal use. Subject to respect for copyright and, where appropriate, attribution, they may be used as a resource for teaching an econometrics course. There is no need to refer to the author. The content of this slideshow comes from Section 3.6 of C. Dougherty, Introduction to Econometrics, fourth edition 2011, Oxford University Press. Additional (free) resources for both students and instructors may be downloaded from the OUP Online Resource Centre Individuals studying econometrics on their own and who feel that they might benefit from participation in a formal course should consider the London School of Economics summer school course EC212 Introduction to Econometrics or the University of London International Programmes distance learning course 20 Elements of Econometrics