Spurious regressions with variables possessing deterministic trends Regresson Regress Y t on X t and t SPURIOUS REGRESSIONS 1 To motivate the discussion.

Slides:



Advertisements
Similar presentations
Christopher Dougherty EC220 - Introduction to econometrics (chapter 2) Slideshow: a Monte Carlo experiment Original citation: Dougherty, C. (2012) EC220.
Advertisements

1 THE DISTURBANCE TERM IN LOGARITHMIC MODELS Thus far, nothing has been said about the disturbance term in nonlinear regression models.
EC220 - Introduction to econometrics (chapter 7)
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: asymptotic properties of estimators: plims and consistency Original.
Augmented Dickey-Fuller Test Equation Dependent Variable: D(LGDPI) Method: Least Squares Sample (adjusted): Included observations: 44 after adjustments.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 13) Slideshow: tests of nonstationarity: example and further complications Original.
============================================================ Dependent Variable: LGHOUS Method: Least Squares Sample: Included observations:
Christopher Dougherty EC220 - Introduction to econometrics (chapter 2) Slideshow: testing a hypothesis relating to a regression coefficient Original citation:
Christopher Dougherty EC220 - Introduction to econometrics (chapter 4) Slideshow: interactive explanatory variables Original citation: Dougherty, C. (2012)
Random effects estimation RANDOM EFFECTS REGRESSIONS When the observed variables of interest are constant for each individual, a fixed effects regression.
FITTING MODELS WITH NONSTATIONARY TIME SERIES 1 Detrending Early macroeconomic models tended to produce poor forecasts, despite having excellent sample-period.
MEASUREMENT ERROR 1 In this sequence we will investigate the consequences of measurement errors in the variables in a regression model. To keep the analysis.
ASYMPTOTIC PROPERTIES OF ESTIMATORS: PLIMS AND CONSISTENCY
1 TIME SERIES MODELS: STATIC MODELS AND MODELS WITH LAGS In this sequence we will make an initial exploration of the determinants of aggregate consumer.
1 ASSUMPTIONS FOR MODEL C: REGRESSIONS WITH TIME SERIES DATA Assumptions C.1, C.3, C.4, C.5, and C.8, and the consequences of their violations are the.
EC220 - Introduction to econometrics (chapter 2)
Stationary process NONSTATIONARY PROCESSES 1 In the last sequence, the process shown at the top was shown to be stationary. The expected value and variance.
EC220 - Introduction to econometrics (chapter 9)
1 We will now consider the distributional properties of OLS estimators in models with a lagged dependent variable. We will do so for the simplest such.
TESTING A HYPOTHESIS RELATING TO THE POPULATION MEAN 1 This sequence describes the testing of a hypothesis at the 5% and 1% significance levels. It also.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 13) Slideshow: tests of nonstationarity: untrended data Original citation: Dougherty,
EC220 - Introduction to econometrics (review chapter)
TESTING A HYPOTHESIS RELATING TO A REGRESSION COEFFICIENT This sequence describes the testing of a hypotheses relating to regression coefficients. It is.
1 A MONTE CARLO EXPERIMENT In the previous slideshow, we saw that the error term is responsible for the variations of b 2 around its fixed component 
Christopher Dougherty EC220 - Introduction to econometrics (chapter 3) Slideshow: prediction Original citation: Dougherty, C. (2012) EC220 - Introduction.
1 THE CENTRAL LIMIT THEOREM If a random variable X has a normal distribution, its sample mean X will also have a normal distribution. This fact is useful.
1 In the previous sequence, we were performing what are described as two-sided t tests. These are appropriate when we have no information about the alternative.
EC220 - Introduction to econometrics (chapter 12)
TOBIT ANALYSIS Sometimes the dependent variable in a regression model is subject to a lower limit or an upper limit, or both. Suppose that in the absence.
1 In a second variation, we shall consider the model shown above. x is the rate of growth of productivity, assumed to be exogenous. w is now hypothesized.
1 PREDICTION In the previous sequence, we saw how to predict the price of a good or asset given the composition of its characteristics. In this sequence,
1 UNBIASEDNESS AND EFFICIENCY Much of the analysis in this course will be concerned with three properties of estimators: unbiasedness, efficiency, and.
DUMMY CLASSIFICATION WITH MORE THAN TWO CATEGORIES This sequence explains how to extend the dummy variable technique to handle a qualitative explanatory.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 12) Slideshow: autocorrelation, partial adjustment, and adaptive expectations Original.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 10) Slideshow: Tobit models Original citation: Dougherty, C. (2012) EC220 - Introduction.
1 INTERACTIVE EXPLANATORY VARIABLES The model shown above is linear in parameters and it may be fitted using straightforward OLS, provided that the regression.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 8) Slideshow: measurement error Original citation: Dougherty, C. (2012) EC220 - Introduction.
CONSEQUENCES OF AUTOCORRELATION
1 t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN The diagram summarizes the procedure for performing a 5% significance test on the slope coefficient.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 7) Slideshow: weighted least squares and logarithmic regressions Original citation:
ASYMPTOTIC AND FINITE-SAMPLE DISTRIBUTIONS OF THE IV ESTIMATOR
EC220 - Introduction to econometrics (chapter 8)
MULTIPLE RESTRICTIONS AND ZERO RESTRICTIONS
Christopher Dougherty EC220 - Introduction to econometrics (chapter 12) Slideshow: footnote: the Cochrane-Orcutt iterative process Original citation: Dougherty,
TYPE II ERROR AND THE POWER OF A TEST A Type I error occurs when the null hypothesis is rejected when it is in fact true. A Type II error occurs when the.
Simple regression model: Y =  1 +  2 X + u 1 We have seen that the regression coefficients b 1 and b 2 are random variables. They provide point estimates.
A.1The model is linear in parameters and correctly specified. PROPERTIES OF THE MULTIPLE REGRESSION COEFFICIENTS 1 Moving from the simple to the multiple.
SPURIOUS REGRESSIONS 1 In a famous Monte Carlo experiment, Granger and Newbold fitted the model Y t =  1 +  2 X t + u t where Y t and X t were independently-generated.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 6) Slideshow: multiple restrictions and zero restrictions Original citation: Dougherty,
PARTIAL ADJUSTMENT 1 The idea behind the partial adjustment model is that, while a dependent variable Y may be related to an explanatory variable X, there.
TESTS OF NONSTATIONARITY: UNTRENDED DATA 1 In the previous slideshow we considered the conditions under which the process shown at the top of the slide.
AUTOCORRELATION 1 Assumption C.5 states that the values of the disturbance term in the observations in the sample are generated independently of each other.
1 We will now look at the properties of the OLS regression estimators with the assumptions of Model B. We will do this within the context of the simple.
1 We will continue with a variation on the basic model. We will now hypothesize that p is a function of m, the rate of growth of the money supply, as well.
1 ASYMPTOTIC PROPERTIES OF ESTIMATORS: THE USE OF SIMULATION In practice we deal with finite samples, not infinite ones. So why should we be interested.
EC208 – Introductory Econometrics. Topic: Spurious/Nonsense Regressions (as part of chapter on Dynamic Models)
4 In our case, the starting point should be the model with all the lagged variables. DYNAMIC MODEL SPECIFICATION General model with lagged variables Static.
INSTRUMENTAL VARIABLES 1 Suppose that you have a model in which Y is determined by X but you have reason to believe that Assumption B.7 is invalid and.
1 INSTRUMENTAL VARIABLE ESTIMATION OF SIMULTANEOUS EQUATIONS In the previous sequence it was asserted that the reduced form equations have two important.
1 HETEROSCEDASTICITY: WEIGHTED AND LOGARITHMIC REGRESSIONS This sequence presents two methods for dealing with the problem of heteroscedasticity. We will.
1 ESTIMATORS OF VARIANCE, COVARIANCE, AND CORRELATION We have seen that the variance of a random variable X is given by the expression above. Variance.
F TESTS RELATING TO GROUPS OF EXPLANATORY VARIABLES 1 We now come to more general F tests of goodness of fit. This is a test of the joint explanatory power.
1 We will illustrate the heteroscedasticity theory with a Monte Carlo simulation. HETEROSCEDASTICITY: MONTE CARLO ILLUSTRATION 1 standard deviation of.
FOOTNOTE: THE COCHRANE–ORCUTT ITERATIVE PROCESS 1 We saw in the previous sequence that AR(1) autocorrelation could be eliminated by a simple manipulation.
VARIABLE MISSPECIFICATION I: OMISSION OF A RELEVANT VARIABLE In this sequence and the next we will investigate the consequences of misspecifying the regression.
Introduction to Econometrics, 5th edition
Introduction to Econometrics, 5th edition
Introduction to Econometrics, 5th edition
Introduction to Econometrics, 5th edition
Presentation transcript:

Spurious regressions with variables possessing deterministic trends Regresson Regress Y t on X t and t SPURIOUS REGRESSIONS 1 To motivate the discussion that will consume the rest of this chapter, we now turn to the problem of spurious regressions. There are actually two literatures on spurious regressions.

Spurious regressions with variables possessing deterministic trends Regresson Regress Y t on X t and t SPURIOUS REGRESSIONS 2 The first, which dates back to the beginning of econometrics as a formal discipline, concerns regressions involving variables with deterministic trends. The other involves regressions with random walks.

Spurious regressions with variables possessing deterministic trends Regresson Regress Y t on X t and t SPURIOUS REGRESSIONS 3 Suppose you have two unrelated variables Y t and X t, both generated by deterministic trends as shown above. What happens if Y t is regressed on X t ?

Spurious regressions with variables possessing deterministic trends Regresson Regress Y t on X t and t SPURIOUS REGRESSIONS 4 The answer is obvious. The common dependence on t will cause the variables to be correlated, and hence a regression of one on the other will yield ‘significant’ coefficients, provided that the sample is large enough.

Spurious regressions with variables possessing deterministic trends Regresson Regress Y t on X t and t SPURIOUS REGRESSIONS 5 This problem was recognised very early in the history of econometrics. A popular response was to detrend the variables before performing the regression. Y t and X t would separately be regressed on time and the residuals saved, say as Y t and X t. ~~

Spurious regressions with variables possessing deterministic trends Regresson Regress Y t on X t and t SPURIOUS REGRESSIONS 6 Eventually it was pointed out (Frisch and Waugh, 1933) that one did not actually need to go to the trouble of detrending Y t and X t before performing the regression.

Spurious regressions with variables possessing deterministic trends Regresson Regress Y t on X t and t SPURIOUS REGRESSIONS 7 Exactly the same results would be obtained by including a time trend in a multiple regression of (unadjusted) Y t on (unadjusted) X t. (This was the origin of the Frisch–Waugh– Lovell theorem discussed in Section 3.2. Lovell (1963) generalized the result.)

Spurious regressions with variables possessing deterministic trends Regresson Regress Y t on X t and t SPURIOUS REGRESSIONS 8 The inclusion of a time trend in a regression of Y t on X t makes it relatively difficult to obtain significant results. Since it possesses a trend, the series for X t will be correlated with t, and hence the problem of multicollinearity inevitably arises to some extent.

Spurious regressions with variables possessing deterministic trends Regresson Regress Y t on X t and t SPURIOUS REGRESSIONS 9 However, performing the regression with the detrended versions of Y t and X t does not lead to any better results because they must be exactly the same.

Spurious regressions with variables that are random walks SPURIOUS REGRESSIONS 10 The problem of spurious regressions caused by deterministic trends is easy to understand and easy to treat. Much more dramatic, and ultimately important, were the findings of a celebrated simulation undertaken by Granger and Newbold (1974).

Spurious regressions with variables that are random walks SPURIOUS REGRESSIONS 11 They found that, if they generated two independent random walks and regressed one on the other, the slope coefficient was likely to have a significant t statistic, despite the fact that the series were unrelated.

Spurious regressions with variables that are random walks SPURIOUS REGRESSIONS 12 They generated 100 pairs of random walks, with sample size 50 time periods. In 77, the slope coefficient was significant at the 5 percent level.

Spurious regressions with variables that are random walks SPURIOUS REGRESSIONS 13 This means that there were 77 instances of Type I error in the 100 regressions. With a 5 percent significance test, there ought to have been about 5. The incidence of Type I error when performing a 1 percent significance test was not much lower.

Spurious regressions with variables that are random walks SPURIOUS REGRESSIONS 14 The study made an immediate impact because there was a growing consensus that many economic time series are plausibly characterized as being random walks, possibly with drift.

Spurious regressions with variables that are random walks SPURIOUS REGRESSIONS 15 Drift would of course aggravate the problem of spurious association. The findings of Granger and Newbold are all the more remarkable because their series were pure random walks.

16 Here is a typical example of what they found. The series in the figure are independent random walks. Obviously, a regression of one on the other ought not to yield significant results, except as a matter of Type I error. SPURIOUS REGRESSIONS Independent random walks

============================================================= Dependent Variable: Y Method: Least Squares Sample: Included observations: 100 ============================================================= Variable Coefficient Std. Error t-Statistic Prob. ============================================================= C X ============================================================= R-squared Mean dependent var Adjusted R-squared S.D. dependent var S.E. of regression Akaike info criteri Sum squared resid Schwarz criterion Log likelihood F-statistic Durbin-Watson stat Prob(F-statistic) ============================================================= 17 Here is the output obtained when the upper random walk in the figure is regressed on the lower one. SPURIOUS REGRESSIONS Regression of one random walk on another

============================================================= Dependent Variable: Y Method: Least Squares Sample: Included observations: 100 ============================================================= Variable Coefficient Std. Error t-Statistic Prob. ============================================================= C X ============================================================= R-squared Mean dependent var Adjusted R-squared S.D. dependent var S.E. of regression Akaike info criteri Sum squared resid Schwarz criterion Log likelihood F-statistic Durbin-Watson stat Prob(F-statistic) ============================================================= 18 The true slope coefficient is 0, because Y was generated independently of X. And yet the estimate of the slope coefficient appears to be significantly different from 0 at the 1% level and above. SPURIOUS REGRESSIONS Regression of one random walk on another

19 We will now undertake a proper simulation. Our model will be that Y t is related to X t and we will regress Y t on X t where Y t and X t are generated as independent random walks. SPURIOUS REGRESSIONS Regression of one random walk on another

20 We will examine the distribution of b 2 and see what happens when we test the null hypothesis H 0 :  2 = 0, knowing that it is true because the series have been generated independently. SPURIOUS REGRESSIONS Regression of one random walk on another

21 For comparison, we will also regress Y t on X t where Y t and X t are generated as independent white noise processes. In both cases we generate 10 million samples for four sample sizes: 25, 50, 100, and 200. SPURIOUS REGRESSIONS Regression of one random walk on another

22 The figure shows the distribution of b 2 for the regression where Y t and X t were white noise processes drawn independently from a normal distribution with zero mean and unit variance. SPURIOUS REGRESSIONS Distribution of b 2 Y t, X t both iid N(0,1): illustration of standard theory b 2 is √T consistent

23 According to theory, the distribution of b 2 should have mean zero and standard deviation inversely proportional to √T, where T is the sample size. SPURIOUS REGRESSIONS Y t, X t both iid N(0,1): illustration of standard theory b 2 is √T consistent Distribution of b 2

24 The height of the distribution should therefore be proportional to √T since the total area must remain constant at unity. SPURIOUS REGRESSIONS Y t, X t both iid N(0,1): illustration of standard theory b 2 is √T consistent Distribution of b 2

25 Thus one would expect the distribution for 100 to be twice as tall as that for 25, and that for 200 to be twice at tall as that for 50. This is easy to check visually. The figure confirms the theory. SPURIOUS REGRESSIONS Y t, X t both iid N(0,1): illustration of standard theory b 2 is √T consistent Distribution of b 2

26 Standard theory tells us that the distribution of the t statistic should have mean zero irrespective of the sample size and that it should converge to a normal distribution with zero mean and unit variance as the sample size becomes large. SPURIOUS REGRESSIONS Y t, X t both iid N(0,1): illustration of standard theory t distribution converges to a normal distribution Distribution of t statistic for b 2

27 The figure confirms this. It shows only the distributions for T = 25 and T = 200. Those for T = 50 and T = 100 are not shown because they are virtually coincidental with the other two. SPURIOUS REGRESSIONS Y t, X t both iid N(0,1): illustration of standard theory t distribution converges to a normal distribution Distribution of t statistic for b 2

28 The distribution for T = 200 is the one with the very slightly higher mode and the thinner tails. It has converged on the normal distribution. SPURIOUS REGRESSIONS Y t, X t both iid N(0,1): illustration of standard theory t distribution converges to a normal distribution Distribution of t statistic for b 2

29 The distribution for T = 25 is very similar. The body of the distribution has also almost converged on the normal distribution. It is only in the tails, which are marginally fatter than those for a normal distribution, that the distribution has not fully converged. SPURIOUS REGRESSIONS Y t, X t both iid N(0,1): illustration of standard theory t distribution converges to a normal distribution Distribution of t statistic for b 2

30 We will now present the results of a parallel simulation for the random walk regressions. This figure demonstrates a truly remarkable result. If you regress one random walk on another, the slope coefficient does not tend to zero, no matter how large is the same size. SPURIOUS REGRESSIONS T = 25, 50, 100, 200 Y t, X t independent random walks b 2 is inconsistent. Its distribution quickly converges on a limiting distribution and fails to collapse. Distribution of b 2

31 It tends to a limiting distribution. This is a complete break from what we have seen so far. It is the first time that we have come across an estimator that is inconsistent because its distribution fails to collapse to a spike. SPURIOUS REGRESSIONS Y t, X t independent random walks T = 25, 50, 100, 200 b 2 is inconsistent. Its distribution quickly converges on a limiting distribution and fails to collapse. Distribution of b 2

32 The two requirements of a consistent estimator are: 1.Its distribution should collapse to a spike. 2.The spike should be at the true value of the parameter being estimated. SPURIOUS REGRESSIONS Y t, X t independent random walks T = 25, 50, 100, 200 b 2 is inconsistent. Its distribution quickly converges on a limiting distribution and fails to collapse. Distribution of b 2

33 In previous encounters with inconsistent estimators, for example, OLS estimators in the presence of measurement error or simultaneity, the distributions did collapse to a spike, but the spike was at the wrong place. SPURIOUS REGRESSIONS Y t, X t independent random walks T = 25, 50, 100, 200 b 2 is inconsistent. Its distribution quickly converges on a limiting distribution and fails to collapse. Distribution of b 2

34 In this case, there is no bias, so if the distribution had collapsed, the estimator would have been consistent. But it refuses to collapse. SPURIOUS REGRESSIONS Y t, X t independent random walks T = 25, 50, 100, 200 b 2 is inconsistent. Its distribution quickly converges on a limiting distribution and fails to collapse. Distribution of b 2

35 This f igure shows the distributions of b 2 for T = 25, 50, 100, and 200. They are virtually identical. For a sample size of 25, the distribution has already virtually reached its limiting distribution. To see the process of convergence, one must look at even smaller sample sizes. SPURIOUS REGRESSIONS Y t, X t independent random walks T = 25, 50, 100, 200 b 2 is inconsistent. Its distribution quickly converges on a limiting distribution and fails to collapse. Distribution of b 2

36 In this figure T = 5, 10, 15, and 20. The variance of the distribution falls when one increases the sample size from 5 to 10, and it falls further when one increases the sample size to 15. There is hardly any further change when one increases the sample size to 20 because the distribution is already close to its limit. SPURIOUS REGRESSIONS Y t, X t independent random walks T = 5, 15, 10, 20 b 2 is inconsistent. Its distribution quickly converges on a limiting distribution and fails to collapse. Distribution of b 2

37 This figure gives the distributions of the standard errors, as printed out in the regression results, for T equal to 25, 50, and 100, and 200. SPURIOUS REGRESSIONS Y t, X t independent random walks Distribution of standard error of b 2 True DGP for Y t :

38 These distributions ought to be almost identical, given that the standard error provides an estimate of the standard deviation, and we have just seen that the distributions of b 2 for these sample sizes are almost identical. SPURIOUS REGRESSIONS Y t, X t independent random walks Distribution of standard error of b 2 True DGP for Y t :

39 However, the figure suggests that the standard deviation decreases with the sample size, in the usual way. This is incorrect, and the reason for the aberration is that the model is misspecified. SPURIOUS REGRESSIONS Y t, X t independent random walks Distribution of standard error of b 2 True DGP for Y t :

40 Since Y t is a random walk, the true data generating process (DGP) is Y t = Y t–1 +  Yt, where  Yt is a white noise process. When we regress Y t on X t, we commit two errors. SPURIOUS REGRESSIONS Y t, X t independent random walks Distribution of standard error of b 2 True DGP for Y t :

41 One is that we have introduced an irrelevant explanatory variable X t. We know that, in general, introducing irrelevant variables does not lead to serious harm. SPURIOUS REGRESSIONS Y t, X t independent random walks Distribution of standard error of b 2 True DGP for Y t :

42 The more serious error is that, since the true DGP for Y t is Y t = Y t–1 +  Yt, we have omitted Y t–1. The effects of omitted variable misspecification are complex, but in this case we can come up with an intuitive explanation of what is happening. SPURIOUS REGRESSIONS Y t, X t independent random walks True DGP for Y t : Distribution of standard error of b 2

43 By omitting Y t–1, we have forced it into the disturbance term. Since Y t is a random walk, this implies that u t is also a random walk. SPURIOUS REGRESSIONS Y t, X t independent random walks Distribution of standard error of b 2

44 This invalidates the OLS procedure for calculating standard errors, which assumes that u t is generated randomly from one observation to the next (no autocorrelation). SPURIOUS REGRESSIONS Y t, X t independent random walks Distribution of standard error of b 2

45 It follows that the t statistic, as printed in the regression results, does not conform to its theoretical distribution. The apparent reduction in the reported standard error as T increases causes the distribution of the reported t statistic to widen as T increases. SPURIOUS REGRESSIONS Y t, X t independent random walks Distribution of t statistic for b 2

46 The t distribution ought to converge on the standardized normal distribution, as in white noise regressions. SPURIOUS REGRESSIONS Y t, X t both iid N(0,1): illustration of standard theory t distribution converges to a normal distribution Distribution of t statistic for b 2

47 Instead, it widens, as shown in the present figure. This accounts for the very high incidence of Type I errors found by Granger and Newbold. SPURIOUS REGRESSIONS Y t, X t independent random walks Distribution of t statistic for b 2

48 Before drawing the implications of these findings, we will consider one further example. Suppose that, instead of being random walks, Y t and X t are (independent) AR(1) processes with coefficient 0.95 as shown,  Yt and  Xt being independent white noise processes. SPURIOUS REGRESSIONS Distribution of b 2 Y t, X t independent stationary autoregressive processes

49 What happens if one regresses Y t on X t ? The figure shows the distribution of b 2 for sample sizes 25, 50, 100, and 200. The distribution does contract as the sample size increases, and if we continued to increase the sample size, it will progressively contract to a spike at zero. SPURIOUS REGRESSIONS Distribution of b 2 Y t, X t independent stationary autoregressive processes

50 So, in contrast to the regressions with random walks, b 2 is a consistent estimator. SPURIOUS REGRESSIONS Distribution of b 2 Y t, X t independent stationary autoregressive processes

51 But the rate of reduction of the standard deviation (and hence the increase in height) of the distribution is less than the √T that we would expect from standard theory and that we saw in the figure for the white noise regressions. This is especially noticeable for small sample sizes. SPURIOUS REGRESSIONS Distribution of b 2 Y t, X t independent stationary autoregressive processes

52 Here is the figure for the white noise regressions again. The standard deviation is inversely proportional to √T, and so the height is proportional to √T. You can see that the height for T = 100 is twice that for T = 25, and the height for T = 200 is twice that for T = 50. SPURIOUS REGRESSIONS Distribution of b 2 Y t, X t both iid N(0,1): illustration of standard theory white noise regressions, for comparison

53 The substandard rates of reduction of the standard deviation and increase in height of the distribution is especially noticeable for small sample sizes. SPURIOUS REGRESSIONS Distribution of b 2 Y t, X t independent stationary autoregressive processes

54 As with the random walk case, the standard errors are computed incorrectly and the t statistics are not valid. SPURIOUS REGRESSIONS Distribution of b 2 Y t, X t independent stationary autoregressive processes

55 When we regress Y t on X t instead of Y t–1, we have included the redundant variable X t, which does not matter, and have omitted Y t–1, which does. Again, Y t–1 is forced into u t and so u t is an autocorrelated process with  = SPURIOUS REGRESSIONS Distribution of b 2 Y t, X t independent stationary autoregressive processes

56 The figure shows the t distribution for the different sample sizes. It is nonstandard, although not as distorted as that for the random walks. SPURIOUS REGRESSIONS Y t, X t independent stationary autoregressive processes Distribution of t statistic for b 2

57 Of course, the autocorrelation should be revealed by a formal test. This is the case. For this purpose, it is sufficient to use the d statistic for the Durbin–Watson test, despite its imperfections. The figure shows the distribution of this statistic for the four sample sizes. SPURIOUS REGRESSIONS Y t, X t independent stationary autoregressive processes Distribution of the Durbin–Watson statistic

58 Even with T as small as 25, it indicates autocorrelation significant at the 1 percent level in most of the 10 million samples. For larger samples, it indicates significant autocorrelation almost all of the time. SPURIOUS REGRESSIONS Y t, X t independent stationary autoregressive processes Distribution of the Durbin–Watson statistic

59 Once aware of the problem, we know how to treat it. We eliminate the autocorrelation by fitting the specification shown. SPURIOUS REGRESSIONS Y t, X t independent stationary autoregressive processes Distribution of the Durbin–Watson statistic

60 If we do this, we will find that the coefficients of X t and X t–1 are both zero and that the coefficient of Y t–1 is 0.95, subject of course to sampling error and the finite-sample bias caused by the presence of the lagged dependent variable in the specification. SPURIOUS REGRESSIONS Y t, X t independent stationary autoregressive processes Distribution of the Durbin–Watson statistic

61 The fitted model then reveals the truth, that Y t depends only on Y t–1 with coefficient 0.95, and that it does not depend on X t at all. SPURIOUS REGRESSIONS Y t, X t independent stationary autoregressive processes Distribution of the Durbin–Watson statistic

62 Could we not treat the random walk case where, under the null hypothesis, the disturbance term itself is a random walk, as an extreme case of autocorrelation with  = 1? SPURIOUS REGRESSIONS Y t, X t independent stationary autoregressive processes Distribution of the Durbin–Watson statistic

63 In some ways, we can. As with the  = 0.95 case, the Durbin–Watson d statistic almost invariably indicates severe positive autocorrelation. The distributions of the d statistic for the various sample sizes are similar to those in the figure, shifted even further to the left. SPURIOUS REGRESSIONS Y t, X t independent stationary autoregressive processes Distribution of the Durbin–Watson statistic

64 What happens in this case if we fit the revised specification that eliminates the autocorrelation? SPURIOUS REGRESSIONS Y t, X t independent stationary autoregressive processes Distribution of the Durbin–Watson statistic

65 The results are much the same as for the case  = We will discover that  = 1, that  2 = 0, and the relationship simplifies to the definition of the random walk that is the true relationship for Y t. SPURIOUS REGRESSIONS Y t, X t independent stationary autoregressive processes Distribution of the Durbin–Watson statistic

66 This is not quite the full story because, as we will see in the next section, there are further surprises when we come to test the (true) null hypothesis that the coefficient of Y t–1 is equal to 1. However, it is a reasonable first approximation. SPURIOUS REGRESSIONS Y t, X t independent stationary autoregressive processes Distribution of the Durbin–Watson statistic

67 The AR(1) regressions demonstrate that the problem of excess Type I errors arising from nonstandard t distributions found by Granger and Newbold did not depend on the nonstationarity of their processes. It occurs with any AR(1) process, being more evident, the more closely the slope coefficient approximates unity. SPURIOUS REGRESSIONS Y t, X t independent stationary autoregressive processes Distribution of the Durbin–Watson statistic

68 The lesson to be learnt from the Granger–Newbold simulations is the one that motivated their article and was articulated by them at the time. SPURIOUS REGRESSIONS Y t, X t independent stationary autoregressive processes Distribution of the Durbin–Watson statistic

69 This was that there is a substantial risk of apparently obtaining significant results, where no relationship exists, if you ignore evidence of autocorrelation, as was once quite common in applied work. SPURIOUS REGRESSIONS Y t, X t independent stationary autoregressive processes Distribution of the Durbin–Watson statistic

70 The fact that Granger and Newbold used nonstationary processes, in the form of random walks, to make their point is not important. They could have made it just as well by using independent stationary AR(1) processes. But the effect is the more dramatic, the greater the value of , and most obvious when one goes to the extreme value of 1, the random walk. SPURIOUS REGRESSIONS Y t, X t independent stationary autoregressive processes Distribution of the Durbin–Watson statistic

71 The objective of the Granger–Newbold simulations was to highlight the dangers of ignoring evidence of autocorrelation in time series models. SPURIOUS REGRESSIONS Y t, X t independent stationary autoregressive processes Distribution of the Durbin–Watson statistic

72 However, their experiments also contributed to a growing awareness that standard theory did not necessarily apply to the properties of regression estimators, and associated statistical inference, for regressions using nonstationary time series data. SPURIOUS REGRESSIONS Y t, X t independent stationary autoregressive processes Distribution of the Durbin–Watson statistic

73 These concerns were reinforced by the fact that the use of nonstationary variables in model-building was very common. SPURIOUS REGRESSIONS Y t, X t independent stationary autoregressive processes Distribution of the Durbin–Watson statistic

74 Macroeconomic analysis, in particular, routinely used variables such as aggregate income and aggregate consumption that invariably possessed time trends when measured in levels and were unquestionably nonstationary. SPURIOUS REGRESSIONS Y t, X t independent stationary autoregressive processes Distribution of the Durbin–Watson statistic

75 The only issue was whether they were better characterized as random walks with drift or as subject to deterministic trends. This stimulated a literature, now enormous and still growing, concerning the determination of the conditions under which such regressions with nonstationary time series could yield consistent estimates with valid statistical inference. SPURIOUS REGRESSIONS Y t, X t independent stationary autoregressive processes Distribution of the Durbin–Watson statistic

76 The first step was to develop a means for identifying whether a time series is, or is not, nonstationary in the first place, and if it is nonstationary, whether or not it possesses a trend, and if it does, whether the trend is deterministic or is the outcome of a random walk with drift. SPURIOUS REGRESSIONS Y t, X t independent stationary autoregressive processes Distribution of the Durbin–Watson statistic

77 These issues will be particularly important when we come to cointegration and explore whether there exist genuine relationships among variables characterized as nonstationary processes. SPURIOUS REGRESSIONS Y t, X t independent stationary autoregressive processes Distribution of the Durbin–Watson statistic

78 Historically there have been two approaches to testing for nonstationarity, one using graphical techniques, the other using formal statistical tests. The following slideshows will consider each in turn. SPURIOUS REGRESSIONS Y t, X t independent stationary autoregressive processes Distribution of the Durbin–Watson statistic

Copyright Christopher Dougherty These slideshows may be downloaded by anyone, anywhere for personal use. Subject to respect for copyright and, where appropriate, attribution, they may be used as a resource for teaching an econometrics course. There is no need to refer to the author. The content of this slideshow comes from Section 13.2 of C. Dougherty, Introduction to Econometrics, fourth edition 2011, Oxford University Press. Additional (free) resources for both students and instructors may be downloaded from the OUP Online Resource Centre Individuals studying econometrics on their own and who feel that they might benefit from participation in a formal course should consider the London School of Economics summer school course EC212 Introduction to Econometrics or the University of London International Programmes distance learning course 20 Elements of Econometrics