1 t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN The diagram summarizes the procedure for performing a 5% significance test on the slope coefficient.

Slides:



Advertisements
Similar presentations
CHOW TEST AND DUMMY VARIABLE GROUP TEST
Advertisements

Christopher Dougherty EC220 - Introduction to econometrics (chapter 5) Slideshow: slope dummy variables Original citation: Dougherty, C. (2012) EC220 -
Christopher Dougherty EC220 - Introduction to econometrics (chapter 2) Slideshow: a Monte Carlo experiment Original citation: Dougherty, C. (2012) EC220.
1 THE DISTURBANCE TERM IN LOGARITHMIC MODELS Thus far, nothing has been said about the disturbance term in nonlinear regression models.
EC220 - Introduction to econometrics (chapter 7)
1 XX X1X1 XX X Random variable X with unknown population mean  X function of X probability density Sample of n observations X 1, X 2,..., X n : potential.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 2) Slideshow: testing a hypothesis relating to a regression coefficient Original citation:
1 THE NORMAL DISTRIBUTION In the analysis so far, we have discussed the mean and the variance of a distribution of a random variable, but we have not said.
HETEROSCEDASTICITY-CONSISTENT STANDARD ERRORS 1 Heteroscedasticity causes OLS standard errors to be biased is finite samples. However it can be demonstrated.
1 PROBABILITY DISTRIBUTION EXAMPLE: X IS THE SUM OF TWO DICE red This sequence provides an example of a discrete random variable. Suppose that you.
Random effects estimation RANDOM EFFECTS REGRESSIONS When the observed variables of interest are constant for each individual, a fixed effects regression.
MEASUREMENT ERROR 1 In this sequence we will investigate the consequences of measurement errors in the variables in a regression model. To keep the analysis.
1 ASSUMPTIONS FOR MODEL C: REGRESSIONS WITH TIME SERIES DATA Assumptions C.1, C.3, C.4, C.5, and C.8, and the consequences of their violations are the.
00  sd  0 –sd  0 –1.96sd  0 +sd 2.5% CONFIDENCE INTERVALS probability density function of X null hypothesis H 0 :  =  0 In the sequence.
EXPECTED VALUE OF A RANDOM VARIABLE 1 The expected value of a random variable, also known as its population mean, is the weighted average of its possible.
1 We will now consider the distributional properties of OLS estimators in models with a lagged dependent variable. We will do so for the simplest such.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 2) Slideshow: testing a hypothesis relating to a regression coefficient (2010/2011.
TESTING A HYPOTHESIS RELATING TO THE POPULATION MEAN 1 This sequence describes the testing of a hypothesis at the 5% and 1% significance levels. It also.
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: confidence intervals Original citation: Dougherty, C. (2012) EC220.
EC220 - Introduction to econometrics (review chapter)
TESTING A HYPOTHESIS RELATING TO A REGRESSION COEFFICIENT This sequence describes the testing of a hypotheses relating to regression coefficients. It is.
1 A MONTE CARLO EXPERIMENT In the previous slideshow, we saw that the error term is responsible for the variations of b 2 around its fixed component 
Christopher Dougherty EC220 - Introduction to econometrics (chapter 3) Slideshow: prediction Original citation: Dougherty, C. (2012) EC220 - Introduction.
SLOPE DUMMY VARIABLES 1 The scatter diagram shows the data for the 74 schools in Shanghai and the cost functions derived from a regression of COST on N.
1 In the previous sequence, we were performing what are described as two-sided t tests. These are appropriate when we have no information about the alternative.
DERIVING LINEAR REGRESSION COEFFICIENTS
Christopher Dougherty EC220 - Introduction to econometrics (chapter 5) Slideshow: Chow test Original citation: Dougherty, C. (2012) EC220 - Introduction.
1 PREDICTION In the previous sequence, we saw how to predict the price of a good or asset given the composition of its characteristics. In this sequence,
1 UNBIASEDNESS AND EFFICIENCY Much of the analysis in this course will be concerned with three properties of estimators: unbiasedness, efficiency, and.
FIXED EFFECTS REGRESSIONS: WITHIN-GROUPS METHOD The two main approaches to the fitting of models using panel data are known, for reasons that will be explained.
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: sampling and estimators Original citation: Dougherty, C. (2012)
DUMMY CLASSIFICATION WITH MORE THAN TWO CATEGORIES This sequence explains how to extend the dummy variable technique to handle a qualitative explanatory.
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: conflicts between unbiasedness and minimum variance Original citation:
Christopher Dougherty EC220 - Introduction to econometrics (chapter 8) Slideshow: measurement error Original citation: Dougherty, C. (2012) EC220 - Introduction.
THE FIXED AND RANDOM COMPONENTS OF A RANDOM VARIABLE 1 In this short sequence we shall decompose a random variable X into its fixed and random components.
Confidence intervals were treated at length in the Review chapter and their application to regression analysis presents no problems. We will not repeat.
CONSEQUENCES OF AUTOCORRELATION
ALTERNATIVE EXPRESSION FOR POPULATION VARIANCE 1 This sequence derives an alternative expression for the population variance of a random variable. It provides.
CONFLICTS BETWEEN UNBIASEDNESS AND MINIMUM VARIANCE
ASYMPTOTIC AND FINITE-SAMPLE DISTRIBUTIONS OF THE IV ESTIMATOR
MULTIPLE RESTRICTIONS AND ZERO RESTRICTIONS
F TEST OF GOODNESS OF FIT FOR THE WHOLE EQUATION 1 This sequence describes two F tests of goodness of fit in a multiple regression model. The first relates.
TYPE II ERROR AND THE POWER OF A TEST A Type I error occurs when the null hypothesis is rejected when it is in fact true. A Type II error occurs when the.
Simple regression model: Y =  1 +  2 X + u 1 We have seen that the regression coefficients b 1 and b 2 are random variables. They provide point estimates.
A.1The model is linear in parameters and correctly specified. PROPERTIES OF THE MULTIPLE REGRESSION COEFFICIENTS 1 Moving from the simple to the multiple.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 6) Slideshow: multiple restrictions and zero restrictions Original citation: Dougherty,
1 We will continue with a variation on the basic model. We will now hypothesize that p is a function of m, the rate of growth of the money supply, as well.
COST 11 DUMMY VARIABLE CLASSIFICATION WITH TWO CATEGORIES 1 This sequence explains how you can include qualitative explanatory variables in your regression.
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: alternative expression for population variance Original citation:
1 11 00 This sequence explains the logic behind a one-sided t test. ONE-SIDED t TESTS  0 +sd  0 –sd  0 –2sd  1 +2sd 2.5% null hypothesis:H 0 :
1 ASYMPTOTIC PROPERTIES OF ESTIMATORS: THE USE OF SIMULATION In practice we deal with finite samples, not infinite ones. So why should we be interested.
Definition of, the expected value of a function of X : 1 EXPECTED VALUE OF A FUNCTION OF A RANDOM VARIABLE To find the expected value of a function of.
4 In our case, the starting point should be the model with all the lagged variables. DYNAMIC MODEL SPECIFICATION General model with lagged variables Static.
INSTRUMENTAL VARIABLES 1 Suppose that you have a model in which Y is determined by X but you have reason to believe that Assumption B.7 is invalid and.
1 INSTRUMENTAL VARIABLE ESTIMATION OF SIMULTANEOUS EQUATIONS In the previous sequence it was asserted that the reduced form equations have two important.
1 HETEROSCEDASTICITY: WEIGHTED AND LOGARITHMIC REGRESSIONS This sequence presents two methods for dealing with the problem of heteroscedasticity. We will.
1 ESTIMATORS OF VARIANCE, COVARIANCE, AND CORRELATION We have seen that the variance of a random variable X is given by the expression above. Variance.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 2) Slideshow: confidence intervals Original citation: Dougherty, C. (2012) EC220 -
1 REPARAMETERIZATION OF A MODEL AND t TEST OF A LINEAR RESTRICTION Linear restrictions can also be tested using a t test. This involves the reparameterization.
F TESTS RELATING TO GROUPS OF EXPLANATORY VARIABLES 1 We now come to more general F tests of goodness of fit. This is a test of the joint explanatory power.
1 We will illustrate the heteroscedasticity theory with a Monte Carlo simulation. HETEROSCEDASTICITY: MONTE CARLO ILLUSTRATION 1 standard deviation of.
FOOTNOTE: THE COCHRANE–ORCUTT ITERATIVE PROCESS 1 We saw in the previous sequence that AR(1) autocorrelation could be eliminated by a simple manipulation.
Introduction to Econometrics, 5th edition
Introduction to Econometrics, 5th edition
Introduction to Econometrics, 5th edition
Introduction to Econometrics, 5th edition
Introduction to Econometrics, 5th edition
Introduction to Econometrics, 5th edition
Introduction to Econometrics, 5th edition
Introduction to Econometrics, 5th edition
Presentation transcript:

1 t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN The diagram summarizes the procedure for performing a 5% significance test on the slope coefficient of a regression under the assumption that we know its standard deviation. discrepancy between hypothetical value and sample estimate, in terms of s.d.: 5% significance test: reject H 0 :  =  0 if z > 1.96 or z < –1.96 s.d. of X known

2 This is a very unrealistic assumption. We usually have to estimate it with the standard error, and we use this in the test statistic instead of the standard deviation. t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN discrepancy between hypothetical value and sample estimate, in terms of s.d.: 5% significance test: reject H 0 :  =  0 if z > 1.96 or z < –1.96 discrepancy between hypothetical value and sample estimate, in terms of s.e.: s.d. of X known s.d. of X not known

3 Because we have replaced the standard deviation in its denominator with the standard error, the test statistic has a t distribution instead of a normal distribution. t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN discrepancy between hypothetical value and sample estimate, in terms of s.d.: 5% significance test: reject H 0 :  =  0 if z > 1.96 or z < –1.96 discrepancy between hypothetical value and sample estimate, in terms of s.e.: s.d. of X known s.d. of X not known

4 Accordingly, we refer to the test statistic as a t statistic. In other respects the test procedure is much the same. t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN discrepancy between hypothetical value and sample estimate, in terms of s.d.: 5% significance test: reject H 0 :  =  0 if z > 1.96 or z < –1.96 discrepancy between hypothetical value and sample estimate, in terms of s.e.: 5% significance test: reject H 0 :  =  0 if t > t crit or t < –t crit s.d. of X known s.d. of X not known

5 We look up the critical value of t and if the t statistic is greater than it, positive or negative, we reject the null hypothesis. If it is not, we do not. discrepancy between hypothetical value and sample estimate, in terms of s.d.: 5% significance test: reject H 0 :  =  0 if z > 1.96 or z < –1.96 discrepancy between hypothetical value and sample estimate, in terms of s.e.: 5% significance test: reject H 0 :  =  0 if t > t crit or t < –t crit t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN s.d. of X known s.d. of X not known

6 Here is a graph of a normal distribution with zero mean and unit variance t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN normal

7 A graph of a t distribution with 10 degrees of freedom (this term will be defined in a moment) has been added. t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN normal t, 10 d.f.

8 When the number of degrees of freedom is large, the t distribution looks very much like a normal distribution (and as the number increases, it converges on one). t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN normal t, 10 d.f.

9 Even when the number of degrees of freedom is small, as in this case, the distributions are very similar. t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN normal t, 10 d.f.

10 Here is another t distribution, this time with only 5 degrees of freedom. It is still very similar to a normal distribution. t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN normal t, 10 d.f. t, 5 d.f.

11 So why do we make such a fuss about referring to the t distribution rather than the normal distribution? Would it really matter if we always used 1.96 for the 5% test and 2.58 for the 1% test? t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN normal t, 10 d.f. t, 5 d.f.

12 The answer is that it does make a difference. Although the distributions are generally quite similar, the t distribution has longer tails than the normal distribution, the difference being the greater, the smaller the number of degrees of freedom. normal t, 10 d.f. t, 5 d.f. t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN

13 As a consequence, the probability of obtaining a high test statistic on a pure chance basis is greater with a t distribution than with a normal distribution. t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN normal t, 10 d.f. t, 5 d.f.

14 This means that the rejection regions have to start more standard deviations away from zero for a t distribution than for a normal distribution. t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN normal t, 10 d.f. t, 5 d.f.

15 The 2.5% tail of a normal distribution starts 1.96 standard deviations from its mean. t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN –1.96 normal t, 10 d.f. t, 5 d.f.

16 The 2.5% tail of a t distribution with 10 degrees of freedom starts 2.33 standard deviations from its mean. –2.33 t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN normal t, 10 d.f. t, 5 d.f.

17 t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN normal t, 10 d.f. t, 5 d.f. That for a t distribution with 5 degrees of freedom starts 2.57 standard deviations from its mean. –2.57

18 For this reason we need to refer to a table of critical values of t when performing significance tests on the coefficients of a regression equation. t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN t distribution: critical values of t Degrees of Two-sided test 10% 5% 2% 1% 0.2% 0.1% freedom One-sided test 5% 2.5% 1% 0.5% 0.1% 0.05% ………………… …………………

19 At the top of the table are listed possible significance levels for a test. For the time being we will be performing two-sided tests, so ignore the line for one-sided tests. t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN t distribution: critical values of t Degrees of Two-sided test 10% 5% 2% 1% 0.2% 0.1% freedom One-sided test 5% 2.5% 1% 0.5% 0.1% 0.05% ………………… …………………

20 Hence if we are performing a (two-sided) 5% significance test, we should use the column thus indicated in the table. t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN t distribution: critical values of t Degrees of Two-sided test 10% 5% 2% 1% 0.2% 0.1% freedom One-sided test 5% 2.5% 1% 0.5% 0.1% 0.05% ………………… …………………

21 t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN t distribution: critical values of t Degrees of Two-sided test 10% 5% 2% 1% 0.2% 0.1% freedom One-sided test 5% 2.5% 1% 0.5% 0.1% 0.05% ………………… ………………… The left hand vertical column lists degrees of freedom. When estimating the population mean using the sample mean, the number of degrees of freedom is defined to be the number of observations minus 1. When estimating the population mean using the sample mean, the number of degrees of freedom = n – 1.

22 t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN t distribution: critical values of t Degrees of Two-sided test 10% 5% 2% 1% 0.2% 0.1% freedom One-sided test 5% 2.5% 1% 0.5% 0.1% 0.05% ………………… ………………… Thus, if there are 20 observations in the sample, as in the sales tax example we will discuss in a moment, the number of degrees of freedom would be 19 and the critical value of t for a 5% test would be n = 20 d.f. = 19

23 t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN t distribution: critical values of t Degrees of Two-sided test 10% 5% 2% 1% 0.2% 0.1% freedom One-sided test 5% 2.5% 1% 0.5% 0.1% 0.05% ………………… ………………… Note that as the number of degrees of freedom becomes large, the critical value converges on 1.96, the critical value for the normal distribution. This is because the t distribution converges on the normal distribution.

24 Hence, referring back to the summary of the test procedure, t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN discrepancy between hypothetical value and sample estimate, in terms of s.d.: 5% significance test: reject H 0 :  =  0 if z > 1.96 or z < –1.96 discrepancy between hypothetical value and sample estimate, in terms of s.e.: s.d. of X known s.d. of X not known 5% significance test: reject H 0 :  =  0 if t > t crit or t < –t crit

25 when n = 20 and so degrees of freedom = 19, we should reject the null hypothesis at the 5% significance level if the absolute value of t is greater than t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN discrepancy between hypothetical value and sample estimate, in terms of s.d.: 5% significance test: reject H 0 :  =  0 if z > 1.96 or z < –1.96 discrepancy between hypothetical value and sample estimate, in terms of s.e.: s.d. of X known s.d. of X not known 5% significance test: reject H 0 :  =  0 if t > or t < –2.093

26 If instead we wished to perform a 1% significance test, we would use the column indicated above. Note that as the number of degrees of freedom becomes large, the critical value converges to 2.58, the critical value for the normal distribution. t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN t distribution: critical values of t Degrees of Two-sided test 10% 5% 2% 1% 0.2% 0.1% freedom One-sided test 5% 2.5% 1% 0.5% 0.1% 0.05% ………………… …………………

27 For a sample of 20 observations, the critical value of t at the 1% level is t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN t distribution: critical values of t Degrees of Two-sided test 10% 5% 2% 1% 0.2% 0.1% freedom One-sided test 5% 2.5% 1% 0.5% 0.1% 0.05% ………………… …………………

28 So we should use this figure in the test procedure for a 1% test. t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN discrepancy between hypothetical value and sample estimate, in terms of s.d.: 5% significance test: reject H 0 :  =  0 if z > 1.96 or z < –1.96 discrepancy between hypothetical value and sample estimate, in terms of s.e.: s.d. of X known s.d. of X not known 1% significance test: reject H 0 :  =  0 if t > or t < –2.861

We now consider an example of a t test. A certain city abolishes its local sales tax on consumer expenditure. We wish to determine whether the abolition had a significant effect on sales. 29 t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN Example: effect of abolition of sales tax on household expenditure Effect on household expenditure:

30 t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN We take as our null hypothesis that there was no effect: H 0 :  = 0. Example: effect of abolition of sales tax on household expenditure Null hypothesis: Effect on household expenditure: Alternative hypothesis:

31 t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN A survey of 20 households shows that, in the following month, mean household expenditure increased by $160 and the standard error of the increase was $60. Example: effect of abolition of sales tax on household expenditure Null hypothesis: Effect on household expenditure: Sample size:n = 20 Mean increase in expenditure: $160 Standard error of increase: Alternative hypothesis: $60

32 t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN The test statistic is Example: effect of abolition of sales tax on household expenditure Test statistic: Null hypothesis: Effect on household expenditure: Sample size:n = 20 Mean increase in expenditure: $160 Standard error of increase: Alternative hypothesis: $60

33 t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN The critical values of t with 19 degrees of freedom are 2.09 at the 5 percent significance level and 2.86 at the 1 percent level Example: effect of abolition of sales tax on household expenditure Test statistic: Null hypothesis: Critical values of t : Effect on household expenditure: Sample size:n = 20 Mean increase in expenditure: $160 Standard error of increase: Alternative hypothesis: $60

34 Example: effect of abolition of sales tax on household expenditure t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN Hence we reject the null hypothesis of no effect at the 5 percent level but not at the 1 percent level. This is a case where we should mention both tests. Conclusion: Test statistic: Null hypothesis: Critical values of t : Reject H 0 at 5% level. Do not reject H 0 at 1% level. Effect on household expenditure: Sample size:n = 20 Mean increase in expenditure: $160 Standard error of increase: Alternative hypothesis: $60

Copyright Christopher Dougherty These slideshows may be downloaded by anyone, anywhere for personal use. Subject to respect for copyright and, where appropriate, attribution, they may be used as a resource for teaching an econometrics course. There is no need to refer to the author. The content of this slideshow comes from Section R.11 of C. Dougherty, Introduction to Econometrics, fourth edition 2011, Oxford University Press. Additional (free) resources for both students and instructors may be downloaded from the OUP Online Resource Centre Individuals studying econometrics on their own who feel that they might benefit from participation in a formal course should consider the London School of Economics summer school course EC212 Introduction to Econometrics or the University of London International Programmes distance learning course EC2020 Elements of Econometrics