Cross-sectional:Observations on individuals, households, enterprises, countries, etc at one moment in time (Chapters 1–10, Models A and B). 1 During this.

Slides:



Advertisements
Similar presentations
1 Although they are biased in finite samples if Part (2) of Assumption C.7 is violated, OLS estimators are consistent if Part (1) is valid. We will demonstrate.
Advertisements

Christopher Dougherty EC220 - Introduction to econometrics (chapter 11) Slideshow: model c assumptions Original citation: Dougherty, C. (2012) EC220 -
EC220 - Introduction to econometrics (chapter 8)
Christopher Dougherty EC220 - Introduction to econometrics (chapter 8) Slideshow: model b: properties of the regression coefficients Original citation:
EC220 - Introduction to econometrics (chapter 3)
Christopher Dougherty EC220 - Introduction to econometrics (chapter 2) Slideshow: a Monte Carlo experiment Original citation: Dougherty, C. (2012) EC220.
1 THE DISTURBANCE TERM IN LOGARITHMIC MODELS Thus far, nothing has been said about the disturbance term in nonlinear regression models.
1 XX X1X1 XX X Random variable X with unknown population mean  X function of X probability density Sample of n observations X 1, X 2,..., X n : potential.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 2) Slideshow: types of regression model and assumptions for a model a Original citation:
1 THE NORMAL DISTRIBUTION In the analysis so far, we have discussed the mean and the variance of a distribution of a random variable, but we have not said.
1 PROBABILITY DISTRIBUTION EXAMPLE: X IS THE SUM OF TWO DICE red This sequence provides an example of a discrete random variable. Suppose that you.
Random effects estimation RANDOM EFFECTS REGRESSIONS When the observed variables of interest are constant for each individual, a fixed effects regression.
MEASUREMENT ERROR 1 In this sequence we will investigate the consequences of measurement errors in the variables in a regression model. To keep the analysis.
1 ASSUMPTIONS FOR MODEL C: REGRESSIONS WITH TIME SERIES DATA Assumptions C.1, C.3, C.4, C.5, and C.8, and the consequences of their violations are the.
EC220 - Introduction to econometrics (chapter 2)
Stationary process NONSTATIONARY PROCESSES 1 In the last sequence, the process shown at the top was shown to be stationary. The expected value and variance.
EC220 - Introduction to econometrics (chapter 9)
EXPECTED VALUE OF A RANDOM VARIABLE 1 The expected value of a random variable, also known as its population mean, is the weighted average of its possible.
TESTING A HYPOTHESIS RELATING TO THE POPULATION MEAN 1 This sequence describes the testing of a hypothesis at the 5% and 1% significance levels. It also.
TESTING A HYPOTHESIS RELATING TO A REGRESSION COEFFICIENT This sequence describes the testing of a hypotheses relating to regression coefficients. It is.
1 A MONTE CARLO EXPERIMENT In the previous slideshow, we saw that the error term is responsible for the variations of b 2 around its fixed component 
Christopher Dougherty EC220 - Introduction to econometrics (chapter 3) Slideshow: prediction Original citation: Dougherty, C. (2012) EC220 - Introduction.
1 THE CENTRAL LIMIT THEOREM If a random variable X has a normal distribution, its sample mean X will also have a normal distribution. This fact is useful.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 10) Slideshow: maximum likelihood estimation of regression coefficients Original citation:
EC220 - Introduction to econometrics (chapter 12)
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: the normal distribution Original citation: Dougherty, C. (2012)
1 In a second variation, we shall consider the model shown above. x is the rate of growth of productivity, assumed to be exogenous. w is now hypothesized.
1 This sequence shows why OLS is likely to yield inconsistent estimates in models composed of two or more simultaneous relationships. SIMULTANEOUS EQUATIONS.
1 PREDICTION In the previous sequence, we saw how to predict the price of a good or asset given the composition of its characteristics. In this sequence,
EC220 - Introduction to econometrics (review chapter)
1 UNBIASEDNESS AND EFFICIENCY Much of the analysis in this course will be concerned with three properties of estimators: unbiasedness, efficiency, and.
FIXED EFFECTS REGRESSIONS: WITHIN-GROUPS METHOD The two main approaches to the fitting of models using panel data are known, for reasons that will be explained.
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: sampling and estimators Original citation: Dougherty, C. (2012)
Christopher Dougherty EC220 - Introduction to econometrics (chapter 12) Slideshow: autocorrelation, partial adjustment, and adaptive expectations Original.
THE DUMMY VARIABLE TRAP 1 Suppose that you have a regression model with Y depending on a set of ordinary variables X 2,..., X k and a qualitative variable.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 8) Slideshow: measurement error Original citation: Dougherty, C. (2012) EC220 - Introduction.
THE FIXED AND RANDOM COMPONENTS OF A RANDOM VARIABLE 1 In this short sequence we shall decompose a random variable X into its fixed and random components.
CONSEQUENCES OF AUTOCORRELATION
ALTERNATIVE EXPRESSION FOR POPULATION VARIANCE 1 This sequence derives an alternative expression for the population variance of a random variable. It provides.
CONFLICTS BETWEEN UNBIASEDNESS AND MINIMUM VARIANCE
1 t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN The diagram summarizes the procedure for performing a 5% significance test on the slope coefficient.
ASYMPTOTIC AND FINITE-SAMPLE DISTRIBUTIONS OF THE IV ESTIMATOR
EC220 - Introduction to econometrics (chapter 8)
MULTIPLE RESTRICTIONS AND ZERO RESTRICTIONS
Christopher Dougherty EC220 - Introduction to econometrics (chapter 12) Slideshow: footnote: the Cochrane-Orcutt iterative process Original citation: Dougherty,
Simple regression model: Y =  1 +  2 X + u 1 We have seen that the regression coefficients b 1 and b 2 are random variables. They provide point estimates.
A.1The model is linear in parameters and correctly specified. PROPERTIES OF THE MULTIPLE REGRESSION COEFFICIENTS 1 Moving from the simple to the multiple.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 9) Slideshow: instrumental variable estimation: variation Original citation: Dougherty,
Christopher Dougherty EC220 - Introduction to econometrics (chapter 6) Slideshow: multiple restrictions and zero restrictions Original citation: Dougherty,
AUTOCORRELATION 1 Assumption C.5 states that the values of the disturbance term in the observations in the sample are generated independently of each other.
1 We will now look at the properties of the OLS regression estimators with the assumptions of Model B. We will do this within the context of the simple.
1 COVARIANCE, COVARIANCE AND VARIANCE RULES, AND CORRELATION Covariance The covariance of two random variables X and Y, often written  XY, is defined.
1 Y SIMPLE REGRESSION MODEL Suppose that a variable Y is a linear function of another variable X, with unknown parameters  1 and  2 that we wish to estimate.
1 We will continue with a variation on the basic model. We will now hypothesize that p is a function of m, the rate of growth of the money supply, as well.
Definition of, the expected value of a function of X : 1 EXPECTED VALUE OF A FUNCTION OF A RANDOM VARIABLE To find the expected value of a function of.
4 In our case, the starting point should be the model with all the lagged variables. DYNAMIC MODEL SPECIFICATION General model with lagged variables Static.
HETEROSCEDASTICITY 1 This sequence relates to Assumption A.4 of the regression model assumptions and introduces the topic of heteroscedasticity. This relates.
INSTRUMENTAL VARIABLES 1 Suppose that you have a model in which Y is determined by X but you have reason to believe that Assumption B.7 is invalid and.
1 INSTRUMENTAL VARIABLE ESTIMATION OF SIMULTANEOUS EQUATIONS In the previous sequence it was asserted that the reduced form equations have two important.
1 HETEROSCEDASTICITY: WEIGHTED AND LOGARITHMIC REGRESSIONS This sequence presents two methods for dealing with the problem of heteroscedasticity. We will.
1 ESTIMATORS OF VARIANCE, COVARIANCE, AND CORRELATION We have seen that the variance of a random variable X is given by the expression above. Variance.
1 We will illustrate the heteroscedasticity theory with a Monte Carlo simulation. HETEROSCEDASTICITY: MONTE CARLO ILLUSTRATION 1 standard deviation of.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 1) Slideshow: simple regression model Original citation: Dougherty, C. (2012) EC220.
FOOTNOTE: THE COCHRANE–ORCUTT ITERATIVE PROCESS 1 We saw in the previous sequence that AR(1) autocorrelation could be eliminated by a simple manipulation.
Introduction to Econometrics, 5th edition
Introduction to Econometrics, 5th edition
Introduction to Econometrics, 5th edition
Introduction to Econometrics, 5th edition
Introduction to Econometrics, 5th edition
Introduction to Econometrics, 5th edition
Presentation transcript:

Cross-sectional:Observations on individuals, households, enterprises, countries, etc at one moment in time (Chapters 1–10, Models A and B). 1 During this course we will work with the three types of data described above. TYPES OF REGRESSION MODEL AND ASSUMPTIONS FOR MODEL A TYPES OF DATA Time series:Observations on income, consumption, interest rates, etc over a number of time periods (years, quarters, months, …) (Chapters 11–13, Model C). Panel data:Observations on the same cross-section of individuals, households, etc over a number of time periods (Chapter 14, Model B).

Different regression models are appropriate for different types of data. We will consider three types of regression model, as shown above. 2 TYPES OF REGRESSION MODEL AND ASSUMPTIONS FOR MODEL A TYPES OF MODEL Model A:Cross-sectional data with nonstochastic regressors. Their values in the observations in a sample do not have stochastic (random) components. Model C:Time series data. The values of the regressors may exhibit persistence over time. Regressions with time series data potentially involve complex technical issues that are best avoided initially. Model B:Cross-sectional data with stochastic regressors. The values of the regressors are drawn randomly and independently from defined populations.

We will start with Model A. We will do this purely for analytical convenience. It enables us to conduct the discussion of regression analysis within the relatively straightforward framework of what is known as the Classical Linear Regression Model. 3 TYPES OF REGRESSION MODEL AND ASSUMPTIONS FOR MODEL A TYPES OF MODEL Model A:Cross-sectional data with nonstochastic regressors. Their values in the observations in a sample do not have stochastic (random) components. Model C:Time series data. The values of the regressors may exhibit persistence over time. Regressions with time series data potentially involve complex technical issues that are best avoided initially. Model B:Cross-sectional data with stochastic regressors. The values of the regressors are drawn randomly and independently from defined populations.

We will replace it in Chapter 8 by the weaker and more realistic assumption, appropriate for regressions with cross-sectional data, that the observations on the regressors are randomly drawn from defined populations. 4 TYPES OF REGRESSION MODEL AND ASSUMPTIONS FOR MODEL A TYPES OF MODEL Model A:Cross-sectional data with nonstochastic regressors. Their values in the observations in a sample do not have stochastic (random) components. Model C:Time series data. The values of the regressors may exhibit persistence over time. Regressions with time series data potentially involve complex technical issues that are best avoided initially. Model B:Cross-sectional data with stochastic regressors. The values of the regressors are drawn randomly and independently from defined populations.

A.1The model is linear in parameters and correctly specified. ‘Linear in parameters’ means that each term on the right side includes a  as a simple factor and there is no built-in relationship among the  s. We will defer a discussion of issues relating to linearity and nonlinearity to Chapter 4. 5 TYPES OF REGRESSION MODEL AND ASSUMPTIONS FOR MODEL A ASSUMPTIONS FOR MODEL A For example: Examples of models that are not linear in parameters:

There must be some variation in the regressor in the sample. Otherwise it cannot account for any of the variation in Y. 6 TYPES OF REGRESSION MODEL AND ASSUMPTIONS FOR MODEL A ASSUMPTIONS FOR MODEL A A.2There is some variation in the regressor in the sample.

If we tried to regress Y on X, when X is constant, we would find that we would not be able to compute the regression coefficients. Both the numerator and the denominator of the expression for b 2 would be equal to zero. We would not be able to obtain b 1 either. 7 TYPES OF REGRESSION MODEL AND ASSUMPTIONS FOR MODEL A If for all i, A.2There is some variation in the regressor in the sample. ASSUMPTIONS FOR MODEL A

8 TYPES OF REGRESSION MODEL AND ASSUMPTIONS FOR MODEL A We assume that the expected value of the disturbance term in any observation should be zero. Sometimes the disturbance term will be positive, sometimes negative, but it should not have a systematic tendency in either direction. A.3The disturbance term has zero expectation ASSUMPTIONS FOR MODEL A for all i

9 TYPES OF REGRESSION MODEL AND ASSUMPTIONS FOR MODEL A Actually, if an intercept is included in the regression equation, it is usually reasonable to assume that this condition is satisfied automatically. The role of the intercept is to pick up any systematic but constant tendency in Y not accounted for by the regressor(s). ASSUMPTIONS FOR MODEL A A.3The disturbance term has zero expectation for all i

10 TYPES OF REGRESSION MODEL AND ASSUMPTIONS FOR MODEL A Suppose that the disturbance term had a nonzero population mean. ASSUMPTIONS FOR MODEL A A.3The disturbance term has zero expectation for all i Suppose

11 TYPES OF REGRESSION MODEL AND ASSUMPTIONS FOR MODEL A Define a new random variable v i = u i –  u. ASSUMPTIONS FOR MODEL A A.3The disturbance term has zero expectation for all i Define Suppose

12 TYPES OF REGRESSION MODEL AND ASSUMPTIONS FOR MODEL A Then we can rewrite the model as shown. v i becomes the new disturbance term and the intercept has absorbed the constant  u. ASSUMPTIONS FOR MODEL A A.3The disturbance term has zero expectation for all i Define Suppose where

13 TYPES OF REGRESSION MODEL AND ASSUMPTIONS FOR MODEL A The disturbance term in the revised model now satisfies Assumption A.3. ASSUMPTIONS FOR MODEL A A.3The disturbance term has zero expectation for all i Define Suppose where Then

The price that we pay is that the interpretation of the intercept has changed. It has absorbed the nonzero component of the disturbance term in addition to whatever had previously been responsible for it. 14 TYPES OF REGRESSION MODEL AND ASSUMPTIONS FOR MODEL A ASSUMPTIONS FOR MODEL A A.3The disturbance term has zero expectation for all i Define Suppose where Then

15 TYPES OF REGRESSION MODEL AND ASSUMPTIONS FOR MODEL A This is usually acceptable because the role of the constant is usually to pick up any systematic tendency in Y not accounted for by the regressor(s). for all i Define Suppose ASSUMPTIONS FOR MODEL A A.3The disturbance term has zero expectation where Then

We assume that the disturbance term is homoscedastic, meaning that its value in each observation is drawn from a distribution with constant population variance. 16 TYPES OF REGRESSION MODEL AND ASSUMPTIONS FOR MODEL A for all i A.4The disturbance term is homoscedastic ASSUMPTIONS FOR MODEL A

In the language of the section on sampling and estimators in the Review chapter, this is a ‘beforehand’ concept, where we are thinking about the potential behavior of the disturbance term before the sample is actually generated. 17 TYPES OF REGRESSION MODEL AND ASSUMPTIONS FOR MODEL A ASSUMPTIONS FOR MODEL A A.4The disturbance term is homoscedastic for all i

Once we have generated the sample, the disturbance term will turn out to be greater in some observations, and smaller in others, but there should not be any reason for it to be more erratic in some observations than in others. 18 TYPES OF REGRESSION MODEL AND ASSUMPTIONS FOR MODEL A ASSUMPTIONS FOR MODEL A A.4The disturbance term is homoscedastic for all i

Since E(u i ) = 0, by Assumption A.3, the population variance of u i is equal to E(u i 2 ), so the condition can also be written as shown. 19 TYPES OF REGRESSION MODEL AND ASSUMPTIONS FOR MODEL A for all i ASSUMPTIONS FOR MODEL A A.4The disturbance term is homoscedastic for all i

If Assumption A.4 is not satisfied, the OLS regression coefficients will be inefficient, and you should be able to obtain more reliable results by using a modification of the regression technique. This will be discussed in Chapter TYPES OF REGRESSION MODEL AND ASSUMPTIONS FOR MODEL A ASSUMPTIONS FOR MODEL A A.4The disturbance term is homoscedastic for all i

We assume that the disturbance term is not subject to autocorrelation, meaning that there should be no systematic association between its values in any two observations. 21 TYPES OF REGRESSION MODEL AND ASSUMPTIONS FOR MODEL A A.5The values of the disturbance term have independent distributions ASSUMPTIONS FOR MODEL A u i is distributed independently of u j for all j ≠ i

For example, just because the disturbance term is large and positive in one observation, there should be no tendency for it to be large and positive in the next (or large and negative, for that matter, or small and positive, or small and negative). 22 TYPES OF REGRESSION MODEL AND ASSUMPTIONS FOR MODEL A ASSUMPTIONS FOR MODEL A A.5The values of the disturbance term have independent distributions u i is distributed independently of u j for all j ≠ i

The assumption implies that the population covariance between u i and u j is zero. Note that the population means of u i and u j are both zero, by virtue of Assumption A.3, and that E(u i u j ) can be decomposed as E(u i )E(u j ) if u i and u j are generated independently – see the Review. 23 TYPES OF REGRESSION MODEL AND ASSUMPTIONS FOR MODEL A u i is distributed independently of u j for all j ≠ i ASSUMPTIONS FOR MODEL A A.5The values of the disturbance term have independent distributions

If this assumption is not satisfied, OLS will again give inefficient estimates. Chapter 12 discusses the problems that arise and ways of getting around them. Violations of this assumption are in any case rare with cross-sectional data. 24 TYPES OF REGRESSION MODEL AND ASSUMPTIONS FOR MODEL A ASSUMPTIONS FOR MODEL A A.5The values of the disturbance term have independent distributions u i is distributed independently of u j for all j ≠ i

We usually assume that the disturbance term has a normal distribution. The justification for the assumption depends on the Central Limit theorem. 25 TYPES OF REGRESSION MODEL AND ASSUMPTIONS FOR MODEL A A.6The disturbance term has a normal distribution ASSUMPTIONS FOR MODEL A

26 TYPES OF REGRESSION MODEL AND ASSUMPTIONS FOR MODEL A In essence, the CLT states that, if a random variable is the composite result of the effects of a large number of other random variables, it will have an approximately normal distribution even if its components do not, provided that none of them is dominant. ASSUMPTIONS FOR MODEL A A.6The disturbance term has a normal distribution

27 TYPES OF REGRESSION MODEL AND ASSUMPTIONS FOR MODEL A The disturbance term u is composed of a number of factors not appearing explicitly in the regression equation so, even if we know nothing about the distribution of these factors, we are usually entitled to assume that the disturbance term is normally distributed. ASSUMPTIONS FOR MODEL A A.6The disturbance term has a normal distribution

Copyright Christopher Dougherty These slideshows may be downloaded by anyone, anywhere for personal use. Subject to respect for copyright and, where appropriate, attribution, they may be used as a resource for teaching an econometrics course. There is no need to refer to the author. The content of this slideshow comes from Section 2.2 of C. Dougherty, Introduction to Econometrics, fourth edition 2011, Oxford University Press. Additional (free) resources for both students and instructors may be downloaded from the OUP Online Resource Centre Individuals studying econometrics on their own who feel that they might benefit from participation in a formal course should consider the London School of Economics summer school course EC212 Introduction to Econometrics or the University of London International Programmes distance learning course ec2020 Elements of Econometrics