1 XX X1X1 XX X Random variable X with unknown population mean  X function of X probability density Sample of n observations X 1, X 2,..., X n : potential.

Slides:



Advertisements
Similar presentations
EXPECTED VALUE RULES 1. This sequence states the rules for manipulating expected values. First, the additive rule. The expected value of the sum of two.
Advertisements

1 MAXIMUM LIKELIHOOD ESTIMATION OF REGRESSION COEFFICIENTS X Y XiXi 11  1  +  2 X i Y =  1  +  2 X We will now apply the maximum likelihood principle.
1 This very short sequence presents an important definition, that of the independence of two random variables. Two random variables X and Y are said to.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 2) Slideshow: a Monte Carlo experiment Original citation: Dougherty, C. (2012) EC220.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 10) Slideshow: introduction to maximum likelihood estimation Original citation: Dougherty,
Christopher Dougherty EC220 - Introduction to econometrics (chapter 11) Slideshow: adaptive expectations Original citation: Dougherty, C. (2012) EC220.
1 THE DISTURBANCE TERM IN LOGARITHMIC MODELS Thus far, nothing has been said about the disturbance term in nonlinear regression models.
EC220 - Introduction to econometrics (chapter 7)
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: asymptotic properties of estimators: plims and consistency Original.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 13) Slideshow: stationary processes Original citation: Dougherty, C. (2012) EC220 -
1 THE NORMAL DISTRIBUTION In the analysis so far, we have discussed the mean and the variance of a distribution of a random variable, but we have not said.
1 PROBABILITY DISTRIBUTION EXAMPLE: X IS THE SUM OF TWO DICE red This sequence provides an example of a discrete random variable. Suppose that you.
Random effects estimation RANDOM EFFECTS REGRESSIONS When the observed variables of interest are constant for each individual, a fixed effects regression.
MEASUREMENT ERROR 1 In this sequence we will investigate the consequences of measurement errors in the variables in a regression model. To keep the analysis.
ASYMPTOTIC PROPERTIES OF ESTIMATORS: PLIMS AND CONSISTENCY
1 ASSUMPTIONS FOR MODEL C: REGRESSIONS WITH TIME SERIES DATA Assumptions C.1, C.3, C.4, C.5, and C.8, and the consequences of their violations are the.
00  sd  0 –sd  0 –1.96sd  0 +sd 2.5% CONFIDENCE INTERVALS probability density function of X null hypothesis H 0 :  =  0 In the sequence.
EXPECTED VALUE OF A RANDOM VARIABLE 1 The expected value of a random variable, also known as its population mean, is the weighted average of its possible.
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: expected value of a function of a random variable Original citation:
TESTING A HYPOTHESIS RELATING TO THE POPULATION MEAN 1 This sequence describes the testing of a hypothesis at the 5% and 1% significance levels. It also.
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: confidence intervals Original citation: Dougherty, C. (2012) EC220.
EC220 - Introduction to econometrics (review chapter)
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: continuous random variables Original citation: Dougherty, C. (2012)
1 A MONTE CARLO EXPERIMENT In the previous slideshow, we saw that the error term is responsible for the variations of b 2 around its fixed component 
Christopher Dougherty EC220 - Introduction to econometrics (chapter 3) Slideshow: prediction Original citation: Dougherty, C. (2012) EC220 - Introduction.
1 In the previous sequence, we were performing what are described as two-sided t tests. These are appropriate when we have no information about the alternative.
Cross-sectional:Observations on individuals, households, enterprises, countries, etc at one moment in time (Chapters 1–10, Models A and B). 1 During this.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 10) Slideshow: maximum likelihood estimation of regression coefficients Original citation:
DERIVING LINEAR REGRESSION COEFFICIENTS
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: the normal distribution Original citation: Dougherty, C. (2012)
1 In a second variation, we shall consider the model shown above. x is the rate of growth of productivity, assumed to be exogenous. w is now hypothesized.
1 PREDICTION In the previous sequence, we saw how to predict the price of a good or asset given the composition of its characteristics. In this sequence,
EC220 - Introduction to econometrics (review chapter)
1 UNBIASEDNESS AND EFFICIENCY Much of the analysis in this course will be concerned with three properties of estimators: unbiasedness, efficiency, and.
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: sampling and estimators Original citation: Dougherty, C. (2012)
THE DUMMY VARIABLE TRAP 1 Suppose that you have a regression model with Y depending on a set of ordinary variables X 2,..., X k and a qualitative variable.
1 CONTINUOUS RANDOM VARIABLES A discrete random variable is one that can take only a finite set of values. The sum of the numbers when two dice are thrown.
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: conflicts between unbiasedness and minimum variance Original citation:
Christopher Dougherty EC220 - Introduction to econometrics (chapter 8) Slideshow: measurement error Original citation: Dougherty, C. (2012) EC220 - Introduction.
THE FIXED AND RANDOM COMPONENTS OF A RANDOM VARIABLE 1 In this short sequence we shall decompose a random variable X into its fixed and random components.
CONSEQUENCES OF AUTOCORRELATION
ALTERNATIVE EXPRESSION FOR POPULATION VARIANCE 1 This sequence derives an alternative expression for the population variance of a random variable. It provides.
CONFLICTS BETWEEN UNBIASEDNESS AND MINIMUM VARIANCE
1 t TEST OF A HYPOTHESIS RELATING TO A POPULATION MEAN The diagram summarizes the procedure for performing a 5% significance test on the slope coefficient.
MULTIPLE RESTRICTIONS AND ZERO RESTRICTIONS
TYPE II ERROR AND THE POWER OF A TEST A Type I error occurs when the null hypothesis is rejected when it is in fact true. A Type II error occurs when the.
A.1The model is linear in parameters and correctly specified. PROPERTIES OF THE MULTIPLE REGRESSION COEFFICIENTS 1 Moving from the simple to the multiple.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 9) Slideshow: instrumental variable estimation: variation Original citation: Dougherty,
Christopher Dougherty EC220 - Introduction to econometrics (chapter 6) Slideshow: multiple restrictions and zero restrictions Original citation: Dougherty,
1 We will now look at the properties of the OLS regression estimators with the assumptions of Model B. We will do this within the context of the simple.
1 COVARIANCE, COVARIANCE AND VARIANCE RULES, AND CORRELATION Covariance The covariance of two random variables X and Y, often written  XY, is defined.
1 Y SIMPLE REGRESSION MODEL Suppose that a variable Y is a linear function of another variable X, with unknown parameters  1 and  2 that we wish to estimate.
1 We will continue with a variation on the basic model. We will now hypothesize that p is a function of m, the rate of growth of the money supply, as well.
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: alternative expression for population variance Original citation:
1 ASYMPTOTIC PROPERTIES OF ESTIMATORS: THE USE OF SIMULATION In practice we deal with finite samples, not infinite ones. So why should we be interested.
Definition of, the expected value of a function of X : 1 EXPECTED VALUE OF A FUNCTION OF A RANDOM VARIABLE To find the expected value of a function of.
HETEROSCEDASTICITY 1 This sequence relates to Assumption A.4 of the regression model assumptions and introduces the topic of heteroscedasticity. This relates.
INSTRUMENTAL VARIABLES 1 Suppose that you have a model in which Y is determined by X but you have reason to believe that Assumption B.7 is invalid and.
1 ESTIMATORS OF VARIANCE, COVARIANCE, AND CORRELATION We have seen that the variance of a random variable X is given by the expression above. Variance.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 2) Slideshow: confidence intervals Original citation: Dougherty, C. (2012) EC220 -
1 We will illustrate the heteroscedasticity theory with a Monte Carlo simulation. HETEROSCEDASTICITY: MONTE CARLO ILLUSTRATION 1 standard deviation of.
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: independence of two random variables Original citation: Dougherty,
Christopher Dougherty EC220 - Introduction to econometrics (chapter 1) Slideshow: simple regression model Original citation: Dougherty, C. (2012) EC220.
FOOTNOTE: THE COCHRANE–ORCUTT ITERATIVE PROCESS 1 We saw in the previous sequence that AR(1) autocorrelation could be eliminated by a simple manipulation.
Introduction to Econometrics, 5th edition
Introduction to Econometrics, 5th edition
Introduction to Econometrics, 5th edition
Introduction to Econometrics, 5th edition
Introduction to Econometrics, 5th edition
Presentation transcript:

1 XX X1X1 XX X Random variable X with unknown population mean  X function of X probability density Sample of n observations X 1, X 2,..., X n : potential distributions XX X2X2 XX XnXn Suppose we have a random variable X and we wish to estimate its unknown population mean  X. Our first step is to take a sample of n observations {X 1, …, X n }. THE DOUBLE STRUCTURE OF A SAMPLED RANDOM VARIABLE

2 Before we take the sample, while we are still at the planning stage, the X i are random quantities. We know that they will be generated randomly from the distribution for X, but we do not know their values in advance. THE DOUBLE STRUCTURE OF A SAMPLED RANDOM VARIABLE XX X1X1 XX X Random variable X with unknown population mean  X function of X probability density Sample of n observations X 1, X 2,..., X n : potential distributions XX X2X2 XX XnXn

3 THE DOUBLE STRUCTURE OF A SAMPLED RANDOM VARIABLE So now we are thinking about random variables on two levels: the random variable X, and the sample observations drawn randomly from its distribution. XX X1X1 XX X Random variable X with unknown population mean  X function of X probability density Sample of n observations X 1, X 2,..., X n : potential distributions XX X2X2 XX XnXn

4 THE DOUBLE STRUCTURE OF A SAMPLED RANDOM VARIABLE XX X1X1 XX X Random variable X with unknown population mean  X function of X probability density XX X2X2 XX XnXn Actual sample of n observations x 1, x 2,..., x n : realization Once we have taken the sample we will have a set of numbers {x 1, …, x n }. This is called by statisticians a realization. The lower case is to emphasize that these are particular numbers, not variables. x1x1 x2x2 xnxn

5 THE DOUBLE STRUCTURE OF A SAMPLED RANDOM VARIABLE We base our plan on the potential distributions. Having generated a sample of n observations {X 1, …, X n }, we plan to use them with a mathematical formula to estimate the unknown population mean  X. XX X1X1 XX X Random variable X with unknown population mean  X function of X probability density Sample of n observations X 1, X 2,..., X n : potential distributions XX X2X2 XX XnXn

XX X1X1 XX X Random variable X with unknown population mean  X function of X probability density Sample of n observations X 1, X 2,..., X n : potential distributions XX X2X2 XX XnXn 6 THE DOUBLE STRUCTURE OF A SAMPLED RANDOM VARIABLE This mathematical formula is known as an estimator. In this context, the standard (but not only) estimator is the sample mean. An estimator is a random variable because it depends on the random quantities {X 1, …, X n }. Estimator:

XX X1X1 XX X Random variable X with unknown population mean  X function of X probability density XX X2X2 XX XnXn Actual sample of n observations x 1, x 2,..., x n : realization x1x1 x2x2 xnxn 7 THE DOUBLE STRUCTURE OF A SAMPLED RANDOM VARIABLE Estimate: The actual number that we obtain, given the realization {x 1, …, x n }, is known as our estimate.

8 probability density function of X XX X XX X probability density function of X We will see why these distinctions are useful and important in a comparison of the distributions of X and X. We will start by showing that X has the same mean as X. THE DOUBLE STRUCTURE OF A SAMPLED RANDOM VARIABLE

9 We start by replacing X by its definition and then using expected value rule 2 to take 1/n out of the expression as a common factor. XX X1X1 Sample of n observations X 1, X 2,..., X n : potential distributions XX X2X2 XX XnXn

10 THE DOUBLE STRUCTURE OF A SAMPLED RANDOM VARIABLE Next we use expected value rule 1 to replace the expectation of a sum with a sum of expectations. XX X1X1 Sample of n observations X 1, X 2,..., X n : potential distributions XX X2X2 XX XnXn

11 THE DOUBLE STRUCTURE OF A SAMPLED RANDOM VARIABLE Now we come to the bit that requires thought. Start with X 1. When we are still at the planning stage, before we draw a particular sample, X 1 is a random variable and we do not know what its value will be. XX X1X1 Sample of n observations X 1, X 2,..., X n : potential distributions XX X2X2 XX XnXn

12 THE DOUBLE STRUCTURE OF A SAMPLED RANDOM VARIABLE All we know is that it will be generated randomly from the distribution of X. The expected value of X 1, as a beforehand concept, will therefore be  X. The same is true for all the other sample components, thinking about them beforehand. Hence we write this line. XX X1X1 Sample of n observations X 1, X 2,..., X n : potential distributions XX X2X2 XX XnXn

13 Thus we have shown that the mean of the distribution of X is  X. THE DOUBLE STRUCTURE OF A SAMPLED RANDOM VARIABLE XX X1X1 Sample of n observations X 1, X 2,..., X n : potential distributions XX X2X2 XX XnXn

14 We will next demonstrate that the variance of the distribution of X is smaller than that of X, as depicted in the diagram. THE DOUBLE STRUCTURE OF A SAMPLED RANDOM VARIABLE probability density function of X XX X XX X probability density function of X

XX X1X1 Sample of n observations X 1, X 2,..., X n : potential distributions XX X2X2 XX XnXn 19 THE DOUBLE STRUCTURE OF A SAMPLED RANDOM VARIABLE variance 15 We start by replacing X by its definition and then using variance rule 2 to take 1/n out of the expression as a common factor.

16 THE DOUBLE STRUCTURE OF A SAMPLED RANDOM VARIABLE Next we use variance rule 1 to replace the variance of a sum with a sum of variances. In principle there are many covariance terms as well, but they are zero if we assume that the sample values are generated independently. XX X1X1 Sample of n observations X 1, X 2,..., X n : potential distributions XX X2X2 XX XnXn variance

17 THE DOUBLE STRUCTURE OF A SAMPLED RANDOM VARIABLE Now we come to the bit that requires thought. Start with X 1. When we are still at the planning stage, we do not know what the value of X 1 will be. XX X1X1 Sample of n observations X 1, X 2,..., X n : potential distributions XX X2X2 XX XnXn variance

18 THE DOUBLE STRUCTURE OF A SAMPLED RANDOM VARIABLE All we know is that it will be generated randomly from the distribution of X. The variance of X 1, as a beforehand concept, will therefore be  X. The same is true for all the other sample components, thinking about them beforehand. Hence we write this line. 2 XX X1X1 Sample of n observations X 1, X 2,..., X n : potential distributions XX X2X2 XX XnXn variance

19 Thus we have demonstrated that the variance of the sample mean is equal to the variance of X divided by n, a result with which you will be familiar from your statistics course. THE DOUBLE STRUCTURE OF A SAMPLED RANDOM VARIABLE XX X1X1 Sample of n observations X 1, X 2,..., X n : potential distributions XX X2X2 XX XnXn variance

Copyright Christopher Dougherty 2012 These slideshows may be downloaded by anyone, anywhere for personal use. Subject to respect for copyright and, where appropriate, attribution, they may be used as a resource for teaching an econometrics course. There is no need to refer to the author. The content of this slideshow comes from Section R.5 of C. Dougherty, Introduction to Econometrics, fourth edition 2011, Oxford University Press. Additional (free) resources for both students and instructors may be downloaded from the OUP Online Resource Centre Individuals studying econometrics on their own who feel that they might benefit from participation in a formal course should consider the London School of Economics summer school course EC212 Introduction to Econometrics or the University of London International Programmes distance learning course EC2020 Elements of Econometrics