Christopher Dougherty EC220 - Introduction to econometrics (revision lectures 2011) Slideshow: dummy variables Original citation: Dougherty, C. (2011)

Slides:



Advertisements
Similar presentations
EC220 - Introduction to econometrics (chapter 11)
Advertisements

Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: exercise r.13 Original citation: Dougherty, C. (2012) EC220 - Introduction.
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: exercise r.4 Original citation: Dougherty, C. (2012) EC220 - Introduction.
EC220 - Introduction to econometrics (chapter 3)
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: exercise r.22 Original citation: Dougherty, C. (2012) EC220 - Introduction.
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: exercise r.19 Original citation: Dougherty, C. (2012) EC220 - Introduction.
EC220 - Introduction to econometrics (chapter 5)
CHOW TEST AND DUMMY VARIABLE GROUP TEST
EC220 - Introduction to econometrics (chapter 5)
Christopher Dougherty EC220 - Introduction to econometrics (chapter 1) Slideshow: exercise 1.7 Original citation: Dougherty, C. (2012) EC220 - Introduction.
EC220 - Introduction to econometrics (chapter 4)
Christopher Dougherty EC220 - Introduction to econometrics (chapter 6) Slideshow: exercise 6.7 Original citation: Dougherty, C. (2012) EC220 - Introduction.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 5) Slideshow: slope dummy variables Original citation: Dougherty, C. (2012) EC220 -
Christopher Dougherty EC220 - Introduction to econometrics (chapter 2) Slideshow: a Monte Carlo experiment Original citation: Dougherty, C. (2012) EC220.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 10) Slideshow: introduction to maximum likelihood estimation Original citation: Dougherty,
Christopher Dougherty EC220 - Introduction to econometrics (chapter 11) Slideshow: adaptive expectations Original citation: Dougherty, C. (2012) EC220.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 13) Slideshow: tests of nonstationarity: introduction Original citation: Dougherty,
Christopher Dougherty EC220 - Introduction to econometrics (chapter 2) Slideshow: testing a hypothesis relating to a regression coefficient Original citation:
Christopher Dougherty EC220 - Introduction to econometrics (chapter 7) Slideshow: exercise 7.5 Original citation: Dougherty, C. (2012) EC220 - Introduction.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 4) Slideshow: interactive explanatory variables Original citation: Dougherty, C. (2012)
Christopher Dougherty EC220 - Introduction to econometrics (chapter 3) Slideshow: exercise 3.5 Original citation: Dougherty, C. (2012) EC220 - Introduction.
DUMMY VARIABLE CLASSIFICATION WITH TWO CATEGORIES Example: the cost of running a school depends on the number of pupils, but it also depends on whether.
EC220 - Introduction to econometrics (chapter 2)
Christopher Dougherty EC220 - Introduction to econometrics (chapter 2) Slideshow: exercise 2.22 Original citation: Dougherty, C. (2012) EC220 - Introduction.
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: expected value of a function of a random variable Original citation:
Christopher Dougherty EC220 - Introduction to econometrics (chapter 6) Slideshow: variable misspecification iii: consequences for diagnostics Original.
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: confidence intervals Original citation: Dougherty, C. (2012) EC220.
EC220 - Introduction to econometrics (chapter 1)
EC220 - Introduction to econometrics (review chapter)
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: continuous random variables Original citation: Dougherty, C. (2012)
Christopher Dougherty EC220 - Introduction to econometrics (chapter 3) Slideshow: prediction Original citation: Dougherty, C. (2012) EC220 - Introduction.
SLOPE DUMMY VARIABLES 1 The scatter diagram shows the data for the 74 schools in Shanghai and the cost functions derived from a regression of COST on N.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 5) Slideshow: exercise 5.5 Original citation: Dougherty, C. (2012) EC220 - Introduction.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 4) Slideshow: semilogarithmic models Original citation: Dougherty, C. (2012) EC220.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 10) Slideshow: maximum likelihood estimation of regression coefficients Original citation:
Christopher Dougherty EC220 - Introduction to econometrics (chapter 5) Slideshow: Chow test Original citation: Dougherty, C. (2012) EC220 - Introduction.
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: the normal distribution Original citation: Dougherty, C. (2012)
Christopher Dougherty EC220 - Introduction to econometrics (chapter 5) Slideshow: dummy variable classification with two categories Original citation:
Christopher Dougherty EC220 - Introduction to econometrics (chapter 5) Slideshow: two sets of dummy variables Original citation: Dougherty, C. (2012) EC220.
EC220 - Introduction to econometrics (review chapter)
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: sampling and estimators Original citation: Dougherty, C. (2012)
Christopher Dougherty EC220 - Introduction to econometrics (chapter 5) Slideshow: the effects of changing the reference category Original citation: Dougherty,
Christopher Dougherty EC220 - Introduction to econometrics (chapter 5) Slideshow: dummy classification with more than two categories Original citation:
DUMMY CLASSIFICATION WITH MORE THAN TWO CATEGORIES This sequence explains how to extend the dummy variable technique to handle a qualitative explanatory.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 12) Slideshow: autocorrelation, partial adjustment, and adaptive expectations Original.
1 INTERACTIVE EXPLANATORY VARIABLES The model shown above is linear in parameters and it may be fitted using straightforward OLS, provided that the regression.
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: conflicts between unbiasedness and minimum variance Original citation:
Christopher Dougherty EC220 - Introduction to econometrics (chapter 8) Slideshow: measurement error Original citation: Dougherty, C. (2012) EC220 - Introduction.
1 TWO SETS OF DUMMY VARIABLES The explanatory variables in a regression model may include multiple sets of dummy variables. This sequence provides an example.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 11) Slideshow: Friedman Original citation: Dougherty, C. (2012) EC220 - Introduction.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 7) Slideshow: weighted least squares and logarithmic regressions Original citation:
Christopher Dougherty EC220 - Introduction to econometrics (chapter 1) Slideshow: exercise 1.9 Original citation: Dougherty, C. (2012) EC220 - Introduction.
EC220 - Introduction to econometrics (chapter 8)
Christopher Dougherty EC220 - Introduction to econometrics (chapter 12) Slideshow: footnote: the Cochrane-Orcutt iterative process Original citation: Dougherty,
Christopher Dougherty EC220 - Introduction to econometrics (chapter 1) Slideshow: exercise 1.5 Original citation: Dougherty, C. (2012) EC220 - Introduction.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 9) Slideshow: instrumental variable estimation: variation Original citation: Dougherty,
Christopher Dougherty EC220 - Introduction to econometrics (revision lectures 2011) Slideshow: autocorrelation Original citation: Dougherty, C. (2011)
Christopher Dougherty EC220 - Introduction to econometrics (chapter 2) Slideshow: exercise 2.24 Original citation: Dougherty, C. (2012) EC220 - Introduction.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 6) Slideshow: multiple restrictions and zero restrictions Original citation: Dougherty,
Christopher Dougherty EC220 - Introduction to econometrics (chapter 5) Slideshow: exercise 5.2 Original citation: Dougherty, C. (2012) EC220 - Introduction.
Chapter 5: Dummy Variables. DUMMY VARIABLE CLASSIFICATION WITH TWO CATEGORIES 1 We’ll now examine how you can include qualitative explanatory variables.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 4) Slideshow: exercise 4.5 Original citation: Dougherty, C. (2012) EC220 - Introduction.
COST 11 DUMMY VARIABLE CLASSIFICATION WITH TWO CATEGORIES 1 This sequence explains how you can include qualitative explanatory variables in your regression.
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: alternative expression for population variance Original citation:
Christopher Dougherty EC220 - Introduction to econometrics (chapter 6) Slideshow: exercise 6.13 Original citation: Dougherty, C. (2012) EC220 - Introduction.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 2) Slideshow: exercise 2.11 Original citation: Dougherty, C. (2012) EC220 - Introduction.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 2) Slideshow: confidence intervals Original citation: Dougherty, C. (2012) EC220 -
Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: independence of two random variables Original citation: Dougherty,
Christopher Dougherty EC220 - Introduction to econometrics (chapter 1) Slideshow: simple regression model Original citation: Dougherty, C. (2012) EC220.
Introduction to Econometrics, 5th edition
Presentation transcript:

Christopher Dougherty EC220 - Introduction to econometrics (revision lectures 2011) Slideshow: dummy variables Original citation: Dougherty, C. (2011) EC220 - Introduction to econometrics (revision lectures 2011). [Teaching Resource] © 2011 The Author This version available at: Available in LSE Learning Resources Online: May 2012 This work is licensed under a Creative Commons Attribution-ShareAlike 3.0 License. This license allows the user to remix, tweak, and build upon the work even for commercial purposes, as long as the user credits the author and licenses their new creations under the identical terms

DUMMY VARIABLE CLASSIFICATION WITH TWO CATEGORIES Example: the cost of running a school depends on the number of pupils, but it also depends on whether the school is an occupational school. Dummy variables always have two values, 0 or 1. If OCC is equal to 0, the cost function becomes that for regular schools. If OCC is equal to 1, the cost function becomes that for occupational schools. 11  11 1+1+ Combined equationCOST =  1 +  OCC +  2 N + u OCC = 0 Regular schoolCOST =  1 +  2 N + u OCC = 1 Occupational schoolCOST =  1 +  +  2 N + u

COST =  1  +  T TECH +  W WORKER +  V VOC +  2 N + u General SchoolCOST =  1  +  2 N + u (TECH = WORKER = VOC = 0) Technical SchoolCOST = (  1  +  T ) +  2 N + u (TECH = 1; WORKER = VOC = 0) Skilled Workers’ SchoolCOST = (  1  +  W ) +  2 N + u (WORKER = 1; TECH = VOC = 0) Vocational SchoolCOST = (  1  +  V ) +  2 N + u (VOC = 1; TECH = WORKER = 0) DUMMY CLASSIFICATION WITH MORE THAN TWO CATEGORIES Now the qualitative variable has four categories. The standard procedure is to choose one category as the reference category and to define dummy variables for each of the others. Note: you must leave out the reference category, otherwise your model will be perfectly collinear! 16

COST N 1+T1+T 1+W1+W 1+V1+V 11 Workers’ Vocational WW VV TT DUMMY CLASSIFICATION WITH MORE THAN TWO CATEGORIES The diagram illustrates the model graphically. The  coefficients are the extra overhead costs of running technical, skilled workers’, and vocational schools, relative to the overhead cost of general schools. 17 Technical General

COST N 1+T1+T 1+W1+W 1+V1+V 11 Workers’ Vocational WW VV TT DUMMY CLASSIFICATION WITH MORE THAN TWO CATEGORIES We chose general academic schools as the reference (omitted) category and defined dummy variables for the other categories. This means that we can only compare other schools to general schools, and not to each other. 17 Technical General

COST N 1+T1+T 1+W1+W 1+V1+V 11 Workers’ Vocational WW VV TT DUMMY CLASSIFICATION WITH MORE THAN TWO CATEGORIES However, suppose that we were interested in testing whether the overhead costs of skilled workers’ schools were different from those of the other types of school. How could we do this? It is simplest to re-run the regression making skilled workers’ schools the reference category. 17 Technical General

7 TWO SETS OF DUMMY VARIABLES The explanatory variables in a regression model may include multiple sets of dummy variables. Now you need to think about every combination, and the reference category is the one in which all dummy variables are zero. COST =  1  +   OCC +   RES +  2 N + u Regular, nonresidentialCOST =  1  +  2 N + u (OCC = RES = 0) Regular, residentialCOST = (  1  +   ) +  2 N + u (OCC = 0; RES = 1) Occupational, nonresidentialCOST = (  1  +   ) +  2 N + u (OCC = 1; RES = 0) Occupational, residentialCOST = (  1  +  +   ) +  2 N + u (OCC = RES = 1)

7 TWO SETS OF DUMMY VARIABLES In the case of a non-residential occupational school, RES is 0 and OCC is 1, so the overhead cost increases by . If the school is both occupational and residential, it increases by (  +  ). COST =  1  +   OCC +   RES +  2 N + u Regular, nonresidentialCOST =  1  +  2 N + u (OCC = RES = 0) Regular, residentialCOST = (  1  +   ) +  2 N + u (OCC = 0; RES = 1) Occupational, nonresidentialCOST = (  1  +   ) +  2 N + u (OCC = 1; RES = 0) Occupational, residentialCOST = (  1  +  +   ) +  2 N + u (OCC = RES = 1)

COST N  1 +  +  1+1+ 1+1+ 11 Occupational, residential Regular, nonresidential    +  8  Occupational, nonresidential Regular, residential TWO SETS OF DUMMY VARIABLES The diagram illustrates the model graphically. Note that the effects of the different components of the model are assumed to be separate and additive in this specification. In particular, we are assuming that the extra overhead cost of a residential school is the same for regular and occupational schools: there is no interaction effect.

SLOPE DUMMY VARIABLES 2 The specification of the model incorporates the assumption that the marginal cost per student is the same for occupational and regular schools. Hence the cost functions have the same slope: the same coefficient on N. This is a restriction we have placed on the model.

SLOPE DUMMY VARIABLES 3 This is not a realistic assumption. Occupational schools incur expenditure on training materials that is related to the number of students. Also, the staff-student ratio has to be higher in occupational schools.

SLOPE DUMMY VARIABLES 5 Looking at the scatter diagram, you can see that the cost function for the occupational schools should be steeper, and that for the regular schools should be flatter. The two lines should have different slopes.

SLOPE DUMMY VARIABLES We will relax the assumption of the same marginal cost by introducing what is known as a slope dummy variable. This is NOCC, defined as the product of N and OCC. For example, in the case of an occupational school, OCC is equal to 1 and NOCC is equal to N. The equation simplifies as shown. 8 COST =  1  +   OCC +  2 N + NOCC + u Regular schoolCOST =  1  +  2 N + u (OCC = NOCC = 0) Occupational schoolCOST = (  1  +   ) + (  2  + N + u (OCC = 1; NOCC = N)

COST N  1 +  11 Occupational Regular  SLOPE DUMMY VARIABLES The diagram illustrates the model graphically. 10

7 INTERACTING DUMMY VARIABLES If we interact dummy variables, we get new dummy variables, but we must interpret carefully. The reference category is obtained by setting all dummies equal to zero. Then write down the earnings function for each subgroup separately to make the effects of various coefficients clear. LGEARN =  1  +  2 S +   F +  W + FW + u Black maleLGEARN =  1  +  2 S + u (F = W = 0) White maleLGEARN =  1  +  2 S +  W + u (F = 0; W = 1) Black femaleLGEARN =  1  +  2 S +   F + u (F = 1; W = 0) White femaleLGEARN =  1  +  2 S +   F +  W + FW + u (F = W = 1)

Copyright Christopher Dougherty 2000–2006. This slideshow may be freely copied for personal use