Two-stage least squares 1. D1 S1 2 P Q D1 D2D2 S1 S2 Increase in income Increase in costs 3.

Slides:



Advertisements
Similar presentations
Dummy Variables and Interactions. Dummy Variables What is the the relationship between the % of non-Swiss residents (IV) and discretionary social spending.
Advertisements

Qualitative predictor variables
Mis-measured Treatment Effects Fall 2009 Bill Evans.
1 IV/2SLS models Vietnam era service Defined as Estimated 8.7 million served during era 3.4 million were in SE Asia 2.6 million served.
Sociology 601 Class 24: November 19, 2009 (partial) Review –regression results for spurious & intervening effects –care with sample sizes for comparing.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 1) Slideshow: exercise 1.16 Original citation: Dougherty, C. (2012) EC220 - Introduction.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 4) Slideshow: interactive explanatory variables Original citation: Dougherty, C. (2012)
Heteroskedasticity The Problem:
HETEROSCEDASTICITY-CONSISTENT STANDARD ERRORS 1 Heteroscedasticity causes OLS standard errors to be biased is finite samples. However it can be demonstrated.
Lecture 9 Today: Ch. 3: Multiple Regression Analysis Example with two independent variables Frisch-Waugh-Lovell theorem.
EC220 - Introduction to econometrics (chapter 7)
Christopher Dougherty EC220 - Introduction to econometrics (chapter 3) Slideshow: exercise 3.5 Original citation: Dougherty, C. (2012) EC220 - Introduction.
Sociology 601 Class 19: November 3, 2008 Review of correlation and standardized coefficients Statistical inference for the slope (9.5) Violations of Model.
Valuation 4: Econometrics Why econometrics? What are the tasks? Specification and estimation Hypotheses testing Example study.
Sociology 601 Class 21: November 10, 2009 Review –formulas for b and se(b) –stata regression commands & output Violations of Model Assumptions, and their.
1 IV/2SLS models. 2 Z i =1 1 =0.57 Z i =0 0 =0.80.
Sociology 601 Class 25: November 24, 2009 Homework 9 Review –dummy variable example from ASR (finish) –regression results for dummy variables Quadratic.
Sociology 601 Class 28: December 8, 2009 Homework 10 Review –polynomials –interaction effects Logistic regressions –log odds as outcome –compared to linear.
1 Multiple Regression EPP 245/298 Statistical Analysis of Laboratory Data.
Introduction to Regression Analysis Straight lines, fitted values, residual values, sums of squares, relation to the analysis of variance.
1 Review of Correlation A correlation coefficient measures the strength of a linear relation between two measurement variables. The measure is based on.
Regression Discontinuity Design 1. Motivating example Many districts have summer school to help kids improve outcomes between grades –Enrichment, or –Assist.
So far, we have considered regression models with dummy variables of independent variables. In this lecture, we will study regression models whose dependent.
Regression Discontinuity Design 1. 2 Z Pr(X i =1 | z) 0 1 Z0Z0 Fuzzy Design Sharp Design.
1 Michigan.do. 2. * construct new variables;. gen mi=state==26;. * michigan dummy;. gen hike=month>=33;. * treatment period dummy;. gen treatment=hike*mi;
Sociology 601 Class 23: November 17, 2009 Homework #8 Review –spurious, intervening, & interactions effects –stata regression commands & output F-tests.
A trial of incentives to attend adult literacy classes Carole Torgerson, Greg Brooks, Jeremy Miles, David Torgerson Classes randomised to incentive or.
Sociology 601 Class 26: December 1, 2009 (partial) Review –curvilinear regression results –cubic polynomial Interaction effects –example: earnings on married.
TESTING A HYPOTHESIS RELATING TO A REGRESSION COEFFICIENT This sequence describes the testing of a hypotheses relating to regression coefficients. It is.
SLOPE DUMMY VARIABLES 1 The scatter diagram shows the data for the 74 schools in Shanghai and the cost functions derived from a regression of COST on N.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 3) Slideshow: precision of the multiple regression coefficients Original citation:
TOBIT ANALYSIS Sometimes the dependent variable in a regression model is subject to a lower limit or an upper limit, or both. Suppose that in the absence.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 5) Slideshow: the effects of changing the reference category Original citation: Dougherty,
DUMMY CLASSIFICATION WITH MORE THAN TWO CATEGORIES This sequence explains how to extend the dummy variable technique to handle a qualitative explanatory.
1 INTERACTIVE EXPLANATORY VARIABLES The model shown above is linear in parameters and it may be fitted using straightforward OLS, provided that the regression.
1 TWO SETS OF DUMMY VARIABLES The explanatory variables in a regression model may include multiple sets of dummy variables. This sequence provides an example.
Confidence intervals were treated at length in the Review chapter and their application to regression analysis presents no problems. We will not repeat.
Country Gini IndexCountryGini IndexCountryGini IndexCountryGini Index Albania28.2Georgia40.4Mozambique39.6Turkey38 Algeria35.3Germany28.3Nepal47.2Turkmenistan40.8.
How do Lawyers Set fees?. Learning Objectives 1.Model i.e. “Story” or question 2.Multiple regression review 3.Omitted variables (our first failure of.
Addressing Alternative Explanations: Multiple Regression
What is the MPC?. Learning Objectives 1.Use linear regression to establish the relationship between two variables 2.Show that the line is the line of.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 1) Slideshow: exercise 1.5 Original citation: Dougherty, C. (2012) EC220 - Introduction.
Simple regression model: Y =  1 +  2 X + u 1 We have seen that the regression coefficients b 1 and b 2 are random variables. They provide point estimates.
. reg LGEARN S WEIGHT85 Source | SS df MS Number of obs = F( 2, 537) = Model |
Christopher Dougherty EC220 - Introduction to econometrics (chapter 5) Slideshow: exercise 5.2 Original citation: Dougherty, C. (2012) EC220 - Introduction.
Chapter 5: Dummy Variables. DUMMY VARIABLE CLASSIFICATION WITH TWO CATEGORIES 1 We’ll now examine how you can include qualitative explanatory variables.
1 IV/2SLS models. 2 Z i =1 1 =0.57 Z i =0 0 =0.80.
(1)Combine the correlated variables. 1 In this sequence, we look at four possible indirect methods for alleviating a problem of multicollinearity. POSSIBLE.
COST 11 DUMMY VARIABLE CLASSIFICATION WITH TWO CATEGORIES 1 This sequence explains how you can include qualitative explanatory variables in your regression.
Lecture 5. Linear Models for Correlated Data: Inference.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 6) Slideshow: exercise 6.13 Original citation: Dougherty, C. (2012) EC220 - Introduction.
STAT E100 Section Week 12- Regression. Course Review - Project due Dec 17 th, your TA. - Exam 2 make-up is Dec 5 th, practice tests have been updated.
RAMSEY’S RESET TEST OF FUNCTIONAL MISSPECIFICATION 1 Ramsey’s RESET test of functional misspecification is intended to provide a simple indicator of evidence.
1 CHANGES IN THE UNITS OF MEASUREMENT Suppose that the units of measurement of Y or X are changed. How will this affect the regression results? Intuitively,
SEMILOGARITHMIC MODELS 1 This sequence introduces the semilogarithmic model and shows how it may be applied to an earnings function. The dependent variable.
GRAPHING A RELATIONSHIP IN A MULTIPLE REGRESSION MODEL The output above shows the result of regressing EARNINGS, hourly earnings in dollars, on S, years.
1 BINARY CHOICE MODELS: LINEAR PROBABILITY MODEL Economists are often interested in the factors behind the decision-making of individuals or enterprises,
1 REPARAMETERIZATION OF A MODEL AND t TEST OF A LINEAR RESTRICTION Linear restrictions can also be tested using a t test. This involves the reparameterization.
1 In the Monte Carlo experiment in the previous sequence we used the rate of unemployment, U, as an instrument for w in the price inflation equation. SIMULTANEOUS.
F TESTS RELATING TO GROUPS OF EXPLANATORY VARIABLES 1 We now come to more general F tests of goodness of fit. This is a test of the joint explanatory power.
WHITE TEST FOR HETEROSCEDASTICITY 1 The White test for heteroscedasticity looks for evidence of an association between the variance of the disturbance.
1 COMPARING LINEAR AND LOGARITHMIC SPECIFICATIONS When alternative specifications of a regression model have the same dependent variable, R 2 can be used.
QM222 Class 9 Section A1 Coefficient statistics
QM222 Class 8 Section A1 Using categorical data in regression
The slope, explained variance, residuals
QM222 Class 15 Section D1 Review for test Multicollinearity
Swedish Institute for Social Research (SOFI)
Introduction to Econometrics, 5th edition
Introduction to Econometrics, 5th edition
Presentation transcript:

Two-stage least squares 1

D1 S1 2

P Q D1 D2D2 S1 S2 Increase in income Increase in costs 3

D1 S2 D2 D3 S3 S1 4

Births in 1997 Variable Smoked during pregnancyDid not smoke Birthweight of child (grams) Mom Unmarried51.2%29.7% Mom a teen16.2%12.7% Received inadequate prenatal care 17.2%11.5% Black12.6%17.6% Mom is hispanic4.2%15.5% Mom < HS degree*47.9%30.1% Mom HS degree *24.1%10.6% 5

6 Z i =1 1 =0.57 Z i =0 0 =0.80

7  0 =3186  1 =3278 Sample Sizes

8 Data set used in previous 2 slides bweight.dta Three variables –treat (1=yes, 0=no) –smoke (1=yes, 0=no) –bweight (birth weight in grams)

9. * how many people are treated. tab treat dummy | variable, | =1 if | treated, =0 | otherwise | Freq. Percent Cum | | Total |

10 Two-sample t test with equal variances Group | Obs Mean Std. Err. Std. Dev. [95% Conf. Interval] | | combined | diff | diff = mean(0) - mean(1) t = Ho: diff = 0 degrees of freedom = 865

11. ttest bweight, by(treat) Two-sample t test with equal variances Group | Obs Mean Std. Err. Std. Dev. [95% Conf. Interval] | | combined | diff | diff = mean(0) - mean(1) t = Ho: diff = 0 degrees of freedom = 865

12

13 * 1st stage. reg smoke treat Source | SS df MS Number of obs = F( 1, 865) = Model | Prob > F = Residual | R-squared = Adj R-squared = Total | Root MSE = smoke | Coef. Std. Err. t P>|t| [95% Conf. Interval] treat | _cons |

14. * reduced form. reg bweight treat Source | SS df MS Number of obs = F( 1, 865) = 7.37 Model | Prob > F = Residual | R-squared = Adj R-squared = Total | Root MSE = bweight | Coef. Std. Err. t P>|t| [95% Conf. Interval] treat | _cons |

15. reg bweight smoke (treat) Instrumental variables (2SLS) regression Source | SS df MS Number of obs = F( 1, 865) = 6.65 Model | Prob > F = Residual | R-squared = Adj R-squared =. Total | Root MSE = bweight | Coef. Std. Err. t P>|t| [95% Conf. Interval] smoke | _cons |

Angrist: Vietnam Draft Lottery 16

17 Vietnam era service Defined as Estimated 8.7 million served during era 3.4 million were in SE Asia 2.6 million served in Vietnam 1.6 million saw combat 203K wounded in action, 153K hospitalized 58,000 deaths n%20war%20casualty.htm#t7

1980 Men, Cohorts VariableNon-veteransVeterans In labor force93.2%95.9% Unemployed5.0%4.7% Labor earnings$15,155$15,875 Nonwhite16.8%12.3% < HS degree21.5%8.8% HS degree49.4%67.8% College degree28.9%23.3% Married72.1%75.5% 18

OLS Estimates Impact of Viet Vet Status Independent Variable Labor EarningsUnemployed Age510 (6.5) (0.0001) Non-white-3446 (67)0.029 (0.0014) < high school-9449 (74)0.078 (0.0015) High school-4800 (57) (0.0011) Viet vet523 (53) (0.0010) Mean of outcome $15,3724.9% R2R

20 Vietnam Era Draft 1 st part of war, operated liked WWII and Korean War At age 18 men report to local draft boards Could receive deferment for variety of reasons (kids, attending school) If available for service, pre-induction physical and tests Military needs determined those drafted

21 Everyone drafted went to the Army Local draft boards filled army. Priorities –Delinquents, volunteers, non-vol –For non-vol., determined by age College enrollment powerful way to avoid service –Men w. college degree 1/3 less likely to serve

22 Draft Lottery Proposed by Nixon Passed in Nov 1969, 1 st lottery Dec 1, st lottery for men age on 1/1/70 –Men born Randomly assigned number 1-365, Draft Lottery number (DLN) Military estimates needs, sets threshold T If DLN<=T, drafted

23 If volunteer, could get better assignment Thresholds for service DraftYear of BirthThreshold Draft suspended in 1973

24

25 Model Sample, men from birth cohorts Y i = earnings X i = Vietnam military service (1=yes, 0=no) Z i = draft eligible, that is DLN <=T (1=yes, 0=no)

26 Put this all together Model of interest Y i = β 0 + x i β 1 + ε i First stage x i = θ 0 + z i θ 1 + μ i θ 1 =(dx/dz)

27 1 st stage Because Z is dichotomous (1 and 0), this makes it easy = θ 1 (change in military service from having a low DLN)

28 Reduced form y i = π o + z i π 1 + v i π 1 = dy/dz=(dy/dx)(dx/dz)

29 Intention to treat y i = γ o + z i γ 1 + v i  1 -  0 = π 1 (difference in earnings for those drafted and those not)

30 Divide reduced form by 1 st stage π 1 /θ 1 = (dy/dx)(dx/dz)/(dx/dz) = dy/dx Recall the equation of interest y i = β 0 + x i β 1 + ε i The units of measure are β 1 = dy/dx So the ratio π 1 /θ 1 is an estimate of β 1

31 β 1 = dy/dx β 1 = [  1 -  0 ]/[ ]

32 1 o

33 Graph of  1 -  0

34  1 -  0 in numbers

35 Β iv = (  1 -  0 )/( ) = /0.159 = $ CPI 78 = 65.2 CPI 81 = /90.9 = * = $2199

36 Although DLN is random, what are some ways that a low DLN could DIRECTLY change wages

37

38

39 Angrist and Evans: The impact of children on labor supply

40 Introduction 2 key labor market trends in the past 40 years –Rising labor force participation of women –Falling fertility These two fact are intimately linked, but how? –Are women working more because they are having less children –Are women having less children because they are working more

41

42

43

44 Note that between 1970 and 1990 –Mean children ever born has fallen by 33%, from 1.78 to 1.18 –% worked last year increased by 32%, from 60 to 79% Hundreds have studies have attempted to address these questions Lots of persistent relationships, but what have we measured?

45 Women with children are not randomly assigned Who is most likely to have large families? –Lower educated –Those with lower wages –Certain minority groups –Certain religious groups –Those who want more children

46 Problem is, many of these same groups are also those most likely to be out of the labor force Of the lower labor supply women among women with young children, how much is due to the kids, how much is attributable to some of these other factors?

47 Preferences for sex mix Among married couples who desire 2+ kids –66% wives and 75% of husbands prefer mix Of women with 2 boys and desiring a 3 rd, 85% would prefer a girl Of women with 2 girls and desiring a 3 rd, 84% would prefer a boy

48

49

50

51 “The desire for a son is the father of many daughters”

52 The sex composition is only impacting 6 percent of women So the change in labor supply should be for this group only, So, if we divide by 0.06, we get /0.06 = Having a 3 rd child will reduce labor supply by 13.3 percentage points

53 Exactly identified model With 1 instrument Over-identified Model with 2 instruments

54

55

56

57

58 Exactly Identified Model

59

60

61 Over-identified Model

62 Test the coefficients on twoboys and twogirls are the same test the coefficient on twobots and twogirls are both equal to zero

63