Download presentation
Presentation is loading. Please wait.
1
Econ 326 Lecture 19
2
Today Please turn in Problem Set 9
We will discuss the paper you read for today “Traffic Congestion and Infant Health: Evidence from EZ Pass”
3
April Fools! Problem Set 9 and “Traffic Congestion” reading are not due until next Wednesday. Today: Go over some hard parts on Exam 2. Final paper Schedule for the rest of the semester. Chapter 9
4
Exam 2 Not yet graded. Feedback?
5
Final Project Announcement: McMyler Prize
Starting this spring, the Economics department will award the McMyler Prize to the current student who wrote the best Econometrics paper over the past two completed semesters (e.g. Spring 2014, Fall 2015). $500 cash prize If you are graduating this year, you won’t be eligible, unfortunately, since the winner will be chosen next Spring.
6
Final Project By now, you should have your data in Stata format and bring it with you to every lab session. If you feel you are behind, it is even more important to come and work on your data with assistance from myself and the TAs. The next 2 Stata labs will focus on problem-solving, tips for developing your empirical specifications, and tools to create tables displaying your results. The last 2 Stata labs will be devoted to student presentations (5 min. each, all required to attend.)
7
Phases of your final paper
Handout: Format and Content Guidelines I am returning your Proposal with comments, as you can revise and include parts of it in your final paper (in Sections 1, 2, and 4, for example). The main purpose of Phase 2 was for you to learn about the related literature and figure out where your project fits into it. A shortened version of your literature review can be included in Section 2. (Summarize what is known about your topic, and describe the key approach/findings of each paper in 2 sentences, at most 3.) Phase 3 (due April 20) will form the basis of sections III and V. Phase 4: Final Paper due on May 6 (in place of a final exam)
8
More on Specification and Data Issues
Chapter 9 Wooldridge: Introductory Econometrics: A Modern Approach, 5e
9
Multiple Regression Analysis: Specification and Data Issues
Tests for functional form misspecification One can always test whether explanatory should appear as squares or higher order terms by testing whether such terms can be excluded Another option is to use a general specification test such as RESET Regression specification error test (RESET) The idea of RESET is to include squares and possibly higher order fitted values in the regression. Test for the exclusion of these terms. If they cannot be exluded, this is evidence for omitted higher order terms and interactions, i.e. for misspecification of functional form.
10
RESET test 1. Regress Y on the X’s according to your model.
2. Create the fitted values 𝑌 as a new variable (yhat) 3. Create squared and cubed versions of this variable, e.g. yhat2 and yhat3 4. Regress Y on the X’s along with yhat2 and yhat3. Q. How can we test for the joint significance of yhat2 and yhat3? A. Step 4 is your unrestricted model, Step 1 is your restricted model. Do a basic F-test with q=2.
11
Multiple Regression Analysis: Specification and Data Issues
Example: Housing price equation The small p-value means your F-statistic is above the critical value at the 5% level, and you reject the null hypothesis that 𝑝𝑟𝑖𝑐𝑒 2 and 𝑝𝑟𝑖𝑐𝑒 3 can be excluded. Discussion Since the predicted values are linear in the X variables, 𝑝𝑟𝑖𝑐𝑒 2 and 𝑝𝑟𝑖𝑐𝑒 3 are nonlinear functions of the X variables. RESET provides little guidance as to where misspecification comes from, but it‘s a hint that you should try another specification. Add 𝑝𝑟𝑖𝑐𝑒 2 and 𝑝𝑟𝑖𝑐𝑒 3 to get the unrestricted results. Evidence for misspecification
12
Multiple Regression Analysis: Specification and Data Issues
What do you do when you know there’s an important omitted variable that you cannot observe? Example: Natural ability, when you are trying to study the effect of education on earnings. Idea: find a proxy variable for ability which is able to control for ability differences between individuals so that the coefficients of the other variables will not be biased. A possible proxy for ability is the IQ score or some other test score.
13
Multiple Regression Analysis: Specification and Data Issues
Using proxy variables for unobserved explanatory variables Example: Omitted ability in a wage equation General approach to using proxy variables Replace by proxy In general, the estimates for the returns to education and experience will be biased because one has omit the unobservable ability variable. Omitted variable, e.g. ability Regression of the omitted variable on its proxy
14
Multiple Regression Analysis: Specification and Data Issues
Assumptions necessary for the proxy variable method to work The proxy is “ just a proxy“ for the omitted variable, it does not belong into the population regression, i.e. it is uncorrelated with its error The proxy variable is a “ good“ proxy for the omitted variable, i.e. using other variables in addition will not help to predict the omitted variable If the error and the proxy were correlated, the proxy would actually have to be included in the population regression function Otherwise x1 and x2 would have to be included in the regression for the omitted variable
15
Multiple Regression Analysis: Specification and Data Issues
Under these assumptions, the proxy variable method works: Discussion of the proxy assumptions in the wage example Assumption 1: Should be fullfilled as IQ score is not a direct wage determinant; what matters is how able the person proves at work Assumption 2: Most of the variation in ability should be explainable by variation in IQ score, leaving only a small rest to educ and exper In this regression model, the error term is uncorrelated with all explanatory variables. As a consequence, all coefficients will be correctly estimated using OLS. The coefficents for the explanatory variables x1 and x2 will be correctly identified. The coefficient for the proxy va-riable may also be of interest (it is a multiple of the coefficient of the omitted variable).
16
Multiple Regression Analysis: Specification and Data Issues
As expected, the measured return to education decreases if IQ is included as a proxy for unobserved ability. The coefficient for the proxy suggests that ability differences between indivi-duals are important (e.g points IQ score are associated with a wage increase of 5.4 percentage points). Even if IQ score imperfectly soaks up the variation caused by ability, inclu-ding it will at least reduce the bias in the measured return to education. No significant interaction effect bet-ween ability and education.
17
Multiple Regression Analysis: Specification and Data Issues
Using lagged dependent variables as proxy variables In many cases, omitted unobserved factors may be proxied by the value of the dependent variable from an earlier time period Example: City crime rates Including the past crime rate will at least partly control for the many omitted factors that also determine the crime rate in a given year Another way to interpret this equation is that one compares cities which had the same crime rate last year; this avoids comparing cities that differ very much in unobserved crime factors
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.