Econometrics - Lecture 3 Regression Models: Interpretation and Comparison.

Slides:



Advertisements
Similar presentations
Introduction Describe what panel data is and the reasons for using it in this format Assess the importance of fixed and random effects Examine the Hausman.
Advertisements

Econometric Modeling Through EViews and EXCEL
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Managerial Economics in a Global Economy
Economics 20 - Prof. Anderson1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 7. Specification and Data Problems.
Econ 488 Lecture 5 – Hypothesis Testing Cameron Kaplan.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
11 Simple Linear Regression and Correlation CHAPTER OUTLINE
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 13 Nonlinear and Multiple Regression.
Instrumental Variables Estimation and Two Stage Least Square
Econ 140 Lecture 151 Multiple Regression Applications Lecture 15.
1 Lecture 2: ANOVA, Prediction, Assumptions and Properties Graduate School Social Science Statistics II Gwilym Pryce
1 Module II Lecture 3: Misspecification: Non-linearities Graduate School Quantitative Research Methods Gwilym Pryce.
1 Lecture 2: ANOVA, Prediction, Assumptions and Properties Graduate School Social Science Statistics II Gwilym Pryce
Chapter 12 Simple Regression
Econ 140 Lecture 131 Multiple Regression Models Lecture 13.
Chapter 4 Multiple Regression.
Regression Analysis Regression used to estimate relationship between dependent variable (Y) and one or more independent variables (X). Our theory states.
Multiple Regression Models
Econ 140 Lecture 181 Multiple Regression Applications III Lecture 18.
Econ 140 Lecture 171 Multiple Regression Applications II &III Lecture 17.
Multiple Regression Applications
Lecture 24: Thurs. Dec. 4 Extra sum of squares F-tests (10.3) R-squared statistic (10.4.1) Residual plots (11.2) Influential observations (11.3,
So far, we have considered regression models with dummy variables of independent variables. In this lecture, we will study regression models whose dependent.
Economics 310 Lecture 7 Testing Linear Restrictions.
Chapter 11 Multiple Regression.
Further Inference in the Multiple Regression Model Prepared by Vera Tabakova, East Carolina University.
Topic 3: Regression.
Empirical Estimation Review EconS 451: Lecture # 8 Describe in general terms what we are attempting to solve with empirical estimation. Understand why.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 11 th Edition.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Linear Regression Models Powerful modeling technique Tease out relationships between “independent” variables and 1 “dependent” variable Models not perfect…need.
Copyright ©2011 Pearson Education 15-1 Chapter 15 Multiple Regression Model Building Statistics for Managers using Microsoft Excel 6 th Global Edition.
Multiple Linear Regression Analysis
Hypothesis Testing in Linear Regression Analysis
Regression Method.
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 15-1 Chapter 15 Multiple Regression Model Building Statistics for Managers using Microsoft.
Chapter 12 Multiple Regression and Model Building.
How do Lawyers Set fees?. Learning Objectives 1.Model i.e. “Story” or question 2.Multiple regression review 3.Omitted variables (our first failure of.
Specification Error I.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 15 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Further Inference in the Multiple Regression Model
6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y is.
1 11 Simple Linear Regression and Correlation 11-1 Empirical Models 11-2 Simple Linear Regression 11-3 Properties of the Least Squares Estimators 11-4.
Chapter 13 Multiple Regression
2010, ECON Hypothesis Testing 1: Single Coefficient Review of hypothesis testing Testing single coefficient Interval estimation Objectives.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
Chap 6 Further Inference in the Multiple Regression Model
11 Chapter 5 The Research Process – Hypothesis Development – (Stage 4 in Research Process) © 2009 John Wiley & Sons Ltd.
MEASURES OF GOODNESS OF FIT The sum of the squares of the actual values of Y (TSS: total sum of squares) could be decomposed into the sum of the squares.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 10 th Edition.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
Econometrics - Lecture 2 Introduction to Linear Regression – Part 2.
4-1 MGMG 522 : Session #4 Choosing the Independent Variables and a Functional Form (Ch. 6 & 7)
Lecturer: Ing. Martina Hanová, PhD. Business Modeling.
Econometrics - Lecture 3 Regression Models: Interpretation and Comparison.
Hackl, Advanced Econometrics
Chapter 15 Multiple Regression Model Building
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Further Inference in the Multiple Regression Model
Further Inference in the Multiple Regression Model
Multiple Regression Analysis: Further Issues
Our theory states Y=f(X) Regression is used to test theory.
Tutorial 1: Misspecification
Chapter 7: The Normality Assumption and Inference with OLS
Seminar in Economics Econ. 470
Chapter 9 Dummy Variables Undergraduated Econometrics Page 1
Presentation transcript:

Econometrics - Lecture 3 Regression Models: Interpretation and Comparison

Contents The Linear Model: Interpretation Selection of Regressors Specification of the Functional Form Oct 30, 2015 Hackl, Econometrics, Lecture 3 2

Economic Models Describe economic relationships (not only a set of observations), have an economic interpretation Linear regression model: y i =  1 +  2 x i2 + … +  K x iK +  i = x i ’  + ε i Variables Y, X 2, …, X K : observable Observations: y i, x i2, …, x iK, i = 1, …, N Error term ε i (disturbance term) contains all influences that are not included explicitly in the model; unobservable Assumption (A1), i.e., E{ε i | X} = 0 or E{ε i | x i } = 0, gives E{y i | x i } = x i ‘β the model describes the expected value of y i given x i (conditional expectation) Oct 30, 2015 Hackl, Econometrics, Lecture 3 3

Example: Wage Equation Wage equation (Verbeek’s dataset “wages1”) wage i = β 1 + β 2 male i + β 3 school i + β 4 exper i + ε i Answers questions like:  Expected wage p.h. of a female with 12 years of education and 10 years of experience Wage equation fitted to all 3294 observations wage i = *male i *school i *exper i  Expected wage p.h. of a female with 12 years of education and 10 years of experience: 5.50 USD Oct 30, 2015 Hackl, Econometrics, Lecture 3 4

Regression Coefficients Linear regression model: y i =  1 +  2 x i2 + … +  K x iK +  i = x i ’  + ε i Coefficient  k measures the change of Y if X k changes by one unit for ∆x k = 1 For continuous regressors Marginal effect of changing X k on Y Ceteris paribus condition: measuring the effect of a change of Y due to a change ∆x k = 1 by  k implies  knowledge which other X i, i ǂ k, are in the model  that all other X i, i ǂ k, remain unchanged Oct 30, 2015 Hackl, Econometrics, Lecture 3 5

Example: Coefficients of Wage Equation Wage equation wage i = β 1 + β 2 male i + β 3 school i + β 4 exper i + ε i β 3 measures the impact of one additional year at school upon a person’s wage, keeping gender and years of experience fixed Wage equation fitted to all 3294 observations wage i = *male i *school i *exper i One extra year at school, e.g., at the university, results in an increase of 64 cents; a 4-year study results in an increase of 2.56 USD of the wage p.h. This is true for otherwise (gender, experience) identical people Oct 30, 2015 Hackl, Econometrics, Lecture 3 6

Regression Coefficients, cont’d The marginal effect of a changing regressor may be depending on other variables Examples Wage equation: wage i = β 1 + β 2 male i + β 3 age i + β 4 age i 2 + ε i the impact of changing age depends on age: Wage equation may contain β 3 age i + β 4 age i male i : marginal effect of age depends upon gender Oct 30, 2015 Hackl, Econometrics, Lecture 3 7

Elasticities Elasticity: measures the relative change in the dependent variable Y due to a relative change in X k For a linear regression, the elasticity of Y with respect to X k is For a loglinear model with (log x i )’ = (1, log x i2,…, log x ik ) log y i = (log x i )’ β + ε i elasticities are the coefficients β (see slide 10) Oct 30, 2015 Hackl, Econometrics, Lecture 3 8

Example: Wage Elasticity Wage equation, fitted to all 3294 observations: log(wage i ) = male i log(exper i ) The coefficient of log(exper i ) measures the elasticity of wages with respect to experience: 100% more time of experience results in an increase of wage by 0.19 or a 19% higher wage 10% more time of experience results in a 1.9% higher wage Oct 30, 2015 Hackl, Econometrics, Lecture 3 9

Elasticities, continues slide 8 This follows from and Oct 30, 2015 Hackl, Econometrics, Lecture 3 10

Semi-Elasticities Semi-elasticity: measures the relative change in the dependent variable Y due to an (absolute) one-unit-change in X k Linear regression for log y i = x i ’ β + ε i the elasticity of Y with respect to X k is β k measures the relative change in Y due to a change in X k by one unit β k is called semi-elasticity of Y with respect to X k Oct 30, 2015 Hackl, Econometrics, Lecture 3 11

Example: Wage Differential Wage equation, fitted to all 3294 observations: log(wage i ) = male i log(exper i ) The semi-elasticity of the wages with respect to gender, i.e., the relative wage differential between males and females, is the coefficient of male i : 0.20 or 20% The wage differential between males (male i =1) and females is obtained from wage f = exp{ log(exper i )} and wage m = wage f exp{0.20} = 1.22 wage f ; the wage differential is 0.22 or 22%, i.e., approximately the coefficient ) ____________________ 1) For small x, exp{x} =  k x k /k! ≈ 1+x Oct 30, 2015 Hackl, Econometrics, Lecture 3 12

Contents The Linear Model: Interpretation Selection of Regressors Specification of the Functional Form Oct 30, 2015 Hackl, Econometrics, Lecture 3 13

Selection of Regressors Specification errors: Omission of a relevant variable Inclusion of an irrelevant variable Questions: What are the consequences of a specification error? How to avoid specification errors? How to detect an erroneous specification? Oct 30, 2015 Hackl, Econometrics, Lecture 3 14

Example: Income and Consumption Hackl, Econometrics, Lecture 3 15 PCR: Private Consumption, real, in bn. EUROs PYR: Household's Dispos- able Income, real, in bn. EUROs 1970:1-2003:4 Basis: 1995 Source: AWM-Database Oct 30, 2015

Income and Consumption Hackl, Econometrics, Lecture 3 16 PCR: Private Consumption, real, in bn. EUROs PYR: Household's Dispos- able Income, real, in bn. EUROs 1970:1-2003:4 Basis: 1995 Source: AWM-Database Oct 30, 2015

Income and Consumption: Growth Rates Hackl, Econometrics, Lecture 3 17 PCR_D4: Private Consump- tion, real, yearly growth rate PYR_D4: Household’s Dis- posable Income, real, yearly growth rate 1970:1-2003:4 Basis: 1995 Source: AWM-Database Oct 30, 2015

Consumption Function C: Private Consumption, real, yearly growth rate (PCR_D4) Y: Household’s Disposable Income, real, yearly growth rate (PYR_D4) T: Trend (T i = i/1000) Consumption function with trend T i = i/1000: Hackl, Econometrics, Lecture 3 18Oct 30, 2015

Consumption Function, cont’d Hackl, Econometrics, Lecture 3 19 OLS estimated consumption function: Output from GRETL Dependent variable : PCR_D4 coefficient std. error t-ratio p-value const 0, , ,649 1,76e-014 *** PYR_D4 0, , ,69 4,94e-034 *** T-0, , ,629 0,0004 *** Mean dependent var 0, S.D. dependent var 0, Sum squared resid 0, S.E. of regression 0, R- squared 0, Adjusted R-squared 0, F(2, 129) 188,8830 P-value (F) 4,71e-39 Log-likelihood 455,9302 Akaike criterion -905,8603 Schwarz criterion -897,2119 Hannan-Quinn-902,3460 rho 0, Durbin-Watson 0, Oct 30, 2015

Consequences Consequences of specification errors: Omission of a relevant variable Inclusion of a irrelevant variable Oct 30, 2015 Hackl, Econometrics, Lecture 3 20

Misspecification: Two Models Two models: y i = x i ‘β + z i ’γ + ε i (A) y i = x i ‘β + v i (B) with J-vector z i Oct 30, 2015 Hackl, Econometrics, Lecture 3 21

Misspecification: Omitted Regressor Specified model is (B), but true model is (A) y i = x i ‘β + z i ’γ + ε i (A) y i = x i ‘β + v i (B) OLS estimates b B of β from (B) can be written with y i from (A): If (A) is the true model but (B) is specified, i.e., J relevant regressors z i are omitted, b B is biased by Omitted variable bias No bias if (a) γ = 0 or if (b) variables in x i and z i are orthogonal Oct 30, 2015 Hackl, Econometrics, Lecture 3 22

Misspecification: Irrelevant Regressor Specified model is (A), but true model is (B): y i = x i ‘β + z i ’γ + ε i (A) y i = x i ‘β + v i (B) If (B) is the true model but (A) is specified, i.e., the model contains irrelevant regressors z i The OLS estimates b A are unbiased have higher variances and standard errors than the OLS estimate b B obtained from fitting model (B) Oct 30, 2015 Hackl, Econometrics, Lecture 3 23

Specification Search General-to-specific modeling: 1.List all potential regressors, based on, e.g.,  economic theory  empirical research  availability of data 2.Specify the most general model: include all potential regressors 3.Iteratively, test which variables have to be dropped, re-estimate 4.Stop if no more variable has to be dropped The procedure is known as the LSE (London School of Economics) method Oct 30, 2015 Hackl, Econometrics, Lecture 3 24

Specification Search, cont’d Alternative procedures Specific-to-general modeling: start with a small model and add variables as long as they contribute to explaining Y Stepwise regression Specification search can be subsumed under data mining Oct 30, 2015 Hackl, Econometrics, Lecture 3 25

Practice of Specification Search Applied research Starts with a – in terms of economic theory – plausible specification Tests whether imposed restrictions are correct  Tests for omitted regressors  Tests for autocorrelation of residuals  Tests for heteroskedasticity Tests whether further restrictions need to be imposed  Tests for irrelevant regressors Obstacles for good specification Complexity of economic theory Limited availability of data Oct 30, 2015 Hackl, Econometrics, Lecture 3 26

Regressor Selection Criteria Criteria for adding and deleting regressors t-statistic, F-statistic Adjusted R 2 Information Criteria: penalty for increasing number of regressors Akaike’s Information Criterion Schwarz’s Bayesian Information Criterion model with smaller BIC (or AIC) is preferred The corresponding probabilities for type I and type II errors can hardly be assessed Oct 30, 2015 Hackl, Econometrics, Lecture 3 27

Wages: Which Regressors? Are school and exper relevant regressors in wage i = β 1 + β 2 male i + β 3 school i + β 4 exper i + ε i or shall they be omitted? t-test: p-values are 4.62E-80 (school) and 1.59E-7 (exper) F-test: F = [( )/2]/[( )/(3294-4)] = , with p-value 2.68E-79 adj R 2 : for the wider model, much higher than AIC: the wider model (AIC = ) is preferable; for the smaller model: AIC = BIC: the wider model (BIC = ) is preferable; for the smaller model: BIC = All criteria suggest the wider model Oct 30, 2015 Hackl, Econometrics, Lecture 3 28

Wages, cont’d OLS estimated smaller wage equation (Table 2.1, Verbeek) with AIC = , BIC = Oct 30, 2015 Hackl, Econometrics, Lecture 3 29

Wages, cont’d OLS estimated wider wage equation (Table 2.2, Verbeek) with AIC = , BIC = Oct 30, 2015 Hackl, Econometrics, Lecture 3 30

Oct 30, 2015 Hackl, Econometrics, Lecture 3 31 The AIC Criterion Various versions in literature Verbeek, also Greene: Akaike‘s original formula is with the log-likelihood function GRETL:

Nested Models: Comparison Model (B), y i = x i ‘β + v i, see slide 21, is nested in model y i = x i ‘β + z i ’γ + ε i (A) (A) is extended by J additional regressors z i Do the J added regressors contribute to explaining Y? F-test (t-test when J = 1) for testing H 0 : coefficients of added regressors are zero R B 2 and R A 2 are the R 2 of the models without (B) and with (A) the J additional regressors, respectively Comparison of adjusted R 2 : adj R A 2 > adj R B 2 equivalent to F > 1 Information Criteria: choose the model with the smaller value of the information criterion Oct 30, 2015 Hackl, Econometrics, Lecture 3 32

Comparison of Non-nested Models Non-nested models: y i = x i ’β + ε i (A) y i = z i ’γ + v i (B) at least one component in z i that is not in x i Non-nested or encompassing F-test: compares by F-tests artificially nested models y i = x i ’β + z 2i ’δ B + ε* i with z 2i : regressors from z i not in x i y i = z i ’γ + x 2i ’δ A + v* i with x 2i : regressors from x i not in z i  Test validity of model A by testing H 0 : δ B = 0  Analogously, test validity of model B by testing H 0 : δ A = 0  Possible results: A or B is valid, both models are valid, none is valid Other procedures: J-test, PE-test (see below) Oct 30, 2015 Hackl, Econometrics, Lecture 3 33

Wages: Which Model? Which of the models is adequate? log(wage i ) = male i school i (A) adj R 2 = 0.121, BIC = , log(wage i ) = age i (B) adj R 2 = 0.069, BIC = Artificially nested model log(wage i ) = = male i school i age i Test of model validity  model A: t-test for age, p-value 5.79E-15; model A is not adequate  model B: F-test for male and school: model B is not adequate Oct 30, 2015 Hackl, Econometrics, Lecture 3 34

J-Test: Comparison of Non- nested Models Non-nested models: (A) y i = x i ’β + ε i, (B) y i = z i ’γ + v i with components of z i that are not in x i Combined model y i = (1 - δ) x i ’β + δ z i ’γ + u i with 0 < δ < 1; δ indicates model adequacy Transformed model y i = x i ’β* + δz i ’c + u i = x i ’β* + δŷ iB + u* i with OLS estimate c for γ and predicted values ŷ iB = z i ’c obtained from fitting model B; β* = (1-δ)β J-test for validity of model A by testing H 0 : δ = 0 Less computational effort than the encompassing F-test Oct 30, 2015 Hackl, Econometrics, Lecture 3 35

Wages: Which Model? Which of the models is adequate? log(wage i ) = male i school i (A) adj R 2 = 0.121, BIC = , log(wage i ) = age i (B) adj R 2 = 0.069, BIC = Test the validity of model B by means of the J-test Extend the model B to log(wage i ) = age i ŷ iA with values ŷ iA predicted for log(wage i ) from model A Test of model validity: t-test for coefficient of ŷ iA, t = 15.96, p-value 2.65E-55 Model B is not a valid model Oct 30, 2015 Hackl, Econometrics, Lecture 3 36

Linear vs. Loglinear Model Choice between linear and loglinear functional form y i = x i ’β + ε i (A) log y i = (log x i )’β + v i (B) In terms of economic interpretation: Are effects additive or multiplicative? Log-transformation stabilizes variance, particularly if the dependent variable has a skewed distribution (wages, income, production, firm size, sales,…) Loglinear models are easily interpretable in terms of elasticities Oct 30, 2015 Hackl, Econometrics, Lecture 3 37

PE-Test: Linear vs. Loglinear Model Choice between linear and loglinear functional form Estimate both models y i = x i ’β + ε i (A) log y i = (log x i )’β + v i (B) calculate the fitted values ŷ (from model A) and log ӱ (from B) Test δ LIN = 0 in y i = x i ’β + δ LIN (log ŷ i – log ӱ i ) + u i not rejecting δ LIN = 0 favors the model A Test δ LOG = 0 in log y i = (log x i )’β + δ LOG (ŷ i – exp{log ӱ i }) + u i not rejecting δ LOG = 0 favors the model B Both null hypotheses are rejected: find a more adequate model Oct 30, 2015 Hackl, Econometrics, Lecture 3 38

Wages: Which Model? Test of validity of models by means of the PE-test The fitted models are (with l_x for log(x)) wage i = male i school i (A) l_wage i = male i l_school i (B) x_f: predicted value of x: d_log = log(wage_f) – l_wage_f, d_lin = wage_f – exp(l_wage_f) Test of model validity, model A: wage i = male i school i – d_log i with p-value for d_log; validity in doubt Test of model validity, model B: l_wage i = male i l_school i d_lin i with p-value for d_lin; model B to be preferred Oct 30, 2015 Hackl, Econometrics, Lecture 3 39

The PE-Test Choice between linear and loglinear functional form The auxiliary regressions are estimated for testing purposes If the linear model is not rejected: accept the linear model If the loglinear model is not rejected: accept the loglinear model If both are rejected, neither model is appropriate, a more general model should be considered In case of the Individual Wages example:  Linear model (A): t-statistic is – 4.731, p-value 0.013: the model is rejected  Loglinear model (B): t-statistic is 0.171, p-value : the model is not rejected Oct 30, 2015 Hackl, Econometrics, Lecture 3 40

Contents The Linear Model: Interpretation Selection of Regressors Specification of the Functional Form Oct 30, 2015 Hackl, Econometrics, Lecture 3 41

Non-linear Functional Forms Model specification y i = g(x i, β) + ε i substitution of g(x i, β) for x i ’β: allows for two types on non-linearity g(x i, β) non-linear in regressors (but linear in parameters)  Powers of regressors, e.g., g(x i, β) = β 1 + β 2 age i + β 3 age i 2  Interactions of regressors OLS technique still works; t-test, F-test for specification check g(x i, β) non-linear in regression coefficients, e.g.,  g(x i, β) = β 1 x i1 β2 x i2 β3 logarithmic transformation: log g(x i, β) = log β 1 + β 2 log x i1 + β 3 log x i2  g(x i, β) = β 1 + β 2 x i β3 non-linear least squares estimation, numerical procedures Various specification test procedures, e.g., RESET test, Chow test Oct 30, 2015 Hackl, Econometrics, Lecture 3 42

Individual Wages: Effect of Gender and Education Effect of gender may be depending of education level Separate models for males and females Interaction terms between dummies for education level and male Example: Belgian Household Panel, 1994 (“bwages”, N=1472) Five education levels Model for log(wage) with education dummies Model with interaction terms between education dummies and gender dummy F-statistic for interaction terms: F(5, 1460) = {( )/5}/{( )/( )} = 2.74 with a p-value of Oct 30, 2015 Hackl, Econometrics, Lecture 3 43

Wages: Model with Education Dummies Model with education dummies: Verbeek, Table 3.11 Oct 30, 2015 Hackl, Econometrics, Lecture 3 44

Wages: Model with Gender Interactions Wage equation with interactions educ*male Oct 30, 2015 Hackl, Econometrics, Lecture 3 45

RESET Test Test of the linear model E{y i |x i }= x i ’β against misspecification of the functional form: Null hypothesis: linear model is correct functional form Test of H 0 : RESET test (Regression Equation Specification Error Test) Test idea: powers of various degrees of ŷ i, the fitted values from the linear model, e.g., ŷ i ², ŷ i ³,..., do not improve model fit under H 0 Test procedure: linear model extended by adding ŷ i ², ŷ i ³,... F- (or t-) test to decide whether powers of fitted values like ŷ i ², ŷ i ³,... contribute as additional regressors to explaining Y Maximal power Q of fitted values: typical choice is Q = 2 or Q = 3 Oct 30, 2015 Hackl, Econometrics, Lecture 3 46

RESET Test: The Idea Test procedure: linear model extended by adding ŷ i ², ŷ i ³,... ŷ i ² is a function of squares (and interactions) of the regressor variables; analogously for ŷ i ³,... If the F-test indicates that the additional regressor ŷ i ² contributes to explaining Y: the linear relation is not adequate, another functional form is more appropriate Oct 30, 2015 Hackl, Econometrics, Lecture 3 47

Wages: RESET Test The fitted models are (with l_x for log(x)) wage i = male i school i (A) l_wage i = male i l_school i (B) Test of specification of the functional form with Q = 3 Model A: Test statistic: F(2, 3288) = 10.23, p-value = 3.723e-005 Model B: Test statistic: F(2, 3288) = 4.52, p-value = For both models the adequacy of the functional form is in doubt Oct 30, 2015 Hackl, Econometrics, Lecture 3 48

Structural Break: Chow Test In time-series context, coefficients of a model may change due to a major policy change, e.g., the oil price shock Modeling a process with structural break E{y i |x i }= x i ’β + g i x i ’ γ with dummy variable g i =0 before the break, g i =1 after the break Regressors x i, coefficients β before, β+γ after the break Null hypothesis: no structural break, γ=0 Test procedure: fitting the extended model, F- (or t-) test of γ=0 with S r (S u ): sum of squared residuals of the (un)restricted model Chow test for structural break or structural change Oct 30, 2015 Hackl, Econometrics, Lecture 3 49

Chow Test: The Practice Test procedure is performed in the following steps Fit the restricted model: S r Fit the extended model: S u Calculate F and the p-value from the F-distribution with K and N- 2K d.f. Needs knowledge of break point Oct 30, 2015 Hackl, Econometrics, Lecture 3 50

Your Homework 1.Use the data set “bwages” of Verbeek for the following analyses: a)Estimate the model where the log hourly wages (lnwage) are explained by lnwage, male, and educ; interpret the results. b)Repeat exercise a) using dummy variables for the education levels, e.g., d1 for educ = 1, instead of the variable educ; compare the models from exercises a) and b) by using (i) the non-nested F-test and (ii) the J-test; interpret the results. c)Use the PE-test to decide whether the model in a) (where log hourly wages lnwage are explained) or the same model but with levels wage of hourly wages as explained variable is to be preferred; interpret the result. d)Repeat exercise b) with the interaction male*exper as additional regressor; interpret the result. Oct 30, 2015 Hackl, Econometrics, Lecture 3 51

Your Homework, cont’d 2.OLS is used to estimate β from y i = x i ‘β + ε i, but a relevant regressor z i is neglected: Show that the estimate b is biased, and derive an expression for the bias. 3.The model for a process with structural break is specified as y i = x i ’β + g i x i ’ γ + ε i, or y = Z  ε in matrix form, with dummy variable g i =0 before the break, g i =1 after the break. Write down the design matrix Z of the model. Oct 30, 2015 Hackl, Econometrics, Lecture 3 52