Multiple Regression Analysis: Further Issues

Slides:



Advertisements
Similar presentations
Economics 20 - Prof. Anderson1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 7. Specification and Data Problems.
Advertisements

Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Lecture 28 Categorical variables: –Review of slides from lecture 27 (reprint of lecture 27 categorical variables slides with typos corrected) –Practice.
Chapter 13 Multiple Regression
Copyright © 2008 by the McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Managerial Economics, 9e Managerial Economics Thomas Maurice.
Lecture 23: Tues., Dec. 2 Today: Thursday:
1Prof. Dr. Rainer Stachuletz Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 7. Specification and Data Problems.
Chapter 12 Multiple Regression
Econ 140 Lecture 131 Multiple Regression Models Lecture 13.
Multiple Regression Models
CHAPTER 4 ECONOMETRICS x x x x x Multiple Regression = more than one explanatory variable Independent variables are X 2 and X 3. Y i = B 1 + B 2 X 2i +
Economics 20 - Prof. Anderson1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 2. Inference.
Lecture 24: Thurs. Dec. 4 Extra sum of squares F-tests (10.3) R-squared statistic (10.4.1) Residual plots (11.2) Influential observations (11.3,
EC Prof. Buckles1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 2. Inference.
Slides 13c: Causal Models and Regression Analysis
Objectives of Multiple Regression
Correlation and Linear Regression
Econ 3790: Business and Economics Statistics Instructor: Yogesh Uppal
Statistics and Econometrics for Business II Fall 2014 Instructor: Maksym Obrizan Lecture notes III # 2. Advanced topics in OLS regression # 3. Working.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Copyright © 2005 by the McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Managerial Economics Thomas Maurice eighth edition Chapter 4.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
11 Chapter 5 The Research Process – Hypothesis Development – (Stage 4 in Research Process) © 2009 John Wiley & Sons Ltd.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Multiple Regression Chapter 14.
Yandell – Econ 216 Chap 15-1 Chapter 15 Multiple Regression Model Building.
Multiple Regression Analysis: Inference
Lecture 6 Feb. 2, 2015 ANNOUNCEMENT: Lab session will go from 4:20-5:20 based on the poll. (The majority indicated that it would not be a problem to chance,
Chapter 6: Multiple Regression – Additional Topics
Heteroscedasticity Chapter 8
Chapter 15 Multiple Regression Model Building
Chapter 14 Introduction to Multiple Regression
Chapter 4 Basic Estimation Techniques
Econ 326 Lecture 19.
Regression Analysis AGEC 784.
Ch. 2: The Simple Regression Model
Regression Analysis: Statistical Inference
Multiple Regression Analysis: Further Issues
26134 Business Statistics Week 5 Tutorial
Multiple Regression Analysis: Further Issues
Correlation and Simple Linear Regression
Basic Estimation Techniques
The Simple Linear Regression Model: Specification and Estimation
Further Inference in the Multiple Regression Model
Multiple Regression Analysis: OLS Asymptotics
More on Specification and Data Issues
Multiple Regression and Model Building
Chapter 6: Multiple Regression – Additional Topics
Ch. 2: The Simple Regression Model
Further Inference in the Multiple Regression Model
Basic Estimation Techniques
I271B Quantitative Methods
CHAPTER 29: Multiple Regression*
Chapter 6: Multiple Regression – Additional Topics
Prepared by Lee Revere and John Large
Multiple Regression Models
Multiple Regression Analysis: Further Issues
More Multiple Regression
Chapter 6: Multiple Regression – Additional Topics
Tutorial 1: Misspecification
Chapter 7: The Normality Assumption and Inference with OLS
Seminar in Economics Econ. 470
Product moment correlation
Multiple Regression Analysis: Further Issues
The Simple Regression Model
Tutorial 6 SEG rd Oct..
Statistics 350 Lecture 18.
More on Specification and Data Issues
3.2. SIMPLE LINEAR REGRESSION
Presentation transcript:

Multiple Regression Analysis: Further Issues Chapter 6 Wooldridge: Introductory Econometrics: A Modern Approach, 5e

Announcements Feedback on project proposals Via Piazza and your partner’s worksheet SOURCE program applications due tomorrow. Mid-term course evaluations today. Due after Spring break: Next homework, includes paper “Do Lenders Favor Politically Connected Firms?”

Example: Do larger firms spend more on Research & Development? 32 firms observed rdintens is R&D spending as percent of sales. Mean = 3 sales measured in millions. Note the one very high outlier. Research Question: Does having higher sales enable a firm to spend more on R&D?

Predicted values (in red) based on a SLR of rdintens on sales. Note: the coefficient on sales is not statistically significant. We cannot reject the null hypothesis that βsales equals zero. reg rdintens sales predict pred_rdintens_linear twoway (scatter rdintens sales) (scatter pred_rdintens_linear sales)

Now we try the quadratic specification: What’s different? How do we test for the joint significance of sales and salessq? Recall: F-test for overall significance of a regression (since sales and salessq are the only explanatory variables in this model!*). This F stat is reported by Stata automatically. *If we also had other explanatory variables, then we would do an F test where the restricted model removed sales and salessq from the regression.

Predictions from the quadratic specification (green) vs Predictions from the quadratic specification (green) vs. predictions from the linear specification (red):

Multiple Regression Analysis: Further Issues More on goodness-of-fit and selection of regressors General remarks on R-squared: In economics, the main goal of regression analysis is to credibly identify causal effects. A high R-squared does not imply that there is a causal interpretation A low R-squared does not preclude precise estimation of partial effects However, when trying to identify the best functional form, it can be helpful to compare measures of goodness-of-fit. E.g. Comparing a model with log(sales) vs. sales as the explanatory variable. reg rdintens sales ---> reg rdintens lsales --->

Multiple Regression Analysis: Further Issues Question: What if we want to compare the model with log(sales) as regressor against a model with a quadratic functional form (sales and sales-squared as regressors) ? Why might we not want to use R-squared? Because R-squared always increases when we add more regressors. It wouldn‘t really be a “ fair“ comparison, since the quadratic regression has more regressors Instead, we use a measure called “Adjusted R-squared“ that takes into account how many regressors are in the model.

Multiple Regression Analysis: Further Issues Correct for degrees of freedom in nominator and denominator Adjusted R-squared (cont.) A better estimate taking into account degrees of freedom would be The adjusted R-squared imposes a penalty for adding new regressors The adjusted R-squared increases if, and only if, the t-statistic of a newly added regressor is greater than one in absolute value How to calculate adjusted R-squared, from R-squared. The adjusted R-squared may even get negative

Multiple Regression Analysis: Further Issues Using adjusted R-squared to choose between nonnested models Models are nonnested if neither model is a special case of the other A comparison between the R-squared of both models would be unfair to the first model because the first model contains fewer parameters In this example, even after adjusting for the difference in degrees of freedom, the quadratic model is still preferred

Multiple Regression Analysis: Further Issues Comparing models with different dependent variables CAUTION: R-squared or adjusted R-squared must not be used to compare models which differ in their definition of the dependent var. Example: CEO compensation and firm performance There is much less variation in log(salary) that needs to be explained than in salary

Multiple Regression Analysis: Further Issues Predicting y when log(y) is the dependent variable Under the additional assumption that is independent of : Prediction for y

Multiple Regression Analysis: Further Issues Comparing R-squared of a logged and an unlogged specification These are the R-squareds for the predictions of the unlogged salary variable (although the second regression is originally for logged salaries). Both R-squareds can now be directly compared.