Multiple Regression Analysis: Further Issues

Slides:



Advertisements
Similar presentations
Multiple Regression Analysis
Advertisements

Economics 20 - Prof. Anderson1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 7. Specification and Data Problems.
Stat 112: Lecture 7 Notes Homework 2: Due next Thursday The Multiple Linear Regression model (Chapter 4.1) Inferences from multiple regression analysis.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
6. Multiple Regression Analysis: Further Issues 6.1 Effects of Data Scaling on OLS Statistics 6.2 More on Functional Form 6.3 More on Goodness-of-Fit and.
Chapter 13 Multiple Regression
The Simple Linear Regression Model: Specification and Estimation
1Prof. Dr. Rainer Stachuletz Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 7. Specification and Data Problems.
Chapter 12 Multiple Regression
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
CHAPTER 4 ECONOMETRICS x x x x x Multiple Regression = more than one explanatory variable Independent variables are X 2 and X 3. Y i = B 1 + B 2 X 2i +
1 Regression Analysis Regression used to estimate relationship between dependent variable (Y) and one or more independent variables (X). Consider the variable.
Topic 3: Regression.
Empirical Estimation Review EconS 451: Lecture # 8 Describe in general terms what we are attempting to solve with empirical estimation. Understand why.
Stat 112: Lecture 9 Notes Homework 3: Due next Thursday
Multiple Linear Regression Analysis
Quantitative Business Analysis for Decision Making Multiple Linear RegressionAnalysis.
Objectives of Multiple Regression
Statistics and Econometrics for Business II Fall 2014 Instructor: Maksym Obrizan Lecture notes III # 2. Advanced topics in OLS regression # 3. Working.
Lesson Multiple Regression Models. Objectives Obtain the correlation matrix Use technology to find a multiple regression equation Interpret the.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Stat 112 Notes 9 Today: –Multicollinearity (Chapter 4.6) –Multiple regression and causal inference.
11 Chapter 5 The Research Process – Hypothesis Development – (Stage 4 in Research Process) © 2009 John Wiley & Sons Ltd.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
Multiple Regression Analysis Regression analysis with two or more independent variables. Leads to an improvement.
Quantitative Methods. Bivariate Regression (OLS) We’ll start with OLS regression. Stands for  Ordinary Least Squares Regression. Relatively basic multivariate.
Multiple Regression Analysis: Inference
Lecture 6 Feb. 2, 2015 ANNOUNCEMENT: Lab session will go from 4:20-5:20 based on the poll. (The majority indicated that it would not be a problem to chance,
Chapter 6: Multiple Regression – Additional Topics
Multiple Regression Analysis: Estimation
Chapter 4: Basic Estimation Techniques
Chapter 14 Introduction to Multiple Regression
Chapter 4 Basic Estimation Techniques
Regression Analysis AGEC 784.
Ch. 2: The Simple Regression Model
Multiple Regression Analysis: Further Issues
Model validation and prediction
Multiple Regression Analysis: Further Issues
Correlation and Simple Linear Regression
Basic Estimation Techniques
Multiple Regression Analysis: Further Issues
Multiple Regression Analysis: OLS Asymptotics
Multiple Regression Analysis and Model Building
More on Specification and Data Issues
The Simple Regression Model
More on Specification and Data Issues
Chapter 6: Multiple Regression – Additional Topics
Ch. 2: The Simple Regression Model
Correlation and Simple Linear Regression
Basic Estimation Techniques
Correlation and Regression
Two-Variable Regression Model: The Problem of Estimation
Chapter 6: MULTIPLE REGRESSION ANALYSIS
Chapter 6: Multiple Regression – Additional Topics
Multiple Regression Models
CHAPTER- 17 CORRELATION AND REGRESSION
Correlation and Simple Linear Regression
Undergraduated Econometrics
Chapter 6: Multiple Regression – Additional Topics
CHAPTER 14 MULTIPLE REGRESSION
Simple Linear Regression and Correlation
Heteroskedasticity.
Seminar in Economics Econ. 470
Product moment correlation
The Simple Regression Model
Multiple Regression Analysis: Further Issues
The Simple Regression Model
More on Specification and Data Issues
Chapter 9 Dummy Variables Undergraduated Econometrics Page 1
Presentation transcript:

Multiple Regression Analysis: Further Issues

Multiple Regression Analysis: Further Issues More on Functional Form More on using logarithmic functional forms Convenient percentage/elasticity interpretation Slope coefficients of logged variables are invariant to rescalings Taking logs often eliminates/mitigates problems with outliers Taking logs often helps to secure normality and homoskedasticity Variables measured in units such as years should not be logged Variables measured in percentage points should also not be logged Logs must not be used if variables take on zero or negative values It is hard to reverse the log-operation when constructing predictions

Multiple Regression Analysis: Further Issues Using quadratic functional forms Example: Wage equation Marginal effect of experience Concave experience profile The first year of experience increases the wage by some $.30, the second year by .298 - 2(.0061)(1) = $.29 etc.

Multiple Regression Analysis: Further Issues Wage maximum with respect to work experience Does this mean the return to experience becomes negative after 24.4 years? Not necessarily. It depends on how many observations in the sample lie right of the turnaround point. In the given example, these are about 28% of the observations. There may be a speci-fication problem (e.g. omitted variables).

Multiple Regression Analysis: Further Issues Example: Effects of pollution on housing prices Nitrogen oxide in the air, distance from employment centers, average student/teacher ratio Does this mean that, at a low number of rooms, more rooms are associated with lower prices?

Multiple Regression Analysis: Further Issues Calculation of the turnaround point Turnaround point: This area can be ignored as it concerns only 1% of the observations. Increase rooms from 5 to 6: Increase rooms from 6 to 7:

Multiple Regression Analysis: Further Issues Other possibilities Higher polynomials

Multiple Regression Analysis: Further Issues Models with interaction terms Interaction effects complicate interpretation of parameters Interaction term The effect of the number of bedrooms depends on the level of square footage Effect of number of bedrooms, but for a square footage of zero

Multiple Regression Analysis: Further Issues Reparametrization of interaction effects Advantages of reparametrization Easy interpretation of all parameters Standard errors for partial effects at the mean values available If necessary, interaction may be centered at other interesting values Population means; may be replaced by sample means Effect of x2 if all variables take on their mean values

Multiple Regression Analysis: Further Issues Average partial effects In models with quadratics, interactions, and other nonlinear functional forms, the partial effect depend on the values of one or more explanatory variables Average partial effect (APE) is a summary measure to describe the relationship between dependent variable and each explanatory variable After computing the partial effect and plugging in the estimated parameters, average the partial effects for each unit across the sample

Multiple Regression Analysis: Further Issues More on goodness-of-fit and selection of regressors General remarks on R-squared A high R-squared does not imply that there is a causal interpretation A low R-squared does not preclude precise estimation of partial effects Adjusted R-squared What is the ordinary R-squared supposed to measure? is an estimate for Population R-squared

Multiple Regression Analysis: Further Issues Correct degrees of freedom of numerator and denominator Adjusted R-squared (cont.) A better estimate taking into account degrees of freedom would be The adjusted R-squared imposes a penalty for adding new regressors The adjusted R-squared increases if, and only if, the t-statistic of a newly added regressor is greater than one in absolute value Relationship between R-squared and adjusted R-squared The adjusted R-squared may even get negative

Multiple Regression Analysis: Further Issues Using adjusted R-squared to choose between nonnested models Models are nonnested if neither model is a special case of the other A comparison between the R-squared of both models would be unfair to the first model because the first model contains fewer parameters In the given example, even after adjusting for the difference in degrees of freedom, the quadratic model is preferred

Multiple Regression Analysis: Further Issues Comparing models with different dependent variables R-squared or adjusted R-squared must not be used to compare models which differ in their definition of the dependent variable Example: CEO compensation and firm performance There is much less variation in log(salary) that needs to be explained than in salary

Multiple Regression Analysis: Further Issues Controlling for too many factors in regression analysis In some cases, certain variables should not be held fixed In a regression of traffic fatalities on state beer taxes (and other factors) one should not directly control for beer consumption In a regression of family health expenditures on pesticide usage among farmers one should not control for doctor visits Different regressions may serve different purposes In a regression of house prices on house characteristics, one would only include price assessments if the purpose of the regression is to study their validity; otherwise one would not include them

Multiple Regression Analysis: Further Issues Adding regressors to reduce the error variance Adding regressors may excarcerbate multicollinearity problems On the other hand, adding regressors reduces the error variance Variables that are uncorrelated with other regressors should be added because they reduce error variance without increasing multicollinearity However, such uncorrelated variables may be hard to find Example: Individual beer consumption and beer prices Including individual characteristics in a regression of beer consumption on beer prices leads to more precise estimates of the price elasticity

Multiple Regression Analysis: Further Issues Predicting y when log(y) is the dependent variable Under the additional assumption that is independent of : Prediction for y

Multiple Regression Analysis: Further Issues Comparing R-squared of a logged and an unlogged specification These are the R-squareds for the predictions of the unlogged salary variable (although the second regression is originally for logged salaries). Both R-squareds can now be directly compared.