Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.

Slides:



Advertisements
Similar presentations
Inferential Statistics and t - tests
Advertisements

Multivariate Regression
Economics 20 - Prof. Anderson1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 7. Specification and Data Problems.
Estimating a Population Proportion
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Specifying an Econometric Equation and Specification Error
Qualitative Variables and
4.3 Confidence Intervals -Using our CLM assumptions, we can construct CONFIDENCE INTERVALS or CONFIDENCE INTERVAL ESTIMATES of the form: -Given a significance.
The Use and Interpretation of the Constant Term
Choosing a Functional Form
McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 7: Demand Estimation and Forecasting.
Copyright © 2008 by the McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Managerial Economics, 9e Managerial Economics Thomas Maurice.
Feb 21, 2006Lecture 6Slide #1 Adjusted R 2, Residuals, and Review Adjusted R 2 Residual Analysis Stata Regression Output revisited –The Overall Model –Analyzing.
1Prof. Dr. Rainer Stachuletz Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 7. Specification and Data Problems.
Econ 140 Lecture 131 Multiple Regression Models Lecture 13.
Chapter 4 Multiple Regression.
Multiple Regression Models
Further Inference in the Multiple Regression Model Prepared by Vera Tabakova, East Carolina University.
Lecture 23 Multiple Regression (Sections )
Stat 112: Lecture 9 Notes Homework 3: Due next Thursday
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
Simple Linear Regression Analysis
Multiple Linear Regression A method for analyzing the effects of several predictor variables concurrently. - Simultaneously - Stepwise Minimizing the squared.
Forecasting Revenue: An Example of Regression Model Building Setting: Possibly a large set of predictor variables used to predict future quarterly revenues.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.
Simple Linear Regression
Multiple Linear Regression Analysis
Active Learning Lecture Slides
Copyright © 2011 Pearson Education, Inc. Multiple Regression Chapter 23.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
Nonlinear Regression Functions
Chapter 13: Inference in Regression
Correlation and Linear Regression
Hypothesis Testing in Linear Regression Analysis
Inference in practice BPS chapter 16 © 2006 W.H. Freeman and Company.
4.2 One Sided Tests -Before we construct a rule for rejecting H 0, we need to pick an ALTERNATE HYPOTHESIS -an example of a ONE SIDED ALTERNATIVE would.
2-1 MGMG 522 : Session #2 Learning to Use Regression Analysis & The Classical Model (Ch. 3 & 4)
Using the Margins Command to Estimate and Interpret Adjusted Predictions and Marginal Effects Richard Williams
Forecasting Revenue: An Example of Regression Model Building Setting: Possibly a large set of predictor variables used to predict future quarterly revenues.
Lecture 22 Dustin Lueker.  The sample mean of the difference scores is an estimator for the difference between the population means  We can now use.
Specification Error I.
Chapter 10 Hetero- skedasticity Copyright © 2011 Pearson Addison-Wesley. All rights reserved. Slides by Niels-Hugo Blunch Washington and Lee University.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Welcome to Econ 420 Applied Regression Analysis Study Guide Week Six.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
Copyright © 2005 by the McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Managerial Economics Thomas Maurice eighth edition Chapter 4.
Review of Building Multiple Regression Models Generalization of univariate linear regression models. One unit of data with a value of dependent variable.
Regression Analysis A statistical procedure used to find relations among a set of variables.
Introduction to sample size and power calculations Afshin Ostovar Bushehr University of Medical Sciences.
Chapter 8 Delving Into The Use of Inference 8.1 Estimating with Confidence 8.2 Use and Abuse of Tests.
Discussion of time series and panel models
1 Regression Analysis The contents in this chapter are from Chapters of the textbook. The cntry15.sav data will be used. The data collected 15 countries’
3-1 MGMG 522 : Session #3 Hypothesis Testing (Ch. 5)
7.4 DV’s and Groups Often it is desirous to know if two different groups follow the same or different regression functions -One way to test this is to.
Chapter 4 The Classical Model Copyright © 2011 Pearson Addison-Wesley. All rights reserved. Slides by Niels-Hugo Blunch Washington and Lee University.
Chap 6 Further Inference in the Multiple Regression Model
11 Chapter 5 The Research Process – Hypothesis Development – (Stage 4 in Research Process) © 2009 John Wiley & Sons Ltd.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Example x y We wish to check for a non zero correlation.
Statistics: Unlocking the Power of Data Lock 5 STAT 250 Dr. Kari Lock Morgan Multiple Regression SECTIONS 10.1, 10.3 Multiple explanatory variables (10.1,
Review: Stages in Research Process Formulate Problem Determine Research Design Determine Data Collection Method Design Data Collection Forms Design Sample.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Multiple Regression Chapter 14.
Regression Analysis: A statistical procedure used to find relations among a set of variables B. Klinkenberg G
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
4-1 MGMG 522 : Session #4 Choosing the Independent Variables and a Functional Form (Ch. 6 & 7)
Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
Elementary Statistics
Seminar in Economics Econ. 470
Chapter 13 Additional Topics in Regression Analysis
Presentation transcript:

Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education. Chapter 8 Model Selection in Multiple Linear Regression Analysis

8-2 Learning Objectives Understand the problem presented by omitted variable bias Understand the problem presented by including an irrelevant variable Understand the problem presented by missing data Understand the problem presented by outliers Perform the RESET test for the inclusion of higher-order polynomials

8-3 Learning Objectives Perform the Davidson-MacKinnon test for choosing among non-nested alternatives Consider how to implement the “eye test” to judge the estimated sample regression function Consider what it means for a p-value to be just above a given cutoff

8-4

8-5 Understand the Problem Presented by Omitted Variable Bias Omitted Variable Bias is the bias in coefficient estimates when a variable is omitted from the model and that variable is also related to one or more independent variables. Omitted variable bias results in OLS estimates being on average wrong and incorrect hypothesis test and confidence intervals.

8-6 Understand the Problem Presented by Including an Irrelevant Variable Including an Irrelevant Variable is when a variable is included in the regression model even though it is not related to the dependent variable. Including an irrelevant variable does not cause the coefficient estimates to be biased but it may result in larger standard errors (which might result in more variables being estimated as statistically insignificant).

8-7 What is the Lesser of Two Evils: Omitted Variable Bias or Including an Irrelevant Variable? Because omitting a relevant variable results in biased estimates while including an irrelevant variable does not, it is more desirable to include an irrelevant variable. However, it would be best to have a correctly specified model without either an omitted variable or an irrelevant variable. A correctly specified model should be created by considering relevant economic theory and by looking at what others have done in similar studies.

8-8 Understand the Problem Presented by Missing Data When collecting data sometime data are missing for some of the observations. Solutions: (1)If there is no systematic reason that the data are missing, we can delete those observations and estimate the model for the observations with the non-missing data. (2)Create a new dummy variable, which is equal to 1 if the data are missing and 0 if they aren’t for that observation (and set the value of the missing observations to 0)

8-9 Empirical Example: We are interested in explaining the teen fertility rate (births per 1,000 women ages 15-19) using expenditure per student in primary school (% of GDP per capita) in Of the 252 Countries the World Bank collects data on, 225 of them have data on teen fertility and only 90 of the countries have data on education expenditure. C.ZS/countries

8-10 Empirical Example: Running the Regression with the Missing Observations Deleted The interpretation of the expenditure variable coefficient is that, “ on average, as expenditure per student as a percent of GDP per capita goes up by $1, the teen fertility rate drops by 2.29 per 1000.” This results is statistically significant at the 1% level.

8-11 Empirical Example: Running the Regression with the Missing Observations Set to 0 and a Dummy Variable Added for the Missing Observations The coefficient with the missing observations included is almost identical to the coefficient estimate on the previous slide suggesting that the missing observations don’t affect the results. However, the dummy variable for the missing observations is statistically significant at the 1% level and it suggests that, on average, the teen fertility rate is lower than those countries without education expenditure missing.

8-12 Understand the Problem Presented by Outliers Outliers can significantly affect the calculated slope coefficients. It is not acceptable to simply drop outliers unless you can determine their presence is due to data entry error. One possible way to control for outliers is to put a dummy variable in for dependent and independent variable outliers.

8-13 Empirical Example: Total Medals won in the Olympics vs. GDP per Capita Potential Outliers

8-14 Empirical Example: Regression Results without Controlling for the Outliers The coefficient on GDP per Capita means on average, if GDP per capita increases by $1000 then the number of Olympic medals goes up by.15 of a medal. This coefficient is statistically significant at the 10% and it is almost significant at the 5% level. 8-14

8-15 Empirical Example: Regression Results without Controlling for the Outliers The coefficient on GDP per Capita has increased from.15 to.21. The medal outlier coefficient says that, “on average the three medal outliers have more medals relative to not being an outlier.” This GDP outlier coefficient says that, “on average the two GDPoutliers have 20 fewer medals relative to not being an outlier.” Both of these coefficients are statistically significant at the 5% level.

8-16 Perform the Reset Test for the Inclusion of Higher-Order Polynomials

8-17 Perform the Davidson MacKinnon Test for Choosing Among Non-Nested Alternatives

8-18 Consider How to Implement the “Eye Test” to Judge the Sample Regression Function To perform the “eye test” you should take a step back from your results and analyze whether the estimates and your conclusions seem reasonable.

8-19 Examples of where the “Eye Test” Should have been used (1)One student tried to analyze what determined super bowl wins using only one year of data (so that there was only one winner) (2)Another student …