Yandell – Econ 216 Chap 15-1 Chapter 15 Multiple Regression Model Building.

Slides:



Advertisements
Similar presentations
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 14-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Advertisements

6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 13 Nonlinear and Multiple Regression.
Ch11 Curve Fitting Dr. Deshi Ye
Qualitative Variables and
Chapter 13 Multiple Regression
Chapter 13 Additional Topics in Regression Analysis
Statistics for Managers Using Microsoft® Excel 5th Edition
Statistics for Managers Using Microsoft® Excel 5th Edition
Chapter 12 Multiple Regression
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Multivariate Data Analysis Chapter 4 – Multiple Regression.
January 6, morning session 1 Statistics Micro Mini Multiple Regression January 5-9, 2008 Beth Ayers.
Chap 15-1 Copyright ©2012 Pearson Education, Inc. publishing as Prentice Hall Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 11 th Edition.
Pertemua 19 Regresi Linier
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 15-1 Business Statistics: A Decision-Making Approach 8 th Edition Chapter 15 Multiple.
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Chapter 15: Model Building
Correlation and Regression Analysis
Regression Model Building Setting: Possibly a large set of predictor variables (including interactions). Goal: Fit a parsimonious model that explains variation.
Copyright ©2011 Pearson Education 15-1 Chapter 15 Multiple Regression Model Building Statistics for Managers using Microsoft Excel 6 th Global Edition.
Statistics for Managers Using Microsoft Excel 3rd Edition
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 13-1 Chapter 13 Introduction to Multiple Regression Statistics for Managers.
Chapter 8 Forecasting with Multiple Regression
Marketing Research Aaker, Kumar, Day and Leone Tenth Edition
Introduction to Linear Regression and Correlation Analysis
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
© 2003 Prentice-Hall, Inc.Chap 11-1 Business Statistics: A First Course (3 rd Edition) Chapter 11 Multiple Regression.
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 15-1 Chapter 15 Multiple Regression Model Building Statistics for Managers using Microsoft.
© 2004 Prentice-Hall, Inc.Chap 15-1 Basic Business Statistics (9 th Edition) Chapter 15 Multiple Regression Model Building.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
12a - 1 © 2000 Prentice-Hall, Inc. Statistics Multiple Regression and Model Building Chapter 12 part I.
© 2003 Prentice-Hall, Inc.Chap 13-1 Basic Business Statistics (9 th Edition) Chapter 13 Simple Linear Regression.
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
Chap 14-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 14 Additional Topics in Regression Analysis Statistics for Business.
Lesson Multiple Regression Models. Objectives Obtain the correlation matrix Use technology to find a multiple regression equation Interpret the.
Lecture 4 Introduction to Multiple Regression
Lecture 3 Introduction to Multiple Regression Business and Economic Forecasting.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice- Hall, Inc. Chap 14-1 Business Statistics: A Decision-Making Approach 6 th Edition.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics 10 th Edition.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 10 th Edition.
Chap 13-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 13 Multiple Regression and.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
Chapter 12 REGRESSION DIAGNOSTICS AND CANONICAL CORRELATION.
Predicting Energy Consumption in Buildings using Multiple Linear Regression Introduction Linear regression is used to model energy consumption in buildings.
Chapter 13 Simple Linear Regression
Chapter 15 Multiple Regression Model Building
Chapter 14 Introduction to Multiple Regression
Chapter 15 Multiple Regression and Model Building
Inference for Least Squares Lines
Correlation, Bivariate Regression, and Multiple Regression
Statistics for Managers using Microsoft Excel 3rd Edition
Chapter 9 Multiple Linear Regression
Multiple Regression Analysis and Model Building
Chapter 12: Regression Diagnostics
Business Statistics, 4e by Ken Black
Chapter 15 Multiple Regression Analysis and Model Building
CHAPTER 29: Multiple Regression*
Multiple Regression Models
Statistics for Business and Economics
Product moment correlation
Regression Forecasting and Model Building
Chapter 13 Additional Topics in Regression Analysis
Business Statistics, 4e by Ken Black
Model Adequacy Checking
Presentation transcript:

Yandell – Econ 216 Chap 15-1 Chapter 15 Multiple Regression Model Building

Yandell – Econ 216 Chap 15-2 Chapter Goals After completing this chapter, you should be able to: understand model building using multiple regression analysis use variable transformations to model nonlinear relationships recognize potential problems in multiple regression analysis and take the steps to correct the problems Apply techniques to obtain a best-fit regression equation

Yandell – Econ 216 Chap 15-3 The relationship between the dependent variable and an independent variable may not be linear Useful when scatter diagram indicates non- linear relationship Example: Quadratic model The second independent variable is the square of the first variable Nonlinear Relationships

Yandell – Econ 216 Chap 15-4 Polynomial Regression Model where: β 0 = Population regression constant β i = Population regression coefficient for variable X j : j = 1, 2, …k p = Order of the polynomial  i = Model error If p = 2 the model is a quadratic model: General form:

Yandell – Econ 216 Chap 15-5 Linear fit does not give random residuals Linear vs. Nonlinear Fit Nonlinear fit gives random residuals x residuals x y x y x

Yandell – Econ 216 Chap 15-6 Quadratic Regression Model Quadratic models may be considered when scatter diagram takes on the following shapes: x1x1 y x1x1 x1x1 yyy β 1 < 0β 1 > 0β 1 < 0β 1 > 0 β 1 = the coefficient of the linear term β 2 = the coefficient of the squared term x1x1 β 2 > 0 β 2 < 0

Yandell – Econ 216 Chap 15-7 Testing for Significance: Quadratic Model Test for Overall Relationship F test statistic = Testing the Quadratic Effect Compare quadratic model with the linear model Hypotheses (No 2 nd order polynomial term) (2 nd order polynomial term is needed) H 0 : β 2 = 0 H 1 : β 2  0

Yandell – Econ 216 Chap 15-8 Higher Order Models Y X If p = 3 the model is a cubic form:

Yandell – Econ 216 Chap 15-9 Multicollinearity Multicollinearity: High correlation exists between two independent variables This means the two variables contribute redundant information to the multiple regression model

Yandell – Econ 216 Chap Multicollinearity Including two highly correlated independent variables can adversely affect the regression results No new information provided Can lead to unstable coefficients (large standard error and low t-values) Coefficient signs may not match prior expectations (continued)

Yandell – Econ 216 Chap Some Indications of Severe Multicollinearity Incorrect signs on the coefficients Large change in the value of a previous coefficient when a new variable is added to the model A previously significant variable becomes insignificant when a new independent variable is added The estimate of the standard deviation of the model increases when a variable is added to the model

Yandell – Econ 216 Chap Detect Collinearity (Variance Inflationary Factor) VIF j is used to measure collinearity: If VIF j > 5, X j is highly correlated with the other explanatory variables r 2 j is the coefficient of determination when the j th independent variable is regressed against the remaining k – 1 independent variables

Yandell – Econ 216 Chap Detect Collinearity in PHStat Output for the pie sales example: Since there are only two explanatory variables, only one VIF is reported VIF is < 5 There is no evidence of collinearity between Price and Advertising Regression Analysis Price and all other X Regression Statistics Multiple R R Square Adjusted R Square Standard Error Observations15 VIF PHStat / regression / multiple regression … Check the “variance inflationary factor (VIF)” box

Yandell – Econ 216 Chap Model Building Goal is to develop a model with the best set of independent variables Easier to interpret if unimportant variables are removed Lower probability of collinearity Stepwise regression procedure Provide evaluation of alternative models as variables are added Best-subset approach Try all combinations and select the best using the highest adjusted r 2 and lowest standard error

Yandell – Econ 216 Chap Idea: develop the least squares regression equation in steps, either through forward selection, backward elimination, or through standard stepwise regression The coefficient of partial determination is the measure of the marginal contribution of each independent variable, given that other independent variables are in the model Stepwise Regression

Yandell – Econ 216 Chap Best Subsets Regression Idea: estimate all possible regression equations using all possible combinations of independent variables Choose the best fit by looking for the highest adjusted r 2 and lowest standard error Stepwise regression and best subsets regression can be performed using PHStat

Yandell – Econ 216 Chap Aptness of the Model Diagnostic checks on the model include verifying the assumptions of multiple regression: Each X i is linearly related to Y Errors have constant variance Errors are independent Error are normally distributed Errors (or Residuals) are given by

Yandell – Econ 216 Chap Residual Analysis Non-constant variance Constant variance xx residuals Not IndependentIndependent x residuals x

Yandell – Econ 216 Chap The Normality Assumption Errors are assumed to be normally distributed Standardized residuals can be calculated by computer Examine a histogram or a normal probability plot of the standardized residuals to check for normality

Yandell – Econ 216 Chap Model Building Flowchart Choose X 1,X 2,…X k Run Regression to find VIFs Remove Variable with Highest VIF Any VIF>5? Run Subsets Regression to Obtain “best” models in terms of C p Do Complete Analysis Add Curvilinear Term and/or Transform Variables as Indicated Perform Predictions No More than One? Remove this X Yes No Yes

Yandell – Econ 216 Chap Chapter Summary Described nonlinear regression models Described multicollinearity Discussed model building Stepwise regression Best subsets regression Examined residual plots to check model assumptions