Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 24 Building Regression Models.

Slides:



Advertisements
Similar presentations
Copyright © 2011 Pearson Education, Inc. Curved Patterns Chapter 20.
Advertisements

Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 20 Curved Patterns.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Copyright © 2009 Pearson Education, Inc. Chapter 29 Multiple Regression.
Inference for Regression
Copyright © 2010, 2007, 2004 Pearson Education, Inc. *Chapter 29 Multiple Regression.
Class 17: Tuesday, Nov. 9 Another example of interpreting multiple regression coefficients Steps in multiple regression analysis and example analysis Omitted.
Statistics for Managers Using Microsoft® Excel 5th Edition
Statistics for Managers Using Microsoft® Excel 5th Edition
January 6, morning session 1 Statistics Micro Mini Multiple Regression January 5-9, 2008 Beth Ayers.
Lecture 24: Thurs., April 8th
Regression Chapter 10 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
Slide Copyright © 2010 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Business Statistics First Edition.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 11 th Edition.
Correlation and Regression Analysis
Copyright ©2011 Pearson Education 15-1 Chapter 15 Multiple Regression Model Building Statistics for Managers using Microsoft Excel 6 th Global Edition.
1 Chapter 10 Correlation and Regression We deal with two variables, x and y. Main goal: Investigate how x and y are related, or correlated; how much they.
Objectives of Multiple Regression
Active Learning Lecture Slides
Copyright © 2011 Pearson Education, Inc. Multiple Regression Chapter 23.
Inference for regression - Simple linear regression
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-3 Regression.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 25 Categorical Explanatory Variables.
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 15-1 Chapter 15 Multiple Regression Model Building Statistics for Managers using Microsoft.
September In Chapter 14: 14.1 Data 14.2 Scatterplots 14.3 Correlation 14.4 Regression.
BPS - 3rd Ed. Chapter 211 Inference for Regression.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Copyright © 2011 Pearson Education, Inc. Regression Diagnostics Chapter 22.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 22 Regression Diagnostics.
Regression Analysis. Scatter plots Regression analysis requires interval and ratio-level data. To see if your data fits the models of regression, it is.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 23 Multiple Regression.
Chapter 12 Examining Relationships in Quantitative Research Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
+ Chapter 12: Inference for Regression Inference for Linear Regression.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 27 Time Series.
Copyright © 2011 Pearson Education, Inc. Analysis of Variance Chapter 26.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 15: Correlation and Regression Part 2: Hypothesis Testing and Aspects of a Relationship.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 18 Inference for Counts.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 19 Linear Patterns.
Copyright © 2011 Pearson Education, Inc. The Simple Regression Model Chapter 21.
Slide 8- 1 Copyright © 2010 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Business Statistics First Edition.
Lesson Multiple Regression Models. Objectives Obtain the correlation matrix Use technology to find a multiple regression equation Interpret the.
Stat 112 Notes 16 Today: –Outliers and influential points in multiple regression (Chapter 6.7)
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 13 Multiple Regression Section 13.3 Using Multiple Regression to Make Inferences.
Lecture 4 Introduction to Multiple Regression
The Statistical Imagination Chapter 15. Correlation and Regression Part 2: Hypothesis Testing and Aspects of a Relationship.
Copyright © 2011 Pearson Education, Inc. Time Series Chapter 27.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 21 The Simple Regression Model.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Regression Chapter 14.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 8- 1.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Chapter 10 Correlation and Regression 10-2 Correlation 10-3 Regression.
Copyright © 2011 Pearson Education, Inc. Regression Diagnostics Chapter 22.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Active Learning Lecture Slides For use with Classroom Response Systems Chapter 23 Multiple Regression.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 10 th Edition.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 26 Analysis of Variance.
BPS - 5th Ed. Chapter 231 Inference for Regression.
HW 23 Key. 24:41 Promotion. These data describe promotional spending by a pharmaceutical company for a cholesterol-lowering drug. The data cover 39 consecutive.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Yandell – Econ 216 Chap 15-1 Chapter 15 Multiple Regression Model Building.
Chapter 15 Multiple Regression Model Building
Regression Analysis.
Lecture Slides Elementary Statistics Thirteenth Edition
CHAPTER 29: Multiple Regression*
Basic Practice of Statistics - 3rd Edition Inference for Regression
Regression Forecasting and Model Building
Chapter 13 Additional Topics in Regression Analysis
Presentation transcript:

Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 24 Building Regression Models

Copyright © 2014, 2011 Pearson Education, Inc Identifying Explanatory Variables What explanatory variables belong in a regression model for stock returns?  Initial model motivated by theory such as CAPM  Seek additional variables that improve fit and produce better predictions  The process is typically complicated by correlated explanatory variables (i.e., collinearity)

Copyright © 2014, 2011 Pearson Education, Inc Identifying Explanatory Variables The Initial Model  Build a model that describes returns on Sony stock  CAPM provides a theoretical starting point: use % change for the whole stock market as an explanatory variable

Copyright © 2014, 2011 Pearson Education, Inc Identifying Explanatory Variables The Initial Model – Scatterplot Association appears linear, two outliers identified.

Copyright © 2014, 2011 Pearson Education, Inc Identifying Explanatory Variables The Initial Model – Timeplot of Residuals Locates outliers in time (Dec and Apr. 2003). No evidence of dependence.

Copyright © 2014, 2011 Pearson Education, Inc Identifying Explanatory Variables The Initial Model – Regression Results

Copyright © 2014, 2011 Pearson Education, Inc Identifying Explanatory Variables The Initial Model – Residual Plot Aside from the two outliers, residuals have similar variances.

Copyright © 2014, 2011 Pearson Education, Inc Identifying Explanatory Variables The Initial Model – Check Normality Aside from the two outliers, residuals are nearly normal.

Copyright © 2014, 2011 Pearson Education, Inc Identifying Explanatory Variables The Initial Model – Proceed to Inference  Estimates are consistent with CAPM.  The estimated intercept is not significantly different from zero with a p-value of  The estimated slope is highly significant with a p- value less than

Copyright © 2014, 2011 Pearson Education, Inc Identifying Explanatory Variables Identifying Other Variables  Research in finance suggests other variables, should be added to the initial model.  Three of these variables are: percentage change in the DJIA (Dow % Change) and differences in performance between small and large companies (Small-Big) and between growth and value stocks (High-Low).

Copyright © 2014, 2011 Pearson Education, Inc Identifying Explanatory Variables Correlation Matrix

Copyright © 2014, 2011 Pearson Education, Inc Identifying Explanatory Variables Scatterplot Matrix

Copyright © 2014, 2011 Pearson Education, Inc Identifying Explanatory Variables Identifying Other Variables  The correlation matrix indicates that percentage changes in the DJIA and in the whole market index are highly correlated.  The scatterplot matrix indicates that the associations between the response and these variables appear linear.

Copyright © 2014, 2011 Pearson Education, Inc Identifying Explanatory Variables Adding Explanatory Variables  The data consist of 204 observations with four candidate explanatory variables.  Begin model building by including all four variables in the multiple regression model.

Copyright © 2014, 2011 Pearson Education, Inc Identifying Explanatory Variables MRM with All Four Explanatory Variables

Copyright © 2014, 2011 Pearson Education, Inc Identifying Explanatory Variables Residual Plot: Residuals vs. Fitted Values Outliers are still present; however, this and other residual plots show the conditions for MRM are satisfied.

Copyright © 2014, 2011 Pearson Education, Inc Identifying Explanatory Variables MRM with All Four Explanatory Variables  The F-statistic is with p-value < ; this multiple regression equation explains statistically significant variation in percentage changes in the value of Sony stock.  Based on the t-statistics, only the variable Small- Big improves a regression that contains all of the other explanatory variables.

Copyright © 2014, 2011 Pearson Education, Inc Identifying Explanatory Variables MRM with All Four Explanatory Variables  Adding other explanatory variables to the initial model alters the slope for Market % Change.  This once important variable barely contributes statistically significant variation to the multiple regression.

Copyright © 2014, 2011 Pearson Education, Inc Collinearity Marginal and Partial Slopes  There is a high correlation between Market % Change and Dow % Change (r = 0.91).  This collinearity produces imprecise estimates of the partial slopes.  It explains the difference between the marginal and partial slopes for Market % Change.

Copyright © 2014, 2011 Pearson Education, Inc Collinearity Variance Inflation Factor (VIF)  Variance inflation factor: quantifies the amount of unique variation in each explanatory variable and measures the effect of collinearity.  The VIF for is

Copyright © 2014, 2011 Pearson Education, Inc Collinearity Results for Sony Stock Value Example

Copyright © 2014, 2011 Pearson Education, Inc Collinearity Results for Sony Stock Value Example  Is High-Low not statistically significant because it is redundant or simply unrelated to the response?  Because it has a VIF near 1, collinearity has little effect on this variable (not redundant).  Generally, VIF > 5 or 10 suggests redundancy.

Copyright © 2014, 2011 Pearson Education, Inc Collinearity Signs of Collinearity  R 2 increases less than we’d expect.  Slopes of correlated explanatory variables in the model change dramatically.  The F-statistic is more impressive than individual t-statistics.

Copyright © 2014, 2011 Pearson Education, Inc Collinearity Signs of Collinearity (Continued)  Standard errors for partial slopes are larger than those for marginal slopes.  Variance inflation factors increase.

Copyright © 2014, 2011 Pearson Education, Inc Collinearity Remedies for Collinearity  Remove redundant explanatory variables.  Re-express explanatory variables (e.g., use the average of Market % Change and Dow % Change as an explanatory variable).  Do nothing if the explanatory variables are significant with sensible estimates.

Copyright © 2014, 2011 Pearson Education, Inc Removing Explanatory Variables Issues  After adding several explanatory variables to a model, some of those added and some of those originally present may not be statistically significant.  Remove those variables for which both statistics and substance indicate removal (e.g., remove Dow % Change rather than Market % Change).

Copyright © 2014, 2011 Pearson Education, Inc. 27 4M Example 24.1: MARKET SEGMENTATION Motivation Within which magazine should a manufacturer of a new mobile phone advertise? One has an older audience. They collect consumer ratings on the new phone design along with consumers’ ages and reported incomes.

Copyright © 2014, 2011 Pearson Education, Inc. 28 4M Example 24.1: MARKET SEGMENTATION Method Use multiple regression with ratings as the response and age and income as the explanatory variables. Examine the correlation matrix and scatterplot matrix.

Copyright © 2014, 2011 Pearson Education, Inc. 29 4M Example 24.1: MARKET SEGMENTATION Method There is a high correlation between age and income that implies collinearity.

Copyright © 2014, 2011 Pearson Education, Inc. 30 4M Example 24.1: MARKET SEGMENTATION Method Association is linear with no outliers.

Copyright © 2014, 2011 Pearson Education, Inc. 31 4M Example 24.1: MARKET SEGMENTATION Mechanics – Estimation Results

Copyright © 2014, 2011 Pearson Education, Inc. 32 4M Example 24.1: MARKET SEGMENTATION Mechanics – Examine Plots MRM conditions are satisfied.

Copyright © 2014, 2011 Pearson Education, Inc. 33 4M Example 24.1: MARKET SEGMENTATION Mechanics The F-statistic has a p-value of < The model explains statistically significant variation in the ratings. Although collinear, both predictors (age and income) are statistically significant.

Copyright © 2014, 2011 Pearson Education, Inc. 34 4M Example 24.1: MARKET SEGMENTATION Message The manufacturer should advertise in the magazine with younger subscribers. Based on the 95% confidence interval for the slope of Age, a younger affluent audience assigns, on average, ratings that are 1 to 2 points higher (out of 10) than the older, affluent audience. Age changes sign when adjusted for differences in income. Substantively, this makes sense because younger customers with money find the new design attractive.

Copyright © 2014, 2011 Pearson Education, Inc. 35 4M Example 24.2: RETAIL PROFITS Motivation A chain of pharmacies is looking to expand into a new community. It has data for 110 cities on the following variables: income, disposable income, birth rate, social security recipients, cardiovascular deaths and percentage of local population aged 65 or more.

Copyright © 2014, 2011 Pearson Education, Inc. 36 4M Example 24.2: RETAIL PROFITS Method Use multiple regression. The response variable is profit. Examine the correlation matrix and the scatterplot matrix.

Copyright © 2014, 2011 Pearson Education, Inc. 37 4M Example 24.2: RETAIL PROFITS Method Several high correlations are present (shaded in table) and indicate the presence of collinearity.

Copyright © 2014, 2011 Pearson Education, Inc. 38 4M Example 24.2: RETAIL PROFITS Method This partial scatterplot matrix identifies communities that are distinct from others. Linearity and no lurking variables conditions are met.

Copyright © 2014, 2011 Pearson Education, Inc. 39 4M Example 24.2: RETAIL PROFITS Mechanics – Estimation Results

Copyright © 2014, 2011 Pearson Education, Inc. 40 4M Example 24.2: RETAIL PROFITS Mechanics – Examine Plots These and other plots (not shown here) indicate that all MRM conditions are satisfied.

Copyright © 2014, 2011 Pearson Education, Inc. 41 4M Example 24.2: RETAIL PROFITS Mechanics The F-statistic indicates that this collection of explanatory variables explains statistically significant variation in profits. The VIF’s indicate some explanatory variables are redundant and should be removed (one at a time) from the model.

Copyright © 2014, 2011 Pearson Education, Inc. 42 4M Example 24.2: RETAIL PROFITS Mechanics – Simplified Model This multiple regression separates the effects of birth rates from age (and income). It reveals that cities with higher birth rates produce higher profits when compared to cities with lower birth rates but comparable income and local population above 65.

Copyright © 2014, 2011 Pearson Education, Inc. 43 4M Example 24.2: RETAIL PROFITS Message Three characteristics of the local community affect estimated profits: disposable income, age and birth rates. Increases in each of these lead to higher profits. The data show that the pharmacy chain will have to trade off these characteristics in selecting a site for expansion.

Copyright © 2014, 2011 Pearson Education, Inc. 44 Best Practices  Become familiar with the substantive problem that needs to be solved.  Begin a regression analysis by looking at plots.  Use the F-statistic for the overall model and a t-statistic for each explanatory variable.  Learn to recognize the presence of collinearity.  Don’t fear collinearity – understand it.

Copyright © 2014, 2011 Pearson Education, Inc. 45 Pitfalls  Do not remove explanatory variables at the first sign of collinearity.  Don’t remove several explanatory variables from your model at once.