12-1. 12-2 Chapter Twelve Multiple Regression and Model Building McGraw-Hill/Irwin Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.

Slides:



Advertisements
Similar presentations
Ecole Nationale Vétérinaire de Toulouse Linear Regression
Advertisements

Part 3 Probabilistic Decision Models
Chapter 1 The Study of Body Function Image PowerPoint
1 Copyright © 2013 Elsevier Inc. All rights reserved. Appendix 01.
Chapter 1 Image Slides Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Describing Data: Measures of Dispersion
Variation, uncertainties and models Marian Scott School of Mathematics and Statistics, University of Glasgow June 2012.
Lecture 2 ANALYSIS OF VARIANCE: AN INTRODUCTION
Assumptions underlying regression analysis
Simple Linear Regression 1. review of least squares procedure 2
Chapter 4: Basic Estimation Techniques
Chi-Square and Analysis of Variance (ANOVA)
Chapter 10 Estimating Means and Proportions
Multiple Regression. Introduction In this chapter, we extend the simple linear regression model. Any number of independent variables is now allowed. We.
Lecture Unit Multiple Regression.
9: Examining Relationships in Quantitative Research ESSENTIALS OF MARKETING RESEARCH Hair/Wolfinbarger/Ortinau/Bush.
McGraw-Hill/Irwin © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 11 Measuring Item Interactions.
Module 17: Two-Sample t-tests, with equal variances for the two populations This module describes one of the most utilized statistical tests, the.
25 seconds left…...
Statistical Inferences Based on Two Samples
© The McGraw-Hill Companies, Inc., Chapter 12 Chi-Square.
Chapter Thirteen The One-Way Analysis of Variance.
Chapter 8 Estimation Understandable Statistics Ninth Edition
Copyright © 2013 Pearson Education, Inc. All rights reserved Chapter 11 Simple Linear Regression.
Experimental Design and Analysis of Variance
COMPLETE f o u r t h e d i t i o n BUSINESS STATISTICS Aczel Irwin/McGraw-Hill © The McGraw-Hill Companies, Inc., Using Statistics The Simple.
Simple Linear Regression Analysis
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Linear Regression and Correlation Chapter 13.
Business Statistics, 4e by Ken Black
Correlation and Linear Regression
Week 13 November Three Mini-Lectures QMM 510 Fall 2014.
Multiple Regression and Model Building
Adapted by Peter Au, George Brown College McGraw-Hill Ryerson Copyright © 2011 McGraw-Hill Ryerson Limited.
More on understanding variance inflation factors (VIFk)
13- 1 Chapter Thirteen McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
Objectives (BPS chapter 24)
Simple Linear Regression
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 13-1 Chapter 13 Simple Linear Regression Basic Business Statistics 11 th Edition.
Multivariate Data Analysis Chapter 4 – Multiple Regression.
Simple Linear Regression Analysis
Simple Linear Regression Analysis
Introduction to Linear Regression and Correlation Analysis
1 1 Slide © 2003 Thomson/South-Western Chapter 13 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple Coefficient of Determination.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved Chapter 13 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
12a - 1 © 2000 Prentice-Hall, Inc. Statistics Multiple Regression and Model Building Chapter 12 part I.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 15 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
Introduction to Linear Regression
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
14- 1 Chapter Fourteen McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.
Chapter 13 Multiple Regression
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Model Building and Model Diagnostics Chapter 15.
Multiple Regression I 1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 4 Multiple Regression Analysis (Part 1) Terry Dielman.
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Multiple Regression Chapter 14.
Chapter 14 Introduction to Multiple Regression
Chapter 20 Linear and Multiple Regression
Chapter 15 Multiple Regression and Model Building
Chapter 13 Created by Bethany Stubbe and Stephan Kogitz.
Multiple Regression and Model Building
Chapter 13 Simple Linear Regression
Inference for Regression Lines
Multiple Regression Chapter 14.
Chapter Fourteen McGraw-Hill/Irwin
Essentials of Statistics for Business and Economics (8e)
Chapter Thirteen McGraw-Hill/Irwin
Chapter 13 Simple Linear Regression
Presentation transcript:

12-1

12-2 Chapter Twelve Multiple Regression and Model Building McGraw-Hill/Irwin Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.

12-3 Multiple Regression 12.1The Linear Regression Model 12.2The Least Squares Estimates and Prediction 12.3The Mean Squared Error and the Standard Error 12.4Model Utility: R 2, Adjusted R 2, and the F Test 12.5Testing the Significance of an Independent Variable 12.6Confidence Intervals and Prediction Intervals 12.7Dummy Variables 12.8Model Building and the Effects of Multicollinearity 12.9 Residual Analysis in Multiple Regression

The Linear Regression Model The linear regression model relating y to x 1, x 2, …, x k is is the mean value of the dependent variable y when the values of the independent variables are x 1, x 2, …, x k.  are the regression parameters relating the mean value of y to x 1, x 2, …, x k.  is an error term that describes the effects on y of all factors other than the independent variables x 1, x 2, …, x k. where

12-5 Example: The Linear Regression Model Example 12.1: The Fuel Consumption Case

12-6 The Linear Regression Model Illustrated Example 12.1: The Fuel Consumption Case

12-7 The Regression Model Assumptions Assumptions about the model error terms,  ’s Mean Zero The mean of the error terms is equal to 0. Constant Variance The variance of the error terms   is, the same for every combination values of x 1, x 2, …, x k. Normality The error terms follow a normal distribution for every combination values of x 1, x 2, …, x k. Independence The values of the error terms are statistically independent of each other. Model

Least Squares Estimates and Prediction Estimation/Prediction Equation: b 1, b 2, …, b k are the least squares point estimates of the parameters  1,  2, …,  k. x 01, x 02, …, x 0k are specified values of the independent predictor variables x 1, x 2, …, x k. is the point estimate of the mean value of the dependent variable when the values of the independent variables are x 01, x 02, …, x 0k. It is also the point prediction of an individual value of the dependent variable when the values of the independent variables are x 01, x 02, …, x 0k.

12-9 Example: Least Squares Estimation Example 12.3: The Fuel Consumption Case Minitab Output FuelCons = Temp Chill Predictor Coef StDev T P Constant Temp Chill S = R-Sq = 97.4% R-Sq(adj) = 96.3% Analysis of Variance Source DF SS MS F P Regression Residual Error Total Predicted Values (Temp = 40, Chill = 10) Fit StDev Fit 95.0% CI 95.0% PI ( 9.895, ) ( 9.293, )

12-10 Example: Point Predictions and Residuals Example 12.3: The Fuel Consumption Case

Mean Square Error and Standard Error Mean Square Error, point estimate of residual variance   Standard Error, point estimate of residual standard deviation  Example 12.3 The Fuel Consumption Case Sum of Squared Errors Analysis of Variance Source DF SS MS F P Regression Residual Error Total

Model Utility: Multiple Coefficient of Determination, R² The multiple coefficient of determination R 2 is R 2 is the proportion of the total variation in y explained by the linear regression model

Model Utility: Adjusted R 2 The adjusted multiple coefficient of determination is Fuel Consumption Case: S = R-Sq = 97.4% R-Sq(adj) = 96.3% Analysis of Variance Source DF SS MS F P Regression Residual Error Total

Model Utility: F Test for Linear Regression Model To testH 0 :   =   = …=   = 0 versus H a : At least one of the  ,  , …,  k is not equal to 0 Test Statistic: Reject H 0 in favor of H a if: F(model) > F   or p-value <  F  is based on k numerator and n-(k+1) denominator degrees of freedom.

12-15 Example: F Test for Linear Regression Test Statistic: Example 12.5 The Fuel Consumption Case Minitab Output Reject H 0 at  level of significance, since F  is based on 2 numerator and 5 denominator degrees of freedom. F-test at  = 0.05 level of significance Analysis of Variance Source DF SS MS F P Regression Residual Error Total

Testing Significance of the Independent Variable Test Statistic If the regression assumptions hold, we can reject H 0 :  j = 0 at the  level of significance (probability of Type I error equal to  ) if and only if the appropriate rejection point condition holds or, equivalently, if the corresponding p-value is less than . t , t  /2 and p-values are based on n – (k+1) degrees of freedom. AlternativeReject H 0 if:p-Value 100(1-  )% Confidence Interval for  j

12-17 Example: Testing and Estimation for  s Example 12.6: The Fuel Consumption Case Minitab Output Predictor Coef StDev T P Constant Temp Chill t , t  /2 and p-values are based on 5 degrees of freedom. Chill is significant at the  = 0.05 level, but not at  = 0.01 Test Interval

Confidence and Prediction Intervals t  is based on n-(k+1) degrees of freedom Prediction: 100(1 -  )% confidence interval for the mean value of y If the regression assumptions hold, 100(1 -  )% prediction interval for an individual value of y (Distance value requires matrix algebra)

12-19 Example: Confidence and Prediction Intervals Example 12.9 The Fuel Consumption CaseMinitab Output FuelCons = Temp Chill Predicted Values (Temp = 40, Chill = 10) Fit StDev Fit 95.0% CI 95.0% PI (9.895, ) (9.293,11.374) 95% Confidence Interval95% Prediction Interval

Dummy Variables Example The Electronics World Case Location Dummy Variable

12-21 Example: Regression with a Dummy Variable Example 12.11: The Electronics World Case Minitab Output Sales = Households DM Predictor Coef StDev T P Constant Househol DM S = R-Sq = 98.3% R-Sq(adj) = 97.8% Analysis of Variance Source DF SS MS F P Regression Residual Error Total

Model Building and the Effects of Multicollinearity Example: The Sale Territory Performance Case

12-23 Correlation Matrix Example: The Sale Territory Performance Case

12-24 Multicollinearity Multicollinearity refers to the condition where the independent variables (or predictors) in a model are dependent, related, or correlated with each other. Effects Hinders ability to use b j s, t statistics, and p-values to assess the relative importance of predictors. Does not hinder ability to predict the dependent (or response) variable. Detection Scatter Plot Matrix Correlation Matrix Variance Inflation Factors (VIF)

Residual Analysis in Multiple Regression For an observed value of y i, the residual is If the regression assumptions hold, the residuals should look like a random sample from a normal distribution with mean 0 and variance  2. Residual Plots Residuals versus each independent variable Residuals versus predicted y’s Residuals in time order (if the response is a time series) Histogram of residuals Normal plot of the residuals

12-26 Multiple Regression Summary: 12.1 The Linear Regression Model 12.2 The Least Squares Estimates and Prediction 12.3 The Mean Squared Error and the Standard Error 12.4 Model Utility: R2, Adjusted R2, and the F Test 12.5 Testing the Significance of an Independent Variable 12.6 Confidence Intervals and Prediction Intervals 12.7 Dummy Variables 12.8 Model Building and the Effects of Multicollinearity 12.9 Residual Analysis in Multiple Regression