Lecture 14 Multiple Regression Model

Slides:



Advertisements
Similar presentations
Korelasi Diri (Auto Correlation) Pertemuan 15 Matakuliah: I0174 – Analisis Regresi Tahun: Ganjil 2007/2008.
Advertisements

Chap 12-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 12 Simple Regression Statistics for Business and Economics 6.
Forecasting Using the Simple Linear Regression Model and Correlation
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 14-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
© 2001 Prentice-Hall, Inc.Chap 13-1 BA 201 Lecture 21 Autocorrelation and Inferences about the Slope.
Multiple Regression [ Cross-Sectional Data ]
Chapter 13 Multiple Regression
Regresi dan Analisis Varians Pertemuan 21 Matakuliah: I0174 – Analisis Regresi Tahun: Ganjil 2007/2008.
Chapter 14 Introduction to Multiple Regression
Korelasi Ganda Dan Penambahan Peubah Pertemuan 13 Matakuliah: I0174 – Analisis Regresi Tahun: Ganjil 2007/2008.
Chapter 12 Simple Regression
Interaksi Dalam Regresi (Lanjutan) Pertemuan 25 Matakuliah: I0174 – Analisis Regresi Tahun: Ganjil 2007/2008.
Regresi dan Rancangan Faktorial Pertemuan 23 Matakuliah: I0174 – Analisis Regresi Tahun: Ganjil 2007/2008.
Chapter 12 Multiple Regression
© 2000 Prentice-Hall, Inc. Chap Multiple Regression Models.
Multiple Regression Models. The Multiple Regression Model The relationship between one dependent & two or more independent variables is a linear function.
© 2003 Prentice-Hall, Inc.Chap 14-1 Basic Business Statistics (9 th Edition) Chapter 14 Introduction to Multiple Regression.
1 Pertemuan 13 Uji Koefisien Korelasi dan Regresi Matakuliah: A0392 – Statistik Ekonomi Tahun: 2006.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Prentice-Hall, Inc.Chap 13-1 Statistics for Managers Using Microsoft® Excel 5th Edition Chapter.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics 11 th Edition.
Pengujian Parameter Koefisien Korelasi Pertemuan 04 Matakuliah: I0174 – Analisis Regresi Tahun: Ganjil 2007/2008.
Chapter Topics Types of Regression Models
Statistics for Business and Economics Chapter 11 Multiple Regression and Model Building.
Simple Linear Regression Analysis
Multiple Regression and Correlation Analysis
© 2004 Prentice-Hall, Inc.Chap 14-1 Basic Business Statistics (9 th Edition) Chapter 14 Introduction to Multiple Regression.
Linear Regression Example Data
Ch. 14: The Multiple Regression Model building
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
Simple Linear Regression. Chapter Topics Types of Regression Models Determining the Simple Linear Regression Equation Measures of Variation Assumptions.
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Chapter 7 Forecasting with Simple Regression
Statistics for Managers Using Microsoft Excel 3rd Edition
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 13-1 Chapter 13 Introduction to Multiple Regression Statistics for Managers.
© 2001 Prentice-Hall, Inc.Chap 14-1 BA 201 Lecture 23 Correlation Analysis And Introduction to Multiple Regression (Data)Data.
Chapter 8 Forecasting with Multiple Regression
© Stevenson, McGraw Hill, Assoc. Prof. Sami Fethi, EMU, All Right Reserved. Regression Analysis; Chapter4 MGMT 405, POM, 2010/11. Lec Notes Chapter.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
Purpose of Regression Analysis Regression analysis is used primarily to model causality and provide prediction –Predicts the value of a dependent (response)
© 2003 Prentice-Hall, Inc.Chap 11-1 Business Statistics: A First Course (3 rd Edition) Chapter 11 Multiple Regression.
© 2004 Prentice-Hall, Inc.Chap 15-1 Basic Business Statistics (9 th Edition) Chapter 15 Multiple Regression Model Building.
© 2002 Prentice-Hall, Inc.Chap 14-1 Introduction to Multiple Regression Model.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
Chapter 14 Introduction to Multiple Regression
12a - 1 © 2000 Prentice-Hall, Inc. Statistics Multiple Regression and Model Building Chapter 12 part I.
© 2003 Prentice-Hall, Inc.Chap 13-1 Basic Business Statistics (9 th Edition) Chapter 13 Simple Linear Regression.
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
EQT 373 Chapter 3 Simple Linear Regression. EQT 373 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value.
Chap 14-1 Copyright ©2012 Pearson Education, Inc. publishing as Prentice Hall Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics.
Statistics for Business and Economics 8 th Edition Chapter 11 Simple Regression Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Lecture 4 Introduction to Multiple Regression
Lecture 10: Correlation and Regression Model.
Applied Quantitative Analysis and Practices LECTURE#25 By Dr. Osman Sadiq Paracha.
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 14-1 Chapter 14 Introduction to Multiple Regression Statistics for Managers using Microsoft.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice- Hall, Inc. Chap 14-1 Business Statistics: A Decision-Making Approach 6 th Edition.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics 10 th Edition.
Chap 13-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 13 Multiple Regression and.
Statistics for Managers Using Microsoft® Excel 5th Edition
Introduction to Multiple Regression Lecture 11. The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & 2 or more.
Multiple Regression Learning Objectives n Explain the Linear Multiple Regression Model n Interpret Linear Multiple Regression Computer Output n Test.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Prentice-Hall, Inc.Chap 14-1 Statistics for Managers Using Microsoft® Excel 5th Edition Chapter.
© 2000 Prentice-Hall, Inc. Chap Chapter 10 Multiple Regression Models Business Statistics A First Course (2nd Edition)
Chapter 14 Introduction to Multiple Regression
Lecture 24 Multiple Regression Model And Residual Analysis
Chapter 15 Multiple Regression and Model Building
Statistics for Managers using Microsoft Excel 3rd Edition
Linear Regression Using Excel
Multiple Regression Analysis and Model Building
Pemeriksaan Sisa dan Data Berpengaruh Pertemuan 17
Korelasi Parsial dan Pengontrolan Parsial Pertemuan 14
Presentation transcript:

Lecture 14 Multiple Regression Model BA 201 Lecture 14 Multiple Regression Model

Topics Developing the Multiple Linear Regression Inferences on Population Regression Coefficients Pitfalls in Multiple Regression and Ethical Issues

The Multiple Regression Model Relationship between 1 dependent & 2 or more independent variables is a linear function Population Y-intercept Population slopes Random Error Residual Dependent (Response) variable for sample Independent (Explanatory) variables for sample model

Simple Linear Regression Model Revisited Y X Observed Value

Population Multiple Regression Model Bivariate model (2 Independent Variables: X1 and X2)

Sample Multiple Regression Model Bivariate model Sample Regression Plane

Multiple Linear Regression Equation Too complicated by hand! Ouch!

Multiple Regression Model: Example (0F) Develop a model for estimating heating oil used for a single family home in the month of January based on average temperature and amount of insulation in inches.

Multiple Regression in PHStat PHStat | Regression | Multiple Regression … EXCEL spreadsheet for the heating oil example.

Sample Multiple Regression Equation: Example Excel Output For each degree increase in temperature, the estimated average amount of heating oil used is decreased by 5.437 gallons, holding insulation constant. For each increase in one inch of insulation, the estimated average use of heating oil is decreased by 20.012 gallons, holding temperature constant.

Interpretation of Estimated Coefficients Slope (bi) Estimated that the average value of Y changes by bi for each 1 unit increase in Xi holding all other variables constant (ceterus paribus) Example: If b1 = -2, then fuel oil usage (Y) is expected to decrease by an estimated 2 gallons for each 1 degree increase in temperature (X1) given the inches of insulation (X2) Y-Intercept (b0) The estimated average value of Y when all Xi = 0

Simple and Multiple Regression Compared Coefficients in a simple regression pick up the impact of that variable plus the impacts of other variables that are correlated with it and the dependent variable but are excluded from the model. Coefficients in a multiple regression net out the impacts of other variables in the equation. Hence they are called the net regression coefficients. They still pick up the effects of other variables that excluded form the model but are correlated with the included variables and the dependent variable.

Simple and Multiple Regression Compared:Example Two simple regressions: Multiple Regression:

Simple and Multiple Regression Compared: Excel Output

Simple and Multiple Regression Compared: Excel Output =

Venn Diagrams and Explanatory Power of a Simple Regression Variations in Oil explained by the error term Variations in Temp not used in explaining variation in Oil Oil Variations in Oil explained by Temp or variations in Temp used in explaining variation in Oil Temp

Venn Diagrams and Explanatory Power of a Simple Regression (continued) Oil Temp

Venn Diagrams and Explanatory Power of a Multiple Regression Variation NOT explained by Temp nor Insulation Overlapping variation in both Temp and Insulation are used in explaining the variation in Oil but NOT in the estimation of nor Oil Temp Insulation

Coefficient of Multiple Determination Proportion of Total Variation in Y Explained by All X Variables Taken Together Never Decreases When a New X Variable is Added to Model Disadvantage When Comparing Models

Venn Diagrams and Explanatory Power of Regression Oil Temp Insulation

Adjusted Coefficient of Multiple Determination Proportion of Variation in Y Explained by All X Variables adjusted for the Number of X Variables Used and the Sample Size Penalize Excessive Use of Independent Variables Smaller than Useful in Comparing among Models Could Decrease If an Insignificant New X Variable Is Added to the Model

Coefficient of Multiple Determination Excel Output Adjusted r2 reflects the number of explanatory variables and sample size is smaller than r2

Interpretation of Coefficient of Multiple Determination 96.56% of the total variation in heating oil can be explained by different temperature and the variation in the amount of insulation 95.99% of the total fluctuation in heating oil can be explained by different temperature and the variation in the amount of insulation after adjusting for the number of explanatory variables and sample size

Example: Adjusted r2 Can Decrease Adjusted r 2 decreases when k increases from 2 to 3

Using The Model to Make Predictions Predict the amount of heating oil used for a home if the average temperature is 300 and the insulation is 6 inches. The predicted heating oil used is 278.97 gallons

Predictions in PHStat PHStat | Regression | Multiple Regression … Check the “Confidence and Prediction Interval Estimate” box EXCEL spreadsheet for the heating oil example.

Another Example The Excel spreadsheet that contains the multiple regression result of regressing Mid-term scores on quiz scores and attendance score

Residual Plots Residuals Vs Residuals Vs Time May need to transform Y variable May need to transform variable May need to transform variable Residuals Vs Time May have autocorrelation

Residual Plots: Example Maybe some non-linear relationship No Discernable Pattern

Testing for Overall Significance Shows if there is a Linear Relationship between all of the X Variables Together and Y Shows if Y Depends Linearly on all of the X Variables Together as a Group Use F Test Statistic Hypotheses: H0: 1 = 2 = … = k = 0 (No linear relationship) H1: At least one i  0 ( At least one independent variable affects Y ) The Null Hypothesis is a Very Strong Statement Almost Always Reject the Null Hypothesis

Testing for Overall Significance (continued) Test Statistic: where F has k numerator and (n-k-1) denominator degrees of freedom

Test for Overall Significance Excel Output: Heating Oil Example p value k = 2, the number of explanatory variables n - 1

Test for Overall Significance Example Solution H0: 1 = 2 = … = k = 0 H1: At least one i  0  = .05 df = 2 and 12 Critical Value(s): Test Statistic: Decision: Conclusion:  F 168.47 (Excel Output) Reject at  = 0.05  = 0.05 There is evidence that at least one independent variable affects Y F 3.89

Test for Significance: Individual Variables Shows if There is a Linear Relationship Between the Variable Xi and Y while Holding the Effects of other X’s Fixed Show if Y Depends Linearly on a Single Xi Individually while Holding the Effects of other X’s Fixed Use t Test Statistic Hypotheses: H0: i = 0 (No linear relationship) H1: i  0 (Linear relationship between Xi and Y)

t Test Statistic Excel Output: Example t Test Statistic for X1 (Temperature) t Test Statistic for X2 (Insulation)

t Test : Example Solution Does temperature have a significant effect on monthly consumption of heating oil? Test at  = 0.05. H0: 1 = 0 H1: 1  0 df = 12 Critical Value(s): Test Statistic: Decision: Conclusion: t Test Statistic = -16.1699 Reject H0 at  = 0.05 Reject H Reject H .025 .025 There is evidence of a significant effect of temperature on oil consumption. b1 t -2.1788 2.1788

Confidence Interval Estimate for the Slope Provide the 95% confidence interval for the population slope 1 (the effect of temperature on oil consumption). -6.169  1  -4.704 The estimated average consumption of oil is reduced by between 4.7 gallons to 6.17 gallons per each increase of 10 F.

Additional Pitfalls and Ethical Issues Fail to Understand that Interpretation of the Estimated Regression Coefficients are Performed Holding All Other Independent Variables Constant Fail to Evaluate Residual Plots for Each Independent Variable

Summary Developed the Multiple Regression Model Addressed Testing the Significance of the Multiple Regression Model Discussed Inferences on Population Regression Coefficients Addressed Pitfalls in Multiple Regression and Ethical Issues