Multiple Linear Regression Analysis

Slides:



Advertisements
Similar presentations
Chap 12-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 12 Simple Regression Statistics for Business and Economics 6.
Advertisements

Chapter 11: Forecasting Models
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Chapter Thirteen McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved. Linear Regression and Correlation.
LECTURE 3 Introduction to Linear Regression and Correlation Analysis
Some Terms Y =  o +  1 X Regression of Y on X Regress Y on X X called independent variable or predictor variable or covariate or factor Which factors.
Chapter 13 Multiple Regression
Chapter 12 Simple Regression
Multivariate Data Analysis Chapter 4 – Multiple Regression.
Gordon Stringer, UCCS1 Regression Analysis Gordon Stringer.
Lecture 24 Multiple Regression (Sections )
Lecture 23 Multiple Regression (Sections )
Linear Regression Example Data
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 11 th Edition.
Stat 112: Lecture 9 Notes Homework 3: Due next Thursday
Spreadsheet Modeling & Decision Analysis A Practical Introduction to Management Science 5 th edition Cliff T. Ragsdale.
Marketing Research Aaker, Kumar, Day and Leone Tenth Edition
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
Linear Trend Lines Y t = b 0 + b 1 X t Where Y t is the dependent variable being forecasted X t is the independent variable being used to explain Y. In.
© 2004 Prentice-Hall, Inc.Chap 15-1 Basic Business Statistics (9 th Edition) Chapter 15 Multiple Regression Model Building.
STA302/ week 111 Multicollinearity Multicollinearity occurs when explanatory variables are highly correlated, in which case, it is difficult or impossible.
Linear Trend Lines = b 0 + b 1 X t Where is the dependent variable being forecasted X t is the independent variable being used to explain Y. In Linear.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 12-1 Correlation and Regression.
Introduction to Linear Regression
Applied Quantitative Analysis and Practices LECTURE#22 By Dr. Osman Sadiq Paracha.
Applied Quantitative Analysis and Practices LECTURE#23 By Dr. Osman Sadiq Paracha.
Multiple Linear Regression. Purpose To analyze the relationship between a single dependent variable and several independent variables.
Regression. Population Covariance and Correlation.
Examining Relationships in Quantitative Research
Topic 10 - Linear Regression Least squares principle - pages 301 – – 309 Hypothesis tests/confidence intervals/prediction intervals for regression.
Linear Trend Lines = b 0 + b 1 X t Where is the dependent variable being forecasted X t is the independent variable being used to explain Y. In Linear.
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 13-1 Introduction to Regression Analysis Regression analysis is used.
Correlation and Regression. Section 9.1  Correlation is a relationship between 2 variables.  Data is often represented by ordered pairs (x, y) and.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
 For each given scatter diagram, determine whether there is a relationship between the variables. STAND UP!!!
Scatter Diagrams scatter plot scatter diagram A scatter plot is a graph that may be used to represent the relationship between two variables. Also referred.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
Linear Prediction Correlation can be used to make predictions – Values on X can be used to predict values on Y – Stronger relationships between X and Y.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice- Hall, Inc. Chap 14-1 Business Statistics: A Decision-Making Approach 6 th Edition.
© 2001 Prentice-Hall, Inc.Chap 13-1 BA 201 Lecture 18 Introduction to Simple Linear Regression (Data)Data.
1 Simple Linear Regression and Correlation Least Squares Method The Model Estimating the Coefficients EXAMPLE 1: USED CAR SALES.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
BPA CSUB Prof. Yong Choi. Midwest Distribution 1. Create scatter plot Find out whether there is a linear relationship pattern or not Easy and simple using.
Chapter 14 Introduction to Regression Analysis. Objectives Regression Analysis Uses of Regression Analysis Method of Least Squares Difference between.
Introduction Many problems in Engineering, Management, Health Sciences and other Sciences involve exploring the relationships between two or more variables.
Yandell – Econ 216 Chap 15-1 Chapter 15 Multiple Regression Model Building.
Predicting Energy Consumption in Buildings using Multiple Linear Regression Introduction Linear regression is used to model energy consumption in buildings.
Chapter 15 Multiple Regression Model Building
Chapter 14 Introduction to Multiple Regression
Chapter 15 Multiple Regression and Model Building
Linear Regression.
Analysis of Variance in Matrix form
Ch12.1 Simple Linear Regression
Multiple Regression Analysis and Model Building
Regression Analysis.
Chapter 12: Regression Diagnostics
Chapter 11 Simple Regression
The Least-Squares Regression Line
Simple Linear Regression
BA 275 Quantitative Business Methods
Multiple Regression A curvilinear relationship between one variable and the values of two or more other independent variables. Y = intercept + (slope1.
Correlation and Regression
BA 275 Quantitative Business Methods
Introduction to Probability and Statistics Thirteenth Edition
Multiple Linear Regression
Essentials of Statistics for Business and Economics (8e)
Multicollinearity Multicollinearity occurs when explanatory variables are highly correlated, in which case, it is difficult or impossible to measure their.
Presentation transcript:

Multiple Linear Regression Analysis BSAD 30 Dave Novak Fall 2018 Source: Ragsdale, 2018 Spreadsheet Modeling and Decision Analysis 8th edition © 2017 Cengage Learning

Overview Last class we considered the relationship between one independent variable and one dependent variable Referred to as “simple” linear regression Today, we consider the relationship between more than one independent variable (X’s) and a single dependent variable (Y) Referred to as “multiple” linear regression Example

Multiple regression When more than one independent variable can be used to explain variance in Y 𝑌 i = b0 + b1X1i + b2X2i + …. + bnXni Where it is assumed all bi’s are independent

Multiple regression Assume that you want to develop a model to predict the market value (or price) of houses in your town / city We have access to the following data Source: Ragsdale, 2018 Spreadsheet Modeling and Decision Analysis 8th

Multiple regression We want to predict housing price (in thousands of $) using some combination of data we have on square footage, the garage size, and number of bedrooms (3 possible X values) Even with three independent variables, we can create many different regression models A model with the most X’s is often not the “best” model

Multiple regression Start by looking at scatter plots and correlation Having access to many different independent variables does not necessarily mean that they all should be part of a regression model Rule of thumb for linear regression: KEEP IT AS SIMPLE AS POSSIBLE Start by looking at scatter plots and correlation

Multiple regression Source: Ragsdale, 2018 Spreadsheet Modeling and Decision Analysis 8th

Multiple regression Look at correlation coefficient (r) for each X / Y combination

Multiple regression Given results from scatter and correlation, start with three separate simple linear regression models and compare results 𝑌 i = b0 + b1X1i (X1 = square footage) 𝑌 i = b0 + b2X2i (X2 = garage size) 𝑌 i = b0 + b3X3i (X3 = # bedrooms)

X1 – square footage

X2 – garage size

X3 – # bedrooms

Summary comparison X1 has highest R2 ,Adj R2, and lowest Std. error Reasonable to start with X1 and build off that variable

Combinations of two variables 𝑌 i = b0 + b1X1i + b2X2i (square footage + garage) 𝑌 i = b0 + b1X1i + b3X3i (square footage + # bedrooms)

X1 + X2 (Sq ft + Garage)

X1 + X3 (Sq ft + Bedrooms)

Change in b0 and b1 𝑌 i = b0 + b1X1i (b0 = 109.5, b1 = 56.394) Notice values of b0 and b1 have changed from the model where we just had X1 𝑌 i = b0 + b1X1i (b0 = 109.5, b1 = 56.394) 𝑌 i = b0 + b1X1i + b2X2i (b0 = 127.68, b1 = 38.576) 𝑌 i = b0 + b1X1i + b3X3i (b0 = 108.31, b1 = 44.313)

Summary comparison Where X1 = sq. ft., X2 = garage, X3 = # bedrooms

Multicollinearity Not surprising adding X3 (# of bedrooms) to regression model with X1 (total square footage) did not improve model Both variables represent similar things – a measure of house size (sq ft) These variables appear to be highly correlated

Combination of all three 𝑌 i = b0 + b1X1i + b2X2i + b3X3i (square footage + garage + # bedrooms) As it is not a time consuming undertaking to test all three variables, we also want to examine the FULL (all independent variables) model

X1 + X2 + X3

Summary comparison

Best fit How do we choose? The two variable model with X1 and X2 has highest adj R2 and lowest std. error of all models Making the model more complex by adding all three variables doesn’t add anything to predictive power We also know that X3 is highly correlated with X1 (X1 and X3 not necessarily independent)

Making predictions Use our selected model to estimate average selling price of house with 2,100 sq ft and a 2-car garage Y 𝑖 =127.64+38.576 X 1𝑖 +12.875 X 2𝑖

Making predictions 95% prediction interval for the actual selling price:

Problem In-class example problem