Regression Model Building - Diagnostics

Slides:



Advertisements
Similar presentations
3.3 Hypothesis Testing in Multiple Linear Regression
Advertisements

12-1 Multiple Linear Regression Models Introduction Many applications of regression analysis involve situations in which there are more than.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
12 Multiple Linear Regression CHAPTER OUTLINE
Prediction, Correlation, and Lack of Fit in Regression (§11. 4, 11
Linear regression models
Psychology 202b Advanced Psychological Statistics, II February 1, 2011.
1 Chapter 3 Multiple Linear Regression Ray-Bing Chen Institute of Statistics National University of Kaohsiung.
Multivariate Data Analysis Chapter 4 – Multiple Regression.
Predictive Analysis in Marketing Research
Multiple Linear Regression
Quantitative Business Analysis for Decision Making Simple Linear Regression.
CHAPTER 3 Describing Relationships
Regression Model Building Setting: Possibly a large set of predictor variables (including interactions). Goal: Fit a parsimonious model that explains variation.
Review for Final Exam Some important themes from Chapters 9-11 Final exam covers these chapters, but implicitly tests the entire course, because we use.
Forecasting Revenue: An Example of Regression Model Building Setting: Possibly a large set of predictor variables used to predict future quarterly revenues.
Multiple Linear Regression Response Variable: Y Explanatory Variables: X 1,...,X k Model (Extension of Simple Regression): E(Y) =  +  1 X 1 +  +  k.
Objectives of Multiple Regression
Descriptive Methods in Regression and Correlation
Chapter 11 Simple Regression
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
2 Multicollinearity Presented by: Shahram Arsang Isfahan University of Medical Sciences April 2014.
Multiple Regression I KNNL – Chapter 6. Models with Multiple Predictors Most Practical Problems have more than one potential predictor variable Goal is.
Detecting and reducing multicollinearity. Detecting multicollinearity.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
Anaregweek11 Regression diagnostics. Regression Diagnostics Partial regression plots Studentized deleted residuals Hat matrix diagonals Dffits, Cook’s.
Lesson Multiple Regression Models. Objectives Obtain the correlation matrix Use technology to find a multiple regression equation Interpret the.
6-3 Multiple Regression Estimation of Parameters in Multiple Regression.
Simple Linear Regression (SLR)
Simple Linear Regression (OLS). Types of Correlation Positive correlationNegative correlationNo correlation.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Model Building and Model Diagnostics Chapter 15.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 10 th Edition.
Lab 4 Multiple Linear Regression. Meaning  An extension of simple linear regression  It models the mean of a response variable as a linear function.
DATA ANALYSIS AND MODEL BUILDING LECTURE 9 Prof. Roland Craigwell Department of Economics University of the West Indies Cave Hill Campus and Rebecca Gookool.
Chapter 12 REGRESSION DIAGNOSTICS AND CANONICAL CORRELATION.
Yandell – Econ 216 Chap 15-1 Chapter 15 Multiple Regression Model Building.
Chapter 20 Linear and Multiple Regression
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Regression Diagnostics
Chapter 12: Regression Diagnostics
Slides by JOHN LOUCKS St. Edward’s University.
Multiple Regression II
AP Stats: 3.3 Least-Squares Regression Line
Linear Regression.
Multiple Regression II
Chapter 3 Multiple Linear Regression
No notecard for this quiz!!
Multiple Regression Models
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Solution 9 1. a) From the matrix plot, 1) The assumption about linearity seems ok; 2).The assumption about measurement errors can not be checked at this.
Regression Model Building - Diagnostics
Regression Model Building - Diagnostics
Multiple Linear Regression
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Regression Forecasting and Model Building
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Diagnostics and Remedial Measures
Linear Regression and Correlation
CHAPTER 3 Describing Relationships
Multicollinearity Multicollinearity occurs when explanatory variables are highly correlated, in which case, it is difficult or impossible to measure their.
Diagnostics and Remedial Measures
CHAPTER 3 Describing Relationships
Presentation transcript:

Regression Model Building - Diagnostics KNNL – Chapter 10

Model Adequacy for Predictors – Added Variable Plot Graphical way to determine partial relation between response and a given predictor, after controlling for other predictors – shows form of relation between new X and Y May not be helpful when other predictor(s) enter model with polynomial or interaction terms that are not controlled for Algorithm (assume plot for X3, given X1, X2): Fit regression of Y on X1,X2, obtain residuals = ei(Y|X1,X2) Fit regression of X3 on X1,X2, obtain residuals = ei(X3|X1,X2) Plot ei(Y|X1,X2) (vertical axis) versus ei(X3|X1,X2) (horizontal axis) Slope of the regression through the origin of ei(Y|X1,X2) on ei(X3|X1,X2) is the partial regression coefficient for X3

Outlying Y Observations – Studentized Residuals

Outlying Y Observations – Studentized Deleted Residuals

Outlying X-Cases – Hat Matrix Leverage Values Cases with X-levels close to the “center” of the sampled X-levels will have small leverages. Cases with “extreme” levels have large leverages, and have the potential to “pull” the regression equation toward their observed Y-values. Large leverage values are > 2p/n (2 times larger than the mean) New cases with leverage values larger than those in original dataset are extrapolations

Identifying Influential Cases I – Fitted Values

Influential Cases II – Regression Coefficients

Multicollinearity - Variance Inflation Factors Problems when predictor variables are correlated among themselves Regression Coefficients of predictors change, depending on what other predictors are included Extra Sums of Squares of predictors change, depending on what other predictors are included Standard Errors of Regression Coefficients increase when predictors are highly correlated Individual Regression Coefficients are not significant, although the overall model is Width of Confidence Intervals for Regression Coefficients increases when predictors are highly correlated Point Estimates of Regression Coefficients arewrong sign (+/-)

Variance Inflation Factor