Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Model Building and Model Diagnostics Chapter 15.

Slides:



Advertisements
Similar presentations
Multiple Regression and Model Building
Advertisements

Managerial Economics in a Global Economy
BA 275 Quantitative Business Methods
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 13 Nonlinear and Multiple Regression.
Chapter 13 Multiple Regression
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
Chapter 13 Additional Topics in Regression Analysis
Multiple Linear Regression Model
Statistics for Managers Using Microsoft® Excel 5th Edition
Statistics for Managers Using Microsoft® Excel 5th Edition
Chapter 12 Multiple Regression
Multivariate Data Analysis Chapter 4 – Multiple Regression.
Lecture 24 Multiple Regression (Sections )
Chapter 11 Multiple Regression.
Simple Linear Regression Analysis
Topic 3: Regression.
Multiple Regression and Correlation Analysis
Quantitative Business Analysis for Decision Making Simple Linear Regression.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 11 th Edition.
Business Statistics - QBM117 Statistical inference for regression.
McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. A PowerPoint Presentation Package to Accompany Applied Statistics.
Regression Model Building Setting: Possibly a large set of predictor variables (including interactions). Goal: Fit a parsimonious model that explains variation.
Simple Linear Regression Analysis
Copyright ©2011 Pearson Education 15-1 Chapter 15 Multiple Regression Model Building Statistics for Managers using Microsoft Excel 6 th Global Edition.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.
Correlation & Regression
Objectives of Multiple Regression
Marketing Research Aaker, Kumar, Day and Leone Tenth Edition
Inference for regression - Simple linear regression
Regression Method.
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 15-1 Chapter 15 Multiple Regression Model Building Statistics for Managers using Microsoft.
© 2004 Prentice-Hall, Inc.Chap 15-1 Basic Business Statistics (9 th Edition) Chapter 15 Multiple Regression Model Building.
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved Chapter 13 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 15 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
Chapter 12 Examining Relationships in Quantitative Research Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
The Examination of Residuals. Examination of Residuals The fitting of models to data is done using an iterative approach. The first step is to fit a simple.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Time Series Forecasting Chapter 16.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Time Series Forecasting Chapter 13.
Chap 14-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 14 Additional Topics in Regression Analysis Statistics for Business.
Examining Relationships in Quantitative Research
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
Autocorrelation in Time Series KNNL – Chapter 12.
McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. A PowerPoint Presentation Package to Accompany Applied Statistics.
Chapter 13 Multiple Regression
REGRESSION DIAGNOSTICS Fall 2013 Dec 12/13. WHY REGRESSION DIAGNOSTICS? The validity of a regression model is based on a set of assumptions. Violation.
Examining Relationships in Quantitative Research
1 1 Slide © 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
I271B QUANTITATIVE METHODS Regression and Diagnostics.
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 10 th Edition.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 17 Simple Linear Regression and Correlation.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Multiple Regression Chapter 14.
Lecturer: Ing. Martina Hanová, PhD.. Regression analysis Regression analysis is a tool for analyzing relationships between financial variables:  Identify.
Bivariate Regression. Bivariate Regression analyzes the relationship between two variables. Bivariate Regression analyzes the relationship between two.
Chapter 12 REGRESSION DIAGNOSTICS AND CANONICAL CORRELATION.
Yandell – Econ 216 Chap 15-1 Chapter 15 Multiple Regression Model Building.
Chapter 15 Multiple Regression Model Building
Multiple Regression and Model Building
I271b Quantitative Methods
Chapter 13 Additional Topics in Regression Analysis
Presentation transcript:

Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Model Building and Model Diagnostics Chapter 15

15-2 Model Building and Model Diagnostics 15.1The Quadratic Regression Model 15.2Interaction 15.3Logistic Regression 15.4Model Building, and the Effects of Multicollinearity 15.5Improving the Regression Model I: Diagnosing and Using Information about Outlying and Influential Observations

15-3 Model Building and Model Diagnostics 15.6Improving the Regression Model II: Transforming the Dependent and Independent Variables 15.7Improving the Regression Model III: The Durbin-Watson Test and Dealing with Autocorrelation

The Quadratic Regression Model One useful form of linear regression is the quadratic regression model Assume we have n observations of x and y The quadratic regression model relating y to x is y = β 0 + β 1 x + β 2 x 2 +  1. β 0 + β 1 x + β 2 x 2 is the mean value of the dependent variable y when the value of the independent variable is x 2. β 0, β 1 and β 2 are unknown regression parameters relating the mean value of y to x 3.  is an error term that describes the effects on y of all factors other than x and x 2 LO 1: Model quadratic relationships by using the quadratic regression model.

15-5 More Variables We have only looked at the simple case where we have y and x That gave us the quadratic regression model y = β 0 + β 1 x + β 2 x 2 +  However, we are not limited to just two terms The following would also be a valid quadratic regression model y = β 0 + β 1 x 1 + β 2 x β 3 x 2 + β 4 x 3 +  LO1

Interaction Multiple regression models often contain interaction variables These are variables that are formed by multiplying two independent variables together For example, x 1 ·x 2 In this case, the x 1 ·x 2 variable would appear in the model along with both x 1 and x 2 We use interaction variables when the relationship between the mean value of y and one of the independent variables is dependent on the value of another independent variable LO 2: Detect and model interaction between two independent variables.

Logistic Regression Logistic regression and least squares regression are very similar Both produce prediction equations The y variable is what makes logistic regression different With least squares regression, the y variable is a quantitative variable With logistic regression, it is usually a dummy 0/1 variable With large data sets, y variable may be the probability of a set of observations having a dummy variable value of one LO 3: Use a logistic model to estimate probabilities and odds ratios.

15-8 General Logistic Regression Model p(x 1,x 2,…x k ) is the probability that the event under consideration will occur when the values of the independent variable are x 1,x 2,…x k The odds of the event occurring are p(x 1,x 2,…x k )/(1-p(x 1,x 2,…x k )) The probability that the event will occur divided by the probability it will not occur LO3

Model Building and the Effects of Multicollinearity Multicollinearity is the condition where the independent variables are dependent, related or correlated with each other Effects Hinders ability to use t statistics and p-values to assess the relative importance of predictors Does not hinder ability to predict the dependent (or response) variable Detection Scatter plot matrix Correlation matrix Variance inflation factors (VIF) LO 4: Describe and measure multicollinearity.

15-10 Comparing Regression Models on R 2, s, Adjusted R 2, and Prediction Interval Multicollinearity causes problems evaluating the p- values of the model Therefore, we need to evaluate more than the additional importance of each independent variable We also need to evaluate how the variables work together One way to do this is to determine if the overall model gives a high R 2 and adjusted R 2, a small s, and short prediction intervals LO 5: Use various model comparison criteria to identify one or more appropriate regression models.

15-11 C Statistic Another quantity for comparing regression models is called the C statistic Also known as C P statistic First, calculate mean square error for the model containing all p potential independent variables Denoted s 2 p Next, calculate SSE for a reduced model with k independent variables Calculate C as LO5

Diagnosing and Using Information About Outlying and Influential Observations Observation 1: Outlying with respect to y value Observation 2: Outlying with respect to x value Observation 3: Outlying with respect to x value and y value not consistent with regression relationship (Influential) LO 6: Use diagnostic measures to detect outlying and influential observations.

Transforming the Dependent and Independent Variables A possible remedy for violations of the constant variance, correct functional form and normality assumptions is to transform the dependent variable Possible transformations include Square root Quartic root Logarithmic The appropriate transformation will depend on the specific problem with the original data set LO 7: Use data transformations to help remedy violations of the regression assumptions.

The Durbin-Watson Test and Dealing with Autocorrelation One type of autocorrelation is called first- order autocorrelation This is when the error term in time period t (  t ) is related to the error term in time period t-1 (  t-1 ) The Durbin-Watson statistic checks for first- order autocorrelation LO 8: Use the Durbin– Watson test to detect autocorrelated error terms.