Forecasting Revenue: An Example of Regression Model Building Setting: Possibly a large set of predictor variables used to predict future quarterly revenues.

Slides:



Advertisements
Similar presentations
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Advertisements

6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
LECTURE 3 Introduction to Linear Regression and Correlation Analysis
Guide to Using Minitab For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 6th Ed. Chapter 14: Multiple Regression.
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
Regression Analysis. Unscheduled Maintenance Issue: l 36 flight squadrons l Each experiences unscheduled maintenance actions (UMAs) l UMAs costs $1000.
Statistics for Managers Using Microsoft® Excel 5th Edition
BA 555 Practical Business Analysis
Statistics for Managers Using Microsoft® Excel 5th Edition
1 BA 275 Quantitative Business Methods Residual Analysis Multiple Linear Regression Adjusted R-squared Prediction Dummy Variables Agenda.
Multiple Regression Involves the use of more than one independent variable. Multivariate analysis involves more than one dependent variable - OMS 633 Adding.
Multivariate Data Analysis Chapter 4 – Multiple Regression.
January 6, morning session 1 Statistics Micro Mini Multiple Regression January 5-9, 2008 Beth Ayers.
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
Class 6: Tuesday, Sep. 28 Section 2.4. Checking the assumptions of the simple linear regression model: –Residual plots –Normal quantile plots Outliers.
Lecture 6: Multiple Regression
Gordon Stringer, UCCS1 Regression Analysis Gordon Stringer.
Lecture 24: Thurs., April 8th
Predictive Analysis in Marketing Research
Multiple Linear Regression
1 4. Multiple Regression I ECON 251 Research Methods.
Regression Diagnostics Checking Assumptions and Data.
Chapter 15: Model Building
Correlation and Regression Analysis
Regression Model Building Setting: Possibly a large set of predictor variables (including interactions). Goal: Fit a parsimonious model that explains variation.
Simple Linear Regression Analysis
Forecasting Revenue: An Example of Regression Model Building Setting: Possibly a large set of predictor variables used to predict future quarterly revenues.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.
Correlation & Regression
Quantitative Business Analysis for Decision Making Multiple Linear RegressionAnalysis.
Inference for regression - Simple linear regression
Chapter 13: Inference in Regression
Variable selection and model building Part II. Statement of situation A common situation is that there is a large set of candidate predictor variables.
Regression Analysis. Scatter plots Regression analysis requires interval and ratio-level data. To see if your data fits the models of regression, it is.
LOGO Chapter 4 Multiple Regression Analysis Devilia Sari - Natalia.
Soc 3306a Lecture 9: Multivariate 2 More on Multiple Regression: Building a Model and Interpreting Coefficients.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Anaregweek11 Regression diagnostics. Regression Diagnostics Partial regression plots Studentized deleted residuals Hat matrix diagonals Dffits, Cook’s.
Review of Building Multiple Regression Models Generalization of univariate linear regression models. One unit of data with a value of dependent variable.
Lesson Multiple Regression Models. Objectives Obtain the correlation matrix Use technology to find a multiple regression equation Interpret the.
Dr. C. Ertuna1 Issues Regarding Regression Models (Lesson - 06/C)
Slide 1 DSCI 5340: Predictive Modeling and Business Forecasting Spring 2013 – Dr. Nick Evangelopoulos Lecture 2: Review of Multiple Regression (Ch. 4-5)
REGRESSION DIAGNOSTICS Fall 2013 Dec 12/13. WHY REGRESSION DIAGNOSTICS? The validity of a regression model is based on a set of assumptions. Violation.
Diagnostics – Part II Using statistical tests to check to see if the assumptions we made about the model are realistic.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Model Building and Model Diagnostics Chapter 15.
1 MGT 511: Hypothesis Testing and Regression Lecture 8: Framework for Multiple Regression Analysis K. Sudhir Yale SOM-EMBA.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
I271B QUANTITATIVE METHODS Regression and Diagnostics.
Applied Quantitative Analysis and Practices LECTURE#30 By Dr. Osman Sadiq Paracha.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 10 th Edition.
Regression Analysis1. 2 INTRODUCTION TO EMPIRICAL MODELS LEAST SQUARES ESTIMATION OF THE PARAMETERS PROPERTIES OF THE LEAST SQUARES ESTIMATORS AND ESTIMATION.
Multiple Regression Learning Objectives n Explain the Linear Multiple Regression Model n Interpret Linear Multiple Regression Computer Output n Test.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 14 l Time Series: Understanding Changes over Time.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Multiple Regression Chapter 14.
Multiple Regression Numeric Response variable (y) p Numeric predictor variables (p < n) Model: Y =  0 +  1 x 1 +  +  p x p +  Partial Regression.
Regression Analysis Part A Basic Linear Regression Analysis and Estimation of Parameters Read Chapters 3, 4 and 5 of Forecasting and Time Series, An Applied.
Yandell – Econ 216 Chap 15-1 Chapter 15 Multiple Regression Model Building.
Predicting Energy Consumption in Buildings using Multiple Linear Regression Introduction Linear regression is used to model energy consumption in buildings.
Regression Analysis AGEC 784.
Regression Diagnostics
Multiple Regression and Model Building
Chapter 12: Regression Diagnostics
Multivariate Analysis Lec 4
Multiple Regression Models
Three Measures of Influence
Regression Forecasting and Model Building
Chapter 13 Additional Topics in Regression Analysis
Presentation transcript:

Forecasting Revenue: An Example of Regression Model Building Setting: Possibly a large set of predictor variables used to predict future quarterly revenues from data collected from last 7 years. Goal: Find an equation (model) that explains variation in Y with a smaller set of predictors that are all related to Y but not too related to each other (multicollinearity). Predict next 4 quarters of revenues. Your dependent variable will be revenues or seasonally adjusted revenues depending upon whether your data has pronounced seasonality.

Forecasting Revenue: An Example of Regression Model Building Hold out sample for validation process later. Do not use last two quarters of data (2 observations) until after you have done the validation process. Starting Point: Examine multicollinearity and poor predictors by checking correlations with a correlation matrix and by generating VIF values. Include revenue (or SA revenue). Eliminate variables with very low correlation (.9) independent variables. This allows you some choice in which to choose variables that have better forecasts available or that you believe should be most related to revenues in theory. You can swap out highly correlated variables later if you run into validation issues.

Variance Inflation Factors Variance Inflation Factor (VIF) – Measure of how highly correlated each independent variable is with the other predictors in the model. Used to identify Multicollinearity. Values larger than 10 for a predictor imply large inflation of standard errors of regression coefficients due to this variable being in model. Inflated standard errors lead to insignificant t- statistics for regression coefficients and wider confidence intervals

Forecasting Revenue: An Example of Regression Model Building Run a multiple regression to look at VIF values (and D-W values) – Delete one of the variables from those with VIF > 10. Choose the one that has the highest VIF or another variable with high VIF that may not have forecasts available. There is some flexibility in this step. Repeat until all VIF are smaller than 10. This will result in a reduced set of variables to use in finding an equation using All Possible Regressions.

Forecasting Revenue: An Example of Regression Model Building Best Model Process using all the data (no holdouts). Use MegaStat All Possible Regressions to find an equation that has all significant (p-value <,.05) variables and has a small standard error (large adjusted R-squared). If you have the C p Statistic it summarizes each possible model, where “best” model can be selected based on the statistic. Ideally you s elect the model with the fewest predictors that has C p  p and has p-values <.05 for all variables.

No Good Model? Low R2? Outliers? Validation problems (especially low D-W)? If you had “modes” in your data (time periods where the revenue trend was clearly different) you can try adding dummy variables to handle special situations rather than delete the data Ex: SWA pre-post merger dummy –0 = pre-merger, 1=post-merger

Validating Your Model Validation with holdout sample. Forecast last two known quarters with 95% prediction intervals. Do the actual values fall within the lower and upper prediction limits implying that the predictions seem reasonable? If so use all quarters and redo the equation using the same variables and forecast next 4 quarters. Check the assumptions for the validation model. If not, try using an alternative model from the all possible regressions options or see if there is a reason that the last two known quarters are different in some way. Look at the quarterly reports and see if they might suggest use of a dummy variable. Redo the validation process.

Regression Diagnostics Model Assumptions: Residual plots or other diagnostics can be used to check the assumptions -- Plot of Residuals versus each variable should be random cloud U-shaped (or rainbow)  Nonlinear relationship -- Plot of Residuals versus predicted should be random cloud Wedge shaped  Non-constant (increasing) variability -- Residuals should be mound-shaped (normal). Use skewness/kurtosis or a normal probability plot to check. -- Plot of Residuals versus Time order (Time series data) should be random cloud. If D-W < 1.3, residuals are not independent. Cook’s D is a check for influential observations that may have large impacts on the equation. Check data for accuracy for high Cook’s D.

Detecting Influential Observations Studentized Residuals – Residuals divided by their estimated standard errors. Observations in dark blue are considered outliers from the equation. Leverage Values – Measure of how far an observation is from the others in terms of the levels of the independent variables (not the dependent variable). Observations in dark blue are considered to be outliers in the X values. Cook’s D – Measure of aggregate impact of each observation on the group of regression coefficients, as well as the group of fitted values. Values larger than 1 are considered highly influential. Influential observations may suggest quarters to research to see if something special happened that may suggest a dummy variable.

The Final Forecasts When you have validated (holdouts, assumption) the best model possible discuss the forecasts. Look at their prediction intervals, re-seasonalize forecasts that used desesonalized data. Do the forecasts make sense? You may have actual Q3 revenues to compare your forecast with. Superimpose your forecasts on a time series plot of revenues and ensure that the forecasts seem reasonable. Document all your data and forecast sources. Write a report that documents all aspects of the forecasting process.