Comparing Time Series, Neural Nets and Probability Models for New Product Trial Forecasting Eugene Brusilovskiy Ka Lok Lee These slides are based on the.

Slides:



Advertisements
Similar presentations
TWO STEP EQUATIONS 1. SOLVE FOR X 2. DO THE ADDITION STEP FIRST
Advertisements

You have been given a mission and a code. Use the code to complete the mission and you will save the world from obliteration…
Chapter 10 Learning Objectives
QUALITY CONTROL TOOLS FOR PROCESS IMPROVEMENT
Copyright © 2003 Pearson Education, Inc. Slide 1 Computer Systems Organization & Architecture Chapters 8-12 John D. Carpinelli.
Copyright © 2002 Pearson Education, Inc. Slide 1.
Cost volume profit analysis
Introductory Mathematics & Statistics for Business
Part 3 Probabilistic Decision Models
Chapter 1 The Study of Body Function Image PowerPoint
1 Copyright © 2010, Elsevier Inc. All rights Reserved Fig 2.1 Chapter 2.
By D. Fisher Geometric Transformations. Reflection, Rotation, or Translation 1.
Decision Support and Artificial Intelligence Jack G. Zheng July 11 th 2005 MIS Chapter 4.
Sales Forecasting using Dynamic Bayesian Networks Steve Djajasaputra SNN Nijmegen The Netherlands.
Business Transaction Management Software for Application Coordination 1 Business Processes and Coordination.
SMA 6304 / MIT / MIT Manufacturing Systems Lecture 11: Forecasting Lecturer: Prof. Duane S. Boning Copyright 2003 © Duane S. Boning. 1.
BeefBasis.com A Web Based Cattle Basis Analysis & Forecasting Tool Dr. Kevin Dhuyvetter and Dr. James Mintert Department of Agricultural Economics Kansas.
Summary of Convergence Tests for Series and Solved Problems
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Title Subtitle.
0 - 0.
ALGEBRAIC EXPRESSIONS
DIVIDING INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
MULTIPLYING MONOMIALS TIMES POLYNOMIALS (DISTRIBUTIVE PROPERTY)
ADDING INTEGERS 1. POS. + POS. = POS. 2. NEG. + NEG. = NEG. 3. POS. + NEG. OR NEG. + POS. SUBTRACT TAKE SIGN OF BIGGER ABSOLUTE VALUE.
MULTIPLICATION EQUATIONS 1. SOLVE FOR X 3. WHAT EVER YOU DO TO ONE SIDE YOU HAVE TO DO TO THE OTHER 2. DIVIDE BY THE NUMBER IN FRONT OF THE VARIABLE.
SUBTRACTING INTEGERS 1. CHANGE THE SUBTRACTION SIGN TO ADDITION
MULT. INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
Addition Facts
Year 6 mental test 10 second questions Numbers and number system Numbers and the number system, fractions, decimals, proportion & probability.
Around the World AdditionSubtraction MultiplicationDivision AdditionSubtraction MultiplicationDivision.
ZMQS ZMQS
Chapter 7 Sampling and Sampling Distributions
BT Wholesale October Creating your own telephone network WHOLESALE CALLS LINE ASSOCIATED.
Chapter 9. Time Series From Business Intelligence Book by Vercellis Lei Chen, for COMP
1 The tiling algorithm Learning in feedforward layered networks: the tiling algorithm writed by Marc M é zard and Jean-Pierre Nadal.
Module 4. Forecasting MGS3100.
LOGO Regression Analysis Lecturer: Dr. Bo Yuan
ABC Technology Project
Chi-Square and Analysis of Variance (ANOVA)
Chapter 15 Integrated Services Digital Network ISDN Services History Subscriber Access Layers BISDN WCB/McGraw-Hill The McGraw-Hill Companies, Inc., 1998.
© Charles van Marrewijk, An Introduction to Geographical Economics Brakman, Garretsen, and Van Marrewijk.
© Charles van Marrewijk, An Introduction to Geographical Economics Brakman, Garretsen, and Van Marrewijk.
© Charles van Marrewijk, An Introduction to Geographical Economics Brakman, Garretsen, and Van Marrewijk.
Chapter 5 Microsoft Excel 2007 Window
Squares and Square Root WALK. Solve each problem REVIEW:
Insert Date HereSlide 1 Using Derivative and Integral Information in the Statistical Analysis of Computer Models Gemma Stephenson March 2007.
Past Tense Probe. Past Tense Probe Past Tense Probe – Practice 1.
This, that, these, those Number your paper from 1-10.
9: Examining Relationships in Quantitative Research ESSENTIALS OF MARKETING RESEARCH Hair/Wolfinbarger/Ortinau/Bush.
Addition 1’s to 20.
25 seconds left…...
Princess Nora University Artificial Intelligence Artificial Neural Network (ANN) 1.
Test B, 100 Subtraction Facts
Determining How Costs Behave
Week 1.
We will resume in: 25 Minutes.
© The McGraw-Hill Companies, Inc., Chapter 12 Chi-Square.
Chapter 15: Quantitatve Methods in Health Care Management Yasar A. Ozcan 1 Chapter 15. Simulation.
1 Unit 1 Kinematics Chapter 1 Day
Spreadsheet Modeling & Decision Analysis
1 PART 1 ILLUSTRATION OF DOCUMENTS  Brief introduction to the documents contained in the envelope  Detailed clarification of the documents content.
Simple Linear Regression Analysis
Chapter 13 Preparing The Systems Proposal Systems Analysis and Design Kendall and Kendall Fifth Edition.
McGraw-Hill©The McGraw-Hill Companies, Inc., 2001 Chapter 16 Integrated Services Digital Network (ISDN)
1 McGill University Department of Civil Engineering and Applied Mechanics Montreal, Quebec, Canada.
Consumer Behavior Prediction using Parametric and Nonparametric Methods Elena Eneva CALD Masters Presentation 19 August 2002 Advisors: Alan Montgomery,
1 Introduction to Neural Networks And Their Applications.
Presentation transcript:

Comparing Time Series, Neural Nets and Probability Models for New Product Trial Forecasting Eugene Brusilovskiy Ka Lok Lee These slides are based on the authors presentation at the 4 th Annual Hawaii International Conference on Statistics, Mathematics, and Related Fields

2 Problem Introduction Goal: To predict future sales using sales information from an introductory period Product: A new (unnamed) soft beverage that was introduced to a test market Data: We have 52 weeks of sales data, which we split into training (first 39 weeks) and validation (last 13 weeks) datasets –We build the models using the training dataset and then examine how well the models predict sales in the last 13 weeks The methods employed here apply to predicting the sales of any newly introduced consumer good

3 Prediction Methods Used Time Series –Most common technique, available in almost every statistics software Neural Nets –Extensive data-mining tool (requires expensive software) Probability Modeling –Not always available in standard statistical packages, may be coded in Excel

4 Training Data – Cumulative Sales for the First 39 Weeks T = 39

5 Time Series A time-series (TS) model accounts for patterns in the past movements of a variable and uses that information to predict its future movements. In a sense a time-series model is just a sophisticated method of extrapolation (Pindyck and Rubinfeld, 1998).

6 Time Series Autoregressive Moving Average Model: ARMA(1,1) – generally recognized to be a good approximation for many observed time series or

7 Neural Networks A Neural Network (NN) is an information processing paradigm inspired by the way the brain processes information (Stergiou and Siganos, 1996). MLP (The Multi-Layer Perceptron) is used here

8 Neural Networks A Neural Network consists of neuron layers of 3 types: –Input layer –Hidden layer –Output layer We use two models with different MLP architectures: a model with one hidden layer and a model with a skip layer

9 Neural Networks (contd) AND Given the rule on the left, we deduce the pattern on the right:

10 Neural Networks Structure of Neural Net Models:

11 Neural Networks Neural Networks are especially useful for problems where –Prediction is more important than explanation –There are lots of training data –No mathematical formula that relates inputs to outputs is known Source: SAS Enterprise Miner Reference Help. Neural Network Node: Reference

12 Probability Modeling Probability models: –Are representations of individual buying behavior –Provide structural insight into the ways in which consumers make purchase decisions (Massy el at.,1970) Specific assumptions of purchase process and latent propensity (Bayesian flavor) Explicit consideration of unobserved heterogeneity

13 Probability Modeling Individual purchase time or time-to-trial is modeled by Diffusion Model. Exponential-Gamma (EG), also known as the Pareto distribution (Hardie et al., 2003) Time to trial ~ Exponential (λ) λ~ Gamma (r, α)

14 Probability Modeling After solving the integral, the cumulative probability function becomes: F(t) = LL = Estimation uses Excel Solver

15

16 Results Exp. Gamma Neural Nets Time Series Mean Absolute Percentage Error (MAPE) 2.7%9.0%5.5% Where T is the total number of time periods (weeks). Here, t=1 is the first validation week (week 40) All three models do a relatively good job predicting future sales, but Exponential Gamma is the best

17 New Product Sales – Results T=39

18 Time Series - Results Captures jumps in the training data Implies no additional sales (the product is dead), extreme case of forecast

19 Neural Nets - Results Can sometimes be over-responsive to jumps in training data

20 Probability Model - Results Overall, the best method Furthermore, allows the analyst to make statements about the consumers in the market

21 Next Steps Include covariates Different training periods Perform comparative analysis for other areas of forecasting –Customer Lifetime Value

22 References Hardie B. G.S., Zeithammer R., and Fader P. (2003), Forecasting New Product Trial in a Controlled Test Market Environment, Journal of Forecasting, 22: Massy, W.F., Montgomery, D.B. and Morrison, D.G. (1970), Stochastic Models of Buying Behavior, The M.I.T. Press, 464 pp. Pindyck, R.S. and Rubinfeld D.L. (1998), Econometric Models and Economic Forecasts, Irwin/McGraw-Hill. SAS Enterprise Miner Reference Help. Article: Neural Network Node: Reference Stergiou, C., & Siganos, D. (1996), Introduction to Neural Networks. Available online at rt.html rt.html