John G. Zhang, Ph.D. Harper College

Slides:



Advertisements
Similar presentations
FINANCIAL TIME-SERIES ECONOMETRICS SUN LIJIAN Feb 23,2001.
Advertisements

Marian Scott SAGES, March 2009
Autocorrelation Functions and ARIMA Modelling
Time-series analysis.
DSCI 5340: Predictive Modeling and Business Forecasting Spring 2013 – Dr. Nick Evangelopoulos Exam 1 review: Quizzes 1-6.
Model Building For ARIMA time series
Use of Business Tendency Survey Results for Forecasting Industry Production in Slovakia Use of Business Tendency Survey Results for Forecasting Industry.
Part II – TIME SERIES ANALYSIS C5 ARIMA (Box-Jenkins) Models
Slide 1 DSCI 5340: Predictive Modeling and Business Forecasting Spring 2013 – Dr. Nick Evangelopoulos Lecture 7: Box-Jenkins Models – Part II (Ch. 9) Material.
Applied Business Forecasting and Planning
Time Series Building 1. Model Identification
R. Werner Solar Terrestrial Influences Institute - BAS Time Series Analysis by means of inference statistical methods.
Economics 310 Lecture 25 Univariate Time-Series Methods of Economic Forecasting Single-equation regression models Simultaneous-equation regression models.
How should these data be modelled?. Identification step: Look at the SAC and SPAC Looks like an AR(1)- process. (Spikes are clearly decreasing in SAC.
Non-Seasonal Box-Jenkins Models
13 Introduction toTime-Series Analysis. What is in this Chapter? This chapter discusses –the basic time-series models: autoregressive (AR) and moving.
BABS 502 Lecture 9 ARIMA Forecasting II March 23, 2009.
Modeling Cycles By ARMA
Data Sources The most sophisticated forecasting model will fail if it is applied to unreliable data Data should be reliable and accurate Data should be.
BABS 502 Lecture 8 ARIMA Forecasting II March 16 and 21, 2011.
Forecasting & Time Series Minggu 6. Learning Objectives Understand the three categories of forecasting techniques available. Become aware of the four.
ARIMA Forecasting Lecture 7 and 8 - March 14-16, 2011
Financial Econometrics
Non-Seasonal Box-Jenkins Models
Copyright 2013 John Wiley & Sons, Inc. Chapter 8 Supplement Forecasting.
Introduction to Time Series Regression and Forecasting
L7: ARIMA1 Lecture 7: ARIMA Model Process The following topics will be covered: Properties of Stock Returns AR model MA model ARMA Non-Stationary Process.
Time series analysis - lecture 1 Time series analysis Analysis of data for which the temporal order of the observations is important Two major objectives:
© 2003 Prentice-Hall, Inc.Chap 12-1 Business Statistics: A First Course (3 rd Edition) Chapter 12 Time-Series Forecasting.
BOX JENKINS METHODOLOGY
Box Jenkins or Arima Forecasting. H:\My Documents\classes\eco346\Lectures\chap ter 7\Autoregressive Models.docH:\My Documents\classes\eco346\Lectures\chap.
© 2002 Prentice-Hall, Inc.Chap 13-1 Statistics for Managers using Microsoft Excel 3 rd Edition Chapter 13 Time Series Analysis.
AR- MA- och ARMA-.
The Forecast Process Dr. Mohammed Alahmed
TIME SERIES by H.V.S. DE SILVA DEPARTMENT OF MATHEMATICS
The Box-Jenkins Methodology for ARIMA Models
Slide 1 DSCI 5340: Predictive Modeling and Business Forecasting Spring 2013 – Dr. Nick Evangelopoulos Lecture 8: Estimation & Diagnostic Checking in Box-Jenkins.
#1 EC 485: Time Series Analysis in a Nut Shell. #2 Data Preparation: 1)Plot data and examine for stationarity 2)Examine ACF for stationarity 3)If not.
Tutorial for solution of Assignment week 39 “A. Time series without seasonal variation Use the data in the file 'dollar.txt'. “
Applied Forecasting ST3010 Michaelmas term 2015 Prof. Rozenn Dahyot Room 128 Lloyd Institute School of Computer Science and Statistics Trinity College.
It’s About Time Mark Otto U. S. Fish and Wildlife Service.
© 2000 Prentice-Hall, Inc. Chap The Least Squares Linear Trend Model Year Coded X Sales
Autocorrelation, Box Jenkins or ARIMA Forecasting.
Big Data at Home Depot KSU – Big Data Survey Course Steve Einbender Advanced Analytics Architect.
© 1999 Prentice-Hall, Inc. Chap Chapter Topics Component Factors of the Time-Series Model Smoothing of Data Series  Moving Averages  Exponential.
Time Series Basics Fin250f: Lecture 8.1 Spring 2010 Reading: Brooks, chapter
MULTIVARIATE TIME SERIES & FORECASTING 1. 2 : autocovariance function of the individual time series.
Auto Regressive, Integrated, Moving Average Box-Jenkins models A stationary times series can be modelled on basis of the serial correlations in it. A non-stationary.
Case 2 Review Brad Barker, Benjamin Milroy, Matt Sonnycalb, Kristofer Still, Chandhrika Venkataraman Time Series - February 2013.
1 Chapter 5 : Volatility Models Similar to linear regression analysis, many time series exhibit a non-constant variance (heteroscedasticity). In a regression.
1 BABS 502 Moving Averages, Decomposition and Exponential Smoothing Revised March 14, 2010.
The Box-Jenkins (ARIMA) Methodology
Seasonal ARIMA FPP Chapter 8.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 14 l Time Series: Understanding Changes over Time.
Forecasting is the art and science of predicting future events.
Ch16: Time Series 24 Nov 2011 BUSI275 Dr. Sean Ho HW8 due tonight Please download: 22-TheFed.xls 22-TheFed.xls.
Time series /Applied Forecasting 7005 Hilary term 2016 Prof. Rozenn Dahyot Room 128 Lloyd Institute School of Computer Science and Statistics Trinity College.
Economics 173 Business Statistics Lecture 28 © Fall 2001, Professor J. Petry
Review of Unit Root Testing D. A. Dickey North Carolina State University (Previously presented at Purdue Econ Dept.)
Lecture 8 ARIMA Forecasting II
Chapter 6: Autoregressive Integrated Moving Average (ARIMA) Models
Statistics for Managers using Microsoft Excel 3rd Edition
Applied Econometric Time-Series Data Analysis
Stock Prediction with ARIMA
Module 3 Forecasting a Single Variable from its own History, continued
Chapter 8 Supplement Forecasting.
Time Series introduction in R - Iñaki Puigdollers
BOX JENKINS (ARIMA) METHODOLOGY
Chap 7: Seasonal ARIMA Models
Presentation transcript:

John G. Zhang, Ph.D. Harper College jzhang@harpercollege.edu Looking Ahead of the Curve: an ARIMA Modeling Approach to Enrollment Forecasting John G. Zhang, Ph.D. Harper College jzhang@harpercollege.edu

Topics Why forecast How to forecast Why ARIMA What is ARIMA How to ARIMA How ARIMA did Discussion 47th AIR Annual Forum

Why Forecast Queries and Reports: what was Dashboard: what is Forecasts: what will be Forecast for enrollment: more valuable for resources planning 47th AIR Annual Forum

How to forecast Naïve forecast: random walk, moving average Exponential smoothing Markov chain Regression ARIMA Others Combining methods 47th AIR Annual Forum

Why ARIMA Naïve forecast: best guess if no patterns Exponential Smoothing: usually designed for one-step ahead forecast Markov chain: see reference Regression: frequently violates the assumption of uncorrelated errors ARIMA: worked well, more later Others: see reference Combining Methods: non-directional 47th AIR Annual Forum

What is ARIMA AutoRegressive Integrated Moving Average Generally, the model is given by 47th AIR Annual Forum

where Xt is a time series value at time t, 0 is a constant, B is a backshift or lag operator, i is a number of lags or spans,  is an error term at time t,  and θ are AR and MA parameters, and p, d, and q are the orders of AR, I, MA 47th AIR Annual Forum

If p = 1, 1 = 1, d = 0, θ1= 0, random walk: (1 - B)(Xt – θ0) = t if p = 1, d = 0, q = 1, ARMA(1, 1): (1 - 1B)(Xt – θ0) = (1 - θ1B) t If p = 1, d = 0, θ1 = 0, AR(1) model: (1 - 1B)(Xt – θ0) = t If p = 1, 1 = 1, d = 0, θ1= 0, random walk: (1 - B)(Xt – θ0) = t If 1 = 0, d = 0, θ1 = 0, constant: (Xt – θ0) = t 47th AIR Annual Forum

How to ARIMA Box and Jenkins (1976) notation: (p d q)(p d q)s Four stages: Identification Estimation Validation Forecasting 47th AIR Annual Forum

How to ARIMA SPSS Trends module: version 12 worked well version 13 and 14: algorithms changed same data, same program, different forecast SAS ETS module: ARIMA procedure more flexible forecast consistant automation possible thanks to macros 47th AIR Annual Forum

Identification Series Plot Autocorrelation plot Dickey-Fuller test of unit root hypothesis AR models to compare the log likelihood values for a series and its transformed series 47th AIR Annual Forum

Identification Degree of differencing Order of AR Order of MA Seasonality if any 47th AIR Annual Forum

Estimation Q statistics Goodness-of-fit criteria: variance estimate Akaike information criterion Schwartz Bayesian criterion Significance of parameters Residuals analysis Mean Absolute Percent Error 47th AIR Annual Forum

Data Time series data Date variable: year, quarter, month, week, day, hour, minute, second Enrollment data: FTE, headcount, seatcount Data points Nature of the series determines the forecast 47th AIR Annual Forum

Patterns of Data Trend: steady increase or decrease in the values of a times series Cycle: long-term patterns of rising and falling data Seasonality: regular change in the data values that occurs at the same time in a given period 47th AIR Annual Forum

FTE 47th AIR Annual Forum

FTE Pattern Trendy: FTE increasing from 1998 to 2006, suggesting non-stationary and differencing necessary Seasonal: higher in the Fall and Spring and lower in the Summer each and every year, implying a seasonal factor present as part of the model building process 47th AIR Annual Forum

Autocorrelations and Partial Autocorrelations (ACF and PACF) Lag Correlation -1 9 8 7 6 5 4 3 2 1 0 1 2 3 4 5 6 7 8 9 1 0 1.00000 | |********************| 1 0.64901 | . |************* | 2 0.29267 | . |****** | 3 -.06855 | . *| . | 4 -.42111 | ********| . | 5 -.42944 | *********| . | 6 -.43520 | *********| . | 7 -.40880 | ********| . | 8 -.38067 | ********| . | 9 -.06784 | . *| . | 10 0.25681 | . |***** . | 11 0.55983 | . |*********** | 12 0.85774 | . |***************** | 13 0.55625 | . |*********** | 14 0.24975 | . |***** . | 15 -.06186 | . *| . | 16 -.36715 | . *******| . | 17 -.37708 | . ********| . | 18 -.38454 | . ********| . | 19 -.36197 | . *******| . | 20 -.33780 | . *******| . | 21 -.07144 | . *| . | 22 0.20576 | . |**** . | 23 0.46222 | . |********* . | PACF Lag Correlation -1 9 8 7 6 5 4 3 2 1 0 1 2 3 4 5 6 7 8 9 1 1 0.64901 | . |************* | 2 -0.22210 | ****| . | 3 -0.28449 | ******| . | 4 -0.37073 | *******| . | 5 0.18006 | . |**** | 6 -0.26468 | *****| . | 7 -0.29117 | ******| . | 8 -0.45581 | *********| . | 9 0.72564 | . |*************** | 10 0.06626 | . |* . | 11 0.26005 | . |***** | 12 0.18460 | . |**** | 13 -0.22575 | *****| . | 14 0.14806 | . |***. | 15 0.10247 | . |** . | 16 0.16423 | . |***. | 17 -0.18254 | ****| . | 18 0.15059 | . |***. | 19 -0.04279 | . *| . | 20 0.11045 | . |** . | 21 -0.18268 | ****| . | 22 0.08106 | . |** . | 23 -0.06703 | . *| . | 47th AIR Annual Forum

Q Statistics Autocorrelation Check of Residuals To Chi- Pr > Lag Square DF ChiSq --------------------Autocorrelations-------------------- 6 385.69 6 <.0001 0.937 0.874 0.808 0.743 0.727 0.711 12 777.02 12 <.0001 0.709 0.707 0.752 0.799 0.833 0.866 18 1107.12 18 <.0001 0.811 0.755 0.697 0.640 0.624 0.608 24 1436.47 24 <.0001 0.605 0.603 0.640 0.679 0.706 0.732 Q Statistics show autocorrelations among various lags highly statistically significant Autocorrelations were very high Further actions needed 47th AIR Annual Forum

FTE Forecast 47th AIR Annual Forum

How ARIMA Did Accuracy: what matters most 2-period ahead: 0.74% (FTE) 0.50% (HC) 6-period ahead: 1.43% (FTE) 1.65% (HC) 10-period ahead: 1.40% (FTE) 2.52%(HC) Forecast error bigger into distant future Eleanor S. Fox (2005) 1.2% (4) 4.1% (8) NCES (2003) 1.9% (2) 3.6% (6) 47th AIR Annual Forum

Discussion Theoretically factors includable along with the time series itself like in regression Unemployment rate Consumer Price Index (CPI) High school student population District population Tuition Forecasts used for forecasting? 47th AIR Annual Forum

Discussion Stationarity and homogeneity Scarcity and spuriousness Seasonality and outliers Raw or cooked data Data mining and stepwise Fit and accuracy Additive or multiplicative (subset/factored) 47th AIR Annual Forum

Discussion Science and art Objective and Subjective Quantitative and qualitative Over-differencing and over-fitting Parsimony and uncertainty Simple or complex 47th AIR Annual Forum