Deirdre Syms, George Xia Macomb Community College November 5, 2015

Slides:



Advertisements
Similar presentations
Model Building For ARIMA time series
Advertisements

Part II – TIME SERIES ANALYSIS C5 ARIMA (Box-Jenkins) Models
Unit Roots & Forecasting
Slide 1 DSCI 5340: Predictive Modeling and Business Forecasting Spring 2013 – Dr. Nick Evangelopoulos Lecture 7: Box-Jenkins Models – Part II (Ch. 9) Material.
STAT 497 APPLIED TIME SERIES ANALYSIS
Business Forecasting Chapter 10 The Box–Jenkins Method of Forecasting.
Bivariate Regression Analysis
Time Series Analysis using SAS prepared by John Fahey (former Load Forecaster at NSPI) and Voytek Grus (former Sales and Revenue Forecaster at BC Gas Inc.)
Qualitative Forecasting Methods
Software Quality Control Methods. Introduction Quality control methods have received a world wide surge of interest within the past couple of decades.
ARIMA Forecasting Lecture 7 and 8 - March 14-16, 2011
Modern methods The classical approach: MethodProsCons Time series regression Easy to implement Fairly easy to interpret Covariates may be added (normalization)
Chapter 2 Data Patterns and Choice of Forecasting Techniques
BOX JENKINS METHODOLOGY
Chapter 15 Forecasting Copyright © 2011 Pearson Addison-Wesley. All rights reserved. Slides by Niels-Hugo Blunch Washington and Lee University.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 12 Analyzing the Association Between Quantitative Variables: Regression Analysis Section.
MS Production and Service Systems Operations Forecasting
Time Series “The Art of Forecasting”. What Is Forecasting? Process of predicting a future event Underlying basis of all business decisions –Production.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 24 Statistical Inference: Conclusion.
Regression Method.
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
Forecasting Professor Ahmadi.
The Examination of Residuals. Examination of Residuals The fitting of models to data is done using an iterative approach. The first step is to fit a simple.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 27 Time Series.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Autoregressive Integrated Moving Average (ARIMA) Popularly known as the Box-Jenkins methodology.
Y X 0 X and Y are not perfectly correlated. However, there is on average a positive relationship between Y and X X1X1 X2X2.
Lesson 4 -Part A Forecasting Quantitative Approaches to Forecasting Components of a Time Series Measures of Forecast Accuracy Smoothing Methods Trend Projection.
John G. Zhang, Ph.D. Harper College
1 1 Slide Forecasting Professor Ahmadi. 2 2 Slide Learning Objectives n Understand when to use various types of forecasting models and the time horizon.
This material is approved for public release. Distribution is limited by the Software Engineering Institute to attendees. Sponsored by the U.S. Department.
Big Data at Home Depot KSU – Big Data Survey Course Steve Einbender Advanced Analytics Architect.
Agresti/Franklin Statistics, 1 of 88 Chapter 11 Analyzing Association Between Quantitative Variables: Regression Analysis Learn…. To use regression analysis.
Forecasting Parameters of a firm (input, output and products)
Autocorrelation in Regression Analysis
1 Chapter 5 : Volatility Models Similar to linear regression analysis, many time series exhibit a non-constant variance (heteroscedasticity). In a regression.
The Box-Jenkins (ARIMA) Methodology
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 14 l Time Series: Understanding Changes over Time.
Time Series Analysis PART II. Econometric Forecasting Forecasting is an important part of econometric analysis, for some people probably the most important.
Subodh Kant. Auto-Regressive Integrated Moving Average Also known as Box-Jenkins methodology A type of linear model Capable of representing stationary.
Welcome to MM305 Unit 5 Seminar Dr. Bob Forecasting.
Yandell – Econ 216 Chap 16-1 Chapter 16 Time-Series Forecasting.
Energy Consumption Forecast Using JMP® Pro 11 Time Series Analysis
Why Model? Make predictions or forecasts where we don’t have data.
Linear Regression.
Forecasting Methods Dr. T. T. Kachwala.
Financial Econometrics Lecture Notes 2
Ch8 Time Series Modeling
Chapter 6: Autoregressive Integrated Moving Average (ARIMA) Models
“The Art of Forecasting”
Estimating with PROBE II
CHAPTER 16 ECONOMIC FORECASTING Damodar Gujarati
Module 3 Forecasting a Single Variable from its own History, continued
Unit Root & Augmented Dickey-Fuller (ADF) Test
Elements of a statistical test Statistical null hypotheses
Principal Component Analysis
The loss function, the normal equation,
Regression Assumptions
Mathematical Foundations of BME Reza Shadmehr
Psych 231: Research Methods in Psychology
Gathering and Organizing Data
Inferential Statistics
Chapter 13 Additional Topics in Regression Analysis
Chapter 26 Comparing Counts Copyright © 2009 Pearson Education, Inc.
One-Factor Experiments
“Measures of Trend” Dr. A. PHILIP AROKIADOSS Chapter 1 Time Series
Lecturer Dr. Veronika Alhanaqtah
Load forecasting Prepared by N.CHATHRU.
Regression Assumptions
Chap 7: Seasonal ARIMA Models
Presentation transcript:

Deirdre Syms, George Xia Macomb Community College November 5, 2015 Using Multivariate Time Series Forecasting Models for Enrollment Predictions Deirdre Syms, George Xia Macomb Community College November 5, 2015

Bring the Case into Statistical Analysis and Forecasting A very interesting Macomb Community degree credit side student enrollment pattern over decades matches the local economic situation: When the local economy turns better, the enrollment figure went down When the local economy turns worse, the enrollment figure went up As a typical suburban two-year college, we provide quality education with lower costs to our students, thus we assume local economic data such as unemployment rates are affect our students’ enrollment behaviors. We/I do a lot of student enrollment reports at MCC, including each semester’s internal enrollment report, each middle fall’s IPEDS enrollment report, etc. Almost all these reports focus on the data facts and descriptive statistics. They are not related to inferential statistical analysis. Today, there are a lot of statistical analysis and forecasting tools for us to use in this case if we want to dig more useful information from our enrollment data, especially predicting the near future student enrollment. The tools may includes the recent versions of SAS/ETS and SPSS.

Pre-Data Analysis Considerations Before every inferential statistical analysis with modeling building, we have to examine if our statistical model(s) chosen is appropriate to our case. For example, in the most common analysis using ordinary two-sample t-test, each observation of data has to meet with three assumptions: independent each other, statistically identical variation, and normally distributed. To qualify a series of data to use statistical software such as SAS/ETS, the time series data need assumptions such as: Data points gathered sequentially through equal time periods Future value depends upon the present value and the present value depends on the past observations, the series length is 30 or more Stochastically stationary or can be mathematically converted into stationary

SAS/ETS Proc ARIMA Data Analysis –Univariate Time Series – Model 1 Proc ARIMA, what does ARIMA mean? It means: A procedure of Auto Regressive Integrative Moving Average There are three stages to proc ARIMA: Identification stage: to determine the degree of difference, d; also can use “nlag” to set up number of lags for autocorrelations are shown Estimation and diagnostic checking stage: to determine degree of auto regression, p; and degree of moving average, q; Forecasting stage: set up number of periods need to be predicted Thus ARIMA has three important parameters: ARIMA (p, d, q). In our case, after back and forth checking, we picked p = 1, nlag = 8 d = 2, and q = 0, i.e., we have ARIMA (1, 2, 0) model, and we set up our forecast periods as 3.

SAS Code for ARIMA Procedure – Univariate Time Series – Model 1 title1 'Macomb Community College Enrollment Study'; title2 'Fall 1982 - Fall 2015 Study'; title3 'ARIMA Method'; data a(keep=Year Fall UnempRt); set sasuser.enrlhsty2015; *if year > 2012 then delete; proc arima data=a; identify var=fall(2) nlag=8; run; estimate p=1; forecast lead=3 out=results id=year;

SAS Proc STATESPACE – Multi-Valuate Time Series – Model 2 We found that there is another time series that has similar behavior as our student enrollment series, that is the local unemployment rate series. SAS/ETS Proc STATESPACE is appropriate for jointly forecasting two more related time series having dynamic interactives. Our case fits the model well. From our ARIMA (1,2,0) model, we know that we chose difference d = 2, so we also set up the second series d = 2 The state space procedure approaches to modeling a multivariate stationary time series automatically (Akaike 1976). Therefore, we don’t have to set up p and q parameters for it.

SAS Code for STATESPACE – Multivariate Time Series – Model 2 title1 'Macomb Community College Enrollment Study'; title2 'Fall 1982 - Fall 2015 Study'; title3 'Multivariate Time Series Method'; data a1; set a; *if year > 2012 then delete; proc statespace data=a1 out=out2 lead=3 noprint; var Fall(2) UnempRt(2); id Year; proc print data=out2 id Year; run;

Actual vs Forecasting by Model 2(1)

Actual vs Forecasting by Model 2(2)

Forecasting Test Result for Fall Model Building Test -- Fall Year Fall MCC Method I - ARIMA   Method II - MultiVTS 3-Year Actual Enrolled Forecasted Difference 2012 24,160 24,702 542 24,443 283 2013 23,725 24,392 667 24,061 336 2014 23,370 24,881 1,511 24,240 870 2-Year 23,852 127 23,775 50 23,801 431 23,788 418 1-Year 23,041 (329) 23,637 267

Forecasting Result for Fall Forecasting Result -- Fall Year As of Today Method I - ARIMA   Method II - MultiVTS 2015 22,352 23,041 689 23,105 753 2016 23,356 22,902 2017 23,212 22,409

Forecasting Test Result for Winter Model Building Test -- Winter Year Winter MCC Method I - ARIMA   Method II - MultiVTS 3-Year Actual Enrolled Forecasted Difference 2013 23,986 24,163 177 23,245 (741) 2014 23,690 23,671 (19) 23,290 (400) 2015 22,168 24,045 1,877 23,009 841 2-Year 23,353 (337) 22,814 (876) 23,356 1,188 22,902 734 1-Year 22,846 678 22,472 304

Forecasting Result of Winter Forecasting Result -- Winter Year Actual Enrolled Method I - ARIMA   Method II - MultiVTS 2016 21,643 21,003 2017 21,585 21,052 2018 21,303 20,363

Research Notes Two types of time series models we mentioned, a univariate one and a multivariate one, we can compare and judge which one is better. The other model such as using SAS Proc PDL (polynomial distributed lag) also can be used. Since enrollment forecasts are difficult to make in periods of irregular patterns when turning points are unexpected. Therefore, subjective forecasting should not be ruled out. Actually, if we integrate our administrative experience and quantitative analysis, the forecasting may more accurate. Some factors are important, such as marketing effort, improve our program and curriculum set up, teaching reputation, etc., however, these factors are very hard to quantified and hard to bring in. Also, comparing with most of four-year colleges and universities, we charge lower tuition and fees, a small increase of tuition and fees will not affect enrollment much.

References SAS Institute (1993), SAS/ETS User’s Guide V6 second Edition SAS Inc. Abraham, B. and Ledolter, J. (1983). Statistical Method for Forecasting Chen, C. (2008) Meharry Medical College, An Integrated Enrollment Forecast Model, IR Applications V. 15 Xia, Z. 2001, Macomb Community College Student Enrollment Model Building with Trend Analysis and Forecasting