Frédéric Picard and Steve Matthews

Slides:



Advertisements
Similar presentations
Survey of Electronic Commerce and Technology: Past, Present and Future Challenges Jason Raymond Third International Conference on Establishment Surveys.
Advertisements

Migration of a large survey onto a micro-economic platform Val Cox April 2014.
Towards a Better Integration of Survey and Tax Data in the Unified Enterprise Survey Claude Turmelle Statistics Canada ICES-III Montréal, Québec, Canada.
Forecasting the Demand Those who do not remember the past are condemned to repeat it George Santayana ( ) a Spanish philosopher, essayist, poet.
Slide 1 DSCI 5340: Predictive Modeling and Business Forecasting Spring 2013 – Dr. Nick Evangelopoulos Lecture 7: Box-Jenkins Models – Part II (Ch. 9) Material.
The Many Ways of Improving the Industrial Coding for Statistics Canada’s Business Register Yanick Beaucage ICES III June 2007.
Workshop on Energy Statistics, China September 2012 Electricity and Heat Statistics 1.
United Nations Statistics Division Scope and Role of Quarterly National Accounts Training Workshop on the Compilation of Quarterly National Accounts for.
Chapter 12 - Forecasting Forecasting is important in the business decision-making process in which a current choice or decision has future implications:
ForecastingOMS 335 Welcome to Forecasting Summer Semester 2002 Introduction.
United Nations Statistics Division
Administrative Data at Statistics Canada – Current Uses and the Way Forward 27 th Voorburg Group Meeting Warsaw, Poland André Loranger October 4, 2012.
Forecasting and Statistical Process Control MBA Statistics COURSE #5.
 Overview of SPSS  Interface  Getting Started  Managing Data  Descriptive Statistics  Basic Analysis  Additional Resources.
Forecasting Revenue: An Example of Regression Model Building Setting: Possibly a large set of predictor variables used to predict future quarterly revenues.
1 The availability, timeliness and quality of rapid estimates UNCTAD experience Henri Laurencin INTERNATIONAL SEMINAR ON TIMELINESS, METHODOLOGY AND COMPARABILITY.
The Future of Administrative Data ICES III End Panel Discussion Don Royce Statistics Canada June 2007.
Collecting Electronic Data From the Carriers: the Key to Success in the Canadian Trucking Commodity Origin and Destination Survey François Gagnon and Krista.
1 Appendix B: A Primer of Time Series Forecasting Models B.1 A Primer of Time Series Forecasting Models.
Listen to Your Meters! Pradeep Murthy SunTec Business Solutions.
Computing tasks associated with Time Series Processing Extract from a presentation by Fortier and Quenneville Statistics Canada -TSRAC BSMD Seminar --
Lyne Guertin Census Data Processing and Estimation Section Social Survey Methods Division Methodology Branch, Statistics Canada UNECE April 28-30, 2014.
V Bandi and R Lahdelma 1 Forecasting. V Bandi and R Lahdelma 2 Forecasting? Decision-making deals with future problems -Thus data describing future must.
R. Ty Jones Director of Institutional Research Columbia Basin College PNAIRP Annual Conference Portland, Oregon November 7, 2012 R. Ty Jones Director of.
Beijing, October 19, th International Roundtable on Business Survey Frames Co-ordinating role of the Business Register in Economic Statistics Results.
Processing Methodology of Tax Data at Statistics Canada Authors: François Brisebois, Richard Laroche and Rossana Manriquez (Statistics Canada) Presenter:
G W. Yan 1 Multi-Model Fusion for Robust Time-Series Forecasting Weizhong Yan Industrial Artificial Intelligence Lab GE Global Research Center.
Administrative Data at Statistics Canada – Current Uses and the Way Forward Wesley Yung and Peter Lys, Statistics Canada.
Simon Compton Methodology Directorate Office for National Statistics
Chapter 9 Forecasting Copyright 2015 Health Administration Press.
Copyright © 2013, SAS Institute Inc. All rights reserved. SAS SOLUTION DE JOUR: SAS OTTAWA PLATFORM USER SOCIETY INTRODUCTION TO SAS FORECAST SERVER 26.
Energy Consumption Forecast Using JMP® Pro 11 Time Series Analysis
GS/PPAL Research Methods and Information Systems
Regression Testing with its types
The treatment of uncertainty in the results
Forecasting Methods Dr. T. T. Kachwala.
Algorithmic complexity: Speed of algorithms
The user as data detective
Discussion 2 1/13/2014.
Carsten Boldsen Hansen Economic Statistics Section, UNECE
Shohreh Mirzaei Yeganeh United Nations Industrial Development
Lecture 8 ARIMA Forecasting II
HLM with Educational Large-Scale Assessment Data: Restrictions on Inferences due to Limited Sample Sizes Sabine Meinck International Association.
Evolving Data Processing in the Statistics Centre – Abu Dhabi
Henri Luomaranta, Statistics Finland
SDMX: Enabling World Bank to automate data ingestion
Étienne Saint-Pierre and Serge Godbout, Statistics Canada
An Active Collection using Intermediate Estimates to Manage Follow-Up of Non-Response and Measurement Errors Jeannine Claveau, Serge Godbout and Claude.
Macroeconomic heatmap taking the temperature of the Estonian economy
Survey phases, survey errors and quality control system
Sample surveys versus business register evaluations:
Survey phases, survey errors and quality control system
Global Assessment on Tendency Surveys
Assessing Quality of Paradata to Better Understand the Data Collection Process for CAPI Social Surveys François Laflamme Milana Karaganis European Conference.
Forecasting Qualitative Analysis Quantitative Analysis.
Quarterly National Accounts - Orientation
The System of National Accounts and Policy Development
Algorithmic complexity: Speed of algorithms
Regression testing Tor Stållhane.
Overview Historical review of the FGV’s Brazilian Manufacturing Survey
Pradeep Murthy SunTec Business Solutions
Regression Forecasting and Model Building
Algorithmic complexity: Speed of algorithms
ANALYSIS OF POSSIBILITY TO USE TAX AUTHORITY DATA IN STS. RESULTS
Lecturer Dr. Veronika Alhanaqtah
Sampling and estimation
TYPES of seasonal adjustment processes in banco de españa
Étienne Saint-Pierre, Statistics Canada
A handbook on validation methodology. Metrics.
Time Series Forecasting with SAS Forecast Server at NPPC
Presentation transcript:

Frédéric Picard and Steve Matthews Use of the SAS High-Performance Forecasting Software to Detect Break in Time Series Frédéric Picard and Steve Matthews SAS OPUS Ottawa, Ontario November 26, 2015

Outline Context SAS High Performance Forecasting Software Exploration of a Few Options Conclusion

Context Most Time Series at Statistics Canada are Surveys Repeated Over Time Annual Surveys Sub-Annual Surveys (Monthlies and Quarterlies) Several Domains

Methodological Changes in Time Series Try to minimize changes to surveys, but sometimes necessary: changing requirements changing population characteristics (cell phones, e-questionnaire) maintain or gain efficiency (sampling error) Artificial change could be misinterpreted as meaningful prefer to revise past to remove effect Impact is best estimated by conducting parallel run (costly)

Use of Forecasts for Time Series Break Detection Select a model based on past data Use that model to “forecast” the current period along with a 95% prediction interval. Looks if the survey estimate for the current period falls within the interval. If the survey estimate is outside of the interval then it is an indication that there is a break.

Statistical Forecasting : Annual Example Salaries and Wages, Industry domain ARIMA (1,1,0)

Challenges Related to the Volume Tens of thousands of survey estimates Model selection and estimation Manual model selection is a difficult and time consuming process Automated model selection can sometimes fail Inclusion or not of an auxiliary variable to improve forecast.

SAS High Performance Forecasting Features Several Options Automated model selection Fast Robust GUI SAS code (proc HPFdiagnose, HPFengine…) GUI can generate the corresponding SAS code

Several Options Model selection criteria (MAPE, RMSE,…), In-Sample vs Out-of-Sample Inclusion or not of an auxiliary variable (force or let HPF decide) Transform the data or not Several models available ESM, ARIMAX, UCM, IDM Outlier detections

Robust If ARIMAX models fail, HPF will try simpler models such as ESM. It has intermittent demand models for data with a lot of zeroes HPF will (almost) always provide a forecast and a confidence interval if the syntax and file formats are correct.

For our Project Used GUI to explore the different options and their impact on the models Looked at the generated SAS code We use the SAS code for the production Easier to manage datasets A little bit more flexible Easy to reuse with other datasets or when data is updated

Proc HPFdiagnose and HPFengine Helpful to determine the best model: Does the series have a trend? Is the series seasonal? Is the auxiliary variable a good predictor of the variable of interest? Should we transform the data using the log?

Forecasting a Seasonal Monthly Series Energy Consumption, Province*Industry domain ARIMA (0,1,0)(2,0,0)

Forecasting a non-seasonal Monthly Series Energy Consumption, Province*Industry domain ESM with trend

Usefulness of an auxiliary variable Sometimes, we have an auxiliary variable It has the potential of increasing the precision Usually, in order to be useful It has to be available for the period that we want to forecast the variable of interest It has to be a good predictor of the variable of interest.

Statistical Forecasting: Annual Example Total Revenue, Province*Industry domain ARIMA (0,1,0)

Statistical Forecasting: Annual Example Total Revenue, Province*Industry domain ARIMA(1,1,0) +X(1)

Roles to Variables

Corresponding Code proc hpfdiagnose data=… criterion=mape holdout=0; transform type=auto ; id date_SAS interval=YEAR; forecast Revenue ; input TaxableRevenue / REQUIRED=MAYBE(POSITIVE) ; arimax identifyorder=both; esm; run;

Transform Data or Not

Corresponding Code proc hpfdiagnose data=… criterion=mape holdout=0; transform type=auto ; id date_SAS interval=YEAR; forecast Revenue ; input TaxableRevenue / REQUIRED=MAYBE(POSITIVE) ; arimax identifyorder=both; esm; run;

Selection Criterion

Selection Criterion

Corresponding Code proc hpfdiagnose data=… criterion=mape holdout=0; transform type=auto ; id date_SAS interval=YEAR; forecast Revenue ; input TaxableRevenue / REQUIRED=MAYBE(POSITIVE) ; arimax identifyorder=both; esm; run;

Conclusion HPF was helpful to us to help manage the large number of forecast HPF allowed us to use complex models (with auxiliary variables) The GUI was helpful to explore different options HPF is highly automated but we have the option to intervene

Time Series Research and Analysis Centre Thank you! For more information,  Pour plus d’information, please contact: veuillez contacter : Frederic Picard Time Series Research and Analysis Centre Statistics Canada frederic.picard@canada.ca