Analysis of day-ahead electricity data Zita Marossy & Márk Szenes (ColBud) MANMADE workshop January 21, 2008.

Slides:



Advertisements
Similar presentations
Cointegration and Error Correction Models
Advertisements

Prediction from Quasi-Random Time Series Lorenza Saitta Dipartimento di Informatica Università del Piemonte Orientale Alessandria, Italy.
Use of Business Tendency Survey Results for Forecasting Industry Production in Slovakia Use of Business Tendency Survey Results for Forecasting Industry.
Part II – TIME SERIES ANALYSIS C5 ARIMA (Box-Jenkins) Models
Unit Roots & Forecasting
Slide 1 DSCI 5340: Predictive Modeling and Business Forecasting Spring 2013 – Dr. Nick Evangelopoulos Lecture 7: Box-Jenkins Models – Part II (Ch. 9) Material.
Time Series Analysis Materials for this lecture Lecture 5 Lags.XLS Lecture 5 Stationarity.XLS Read Chapter 15 pages Read Chapter 16 Section 15.
Time Series Building 1. Model Identification
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
R. Werner Solar Terrestrial Influences Institute - BAS Time Series Analysis by means of inference statistical methods.
Chen Chu South China University of Technology. 1. Self-Similar process and Multi-fractal process There are 3 different definitions for self-similar process.
The Garch model and their Applications to the VaR
1 Pulp Price Analysis Prepared by Professor Martin L. Puterman for BABS 502 Sauder School of Business March 24, 2004.
Dr. Yukun Bao School of Management, HUST Business Forecasting: Experiments and Case Studies.
Forecasting using simple models
Analyzing stochastic time series Tutorial
Data Sources The most sophisticated forecasting model will fail if it is applied to unreliable data Data should be reliable and accurate Data should be.
Some Important Time Series Processes Fin250f: Lecture 8.3 Spring 2010.
Understanding and Predicting Host Load Peter A. Dinda Carnegie Mellon University
ARIMA Forecasting Lecture 7 and 8 - March 14-16, 2011
Short-Term Load Forecasting In Electricity Market N. M. Pindoriya Ph. D. Student (EE) Acknowledge: Dr. S. N. Singh ( EE ) Dr. S. K. Singh ( IIM-L )
1 4. Multiple Regression I ECON 251 Research Methods.
L7: ARIMA1 Lecture 7: ARIMA Model Process The following topics will be covered: Properties of Stock Returns AR model MA model ARMA Non-Stationary Process.
Time series analysis - lecture 1 Time series analysis Analysis of data for which the temporal order of the observations is important Two major objectives:
2. Forecasting. Forecasting  Using past data to help us determine what we think will happen in the future  Things typically forecasted Demand for products.
Slides 13b: Time-Series Models; Measuring Forecast Error
Chapter 2 Data Patterns and Choice of Forecasting Techniques
G. Peter Zhang Neurocomputing 50 (2003) 159–175 link Time series forecasting using a hybrid ARIMA and neural network model Presented by Trent Goughnour.
BOX JENKINS METHODOLOGY
CHAPTER 3 FORECASTING.
Stream-Based Electricity Load Forecast Authors: Joao Gama Pedro Pereira Rodrigues Presented by: Viktor Botev.
1 FARIMA(p,d,q) Model and Application n FARIMA Models -- fractional autoregressive integrated moving average n Generating FARIMA Processes n Traffic Modeling.
#1 EC 485: Time Series Analysis in a Nut Shell. #2 Data Preparation: 1)Plot data and examine for stationarity 2)Examine ACF for stationarity 3)If not.
Tutorial for solution of Assignment week 39 “A. Time series without seasonal variation Use the data in the file 'dollar.txt'. “
Data Mining: Neural Network Applications by Louise Francis CAS Annual Meeting, Nov 11, 2002 Francis Analytics and Actuarial Data Mining, Inc.
It’s About Time Mark Otto U. S. Fish and Wildlife Service.
1 4. Empirical distributions & prediction of returns 4.1 Prices and returns Price (P) ACF decays slowly. Log price: p = log(P) Single-period (rate of)
John G. Zhang, Ph.D. Harper College
The behaviour of day-ahead electricity prices Analysis of spot electricity prices using statistical, econometric, and econophysical methods Zita MAROSSY.
Big Data at Home Depot KSU – Big Data Survey Course Steve Einbender Advanced Analytics Architect.
FAT TAILS REFERENCES CONCLUSIONS SHANNON ENTROPY AND ADJUSTMENT OF PARAMETERS AN ADAPTIVE STOCHASTIC MODEL FOR RETURNS An adaptive stochastic model is.
Auto Regressive, Integrated, Moving Average Box-Jenkins models A stationary times series can be modelled on basis of the serial correlations in it. A non-stationary.
Time Series Analysis Lecture 11
1 Chapter 5 : Volatility Models Similar to linear regression analysis, many time series exhibit a non-constant variance (heteroscedasticity). In a regression.
1 BABS 502 Moving Averages, Decomposition and Exponential Smoothing Revised March 14, 2010.
Review and Summary Box-Jenkins models Stationary Time series AR(p), MA(q), ARMA(p,q)
MGS3100_03.ppt/Feb 11, 2016/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Time Series Forecasting Feb 11, 2016.
Ch16: Time Series 24 Nov 2011 BUSI275 Dr. Sean Ho HW8 due tonight Please download: 22-TheFed.xls 22-TheFed.xls.
Discussion of Mandelbrot Themes: Alpha (Tail Index) and Scaling (H) Prepared by Sheri Markose, Amadeo Alentorn and Vikentia Provizionatou WEHIA 2005 Implications.
Data Mining: Neural Network Applications by Louise Francis CAS Convention, Nov 13, 2001 Francis Analytics and Actuarial Data Mining, Inc.
Introduction to stochastic processes
LOAD FORECASTING. - ELECTRICAL LOAD FORECASTING IS THE ESTIMATION FOR FUTURE LOAD BY AN INDUSTRY OR UTILITY COMPANY - IT HAS MANY APPLICATIONS INCLUDING.
MODELLING SPOT PRICE OF ELECTRICITY IN NSW Hilary Green with Nino Kordzakhia and Ruben Thoplan 15th International Conference Computing in Economics.
Economics 173 Business Statistics Lecture 28 © Fall 2001, Professor J. Petry
Forecasting. Model with indicator variables The choice of a forecasting technique depends on the components identified in the time series. The techniques.
Ian Newcombe CO 2 LEVEL RISE OVER 26 YEARS. DATASET Quarterly Mauna Loa, HI CO 2 Record Quarterly US gasoline sales Quarterly US car and light truck sales.
Time Series Forecasting Trends and Seasons and Time Series Models PBS Chapters 13.1 and 13.2 © 2009 W.H. Freeman and Company.
Lecture 9 Forecasting. Introduction to Forecasting * * * * * * * * o o o o o o o o Model 1Model 2 Which model performs better? There are many forecasting.
Energy Consumption Forecast Using JMP® Pro 11 Time Series Analysis
Operations Management Contemporary Concepts and Cases
Case study 4: Multiplicative Seasonal ARIMA Model
Time Series Analysis and Its Applications
Chapter 6: Autoregressive Integrated Moving Average (ARIMA) Models
Meat sales forecasting (Panvita Group)
A Weighted Moving Average Process for Forecasting “Economics and Environment” By Chris P. Tsokos.
Forecasting.
Chapter 8 Supplement Forecasting.
Load forecasting Prepared by N.CHATHRU.
BOX JENKINS (ARIMA) METHODOLOGY
Chap 7: Seasonal ARIMA Models
Presentation transcript:

Analysis of day-ahead electricity data Zita Marossy & Márk Szenes (ColBud) MANMADE workshop January 21, 2008

Topics Stylized facts of electricity price data  Modeling variable: price  Autocorrelation structure  Persistence  Price distribution  Seasonality Time series modeling  Neural network  SETAR

Main results Persistence analysis  Underlying variable: price, not price change  Results: H = (0.8) Price distribution  Generalized extreme value distribution vs. Lévy distribution Design of a seasonal filter  Filtering the intra-weekly seasonality Performance evaluation of an ANN model  Reasonable for short-run forecasts SETAR model for determining price spikes Data: EEX, hourly day-ahead prices

Autocorrelation structure Seasonality Effect of intra-weekly seasonality is strong AC decays slowly

Modeling prices, not price changes 1. The price process has no unit root, there is no need to differentiate the time series 2. Electricity can not be stored: ‘return’ has no direct meaning 3. By differencing we cause spurious patterns in ACF:

Persistence analysis Calculating the Hurst exponent of prices Without differencing the time series Hurst exponent – classical usage (with differencing the time series first):  > 0.5 persistent process High ‘return’ shock followed by high ‘return’  = 0.5 random walk ‘Return’ is white noise  < 0.5 antipersistent process (mean reversion) Hurst exponent – without differencing  > 0.5 persistent process High price followed by high price = Are high prices persistent?  = 0.5 white noise  < 0.5 antipersistent process

Hurst exponent: estimation results MethodEstimated Hurst exponent Aggregated variance0.872 Differenced aggregated variance0.702 Aggregated absolute values/means0.924 Fractal dimension (Higuchi)0.967 Residuals of regression (Peng)0.811 R/S0.835 Periodogram0.891 Modified periodogram0.770 Wavelet0.839

Price distribution Two estimated distributions:  Lévy  Generalized extreme value

Comparison Kolmogorov test: Test statistic:  Lévy:  GEV: Mean of absolute differences:  Lévy: 8.07*10 -4  GEV: 7.18*10 -4

Seasonality Seasonality:  intradaily  Weekly Spectral decomposition  Periodogram of prices  Periodogram of ACF Filtering  Median or average week  Differencing  Moving average technique

Need for new seasonal filter The type of distribution changes

Suggested filter ‘GEV filter’ 1. Separately estimate a GEV distribution for each hour and day i: F1(i) 2. Transform the prices: F 2 -1 F 1,i (x) F 2 : lognormal cdf (parameters: entire distribution) 3. Model the prices of filtered data 4. Forecast 5. Transform the forecasts back into GEV

Empirical results Figures: periodogram of  ACF (orig prices)  ACF (filtered data) Intraweekly filtering  successful

Estimated GEV parameters

Distributions with high scale param

Conclusion Different hours of week behave differently There are a few hours with fatter tails These are more sensitive to price spikes We can model fat tails and forecasting separately

Performance evaluation of an ANN Short term price forecasting (few hours to days) ANN: simple but flexible tool Architecture: standard feedforward type Layers: 168 – 15 – 1 Input: historical data Training set: 42 days Prediction horizon: from 1 hour to 1 week

Performance evaluation of an ANN Measuring error by MAPE Testing against naive method Averaged over 50 runs: 50 consecutive weeks from Nov to Nov Results:  NN performs well in day-ahead forecasting  But it fails to compete with naive method in wider time horizon Improvements:  Exogenous variables

TAR (Threshold AR) SETAR Aim:  Identifying the limit (C) between high and low prices 2 state SETAR model  On daily price  Threshold: 44.26