Josefina López Herrera Advisors: François Cellier Gabriela Cembrano IOC - UA - IRI Time Series Prediction Using Inductive Reasoning Techniques.

Slides:



Advertisements
Similar presentations
FINANCIAL TIME-SERIES ECONOMETRICS SUN LIJIAN Feb 23,2001.
Advertisements

Yinyin Yuan and Chang-Tsun Li Computer Science Department

Towards Twitter Context Summarization with User Influence Models Yi Chang et al. WSDM 2013 Hyewon Lim 21 June 2013.
Use of Kalman filters in time and frequency analysis John Davis 1st May 2011.
AGC DSP AGC DSP Professor A G Constantinides©1 Modern Spectral Estimation Modern Spectral Estimation is based on a priori assumptions on the manner, the.
Maximum Covariance Analysis Canonical Correlation Analysis.
Ch11 Curve Fitting Dr. Deshi Ye
Objectives (BPS chapter 24)
Mixed Structural and Behavioral Models for Predicting the Future Behavior of some Aspects of the Macroeconomy Mukund Moorthy 2nd February 1999.
Jump to first page Fuzzy Inductive Reasoning Predicting U.S. Food Demand in the 20th Century: A New Look at System Dynamics Jeffrey T. LaFrance, Professor.
Start Presentation © Prof. Dr. François E. Cellier Internal Seminar: Groups Arbenz/Cellier/Gander/Hinterberger May 29, 2007 Making Predictions in an Uncertain.
A Generalized Model for Financial Time Series Representation and Prediction Author: Depei Bao Presenter: Liao Shu Acknowledgement: Some figures in this.
© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
PROVIDING DISTRIBUTED FORECASTS OF PRECIPITATION USING A STATISTICAL NOWCAST SCHEME Neil I. Fox and Chris K. Wikle University of Missouri- Columbia.
UNIVERSITAT POLITÈCNICA DE CATALUNYA Programa de doctorat: Tecnologies avançades de la producció Qualitative Modelling of Complex Systems by Means of Fuzzy.
Mutual Information Mathematical Biology Seminar
Chapter 12 Simple Regression
BA 555 Practical Business Analysis
Simple Linear Regression
Enhanced Equal Frequency Partition Method for the Identification of a Water Demand System T. Escobet R.M. Huber A. Nebot F.E. Cellier Dept ESAII IRI Dept.
Development of Empirical Models From Process Data
Particle Filters for Mobile Robot Localization 11/24/2006 Aliakbar Gorji Roborics Instructor: Dr. Shiri Amirkabir University of Technology.
1 Simple Linear Regression Chapter Introduction In this chapter we examine the relationship among interval variables via a mathematical equation.
Estimation and the Kalman Filter David Johnson. The Mean of a Discrete Distribution “I have more legs than average”
Forecasting McGraw-Hill/Irwin Copyright © 2012 by The McGraw-Hill Companies, Inc. All rights reserved.
The Analysis of Variance
Josefina López Herrera Institut d’Informàtica i Robòtica Industrial Universitat Politècnica de Catalunya Edifici Nexus Gran Capità 2-4 Barcelona 08034,
Today Concepts underlying inferential statistics
1 BA 555 Practical Business Analysis Review of Statistics Confidence Interval Estimation Hypothesis Testing Linear Regression Analysis Introduction Case.
Start Presentation January 18, 2014 Inductive Modeling In this presentation, we shall study general techniques for identifying complex non-linear models.
Statistical Methods for long-range forecast By Syunji Takahashi Climate Prediction Division JMA.
Lecture II-2: Probability Review
Introduction to Regression Analysis, Chapter 13,
Adaptive Signal Processing
1 Prediction of Software Reliability Using Neural Network and Fuzzy Logic Professor David Rine Seminar Notes.
Traffic modeling and Prediction ----Linear Models
ECON 6012 Cost Benefit Analysis Memorial University of Newfoundland
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
Demand Management and Forecasting
Computational Stochastic Optimization: Bridging communities October 25, 2012 Warren Powell CASTLE Laboratory Princeton University
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Intro. ANN & Fuzzy Systems Lecture 26 Modeling (1): Time Series Prediction.
Applications of Neural Networks in Time-Series Analysis Adam Maus Computer Science Department Mentor: Doctor Sprott Physics Department.
Managerial Economics Demand Estimation & Forecasting.
Y X 0 X and Y are not perfectly correlated. However, there is on average a positive relationship between Y and X X1X1 X2X2.
6. Evaluation of measuring tools: validity Psychometrics. 2012/13. Group A (English)
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
BioSS reading group Adam Butler, 21 June 2006 Allen & Stott (2003) Estimating signal amplitudes in optimal fingerprinting, part I: theory. Climate dynamics,
Start PresentationSeptember 21, 2009 Mixed Quantitative and Qualitative Simulation in Modelica Prof. Dr. François E. Cellier Department of Computer Science.
Correlation & Regression Analysis
Chapter 8: Simple Linear Regression Yang Zhenlin.
1 Module One: Measurements and Uncertainties No measurement can perfectly determine the value of the quantity being measured. The uncertainty of a measurement.
Hybrid Load Forecasting Method With Analysis of Temperature Sensitivities Authors: Kyung-Bin Song, Seong-Kwan Ha, Jung-Wook Park, Dong-Jin Kweon, Kyu-Ho.
McGraw-Hill/Irwin Copyright © 2008 by The McGraw-Hill Companies, Inc. All rights reserved. Demand Management and Forecasting CHAPTER 10.
Visual-FIR: A new platform for modeling and prediction of dynamical Systems Antoni Escobet 1 Àngela Nebot 2 François E. Cellier 3 1 Dept. ESAII Universitat.
Review and Summary Box-Jenkins models Stationary Time series AR(p), MA(q), ARMA(p,q)
The Box-Jenkins (ARIMA) Methodology
CHEE825 Fall 2005J. McLellan1 Nonlinear Empirical Models.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
LOAD FORECASTING. - ELECTRICAL LOAD FORECASTING IS THE ESTIMATION FOR FUTURE LOAD BY AN INDUSTRY OR UTILITY COMPANY - IT HAS MANY APPLICATIONS INCLUDING.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
CPH Dr. Charnigo Chap. 11 Notes Figure 11.2 provides a diagram which shows, at a glance, what a neural network does. Inputs X 1, X 2,.., X P are.
Does the brain compute confidence estimates about decisions?
Chapter 4 Basic Estimation Techniques
Why Stochastic Hydrology ?
Chapter 11: Simple Linear Regression
Dr. Unnikrishnan P.C. Professor, EEE
Facultad de Ingeniería, Centro de Cálculo
Presentation transcript:

Josefina López Herrera Advisors: François Cellier Gabriela Cembrano IOC - UA - IRI Time Series Prediction Using Inductive Reasoning Techniques

Table of Contents Contributions principales. Antecedents. Time Series Analysis Techniques. Fuzzy Inductive Reasoning (FIR) for Time Series Analysis. Time Series Characteristics. Conclusions and Future Research.

Contributions Evaluation of Prediction Error. Confidence Measures for Prediction in FIR. Dynamic Mask Allocation. Estimation of Horizon of Predictability. Applications: –Early Warning Using Smart Sensors. –Signal Predictive Control Using FIR

Antecedents George Klir at the State University of New York Uyttenhove 1978, Klir 1985 François Cellier at the University of Arizona Cellier and Yandell 1987, D. Li and Cellier 1990,Cellier 1991,Cellier et al. 1996, Cellier et al Rafael Huber and Gabriela Cembrano at the IRI Institute (UPC-CSIC)

PhD. Dissertations UPC-UA Angela Nebot Castells (1994) Qualitative Modeling and Simulation of Biomedical Systems using FIR Francisco Múgica (1995) Diseño Sistemático de Controladores Difusos Usando Razonamiento Inductivo Alvaro de Albornoz Bueno (1996) Inductive Reasoning and Reconstruction Analysis: Two Complementary Tools for Qualitative Fault Monitoring of Large-Scale Systems

Linear Models Linear Models Non-Linear Models Non-Linear Models Fuzzy Logic Fuzzy Logic Pattern-Based Approaches Pattern-Based Approaches Time Series Analysis Techniques Time Series Analysis Techniques FIR

Stationarity will be assumed. Prefiltering of data may be necessary. Probabilistic Reasoning. Ljung 1999, Brockwell and David 1991, 1996,Box Jenkins Stochastic Time Series. Linear Models Linear Models

Parametric Models, Learning Techniques At least Quasi-stationary Deterministic Elements State Space Models (Casdagli and Eubank 1992) Neural Networks (Weigend and Gershenfeld 1994) Hybrid Models (Delgado 1998, Telecom 1994) Non-Linear Models Non-Linear Models

Non-parametric Models, Synthesized Techniques At least Quasi-stationary, Deterministic Elements Fuzzy Neural Networks (Jang 1997) FIR (López et al. 1996) Mixed Models : Burr 1998, Takagi and Sugeno 1991 Fuzzy Logic Fuzzy Logic

Fuzzification: Conversion to qualitative variables (Fuzzy Recoding) Qualitative Modeling:Find the best qualitative relationship between inputs and outputs (Fuzzy Modeling) Qualitative Simulation: Forecasting of future qualitative outputs (Fuzzy Simulation) Defuzzification : Conversion to quantitative variables (Regeneration) FIR

Qualitative Modeling

Qualitative Simulation Behavior Matrix Raw Data Matrix Optimal Matrix 1 ? Input Pattern Distance Computation Euclidean d j Output Forecast Computation f i =F(W*5-NN-out) Forecast Value 5-Nearest Neighbors Matched Input Pattern Class Side Member

Time Series Forecasting In univariate time series, only a single variable has been observed, the future values of which are to be predicted on the basis of their own past. In this case, the mask candidate matrix has n-rows and one column. In order to decide the depth of the mask, the autocorrelation function is used.

Characteristics of Time Series B -Barcelona water demand time series V -Van-der- Pol oscillator time series L- chaotic intensity pulsation of a single-mode far infrared NH 3 laser beam Weigend and Gershenfeld 1994

Water Demand Prediction Data Daily Demand in Barcelona. Jan Nov The process is quasi-stationary, and its variance is roughly constant.

Water Demand Prediction The water demand on any given day is strongly correlated with the demand seven days earlier. Autocorrelation function of daily demand series.

Water Demand Prediction The result of prediction was:

Prediction Error

Qualitative Simulation with FIR prediction for time using k steps real data predicted data

Comparison of FIR with other Methodologies for the Barcelona Water Demand Time Series without intervention analysis *) with intervention analysis Related Investigation

Comparison of FIR with other Methodologies

Confidence Measures Confidence Measures Crisp Fuzzy Logic Fuzzy Logic Proximity Similarity

Sources of Uncertainty in Predictions Dispersion among neighbors in input space. Uncertainty related to quantity of measurements. Dispersion among neighbors in output space. Uncertainty related to quality of measurements.

Proximity Measure This measure is related to establishing the distance between the testing input state and the training input states of its five nearest neighbors in the experience data base and to establishing distance measures between the output states of the five nearest neighbors among themselves.

Similarity Measure (Dubois and Pradé 1980). A=B then S 1 (A,B) = 1.0 A disjoint B then S 1 (A,B) = 0.0 This measure is defined without the explicit use of a distance function, the similarity measure presented is based on intersection, union and cardinality.

Similarity Measure The similarity of the i th m-input of the j th nearest neighbor to the testing m-input based on intersection can be defined as follows: The overall similarity of the j th neighbor is defined as the average similarity of all its m- inputs in the input space: where q i are normalized values in the range from 0 to 1.

Similarity Measure The similarity of the j th neighbor to the estimated testing m-output based on intersection can be defined as follows: A confidence value based on similarity measures can thus be defined :

FIR Confidence Measures for NH 3 Time Series Deterministic process Similarity and Proximity

FIR Confidence Measures for Barcelona Time Series Stochastic Process with deterministic elements. The relationship between the prediction error and the confidence measures is less evident. The two are positively correlated.

Evaluation of Confidence Measures The similarity measure is more sensitive to the prediction error because the similarity measure preserves the qualitative difference between a new input state and its neighbors in the experience data base. The confidence measures are indicators of how well the series may be fitted by an autoregressive or deterministic model.

FIR Mask #1 FIR Mask #2 Mask Selector FIR Mask #n Switch Selector c1c1 c2c2 ynyn y1y1 y2y2 cncn Best mask y TsTs y i predicted output using mask m i c i estimated confidence Dynamic Mask Allocation in Fuzzy Inductive Reasoning (DMAFIR)

Optimal and Suboptimal Mask for Barcelona Time Series

Dynamic Mask Allocation Applied to Barcelona Time Series

Prediction and Simulation FIR Predictions use different masks to predict future values n-steps into the future, avoiding the use of already predicted (contamined) data in the predictions. FIR Simulations use the optimal mask of the single step prediction recursively, minimizing the distance of extrapolation at the expense of recursively using already contamined data.

Qualitative Prediction 1-step prediction Mask candidate matrix Optimal Mask 2-step prediction 3-step prediction

Simulation and Prediction Without dynamic mask allocation for Barcelona time series. Comparison of FIR qualitative simulation and prediction with dynamic mask allocation for Barcelona time series.

DMAFIR Algorithm to Predict Time Series with Multiple Regimes The behavioral patterns change between segments. Van-der-Pol oscillator series is introduced. This oscillator is described by the following second-order differential equation : By choosing the outputs of the two integrators as two state variables: The following state-space model is obtained: Output Time Series

DMAFIR Algorithm to Predict Time Series with Multiple Regimes * the input/output behaviors will be different because of the different training data used by the two models

Prediction Errors for Van-der-Pol Series FIR during the prediction looks for five good neighbors, it only encounters four that are truly pertinent. The values along the diagonal are smallest and the values in the two remaining corners are largest.

One-day Predictions of the Van-der-Pol Multiple Regimes Series. A time series was constructed in which the variable  assumes a value of 1.5 during one segment, followed by a value of 2.5 during the second time segment, followed by 3.5. The multiple regimes series consists of 553 samples.

Prediction Errors for Multiple Regimes Van-der-Pol Series The model obtained for  = 1.5 cannot predict the higher peaks of the second and third time segment very well. The DMAFIR error demostrates that this new technique can indeed be successfully applied to the problem of predicting time series that operate in multiple regimes.

Variable Structure System Prediction with DMAFIR A time-varying system exhibits an entire spectrum of different behavioral patterns. To demonstrate DMAFIR’s ability of dealing with time-varying systems, the Van-der-Pol oscillator is used. A series was generated, in which changes its value continuously in the range from 1.0 to 3.5. The time series contains 953 records sampled using a sampling interval of 0.05.

One-day Predictions of the Van-der-Pol Time-varying Series Using DMAFIR with the Similarity Confidence Measure Predictions Errors for Time-varying Van-der-Pol Series.

Predicting the Predictability Horizon The errors are likely to accumulate during iterative predictions of future values of a time series. It is thus of much interest to the user of such a tool to be able to assess the quality of predictions made not only locally, but as a function of time. When the predictions depend on previously predicted data points these are by themselves associated with a degree of uncertainty already. In the first step of a multiple-step prediction, the predicted value depends entirely on measurement data. The local error can be indirectly estimated using the proximity or similarity measure. Either measure can easily be extended to become an estimator of accumulated confidence

Water Demand of the City of Barcelona Multiple Step simulation using FIR

Conclusions The prediction made by CIR (Causal Inductive Reasoning) were not significantly better. The confidence measure of FIR are an indirect prediction error estimate. A new formula to assess the error of predictions of a univariate time series, the FIR filters out what it considers to be a noise. FIR provides the model automatically, not requires a significant development effort as well as knowledge about the nature of the process form wich the series was derived. The confidence measures provide at least a statistical estimate for the quality of the prediction. Several suboptimal mask are used to make, in parallel forecast of the same time series. Each of the forecast is accompanied by an estimate of its quality. In each step, the one forecast is kept as the true forecast to be reported back to the user that shows the highest confidence value. A set of formulae has been devised to estimate the effects of data contamination on the accunulated confidence over multiple prediction steps. The FIR is a robust methodology, after López et al. 96 some UPC groups use FIR like Prediction Module in an Optimation Tool for Water Distribution Networks, Quevedo et al

Publications Cellier, F. And J. López (1995). Causal Inductive Reasoning. A new paradigm for data-driven qualitative simulation of continuous-time dynamical systems. Systems Analysis Modelling Simulation 18(1), pp Cellier F., J. López, A. Nebot, G. Cembrano (1996), Means for estimating the forecasting error in Fuzzy Inductive Reasoning, ESM´96:European Simulation Multiconference, Budapest, Hungary, June 2-6, pp López J., G. Cembrano, F, Cellier (1996), Time series prediction using Fuzzy Inductive Reasoning, ESM´96:European Simulation Multiconference, Budapest, Hungary, June 2-6, pp Cellier F., J. López, A. Nebot, G. Cembrano (1998), Confidence measures in Fuzzy Inductive Reasoning, International Journal of General Systems, in print. López J., F. Cellier (1999), Improving the Forecasting Capability of Fuzzy Inductive Reasoning by Means of Dynamic Mask Allocation, ESM´99:European Simulation Multiconference, in print. López J., F. Cellier, G. Cembrano, L. Ljung, (1999), Estimating the horizon of predictability in time series predictions using inductive modeling tools, International Journal of General Systems, submitted for publication.

Future Research Use of time-series predictors in the design of smart sensors with look-ahead capabilities. If a sensor with look-ahead capability can anticipate the crossing of a critical threshold, it may issue an early warning that might enable the plant operator to do something about the problem before it ever occurs. (Appendix A) The design of signal predictive controllers that make use of smart sensors of the class introduced in Appendix A, to improve the control performance of feedback control systems.