Using Probabilistic Finite Automata to Simulate Hourly series of GLOBAL RADIATION. Mora-Lopez M. Sidrach-de-Cardona Shah Jayesh Valentino Crespi CS-594.

Slides:



Advertisements
Similar presentations
Estimating the detector coverage in a negative selection algorithm Zhou Ji St. Jude Childrens Research Hospital Dipankar Dasgupta The University of Memphis.
Advertisements

CORRELATION. Overview of Correlation u What is a Correlation? u Correlation Coefficients u Coefficient of Determination u Test for Significance u Correlation.
Probabilistic models Haixu Tang School of Informatics.
1 Detection and Analysis of Impulse Point Sequences on Correlated Disturbance Phone G. Filaretov, A. Avshalumov Moscow Power Engineering Institute, Moscow.
Random walk Presented by Changqing Li Mathematics Probability Statistics.
STAT 497 APPLIED TIME SERIES ANALYSIS
Chubaka Producciones Presenta :.
1 Alberto Montanari University of Bologna Simulation of synthetic series through stochastic processes.
1 Statistical Tests of Returns to Scale Using DEA Rajiv D. Banker Hsihui Chang Shih-Chi Chang.
Hypothesis testing Week 10 Lecture 2.
The Multiple Regression Model Prepared by Vera Tabakova, East Carolina University.
HMM-BASED PATTERN DETECTION. Outline  Markov Process  Hidden Markov Models Elements Basic Problems Evaluation Optimization Training Implementation 2-D.
PSY 307 – Statistics for the Behavioral Sciences
2012 JANUARY Sun Mon Tue Wed Thu Fri Sat
Extremes ● An extreme value is an unusually large – or small – magnitude. ● Extreme value analysis (EVA) has as objective to quantify the stochastic behavior.
Simulation Modeling and Analysis
Statistical Methods Chichang Jou Tamkang University.
Statistical Methods in Computer Science Hypothesis Testing I: Treatment experiment designs Ido Dagan.
Causal-State Splitting Reconstruction Ziba Rostamian CS 590 – Winter 2008.
Statistical Methods in Computer Science Hypothesis Testing I: Treatment experiment designs Ido Dagan.
5-3 Inference on the Means of Two Populations, Variances Unknown
12th EMS Annual Meeting & 9th European Conference on Applied Climatology (ECAC), Łódź, September 2012 Rajmund Przybylak, Aleksandra Pospieszyńska.
Application of reliability prediction model adapted for the analysis of the ERP system Frane Urem, Krešimir Fertalj, Željko Mikulić College of Šibenik,
School Year Calendar You can print this template to use it as a wall calendar, or you can copy the page for any month to add it to your own presentation.
School Year Calendar You can print this template to use it as a wall calendar, or you can copy the page for any month to add it to your own presentation.
2007 Monthly Calendar You can print this template to use it as a wall calendar, or you can copy the page for any month to add it to your own presentation.
You can print this template to use it as a wall calendar, or you can copy the page for any month to add it to your own presentation. If you’d like to change.
You can print this template to use it as a wall calendar, or you can copy the page for any month to add it to your own presentation. If you’d like to change.
Downscaling in time. Aim is to make a probabilistic description of weather for next season –How often is it likely to rain, when is the rainy season likely.
Southern Taiwan University Department of Electrical engineering
Statistical Analysis of Solar Geomagnetic Storm Occurrences By: Seth Sivak.
Chapter 10. Sampling Strategy for Building Decision Trees from Very Large Databases Comprising Many Continuous Attributes Jean-Hugues Chauchat and Ricco.
Learning Objectives In this chapter you will learn about the t-test and its distribution t-test for related samples t-test for independent samples hypothesis.
● Final exam Wednesday, 6/10, 11:30-2:30. ● Bring your own blue books ● Closed book. Calculators and 2-page cheat sheet allowed. No cell phone/computer.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
ELEMENTS OF MATHEMATICAL STATISTICS. GENERAL NOTIONS Mathematical statistics is a set of methods that are used to collect, analyze group and interprete.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
TYPES OF STATISTICAL METHODS USED IN PSYCHOLOGY Statistics.
WORD JUMBLE. Months of the year Word in jumbled form e r r f b u y a Word in jumbled form e r r f b u y a february Click for the answer Next Question.
WEATHER BY: JENNIFER FAUTH KINDERGARTEN.
An Index of Data Size to Extract Decomposable Structures in LAD Hirotaka Ono Mutsunori Yagiura Toshihide Ibaraki (Kyoto University)
BPS - 3rd Ed. Chapter 161 Inference about a Population Mean.
Tests of Random Number Generators
Lecture V Probability theory. Lecture questions Classical definition of probability Frequency probability Discrete variable and probability distribution.
1 CS 552/652 Speech Recognition with Hidden Markov Models Winter 2011 Oregon Health & Science University Center for Spoken Language Understanding John-Paul.
Chitu, E. Topor, E. Paltineanu, C. R. Dumitru, I. M. Sumedrea, D. Chitu, V. Ionita, A. D. Filipescu, L. Research Institute for Fruit Growing Pitesti -
Logic and Vocabulary of Hypothesis Tests Chapter 13.
An Index of Data Size to Extract Decomposable Structures in LAD Hirotaka Ono Mutsunori Yagiura Toshihide Ibaraki (Kyoto Univ.)
OPERATING SYSTEMS CS 3530 Summer 2014 Systems and Models Chapter 03.
URBDP 591 I Lecture 4: Research Question Objectives How do we define a research question? What is a testable hypothesis? How do we test an hypothesis?
Learning Sequence Motifs Using Expectation Maximization (EM) and Gibbs Sampling BMI/CS 776 Mark Craven
2011 Calendar Important Dates/Events/Homework. SunSatFriThursWedTuesMon January
Surveying II. Lecture 1.. Types of errors There are several types of error that can occur, with different characteristics. Mistakes Such as miscounting.
WEPS Climate Data (Cligen and Windgen). Climate Data ▸ Climate Generation by Stochastic Process A stochastic process is one involving a randomly determined.
July 2007 SundayMondayTuesdayWednesdayThursdayFridaySaturday
More about tests and intervals CHAPTER 21. Do not state your claim as the null hypothesis, instead make what you’re trying to prove the alternative. The.
Analyzing Redistribution Matrix with Wavelet
McDonald’s Kalender 2009.
McDonald’s Kalender 2009.
Year 2 Autumn Term Week 12 Lesson 1
Statistical inference
McDonald’s Kalender 2009.
Decision Trees By Cole Daily CSCI 446.
McDonald’s calendar 2007.
Year 2 Autumn Term Week 12 Lesson 1
Computer Simulation Techniques Generating Pseudo-Random Numbers
McDonald’s calendar 2007.
Production Month Sun Hours K Monthly Kwh Tou Peak Value After Kwh
2015 January February March April May June July August September
Presentation transcript:

Using Probabilistic Finite Automata to Simulate Hourly series of GLOBAL RADIATION. Mora-Lopez M. Sidrach-de-Cardona Shah Jayesh Valentino Crespi CS-594

Overview Abstract. Introduction. Data Set. Probabilistic Finite Automata. Global Radiation. Generalization of Model. Result. Conclusion. Questions, Comments ??????

Abstract Model to generate synthetic series of hourly exposure of the Global Radiation. Based on Subclass of Probabilistic Finite Automata (PFA) for Variable-length Marcov Process. Check “variable memory” of Cloudiness…………

Introduction Traditionally, analysis based on stochastic process theory. Should eliminate negative values appears in series. PFA - Mathematical model developed in Artificial Intelligent and Machine Learning. Why Machine Learning Model ? Useful for studying system in which goal concept presents Probabilistic behavior.

Global Radiation alance/index.htmlhttp://earthguide.ucsd.edu/earthguide/diagrams/energyb alance/index.html

Data Set Data of hourly exposure series of global radiation were recorded over several years at 9 Spanish metrological station.

Data Set Weather characteristics of locations are very different. Moderate Atlantic Climate (Oviedo). Continental Continental Climate (Madrid, Tortosa). Costal Mediterranean Climate both in winter and summer (Malaga, Mallorca).

Probabilistic Finite Automata. First application used for universal data compression. Used for, Analysis of biological sequences, for DNA and proteins. Analysis of natural languages, handwriting and speech.

Probabilistic Finite Automata.

Building PFA for Hourly Global Radiation Hourly Clearness index, K h = G h / G h,0 Where, G h Hourly global radiation. G h,0 Extraterrestrial hourly global radiation

Building PFA for Hourly Global Radiation Why Constructed “Artificial”? Data from different days linked together. Last observation of each day is followed by the first observation of the following day.

Building PFA for Hourly Global Radiation Numbers of hours ? Each series (Month) is constant and equal for all locations. 10January, February, November, December. 12March, April, September, October. 14May, Jun, July, August. To Discretize the continuous values of clearness index we have only 8 different discrete values.

Building PFA for Hourly Global Radiation Relationship between Values of Clearness index and symbol Of alphabet. Don’t having uniform interval. Lower and upper intervals, frequency of values is less than other intervals.

Algorithm 1.Compute the series of discrete values. 2.Initialize the PDF with a node, with label null sequence. 3.The set PSS – Possible Subsequence Set – is initialized with all sequences of order 1. each element in this set corresponds to a sequence of discrete values. Take o =1 as the initial value of the order – that is, the size if subsequence to consider. 4.If there are elements of order o is PSS, pick any of these elements, Y. Using all discrete sequences in the series, compute the frequency of Y. if 4a and 4b are true, then go to 5, else go to 6. 4athe frequency of this sequence is greater than the threshold frequency. 4bfor same, the probability of occurrence of the subsequence is not equal to the probability of the subsequence final(Y)xp, that is

Algorithm (not equal: when the ratio between the probabilities is significantly greater than one; for instance, greater than 1.2). 5.Do 5aAdd to the PFA a node, labeled Y, and compute its corresponding probabilities vector. 5bFor each amplified sequence, Y xp ; if the probability of this amplified sequence is greater than the threshold probability, then include it in PSS. 6.Remove the analyzed subsequence, Y, from PSS. 7.If there are no elements of order o in PSS, add 1 to the value of o. if o<= N and there are elements of length o is PSS, then go to 4, else stop.

Generating new series of Hourly Global Radiation. Generate a synthetic series. Tested on null hypothesis that the series have same mean and variance, with significance level This series, selected as proxy for recorded one, else generate another synthetic series until we find a synthetic series which rejected the null hypothesis. In all cases, less than 10 synthetic series had to be generated.

Generating new series of Hourly Global Radiation. For each selected synthetic series, compare cumulative probability distribution function (cpdf) with cpdf of the recorded series. Comparison is based on the Kolmogorov-Smirnov two-sample test-statistic, which focus of the absolute value of the maximum difference between two empirical distribution function.

Generalization of the Simulated Model To generate a new series of hourly clearness index uses, input data as Mean monthly value of the daily clearness index Cpdf of the recorded month. Most of metrological stations, these values are not available and only mean monthly values of the daily global radiation are usually recorded. One of the Aim of Paper: To characterize the observed relationship between the recorded data and parameter used for the proposed model.

Generalization of the Simulated Model

Relationship between these two parameters have computed the correlation coefficient between them and proves to be Concluded Mean monthly daily clearness index which is available can be used in model instead of the mean monthly hourly clearness index.

Result List of Test 1.Both series have the same mean and variance have been tested. (with significant level 0.05). 2.Cpdf of the recorded and simulated series have be compared with Kolmogorov-Smirnov two-sample test statistic with a bootstrap P-value.

Result It is observed that 97.8 % of the month it is same.

Result

Conclusion Model only use monthly mean value of global radiation and generate following. Constructed PFA. Proposed standard cpdf. Generate new series of hourly global radiation similar to real one. Conclude that probabilistic Finite Automata can be used to characterize and predict new series of hourly global solar radiation series.

No Questions ??????????? Thank you……