P á l Rakonczai, L á szl ó Varga, Andr á s Zempl é ni Copula fitting to time-dependent data, with applications to wind speed maxima Eötvös Loránd University.

Slides:



Advertisements
Similar presentations
Copula Representation of Joint Risk Driver Distribution
Advertisements

Probabilistic Analysis of Hydrological Loads to Optimize the Design of Flood Control Systems B. Klein, M. Pahlow, Y. Hundecha, C. Gattke and A. Schumann.
INTRODUCTION TO COPULAS
Structural reliability analysis with probability- boxes Hao Zhang School of Civil Engineering, University of Sydney, NSW 2006, Australia Michael Beer Institute.
Pattern Recognition and Machine Learning
Week11 Parameter, Statistic and Random Samples A parameter is a number that describes the population. It is a fixed number, but in practice we do not know.
Pair-copula constructions of multiple dependence Workshop on ''Copulae: Theory and Practice'' Weierstrass Institute for Applied Analysis and.
Master thesis presentation Joanna Gatz TU Delft 29 of July 2007 Properties and Applications of the T copula.
Models for construction of multivariate dependence Workshop on Copulae and Multivariate Probability distributions in Finance – Theory, Applications,
Chap 8: Estimation of parameters & Fitting of Probability Distributions Section 6.1: INTRODUCTION Unknown parameter(s) values must be estimated before.
Copula approach to modeling of ARMA and GARCH models residuals Anna Petričková FSTA 2012, Liptovský Ján
Simulating Exchangeable Multivariate Archimedean Copulas and its Applications Authors: Florence Wu Emiliano A. Valdez Michael Sherris.
Elementary hypothesis testing Purpose of hypothesis testing Type of hypotheses Type of errors Critical regions Significant levels Hypothesis vs intervals.
Parameter Estimation for Dependent Risks: Experiments with Bivariate Copula Models Authors: Florence Wu Michael Sherris Date: 11 November 2005.
Descriptive statistics Experiment  Data  Sample Statistics Experiment  Data  Sample Statistics Sample mean Sample mean Sample variance Sample variance.
Statistical inference
Machine Learning CMPT 726 Simon Fraser University
„EXTREME-VALUE ANALYSIS: FOCUSING ON THE FIT AND THE CONDITIONS, WITH HYDROLOGICAL APPLICATIONS” Dávid Bozsó, Pál Rakonczai, András Zempléni Eötvös Loránd.
Archimedean Copulas Theodore Charitos MSc. Student CROSS.
A) Transformation method (for continuous distributions) U(0,1) : uniform distribution f(x) : arbitrary distribution f(x) dx = U(0,1)(u) du When inverse.
Wang Yao Department of Statistics Rutgers University Mentor: Professor Regina Y. Liu DIMACS -- July 17, 2008 Extreme Value Theory.
Probability theory 2008 Outline of lecture 5 The multivariate normal distribution  Characterizing properties of the univariate normal distribution  Different.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Statistical Intervals Based on a Single Sample.
Higher-order Confidence Intervals for Stochastic Programming using Bootstrapping Cosmin G. Petra Joint work with Mihai Anitescu Mathematics and Computer.
Copula functions Advanced Methods of Risk Management Umberto Cherubini.
Correlations and Copulas 1. Measures of Dependence 2 The risk can be split into two parts: the individual risks and the dependence structure between them.
Simulation Output Analysis
CASA June 2006 BRATISLAVA Mária Bohdalová Faculty of Management, Comenius University Bratislava Oľga Nánásiová Faculty of Civil.
Statistics for Engineer Week II and Week III: Random Variables and Probability Distribution.
A statistical model Μ is a set of distributions (or regression functions), e.g., all uni-modal, smooth distributions. Μ is called a parametric model if.
An Empirical Likelihood Ratio Based Goodness-of-Fit Test for Two-parameter Weibull Distributions Presented by: Ms. Ratchadaporn Meksena Student ID:
CS433: Modeling and Simulation Dr. Anis Koubâa Al-Imam Mohammad bin Saud University 15 October 2010 Lecture 05: Statistical Analysis Tools.
Random Numbers and Simulation  Generating truly random numbers is not possible Programs have been developed to generate pseudo-random numbers Programs.
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
Week11 Parameter, Statistic and Random Samples A parameter is a number that describes the population. It is a fixed number, but in practice we do not know.
Continuous Variables Write message update equation as an expectation: Proposal distribution W t (x t ) for each node Samples define a random discretization.
Lévy copulas: Basic ideas and a new estimation method J L van Velsen, EC Modelling, ABN Amro TopQuants, November 2013.
Clustering and Testing in High- Dimensional Data M. Radavičius, G. Jakimauskas, J. Sušinskas (Institute of Mathematics and Informatics, Vilnius, Lithuania)
Limits to Statistical Theory Bootstrap analysis ESM April 2006.
Lecture 3: Statistics Review I Date: 9/3/02  Distributions  Likelihood  Hypothesis tests.
Additional Topics in Prediction Methodology. Introduction Predictive distribution for random variable Y 0 is meant to capture all the information about.
Sampling and estimation Petter Mostad
Point Estimation of Parameters and Sampling Distributions Outlines:  Sampling Distributions and the central limit theorem  Point estimation  Methods.
Chapters 6 & 7 Overview Created by Erin Hodgess, Houston, Texas.
New approaches in extreme-value modeling A.Zempléni, A. Beke, V. Csiszár (Eötvös Loránd University, Budapest) Flood Risk Workshop,
New Information Technologies in Learning Statistics M. Mihova, Ž. Popeska Institute of Informatics Faculty of Natural Sciences and Mathematics, Macedonia.
Incorporating Uncertainties into Economic Forecasts: an Application to Forecasting Economic Activity in Croatia Dario Rukelj Ministry of Finance of the.
STA347 - week 91 Random Vectors and Matrices A random vector is a vector whose elements are random variables. The collective behavior of a p x 1 random.
The Unscented Particle Filter 2000/09/29 이 시은. Introduction Filtering –estimate the states(parameters or hidden variable) as a set of observations becomes.
Stats Term Test 4 Solutions. c) d) An alternative solution is to use the probability mass function and.
1 Probability and Statistics Confidence Intervals.
Week 21 Order Statistics The order statistics of a set of random variables X 1, X 2,…, X n are the same random variables arranged in increasing order.
Gil McVean, Department of Statistics Thursday February 12 th 2009 Monte Carlo simulation.
Latvijas Aktuāru Asociācija AN INTRODUCTION TO COPULAS Gaida Pettere Professor, Dr. Math. Riga Technical University, Chairmen of Latvian Actuarial Association.
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
Bootstrapping James G. Anderson, Ph.D. Purdue University.
1 Lecture Plan Modelling Profit Distribution from Wind Production (Excel Case: Danish Wind Production and Spot Prices) Reasons for copula.
Week 21 Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
A major Hungarian project for flood risk assessment A.Zempléni (Eötvös Loránd University, Budapest, visiting the TU Munich as a DAAD grantee) Technical.
Estimating standard error using bootstrap
Standard Errors Beside reporting a value of a point estimate we should consider some indication of its precision. For this we usually quote standard error.
Probability Theory and Parameter Estimation I
Ch3: Model Building through Regression
Special Topics In Scientific Computing
Predictive distributions
CONCEPTS OF ESTIMATION
Probability & Statistics Probability Theory Mathematical Probability Models Event Relationships Distributions of Random Variables Continuous Random.
Bootstrap - Example Suppose we have an estimator of a parameter and we want to express its accuracy by its standard error but its sampling distribution.
The Empirical FT. What is the large sample distribution of the EFT?
Pattern Recognition and Machine Learning
Presentation transcript:

P á l Rakonczai, L á szl ó Varga, Andr á s Zempl é ni Copula fitting to time-dependent data, with applications to wind speed maxima Eötvös Loránd University Faculty of Science Institute of Mathematics Department of Probability Theory and Statistics

Outline 1.Copulae 2.Goodness-of-fit tests 3.Bootstrap methods 4.Serial dependence 5.Applications to wind speed maxima 2

1. Copulae C is a copula, if it is a d-dimensional random vector with marginals ~ Unif [0,1] Existence (Sklar’s Theorem): to any d-dimensional random variable X with c.d.f. H and marginals F i (i=1,...,d) there exists a copula C : H( x 1, …, x d ) = C ( F 1 (x 1 ), …, F d (x d ) ) Uniqueness: if F i are continuous (i=1,...,d) Separation of the marginal model and the dependence 3

Elliptical Copulae – copulae of elliptical distributions – Gaussian: X ~ N n (0,Σ) where Φ: c.d.f. of N(0,1) – Student’s t: X ~ where t v : c.d.f. of Student’s t distribution with v degrees of freedom 1. Copulae – Examples 4

Archimedean Copulae Copula generator function: ϕ is continuous, strictly decreasing and ϕ (1)=0. d-variate Archimedean copula: – Gumbel: where – Clayton: where 1. Copulae - Examples 5

6

2. Goodness-of-fit tests in one dimension 1.Estimation of the model parameter 2.Goodness-of fit test: a)Cramér-von Mises tests: F n : empirical c.d.f. F: c.d.f. Φ : weight function  Anderson-Darling: b)Critical value – simulation: 1)Simulate a sample from the copula model C θ under H 0 2)Re-estimate by ML-method 3)Calculate the test statistics Repetition and estimation of p values 7

2. Goodness-of-fit tests in more dimensions Probability integral transformation (PIT) – mapping into the d-dimensional unit cube: ~H ~C, for i=1,...,n Kendall’s transform: (K function) Advantage: one-dimensional – Example: Archimedean copulas: where 8

Empirical version: where Kendall’s process:  favorable asymptotic properties Cramér-von Mises type statistic:  where Φ : weight function 9 2. Goodness-of-fit tests in more dimensions

3. Serial dependence Let X 1, X 2,..., X n be univariate stationary observations; EX i = μ, Var(X i )= σ 2. If X 1, X 2,..., X n are i.i.d., then Serial dependence → higher variance Effective sample size (n e ): where : estimated variance ← bootstrap 10

4. Bootstrap methods - Bootstrap intro Efron (1979) Let X 1, X 2,... be i.i.d. random variables with (unknown) common distribution F –X n ={X 1,..., X n } random sample – T n =t n ( X n ; F) random variable of interest, it’s distribution: G n Goal: approximation of the distribution G n Bootstrap method: – For given X n, we draw a simple random sample of size m (usually m ≈ n) – Common distribution of ’s: – – Repetition 11

4. Bootstrap methods - CBB Nonparametric bootstrap (sample size: n) – Block bootstrap Circular block bootstrap (CBB) 1.Let 2.For some m, let i 1, i 2..., i m be a uniform sample from the set {1, 2,..., n} 3.For block size b, construct n’=m·b (n’ ≈ n) pseudo-data: for j=1,...,b 4.Functional of interest, e.g. bootstrap sample mean: 12

4. Bootstrap methods – Block-length selection D.N.Politis-H. White (2004): automatic block-length selection Minimalize: where and g(.): spectral density function R(.): autocovariance function Optimal block size: Estimation of G and D 13

5. Applications to wind speed maxima Sample: n = 2591 observations of weekly wind speed maxima for 5 German towns Automatic block-length selection results: meteorologically no sense 14

15 5. Applications to wind speed maxima Method: 1.Fitting AR(1) modell to the data:, Z t ~Extreme value distr. 2.Calculation of the theoretical from AR(1) parameters: 3. b* optimal block size: where the simulated variance of the mean first crosses the theoretical value

Bootstrap simulation results b* = Applications to wind speed maxima

Bootstrap simulation results 5. Applications to wind speed maxima 17

Bremerhaven & Fehmarn Applications to wind speed maxima

Bremerhaven & Schleswig Applications to wind speed maxima

Fehmarn & Schleswig Applications to wind speed maxima

Prediction regions (Bremerhaven & Fehmarn) Applications to wind speed maxima Wind speed (m/s) Pred. regions: % lower(5%) bounds upper(95%) bounds block=1 lower bound block=7 lower bound block=30 lower bound block=1 upper bound block=7 upper bound block=30 upper bound

Final remarks Conclusions Copula choice is important Serial dependence largely influences the critical values of GoF tests Block size does not have a major impact on the estimated prediction region Future work Multivariate effective sample size Parametric bootstrap Acknowledgement We are grateful to the Doctoral School of Mathematics of ELTE for supporting L. Varga’s participation at SMTDA Conference.

Thank you for the attention 23

References P. Rakonczai, A. Zempléni: Copulas and goodness of fit tests. Recent advances in stochastic modeling and data analysis, World Scientific, pp , S.N. Lahiri: Resampling Methods for Dependent Data. Springer, D.N.Politis, H.White: Automatic Block-Length Selection for the Dependent Bootstrap. Econometric Reviews, Vol. 23, pp , P. Embrechts, F. Lindskog, A. McNeil: Modelling Dependence with Copulas and Applications to Risk Management. Department of Mathematics, ETHZ, Zürich, L.Kish: Survey Sampling, J. Wiley,