Jrothenb – 1 Joerg Rothenbuehler. jrothenb – 2 The distribution of the Maximum:

Slides:



Advertisements
Similar presentations
Introduction to modelling extremes
Advertisements

Introduction to modelling extremes Marian Scott (with thanks to Clive Anderson, Trevor Hoey) NERC August 2009.
Autocorrelation Functions and ARIMA Modelling
Class 6: Hypothesis testing and confidence intervals
1 McGill University Department of Civil Engineering and Applied Mechanics Montreal, Quebec, Canada.
Probability Distributions CSLU 2850.Lo1 Spring 2008 Cameron McInally Fordham University May contain work from the Creative Commons.
Introduction to Algorithmic Trading Strategies Lecture 8 Risk Management Haksun Li
Time Series Building 1. Model Identification
1 CLUSTER ANALYSIS OF EUROPEAN DAILY TEMPERATURE SERIES: AN EXTREME VALUE APPROACH Andrés M. Alonso Departamento de Estadística Universidad Carlos II de.
The Age of Turbulence, Credit Derivatives Style Hans NE Byström Lund University.
STAT 497 APPLIED TIME SERIES ANALYSIS
Quantile Estimation for Heavy-Tailed Data 23/03/2000 J. Beirlant G. Matthys
Model Fitting Jean-Yves Le Boudec 0. Contents 1 Virus Infection Data We would like to capture the growth of infected hosts (explanatory model) An exponential.
Confidence Intervals Mon, March 22 nd. Point & Interval Estimates  Point estimate – use sample to estimate exact statistic to represent pop parameter.
Extremes ● An extreme value is an unusually large – or small – magnitude. ● Extreme value analysis (EVA) has as objective to quantify the stochastic behavior.
On the Constancy of Internet Path Properties Yin Zhang, Nick Duffield AT&T Labs Vern Paxson, Scott Shenker ACIRI Internet Measurement Workshop 2001 Presented.
Climate Change and Extreme Wave Heights in the North Atlantic Peter Challenor, Werenfrid Wimmer and Ian Ashton Southampton Oceanography Centre.
A Nonstationary Poisson View of Internet Traffic T. Karagiannis, M. Molle, M. Faloutsos University of California, Riverside A. Broido University of California,
Extremal cluster characteristics of a regime switching model, with hydrological applications Péter Elek, Krisztina Vasas and András Zempléni Eötvös Loránd.
Saturday May 02 PST 4 PM. Saturday May 02 PST 10:00 PM.
AP Statistics Section 10.2 A CI for Population Mean When is Unknown.
Eidgenössisches Departement des Innern EDI Bundesamt für Meteorologie und Klimatologie MeteoSchweiz Automating Peak-over- Threshold with the Information.
1 BA 555 Practical Business Analysis Review of Statistics Confidence Interval Estimation Hypothesis Testing Linear Regression Analysis Introduction Case.
Measuring market risk:
Flood Frequency Analysis
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Ekaterina Ovchinnikova Finance Academy under the Government of the Russian Federation Loss distribution approach to assessing operational risk State-of-the.
Extreme Value Analysis What is extreme value analysis?  Different statistical distributions that are used to more accurately describe the extremes of.
Chapter 4 – Modeling Basic Operations and Inputs  Structural modeling: what we’ve done so far ◦ Logical aspects – entities, resources, paths, etc. 
© Copyright McGraw-Hill CHAPTER 3 Data Description.
February 3, 2010 Extreme offshore wave statistics in the North Sea.
SPC for Real-World Processes William A. Levinson, P.E. Intersil Corporation Mountaintop, PA.
1 Statistical Distribution Fitting Dr. Jason Merrick.
Going to Extremes: A parametric study on Peak-Over-Threshold and other methods Wiebke Langreder Jørgen Højstrup Suzlon Energy A/S.
10.1: Confidence Intervals – The Basics. Introduction Is caffeine dependence real? What proportion of college students engage in binge drinking? How do.
Extreme Value Techniques Paul Gates - Lane Clark & Peacock James Orr - TSUNAMI GIRO Conference, 15 October 1999.
Market Risk VaR: Historical Simulation Approach N. Gershun.
Extreme values and risk Adam Butler Biomathematics & Statistics Scotland CCTC meeting, September 2007.
Extreme Value Theory: Part II Sample (N=1000) from a Normal Distribution N(0,1) and fitted curve.
A Survey of Statistical Methods for Climate Extremes Chris Ferro Climate Analysis Group Department of Meteorology University of Reading, UK 9th International.
Concepts and Applications of Kriging
For information contact H. C. Koons 30 October Preliminary Analysis of ABFM Data WSR 11 x 11-km Average Harry Koons 30 October.
Confidence Interval Estimation For statistical inference in decision making:
Esri UC 2014 | Technical Workshop | Concepts and Applications of Kriging Eric Krause Konstantin Krivoruchko.
Sampling and estimation Petter Mostad
Identification of Extreme Climate by Extreme Value Theory Approach
New approaches in extreme-value modeling A.Zempléni, A. Beke, V. Csiszár (Eötvös Loránd University, Budapest) Flood Risk Workshop,
Probability distributions
Extreme Value Analysis
Physics 490 – Project Lab Lab B2: Radioactive Decay, Counting Statistics, and the Geiger Tube Jack Young Richard Breazeale Ryan Phelan.
Discussion of Mandelbrot Themes: Alpha (Tail Index) and Scaling (H) Prepared by Sheri Markose, Amadeo Alentorn and Vikentia Provizionatou WEHIA 2005 Implications.
Analysis of Financial Data Spring 2012 Lecture: Introduction Priyantha Wijayatunga Department of Statistics, Umeå University
1 Lecture Plan Modelling Profit Distribution from Wind Production (Excel Case: Danish Wind Production and Spot Prices) Reasons for copula.
MODELLING SPOT PRICE OF ELECTRICITY IN NSW Hilary Green with Nino Kordzakhia and Ruben Thoplan 15th International Conference Computing in Economics.
A major Hungarian project for flood risk assessment A.Zempléni (Eötvös Loránd University, Budapest, visiting the TU Munich as a DAAD grantee) Technical.
Chapter 3 Lect 6 Forecasting. Seasonality – Repetition at Fixed Intervals Seasonal variations –Regularly repeating movements in series values that can.
Fewer permutations, more accurate P-values Theo A. Knijnenburg 1,*, Lodewyk F. A. Wessels 2, Marcel J. T. Reinders 3 and Ilya Shmulevich 1 1Institute for.
Application of Extreme Value Theory (EVT) in River Morphology
Parameter, Statistic and Random Samples
Modeling and Simulation CS 313
Probability and Statistics for Computer Scientists Second Edition, By: Michael Baron Chapter 8: Introduction to Statistics CIS Computational Probability.
Extreme Value Theory for High-Frequency Financial Data
Modeling and Simulation CS 313
Diagnostics and Transformation for SLR
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Warmup To check the accuracy of a scale, a weight is weighed repeatedly. The scale readings are normally distributed with a standard deviation of
Identification of Extreme Climate by Extreme Value Theory Approach
Diagnostics and Transformation for SLR
Introductory Statistics
Presentation transcript:

jrothenb – 1 Joerg Rothenbuehler

jrothenb – 2 The distribution of the Maximum:

jrothenb – 3 Fisher-Tippett Theorem

jrothenb – 4 The Extreme Value Distributions

jrothenb – 5 The three EVD can be represented by a single three parameter distribution, called the GENERALIZED EVD (GEV): Generalized Extreme Value Distribution (GEV)

jrothenb – 6 The function of the parameters

jrothenb – 7 Excesses over high thresholds

jrothenb – 8 Generalized Pareto Distribution (GPD)

jrothenb – 9 Properties of GPD

jrothenb – 10 The Empirical Mean Excess Function The empirical mean excess function of a GPD with

jrothenb – 11 Modeling Extreme Events:  The number of exceedances of a high threshold follows a Poisson process (iid exp. distributed interarrival times)  Excesses over a high threshold can be modeled by a GPD  An appropriate value of the high threshold can be found by plotting the empirical mean excess function.  The distribution of the maximum of a Poisson number of iid excesses over a high threshold is a GEV with the same shape parameter as the corresponding GPD.

jrothenb – 12 Extremal Index of a Stationary Time Series  The extremal index measures the dependence of the data in the tails.  can be interpreted as the average cluster size in the tails: High values appear in clusters of size  means there is no clustering in the tails.  If the data does not show strong long range dependence, but has extremal index, its maxima has distribution, where H is the GEV of iid data with the same marginal distribution.  GPD analysis may not be appropriate for data with

jrothenb – 13 The Data: Surveyor Project  One way delays of probe packets during one week  Packets sent according to a Poisson process with a rate of 2/sec  Packet is time-stamped to measure delay  If delay >10 sec, packet assumed lost, discarded  Saturday and Sunday excluded for analysis  More details:

jrothenb – 14 Time-Series Plot Colorado-Harvard Monday 12:00am - Friday 8:00 pm

jrothenb – 15 ACF and Ex. Index Estimation

jrothenb – 16 Empirical Mean Excess Function

jrothenb – 17 Estimation of Shape Parameter as a function the used threshold using GPD

jrothenb – 18 Result of the GPD Fit

jrothenb – 19 Fit of a GPD-Distr. for Colorado-Harvard  threshold =  Quantile of threshold =  Number of exceedances = 500  Parameter estimates and Standard Errors xi beta

jrothenb – 20 Estimations based on GPD Fit p quantile sfall empirical quantile

jrothenb – 21 Quantile estimation as a function of the threshold Empirical quantile % Estimate

jrothenb – 22 Fitting a GEV to block wise maxima Block 1 Block 2 Block 3 Block 4 Block 5

jrothenb – 23 GEV-Fit Results for different Block sizes  Block size = 7200 : 108 Blocks xi sigma mu Estimation Std. Error  Block size = : 54 Blocks xi sigma mu Estimation Std. Error

jrothenb – 24 High Level Estimation Level exceeded during 1 of 50 hours Block size LowerEstimateUpper 1h h h Level exceeded during 1 of 100 hours Block size LowerEstimateUpper 1h h h

jrothenb – 25 Does GPD always work? The Army-Lab. – Univ. of. Virginia dataset Time Series Plot ACF Plot, Lags:5-1000PACF Plot, Lags: ACF Plot, Lags:1-1000

jrothenb – 26 What goes wrong beyond the LRD: Empirical Mean Excess Function Shape Parameter

jrothenb – 27 Non-Stationarity: Harvard to Army- Lab. Time Series Plot: Monday 12 am – Friday 8 pm

jrothenb – 28 Pick a few hours per day! Mean Excess Plot 11am – 4pm Mon - Fri Empirical Tail Distr. Shape Parameter Estimation

jrothenb – 29 Single Outlier: Virginia - Harvard Empirical Tail Distr. ACF, Lag Estimation of Extremal Index Monday 12am – Friday 8pm

jrothenb – 30 The effect of the outlier on GEV  Fit Without outlier: Block size = Blocks xi sigma mu x1=  Fit With outlier Block size = Blocks xi sigma mu x1=

jrothenb – 31 The effect of the single outlier on GPD: Analysis with outlier Analysis without outlier

jrothenb – 32 Conclusions:  The GPD is a model that can be fitted to the tails of a distribution. The quality of the fit can be checked with various methods. From the model, we can gain quantile estimates at the edge of or outside the data range. However, a good fit is often not possible.  The GEV provides a model for the distribution of block wise Maxima. Its use is supported by EVT for stationary time series without strong LRD, while GPD is only supported in the iid case. The quality of fit can be checked with similar tools as in the GPD model. Certain problems remain, and reliable quantile estimates are not available.

jrothenb – 33 Acknowledgements: Applied Research Group at Telcordia: – E. van den Berg – K. Krishnan – J. Jerkins – A. Neidhardt – Y. Chandramouli Cornell University: – Prof. G. Samorodnitsky