Quantile Estimation for Heavy-Tailed Data 23/03/2000 J. Beirlant G. Matthys

Slides:



Advertisements
Similar presentations
Introduction to modelling extremes
Advertisements

Introduction to modelling extremes Marian Scott (with thanks to Clive Anderson, Trevor Hoey) NERC August 2009.
SJS SDI_21 Design of Statistical Investigations Stephen Senn 2 Background Stats.
COMM 472: Quantitative Analysis of Financial Decisions
Uncertainty and confidence intervals Statistical estimation methods, Finse Friday , 12.45–14.05 Andreas Lindén.
1 12. Principles of Parameter Estimation The purpose of this lecture is to illustrate the usefulness of the various concepts introduced and studied in.
USING DECISION SUPPORT SYSTEM TECHNIQUE FOR HYDROLOGICAL RISK ASSESSMENT CASE OF OUED MEKERRA IN THE WESTERN OF ALGERIA M. A. Yahiaoui Université de Bechar.
Introduction to Algorithmic Trading Strategies Lecture 8 Risk Management Haksun Li
Chap 8: Estimation of parameters & Fitting of Probability Distributions Section 6.1: INTRODUCTION Unknown parameter(s) values must be estimated before.
The General Linear Model. The Simple Linear Model Linear Regression.
Primbs, MS&E 345, Spring The Analysis of Volatility.
Model Fitting Jean-Yves Le Boudec 0. Contents 1 Virus Infection Data We would like to capture the growth of infected hosts (explanatory model) An exponential.
Non-Normal Distributions
1 Engineering Computation Part 6. 2 Probability density function.
Climate Change and Extreme Wave Heights in the North Atlantic Peter Challenor, Werenfrid Wimmer and Ian Ashton Southampton Oceanography Centre.
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 6 Introduction to Sampling Distributions.
Probability By Zhichun Li.
Statistical inference form observational data Parameter estimation: Method of moments Use the data you have to calculate first and second moment To fit.
Statistical Inference and Regression Analysis: GB Professor William Greene Stern School of Business IOMS Department Department of Economics.
Maximum-Likelihood estimation Consider as usual a random sample x = x 1, …, x n from a distribution with p.d.f. f (x;  ) (and c.d.f. F(x;  ) ) The maximum.
KYIV SCHOOL OF ECONOMICS Financial Econometrics (2nd part): Introduction to Financial Time Series May 2011 Instructor: Maksym Obrizan Lecture notes III.
Useful Statistical Distributions for Econometrics Econometrics is usually concerned with the estimation of equations of the form: The normal distribution.
Volatility Chapter 9 Risk Management and Financial Institutions 2e, Chapter 9, Copyright © John C. Hull
July 3, Department of Computer and Information Science (IDA) Linköpings universitet, Sweden Minimal sufficient statistic.
Lecture 7 1 Statistics Statistics: 1. Model 2. Estimation 3. Hypothesis test.
Market Risk VaR: Historical Simulation Approach
Measuring market risk:
1 Summarizing Performance Data Confidence Intervals Important Easy to Difficult Warning: some mathematical content.
Functions of Random Variables. Methods for determining the distribution of functions of Random Variables 1.Distribution function method 2.Moment generating.
Moment Generating Functions
Traffic Modeling.
Use of moment generating functions 1.Using the moment generating functions of X, Y, Z, …determine the moment generating function of W = h(X, Y, Z, …).
Ch5. Probability Densities II Dr. Deshi Ye
Chapter 7 Point Estimation
1 Lecture 16: Point Estimation Concepts and Methods Devore, Ch
Market Risk VaR: Historical Simulation Approach N. Gershun.
1 A non-Parametric Measure of Expected Shortfall (ES) By Kostas Giannopoulos UAE University.
Extreme values and risk Adam Butler Biomathematics & Statistics Scotland CCTC meeting, September 2007.
PROBABILITY AND STATISTICS FOR ENGINEERING Hossein Sameti Department of Computer Engineering Sharif University of Technology Principles of Parameter Estimation.
Extreme Value Theory: Part II Sample (N=1000) from a Normal Distribution N(0,1) and fitted curve.
Week 21 Stochastic Process - Introduction Stochastic processes are processes that proceed randomly in time. Rather than consider fixed random variables.
For information contact H. C. Koons 30 October Preliminary Analysis of ABFM Data WSR 11 x 11-km Average Harry Koons 30 October.
Input Modeling for Simulation Chapter 3 “As far as the laws of mathematics refer to reality, they are not certain, and as far as they are certain, they.
Estimation Method of Moments (MM) Methods of Moment estimation is a general method where equations for estimating parameters are found by equating population.
Sampling and estimation Petter Mostad
Identification of Extreme Climate by Extreme Value Theory Approach
S TOCHASTIC M ODELS L ECTURE 4 P ART II B ROWNIAN M OTIONS Nan Chen MSc Program in Financial Engineering The Chinese University of Hong Kong (Shenzhen)
Extreme Value Analysis
1 Optimizing Decisions over the Long-term in the Presence of Uncertain Response Edward Kambour.
Machine Learning 5. Parametric Methods.
Lecture 3: MLE, Bayes Learning, and Maximum Entropy
Stochastic Excess-of-Loss Pricing within a Financial Framework CAS 2005 Reinsurance Seminar Doris Schirmacher Ernesto Schirmacher Neeza Thandi.
Hydrological Forecasting. Introduction: How to use knowledge to predict from existing data, what will happen in future?. This is a fundamental problem.
Week 21 Order Statistics The order statistics of a set of random variables X 1, X 2,…, X n are the same random variables arranged in increasing order.
1 Ka-fu Wong University of Hong Kong A Brief Review of Probability, Statistics, and Regression for Forecasting.
Week 21 Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
Bias-Variance Analysis in Regression  True function is y = f(x) +  where  is normally distributed with zero mean and standard deviation .  Given a.
STA302/1001 week 11 Regression Models - Introduction In regression models, two types of variables that are studied:  A dependent variable, Y, also called.
Stochastic Process - Introduction
Hypotheses and test procedures
12. Principles of Parameter Estimation
Parameter Estimation 主講人:虞台文.
Planning a Simulation Study
Market Risk VaR: Historical Simulation Approach
Volatility Chapter 10.
Estimating the tail index from incomplete data
Simple Linear Regression
12. Principles of Parameter Estimation
Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
Presentation transcript:

Quantile Estimation for Heavy-Tailed Data 23/03/2000 J. Beirlant G. Matthys

Contents MODEL SPECIFICATION EXAMPLE : S&P500’s daily %returns EXTREME QUANTILE ESTIMATION : review of 3 classes of methods Blocks method Peaks-over-treshold (POT) method Q(uantile)-based methods IMPROVING Q-BASED METHODS : ML-estimator SIMULATION RESULTS CONCLUSIONS Introduction & Notation

Notation Sample X 1, X 2,…,X n from Order statistics : Pareto-type / heavy-tailed model –Tail decays essentially as a power function : with l slowly varying, i.e. –Terminology :tail index  extreme value index (EVI) –Equivalently : for some other slowly varying function l *

Examples of heavy-tailed distributions : –Pareto : –Student’s t with degrees of freedom : –Loggamma : –Fréchet : !Not: normal, exponential, lognormal, Weibull !

Heavy-tailed distributions in finance log-returns and exchange rates mostly do exhibit heavy tails (often for log-return series) often one is interested in a certain high level, which will be exceeded with only a (very) small probability search for an extreme quantile of the distribution Example : 16/10/1987: Risk manager wants to assess the risks his investment is exposed to and investigates some “worst case scenarios” estimate %fall of the S&P500 index that will happen only once every days ( 40 years) on average

Formally: Consider daily %falls as random variables X 1, X 2,…, X n,… and denote their stationary distribution (assuming it exists) with F then we look for quantile x of F that satisfies : F(x) = 1 - 1/ Problem of estimating extreme quantiles x p with tail probability p  (0,1), where p will be usually very small : 3 classes of methods : method of block maxima POT-method Q(uantile)-based methods

Peaks Over Treshold (POT) Method Limit law for excesses over a high treshold (Pickands— Balkema—de Haan, 1974, 1975): If F is a heavy-tailed distribution with EVI  >0 and excess distribution then where G , denotes the Generalised Pareto Distribution: (GPD)

Thus, for large tresholds u, for  >0 and some  > 0. Description of the POT method: 1 given a sample X 1, X 2,…, X n, select a high treshold u let N u be the number of exceedances denote the excesses for j=1,…,N u 2 fit a GPD G , to the excesses Y 1,…,Y n, e.g. with MLE parameter estimates

3 as estimate F(x) for x > u with where F n is the empirical distribution function; F n (u) = 1-N u /n 4 obtain quantile estimates by inverting (*) Note: in general for each different choice of treshold other parameter and quantile estimates ! (*)

MLE’s for GPD-fit of the excesses above treshold u = 1.5%: (thus ) 95%CI: (4.9, 10) compare the fitted excess distribution with the empirical one:

Q(uantile)-based Methods Hill estimator Pickands estimator Moment estimator MLE for exponential regression model

Hill estimator Heavy-tailed distribution: distribution of log(X): Quantile function: Order statistics of X 1, X 2,…,X n : is a consistent estimator for plot

slope estimator for k large, denominator 1 Hill estimator (1975) for each k (number of order statistics): different estimator –k smallbias small, variance large –k largevariance small, bias large bias-variance trade-off to select ‘optimal’ k

estimating a quantile Q(1-p) with Hill estimator: extrapolate along fitted line on Pareto quantile plot again different estimator for each k bias-variance trade-off needed to select ‘optimal’ k (Weissman, 1978)

Pickands estimator EVI estimator (1975): derived quantile estimator:

Moment estimator EVI estimator (Dekkers—Einmahl—de Haan, 1989): where derived quantile estimator:

k Q-based estimators for EVI of S&P500's %falls

k Q-based estimators for quantile of S&P500's %falls

ML Method improving Q-based methods (Beirlant, Dierckx, Goegebeur & Matthys, Extremes, 1999) Hill estimator : for 0 < k < n, consider log-spacings Assumption on second-order slow variation : for x   with  < 0 and b(x)  0 as x  

exponential regression model for log-spacings : with f 1,k, f 2,k,…,f k,k i.i.d. exponential (1) with uniform (0,1) order statistics with f 1, f 2,…,f k i.i.d. exponential (1)

Burr(1,1,2), k = 200, n = 500

k Q-based estimators for EVI of S&P500's %falls

quantile estimation : try to preserve stability –first idea, by analogy to Weissman estimator : –improvement, including estimated information on l* :

k Q-based estimators for quantile of S&P500's %falls

k Loggamma(1,2), p =

k Loggamma(2,2), p =

k Student's t2, p =

k Student's t4, p =

k Burr(1,1,1), p =

k Burr(1,0.5,2), p =

k Arch(1), p = 0.001, gamma = 0.466

k Arch(1), p = 0.001, gamma = 0.094

k Garch(1,1), p = 0.001, gamma = 0.419

k Garch(1,1), p = 0.001, gamma = 0.096

How to detect heavy-tailedness Extension of the Q-based ML method to all classes of distributions (not only heavy-tailed ones) Further investigation of asymptotic properties of the Q-based ML estimators + construction of CI’s for (financial) time series (dependent data) Related Problems and Questions