Extreme values and risk Adam Butler Biomathematics & Statistics Scotland CCTC meeting, September 2007.

Slides:



Advertisements
Similar presentations
Introduction to Hypothesis Testing
Advertisements

Introduction to modelling extremes
Introduction to modelling extremes Marian Scott (with thanks to Clive Anderson, Trevor Hoey) NERC August 2009.
Probability models- the Normal especially.
Estimation of Means and Proportions
Probability Distributions CSLU 2850.Lo1 Spring 2008 Cameron McInally Fordham University May contain work from the Creative Commons.
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Econ. & Mat. Enrique Navarrete Palisade Risk Conference
USING DECISION SUPPORT SYSTEM TECHNIQUE FOR HYDROLOGICAL RISK ASSESSMENT CASE OF OUED MEKERRA IN THE WESTERN OF ALGERIA M. A. Yahiaoui Université de Bechar.
Attributes Data Binomial and Poisson Data. Discrete Data All data comes in Discrete form. For Measurement data, in principle, it is on a continuous scale,
Chap 8: Estimation of parameters & Fitting of Probability Distributions Section 6.1: INTRODUCTION Unknown parameter(s) values must be estimated before.
McGraw-Hill Ryerson Copyright © 2011 McGraw-Hill Ryerson Limited. Adapted by Peter Au, George Brown College.
Bayesian inference Gil McVean, Department of Statistics Monday 17 th November 2008.
Extremes ● An extreme value is an unusually large – or small – magnitude. ● Extreme value analysis (EVA) has as objective to quantify the stochastic behavior.
Climate Change and Extreme Wave Heights in the North Atlantic Peter Challenor, Werenfrid Wimmer and Ian Ashton Southampton Oceanography Centre.
WFM 5201: Data Management and Statistical Analysis
Maximum Likelihood We have studied the OLS estimator. It only applies under certain assumptions In particular,  ~ N(0, 2 ) But what if the sampling distribution.
Statistics and Probability Theory Prof. Dr. Michael Havbro Faber
Extreme Value Analysis, August 15-19, Bayesian analysis of extremes in hydrology A powerful tool for knowledge integration and uncertainties assessment.
Slide 1 Statistics Workshop Tutorial 4 Probability Probability Distributions.
Slide 1 Statistics Workshop Tutorial 7 Discrete Random Variables Binomial Distributions.
Role and Place of Statistical Data Analysis and very simple applications Simplified diagram of a scientific research When you know the system: Estimation.
Maximum likelihood (ML)
Standard error of estimate & Confidence interval.
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
Stress testing and Extreme Value Theory By A V Vedpuriswar September 12, 2009.
Common Probability Distributions in Finance. The Normal Distribution The normal distribution is a continuous, bell-shaped distribution that is completely.
Extreme values Adam Butler Biomathematics & Statistics Scotland Seminar at MLURI, January 2008.
Bayesian Spatial Modeling of Extreme Precipitation Return Levels Daniel COOLEY, Douglas NYCHKA, and Philippe NAVEAU (2007, JASA)
Extreme Value Analysis What is extreme value analysis?  Different statistical distributions that are used to more accurately describe the extremes of.
Statistical Decision Theory
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Review of Chapters 1- 5 We review some important themes from the first 5 chapters 1.Introduction Statistics- Set of methods for collecting/analyzing data.
Copyright ©2011 Nelson Education Limited Large-Sample Tests of Hypotheses CHAPTER 9.
Physics 270 – Experimental Physics. Standard Deviation of the Mean (Standard Error) When we report the average value of n measurements, the uncertainty.
9 Mar 2007 EMBnet Course – Introduction to Statistics for Biologists Nonparametric tests, Bootstrapping
Psy B07 Chapter 4Slide 1 SAMPLING DISTRIBUTIONS AND HYPOTHESIS TESTING.
1 Statistical Distribution Fitting Dr. Jason Merrick.
Introduction to Probability and Statistics Thirteenth Edition Chapter 9 Large-Sample Tests of Hypotheses.
STATSPUNE 1 Are our dams over-designed? S.A.ParanjpeA.P.Gore.
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
Maximum Likelihood Estimation Methods of Economic Investigation Lecture 17.
1 A non-Parametric Measure of Expected Shortfall (ES) By Kostas Giannopoulos UAE University.
Lecture 2 Review Probabilities Probability Distributions Normal probability distributions Sampling distributions and estimation.
INTRODUCTORY STUDY : WATER INDICATORS AND STATISTICAL ANALYSIS OF THE HYDROLOGICAL DATA EAST OF GUADIANA RIVER by Nikolas Kotsovinos,P. Angelidis, V. Hrissanthou,
Extreme Value Theory: Part II Sample (N=1000) from a Normal Distribution N(0,1) and fitted curve.
Issues concerning the interpretation of statistical significance tests.
Hypothesis Testing An understanding of the method of hypothesis testing is essential for understanding how both the natural and social sciences advance.
Risk Analysis & Modelling Lecture 10: Extreme Value Theory.
Summarizing Risk Analysis Results To quantify the risk of an output variable, 3 properties must be estimated: A measure of central tendency (e.g. µ ) A.
Introduction Suppose that a pharmaceutical company is concerned that the mean potency  of an antibiotic meet the minimum government potency standards.
Identification of Extreme Climate by Extreme Value Theory Approach
Chapter 9 Sampling Distributions Sir Naseer Shahzada.
1 Introduction to Statistics − Day 4 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Lecture 2 Brief catalogue of probability.
New approaches in extreme-value modeling A.Zempléni, A. Beke, V. Csiszár (Eötvös Loránd University, Budapest) Flood Risk Workshop,
Extreme Value Analysis
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
Hydrological Forecasting. Introduction: How to use knowledge to predict from existing data, what will happen in future?. This is a fundamental problem.
G. Cowan Lectures on Statistical Data Analysis Lecture 9 page 1 Statistical Data Analysis: Lecture 9 1Probability, Bayes’ theorem 2Random variables and.
Application of Extreme Value Theory (EVT) in River Morphology
Lecture Slides Elementary Statistics Eleventh Edition
Extreme Value Theory for High-Frequency Financial Data
Probability & Statistics Probability Theory Mathematical Probability Models Event Relationships Distributions of Random Variables Continuous Random.
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
Lecture Slides Elementary Statistics Twelfth Edition
Lecture Slides Elementary Statistics Twelfth Edition
Environmental Statistics
Identification of Extreme Climate by Extreme Value Theory Approach
Lecture Slides Essentials of Statistics 5th Edition
Lecture Slides Essentials of Statistics 5th Edition
Presentation transcript:

Extreme values and risk Adam Butler Biomathematics & Statistics Scotland CCTC meeting, September 2007

Extreme values and risk Extreme value theory (EVT) is a branch of statistics concerned with the frequency & size of rare events EVT methods are widely used in finance, hydrology & engineering, usually for risk assessment, but are not yet widely used in the biological sciences

Extreme values and risk Risk assessment: What is the probability we will have more than 100mm of rain on a given day? Risk management: I need to build a flood defense, and I want the probability that it fails on any particular day to be less than 1-in How high should it be?

Extreme values and risk What is the chance of getting a log daily return of less than –0.1? (i.e. a drop in value of 9% or more since the previous day)

Extreme values and risk Common features We are interested in a process that can be quantified, and for which we have some data …and we want to use these data to say something about the probability that a rare or extreme event will occur

Extreme values and risk We will usually be interested in events that are beyond the range of the data i.e. we want to extrapolate Extrapolation is rarely advisable, but it is sometimes unavoidable, especially when doing risk assessment The standard approach would be to assume that the data come from, for example, a normal distribution… jk

Extreme values and risk P(X < –0.1) 

Extreme values and risk …but: The extreme values don’t play much of a role when we estimate the parameters, so the model that we end up fitting might not describe the extreme values at all well…

Empirical: P(X < –0.05)  Normal: P(X < –0.05)  Extreme values and risk

…and, worse still, extrapolations beyond the range of the data often differ radically between models that provide a very similar fit to the bulk of the data…

Extreme values and risk Cauchy: P(X < –0.1)  0.02 Normal: P(X < –0.1) 

Extreme values and risk

In EVT we adopt the principle that we should only make use of the most extreme data that we have observed  we throw away almost all of the data

Extreme values and risk Threshold exceedances

Extreme values and risk

We consider exceedances of a high threshold EVT tells us that a good statistical model for exceedances, x, is the Generalised Pareto Distribution (GPD), P(x) = 1 – [1 +  (x /  )] -1/  (x > 0)  = “scale parameter”  = “shape parameter”

Extreme values and risk GPD: impact of the scale parameter  = 1  = 2  = 3  = 0  = “scale parameter”  = “shape parameter”

Extreme values and risk GPD: impact of the shape parameter  = 0  = 1  = -0.5  = 1  = “scale parameter”  = “shape parameter”

Extreme values and risk Threshold = u = 25mm  and  estimated by maximum likelihood to be 7.70 and respectively P(X > 100) estimated to be (once in a 131 years)

Extreme values and risk …but why is the GPD the “right model” to use? In theory: for almost any random variable X, the exceedances of a high threshold u will tend towards the GPD model as u tends towards infinity In practice: we use a threshold that is high but still finite: we rely on the fact that if this level is sufficiently high then the asymptotic result will still be approximately true

Extreme value methods “Parameter stability plot”

Extreme values and risk Other extreme value models A related approach involves analysing the maximum values per day, per month or per year (block maxima) EVT suggests that a good model to use in this case is the GEV (Generalised Extreme Value) ;

Extreme values and risk Advantages Robust Relies on weak assumptions Avoids bias Theoretically sound Justified by asymptotic theory Quick & relatively easy to use Honest …about the uncertainties involved in making statements about very rare events Disadvantages Inefficient Most of the data are thrown away …we may over-estimate uncertainty …relies on having a large sample size Asymptotics The theory only holds exactly for infinitely extreme events Difficult to extend to multivariate case Data quality Sensitive to errors in extreme data

Extreme values and risk Practicalities Basic course: Software: routines in… R, Genstat, S-plus, Matlab Extremes toolkit: Recommended book: Coles (2001) An introduction to statistical modeling of extreme values. Springer. Contact me: