Chris Ferro Climate Analysis Group Department of Meteorology University of Reading Extremes in a Varied Climate 1.Significance of distributional changes.

Slides:



Advertisements
Similar presentations
Introduction to modelling extremes
Advertisements

Review bootstrap and permutation
Computational Statistics. Basic ideas  Predict values that are hard to measure irl, by using co-variables (other properties from the same measurement.
Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Hydrologic Statistics
Beyond Null Hypothesis Testing Supplementary Statistical Techniques.
Federal Department of Home Affairs FDHA Federal Office of Meteorology and Climatology MeteoSwiss Improving COSMO-LEPS forecasts of extreme events with.
Analysis of Extremes in Climate Science Francis Zwiers Climate Research Division, Environment Canada. Photo: F. Zwiers.
Resampling techniques Why resampling? Jacknife Cross-validation Bootstrap Examples of application of bootstrap.
Chapter 7 Sampling and Sampling Distributions
Climate Change and Extreme Wave Heights in the North Atlantic Peter Challenor, Werenfrid Wimmer and Ian Ashton Southampton Oceanography Centre.
Chapter 3 Hypothesis Testing. Curriculum Object Specified the problem based the form of hypothesis Student can arrange for hypothesis step Analyze a problem.
Experimental Evaluation
5-3 Inference on the Means of Two Populations, Variances Unknown
Tests of Hypothesis [Motivational Example]. It is claimed that the average grade of all 12 year old children in a country in a particular aptitude test.
Statistical Inference: Which Statistical Test To Use? Pınar Ay, MD, MPH Marmara University School of Medicine Department of Public Health
2015 AprilUNIVERSITY OF HAIFA, DEPARTMENT OF STATISTICS, SEMINAR FOR M.A 1 Hastie, Tibshirani and Friedman.The Elements of Statistical Learning (2nd edition,
Statistical Challenges in Climatology Chris Ferro Climate Analysis Group Department of Meteorology University of Reading ‘Climate is what we expect, weather.
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses.
+ Chapter 9 Summary. + Section 9.1 Significance Tests: The Basics After this section, you should be able to… STATE correct hypotheses for a significance.
Stats for Engineers Lecture 9. Summary From Last Time Confidence Intervals for the mean t-tables Q Student t-distribution.
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
Chapter 9 Hypothesis Testing II: two samples Test of significance for sample means (large samples) The difference between “statistical significance” and.
The Scientific Method Formulation of an H ypothesis P lanning an experiment to objectively test the hypothesis Careful observation and collection of D.
Experimental Evaluation of Learning Algorithms Part 1.
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
Confidence intervals and hypothesis testing Petter Mostad
Estimating  0 Estimating the proportion of true null hypotheses with the method of moments By Jose M Muino.
Extreme values and risk Adam Butler Biomathematics & Statistics Scotland CCTC meeting, September 2007.
Lecture 4: Statistics Review II Date: 9/5/02  Hypothesis tests: power  Estimation: likelihood, moment estimation, least square  Statistical properties.
Limits to Statistical Theory Bootstrap analysis ESM April 2006.
Extreme Value Theory: Part II Sample (N=1000) from a Normal Distribution N(0,1) and fitted curve.
A Survey of Statistical Methods for Climate Extremes Chris Ferro Climate Analysis Group Department of Meteorology University of Reading, UK 9th International.
CHEMISTRY ANALYTICAL CHEMISTRY Fall Lecture 6.
Lecture 3: Statistics Review I Date: 9/3/02  Distributions  Likelihood  Hypothesis tests.
Machine Learning Chapter 5. Evaluating Hypotheses
Statistical Analyses of Extremes from a Regional Climate Model Chris Ferro Climate Analysis Group Department of Meteorology University of Reading Royal.
Ex St 801 Statistical Methods Inference about a Single Population Mean.
Inen 460 Lecture 2. Estimation (ch. 6,7) and Hypothesis Testing (ch.8) Two Important Aspects of Statistical Inference Point Estimation – Estimate an unknown.
Hypothesis Testing Errors. Hypothesis Testing Suppose we believe the average systolic blood pressure of healthy adults is normally distributed with mean.
Summarizing Risk Analysis Results To quantify the risk of an output variable, 3 properties must be estimated: A measure of central tendency (e.g. µ ) A.
Methods for point patterns. Methods consider first-order effects (e.g., changes in mean values [intensity] over space) or second-order effects (e.g.,
New approaches in extreme-value modeling A.Zempléni, A. Beke, V. Csiszár (Eötvös Loránd University, Budapest) Flood Risk Workshop,
WCRP Extremes Workshop Sept 2010 Detecting human influence on extreme daily temperature at regional scales Photo: F. Zwiers (Long-tailed Jaeger)
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
ES 07 These slides can be found at optimized for Windows)
G. Cowan Computing and Statistical Data Analysis / Stat 9 1 Computing and Statistical Data Analysis Stat 9: Parameter Estimation, Limits London Postgraduate.
© Crown copyright 2007 Global changes in extreme daily temperature since 1950 using non-stationary extreme value analysis Simon Brown, John Caesar and.
Diagnostic verification and extremes: 1 st Breakout Discussed the need for toolkit to build beyond current capabilities (e.g., NCEP) Identified (and began.
Topics Semester I Descriptive statistics Time series Semester II Sampling Statistical Inference: Estimation, Hypothesis testing Relationships, casual models.
Hypothesis Testing Steps for the Rejection Region Method State H 1 and State H 0 State the Test Statistic and its sampling distribution (normal or t) Determine.
Hypothesis Testing. Suppose we believe the average systolic blood pressure of healthy adults is normally distributed with mean μ = 120 and variance σ.
Lecture 8 Estimation and Hypothesis Testing for Two Population Parameters.
Hypothesis Testing. Statistical Inference – dealing with parameter and model uncertainty  Confidence Intervals (credible intervals)  Hypothesis Tests.
BIOL 582 Lecture Set 2 Inferential Statistics, Hypotheses, and Resampling.
Climate change, hydrodynamical models & extreme sea levels Adam Butler Janet Heffernan Jonathan Tawn Lancaster University Department of Mathematics &
Lec. 19 – Hypothesis Testing: The Null and Types of Error.
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses pt.1.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Inferences Concerning Means.
Chapter 6 Sampling and Sampling Distributions
Chapter Nine Hypothesis Testing.
Application of the Bootstrap Estimating a Population Mean
Significance Test for the Difference of Two Proportions
Dr.MUSTAQUE AHMED MBBS,MD(COMMUNITY MEDICINE), FELLOWSHIP IN HIV/AIDS
Flood Frequency Analysis
When we free ourselves of desire,
Hydrologic Statistics
Simple Linear Regression
Chapter Outline Inferences About the Difference Between Two Population Means: s 1 and s 2 Known.
Presentation transcript:

Chris Ferro Climate Analysis Group Department of Meteorology University of Reading Extremes in a Varied Climate 1.Significance of distributional changes 2.Extreme-value analysis of gridded data

Significance of Changes Compare daily data at single grid point in Assume X 1, …, X n have same distribution Assume Y 1, …, Y n have same distribution

Quantiles x p is the p-quantile of the control if An estimate of x p is whereare the order statistics.

Daily T min at Wengen (DJF) Dots mark the 1, 5, 10, 25, 50, 75, 90, 95 and 99% quantiles

Confidence Intervals Quantify the uncertainty due to finite samples. A (1 – α)-confidence interval for is (L p, U p ) if

Resampling Replicating the experiment would reveal sampling variation of the estimate. Mimic replication by resampling from data. Must preserve any dependence –time (e.g. pre-whitening, blocking) –space (e.g. pairing)

Bootstrapping 1 For b = 1, …, B (large) resample {X 1 *, …, X n * } and {Y 1 *, …, Y n * } compute Then

Daily T min at Wengen (DJF) 90% confidence intervals

Descriptive Hypotheses 1.No change: Y  X d p = 0 for all p reject unless 2.Location change: Y  X + m for some m d p = m for some m and all p reject unless 3.Location-scale change: Y  sX + m 4.Scale change (if natural origin): Y  sX

Simultaneous Intervals Simultaneous (1 – α)-confidence intervals for d p 1, …, d p m are (L p 1, U p 1 ), …, (L p m, U p m ) if Wider than pointwise intervals, e.g.

Bootstrapping 2 For each p bootstrap set with k chosen to give correct confidence level: estimate level by proportion of the B sets with at least one point outside the intervals.

Daily T min at Wengen (DJF) 90% pointwise intervals 90% simultaneous intervals

Simplest hypothesis not rejected T min (DJF)T max (JJA)

Other issues Interpretation tricky when distributions change within samples: adjust for trends Bootstrapping quantiles is difficult: more sophisticated bootstrap methods Field significance Computational cost

Conclusions Quantiles describe entire distribution Confidence intervals quantify uncertainty Bootstrapping can account for dependence

Extreme-value Analysis Summarise extremes at each grid point Estimate return levels and other quantities Framework for quantifying uncertainty Summarise model output Validate and compare models Downscale model output

Annual Maxima Let Y s,t be the largest daily precipitation value at grid point s in year t. s = 1, …, grid points t = 1, …, 31 years Assume Y s,1, …, Y s,31 are independent and have the same distribution.

GEV Distribution Probability theory suggests the generalised extreme-value distribution for annual maxima: with parameters

Return Levels z s (m) is the m-year return level at s if i.e. exceeded once every m years on average.

Fitting the GEV Findto maximise likelihood: Estimate has variance Estimate is obtained from

Results 1 – Parameters μσγ

Results 1 – Return Levels z(100)% standard error

Pooling Grid Points Oftenif r is close to s. Assumeif r is a neighbour (r ~ s). Maximise Using 9 × 31 observations increases precision.

Standard Errors Taylor expansion yields Independence of grid points implies H = V, leaving For dependence, estimate V using variance of

Results 2 – Parameters μσγ Original Pooled

Results 2 – Return Levels z(100)% standard error Original Pooled

Local Variations Potential bias iffor r ~ s. Reduce bias by modelling, e.g. Should exploit physical knowledge.

Results 3 – Parameters μ Original Pooled II σγ

Results 2 – Parameters μσγ Original Pooled

Results 3 – Return Levels Original z(100)% standard error Pooled II

Results 2 – Return Levels z(100)% standard error Original Pooled

Conclusions Pooling can clarify extremal behaviour increase precision introduce bias Careful modelling can reduce bias

Future Directions Diagnostics for adequacy of GEV model Threshold exceedances; k-largest maxima Size and shape of neighbourhoods Less weight on more distant grid points More accurate standard error estimates Model any changes through time Clustering of extremes in space and time

References Davison & Hinkley (1997) Bootstrap Methods and their Application. Cambridge University Press. Davison & Ramesh (2000) Local likelihood smoothing of sample extremes. J. Royal Statistical Soc. B, 62, 191–208. Smith (1990) Regional estimation from spatially dependent data. Wilks (1997) Resampling hypothesis tests for autocorrelated fields. J. Climate, 10, 65 – 82.

Commentary Annual maximum at grid point s is GEV. 1.Estimateusing data at s. 2.Estimate using data in neighbourhood of s. Bias can occur if 3.Allow for local variation in parameters.