A Superior Alternative to the Modified Heidke Skill Score for Verification of Categorical Versions of CPC Outlooks Bob Livezey Climate Services Division/OCWWS/NWS.

Slides:



Advertisements
Similar presentations
Robin Hogan Ewan OConnor University of Reading, UK What is the half-life of a cloud forecast?
Advertisements

Measuring the performance of climate predictions Chris Ferro, Tom Fricker, David Stephenson Mathematics Research Institute University of Exeter, UK IMA.
Exeter, February The impenetrable hedge: some thoughts on propriety, equitability and consistency Ian Jolliffe Forecast verification Hedging Propriety.
1 Forecast verification - did we get it right? Ian Jolliffe Universities of Reading, Southampton, Kent, Aberdeen
Guidance of the WMO Commission for CIimatology on verification of operational seasonal forecasts Ernesto Rodríguez Camino AEMET (Thanks to S. Mason, C.
14 May 2001QPF Verification Workshop Verification of Probability Forecasts at Points WMO QPF Verification Workshop Prague, Czech Republic May 2001.
Confidence Intervals This chapter presents the beginning of inferential statistics. We introduce methods for estimating values of these important population.
Daria Kluver Independent Study From Statistical Methods in the Atmospheric Sciences By Daniel Wilks.
Forecasting Demand for Services
CHAPTER 2 Basic Descriptive Statistics: Percentages, Ratios and rates, Tables, Charts and Graphs.
Statistical hypothesis testing – Inferential statistics II. Testing for associations.
U.S Drought Highlights Since October 2004 Douglas Le Comte NOAA/CPC 30 th Annual Climate Diagnostics and Prediction Workshop October 24-28, 2005.
Probabilistic forecasts of (severe) thunderstorms for the purpose of issuing a weather alarm Maurice Schmeits, Kees Kok, Daan Vogelezang and Rudolf van.
Introduction to Seasonal Climate Prediction Liqiang Sun International Research Institute for Climate and Society (IRI)
© Crown copyright Met Office Operational OpenRoad verification Presented by Robert Coulson.
Products and Services to Support Decision Makers: Forecast Skill, L3MTO, and Beyond 2007 Committee for Environmental Services, Operations, and Research.
1 Chapter 6. Section 6-1 and 6-2. Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
4IWVM - Tutorial Session - June 2009 Verification of categorical predictands Anna Ghelli ECMWF.
Sampling Uncertainty in Verification Measures for Binary Deterministic Forecasts Ian Jolliffe and David Stephenson 1EMS September Sampling uncertainty.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Chapter 11 Goodness of Fit Test (section 11.2)
7.4 – Sampling Distribution Statistic: a numerical descriptive measure of a sample Parameter: a numerical descriptive measure of a population.
Heidke Skill Score (for deterministic categorical forecasts) Heidke score = Example: Suppose for OND 1997, rainfall forecasts are made for 15 stations.
Y X 0 X and Y are not perfectly correlated. However, there is on average a positive relationship between Y and X X1X1 X2X2.
Latest results in verification over Poland Katarzyna Starosta, Joanna Linkowska Institute of Meteorology and Water Management, Warsaw 9th COSMO General.
Sections 7-1 and 7-2 Review and Preview and Estimating a Population Proportion.
Jeopardy Hypothesis Testing t-test Basics t for Indep. Samples Related Samples t— Didn’t cover— Skip for now Ancient History $100 $200$200 $300 $500 $400.
World Meteorological Organization Working together in weather, climate and water Enhanced User and Forecaster Oriented TAF Quality Assessment CAeM-XIV.
Regional Verification of CPC’s Seasonal Outlooks Michael Halpert & Kenneth Pelman Climate Prediction Center.
La Niña and The U. S. Winter Outlook Mike Halpert, Deputy Director Climate Prediction Center / NCEP December, 2011.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
L3MTO Transfer to CPC Preliminary Meeting February 7, 2008.
1 Chapter 6. Section 6-1 and 6-2. Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Chapter 13 Inference for Counts: Chi-Square Tests © 2011 Pearson Education, Inc. 1 Business Statistics: A First Course.
Chapter Outline Goodness of Fit test Test of Independence.
1 Chapter 10. Section 10.1 and 10.2 Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
NOAA / WSWC Workshop on Advancing Seasonal Prediction for Water Resources Jon Gottschalck and Dave DeWitt NWS / NCEP / Climate Prediction Center October.
Chapter 3: Organizing Data. Raw data is useless to us unless we can meaningfully organize and summarize it (descriptive statistics). Organization techniques.
Making Comparisons All hypothesis testing follows a common logic of comparison Null hypothesis and alternative hypothesis – mutually exclusive – exhaustive.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Bob Livezey NWS Climate Services Seminar February 13, 2013.
U. Damrath, COSMO GM, Athens 2007 Verification of numerical QPF in DWD using radar data - and some traditional verification results for surface weather.
New NWS Western Region Local Climate Products 1 Marina Timofeyeva, 2 Andrea Bair and 3 David Unger 1 UCAR/NWS/NOAA 2 WR HQ/NWS/NOAA 3 CPC/NCEP/NWS/NOAA.
Verification of ensemble systems Chiara Marsigli ARPA-SIMC.
Improvements in Skill of CPC Outlooks Ed O’Lenic and Ken Pelman, NOAA-NWS-Climate Prediction Center 33rd Climate Diagnostics and Prediction Workshop, October.
Furthermore… References Katz, R.W. and A.H. Murphy (eds), 1997: Economic Value of Weather and Climate Forecasts. Cambridge University Press, Cambridge.
Alan F. Hamlet Andy Wood Dennis P. Lettenmaier JISAO Center for Science in the Earth System Climate Impacts Group and the Department.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
© Crown copyright Met Office Verifying modelled currents using a threshold exceedance approach Dr Ray Mahdon An exploration of the Gerrity Skill Score.
1 Recent Reduction in the Area of “EC” in CPC Outlooks and its Relationship to Value Ed O’Lenic, Kenneth Pelman, and David Unger NOAA-NWS-Climate Prediction.
CHI SQUARE DISTRIBUTION. The Chi-Square (  2 ) Distribution The chi-square distribution is the probability distribution of the sum of several independent,
©2006 Thomson/South-Western 1 Chapter 12 – Analysis of Categorical Data Slides prepared by Jeff Heyl Lincoln University ©2006 Thomson/South-Western Concise.
I. Sanchez, M. Amodei and J. Stein Météo-France DPREVI/COMPAS
Bob Livezey and Marina Timofeyeva
Intensity-scale verification technique
Verifying and interpreting ensemble products
COSMO Priority Project ”Quantitative Precipitation Forecasts”
Zhong Liu1,2, Dana Ostrenga1,3, William Teng1,4, and Steven Kempler1
Country Presentation SASCOF September 2016 Maldives
Binary Forecasts and Observations
Bob Livezey and Marina Timofeyeva
Simultaneous Inferences and Other Regression Topics
Seasonal Degree Day Outlooks
Chi Square Two-way Tables
Validation-Based Decision Making
COSMO-LEPS Verification
Overview and Chi-Square
Forecast system development activities
the performance of weather forecasts
Inferences from Matched Pairs
Presentation transcript:

A Superior Alternative to the Modified Heidke Skill Score for Verification of Categorical Versions of CPC Outlooks Bob Livezey Climate Services Division/OCWWS/NWS 28 th Climate Diagnostics and Prediction Workshop Reno, October 20, 2003

Outline 1.Introduction 2. Contingency Tables & Notation 3.Common Scores & Score Attributes 4.Gandin & Murphy Equitable Scores 5.Gerrity Scores 6.Recommendations

Contingency Tables and Notation p ij: Joint relative frequencies p i: : Observed relative frequencies q i : Forecast relative frequencies p i * : Prescribed relative observed frequencies (climatology) Table 4.2. Contingency table giving pij in percent (total sample size n=788) for U.S. mean temperature forecasts for June through August

Some Simple Categorical Skill Scores: Heidke, CPC Heidke, and Pierce

Score InequitableHSSPSSGS FMA JJA Skill scores for U.S. mean temperature forecasts in three categories for February through April and June through August

Desirable Attributes of Scores Equitable; –Equitable without dependence on the forecast distribution; Rewards for correct forecasts inversely proportional to their event frequencies; Penalties for incorrect forecasts directly proportional to their event frequencies; –Penalties for incorrect ordinal forecasts with equal event frequencies proportional to degree of miss; Consistent with an underlying linear association and insensitive to type or number of categories used. Note: 2 &3 imply that all information in the contingency table is taken into account.

Figure 4.1

Gandin and Murphy Equitable Scores

Gerrity Scores

Figure 4.4

Recommendations CPC use the Gerrity score for ordinal multi- categorical verification –Forecast history is digitized so skill history can be constructed –Clueless audience remains clueless –Score now equitably accounts for all facets of forecast performance CPC use actual frequencies CPC routinely determine confidence limits of scores Reference Jolliffe and Stephenson (2003; Wiley)