Tom Hamill and Michael Scheuerer ESRL, Physical Sciences Division

Slides:



Advertisements
Similar presentations
Chapter 13 – Weather Analysis and Forecasting
Advertisements

ECMWF long range forecast systems
SNPP VIIRS green vegetation fraction products and application in numerical weather prediction Zhangyan Jiang 1,2, Weizhong Zheng 3,4, Junchang Ju 1,2,
Report of the Q2 Short Range QPF Discussion Group Jon Ahlquist Curtis Marshall John McGinley - lead Dan Petersen D. J. Seo Jean Vieux.
Statistical post-processing using reforecasts to improve medium- range renewable energy forecasts Tom Hamill and Jeff Whitaker NOAA Earth System Research.
Downscaling ensembles using forecast analogs Jeff Whitaker and Tom Hamill
Details for Today: DATE:3 rd February 2005 BY:Mark Cresswell FOLLOWED BY:Assignment 2 briefing Evaluation of Model Performance 69EG3137 – Impacts & Models.
Federal Department of Home Affairs FDHA Federal Office of Meteorology and Climatology MeteoSwiss Improving COSMO-LEPS forecasts of extreme events with.
Improving Excessive Rainfall Forecasts at HPC by using the “Neighborhood - Spatial Density“ Approach to High Res Models Michael Eckert, David Novak, and.
Instituting Reforecasting at NCEP/EMC Tom Hamill (ESRL) Yuejian Zhu (EMC) Tom Workoff (WPC) Kathryn Gilbert (MDL) Mike Charles (CPC) Hank Herr (OHD) Trevor.
Improving Probabilistic Ensemble Forecasts of Convection through the Application of QPF-POP Relationships Christopher J. Schaffer 1 William A. Gallus Jr.
MOS Developed by and Run at the NWS Meteorological Development Lab (MDL) Full range of products available at:
Hydrometeorological Prediction Center HPC Medium Range Grid Improvements Mike Schichtel, Chris Bailey, Keith Brill, and David Novak.
New Local Climate Outlook Products from the NWS Andrea Bair NOAA/NWS Western Region Headquarters Climate Services Program Manager.
A statistical downscaling procedure for improving multi-model ensemble probabilistic precipitation forecasts Tom Hamill ESRL Physical Sciences Division.
MOS Performance MOS significantly improves on the skill of model output. National Weather Service verification statistics have shown a narrowing gap between.
Introduction to Numerical Weather Prediction and Ensemble Weather Forecasting Tom Hamill NOAA-CIRES Climate Diagnostics Center Boulder, Colorado USA.
¿How sensitive are probabilistic precipitation forecasts to the choice of the ensemble generation method and the calibration algorithm? Juan J. Ruiz 1,2.
Ensemble Post-Processing and it’s Potential Benefits for the Operational Forecaster Michael Erickson and Brian A. Colle School of Marine and Atmospheric.
Chapter 13 – Weather Analysis and Forecasting. The National Weather Service The National Weather Service (NWS) is responsible for forecasts several times.
Improving Medium-Range Ensemble-Based QPF over the Western United States Trevor Alcott and Jon Rutz NOAA/NWS WR-STID Jim Steenburgh University of Utah.
NCPP – needs, process components, structure of scientific climate impacts study approach, etc.
1 Ensemble Reforecasts Presented By: Yuejian Zhu (NWS/NCEP) Contributors:
Forecasting and Numerical Weather Prediction (NWP) NOWcasting Description of atmospheric models Specific Models Types of variables and how to determine.
ESA DA Projects Progress Meeting 2University of Reading Advanced Data Assimilation Methods WP2.1 Perform (ensemble) experiments to quantify model errors.
Rank Histograms – measuring the reliability of an ensemble forecast You cannot verify an ensemble forecast with a single.
ISDA 2014, Feb 24 – 28, Munich 1 Impact of ensemble perturbations provided by convective-scale ensemble data assimilation in the COSMO-DE model Florian.
Exploring sample size issues for 6-10 day forecasts using ECMWF’s reforecast data set Model: 2005 version of ECMWF model; T255 resolution. Initial Conditions:
Intraseasonal TC prediction in the southern hemisphere Matthew Wheeler and John McBride Centre for Australia Weather and Climate Research A partnership.
How can LAMEPS * help you to make a better forecast for extreme weather Henrik Feddersen, DMI * LAMEPS =Limited-Area Model Ensemble Prediction.
Preliminary Results of Global Climate Simulations With a High- Resolution Atmospheric Model P. B. Duffy, B. Govindasamy, J. Milovich, K. Taylor, S. Thompson,
A Comparison of the Northern American Regional Reanalysis (NARR) to an Ensemble of Analyses Including CFSR Wesley Ebisuzaki 1, Fedor Mesinger 2, Li Zhang.
The Application of Reforecasts at WPC Thomas Workoff 1, Wallace Hogsett, Faye Barthold 2 and David Novak Weather Prediction Center 1 Systems Research Group,
June 19, 2007 GRIDDED MOS STARTS WITH POINT (STATION) MOS STARTS WITH POINT (STATION) MOS –Essentially the same MOS that is in text bulletins –Number and.
VERIFICATION OF NDFD GRIDDED FORECASTS IN THE WESTERN UNITED STATES John Horel 1, David Myrick 1, Bradley Colman 2, Mark Jackson 3 1 NOAA Cooperative Institute.
Improving Ensemble QPF in NMC Dr. Dai Kan National Meteorological Center of China (NMC) International Training Course for Weather Forecasters 11/1, 2012,
Celeste Saulo and Juan Ruiz CIMA (CONICET/UBA) – DCAO (FCEN –UBA)
Notes on reforecasting and the computational capacity needed for future SREF systems Tom Hamill NOAA Earth System Research Lab presentation for 2009 National.
1 Agenda Topic: National Blend Presented By: Kathryn Gilbert (NWS/NCEP) Team Leads: Dave Myrick, David Ruth (NWS/OSTI/MDL), Dave Novak (NCEP/WPC), Jeff.
1 Using reforecasts for probabilistic forecast calibration Tom Hamill & Jeff Whitaker NOAA Earth System Research Lab, Boulder, CO NOAA.
Ensembles. Ensemble Methods l Construct a set of classifiers from training data l Predict class label of previously unseen records by aggregating predictions.
1 An overview of the use of reforecasts for improving probabilistic weather forecasts Tom Hamill NOAA / ESRL, Physical Sciences Div.
A recommended reforecast configuration for the NCEP GEFS Tom Hamill NOAA/ESRL Physical Sciences Division (303)
Model Post Processing. Model Output Can Usually Be Improved with Post Processing Can remove systematic bias Can produce probabilistic information from.
. Outline  Evaluation of different model-error schemes in the WRF mesoscale ensemble: stochastic, multi-physics and combinations thereof  Where is.
APPLICATION OF NUMERICAL MODELS IN THE FORECAST PROCESS - FROM NATIONAL CENTERS TO THE LOCAL WFO David W. Reynolds National Weather Service WFO San Francisco.
18 September 2009: On the value of reforecasts for the TIGGE database 1/27 On the value of reforecasts for the TIGGE database Renate Hagedorn European.
Statistical Post Processing - Using Reforecast to Improve GEFS Forecast Yuejian Zhu Hong Guan and Bo Cui ECM/NCEP/NWS Dec. 3 rd 2013 Acknowledgements:
Federal Department of Home Affairs FDHA Federal Office of Meteorology and Climatology MeteoSwiss A more reliable COSMO-LEPS F. Fundel, A. Walser, M. A.
Exploring Multi-Model Ensemble Performance in Extratropical Cyclones over Eastern North America and the Western Atlantic Ocean Nathan Korfe and Brian A.
Common verification methods for ensemble forecasts
2015 Production Suite Review: Report from NHC 2015 Production Suite Review: Report from NHC Eric S. Blake, Richard J. Pasch, Andrew Penny NCEP Production.
DOWNSCALING GLOBAL MEDIUM RANGE METEOROLOGICAL PREDICTIONS FOR FLOOD PREDICTION Nathalie Voisin, Andy W. Wood, Dennis P. Lettenmaier University of Washington,
VERIFICATION OF A DOWNSCALING SEQUENCE APPLIED TO MEDIUM RANGE METEOROLOGICAL PREDICTIONS FOR GLOBAL FLOOD PREDICTION Nathalie Voisin, Andy W. Wood and.
Probabilistic Forecasts Based on “Reforecasts” Tom Hamill and Jeff Whitaker and
Kathryn Gilbert, Project Manager David Myrick, Deputy Project Manager Wednesday, February 18, 2015 Co-Authors: Jeff Craven, Dave Novak, Tom Hamill, Jim.
NOAA Northeast Regional Climate Center Dr. Lee Tryhorn NOAA Climate Literacy Workshop April 2010 NOAA Northeast Regional Climate.
Translating Advances in Numerical Weather Prediction into Official NWS Forecasts David P. Ruth Meteorological Development Laboratory Symposium on the 50.
National Oceanic and Atmospheric Administration’s National Weather Service Colorado Basin River Forecast Center Salt Lake City, Utah 11 The Hydrologic.
Improving Numerical Weather Prediction Using Analog Ensemble Presentation by: Mehdi Shahriari Advisor: Guido Cervone.
A Guide to Tropical Cyclone Guidance
Verifying and interpreting ensemble products
Precipitation Products Statistical Techniques
Nathalie Voisin, Andy W. Wood and Dennis P. Lettenmaier
MOS Developed by and Run at the NWS Meteorological Development Lab (MDL) Full range of products available at:
Post Processing.
The Importance of Reforecasts at CPC
University of Washington Center for Science in the Earth System
Christoph Gebhardt, Zied Ben Bouallègue, Michael Buchhold
Short Range Ensemble Prediction System Verification over Greece
Presentation transcript:

The science behind multi-model precipitation probability in the “National Blend of Models” Tom Hamill and Michael Scheuerer ESRL, Physical Sciences Division tom.hamill@noaa.gov; michael.scheuerer@noaa.gov (303) 497-3060; (303) 497-4281

“National Blend of Models” project Improve NWS National Digital Forecast Database (NDFD) fields using multi-model ensemble and multiple deterministic forecasts. Reduce need for forecaster manual intervention, free up forecasters for more decision support. Eventually “Blend” will cover: all NDFD variables (e.g., Tmax, Tmin, T, Td, wind speed, direction, gust, sky cover, precip amount, snow amount, wave height) on 2.5-km grid. all US regions (CONUS, AK, HI, PR, Guam) May expand to PQPF in the future PSD’s role: improve multi-model precipitation, and more specifically here, POP. Asked to focus on +3 to +8 day period initially.

NDFD POP product example One’s eye is drawn to major changes in POP at WFO boundaries. Can we make POP so skillful and reliable that forecasters will need to modify centralized guidance less frequently?

Challenge: for NWS “National Blend” project, little training data exists, or it’s hard to get (and archive). GEFS: will change soon to new version, obviating the existing 00Z cycle reforecasts. ECMWF: ~20-year, 1x weekly reforecasts, but have not been made available to us (yet). CMC: ~ 5 years of reforecasts based on ERA-Interim with near-surface adjustment; not available for this project (yet). No reforecasts for control/deterministic of various centers. This limits the range of post-processing methodologies we feel comfortable trying.

What might we reasonably do to post-process multi-model ensemble guidance? Multi-model reliability, Day +5, > 1 mm Previously (Hamill, July 2012 MWR) multi-model ensembles for POP (July-Oct 2010) averaged to 1 degree grid verified against 1-degree CCPA. This procedure produced reliable and skillful POP forecasts. Models were ECMWF, UK Met Office, CMC, and NCEP. 1-degree forecasts are not high-enough resolution for Nat’l Blend; desire on 2.5-km grid.

Why we don’t expect >10-km global ensemble forecasts to be reliable when verified against < 10-km analyses. Problem 1: there is coherent climatological detail, especially in the western US at scales < 10 km, below the model-resolvable scales.

Problem 2: sub-grid variability large scale precip amount 1 2 3 4 5 east-west grid box # Here we have a precipitation forecast along a segment of a latitude circle.

Problem 2: sub-grid variability large scale precip amount fine scale 1 2 3 4 5 east-west grid box # Here’s one possible fine-scale analysis that is consistent with the large scale.

Problem 2: sub-grid variability large scale precip amount fine scale 1 2 3 4 5 east-west grid box # But here’s yet another. There are infinitely many fine-scale vectors that will spatially average to the correct large-scale vector. So, when simulating a high-resolution ensemble, one should account for sub-gridscale variability in a realistic manner.

Proposed downscaling methodology Ensemble member’s forecast Precipitation amount 1 model grid point Repeat this process for every member at every grid point: Consider coarse resolution ensemble member forecast at a given grid point.

Proposed downscaling methodology Matching coarse- resolution precip. analysis Ensemble member’s forecast Precipitation amount Precipitation amount 1 model grid point 1 model grid point Repeat this process for every member at every grid point: Consider coarse resolution ensemble member forecast at a given grid point. Find a past coarse-resolution analysis close in value to this forecast.

Proposed downscaling methodology Matching coarse- and fine-resolution precip. analyses Ensemble member’s forecast Precipitation amount Precipitation amount 1 model grid point 1 model grid point Repeat this process for every member at every grid point: Consider coarse resolution ensemble member forecast at a given grid point. Find a past coarse-resolution analysis close in value to this forecast. Extract the fine-resolution analysis at the same day and location.

Proposed downscaling methodology Matching coarse- and fine-resolution precip. analyses Ensemble member’s forecast Precipitation amount Precipitation amount 1 model grid point 1 model grid point Repeat this process for every member at every grid point: Consider coarse resolution ensemble member forecast at a given grid point. Find a past coarse-resolution analysis close in value to this forecast. Extract the fine-resolution analysis at the same day and location. Define the downscaling vector.

Proposed downscaling methodology Matching coarse- and fine-resolution precip. analyses Downscaling vector + coarse-resolution forecast. Ensemble member’s forecast Precipitation amount Precipitation amount Precipitation amount 1 model grid point 1 model grid point 1 model grid point Repeat this process for every member at every grid point: Consider coarse resolution ensemble member forecast at a given grid point. Find a past coarse-resolution analysis close in value to this forecast. Extract the fine-resolution analysis at the same day and location. Define the downscaling vector. Add vector to the coarse-res. forecast to define the downscaled forecast.

A different problem: systematic biases such as over-forecasting of drizzle. Some regions, such as in N Nevada, models tend to produce precipitation unrealistically, day after day. Such systematic errors may affect POP reliability.

CDF-based bias correction, or “quantile mapping” A challenge is that if the CDFs are generated from small samples (with CDFs computed separately for each grid point) may be noisy if computed using a small training sample size.

Potential remedy for small sample size: use supplemental data locations Idea is to supplement training data at each grid point using n extra grid points that have similar observed climatology and similar forecast-observed relationships, as determined from GEFS reforecasts. In this plot, big symbol is where we’re training, smaller symbol where we’re supplementing training data.

Smoothing the POP forecasts Savitzky-Golay (S-G) filter used. For more details, see: http://research.microsoft.com/en-us/um/people/jckrumm/SavGol/SavGol.htm http://www.wire.tu-bs.de/OLDWEB/mameyer/cmr/savgol.pdf (good tutorial) Numerical Recipes text. I coded up the S-G filter with a window size of 9 and using 3rd-order polynomial fitting.

Savitzky-Golay filter designed to preserve amplitudes, unlike more common nxn block smoothers from Press et al. Numerical Recipes text

Multi-model POP with statistical downscaling (but w/o Savitzky-Golay)

Multi-model POP with statistical downscaling (but WITH Savitzky-Golay) In this implementation, more smoothing is applied in less varied terrain of eastern US than in more varied terrain of western US.

Strategy for POP forecasting implemented here. POPs via multi-model ensemble, but with two modifications: Adjust each member with a CDF-based bias correction. Apply statistical downscaling as described previously. Data used in subsequent demo: 108-120 h coarse res. precip. forecasts, ensembles and deterministic for Jan 2014 from ECMWF, NCEP GEFS, UK Met, CMC. CDF bias correction trained using Nov-Dec 2013 108-120 h forecasts Long time series of both hi-res. and coarse-res. CCPA precip. analyses (2002-2013) for statistical downscaling. Technique: (1) CDF bias correct the forecasts using the past 60 days of forecasts and observations. (2) Statistically downscale each member. (3) Compute probability from ensemble relative frequency. (4) Apply Savitzky-Golay smoothing to mimimize noise from sampling error.

POP Reliability and skill, before & after The BSS numbers look rather high, even for the raw ensembles (next slide). I think this is because: no big “dropouts” this month? (b) forecasts were very different than the long-term climatology, so skill relative to climatology was larger (the month was abnormally dry in many locations).

Reliability and skill of raw ensembles Over this sample, the multi-model raw ensemble skill wasn’t better than that of the best forecast, from ECMWF. ECMWF NCEP CMC UK Met Office

Case study

Benefits of CDF correction may not generalize to higher precipitation amounts Applying the same methodology to > 10 mm 12 h-1 forecasts, we see little improvement as a result of statistical downscaling or CDF-based bias corrections. It’s likely that 60 days, even with supplemental locations, is not enough samples to properly adjust > 10 mm forecasts (reforecasts and/or other post-processing approach needed)

Thinking ahead: How to improve PQPF for rare events Next-Generation Global Prediction System priorities include: Development of reanalysis/reforecast data sets to provide large training samples needed to improve probabilistic forecasts of rare events like heavy precipitation. Development and deployment of advanced post-processing methods. Where are we with these developments?

Reanalysis / reforecast: where are we? What’s important is to have reforecasts be statistically consistent with real-time forecasts. To do this, probably need reanalysis using same model, same data assimilation procedure as used for real-time GFS analysis. Existing reanalysis is now ~ 7 years old, T382 3D-Var; new analyses soon to be T1534 4D-En-Var. So we need new reanalysis. ESRL/PSD funded to begin reanalysis work, expect funding to continue it in FY2016 with EMC, CPC. Reforecast generation is straightforward (if computationally expensive) once reanalysis is generated.

Advanced post-processing methods: where are we? My PSD colleague Michael Scheuerer has Scheuerer and Hamill, 2015 MWR, conditionally accepted.

Gamma distribution’s shift, location, and scale parameters set with ensemble data This produces predictive distributions with a variety of shapes depending on the ensemble mean, spread, and other predictors Ref: ibid.

Brier Skill Scores Rank analog (no supplemental locations) At the higher thresholds, the CSGD forecasts significantly outperform the rank analog. No data from supplemental training locations used in either method here. Censored, shifted gamma distributions

Reliability diagrams, +108 to +120 h Rank analog, no supplemental locations Censored, shifted gamma distribution fitting

Improved high-end sharpness In this case (one where the raw ensemble guidance was quite good), one can see that the CGSD approach produced much higher probabilities of heavy precipitation than the analog.

How skillful is CSGD when using smaller training sample sizes? There is some drop-off in skill, shown here, and reliability (not shown) when decreasing samples to 3- or 1 year of training data from the original 2002-2014 training data. Use of supplemental locations to increase the sample size does have a positive benefit. Further tests and comparisons underway.

Conclusion / next steps It appears to be possible to add skill, improve reliability of multi-model ensemble POP with statistical downscaling,CDF quantile mapping, and judicious smoothing. Improvements to probabilities for higher event thresholds more marginal with this method, probably because CDFs used in bias correction are noisy with small sample size at higher quantiles. Our group is working on more advanced methods, including censored, shifted Gamma distributions, which show promise for post-processing an individual model. This method is being evaluated in real time by forecasters. Pending evaluation, will migrate this technology to downscale using high-resolution Stage-IV precipitation analyses.

Supplementary slides

Methodology for weighting smoothed vs. original ensemble probabilities.

Terrain-height variations

Weight applied adjusted ensemble probabilities Weight applied to CDF bias-corrected and downscaled multi-model ensemble probabilities