Configuring, storing, and serving the next-generation global reforecast Tom Hamill NOAA Earth System Research Lab 1 NOAA Earth System Research Laboratory.

Slides:



Advertisements
Similar presentations
Water in the Atmosphere
Advertisements

C2 NWS Snow Model. C2 Snow Model Terms  SWE - Snow water equivalent  AESC - Areal extent of snow cover  Heat Deficit - Energy required to bring the.
A primer on ensemble weather prediction and the use of probabilistic forecasts Tom Hamill NOAA Earth System Research Laboratory Physical Sciences Division.
Statistical post-processing using reforecasts to improve medium- range renewable energy forecasts Tom Hamill and Jeff Whitaker NOAA Earth System Research.
Methods: TOPS Generated 1km monthly Tave, Precip, VPD and solar radiation surfaces from downscaled WCRP CMIP3 scenarios (Maurer et al., 2007) Input ParameterUnited.
Alan Robock Department of Environmental Sciences Rutgers University, New Brunswick, New Jersey USA
Instituting Reforecasting at NCEP/EMC Tom Hamill (ESRL) Yuejian Zhu (EMC) Tom Workoff (WPC) Kathryn Gilbert (MDL) Mike Charles (CPC) Hank Herr (OHD) Trevor.
Photosynthetically-active radiation (spectral portion, CI) h h h h h h h h.
Atmospheric Analysis Lecture 3.
The NCEP operational Climate Forecast System : configuration, products, and plan for the future Hua-Lu Pan Environmental Modeling Center NCEP.
Rapid Update Cycle Model William Sachman and Steven Earle ESC452 - Spring 2006.
Lecture ERS 482/682 (Fall 2002) Snow hydrology ERS 482/682 Small Watershed Hydrology.
Outline Background, climatology & variability Role of snow in the global climate system Indicators of climate change Future projections & implications.
Extreme Precipitation
HWRF Model Sensitivity to Non-hydrostatic Effects Hurricane Diagnostics and Verification Workshop May 4, 2009 Katherine S. Maclay Colorado State University.
Introduction to Numerical Weather Prediction and Ensemble Weather Forecasting Tom Hamill NOAA-CIRES Climate Diagnostics Center Boulder, Colorado USA.
ERS 482/682 Small Watershed Hydrology
Evaporative heat flux (Q e ) 51% of the heat input into the ocean is used for evaporation. Evaporation starts when the air over the ocean is unsaturated.
June 23, 2011 Kevin Werner NWS Colorado Basin River Forecast Center 1 NOAA / CBRFC Water forecasts and data in support of western water management.
Colorado Basin River Forecast Center Water Supply Forecasting Method Michelle Stokes Hydrologist in Charge Colorado Basin River Forecast Center April 28,
Soil Physics 2010 Outline Announcements Measuring evaporation.
CPC’s U.S. Seasonal Drought Outlook & Future Plans April 20, 2010 Brad Pugh, CPC.
MDSS Challenges, Research, and Managing User Expectations - Weather Issues - Bill Mahoney & Kevin Petty National Center for Atmospheric Research (NCAR)
Augmenting Hydro-MET Data Demands of Impact Assessment Models Team: IWMI (Charlotte, Solomon) Cornell (Tamo, Dan, Zach) BDU (Seifu, Esayas)
GLOBAL PATTERNS OF THE CLIMATIC ELEMENTS: (1) SOLAR ENERGY (Linked to solar insolation & R, net radiation)
1 Ensemble Reforecasts Presented By: Yuejian Zhu (NWS/NCEP) Contributors:
1 Global Ensemble Strategy Presented By: Yuejian Zhu (NWS/NCEP) Contributors: EMC staff.
Evaporation Slides prepared by Daene C. McKinney and Venkatesh Merwade
Notes on reforecasting and the computational capacity needed for future SREF systems Tom Hamill NOAA Earth System Research Lab presentation for 2009 National.
Distinct properties of snow
Part I The hydrologic cycle.  This is also called the water cycle  It is the recycling of the water between the oceans, land and atmosphere  There.
These notes are provided to help you pay attention IN class. If I notice poor attendance, fewer notes will begin to appear on these pages Snow Measuring.
NCA-LDAS Meeting, Sept 23, 2014 NCA-LDAS: An Integrated Terrestrial Water Analysis System for the National Climate Assessment “Water Indicators” Hiroko.
The Atmosphere: An Introduction to Meteorology, 12th
Water in the Atmosphere Evaporation Condensation and Cloud Formation.
Dataset Development within the Surface Processes Group David I. Berry and Elizabeth C. Kent.
NCEP Production Suite Review: Land-Hydrology at NCEP
Geneva 2-3 December 2011 Proposal for a sub-seasonal research data set.
Notes on reforecasting and the computational capacity needed for future SREF systems Tom Hamill NOAA Earth System Research Lab presentation for 2009 National.
1 Using reforecasts for probabilistic forecast calibration Tom Hamill & Jeff Whitaker NOAA Earth System Research Lab, Boulder, CO NOAA.
1 Takuya KOMORI 1 Kiyotomi SATO 1, Hitoshi YONEHARA 1 and Tetsuo NAKAZAWA 2 1: Numerical Prediction Division, Japan Meteorological Agency 2: Typhoon Research.
1 An overview of the use of reforecasts for improving probabilistic weather forecasts Tom Hamill NOAA / ESRL, Physical Sciences Div.
Lecture 8 Evapotranspiration (1) Evaporation Processes General Comments Physical Characteristics Free Water Surface (the simplest case) Approaches to Evaporation.
Modern Era Retrospective-analysis for Research and Applications: Introduction to NASA’s Modern Era Retrospective-analysis for Research and Applications:
Additional data sources and model structure: help or hindrance? Olga Semenova State Hydrological Institute, St. Petersburg, Russia Pedro Restrepo Office.
Insights from CMC BAMS, June Short Range The SPC Short-Range Ensemble Forecast (SREF) is constructed by post-processing all 21 members of the NCEP.
Statistical Post Processing - Using Reforecast to Improve GEFS Forecast Yuejian Zhu Hong Guan and Bo Cui ECM/NCEP/NWS Dec. 3 rd 2013 Acknowledgements:
CTB computer resources / CFSRR project Hua-Lu Pan.
Lecture 8: Precipitation and Evaporation Reading: Applied Hydrology Sec on evaporation and evapotranspiration.
NCEP Vision: First Choice – First Alert – Preferred Partner NOAA Hydrometeorological TestBed at the NCEP Hydrometeorological Prediction Center (HPC) Faye.
OEAS 604: Introduction to Physical Oceanography Surface heat balance and flux Chapters 2,3 – Knauss Chapter 5 – Talley et al. 1.
Welcome to the PRECIS training workshop
Atmospheric Moisture.
CMEMS Mediterranean MFC. Update on specific development for CMEMS Med-MFC V2 Analysis and Forecast Product 1.Data Assimilation: Grid point EOFs ( * )
1 DATA and TAPE OUTLAY FOR CFSRR Suru Saha and Patrick Tripp, CFSRR Project, EMC/NCEP 2008 Reanalysis: 1238 Tapes (867 TB) 2009 Reforecast: 3651 Tapes.
Evaporation What is evaporation? How is evaporation measured?
The Water Cycle. Think About It: Why is there humidity? There is moisture in the air. Why is there moisture in the air? It evaporates from lakes and oceans.
DOWNSCALING GLOBAL MEDIUM RANGE METEOROLOGICAL PREDICTIONS FOR FLOOD PREDICTION Nathalie Voisin, Andy W. Wood, Dennis P. Lettenmaier University of Washington,
Evaporation What is evaporation? How is evaporation measured? How is evaporation estimated? Reading for today: Applied Hydrology Sections 3.5 and 3.6 Reading.
ENVI 412 Hydrologic Losses and Radar Measurement Dr. Philip B. Bedient Rice University.
Data services: Archiving of UERRA data in MARS and dissemination Richard Mládek, Manuel Fuentes, Shahram Najm, Sebastien Villaume, Enrico Fucile (ECMWF)
Accessing EURO4M and UERRA data in ECMWF MARS archive Richard Mládek, Manuel Fuentes (ECMWF)
Lecture 8 Evapotranspiration (1)
Potential Evapotranspiration (PET)
Nathalie Voisin, Andy W. Wood and Dennis P. Lettenmaier
Watershed Hydrology NREM 691 Week 3 Ali Fares, Ph.D.
Energy Budgets Some parts of the earth receive a lot of solar energy (surplus), some receive less (deficit). In order to transfer this energy around, to.
Regional Drought and Crop Yield Information System (RDCYIS)
TFMM – Trends Available Meteorological Forcing
The Water Cycle.
Presentation transcript:

Configuring, storing, and serving the next-generation global reforecast Tom Hamill NOAA Earth System Research Lab 1 NOAA Earth System Research Laboratory

Outline Review of DOE grant & constraints Principles Anticipated users Proposed configuration Storage benchmarks Lists of proposed fields to archive. 2

What we have been granted by DOE, and what’s expected of us 14.5 M CPU hours (CPUh) on “Franklin” supercomputer at Lawrence Berkeley lab, all to be used before June – some of those cycles (10%) needed for generating initial conditions, too. We are expected to make this data set available to the community by mid-2011 and to generate some experimental forecasts for the renewable energy sector (streamflow, solar potential, wind- energy potential) 3

Principles Reforecasts will be computed with a (smaller- ensemble) version of the GEFS that will be operational in We hope that GEFS will remain in this configuration, or will be changed only slightly, for several years thereafter. Once GEFS changes, either EMC or ESRL will continue to run the reforecast version until a next-generation reforecast is in place. 4

Some anticipated users NCEP/CPC, for calibration of probabilistic 6-10 day, week-2 forecasts, development of new longer- lead threats assessments. NCEP/HPC and WFOs, for calibration of shorter-range heavy precipitation forecasts. OHD, universities, labs, and international organizations for ensemble streamflow calibration and other hydrologic calibration efforts Federal, state, local and private water, environmental and energy management agencies; e.g. these data will be used to help operate the NYC water supply system NHC, for statistical calibration of track and intensity forecasts of hurricanes, TCgenesis probabilities, etc. NCEP/SPC, for longer-lead calibrated severe weather forecast products. Universities, labs, ESRL, MDL, EMC, others for development of advanced statistical post-processing techniques, high-impact events. ESRL, NCEP/EMC, and universities, for data assimilation research into correcting bias in short-term forecasts. ESRL, as advertised in proposal, for R&D of longer-lead forecasts of solar-energy potential, wind- power potential. ERSL and EMC, for determining optimal reforecast configuration into the future. Many unforeseen users, if we make it easy for them to get the data. The agreed-upon reforecast configuration is not likely to be optimal for any one of these users, but we hope it will be satisfactory for most to make substantial progress. 5

A proposed configuration (based on 16 July 2010 telecon discussion) Assume T254L42 GFS model used for 0-8 days. From days , T190L42. Estimated CPUh for 16-day forecast on DOE Franklin. Say 30 years, 365 days/year, 10 members, 2x daily ~= 20.2M CPU h, which is greater than 14.5 M total allocation. Also, will need perhaps 10% of allocation for generating initial conditions, testing. Proposed configuration based on discussion: At 00Z, full 10-member forecast every day for 30 years out to 16 days. At 12Z, a T254L42 ensemble to day 8, 5 members, reforecast every other day. Total = M CPUh = M CPUh. Doable. Extra CPU time may be devoted to running some 45-day reforecasts at T126 for the sake of comparison against CFSR reforecasts, with coupled ocean model (it’d be nice to have a few spare cycles to do this). 6

Benchmarking the storage of full model Assume we want a permanent archive of the full model fields. Assume this data infrequently accessed, though if storage solution permits rapid access, great. The full output of a T254L42 sigma (upper-air) file is ~64 MB, and the “sflx” (surface) file is ~40 MB. “Restart” file at end of T254L28 run is ~168MB. Full output of a T190L42 sigma file is ~35 MB, and sflx file is 23 MB. “Restart” file at end of T190L28 run is ~93MB. Storage for a single 0- to 8-day T254L42 forecast: – Output saved 8x daily for 0-3 days, 4x daily for 3-16 days. Restart file at end of day 7.5 for the beginning of the T190L42 forecasts. – [3 days*8x daily days*4x daily]*[64+40 MB]+168MB restart=4.9 GB storage Storage for a single day T190L42 forecast: – Output saved 4x daily for 7.5 to 16 days, plus restart file at end of day 16. – [8.5 days*4x daily + 1]*[35+23 MB] + 93 MB = 2.2 GB 00Z run to 16 days: 10 mbrs * 365 days *( GB) * 30 years ~= 778 TB 12Z run to 8 days: 5 mbrs * (365/2 days) * 4.9 GB * 30 years ~= 134 TB Total ~= 912 TB for permanent archive of full fields 7

Benchmarking, part 2: user-selected fields on fast access. Assume users will want some subset of fields on fast-access. Fields are ~ those stored in TIGGE and NAEFS data sets, plus a few extra. See following slides. For selected fields and T254 on 768*384 grid, ~13 MB per time in Grib-2. For selected fields and T190 on 576*288 grid, ~7.3 MB per time in Grib-2. Storage of one run at 00Z cycle: For 0-3 days, 8x daily, 3-8 days, 4x daily = 45 output times. For days = 8.5 days * 4x daily + 1 = 35 output times: So 45*13 MB + 35*7.3 = 841 MB. Storage of one run to 8 days at 12Z cycle: 45 output times*13 MB = 585 MB Storage of proposed 30-year reforecast for 00Z configuration = 841 MB * 30 years * 365 days * 10 members = 92 TB. Storage of proposed 30-year reforecast for 12Z configuration = 585 MB * 30 years * (365/2 days) * 5 members = 17 TB = 109 TB for full TIGGE data set. Also, will need 2x for backup, so 218 TB. 8

To make this useful to you, we need … Fast-access storage for ~ 109 TB data, with redundancy Mass storage for ~ 912 TB data, ideally with redundancy. Servers, cooling for servers. Software to make it easy for you to fetch the data you need. Without an “enterprise” solution, costs will scale approximately linearly, so storage costs for 5 members will be roughly half that for 10 members. Lesson: we need to really have users for the data that we store. We have only $175K of committed funds now available for storage costs. – If you are urging we store even more, can you help pay for the extra archival costs? 9

Proposed fields for “fast” archive Mean and every member “Fast” archive will be on disk, readily accessible (as opposed to full model output that may be on some slower archival system) Mandatory level data: Geopotential height, temperature, u, v, specific humidity at 1000, 925, 850, 700, 500, 300, 250, 200, 100, 50, and 10 hPa. PV (K m 2 kg -1 s -1 ) on θ = 320K surface. Wind components, potential temperature on 2 PVU surface. Fixed fields saved once: – field capacity – wilting point – land-sea mask – terrain height 10 blue = new fields based on recent discussions

Proposed single-level fields for “fast” archive Surface pressure (Pa) Sea-level pressure (Pa) Surface (2-m) temperature (K) Skin temperature (K) Maximum temperature since last storage time (K) Minimum temperature since last storage time (K) Soil temperature (0-10 cm; K) Volumetric soil moisture content (proportion, 0-10 cm) – kg/m 3 in TIGGE archive Total accumulated precipitation since beginning of integration (kg/m 2 ) Precipitable water (kg/m 2, vapor only, no condensate) Specific humidity at 2-m AGL (kg/kg; instantaneous) – surface dewpoint in TIGGE archive Water equivalent of accumulated snow depth (kg/m 2 ) – from beginning of integration? CAPE (J/kg) CIN (J/kg) Total cloud cover (%) 10-m u- and v-wind component (m/s) 80-m u- and v-wind component (m/s) Sunshine duration (min) Snow depth water equivalent (kg/m 2 ) Runoff Solid precipitation Liquid precipitation Vertical velocity (850 hPa) Geopotential height of surface Wind power (=windspeed 3 at 80 m*density) Saturation vapor pressure deficit 11

Proposed fields for “fast” archive Fluxes (W/m 2 ; average since last archive time) – sensible heat net flux at surface – latent heat net flux at surface – downward long-wave radiation flux at surface – upward long-wave radiation flux at surface – upward short-wave radiation at surface – downward short-wave radiation flux at surface – upward long-wave radiation at nominal top – ground heat flux. 12