“It Slices, It Dices, and Area Subsets” SUMMARY AND FUTURE WORK

Slides:



Advertisements
Similar presentations
Conversion of CPC Monitoring and Forecast Products to GIS Format Viviane Silva Lloyd Thomas, Mike Halpert and Wayne Higgins.
Advertisements

1 NASA CEOP Status & Demo CEOS WGISS-25 Sanya, China February 27, 2008 Yonsook Enloe.
WOCE Global Data V3 WOCE-DPC Report Nathan Bindoff and David M. Legler Co-Chairs, WOCE DPC WOCE Conference November 2002 All of it.
The Model Output Interoperability Experiment in the Gulf of Maine: A Success Story Made Possible By CF, NcML, NetCDF-Java and THREDDS Rich Signell (USGS,
Integrating NOAA’s Unified Access Framework in GEOSS: Making Earth Observation data easier to access and use Matt Austin NOAA Technology Planning and Integration.
The International Surface Pressure Databank (ISPD) and Twentieth Century Reanalysis at NCAR Thomas Cram - NCAR, Boulder, CO Gilbert Compo & Chesley McColl.
Distribution of Real-Time and Retrospective NCEP Model Data Sets RT-NOMADS Jordan C. Alpert Jun Wang
Активное распределенное хранилище для многомерных массивов Дмитрий Медведев ИКИ РАН.
NOMADS NOAA Operational Model Archive and Distribution System Jordan C. Alpert NOAA System-of-Systems Workshop, Silver Spring, MD.
Introduction Downloading and sifting through large volumes of data stored in differing formats can be a time-consuming and sometimes frustrating process.
FALL 2005CSI 4118 – UNIVERSITY OF OTTAWA1 Part 4 Web technologies: HTTP, CGI, PHP,Java applets)
By Tim Ahern, Rick Benson, Rob Casey, Chad Trabant and Bruce Weertman and many more talented people at the IRIS DMC.
DATA COMMUNICATION DONE BY: ALVIN SAMPATH CARLVIN SAMPATH.
Bringing it All Together: NODC’s Geoportal Server as an Integration Tool for Interoperable Data Services Kenneth S. Casey, Ph.D. YuanJie Li NOAA National.
Unidata TDS Workshop TDS Overview – Part I XX-XX October 2014.
Web Services for Earth Science Data Edward Armstrong, Thomas Huang, Charles Thompson, Nga Quach, Richard Kim, Zhangfan Xing Winter ESIP 2014 Washington.
NOAA’s National Weather Service National Digital Forecast Database: Status Update LeRoy Spayd Chief, Meteorological Services Division Unidata Policy Committee.
Integrated Model Data Management S.Hankin ESMF July ‘04 Integrated data management in the ESMF (ESME) Steve Hankin (NOAA/PMEL & IOOS/DMAC) ESMF Team meeting.
Integrated Grid workflow for mesoscale weather modeling and visualization Zhizhin, M., A. Polyakov, D. Medvedev, A. Poyda, S. Berezin Space Research Institute.
ESIP Federation 2004 : L.B.Pham S. Berrick, L. Pham, G. Leptoukh, Z. Liu, H. Rui, S. Shen, W. Teng, T. Zhu NASA Goddard Earth Sciences (GES) Data & Information.
DATABASE MANAGEMENT SYSTEMS CMAM301. Introduction to database management systems  What is Database?  What is Database Systems?  Types of Database.
Climate Data Formats Deniz Bozkurt
NQuery: A Network-enabled Data-based Query Tool for Multi-disciplinary Earth-science Datasets John R. Osborne.
NOAA’s National Climatic Data Center Climate Service Partnership Activities At NOAA’s National Climatic Data Center Tim Owen Climate Prediction Applications.
Information Technology: GrADS INTEGRATED USER INTERFACE Maps, Charts, Animations Expressions, Functions of Original Variables General slices of { 4D Grids.
Research & Development Building a science foundation for sound environmental decisions Remote Sensing Information Gateway (RSIG)
Product-Generation in ESG: some explorations of the user experience Steve Hankin – March, 2007.
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
SCD Research Data Archives; Availability Through the CDP About 500 distinct datasets, 12 TB Diverse in type, size, and format Serving 900 different investigators.
GIS for Atmospheric Sciences and Hydrology By David R. Maidment University of Texas at Austin National Center for Atmospheric Research, 6 July 2005.
Data Stewardship at the NOAA Data Centers Sub Topic - Value Added Products ESIP Federation Meeting, Washington, DC January 6-8, 2009.
TIGGE Archive Access at NCAR Steven Worley Doug Schuster Dave Stepaniak Hannah Wilcox.
Distributed Data Servers and Web Interface in the Climate Data Portal Willa H. Zhu Joint Institute for the Study of Ocean and Atmosphere University of.
Climate-SDM (1) Climate analysis use case –Described by: Marcia Branstetter Use case description –Data obtained from ESG –Using a sequence steps in analysis,
June 27-29, DC2 Software Workshop - 1 Tom Stephens GSSC Database Programmer GSSC Data Servers for DC2.
Jordan C. Alpert/NCEP/EMC Glenn Rutledge/NCDC Jun Wang/SAIC where the nation’s climate and weather services.
5-7 May 2003 SCD Exec_Retr 1 Research Data, May Archive Content New Archive Developments Archive Access and Provision.
The NOAA Operational Model Archive and Distribution System NOMADS CEOS-Grid Application Status Report Glenn K. Rutledge NOAA NCDC CEOS WGISS-19 Cordoba,
NcBrowse: A Graphical netCDF File Browser Donald Denbo NOAA-PMEL/UW-JISAO
The TIGGE Model Validation Portal: An Improvement in Data Interoperability 1 Thomas Cram Doug Schuster Hannah Wilcox Michael Burek Eric Nienhouse Steven.
IRI/LDEO Climate Data Library M.Benno Blumenthal, Michael Bell, and John del Corral International Research Institute for Climate and Society Columbia University.
1 Chapter 1 INTRODUCTION TO WEB. 2 Objectives In this chapter, you will: Become familiar with the architecture of the World Wide Web Learn about communication.
Client/Server Technology
Stavroula Balopoulou , Angelo Lykiardopoulos, Sissy Iona HCMR-HNODC
Data Browsing/Mining/Metadata
Distributed Control and Measurement via the Internet
2nd GEO Data Providers workshop (20-21 April 2017, Florence, Italy)
Development Environment
SRNWP Interoperability Workshop
MERRA Data Access and Services
Porting MM5 and BOLAM codes to the GRID
Flanders Marine Institute (VLIZ)
INITIAL WORLD OCEAN DATABASE COLLECTION
CUAHSI HIS Sharing hydrologic data
WaterWare description
CHAPTER 3 Architectures for Distributed Systems
TIGGE Data Archive and Access System at NCAR
Multidimensional Data and GIS
Operational Dataset Update Functionality Included in the NCAR Research Data Archive Management System Zaihua Ji Doug Schuster Steven Worley Computational.
Prepared by: Jennifer Saleem Arrigo, Program Manager
Grid Computing Done by: Shamsa Amur Al-Matani.
Software models - Software Architecture Design Patterns
Unit# 5: Internet and Worldwide Web
Serpil TOK, Zeki BAYRAM. Eastern MediterraneanUniversity Famagusta
SLAC monitoring Web Services
ExPLORE Complex Oceanographic Data
The Internet and Electronic mail
how users and data producers interact on WIS
OPeNDAP/Hyrax Interfaces
Comeaux and Worley, NSF/NCAR/SCD
Presentation transcript:

“It Slices, It Dices, and Area Subsets” SUMMARY AND FUTURE WORK NOAA Weather Model Output Data for Decision Support Systems, Including Aggregation of Ensembles ESIP Winter Meeting, January 6-8, 2015, Washington D.C. Earth Science and Data in Support of … Climate, Energy, Water Nexus. UPoster Jordan C. Alpert Glenn Rutledge NOAA/NWS/NCEP/Environmental Modeling Center, NOAA Center Weather and Climate Prediction , College Park, Maryland 20740 Climate Services Division, National Climate Data Center 151 Patton Avenue, Asheville NC 28801 “It Slices, It Dices, and Area Subsets” Want Data for Decision Support? Use Server Aggregated Data Sets Model output datasets at National Centers are formatted as Gridded Binary (GRIB) and in a few cases NETCDF files. NOMADS services are format independent using Grads Data Server GDS/OPeNDAP services and NOMADS also includes applications to repackage Grib data sets: The NWS/NCEP Environmental Modeling Center Want Operational Real Time Model Grid Data? Using http queries, a URL request extracts data from the GDS server: An example using GEFS aggregated global ensemble data set from a NOMADS OPeNDAP/GDS constrained query is below: EMC is responsible for the enhancements, transitions-to-operations, and maintenance of more than 20 numerical prediction systems comprising NCEP's operational production suite. NOMADS real-time services are supported by NOAA/NCEP, available at http://nomads.ncep.noaa.gov/. This server provides all the real time weather service model output data. Access to NCEP's operational data sets as they are being generated for up to a month in the past for some data sets (we still call this real time). 24x7 monitoring and data flow by NCEP staff and backup server to insure high availability for time critical access. Long-term archives (many years) for some data sets is at National Climatic Data Center (NCDC): http://nomads.ncdc.noaa.gov/ “Gribfilter” streams data from our server to yours. A web page interface filter is in place to allow the user, for any model run, to select files by forecast time, (when you want only a section of a huge data file rather than transferring the entire file) by region, vertical level(s), for any of the model’s variables. One selects the files to "slice and dice" the needed data set, and users customize their own data flow, receiving a re-packaged GRIB file streamed to the user workstation or the “URL commands” for program scripting. For details see: http://www.cpc.ncep.noaa.gov/products/wesley/scripting_grib_filter.html The resulting repackaged GRIB file is streamed to the users work station or the “Show the URL only for web programming” check box provides a custom URL address and keywords to download the repackaged dataset if used as the object of a non-interactive web download command such as wget or cURL. These can be used in a user script in virtually any language and placed in a cron scheduler for user custom high availability data flow. http://nomads.ncep.noaa.gov:9090/dods/gens/gens20090501/gep_all_00z.ascii?tmpprs[0:20][0:21][1:1][129:129][243:243] Notice the 5-Dimensional query for each variable (6-D data cube!), for temperature (tmpprs): ith Ensemble component [0:20] Lists all 20 Ensemble components plus control at, Forecast times [0:21] IC and every 6-hour interval to 5-days is indicated, Vertical levels [1:1] 975Mb indicated, and [0:0] would mean 1000mb, Latitude, [129:129] is measured from SP (0) to NP (for a 1x1 degree grid) Longitude, [243:243] Beginning at the 0 meridian – we show BWI airport The ordering of the square bracketed values constrain the query, otherwise the entire dimension contents are returned: [Ens1:Ens2][Fcst1:Fcst2][Lev1:Lev2][Lat1:Lat2][Lon1:Lon2] The “gep_all_00z” is the entry point to the 21 ensemble components, each with a realization of forecasts out to 384-hours. Units and other information are represented in the metadata descriptor file, at the above address when ascii? Is changed to .info. A stride is also possible as [start:stride:finish] using colon separated values. Normally constrained queries are composed by high level languages MATLAB and IDL on the commercial side and GrADS as a freeware example, in response to user functional commands. But one can compose these by hand or users can construct their own script and query the server with non-interactive web download commands like wget an cURL. Here is an example: Ensemble probability tool: A web based client application that can be used to provide threshold information to the user. The program obtains the Global Ensemble Forecast information matrix from the server as described above from 400 separate binary packed GRIB2 files, and returns the information to the user as a probability display. A user then determines a proper threshold of their defined weather event. See demo at http://nomads.ncdc.noaa.gov/EnsProb/ Users can determine their threshold (to tolerate false alarms) for a weather event alert, that is sent to email and cell phone (SMS) text-message when the threshold is met in a similar application http://nomads-cloud.net/cgi-bin/ensprob_wb4.pl. The returned URL for this application will repeat the alert process, that is, sending user notification only if the threshold criteria is met, otherwise no action is taken, so the user may repeat the command in cron or a scheduler and be assured of the latest forecast cycle (4X/day). “http/fast/partial” ftp protocol allows "random access" GRIB file transfer using server side metadata description index files and a downloadable perl program that supports ftp random access. NOMADS servers keep a metadata file for this purpose for all datasets. The index file metadata is accessed by a get inventory information script and a random-access program using cURL. Both are freely available, widely used, and work on many platforms. Called “http/fast/partial ftp” commands are easily scripted/automated and placed into a cron job. For examples and downloads see http://www.cpc.ncep.noaa.gov/products/wesley/fast_downloading_grib.html http://www.cpc.ncep.noaa.gov/products/wesley/wgrib2/index.html Prediction is what makes EMC unique! (0) Global Forecast System (GFS), Global Spectral Model (2) “ftp” File transfer protocol service, Yes., NOMADS does this too What is NOMADS? Want Data Values From Format Independent URL Queries? NOAA’s Operational Model Archive Distribution System GDS/OPeNDAP*: combines both GrADS, the Grid Analysis and Display System, a freeware client (from COLA**) and GrADS Data Server, to provide subsetting and analysis services across the internet. Underneath GDS is OPeNDAP data networking that makes local data accessible to remote locations. The server unpacks the model output data in response to user queries so server data values can appear to the user or client application as a local file! NOMADS serves data. Display is done by the user (client) NOMADS is a distributed data service for format independent access to climate, ocean and weather model output grid data. Weather service real time model output at the Web Operations Center has moved to the NOAA Center for Weather and Climate Prediction (NCWCP) data center At Weather Forecast Office’s, Regional Centers, University and Institutional research, NOAA often effort is wasted on data receipt and format issues with not enough infrastructure to collaborate. There is a need for Climate and Weather model output data to make decisions to preserve life, property and commerce. Applications to Slice, Dice and Area subset model output packed binary grid data sets across the complex dimensions of the NOAA Operational model output matrix (4) Use URL queries to request or extract data from the GDS/OPeNDAP server An Example: The OPeNDAP/GDS constrained query for the Aggregated Global Ensemble data set (GEFS) All NOMADS holdings have a Metadata description that is man or machine readable SUMMARY AND FUTURE WORK Aggregation of data sets is the key for users to obtain the data they need to make time critical decisions for their own projects. Data location Description Dimensions & Extent Variables, Units… Geo-spatial For an ensemble data set, an additional metadata line in the dimensions and extent would read “Ensemble:1 to 21 (21 points) Variables” indicating the 21 GEFS ensemble component forecasts aggregated over the separate files. What are Ensembles? GDS/OPeNDAP servers are an efficient way for users and web or cell phone application programs to get real time model output data on demand. Operational Conventional and non-conventional Observations need to be served in a similar way to grid fields. THREDDS Data Server is under testing at NCWCP http://nomads.ncep.noaa.gov (Data) http://nomads.ncdc.noaa.gov (Archives) NCEP global ensemble forecasts are constructed from many (~20) GFS model integrations by slightly changing the initial conditions. Each of the GEFS ensemble forecast components are in separate binary (GRIB2) files from initial condition (f00) time to 16 days (f384) in the future. Probability estimates can be defined simply as the percentage of forecasts that satisfy a specified weather event (hits)over the total sample space (the total number of components). NOMADS successful at satisfying NWS Strategic Goals Improve Weather, Climate and Water Decisions Services, Improve sector-relevant information in support of economic productivity, Enable environmental forecast services supporting healthy communities and ecosystems (1) Aggregation means all the ensemble component files representing each realization of all the separate forecast files appear as if they were one dataset with one entry point without any duplication of the data. (5) *GrADS Data Server: http://www.iges.org/grads/gds/index.html **Center for Ocean-Land-Atmosphere Studies: http://www.iges.org/grads/ (3)