New Resources in the Research Data Archive Doug Schuster.

Slides:



Advertisements
Similar presentations
Slide 1 TIGGE-LAM Workshop Bologna Jan TIGGE-LAM: Archiving at ECMWF Manuel Fuentes Data and Services Section ECMWF.
Advertisements

ECMWF June 2006Slide 1 Access to ECMWF data for Research Manuel Fuentes Data and Services Section, ECMWF ECMWF Forecast Products User Meeting.
The THORPEX Interactive Grand Global Ensemble (TIGGE) Richard Swinbank, Zoltan Toth and Philippe Bougeault, with thanks to the GIFS-TIGGE working group.
Slide 1 TECO on the WIS, Seoul, 6-8 November 2006 Slide 1 TECO on the WIS: Stakeholder Session THORPEX and TIGGE Walter Zwieflhofer ECMWF.
RAMADDA for Big Climate Data Don Murray NOAA/ESRL/PSD and CU-CIRES Boulder/Denver Big Data Meetup - June 18, 2014.
ICOADS Archive Practices at NCAR JCOMM ETMC-III 9-12 February 2010 Steven Worley.
ERA-Interim and ASR Data Management at NCAR
The Research Data Archive at NCAR Doug Schuster and Steve Worley NCAR.
The International Surface Pressure Databank (ISPD) and Twentieth Century Reanalysis at NCAR Thomas Cram - NCAR, Boulder, CO Gilbert Compo & Chesley McColl.
Operational Dataset Update Functionality Included in the NCAR Research Data Archive Management System 1 Zaihua Ji Doug Schuster Steven Worley Computational.
Introduction Downloading and sifting through large volumes of data stored in differing formats can be a time-consuming and sometimes frustrating process.
GEO Work Plan Symposium 2014 WE-01 Jim Caughey THORPEX IPO.
October 16-18, Research Data Set Archives Steven Worley Scientific Computing Division Data Support Section.
EGU 2011 TIGGE, TIGGE LAM and the GIFS T. Paccagnella (1), D. Richardson (2), D. Schuster(3), R. Swinbank (4), Z. Toth (3), S.
TIGGE Archive Highlights. First Service Date ECMWF – October 2006 NCAR – October 2006 CMA – June 2007.
Data for Climate and Energy Studies Steven Worley Computational and Information Systems Laboratory NCAR.
TIGGE Data Archive and Access System at NCAR 5th GIFS-TIGGE Working Group South African Weather Service Pretoria March 2008 Steven Worley Doug Schuster.
Ensemble Forecasting: Thorpex-Tigge and use in Applications Tom Hopson.
Slide 1 TIGGE phase1: Experience with exchanging large amount of NWP data in near real-time Baudouin Raoult Data and Services Section ECMWF.
Data to Support Ocean-Atmosphere Research NCAR Research Data Archive (RDA), Zaihua Ji, NCAR Steven Worley, NCAR Scott Woodruff,
Data Access to Marine Surface Observations and Products from COADS 29 January, 2002 Steven Worley National Center for Atmospheric Research.
CISL/DSS & MMM Data Discussion 19 March Who CISL/DSS - maintain NCEP operational analyses and observation datasets – Gregg Walters, Doug Schuster,
THORPEX Interactive Grand Global Ensemble (TIGGE) China Meteorological Administration TIGGE-WG meeting, Boulder, June Progress on TIGGE Archive Center.
Improved Access to RDA from the MSS OSD Executive Meeting April 28, 2009.
ICOADS: Update Status and Data Distribution Steven J. Worley Scott D. Woodruff Sandra J. Lubker Ziahua Ji J. Eric Freeman NCAR, NOAA/ESRL, NOAA/NCDC CLIMAR-III,
Analyzed Data Products Available from NCAR that Support Marine Climate Research JCOMM ETMC-III 9-12 February 2010 Steven Worley Doug Schuster.
1 Takuya KOMORI 1 Kiyotomi SATO 1, Hitoshi YONEHARA 1 and Tetsuo NAKAZAWA 2 1: Numerical Prediction Division, Japan Meteorological Agency 2: Typhoon Research.
Data Discovery and Access to The International Surface Pressure Databank (ISPD) 1 Thomas Cram Gilbert P. Compo* Doug Schuster Chesley McColl* Steven Worley.
TIGGE, an International Data Archive and Access System Steven Worley Doug Schuster Dave Stepaniak Nate Wilhelmi (NCAR) Baudouin Raoult (ECMWF) Peiliang.
TIGGE and operational EPS 経田 正幸 KYOUDA Masayuki Numerical Prediction Division, Japan Meteorological Agency 9 th THORPEX GIFS-TIGGE Working Group meeting.
Content, Discovery, and Accessibility Enhancements to the NCAR Research Data Archive Doug Schuster and Steve Worley NCAR.
JRA-25 and JCDAS at NCAR Data from Japanese 25-year Reanalysis (JRA-25) and the operational follow- on JMA Climate Data Assimilation System (JCDAS) are.
Progress of CMA TIGGE Archive Data center (updated) Bian Xiaofeng,Li Xiang,Sun Jing (National Meteorological Information Centre,CMA) Chen Jing Hu Jiangkai,
TIGGE Data Archive and Access at NCAR November 2008 November 2008 Steven Worley National Center for Atmospheric Research Boulder, Colorado, U.S.A.
Slide 1 GO-ESSP Paris. June 2007 Slide 1 (TIGGE and) the EU Funded BRIDGE project Baudouin Raoult Head of Data and Services Section ECMWF.
Information Technology: GrADS INTEGRATED USER INTERFACE Maps, Charts, Animations Expressions, Functions of Original Variables General slices of { 4D Grids.
TIGGE Data Archive at NCAR 8th GIFS-TIGGE Working Group World Meteorological Organization Geneva February, 2010 Doug Schuster Steven Worley Dave.
The TIGGE Model Validation Portal: An Improvement in Data Interoperability 1 Thomas Cram Doug Schuster Hannah Wilcox Steven Worley National Center for.
29 March 2004 Steven Worley, NSF/NCAR/SCD 1 Research Data Stewardship and Access Steven Worley, CISL/SCD Cyberinfrastructure meeting with Priscilla Nelson.
TIGGE Archive Status at NCAR THORPEX Workshop and 6th GIFS-TIGGE Working Group Meetings WMO Headquarters Geneva September 2008 Steven Worley Doug.
SCD Research Data Archives; Availability Through the CDP About 500 distinct datasets, 12 TB Diverse in type, size, and format Serving 900 different investigators.
The Research Data Archive at NCAR: A System Designed to Handle Diverse Datasets Bob Dattore and Steven Worley National Center for Atmospheric Research.
TIGGE Archive Access at NCAR Steven Worley Doug Schuster Dave Stepaniak Hannah Wilcox.
Data Discovery and Access to The International Surface Pressure Databank (ISPD) 1 Thomas Cram Gilbert P. Compo* Doug Schuster Chesley McColl* Steven Worley.
THORPEX THORPEX (THeObserving system Research and Predictability Experiment) was established in 2003 by the Fourteenth World Meteorological Congress. THORPEX.
GPS Observations for Atmospheric Science Ground-based and Radio Occultation Observations for Weather, Climate and Ionosphere C. Rocken, S. Sokolovskiy,
Enabling the Transition of CPC Products to GIS Format Brian Doty Jennifer Adams Michael Halpert Viviane Silva.
5-7 May 2003 SCD Exec_Retr 1 Research Data, May Archive Content New Archive Developments Archive Access and Provision.
The TIGGE Model Validation Portal: An Improvement in Data Interoperability 1 Thomas Cram Doug Schuster Hannah Wilcox Michael Burek Eric Nienhouse Steven.
1. Gridded Data Sub-setting Services through the RDA at NCAR Doug Schuster, Steve Worley, Bob Dattore, Dave Stepaniak.
TIGGE-LAM archive development in the frame of GEOWOW Richard Mladek, Manuel Fuentes (ECMWF)
A41I-0105 Supporting Decadal and Regional Climate Prediction through NCAR’s EaSM Data Portal Doug Schuster and Steve Worley National Center for Atmospheric.
Introduction What purpose does a data archive center serve if users can’t find or access the holdings they might need to facilitate their research discoveries?
Status of CMA S2S Archiving & Web portal
Tom Hopson, NCAR (among others) Satya Priya, World Bank
TIGGE Archives and Access
TIGGE Data Archive and Access System at NCAR
Use of TIGGE Data: Cyclone NARGIS
Jennifer Boehnert Emily Riddle Tom Hopson
Meningitis Forecasting using Climate Information Tom Hopson
Operational Dataset Update Functionality Included in the NCAR Research Data Archive Management System Zaihua Ji Doug Schuster Steven Worley Computational.
Google Meningitis Modeling
Links with GEO.
Development and Futures of Research Data Archives
TIGGE Data Archive at NCAR
Steven Worley, Douglas Schuster,
Implementation and Plans for TIGGE at NCAR and ECMWF
CXML data exchange Beth Ebert
Data Curation in Climate and Weather
Comeaux and Worley, NSF/NCAR/SCD
Presentation transcript:

New Resources in the Research Data Archive Doug Schuster

Topic Outline lNew Resources lSearch/Discovery and Data Delivery lTIGGE lJRA-25 lRoutine Updates

Data Search, Discovery and Delivery lPopular Datasets lGoogle Style Search lDrill Down Style Search lFile Level Metadata lExample: lSearch for model generated tropical cyclone track data using “Drill Down” method.

Data Search, Discovery, and Delivery

Data Search, Discovery, and Delivery (Drill Down)

Data Search, Discovery, and Delivery (File Level Metadata)

Background on TIGGE WMO World Weather Research Programme THORPEX –THe Observing system Research and Predictability EXperiment –THORPEX Interactive Global Grand Ensemble (TIGGE) Archive supports research Grand Ensemble = multiple NWP centers ensembles are combined (an ensemble of ensembles) 10 international NWP Centers contributing to TIGGE

Background on TIGGE Three mirrored archive centers NCAR ECMWF CMA {Shared System Development!} Daily Data Flow Metrics –245 GB –1.6 Million gridded fields as separate data packets –3000+ Files/day

Data Receipt Archive Centre Current Data Provider NCAR NCEP CMC UKMO ECMWF MeteoFrance JMA KMA CMA BoM CPTEC IDD/LDM HTTP FTP Unidata IDD/LDM Internet Data Distribution / Local Data Manager Commodity internet application to send and receive data NCDC

Archive Summary Online Data –Period, most recent two weeks –~ 4 TB, public products –~ 2 TB, data preparation, subsetting, DB Offline Data –Full period of record –~ 200 TB, NCAR MSS system

Major Challenges Insure data receipt, build complete archive  Exchange manifest files as part of IDD/LDM data transmission between Archive centers  Verify send, receive  Automated resend requests for missing fields  Collate data fields into different files types  Harvest and hold metadata in MySQL DB’s  Identify location of every field in file set  Updated often  Critical for users interface and background data processing

Major Challenges  Access system must accurately display what common parameters are available as users make selections  Driven by multi-center research (Grand Ensemble)  Parameters vary between centers.

Variance between centers

Get Forecast Data NCAR online file archive Selection options (Portal or RDA) Center(s) Date File type (sl, pl, etc) Initialization time Forecast length Download Options Point and click using browser, one file at a time Script to run on local machine User and password encrypted ‘wget’ commands background process to access all files User customized files Selection options (Portal) Same as for files, plus Parameter Subsets Grid Interpolation Spatial subsets Formats, GRIB2, NetCDF Delayed ModeReal Time Two User Interfaces

User access selection demonstration Animation, what you will see –Multiple centers (ECMWF, UKMO, NCEP, CMA, CMC, KMA) –Fields/Parameters (Geopotential Height, 2m Temperature) –Levels (500 hPa, Single Level) –Spatial and temporal ranges (Global, 3-days, 12Z initializations, 48 hour forecasts) –Regridding to common spatial resolution (1.5°) –Output format (netCDF)

Sample Data Request for an Event

Retrieve Completed Subset

Subset Request Animation

Gustav/Hannah Animation

Features of JRA-25/JCDAS at NCAR All data available through web/RDA portal and NCAR MSS, 11 TB Available dates, 1979 though different data products – 4 x daily, GRIB1 format –Monthly mean, netCDF (NCAR derived from binary) format All data users are registered and must agree to JMA’s ‘Condition of Use’

Typhoon Sepat, 16 August 2007 Images courtesy Dave Stepaniak

Routine Updates NCEP FNL Global Tropospheric Analysis (Daily) BUFR/PREPBUFR obs. data (Weekly) NCEP FNL Global Tropospheric Analysis (Daily) BUFR/PREPBUFR obs. data (Weekly) Unidata IDD data (Daily) NetCDF format obs collected from GTS IDD model data (GRIB-2)  GFS  NAM  RUC Unidata IDD data (Daily) NetCDF format obs collected from GTS IDD model data (GRIB-2)  GFS  NAM  RUC

Routine Updates SST NCEP OI Global SST 1x1 Deg (weekly) NOAA OI Global 0.25 x 0.25 SST (monthly) Hadley Centre Global Sea Ice and SST (monthly) SST NCEP OI Global SST 1x1 Deg (weekly) NOAA OI Global 0.25 x 0.25 SST (monthly) Hadley Centre Global Sea Ice and SST (monthly) Reanalysis NNR Yearly updates NARR Yearly updates JRA-25 Reanalysis NNR Yearly updates NARR Yearly updates JRA-25

Questions?

Lessons Learned  Manifest files and automated resend are critical for a complete archive  The impact of different contributions from the NWP centers across archive cannot be under estimated  There are important design considerations to insure prompt browser interactions  Caching data from the DB

Lessons Learned  Computational resource requirements ramp up quickly with multi-dimensional problems  D’s, center, ensemble member, parameter, forecast length, etc.  Archive file structure choices greatly impact subsetting ability  TIGGE currently based on synoptic order  Time-series by parameter could be better?

Major Challenges  Limited online storage – 4 TB, ≅ 2 weeks temporal coverage  Full archive on NCAR Mass Storage System  User registration and metrics required  Accept data policy; for research and education only  48 hour delay from forecast initialization time