Data formats and requirements in CMIP6: the climate-prediction case Pierre-Antoine Bretonnière EC-Earth meeting, Reading, May 2015.

Slides:



Advertisements
Similar presentations
Interfacing BUFR to NMC Systems Jeff Ator NOAA National Weather Service United States of America WORLD METEOROLOGICAL ORGANIZATION RA.
Advertisements

JTX Overview Overview of Job Tracking for ArcGIS (JTX)
Preparing CMOR for CMIP6 and other WCRP Projects
Data Dictionary What does “Backordered item” mean? What does “New Customer info.” contain? How does the “account receivable report” look like?
Data Portal for the “Climate changes Spatial Planning” program Henk Klein Baltink (KNMI) Fred Bosveld (KNMI) Hans de Wolf (Dutch Space)
Organizing Data Chapter 5. Data Hierachy Table = Entities X Attributes Entities = Records Attributes = Fields.
CLIMATE SCIENTISTS’ BIG CHALLENGE: REPRODUCIBILITY USING BIG DATA Kyo Lee, Chris Mattmann, and RCMES team Jet Propulsion Laboratory (JPL), Caltech.
M. Stockhause et al. Martina Stockhause, Michael Lautenschlager, Frank Toussaint Deutsches Klimarechenzentrum (DKRZ) World Data Centre for Climate (WDCC)
Database Management Systems (DBMS)
Leaving a Metadata Trail Chapter 14. Defining Warehouse Metadata Data about warehouse data and processing Vital to the warehouse Used by everyone Metadata.
Managing Data Interoperability with FME Tony Kent Applications Engineer IMGS.
Z EGU Integration of external metadata into the Earth System Grid Federation (ESGF) K. Berger 1, G. Levavasseur 2, M. Stockhause 1, and M. Lautenschlager.
The Data Attribution Abdul Saboor PhD Research Student Model Base Development and Software Quality Assurance Research Group Freie.
Data Formats: Using Self-describing Data Formats Curt Tilmes NASA Version 1.0 February 2013 Section: Local Data Management Copyright 2013 Curt Tilmes.
Metadata Creation with the Earth System Modeling Framework Ryan O’Kuinghttons – NESII/CIRES/NOAA Kathy Saint – NESII/CSG July 22, 2014.
Implementation Considerations Yonglei Tao. Components of Coding Standards 2  File header  file location, version number, author, project, update history.
An Overview of MPEG-21 Cory McKay. Introduction Built on top of MPEG-4 and MPEG-7 standards Much more than just an audiovisual standard Meant to be a.
DM_PPT_NP_v01 SESIP_0715_AJ HDF Product Designer Aleksandar Jelenak, H. Joe Lee, Ted Habermann Gerd Heber, John Readey, Joel Plutchak The HDF Group HDF.
National Institute of Standards and Technology Technology Administration U.S. Department of Commerce 1 Patient Care Devices Domain Test Effort Integrating.
The netCDF-4 data model and format Russ Rew, UCAR Unidata NetCDF Workshop 25 October 2012.
Planetary Science Archive PSA User Group Meeting #1 PSA UG #1  July 2 - 3, 2013  ESAC PSA Archiving Standards.
Integrated Model Data Management S.Hankin ESMF July ‘04 Integrated data management in the ESMF (ESME) Steve Hankin (NOAA/PMEL & IOOS/DMAC) ESMF Team meeting.
The Project – Database Design. The following is the high mark band for the Database design: Analysed a given situation and produced and analysed a given.
Touchstone Automation’s DART ™ (Data Analysis and Reporting Tool)
Data Publication and Quality Control Procedure for CMIP5 / IPCC-AR5 Data WDC Climate / DKRZ:
Managing the Impacts of Change on Archiving Research Data A Presentation for “International Workshop on Strategies for Preservation of and Open Access.
_______________________________________________________________CMAQ Libraries and Utilities ___________________________________________________Community.
SIMO SIMulation and Optimization ”New generation forest planning system” Antti Mäkinen & Jussi Rasinmäki Dept. of Forest Resource Management.
1 Earth System Modeling Framework Documenting and comparing models using Earth System Curator Sylvia Murphy: Julien Chastang:
Portable Infrastructure for the Metafor Metadata System Charlotte Pascoe 1, Gerry Devine 2 1 NCAS-BADC, 2 NCAS-CMS University of Reading PIMMS provides.
Regional Climate Model Evaluation System based on satellite and other observations for application to CMIP/AR downscaling Peter Lean 1, Jinwon Kim 1,3,
IPCC TGICA and IPCC DDC for AR5 Data GO-ESSP Meeting, Seattle, Michael Lautenschlager World Data Center Climate Model and Data / Max-Planck-Institute.
A Computationally Efficient Platform to Examine the Efficacy of Regional Downscaling Methods AGU Fall Meeting Abstract GC12C-04 AGU Fall Meeting Abstract.
Metadata Input Tool for CADIS Scientists and Data Managers by D. Stott August 8, 2007.
00/XXXX 1 Data Processing in PRISM Introduction. COCO (CDMS Overloaded for CF Objects) What is it. Why is COCO written in Python. Implementation Data Operations.
AeroCom organisation Core team : Christiane Textor, Sarah Guibert, Stefan Kinne, Joyce Penner, Michael Schulz, Frank Dentener (LSCE-MPIM-JRC-UMI) Initial.
M. Stockhause 1, G. Levavasseur 2, K. Berger 1 1 Deutsches Klimarechenzentrum (DKRZ) 2 Institute Pierre Simon Laplace (IPSL) ESGF-QCWT Quality Control.
LLNL-PRES-XXXXXX This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under contract DE-AC52-07NA27344.
Support to scientific research on seasonal-to-decadal climate and air quality modelling Pierre-Antoine Bretonnière Francesco Benincasa IC3-BSC - Spain.
Development of a CF Conventions API Russ Rew GO-ESSP Workshop, LLNL
CAA Database Overview Sinéad McCaffrey. Metadata ObservatoryExperiment Instrument Mission Dataset File.
Using a Simple Knowledge Organization System to facilitate Catalogue and Search for the ESA CCI Open Data Portal EGU, 21 April 2016 Antony Wilson, Victoria.
Online Information and Education Conference 2004, Bangkok Dr. Britta Woldering, German National Library Metadata development in The European Library.
The Earth System Curator Metadata Infrastructure for Climate Modeling Rocky Dunlap Georgia Tech.
Weigel, Berger, Kindermann, Lautenschlager EGU Versioning for CMIP6 in the Earth System Grid Federation Data preparation Initial registration.
1 This slide indicated the continuous cycle of creating raw data or derived data based on collections of existing data. Identify components that could.
Intentions and Goals Comparison of core documents from DFIG and Publishing Workflow IG show that there is much overlap despite different starting points.
M&CML: A Monitoring & Control Specification Modeling Language
Making FAAM Flights Discoverable
NASA Earth Science Data Stewardship
Data repositories CHFP at CIMA is the WGSIP data repository: OPeNDAP, web interface, NetCDF3, CHFP convention. Common complaint: Too many repositories.
Simulation Production System
AP7/AP8: Long-Term Archival of CMIP6 Data
Integrating Data for Archaeology
DATA MODELS.
Supplier Recovery Claim Automation
The cf-python software library
MAKE SDTM EASIER START WITH CDASH !
MIKADO: Generation of CDI ISO19139 XML files
EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal
National Center for Atmospheric Research
2. An overview of SDMX (What is SDMX? Part I)
Task 5 : Supporting CCI Contributions to Obs4MIPs
ExPLORE Complex Oceanographic Data
Long-Lived Data Collections
RDA uptake activities and plans: ESGF
Reportnet 3.0 Database Feasibility Study – Approach
Robert Dattore and Steven Worley
OBSERVER DATA MANAGEMENT PRINCIPLES AND BEST PRACTICE (Agenda Item 4)
Login Main Functions Via SAS Information Delivery Portal
Presentation transcript:

Data formats and requirements in CMIP6: the climate-prediction case Pierre-Antoine Bretonnière EC-Earth meeting, Reading, May 2015

I – Experience from previous Model Intercomparison Projects  CMIP5  SPECS II – How to comply with strict metadata and format conventions  A CMOR history  Plans for CMIP6

1.8PB for data sets stored in 4.3Mio Files in 23 data nodes, 116 experiments published. (=CMIP3x50) NetCDF3 format tas_Amon_EC-Earth_historical_r1i1p1_ nc Triple data quality control: ESGF publisher conformance checks, Data consistency checks, Double and cross ‐ checks of data and metadata and DataCite data publication Experience and model description shared before data publication (ES-DOC database) Experience from CMIP5 EC-Earth meeting, Reading, May 2015

Lessons and future requirements: Usability of ESGF data access interface Automated data replication between ESGF data nodes More powerful, more stable and scalable wide area data networks (service level agreements) Detailed information of initialization, physics, etc should be more easily accessible (were in the model documentation) 4 Experience from CMIP5 EC-Earth meeting, Reading, May 2015

80 TB of data, being stored at the BADC and published on the ESGF nodes NetCDF4 format (1.5 to 2x space saving) Double time axis to encode seasonal to decadal predictions Add start date in the name of the file IPSL-CM5A- LR/decadal/S /mon/ocean/tos/r3i1p1/tos_Omon_IPSL- CM5A-LR_decadal_S _r3i1p1_ nc New attributes: initialization and physics description associated experiment Creation of “deposit receipts” when data is published Experience from SPECS EC-Earth meeting, Reading, May 2015

CMOR is a library of C functions ( with Fortran90 and Python interfaces) which facilitate/enforce compliance with MIP requirements. CMOR was designed to be adapted to the different metadata requirements of each “model inter- comparison project” CMOR2 has been used in CMIP5, and a SPECS patch to encode new requirements and project specific conventions was developed. CMOR history EC-Earth meeting, Reading, May 2015

Ensures compliance with: NetCDF – ( ) CF Conventions (provides standardized description of data contained in a file ( cf-convention.github.io ) Data Reference Syntax (DRS) – defines vocabulary used in uniquely identifying MIP datasets and specifying file and directory names ( cmip-pcmdi.llnl.gov/cmip5/output_req.html ). Project specific CMOR history EC-Earth meeting, Reading, May 2015

The CMOR library will be used to ensure that all the data produced by the different partners have the same standardized format CMOR3 is under development to:  Better handle a wide range of models and observational data  Modularize CMOR input tables  Integrate CMIP6 format conventions changes (including SPECS standards) New format and DRS under development Data quality control handled at several levels (improved in the CMORization process itself: e.g., valid max and min for each field) Plans for CMIP6

EC-Earth meeting, Reading, May 2015 SPECS contribution to CMOR will ensure that the needs of the Decadal Climate Prediction Project are taken into account For EC-Earth users, unifying the data formatting (using tools like ece2cmor) in CMIP6 will ensure better efficiency in the CMORization Use of XIOS to facilitate the formatting of the data Plans for CMIP6 ESGF nodes and EC-Earth users map

Thanks for your attention! Questions?