RDA uptake activities and plans: ESGF
Context „Well organized“ part of climate data management: Model Intercomparison project support Large community efforts (ESGF, COG, IS-ENES) Interfaces, APIs for related communities (impact,…) modeling centers observation data providers model metadata ESGF Data Node Index Nodes CIM repository Metadata Synchronization Data Publication / Versioning Model run Documentation Replication Community Portals Compute Derived / On-Demand data products ESGF / ENES data infrastructure End Users Search API Access API RDA links: PIDs / PIT / Collections: Data management support End user services Data Fabric IG: ESGF/ENES use cases EUDAT/ ePIC collaboration 26.05.2019
PIDs Central Role of Persistent (and Unique) Identifieres Actionable „tracking ids“ for ESGF ! From RDA Data Management Paper (https://b2share.eudat.eu/record/229/files/paris-doc-v6-1.pdf?version=1) ESGF files or filesets Replica / New Version resolver PID‘ PID PID properties properties Creation_date checksum checksumtype status_flags DRS_id tombstone flag replaced by, preceded by aggregation level, children Creation_date checksum checksumtype status_flags DRS_id tombstone flag replaced by, preceded by aggregation level, children Properties are typed type definitions supporting ESGF use cases Type registry 26.05.2019
ESGF data publication / Versioning / Replication PIDs for ESGF Actionable „tracking ids“ for ESGF: CMIP6 support PID infrastructure: Handle system / ePIC / DONA .. PID information types and type registry PID Collections Data generation Data post- processing ESGF data publication / Versioning / Replication Data usage / analysis Data archival Data citation Infrastructure and end user services Assignment of PIDs: CMOR tool Management of PIDs: Integration into ESGF data publication (and versioning/replication) process ESGF PID backend infrastructure: Handle system, message queue, operational agreements .. PID DOI transition 26.05.2019
Status and next steps Handle based PID infrastructure prototype, stable PID API (EUDAT collaboration) Next: ESGF integration, PID system hosting Tools, services exploiting PID/PIT system Message queue to manage massive PID system interactions (rabbitmq) ! Community feedback to RDA Future: Processing tool integration Also from RDA Data Management Paper (https://b2share.eudat.eu/record/229/files/paris-doc-v6-1.pdf?version=1) 26.05.2019
Thank you Questions ? 26.05.2019