RDA uptake activities and plans: ESGF

Slides:



Advertisements
Similar presentations
Robust Tools for Archiving and Preserving Digital Data Joseph JaJa, Mike Smorul, and Mike McGann Institute for Advanced Computer Studies Department of.
Advertisements

1 CS 502: Computing Methods for Digital Libraries Lecture 4 Identifiers and Reference Links.
M. Stockhause et al. Martina Stockhause, Michael Lautenschlager, Frank Toussaint Deutsches Klimarechenzentrum (DKRZ) World Data Centre for Climate (WDCC)
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
Modelling and Data Centre Requirements: CEDA ESGF UV-CDAT Conference December 2014 Philip Kershaw, Centre for Environmental Data Archival, RAL Space,
Tobias Weigel (DKRZ) Tobias Weigel Deutsches Klimarechenzentrum (DKRZ) Persistent Identifiers Solving a number of problems through a simplistic mechanism.
Z EGU Integration of external metadata into the Earth System Grid Federation (ESGF) K. Berger 1, G. Levavasseur 2, M. Stockhause 1, and M. Lautenschlager.
DATA FOUNDATION TERMINOLOGY WG 4 th Plenary Update THE PLUM GOALS This model together with the derived terminology can be used Across communities and stakeholders.
DOI Registration for Social and Economic Data da|ra Brigitte Hausstein GESIS Leibniz-Institute for the Social Sciences, Berlin.
The Language Archive – Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands PIDs in Data Infrastructures Peter Wittenburg CLARIN Research.
Cloud Task Replica Repository Preservation Tools Open Repositories Atlanta Richard Rodgers MIT Libraries.
TWC Adoption of RDA DTR and PID in Deep Carbon Observatory Data Portal Stephan Zednik, Xiaogang Ma, John Erickson, Patrick West, Peter Fox, & DCO-Data.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Data Publication and Quality Control Procedure for CMIP5 / IPCC-AR5 Data WDC Climate / DKRZ:
Working Group Practical Policy based on slides and latest documents from the PP WG chaired by Reagan Moore, Rainer Stotzka presented by Johannes Reetz.
- Vendredi 27 mars PRODIGUER un nœud de distribution des données CMIP5 GIEC/IPCC Sébastien Denvil Pôle de Modélisation, IPSL.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Processing services.
Cole David Ronnie Julio. Introduction Globus is A community of users and developers who collaborate on the use and development of open source software,
DGC Paris WP2 Summary of Discussions and Plans Peter Z. Kunszt And the WP2 team.
Data formats and requirements in CMIP6: the climate-prediction case Pierre-Antoine Bretonnière EC-Earth meeting, Reading, May 2015.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No The pan-European.
System/SDWG Update Management Council Face-to-Face Flagstaff, AZ August 22-23, 2011 Sean Hardman.
M. Stockhause 1, G. Levavasseur 2, K. Berger 1 1 Deutsches Klimarechenzentrum (DKRZ) 2 Institute Pierre Simon Laplace (IPSL) ESGF-QCWT Quality Control.
A Technical Overview Bill Branan DuraCloud Technical Lead.
Why RDA? A domain repository perspective George Alter ICPSR University of Michigan.
Replicate Research Data Safely eudat.eu/b2safe B2SAFE How to replicate your data using EUDAT’s B2SAFE Version 3 November 2015 This work is.
NFFA-EUROPE: Information and Data Management Repository Platform for nanoscience in Europe LOGO of your Pilot – organisation / initiative Stefano Cozzini.
Data Citation Implementation Pilot Workshop
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Data Preservation.
ICSU-WDS & RDA Data Publication Services WG. 2 Linking Research Data and the Literature: why? Why link? 1.Increase visibility & discoverability of research.
TWC Adoption* of RDA DTR and PIT in the Deep Carbon Observatory Data Portal Xiaogang Ma, John Erickson, Patrick West, Stephan Zednik, Peter Fox, & the.
Store and exchange data with colleagues and team Synchronize multiple versions of data Ensure automatic desktop synchronization of large files B2DROP is.
Weigel, Berger, Kindermann, Lautenschlager EGU Versioning for CMIP6 in the Earth System Grid Federation Data preparation Initial registration.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
Bringing visibility to food security data results: harvests of PRAGMA and RDA Quan (Gabriel) Zhou, Venice Juanillas Ramil Mauleon, Jason Haga, Inna Kouper,
1 This slide indicated the continuous cycle of creating raw data or derived data based on collections of existing data. Identify components that could.
Intentions and Goals Comparison of core documents from DFIG and Publishing Workflow IG show that there is much overlap despite different starting points.
RDA Europe: Views about PID Systems
Approaches and Challenges in Managing Persistent Identifiers
Digital Object Architecture (DOA) in ITU
AP7/AP8: Long-Term Archival of CMIP6 Data
EUDAT’s engagement with the Earth Sciences
WHY? - Found initiative while case statement preparation
Data Citation Service for CMIP6 and IPCC DDC Aspects
WG Research Data Collections RDA P10 Montréal – September 2017
Data Ingestion in ENES and collaboration with RDA
ACS 2016 Moving research forward with persistent identifiers
Xiaogang Ma, John Erickson, Patrick West, Stephan Zednik, Peter Fox,
Flexible Extensible Digital Object Repository Architecture
Flexible Extensible Digital Object Repository Architecture
Climate Data Analytics in a Big Data world
The Re3gistry software and the INSPIRE Registry
DATA SPHINX & EUDAT Collaboration
CMIP6 / ENES Data TF Meeting: DKRZ
New input for CEOS Persistent Identifier Best Practices
EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal
Brief WG/IG reporting Tobias Weigel on behalf of co-chairs
NSDL Data Repository (NDR)
WG Research Data Collections An overview of the recommendation
Using the RDA Collections API to Shape Humanities Data
Tech introduction.
Data types and persistent identifiers in
Mission DataCite was founded in 2009 as an international organization which aims to: establish easier access to research data increase acceptance of research.
IS-ENES Cases Seven use cases are listed as data lifecycle steps A B C
CMIP6 use case and adoption of RDA outputs
Bird of Feather Session
Digital Object Management for ENES: Challenges and Opportunities
Leveraging PIDs for object management in data infrastructures RDA UK Node Workshop, July Tobias Weigel (DKRZ)
Presentation transcript:

RDA uptake activities and plans: ESGF

Context „Well organized“ part of climate data management: Model Intercomparison project support Large community efforts (ESGF, COG, IS-ENES) Interfaces, APIs for related communities (impact,…) modeling centers observation data providers model metadata ESGF Data Node Index Nodes CIM repository Metadata Synchronization Data Publication / Versioning Model run Documentation Replication Community Portals Compute Derived / On-Demand data products ESGF / ENES data infrastructure End Users Search API Access API RDA links: PIDs / PIT / Collections: Data management support End user services Data Fabric IG: ESGF/ENES use cases EUDAT/ ePIC collaboration 26.05.2019

PIDs Central Role of Persistent (and Unique) Identifieres  Actionable „tracking ids“ for ESGF ! From RDA Data Management Paper (https://b2share.eudat.eu/record/229/files/paris-doc-v6-1.pdf?version=1) ESGF files or filesets Replica / New Version resolver PID‘ PID PID properties properties Creation_date checksum checksumtype status_flags DRS_id tombstone flag replaced by, preceded by aggregation level, children Creation_date checksum checksumtype status_flags DRS_id tombstone flag replaced by, preceded by aggregation level, children Properties are typed  type definitions supporting ESGF use cases Type registry 26.05.2019

ESGF data publication / Versioning / Replication PIDs for ESGF Actionable „tracking ids“ for ESGF: CMIP6 support PID infrastructure: Handle system / ePIC / DONA .. PID information types and type registry PID Collections Data generation Data post- processing ESGF data publication / Versioning / Replication Data usage / analysis Data archival Data citation Infrastructure and end user services Assignment of PIDs: CMOR tool Management of PIDs: Integration into ESGF data publication (and versioning/replication) process ESGF PID backend infrastructure: Handle system, message queue, operational agreements .. PID  DOI transition 26.05.2019

Status and next steps Handle based PID infrastructure prototype, stable PID API (EUDAT collaboration) Next: ESGF integration, PID system hosting Tools, services exploiting PID/PIT system Message queue to manage massive PID system interactions (rabbitmq) ! Community feedback to RDA Future: Processing tool integration Also from RDA Data Management Paper (https://b2share.eudat.eu/record/229/files/paris-doc-v6-1.pdf?version=1) 26.05.2019

Thank you Questions ? 26.05.2019