IDRP: The first distributed data management infrastructure for nanoscience Rossella Aversa Karlsruhe Institute for Technology (KIT) – Steinbuch Center for Computing (SCC) Consiglio Nazionale delle Ricerche (CNR) – Istituto di Officina dei Materiali (IOM) RDA's 13th Plenary Meeting, Philadelphia, US RDARI IG Meeting April 3rd, 2019
The NFFA project Nano Foundries and Fine Analysis, for multidisciplinary research at the nanoscale EU-funded project, led by CNR-IOM (Trieste) 20 partners providing access through institutions (Transnational Access) Image credits: nffa.eu
chittagongit.com, pixabay.com The previous workflow Proposal Proposal Facility A Facility B Data Data Image credits: chittagongit.com, pixabay.com
The aims of the system Offer an architecture for multi-institutional collaborations Register (or store) the high volume/variety of data generated at the NFFA facilities Identify and organize metadata associated to scientific data Make scientific data accessible and searchable by means of a metadata search engine Educate to provide FAIR data
freepik.com, chittagongit.com The NFFA workflow Proposal Information and Data Repository Platform Facility A Facility B Data Local storage Data Local storage Image credits: freepik.com, chittagongit.com
flaticon.es, freepik.com, icons-for-free.com The NFFA platform NFFA Portal IDRP Publication Local Storage Data Analysis Facilities Image credits: flaticon.es, freepik.com, icons-for-free.com
The NFFA portal NFFA Portal User registration Proposal submission
The NFFA portal portal.nffa.eu
The IDRP NFFA Portal IDRP Single-sign-on Metadata registration
Metadata The IDRP includes the relevant (searchable) information: Proposal, instruments, people involved… Basic Metadata
Metadata Standard for Nanoscience Data Basic metadata model developed by Science and Technology Facility Council (STFC) Presented at RDA Plenary in Berlin (April 2018) within the Research data needs of the Photon and Neutron Science community IG (https://rd-alliance.org/groups/rdacodata-materials-data- infrastructure-interoperability-ig.html) Follows common metadata standards Defines common terminology and proposal execution workflow in nanoscience
Metadata at IDRP
Local storage NFFA Portal IDRP Local Storage Facilities
Local storage: issues nomad-repository.eu datashare.nffa.eu materialscloud.org
Local storage: issues Difficult for the distributed infrastructure: Different interfaces, no common metadata schema: not totally interoperable Different authentication methods: not easily accessible Not all using global PIDs: not totally findable No common agreement on data types On the other hand: Data in the local storage can be enriched with new domain-specific metadata
Metadata The IDRP includes the relevant (searchable) information: Proposal, instruments, people involved… Instrument information (for reproducibility) Basic Metadata Associated Metadata
Data Analysis NFFA Portal IDRP Local Storage Data Analysis Facilities
Data Analysis X-SOCS (ESRF) SEM (CNR-IOM) sem-classifier.nffa.eu
Metadata The IDRP includes the relevant (searchable) information: Proposal, instruments, people involved… Instrument information (for reproducibility) Scientific analysis Basic Metadata Associated Metadata Domain-specific Metadata
Metadata registration
Local storage: NFFA proposed implementation IDRP offers: interfaces for registering/uploading data import from local storage: downloading from local storage registering the URL
Metadata registration at IDRP idrp.nffa.eu
Data publication Open access at IDRP Assign a DOI through an external service b2share.eudat.eu
Data publication idrp.nffa.eu
Data publication idrp.nffa.eu
The aims of the system Offer an architecture for multi-institutional collaborations Register (or store) the high volume/variety of data generated at the NFFA facilities Identify and organize metadata associated to scientific data Make scientific data accessible and searchable by means of a metadata search engine Educate to provide FAIR data
F A I R Achieved points Metadata indexed in a searchable resource (Meta)data “internally“ published and/or global persistent identifier assigned Standardized data access to all the facilities (Meta)data permanently registered at IDRP Basic metadata model for Nanoscence data Common implementation for associated and domain-specific metadata registration F A I R
Connections within RDA Repository Platforms for Reseach Data IG (https://rd-alliance.org/groups/repository-platforms-research-data.html) Domain Repositories IG (https://rd-alliance.org/groups/domain-repositories-interest-group.html) RDA/CODATA Materials Data, Infrastructure & Interoperability IG (https://rd-alliance.org/groups/rdacodata-materials-data-infrastructure-interoperability-ig.html) Vocabulary Services IG (https://www.rd-alliance.org/groups/vocabulary-services-interest-group.html) Research data needs of the Photon and Neutron Science community IG (https://www.rd-alliance.org/groups/research-data-needs-photon-and-neutron-science-community.html) Research Data Repository Interoperability WG (https://rd-alliance.org/groups/research-data-repository-interoperability-wg.html) International Materials Resource Registries WG (https://rd-alliance.org/groups/working-group-international-materials-resource-registries.html)