Research Data Repository Interoperability Thomas Jejkal.

Slides:



Advertisements
Similar presentations
SCAR Data Management SSG Plenary 30 th July 2010 Kim Finney (Manager, Australian Antarctic Data Centre & Chief Officer, SCAR Standing Committee on Antarctic.
Advertisements

Texas Digital Library Services Preservation Network.
© 2006 Open Grid Forum GGF18, 13th September 2006 OGSA Data Architecture Scenarios Dave Berry & Stephen Davey.
Interoperability Scenarios All Working Groups Meeting May, Rome, Italy.
Information Types and Registries Giridhar Manepalli Corporation for National Research Initiatives Strategies for Discovering Online Data BRDI Symposium.
JOINING UP GOVERNMENTS EUROPEAN COMMISSION Open Data Towards a European Open Data Ecospace v Abu Dhabi, 28 April 2014.
CNRIS CNRIS 2.0 Challenges for a new generation of Research Information Systems.
IC GS Informatics Breakout Group. Informatics Breakout – topics discussed 1)How will 1G integrate with topographic data? 2)Centralized, distributed, or.
Graffiti Reporting A partnership of Local and State Government; My Local Services App enhancements.
MIT’s DSpace A good fit for ETDs Margret Branschofsky Keith Glavash MIT LIBRARIES.
1 Location-Based Services Using GSM Cell Information over Symbian OS Final Year Project LYU0301 Mok Ming Fai (mfmok1) Lee Kwok Chau (leekc1)
I:\Share\Bestuursinligting\OUDITfinaal\Portfolio\Statistics\BI UPSpace An institutional repository for the University of.
I:\Share\Bestuursinligting\OUDITfinaal\Portfolio\Statistics\BI UPSpace An institutional repository for the University of Pretoria.
An Introduction to DuraCloud Carissa Smith, Partner Specialist Michele Kimpton, Project Director Bill Branan, Lead Software Developer Andrew Woods, Lead.
RDA Data Foundation and Terminology (DFT) IG: Introduction Prepared for RDA Plenary San Diego, March 9, 2015 Gary Berg-Cross, Raphael Ritz, Co-Chairs DFT.
RDA Wheat Data Interoperability Working Group Outcomes RDA Outputs P5 9 th March 2015, San Diego.
“”Capacity and services to road users” Task descriptions Paul van der Kroon, Paris November 2005.
DATA FOUNDATION TERMINOLOGY WG 4 th Plenary Update THE PLUM GOALS This model together with the derived terminology can be used Across communities and stakeholders.
RDA Data Foundation and Terminology (DFT) IG: Introduction Prepared for RDA 6 th Plenary Paris, Sept. 25, 2015 Gary Berg-Cross, Raphael Ritz Co-Chairs.
Getting Involved in the Research Data Alliance Stefanie Kethers
1st Workshop on Intelligent and Knowledge oriented Technologies Universal Semantic Knowledge Middleware Marek Paralič,
Materials Science Registry Will propose RDA Materials Science WG Define minimum/modest metadata extensions to Dublin Core to enable resource discovery.
Data Fabric IG Introduction. 2  about 50 interviews & about 75 community interactions  Data Management and Processing is too time consuming and costly.
OEI’s Services Portfolio December 13, 2007 Draft / Working Concepts.
Adoption of RDA-DFT Terminology and Data Model to the Description and Structuring of Atmospheric Data Aaron Addison, Rudolf Husar, Cynthia Hudson-Vitale.
© 2013 IBM Corporation OSLC WG Transition **DRAFT** Plan 8 April 2013 Open Services for Lifecycle Collaboration Lifecycle integration inspired by the web.
Jamie Hall (ILL). SciencePAD Persistent Identifiers Workshop PANData Software Catalogue January 30th 2013 Jamie Hall Developer IT Services, Institut Laue-Langevin.
Hydro DWG at the RDA Plenary: BoF and Aligning HDWG work with WMO expectations and timeline Sylvain, Tony, Silvano, Ilya.
Statistical Metadata Strategy and GSIM Implementation in Canada Statistics Canada.
Summary of RDA Outputs so far dr. Ir. Herman Stehouwer 22 September 2015.
Oct 12-14, 2003NSDL Challenges in Building Federation Services over Harvested Metadata Kurt Maly, Michael Nelson, Mohammad Zubair Digital Library.
Hydro DWG at the RDA Plenary BoF - Improve sharing of water resource data globally 24 September BREAKOUT :30-15:00.
Data in Context Co-chairs: Brigitte Jörg, Keith Jeffery RDA 3rd Plenary, March, 26th - 28th, 2014 Dublin.
U.S. Environmental Protection Agency Central Data Exchange Pilot Project Promoting Geospatial Data Exchange Between EPA and State Partners. April 25, 2007.
An adoption phase for RDA WGs?. Background WGs end after 18 months WGs (and some IGs) produce outputs, but adoption of these outputs often only takes.
RDA End to End RDA Global Tested, Hardened, Integrated Council TAB OAB Sec Tech Transfer Outreach Mtgs Publication Testing & Eval RDA Coord Groups Third.
Data Foundation IG DF Organizing Chairs: Gary Berg-Cross & Peter Wittenburg.
NFFA-EUROPE: Information and Data Management Repository Platform for nanoscience in Europe LOGO of your Pilot – organisation / initiative Stefano Cozzini.
Adoption of RDA-DFT Terminology and Data Model to the Description and Structuring of Atmospheric Data Aaron Addison, Rudolf Husar, Cynthia Hudson-Vitale.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Plans for PY2 Steven Newhouse Project Director, EGI.eu 30/05/2011 Future.
ICSU-WDS & RDA Data Publication Services WG. 2 Linking Research Data and the Literature: why? Why link? 1.Increase visibility & discoverability of research.
Thomas Gutberlet HZB User Coordination NMI3-II Neutron scattering and Muon spectroscopy Integrated Initiative WP5 Integrated User Access.
RDA 7 th Plenary Newcomers Session 29 February 2016, Tokyo, Japan rd-alliance.org: a short
Bringing visibility to food security data results: harvests of PRAGMA and RDA Quan (Gabriel) Zhou, Venice Juanillas Ramil Mauleon, Jason Haga, Inna Kouper,
1 This slide indicated the continuous cycle of creating raw data or derived data based on collections of existing data. Identify components that could.
Intentions and Goals Comparison of core documents from DFIG and Publishing Workflow IG show that there is much overlap despite different starting points.
RDA WG on Dynamic Data Citation
RDA 9th Plenary Breakout 3, 5 April :00-17:30
Overview of WGs, IGs and BoFs
Current and Upcoming RDA Recommendations Dr. ir. Herman Stehouwer
Research Data Repository Interoperability WG David Wilcox, Thomas Jejkal Montreal, 09/20/17 CC BY-SA 4.0.
WHY? - Found initiative while case statement preparation
RDA Data Fabric (DF) Interest Group Peter Wittenburg & Gary Berg-Cross
General Social Survey Enquête sociale générale
DataNet Collaboration
Overview, follow-up activities for EFIS/MG and WG FM
Data Ingestion in ENES and collaboration with RDA
Data Fabric Interest Group Plenary 9 Core Session Barcelona
OGSA Data Architecture Scenarios
Agenda Welcome and overview (Peter)
General Social Survey Enquête sociale générale
NFFA Europe.
From Observational Data to Information (OD2I IG )
VRE – IG charter Helen Glaves
Research Data Alliance (RDA) 9th WG/IG Collaboration Meeting: Repository Platforms for Research Data (RPRD) Interest Group 13nd June 2018 Co-Chairs:
Repository Platforms for Research Data Interest Group: Requirements, Gaps, Capabilities, and Progress Robert R. Downs1, 1 NASA.
IG Physical Samples and Collections in the Research Data Ecosystem
Bird of Feather Session
Metadata property guidelines
IDRP: The first distributed data management infrastructure for nanoscience Rossella Aversa Karlsruhe Institute for Technology (KIT) – Steinbuch Center.
Presentation transcript:

Research Data Repository Interoperability Thomas Jejkal

2 RDA Working Group - Charter „ The Research Data Repository Interoperability Working Group will establish standards for interoperability between different research data repository platforms focusing on machine-machine communication. These standards may include (but are not limited to) a generic API specification and import/export formats [...]“ RDRIWG Case Statement (

3  First meeting at P6, phone conference and BoF session at P7  Case Statement submitted on 19th of May  18 months with adoptable outcome  2 generic use cases  Replication/Migration  Information retrieval RDA Working Group - History

4  18 nanoscience facilities all over Europe  Measurement, Analysis, Simulation  Support for multi-facility proposals  Huge variety of research data outputs (raw, analyzed, simulated)  Registered at distributed information data repository platform (IDRP)  Find, retrieve, share via data portal  Publication to publication repository on-demand Use Case – NFFA Europe

5 NFFA Europe – Metadata Model Project (NFFA) Proposal Facility 1 Instrument Experiment Measurement Raw Data Facility 2 Sample Data Analysis Analysed Data Data assets summarize all file- based output of experiments Basic metadata provided with proposal submission Additional metadata added after experiment

6 NFFA Europe – Architecture NFFA Portal Facility Local Data Repository

7 NFFA Europe – Architecture NFFA Portal Facility Local Data Repository Information and Data Repository Platform (IDRP) Distributed Repository

8 NFFA Europe – Architecture NFFA Portal Facility Local Data Repository Information and Data Repository Platform (IDRP) Distributed Repository

9 NFFA Europe – Architecture NFFA Portal Facility Local Data Repository Information and Data Repository Platform (IDRP) Distributed Repository Publication Repository

10  Many different research data repository systems at facilities  How to provide transparent access via IDRP/portal to (meta-)data?  Different models/repositories needed for internal and public data  How to migrate from the internal to the public model?  Different (research) data types  How to identify/obtain data type information from repositories?  Reasonable performance and flexibility needed due to focus on research data  What are special challenges for research data? Interoperability Aspects & Questions

11  One internal metadata model  Custom metadata provided separately, custom extraction and retrieval needed  Data remains at facility, registered by reference, access organized by policy Transparent Access to (Meta-)Data NFFA Portal Facility Local Data Repository IDRP Distributed Repository  Import/Export formats, replication

12  Transformation as crosswalk from internal model to (reduced) public model, automated PID assignment  Data dereferenced  copied to IDRP Migrate from Internal to Public Model NFFA Portal IDRP Distributed Repository Publication Repository  Replication/migration

13 Identify/Obtain Data Type Information Facility Local Data Repository IDRP Distributed Repository Data Type Registry  Manual definition of data types  Suggestion based on defined types  Adoption of results of DTR WG   Retrieval of platform/content related information

14 Challenges for (NFFA) Research Data  Stored at many different platforms (ICAT, NoMaD, iRods, AiiDA, KIT Data Manager)  Data formats and structure differ depending on repository platform and used equipment  Often no clear separation between data and metadata  Volume from 1 to 10 TB/a  Data and metadata access restricted by default, publication optional, might be covered by data policy

15  Reduce effort for realizing NFFA-like concepts  Easier “federation” of local repositories  Less vendor lock-ins (e.g. publication repository)  Standard ways for getting platform/content related information How could Interoperability help?

16  State of the art standards/technologies for access/information retrieval  OAI-PMH, OAI-ORE, SWORD, METS, Resource Sync, Re3Data....  Confederation of Open Access Repositories (COAR)  Worked since 2011 on (open access) repository interoperability  Roadmap for Future Directions for Repositories Interoperability  Related RDA IGs and WGs  Repository Platforms for Research Data IG  Data Fabric IG  The Long Tail of Research Data IG  Data Type Registry WG What is There?

17  First working meeting at P8 (if endorsed)  Start working on analyzing the state of the art and identifying gaps  Short talks and discussion  E.g. OAI-PMH, SWORD, METS, Linked Data Platform, ResourceSync  Go into D1: Research Data Repository Interoperability Primer (M6)  Basis for D2: Interface Specification Draft (M12)  Workshop proposal submitted for IEEE BigData 2016 RDA Working Group – What’s next?

18  Research data repository interoperability could help to  remove barriers,  support collaboration, and  to create commonalities.  RDA WG brings platform developers together to work on this topic  Could greatly improve data sharing and exchange  Potential of immediate adoption/benefit of outcomes for NFFA Conclusions