Research Data Management in Munich a LMU-LRZ-TUM working group‘s perspective S. Hachinger, H. Nguyen, T. Weber (LRZ) U. Eisold, M. Hora, T. Mader, C.

Slides:



Advertisements
Similar presentations
Digital Certificate Operation in a Complex Environment Matthew J. Dovey Oxford University Computing Services.
Advertisements

ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
CSIRO ASKAP Science Data Archive (CASDA) Project Kick-Off IM&T AND CASS Dan Miller| Project Manager 17 July 2014.
New DFG Information Infrastructure Projects Dr. Stefan Winkler-Nees; Birmingham, 28. March 2011 New DFG Information Infrastructure Projects.
“Grandpa’s up there somewhere.”. Making your IT skills virtual What it takes to move your services to the cloud Erik Mitchell | Kevin Gilbertson | Jean-Paul.
Tyler O. Walters, Associate Director, Technology & Resource Services Library & Information Center, Georgia Institute of Technology For NSF Site Visit to.
Digital Library Architecture and Technology
Good practice in Research Data Management Module 6: Tools, training and support.
#watitis2014 ONTARIO LIBRARY RESEARCH CLOUD: BUILDING A PROVINCE-WIDE RESEARCH CLOUD FOR ONTARIO’S ACADEMIC LIBRARIES.
Hydra and Research Data Management Neil Stewart, Digital Library Manager, London School of Economics and Political Science Presentation for Hydra Europe.
Geoff Payne ARROW Project Manager 1 April Genesis Monash University information management perspective Desire to integrate initiatives such as electronic.
EGI-Engage EGI-Engage Engaging the EGI Community towards an Open Science Commons Project Overview 9/14/2015 EGI-Engage: a project.
Hussein Suleman University of Cape Town Department of Computer Science Advanced Information Management Laboratory High Performance.
Managing Research Data – The Organisational Challenge at Oxford James A J Wilson Friday 6 th December,
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick.
MTA SZTAKI Department of Distributed Systems The problems of persistent identifiers in the context of the National Digital Data Archives of Hungary András.
The Helix Nebula Marketplace HNX The European cloud marketplace for scientists, researchers, developers & public organisations Marc-Elian Bégin, CEO, Co-founder,
Hussein Suleman University of Cape Town Department of Computer Science Digital Libraries Laboratory February 2008 Data Curation Repositories:
DuraCloud Open technologies and services for managing durable data in the cloud Michele Kimpton, CBO DuraSpace.
RDA Data Support Section. Topics 1.What is it? 2.Who cares? 3.Why does the RDA need CISL? 4.What is on the horizon?
An Open Data Platform in the framework of the EGI-LifeWatch Competence Centre Fernando Aguilar Jesús Marco
Research Data Management 26 th April 2016 Federica Fina, Data Scientist, University of St Andrews Library.
Fedora Commons Overview and Background Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Support to scientific.
EGI-InSPIRE RI EGI Compute and Data Services for Open Access in H2020 Tiziana Ferrari Technical Director, EGI.eu
EGI-InSPIRE RI An Introduction to European Grid Infrastructure (EGI) March An Introduction to the European Grid Infrastructure.
PaaS services for Computing and Storage
Unit 3 Virtualization.
Accessing the VI-SEEM infrastructure
Jennie Larkin, PhD Senior Advisor
By: Raza Usmani SaaS, PaaS & TaaS By: Raza Usmani
Computing Clusters, Grids and Clouds Globus data service
Tokamak data mirror for JET and MAST Moving towards an open data repository for European nuclear fusion research.
EUDAT: collaborative pan-European infrastructure providing research data services, training and consultancy This work is licensed.
EUDAT’s engagement with the Earth Sciences
RDA US Science workshop Arlington VA, Aug 2014 Cees de Laat with many slides from Ed Seidel/Rob Pennington.
Tools and Services Workshop
OpenAIRE in 8 Minutes Tony Ross-Hellauer State and University Library,
Joslynn Lee – Data Science Educator
Status and Challenges: January 2017
Population Imaging Use Case - EuroBioImaging
INTAROS WP5 Data integration and management
Technical Meeting with CNR and INAF 7 October 2014
An Introduction to Tessella and The Safety Deposit Box Platform
Data Services at CSC ©2016 OKM ATT initiative Licensed under Creative Commons BY 4.0.
SowiDataNet - A User-Driven Repository for Data Sharing and Centralizing Research Data from the Social and Economic Sciences in Germany Monika Linne, 30.
EGI-Engage Engaging the EGI Community towards an Open Science Commons
BoF: VREs- Keith G Jeffery & Helen Glaves
Research Data Archive - technology
Jay Bhatt Drexel University Libraries
Mirjam van Daalen, (Stephan Egli, Derek Feichtinger) :: Paul Scherrer Institut Status Report PSI PaNDaaS2 meeting Grenoble 6 – 7 July 2016.
NSF : CIF21 DIBBs: Middleware and High Performance Analytics Libraries for Scalable Data Science PI: Geoffrey C. Fox Software: MIDAS HPC-ABDS.
HII Technical Infrastructure
Experiences of the Digital Repository of Ireland
DATA SPHINX & EUDAT Collaboration
Introducing da|raSearchNet
EOSCpilot Skills Landscape & Framework
Tutorial Overview February 2017
SCALABLE OPEN ACCESS Hussein Suleman
Research Data at TU Delft
Research Data Alliance (RDA) 9th WG/IG Collaboration Meeting: Repository Platforms for Research Data (RPRD) Interest Group 13nd June 2018 Co-Chairs:
Repository Platforms for Research Data Interest Group: Requirements, Gaps, Capabilities, and Progress Robert R. Downs1, 1 NASA.
Brian Matthews STFC EOSCpilot Brian Matthews STFC
Mirjam van Daalen, (Stephan Egli, Derek Feichtinger) :: Paul Scherrer Institut Status Report PSI PaNDaaS2 meeting Grenoble 12 – 13 December 2016.
Bird of Feather Session
Data Management Components for a Research Data Archive
WP6 – EOSC integration J-F. Perrin (ILL) 15th Jan 2019
Expand portfolio of EGI services
EOSC-hub Contribution to the EOSC WGs
Presentation transcript:

Research Data Management in Munich a LMU-LRZ-TUM working group‘s perspective S. Hachinger, H. Nguyen, T. Weber (LRZ) U. Eisold, M. Hora, T. Mader, C. Wolter (TUM) R. Gnan, S. Kümmet, V. Schallehn, J. Schulz , M. Spenger, A. Weiss (LMU) 05.11.2018 | RDA 12th Plenary Meeting – RDARI IG Meeting

Outline Academic landscape in Munich & RDM demands RDM services of TUM/LMU/LRZ Summary and discussion RDA 12th Plenary Meeting – RDARI IG 05.11.2018 | Stephan Hachinger (LRZ)

Academic Institutions in Munich perspective of our working group Ludwig-Maximilians-Universität München Technical University of Munich Leibniz Supercomputing Centre RDA 12th Plenary Meeting – RDARI IG 05.11.2018 | Stephan Hachinger (LRZ)

Academic Institutions in Munich a broader perspective RDA 12th Plenary Meeting – RDARI IG 05.11.2018 | Stephan Hachinger (LRZ)

Customer Demands TUM (40’000 students) & LMU (50’000 students) among “top” German universities RDM demands: general-purpose plus individual needs, e.g.: TUM  large data sets in technology, science (simulations, etc.) LMU  digital humanities, complex metadata Increasing usage of LRZ infrastructure (IaaS, PaaS, SaaS) Cloud Storage Compute-Cloud and HPC-Cluster systems pure data-storage solutions have existed for long full RDM solutions have been implemented since ~2010 RDA 12th Plenary Meeting – RDARI IG 05.11.2018 | Stephan Hachinger (LRZ)

Technical University of Munich (TUM) University Library & RDM System eric@ub.tum.de Large university library (~2 million records)  Archival of scientific output & research data RDM project “eRIC” consists of various components: Guidelines: “customized, open-source, scalable, sustainable” mediaTUM (research data & document system) Workbench (research data- and project-management toolbox) Consulting, dissemination, co-development RDM life cycle support Plan Create/Collect Analyse/Process Archive/Publish Reuse RDA 12th Plenary Meeting – RDARI IG 05.11.2018 | Stephan Hachinger (LRZ)

TUM RDM Services – Secure long-term data store (up to several TB) eric@ub.tum.de Secure long-term data store (up to several TB) Metadata annotation at registration time (different schemas) Publication: DOI and permalinks Data access via rsync/ftp Access rights management RDA 12th Plenary Meeting – RDARI IG 05.11.2018 | Stephan Hachinger (LRZ)

TUM RDM Services – Workbench eric@ub.tum.de Electronic lab book / Project MGMT tool Basic RDM features Data MGMT plan tool Collaboration and sharing tools Link data or documents  lab book entries RDA 12th Plenary Meeting – RDARI IG 05.11.2018 | Stephan Hachinger (LRZ)

Ludwig-Maximilians-Universität München (LMU Munich) University Library & RDM Services fdm-bayern@lmu.de Very large university library (~ 5 million records)  Archive of scientific output & research data RDM Platform “Open Data LMU” (https://data.ub.uni-muenchen.de/) RDM project “eHumanities – interdisziplinär”, funded by Bavarian State Ministry of Sciences, Research and the Arts  Partners: University Library of Erlangen-Nuremberg Ludwig-Maximilians-Universität München (LMU) IT-Group for the Humanities (ITG) University Library LMU } LMU Data Center Digital Humanities RDA 12th Plenary Meeting – RDARI IG 05.11.2018 | Stephan Hachinger (LRZ)

LMU RDM Services – Open Data LMU (since 2011) Interdisciplinary low-barrier research data repository Using EPrints 3 Publication DOI Subject cataloging OAI-PMH API (incl. DataCite) Next steps Discovery service based on Apache Solr Adding Fedora cluster Tests with Blazegraph fdm-bayern@lmu.de RDA 12th Plenary Meeting – RDARI IG 05.11.2018 | Stephan Hachinger (LRZ)

LMU RDM Data Center Digital Humanities IT-Group for the Humanities (ITG) Primary contact for researchers Research and development process Data quality management Provision of data University Library (UB LMU) Metadata management Hosting of research data Data curation Long-term archiving fdm-bayern@lmu.de RDA 12th Plenary Meeting – RDARI IG 05.11.2018 | Stephan Hachinger (LRZ)

LRZ RDM Services LRZ is a world TOP-10 HPC centre computing centre for Munich universities + more part of the Bavarian Academy of Sciences and Humanities (BAdW) LRZ-RDM project adds RDM on top of pure data repositories LRZ-RDM customers Users of LRZ HPC and data facilities (large-volume, high-performance, high-throughput) Researchers not served by TUM or LMU (e.g. BAdW) Main focus on consulting within LRZ’s ticket system (ITSM, ISO 20k) Consulting services are in place, first technical RDM services are tested with pilot users. rdm@lists.lrz.de RDA 12th Plenary Meeting – RDARI IG 05.11.2018 | Stephan Hachinger (LRZ)

LRZ metadata store & indexing/sync services (on VMWare IaaS Cloud) Data Storage & RDM Architecture at LRZ – a simplified Overview see also talk of T. Weber on Tuesday / Session 267 rdm@lists.lrz.de LRZ metadata store & indexing/sync services (on VMWare IaaS Cloud) Mid volume <10TB MWN Cloud Storage CIFS or NFS export Large volume (100TB), HPC Data Science Storage NFS export (trusted IPs) AAI & MGMT Portal Data transfer: GLOBUS Archive (PB) IBM Spectrum Protect Tapes, HDD buffer Currently ~ 50PB

LRZ RDM: Collaborations with Scientists & BAdW Partners rdm@lists.lrz.de Co-development with researchers: ClimEx project (www.climex-project.org) SeisSol project (www.seissol.org) Repository integration: www.AlpEnDAC.eu VerbaAlpina (www.lmu.de/ verbaalpina) ClimEx Pilot Use Case Collaboration between LRZ, LMU and Ouranos (Québec) Aim: FAIR access to 400TB of climate simulation ensemble data; rich annotation (time, geocoding, quantities, …) SeisSol Use Case Collaboration between LRZ, LMU and TUM Aim: FAIR access to code, reproducibility Collaboration with researchers from the BAdW (two RDM project proposals in evaluation) RDA 12th Plenary Meeting – RDARI IG 05.11.2018 | Stephan Hachinger (LRZ)

RDM at Munich – Infrastructure Summary (Hardware / Platforms) LRZ RDM Services SaaS MediaTUM, Workbench LMU Open Data, DC Digital Humanities PaaS Web, “FTP” and Database Servers PaaS Web, “FTP” and Database Servers IaaS Compute: VMWare & OpenNebula Cloud, HPC Storage: Cloud Storage, Data Science Storage, Tape Local Servers, Storage Clusters RDA 12th Plenary Meeting – RDARI IG 05.11.2018 | Stephan Hachinger (LRZ)

Summary Contacts: LRZ RDM Team: rdm@lists.lrz.de UB TUM RDM Team: eric@ub.tum.de LMU RDM Team: fdm-bayern@lmu.de Munich: Universities with over 90’000 students RDM activities with focus: annotation, metadata servers, DOIs Pragmatic split for pushing forward efficiently: UB TUM: general-purpose RDM / repository + electronic lab book UB LMU & ITG: general-purpose RDM / repository + extended tools for Digital Humanities LRZ: RDM on top of storage solutions + high-throughput/high-volume specialization Co-development (use-case driven development) in all institutions RDA 12th Plenary Meeting – RDARI IG 05.11.2018 | Stephan Hachinger (LRZ)