Mariusz Sterzel1 , Lukasz Dutka1, Tomasz Szepieniec1 From PL-Grid to EOSC: OneData, Marketplace, and other contributions of the ACC Cyfronet AGH Marian Bubak1,2 and Mariusz Sterzel1 , Lukasz Dutka1, Tomasz Szepieniec1 1ACC Cyfronet AGH, Krakow, Poland 2Department of Computer Science AGH, Krakow, Poland http://dice.cyfronet.pl/; bubak@agh.edu.pl
Bottom-up approach to FAIR: users first, Outline Bottom-up approach to FAIR: users first, then support from PLGrid and feedback Predecessor: Collage PLGrid: Engaging Scientific Communities Cyfronet’s involvement in the EOSC OneData Marketplace Medical data: VPH-Share, EurValve, and CECM
Predecesor: Collage - Executable Publications Goal: Extending the traditional model with computational access and interactivity mechanisms; enabling readers to replicate and verify experimentation results and browse result spaces. Challenges: Scientific: A common description schema for primary data (experimental data, algorithms, software, workflows, scripts) as part of publications; deployment mechanisms for on-demand reenactment. Technological: An integrated architecture for storing, annotating, publishing, referencing and reusing primary data sources. Organizational: Provisioning of executable paper services to a large community of users representing various branches of computational science; fostering further uptake through involvement of major players in the field of scientific publishing. P. Nowakowski, E. Ciepiela, D. Harężlak, J. Kocot, M. Kasztelnik, T. Bartyński, J. Meizner, G. Dyk, M. Malawski: The Collage Authoring Environment. In: Proceedings of the International Conference on Computational Science, ICCS 2011 (2011), Winner of the Elseview/ICCS Executable Paper Grand Challenge E. Ciepiela, D. Harężlak, M. Kasztelnik, J. Meizner, G. Dyk, P. Nowakowski, M. Bubak: The Collage Authoring Environment: From Proof-of-Concept Prototype to Pilot Service in Procedia Computer Science, vol. 18, 2013
PLGrid - Polish Grid and Cloud Infrastructure for Science Computing resources 5+ PTFLOPS 130 000+ cores Scientific Software 600+ applications, tools, libraries http://apps.plgrid.pl Storage 60+ PB fast scratch distributed access Tools for collaboration project tracking (JIRA) version control (Git) teleconferencing (Adobe Connect) Computational Cloud (based on OpenStack) 6000+ users 350+ active research grants All Polish Academic HPC centres integrated
PLGrid: Engaging Scientific Communities PLGrid Plus and PLGrid NG projects Synergy between domain specific researchers and IT experts Solutions for 27 scientific domains integrated with the PLGrid Infrastructure
Cyfronet for the EOSC EOSCPilot OneData for hybrid cloud, a.o. PanCancer demonstrator EOSC-hub OneData for EGI-DataHub – Discovery and Access: management of data and transparent data access to data in distributed cloud environment Marketplace - Discover, order and access services from the EOSC-hub service catalogue (general & thematic services)
OneData – Transparent Data Access and Management on Hybrid Clouds Our main goal is to: deliver data management platform for large scale and distributed problems make the solution decentralized and eventually consistent in order to build a mesh of data sources make data owned to users, not to datacentres The work is supported by:
Data in Hybrid Cloud Environments JOHN’S SPACES SENTINEL 2 DEEP LEARNING PUBLICATIONS SARA’S SPACES SENTINEL 2 SKY MAPS MY DATA
Issues Addressed by OneData Multi-protocol transparent access to data (POSIX, CDMI, HTTP, …) Heterogeneity of storage technologies Replica management Easy data sharing without borders Integrated metadata management and data discoverability Flexible authentication and authorization Easy integration using API with external services Open data support 1 2 3 4 5 6 7 8
EOSC-hub Marketplace Discover, order and access services from the EOSC-hub service catalogue (general & thematic services) Discover, Order & Access Management (integrated) Service options Visibility to service provider’s brand Integrated with AAI (eduGAIN, social IDs, IGTF, etc.), AARC blueprint conformant First release: July 2018 https://marketplace.eosc-hub.eu/ Work in progress to build EOSC Portal (run be EC) Current MP is based on 15/03/2018 EOSC-HUB/OpenAire-Advance/eInfraCentral
Data for In-Silico Medicine VPH-Share collaborative data and models: www.vph-share.eu ; VPH Institute: http://www.vph-institute.org/ EurValve: Personalised Decision Support for Heart Valve Disease http://www.eurvalve.eu/ CECM – Towards CoE in in-silico medicine (Teaming) http://dice.cyfronet.pl/projects/details/CECM
VPH-Share Federated Cloud
EurValve Model Execution Environment The MEE can be interfaced from a dedicated GUI (the EurValve Portal), through a RESTful API or through a command-line interface, depending on the researcher’s preferences. Computational tasks can be run on HPC resources or in a cloud environment, as appropriate. A uniform security layer is provided (for data and services) API – Application Programming Interface REST – Representational state transfer Rimrock – service used to submit jobs to HPC cluster Atmosphere – provides access to cloud resources git – a distributed revision control system
Flow of Medical Data BLOB Data handled based on the confidentiality level: Step 1 (all levels) – data is sent via encrypted channel to the service Step 2-3 (high) – data encrypted and stored on disk Step 4-5 (high) – data decrypted and retrieved Step A-B (lo) – data stored directly to disk Step 6 (all) – data sent back to the user DB Records: Step 1b – data is saved via an encrypted channel to the DB service in a secured location Step 2b – data is retrieved from the service via an encrypted channel At present, all EurValve data is encrypted; however steps A and B could also proceed in an unencrypted mode (if required for performance reasons). API – Application Programming Interface BLOB – Binary Large Object REST – REpresentational State Transfer Secure locally hosted service REST (1b) (2b) SQL Database access
More Information http://www.cyfronet.krakow.pl/en/ http://www.plgrid.pl/en https://onedata.org/ http://www.eosc-hub.eu/ http://www.eurvalve.eu http://dice.cyfronet.pl