Download presentation
Presentation is loading. Please wait.
Published byDamian Booth Modified over 5 years ago
1
Mariusz Sterzel1 , Lukasz Dutka1, Tomasz Szepieniec1
From PL-Grid to EOSC: OneData, Marketplace, and other contributions of the ACC Cyfronet AGH Marian Bubak1,2 and Mariusz Sterzel1 , Lukasz Dutka1, Tomasz Szepieniec1 1ACC Cyfronet AGH, Krakow, Poland 2Department of Computer Science AGH, Krakow, Poland
2
Bottom-up approach to FAIR: users first,
Outline Bottom-up approach to FAIR: users first, then support from PLGrid and feedback Predecessor: Collage PLGrid: Engaging Scientific Communities Cyfronet’s involvement in the EOSC OneData Marketplace Medical data: VPH-Share, EurValve, and CECM
3
Predecesor: Collage - Executable Publications
Goal: Extending the traditional model with computational access and interactivity mechanisms; enabling readers to replicate and verify experimentation results and browse result spaces. Challenges: Scientific: A common description schema for primary data (experimental data, algorithms, software, workflows, scripts) as part of publications; deployment mechanisms for on-demand reenactment. Technological: An integrated architecture for storing, annotating, publishing, referencing and reusing primary data sources. Organizational: Provisioning of executable paper services to a large community of users representing various branches of computational science; fostering further uptake through involvement of major players in the field of scientific publishing. P. Nowakowski, E. Ciepiela, D. Harężlak, J. Kocot, M. Kasztelnik, T. Bartyński, J. Meizner, G. Dyk, M. Malawski: The Collage Authoring Environment. In: Proceedings of the International Conference on Computational Science, ICCS 2011 (2011), Winner of the Elseview/ICCS Executable Paper Grand Challenge E. Ciepiela, D. Harężlak, M. Kasztelnik, J. Meizner, G. Dyk, P. Nowakowski, M. Bubak: The Collage Authoring Environment: From Proof-of-Concept Prototype to Pilot Service in Procedia Computer Science, vol. 18, 2013
4
PLGrid - Polish Grid and Cloud Infrastructure for Science
Computing resources 5+ PTFLOPS cores Scientific Software 600+ applications, tools, libraries Storage 60+ PB fast scratch distributed access Tools for collaboration project tracking (JIRA) version control (Git) teleconferencing (Adobe Connect) Computational Cloud (based on OpenStack) 6000+ users 350+ active research grants All Polish Academic HPC centres integrated
5
PLGrid: Engaging Scientific Communities
PLGrid Plus and PLGrid NG projects Synergy between domain specific researchers and IT experts Solutions for 27 scientific domains integrated with the PLGrid Infrastructure
6
Cyfronet for the EOSC EOSCPilot
OneData for hybrid cloud, a.o. PanCancer demonstrator EOSC-hub OneData for EGI-DataHub – Discovery and Access: management of data and transparent data access to data in distributed cloud environment Marketplace - Discover, order and access services from the EOSC-hub service catalogue (general & thematic services)
7
OneData – Transparent Data Access and Management on Hybrid Clouds
Our main goal is to: deliver data management platform for large scale and distributed problems make the solution decentralized and eventually consistent in order to build a mesh of data sources make data owned to users, not to datacentres The work is supported by:
8
Data in Hybrid Cloud Environments
JOHN’S SPACES SENTINEL 2 DEEP LEARNING PUBLICATIONS SARA’S SPACES SENTINEL 2 SKY MAPS MY DATA
9
Issues Addressed by OneData
Multi-protocol transparent access to data (POSIX, CDMI, HTTP, …) Heterogeneity of storage technologies Replica management Easy data sharing without borders Integrated metadata management and data discoverability Flexible authentication and authorization Easy integration using API with external services Open data support 1 2 3 4 5 6 7 8
10
EOSC-hub Marketplace Discover, order and access services from the EOSC-hub service catalogue (general & thematic services) Discover, Order & Access Management (integrated) Service options Visibility to service provider’s brand Integrated with AAI (eduGAIN, social IDs, IGTF, etc.), AARC blueprint conformant First release: July 2018 Work in progress to build EOSC Portal (run be EC) Current MP is based on 15/03/2018 EOSC-HUB/OpenAire-Advance/eInfraCentral
11
Data for In-Silico Medicine
VPH-Share collaborative data and models: ; VPH Institute: EurValve: Personalised Decision Support for Heart Valve Disease CECM – Towards CoE in in-silico medicine (Teaming)
12
VPH-Share Federated Cloud
13
EurValve Model Execution Environment
The MEE can be interfaced from a dedicated GUI (the EurValve Portal), through a RESTful API or through a command-line interface, depending on the researcher’s preferences. Computational tasks can be run on HPC resources or in a cloud environment, as appropriate. A uniform security layer is provided (for data and services) API – Application Programming Interface REST – Representational state transfer Rimrock – service used to submit jobs to HPC cluster Atmosphere – provides access to cloud resources git – a distributed revision control system
14
Flow of Medical Data BLOB Data handled based on the confidentiality level: Step 1 (all levels) – data is sent via encrypted channel to the service Step 2-3 (high) – data encrypted and stored on disk Step 4-5 (high) – data decrypted and retrieved Step A-B (lo) – data stored directly to disk Step 6 (all) – data sent back to the user DB Records: Step 1b – data is saved via an encrypted channel to the DB service in a secured location Step 2b – data is retrieved from the service via an encrypted channel At present, all EurValve data is encrypted; however steps A and B could also proceed in an unencrypted mode (if required for performance reasons). API – Application Programming Interface BLOB – Binary Large Object REST – REpresentational State Transfer Secure locally hosted service REST (1b) (2b) SQL Database access
15
More Information
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.