Photon & Neutron working meeting

Slides:



Advertisements
Similar presentations
1 Applications Virtualization in VPC Nadya Williams UCSD.
Advertisements

EGI-Engage EGI-Engage Engaging the EGI Community towards an Open Science Commons Project Overview 9/14/2015 EGI-Engage: a project.
Sync and Exchange Research Data b2drop.eudat.eu This work is licensed under the Creative Commons CC-BY 4.0 licence B2DROP EUDAT’s Personal.
European Grid Initiative Data Services and Solutions Part 2: Data in the cloud Enol Fernández Data Services.
European Life Sciences Infrastructure for Biological Information ELIXIR Cloud Roadmap Chairs: Steven Newhouse, EMBL-EBI & Mirek Ruda,
Store and exchange data with colleagues and team Synchronize multiple versions of data Ensure automatic desktop synchronization of large files B2DROP is.
EGI-InSPIRE RI EGI Compute and Data Services for Open Access in H2020 Tiziana Ferrari Technical Director, EGI.eu
EGI-InSPIRE RI An Introduction to European Grid Infrastructure (EGI) March An Introduction to the European Grid Infrastructure.
EGI… …is a Federation of over 300 computing and data centres spread across 56 countries in Europe and worldwide …delivers advanced computing.
Petr Škoda, Jakub Koza Astronomical Institute Academy of Sciences
PaaS services for Computing and Storage
Onedata Eventually Consistent Virtual Filesystem for Multi-Cloud Infrastructures Michał Orzechowski (CYFRONET AGH)
Accessing the VI-SEEM infrastructure
The EGI Training Infrastructure
Containers as a Service with Docker to Extend an Open Platform
Diego Scardaci EGI Technical Outreach Expert
BEST CLOUD COMPUTING PLATFORM Skype : mukesh.k.bansal.
EGI Updates Check-in Matthew Viljoen – EGI Foundation
Foundations of Data Science
AAI for a Collaborative Data Infrastructure
Unified Data Access and MGMT. in Distributed hybrid Cloud
The PaaS Layer in the INDIGO-DataCloud
Federated Cloud Computing
Population Imaging Use Case - EuroBioImaging
ICOS on-demand atmospheric transport computation A use case for interoperability of EGI and EUDAT services Ute Karstens, André Bjärby, Oleg Mirzov, Roger.
Technical Meeting with CNR and INAF 7 October 2014
KER - Open Data Platform
Data Ingestion in ENES and collaboration with RDA
Fernando Aguilar, IFCA-CSIC
Onedata Eventually Consistent Virtual Filesystem for Multi-Cloud Infrastructures Michał Orzechowski (CYFRONET AGH)
Ideas for an ICOS Competence Centre Implementation of an on-demand computation service Ute Karstens, André Bjärby, Oleg Mirzov, Roger Groth, Mitch Selander,
EGI/EUDAT/INDIGO-DataCloud Joint project proposal for EINFRA-12 A
Check-in Nicolas Liampotis
Introduction to Data Management in EGI
VI-SEEM Data Repository
EGI-Engage Engaging the EGI Community towards an Open Science Commons
An easier path? Customizing a “Global Solution”
VI-SEEM Data Repository
Identity Management and Authorization
University of Technology
EGI-Engage: Final Review Giuseppe La Rocca
Vision for Open Science
Status report of the LToS platform
Data Access and Re-use Carl Johan Håkansson EUDAT Service Area Manager
Diego Scardaci (EGI.eu)
Solutions for federated services management EGI
Introduction to the EGI cloud federations
Climate Data Analytics in a Big Data world
Introduction to D4Science
The Onedata platform Konrad Zemek, Krzysztof Trzepla ACC Cyfronet AGH
Case Study: Algae Bloom in a Water Reservoir
EGI FedCloud in Digital Humanities
Data science and machine learning at scale, powered by Jupyter
EOSCpilot All Hands Meeting 8 March 2018 Pisa
The SADE mini-project of the EGI DARIAH Competence Centre
ELIXIR Competence Center
Pre-OMB meeting Preparation for the Workshop “EGI towards H2020”
Community AAI with Check-In
Storing and Accessing G-OnRamp’s Assembly Hubs outside of Galaxy
Copernicus Data in the EGI infrastructure
EGI FedCloud User Support coordination meeting
MMG: from proof-of-concept to production services at scale
Technical Outreach Expert
Enol Fernandez & Giuseppe La Rocca EGI Foundation
Joining the EOSC Ecosystem
Expand portfolio of EGI services
EOSC-hub Contribution to the EOSC WGs
Check-in Identity and Access Management solution that makes it easy to secure access to services and resources.
LifeWatch AARC Pilot Fernando Aguilar 13th FIM4R Workshop
EGI High-Throughput Compute
Presentation transcript:

Photon & Neutron working meeting Giuseppe La Rocca, EGI Foundation giuseppe.larocca@egi.eu

Outline Status of the EGI Notebooks service Access to data services with EGI Notebooks service Integration with B2DROP Reproducible Open science with EGI Notebooks, Zenodo and Binder Status of the AAI integration in the PANOSC cluster project Data archiving solutions Open discussion

EGI Notebooks Jupyter as a Service in the EGI Cloud

The Jupyter Notebook in a nutshell Non-profit, open-source, interactive platform for Data Science born out of the iPython project in 2014 Released under the BSD license Notebooks can be shared with others using email, Dropbox, GitHub Interactive widgets

The EGI Notebooks service JupyterHub hosted in the EGI Cloud Offers Jupyter notebooks ‘as Service’ One-click solution: login and start using Extra EGI Features: Login with the EGI AAI Check-In service Persistent storage for notebooks Bring your own environments/kernels Use EGI computing and storage resources from your notebooks

Single Sign-On (SSO) Completely integrated with EGI Check-in Login with eduGAIN, social (Google, Facebook, LinkedIn) or EGI SSO Fine grained authorisation VO membership Role, group, …

Persistency Persistent directory linked from home User decides what to keep NFS storage EGI DataHub/Onedata Other files coming from the notebook environment

Custom environments Easy to extend with your own notebook environment Docker container image with JupyterHub v0.9 No root user Uses $HOME for notebooks User select what to start when creating their notebooks

JupyterLab interface

Kubernetes cluster powered by EGI Cloud Compute Technology Stack https://notebooks.egi.eu EGI CheckIn Kubernetes cluster powered by EGI Cloud Compute SSL Certificate Persistent Storage Kubernetes Ingress

Service modes Catch-all instance VO/Community deployments Available via the Marketplace Limited resources: 1 CPU, 1GB RAM and 10GB of persistent storage Sponsored access (free for the users) Kills notebooks after 1 hour of inactivity (Use the persistent folder  Don’t loose your work!) VO/Community deployments Tailored to specific VO with custom computing/storage, e.g.: access to GPUs, fat nodes access to Spark, other BigData/ML environments auto-mount filesystems on notebooks Community deployment for training

Status Catch-all instance is available for production users User has to register in Check-in at the first login Deployed on INFN-CATANIA-STACK and CESGA cloud sites Community-specific deployments E.g. for AGINFRA+ community E.g. for training (deployed on CESNET cloud) Explore Binder technology

“Integrating EGI Notebooks with EGI DataHub” “EOSC-hub Data Platforms for data processing and solutions for publishing and archiving scientific data (PART II)” During the EOSC-hub week, 12 Friday at 11:00 – 12:30  Room: Cracow I+II

What do we want to do? Several of the use cases in EOSC-hub will enable scientific end-users to perform data analysis experiments on large volumes of data, by exploiting a PID-enabled, server-side, and parallel approach. Users expect easy to use interfaces like Jupyter Notebooks for interacting with the system. Producing reusable results following FAIR guidelines Findability, Accessibility, Interoperability, and Reusability.

How? Analysis Notebooks / JupyterLab FedCloud resources Data management DataHub / Onedata Space Onezone Oneprovider Oneclient AAI (OIDC) Check-in PID management B2HANDLE Handle.net Cataloguing and discovery B2FIND A User goes to a Jupyter notebooks service to run an interactive analysis of open data Notebooks is deployed on a Kubernetes cluster running on EGI FedCloud resources DataHub used for Data Management Onedata concepts and components Space: virtual volume where users organize their data. A space is supported by one or multiple Oneproviders. Onezone: the federation and authentication service, user for AAI, management and web access. Oneprovider: a data management component deploy in data centres to expose data and manage transfers Oneclient: a client application allowing to mount a space locally Management of publication with PID Exposition of dataset using OAI-PMH PID generation by B2HANDLE using Handle.net DataHub OAI-PMH endpoint is harvested and indexed by B2FIND and can be discovered Check-in for handling user authentication and authorisation to services with federated AAI Implementation of AARC’s blueprint FAIR Findability: B2FIND, B2HANDLE and Handle.net Accessibility: Oneprovider Interoperability: APIs, standards (OAI-PMH) Reproducibility: Notebooks, Oneprovider

Integrating EGI Notebooks with EUDAT B2DROP

B2DROP B2DROP offers online storage for researchers Based on owncloud Researchers can collaborate with others and work together on the data. The data can be shared with external collaborators (B2DROP users or not) The data can be shared with other platforms, which support the open cloud mesh. Based on owncloud Exposes webdav interface for accessing files

Integration with B2DROP My_folder In synch

Produce reproducible Open Science with Binder, Zenodo and EGI Notebooks Slide decks/material at: https://documents.egi.eu/document/3442

How to make reproducible Open Science Credit: Juliette Taka https://twitter.com/JulietteTaka

How it works Binderhub: Provides an interface to create, and share the sessions Pulls code from repository mybinder.org: a public instance of Binderhub hosted on Google Cloud repo2docker: Creates reproducible containers from repositories jupyterhub: Generates user sessions that serve these containers Kubernetes: manages the computing infrastructure

The Open Science story: the current step EGI Notebooks services MyBinder.org Download ipynb file Execute Re-execute Create repository Upload ipynb file Add requirements.txt GitHub Your repository Provide GitHub project reference Your laptop Specify GitHub repo Generate DOI Zenodo Obtain GitHub project reference Data repository Fellow researchers Discover Notebook (use DOI) Journal paper DOI

An Open Science story: The next step DataHub B2DROP Etc. EGI Notebooks and Binder service Distributed big data Download ipynb file Execute Create repository Upload ipynb file Add requirements.txt Provide GitHub project reference GitHub Your repository Your laptop Generate DOI Specify GitHub repo Generate DOI Zenodo Obtain GitHub project reference Data repository Fellow researchers Discover Notebook (use DOI) Journal paper DOI

Upcoming events “EOSC-hub Data Platforms for data processing and solutions for publishing and archiving scientific data (PART II)” During the EOSC-hub week, 12 Friday at 11:00 – 12:30  Room: Cracow I+II JupyterHub Developments training @ EGI Conference 2019, 06-08 May 2019, Amsterdam Agenda: https://indico.egi.eu/indico/event/4431/session/4/?slotId=0#20190506

References Onedata EGI DataHub EGI Notebooks EGI Check-in B2FIND https://onedata.org EGI DataHub https://datahub.egi.eu - http://egi-datahub.readthedocs.io/ EGI Notebooks https://www.egi.eu/services/notebooks/ - https://notebooks.egi.eu/ EGI Check-in https://www.egi.eu/services/check-in/ - https://wiki.egi.eu/wiki/AAI B2FIND https://eudat.eu/services/b2find - http://eudat7-ingest.dkrz.de/ B2HANDLE https://eudat.eu/services/b2handle - https://hdl.grnet.gr:8001/api/handles