Download presentation
Presentation is loading. Please wait.
1
Break out group coordinator:
Design your e-infrastructure! Use case: Break out group coordinator: … Krakow, 27. September, 2016.
2
Group members MUG VRE Elixir AAI, Uni Barcelona, Supercomputing munich, MUG, SurfSARA, MaxPlanck, CERN (B2share) Apologies for missing somebody
3
First break-out Background and Users
4
Who will be the user? Can the users be characterised? How many are they?
Computational biologist and biologist, Not strong technical skills Requirements and challenges Simplicity Tested pipelines Provenance data for software not only for data Quality and optimisation Software – not always validated software for all aspects. Maybe a role for Elixir here? 1GB per week Problem retaining data used for experiments – Repository compatible with EBI B2share with sparkQL end point Get data distributed, just keep the metadata Users credentials managed locally
5
What value will the envisaged system deliver for them (the whole setup)? What will the system exactly deliver to them? Virtual research environment to integrate row data, tool to analyse them and a standard to define that the results are correct. Data curation – some data is v curated, some others not (is lost after publication) not fully automated, no option at the moment to comment on validity of data at a later stage. Current software on simulated data hardly validated Get a set of validated procedures across communities. Join different communities
6
What's the timeline for development, testing and large-scale operation
What's the timeline for development, testing and large-scale operation? (Consecutive releases can/should be considered.) Two years to go A working infrastructure in 1y
7
Design and implementation plan
Second break-out Design and implementation plan
8
What should the first version include
What should the first version include? - The most basic product prototype imaginable already bringing value to the users (the so-called Minimal Viable Product - MVP) Need to solutions to pull data from experimental sites – at the moment use B2safe (archiving), but they want to also share what they have Experimenting with GridFTP and Aspera Hope to use data transfer from elixir and elixir AAI Access data from different repositories Need EGI cloud for computational and analysis, and EGI cloud needs data from EUDAT storage (beginning with B2safe)
9
Which components/services already exist in this architecture?
Adapt the tool that exist and put everything in production Enable seamless process to link data with computing, basically between EUDAT and EGI. Basic infrastructures already there
10
Which components/services are under development (and by who)?
Basic infrastructures already there Adapting existing analysis tools Needs for a repositories, not clear yet how this will look like If possible Btshare could be reused EUDAT interested in supporting the repository, even if B2share is not used Ensure that data models are interoperable
11
Which components/services should be still brought into the system
Which components/services should be still brought into the system? Which EGI/EUDAT/GEANT/OpenAire partner can do it? Transmitting Gbyte of data main problem, Firewall, hospitals and commercial entities may be a bottleneck Enable seamless process to link data with computing, basically between EUDAT and EGI.
12
Are there gaps in the EGI/EUDAT/GEANT/OpenAIRE service catalogues that should be filled to implement the use case? Which service provider could fill the gap? B2share – no support for graph data EGI – seamless linking services with EUDAT Rely on research infrastructures to better serve the community.
13
Next steps What, who, when …
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.