Get Data to Computation eudat.eu/b2stage www.eudat.eu B2STAGE How to shift large amounts of data Version 4 February 2016 This work is licensed under the.

Slides:



Advertisements
Similar presentations
Case Study: Examining the Results of P2P Collaboration at PricewaterhouseCoopers February 14, 2001 Case Study: Examining the Results of Collaboration at.
Advertisements

Jens G Jensen Atlas Petabyte store Supporting Multiple Interfaces to Mass Storage Providing Tape and Mass Storage to Diverse Scientific Communities.
Distributed Data Processing
EUDAT Towards a pan-European Collaborative Data Infrastructure Ari Lukkarinen CSC-IT Center for Science, Finland APA Conference, November 6th, 2012.
EUDAT Data Services for Research “The Story” Per Öster Director, Research Infrastructures CSC – IT Center for Science Ltd.
Astrophysics, Biology, Climate, Combustion, Fusion, Nanoscience Working Group on Simulation-Driven Applications 10 CS, 10 Sim, 1 VR.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
The University of Texas Research Data Repository : “Corral” A Geographically Replicated Repository for Research Data Chris Jordan.
Cloud Computing.
Project number: Data and Data Requirements Wouter Los University of Amsterdam.
DISTRIBUTED COMPUTING
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
IRODS workshop, September , Linköping (Sweden) iRODS Workshop users needs summary Agnès Ansari – Wednesday, 26 September.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
Big Data EUDAT 2012 – Training Day Adam Carter, EPCC EUDAT Training Task Leader.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
CEOS Working Group on Information Systems and Services - 1 Data Services Task Team Discussions on GRID and GRIDftp Stuart Doescher, USGS WGISS-15 May 2003.
EGEE-III-INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE-III All Activity Meeting Brussels,
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No The pan-European.
Sync and Exchange Research Data b2drop.eudat.eu This work is licensed under the Creative Commons CC-BY 4.0 licence B2DROP EUDAT’s Personal.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EUDAT The European.
7. Grid Computing Systems and Resource Management
Find Research Data b2find.eudat.eu B2FIND User Training How to find data objects and collections using EUDAT’s B2FIND This work is licensed.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No B 2 DROP User.
TurbaseDNS: a database of direct numerical simulations of complex flows F. Bonaccorso 1, L. Biferale 1, A. Lanotte 2 and M. Sbragaglia 1 1 University of.
Replicate Research Data Safely eudat.eu/b2safe B2SAFE How to replicate your data using EUDAT’s B2SAFE Version 3 November 2015 This work is.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EUDAT- Towards.
Store and Share Research Data b2share.eudat.eu B2SHARE How to share and store research data using EUDAT’s B2SHARE This work is licensed under.
b2access.eudat.eu B2ACCESS The simple and secure authorisation and authentication platform of EUDAT This work is licensed under the Creative.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EUDAT EGI interoperability.
Jost von Hardenberg ISAC-CNR, Torino, Italy with Paolo Davini, Susanna Corti, and many others EUDAT User Forum, Rome,Italy 3-4 February, 2016.
The Mapper project receives funding from the EC's Seventh Framework Programme (FP7/ ) under grant agreement n° RI Requirements for Multiscale.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EPOS and EUDAT.
ICOS To collect high-quality observational data relevant to the greenhouse gas budget of Europe To make the ICOS data freely available to all interested.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EUDAT Aalto Data.
B2access.eudat.eu B2ACCESS User Training How to register with B2ACCESS Version 1 February 2016 This work is licensed under the Creative Commons.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No The use of the.
Data Infrastructure in the TeraGrid Chris Jordan Campus Champions Presentation May 6, 2009.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No West-Life.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Public access.
Store and exchange data with colleagues and team Synchronize multiple versions of data Ensure automatic desktop synchronization of large files B2DROP is.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Collaboration.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Support to scientific.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EUDAT Services.
EGI-InSPIRE RI EGI Compute and Data Services for Open Access in H2020 Tiziana Ferrari Technical Director, EGI.eu
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No TURBASE-DNS: A.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Herbadrop.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Aalto Data Repository.
EGI-InSPIRE RI An Introduction to European Grid Infrastructure (EGI) March An Introduction to the European Grid Infrastructure.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No LTER- Europe &
PIDs in EUDAT Webinar, 15 Februari 2013
This work is licensed under the Creative Commons CC-BY 4.0 licence.
The EUDAT Services Suite
Tokamak data mirror for JET and MAST Moving towards an open data repository for European nuclear fusion research.
EUDAT: collaborative pan-European infrastructure providing research data services, training and consultancy This work is licensed.
AAI for a Collaborative Data Infrastructure
The EUDAT Services Suite and how it could support FAIR data
Study course: “Computing clusters, grids and clouds” Andrey Y. Shevel
Mark van de Sanden Giovanni Morelli
Introduction to Cloud Computing
EUDAT Collaborative Data Infrastructure
Workshop Data curation and the EUDAT Collaborative Data Infrastructure
DATA SPHINX & EUDAT Collaboration
NFFA Europe.
An EUDAT-based FAIR Data Approach for Data Interoperability
European Research Data Services, Expertise & Technology Solutions
Pre-OMB meeting Preparation for the Workshop “EGI towards H2020”
DATATURB Direct simulation data of turbulent flows
Parallel I/O for Distributed Applications (MPI-Conn-IO)
Presentation transcript:

Get Data to Computation eudat.eu/b2stage B2STAGE How to shift large amounts of data Version 4 February 2016 This work is licensed under the Creative Commons CC-BY 4.0 licence. Attribution: EUDAT –

eudat.eu/b2stage B2STAGE is… a reliable, efficient, light-weight and easy-to-use service to transfer research data sets between EUDAT storage resources and high- performance computing (HPC) workspaces 2

eudat.eu/b2stage A truly pan-European Infrastructure 3 EUDAT offers common data services to both research communities and individuals through a network of 35 European organisations. EUDAT wants to enable European researchers from any discipline to preserve, find, access, and process data in a trusted environment, as part of a Collaborative Data Infrastructure. European infrastructures Technology Providers Research Communities

eudat.eu/b2stage Community-Driven Solutions 4 PHYSICAL SCIENCES & ENGINEERING SOCIAL SCIENCES & HUMANITIES MATERIALS & ANALYTICAL FACILITIES ENVIRONMENTAL SCIENCES MAPPER BIOMEDICAL & MEDICAL SCIENCES EUDAT services are designed, built and implemented based on user community requirements.

eudat.eu/b2stage The EUDAT Service Suite 5

eudat.eu/b2stage move large amounts of data between data stores and high- performance compute resources re-ingest computational results back into EUDAT deposit large data sets into EUDAT resources for long-term preservation Facilitating communities to: Features: high-speed transfer reliable and light-weight manages permanent PIDs 6 B2STAGE Features

eudat.eu/b2stage Why use B2STAGE? 7 Research challenges are getting larger and more complex : E.g. full-Earth climate simulation, coupled simulations of multiple organs in the human body, seismic analyses of earthquakes at continental scale High level benefits Researcher data and compute demands are rising fast Efficient transfer of data to high performance computing (HPC) workspaces is essential especially in distributed computing, where resources are geographically dispersed

eudat.eu/b2stage Why use B2STAGE? 8 Facilitates transfer of large data collections from EUDAT storage resources to HPC facilities. Specific User Requirements Provides the means to re-ingest computational results back into the EUDAT infrastructure. Ingests data sets into EUDAT resources for long-term preservation. Offers reliable, efficient, easy-to-use tools to manage data transfers. The Data Staging Script is the only tool handling data transfer using PIDs.

eudat.eu/b2stage Who can use B2STAGE? Researchers can transfer large data collections from EUDAT storage resources to HPC facilities for processing. Community Managers can replicate community data through a lightweight service and ingest data sets to EUDAT storage resources for long term preservation. 9

eudat.eu/b2stage How can you use B2STAGE? EUDAT offers B2STAGE to all registered researchers and interested communities, enabling them to make use of the service to stage data out of EUDAT, and ingest computational results back. Access to remote HPC facilities should be negotiated and arranged by individual users in parallel. To help researchers use the B2STAGE service, EUDAT offers documentation, training material and a service helpdesk. 10 For more information please

eudat.eu/b2stage How can you use B2STAGE? 11

eudat.eu/b2stage How does B2STAGE work? 12 GridFTP server iRODS-DSI User desktop GridFTP client data control PID Registry PID control HPC GridFTP server

eudat.eu/b2stage User desktop How does B2STAGE work? 13 GridFTP client File system GridFTP server iRODS-DSI PID Registry PID data control

eudat.eu/b2stage B2STAGE User communities VPH Community ingesting data onto EUDAT resources Approximately 12TB will be ingested through this service VPH data also replicated between RZG and PSNC sites B2STAGE will foster the collaboration with EGI and PRACE to develop cross-infrastructure usage: B2STAGE will be the main service to enable the interoperability of these infrastructures. Numerous new communities to adopt it as part of the 2015 and 2016 Calls for Collaboration 14

eudat.eu/b2stage B2STAGE summary B2STAGE offers: data staging functionalities to easily and efficiently transfer data from EUDAT storage resources to HPC facilities a powerful mechanism to ingest data onto EUDAT resources a script to facilitate the staging, ingest and retrieval of PID information of transferred data B2STAGE is unique in handling PIDs for the data 15

eudat.eu/b2stage Future features The Data Staging Script will be replaced by a modular and extensible python library which will furnish the users with a programmable interface towards most of the EUDAT services. 16

eudat.eu/b2stage 17 For more info: User documentation: Thank you