European Grid Initiative Data Services and Solutions Part 2: Data in the cloud Enol Fernández Data Services.

Slides:



Advertisements
Similar presentations
Agile Infrastructure built on OpenStack Building The Next Generation Data Center with OpenStack John Griffith, Senior Software Engineer,
Advertisements

1 EGI Federated Clouds Task Force HEPiX Spring 2012 Workshop Matteo Turilli
SaaS, PaaS & TaaS By: Raza Usmani
Dr. Ognjen Prnjat European and Regional eInfrastructure management Greek Research and Technology Network Oceanos and synnefo - clouds.
An Introduction to DuraCloud Carissa Smith, Partner Specialist Michele Kimpton, Project Director Bill Branan, Lead Software Developer Andrew Woods, Lead.
January 2013 CDMI: An Introduction. Big Data Complexity Volume Speed “Big Data” refers to datasets whose size is beyond the ability of typical tools to.
Cloud based storage. Cloud Storage Storage accessed by a web service API It is a block storage, it exposes its storage to clients as Raw storage that.
INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 7 2/23/2015.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
EGI-Engage EGI-Engage Engaging the EGI Community towards an Open Science Commons Project Overview 9/14/2015 EGI-Engage: a project.
Cloud Computing. What is Cloud Computing? Cloud computing is a model for enabling convenient, on-demand network access to a shared pool of configurable.
Ceph Storage in OpenStack Part 2 openstack-ch,
European Grid Initiative Federated Cloud update Peter solagna Pre-GDB Workshop 10/11/
608D CloudStack 3.0 Omer Palo Readiness Specialist, WW Tech Support Readiness May 8, 2012.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
1 NETE4631 Working with Cloud-based Storage Lecture Notes #11.
Breaking Barriers Exploding with Possibility Breaking Barriers Exploding with Possibility The Cloud Era Unveiled.
 Mike Martin  Architect  MEET Member  Crew Member of Azug  Windows Azure Insider  Windows Azure MVP  
EGI-Engage Data Services and Solutions Part 1: Data in the Grid Vincenzo Spinoso EGI.eu/INFN Data Services.
Vignesh Ravindran Sankarbala Manoharan. Infrastructure As A Service (IAAS) is a model that is used to deliver a platform virtualization environment with.
1 FedCloud Task Force Demo EGI CF2012 – Munich 28/29 March Matteo Turilli
GRNET Cloud Services and Collaborations Kostas Koumantaros {kkoum at grnet.gr}
EGI Technical Forum Madrid COMPSs in the EGI Federated Cloud Daniele Lezzi – BSC EGI Technical Forum Madrid.
EGI-InSPIRE RI EGI Webinar EGI-InSPIRE RI Porting your application to the EGI Federated Cloud 17 Feb
EGI-Engage is co-funded by the Horizon 2020 Framework Programme of the European Union under grant number Marios Chatziangelou, et al.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Federated Cloud 1 17 Feb 2014 Diego Scardaci, EGI.eu Technical Outreach.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Data service requirements and provisioning models Gergely Sipos With input.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI OpenSource GeoSpatial Catalogue Platform-as-a-Service Salvatore Pinto Cloud.
European Life Sciences Infrastructure for Biological Information ELIXIR Cloud Roadmap Chairs: Steven Newhouse, EMBL-EBI & Mirek Ruda,
Instituto de Biocomputación y Física de Sistemas Complejos Cloud resources and BIFI activities in JRA2 Reunión JRU Española.
DIRAC for Grid and Cloud Dr. Víctor Méndez Muñoz (for DIRAC Project) LHCb Tier 1 Liaison at PIC EGI User Community Board, October 31st, 2013.
INDIGO DATACLOUD MEETING AMSTERDAM 4-5 th APRIL 2016 Lukasz Dutka RIA INDIGO-DataCloud is co-founded by the Horizon 2020Framework Programme AMSTERDAM.
InSilicoLab – Grid Environment for Supporting Numerical Experiments in Chemistry Joanna Kocot, Daniel Harężlak, Klemens Noga, Mariusz Sterzel, Tomasz Szepieniec.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI /04/14 1 EGI Community Forum 2014 Federated Cloud image management Marios.
EGI-Engage is co-funded by the Horizon 2020 Framework Programme of the European Union under grant number Marios Chatziangelou, et al.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
St. Petersburg, 2016 Openstack Disk Storage vs Amazon Disk Storage Computing Clusters, Grids and Cloud Erasmus Mundus Master Program in PERCCOM Author:
European Grid Initiative The EGI Federated Cloud as Educational and Training Infrastructure for Data Science Tiziana Ferrari/ EGI.eu.
Daniele Lezzi Execution of scientific workflows on federated multi-cloud infrastructures IBERGrid Madrid, 20 September 2013.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI A pan-European Research Infrastructure supporting the digital European Research.
EGI-InSPIRE RI EGI Compute and Data Services for Open Access in H2020 Tiziana Ferrari Technical Director, EGI.eu
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
EGI-Engage is co-funded by the Horizon 2020 Framework Programme of the European Union under grant number Federated Cloud Update.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI OpenSource GeoSpatial Catalogue Platform-as-a-Service Salvatore Pinto Cloud.
1 EGI Federated Cloud Architecture Matteo Turilli Senior Research Associate, OeRC, University of Oxford Chair – EGI Federated Clouds Task Force
EGI… …is a Federation of over 300 computing and data centres spread across 56 countries in Europe and worldwide …delivers advanced computing.
The EGI Federated Cloud
PaaS services for Computing and Storage
Onedata Eventually Consistent Virtual Filesystem for Multi-Cloud Infrastructures Michał Orzechowski (CYFRONET AGH)
Course: Cluster, grid and cloud computing systems Course author: Prof
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING CLOUD COMPUTING
By: Raza Usmani SaaS, PaaS & TaaS By: Raza Usmani
StratusLab First Periodic Review
Unified Data Access and MGMT. in Distributed hybrid Cloud
Federated Cloud Computing
FedCloud Blueprint Update
Onedata Eventually Consistent Virtual Filesystem for Multi-Cloud Infrastructures Michał Orzechowski (CYFRONET AGH)
Introduction to Data Management in EGI
DI4R, 30th September 2016, Krakow
EGI-Engage Engaging the EGI Community towards an Open Science Commons
Working with Cloud-based Storage
Research Data Archive - technology
AWS COURSE DEMO BY PROFESSIONAL-GURU. Amazon History Ladder & Offering.
OpenStack Ani Bicaku 18/04/ © (SG)² Konsortium.
Introduction to the EGI cloud federations
The EGI Federated Cloud
The Onedata platform Konrad Zemek, Krzysztof Trzepla ACC Cyfronet AGH
Storing and Accessing G-OnRamp’s Assembly Hubs outside of Galaxy
Presentation transcript:

European Grid Initiative Data Services and Solutions Part 2: Data in the cloud Enol Fernández Data Services and Solutions Webinar1

Outline Data in the cloud Use cases and technical details Block Storage Object Storage Native cloud solutions State of the art: status in EGI Examples Future Plans Data Services and Solutions Webinar2

What is the EGI Federated Cloud 3 EGI Federated Cloud is based on: Standards and validation: federation is based on common Open-Standards – OCCI, CDMI, OVF, GLUE, etc... Heterogeneous implementation: no mandate on the cloud technology, the only condition is to expose the chosen interfaces and services. The EGI Federated Cloud is federation of institutional private Clouds, offering Cloud Infrastructure as a Service to scientists in Europe and worldwide. Data Services and Solutions Webinar

FedCloud IaaS Capabilities Computing VM Management VM Marketplace Storage Block Storage Object Storage Data Services and Solutions Webinar4

Block Storage Persistent block level storage to use with VMs Use as any other block device from VMs Snapshotable Simple usage Consistent and low- latency performance SSDs (in some sites) High Performance From GB to TB Create and attach to VMs on demand Scale to your needs VM Data Services and Solutions Webinar5

Object Storage Data storage infrastructure for storing and retrieving data from anywhere at any time Simple REST APIs for managing and accessing data API Access Store as much data as needed. Get accounted only for the space used. Scalable Define ACLs on each object, share publicly your data Sharing Data Services and Solutions Webinar6

Block Storage vs Object Storage Block StorageObject Storage Access only from within a VM only at the same site the VM is located from any device connected to the internet. Sharingnot possible possible (data can be kept private or public) Accounting for the entire volume, regardless how much of it is actually used only for the data stored Integration easy with any application capable to write/read file from a local disk requires a client to be integrated within the application Data Services and Solutions Webinar7

Use Cases Block StorageObject Storage Application hosting Data Processing Database Large Data File Storage & Backup Static Content Media Serving & Sharing Big Data Data Services and Solutions Webinar8

Block Storage: Typical Use Store your data on volumes Data persists independently of VM Stripe volumes for better performance Share via network filesystem (e.g. NFS) or use as DB store VM NFS Data Services and Solutions Webinar9

Block Storage: OCCI OCCI (Open Cloud Computing Interface) is a OGF standard API to facilitate interoperable access to cloud resources Block storage in FedCloud is managed via OCCI: create/delete volumes Attach/detach (link/unlink in OCCI terms) to VMs Once attached, use as other disk in VM Data Services and Solutions Webinar10

Object Storage: CDMI FedCloud object storage is managed via CDMI (Cloud Data Management Interface) RESTful API for operations on storage objects Developed by SNIA, now ISO/IEC Very flexible API, based on capabilities: Object basic capabilities (create/get/delete/list) Object ACLs Import from external sources, export as Filesystems … Data Services and Solutions Webinar11

Native Cloud Solutions Cloud Management Frameworks (CMF) provide their own APIs for managing cloud storage Usually more features than OCCI/CDMI However, not (yet) fully integrated in EGI’s FedCloud OpenStackOpenNebulaSynnefo Block Storage Cinder APIOpenNebula API Cinder API Object StorageSwift APIN/A Pithos API + Swift API Data Services and Solutions Webinar12

State of the Art: Block Storage Block storage is supported on all FedCloud CMFs and sites OpenStackOpenNebulaSynnefo OCCI Basic Operations Yes OCCI advanced (resize, snapshot) No Native API Basic Operations Yes Native API advanced YesPartialYes Data Services and Solutions Webinar13

State of the Art: Object Storage CDMI support CDMI server framework by Synnefo On going effort to support OpenStack Basic client available Native APIs allow basic and advanced capabilities OpenStackSynnefoOneDataOpenNebula CDMI Basic Operations In ProgressYes N/A Native APIYes N/A Data Services and Solutions Webinar14

Example: Chipster Chipster is a graphical application for data analysis, with server backend Original Chipster VM included big collection of tools and data (~200GB) Deployment at FedCloud Separated VMs from tools and data with block storage NFS server for these volumes Chipster VMs mount the NFS exports on start-up NFS Server NFS Server Tools Volume Data Volume Chipster VM Chipster VM Chipster VM Chipster VM EGI FedCloud Resource Provider Data Services and Solutions Webinar15

Example: EISCAT-3D (I) EISCAT-3D is a 3D imaging radar to be located in the northernmost parts of Europe. Open Source Geospatial Catalogue (OSGC) Portal provides access to the data stored in Object Storage providers at FedCloud Planning extra services to further process the EISCAT-3D data and make it available in the portal Data Services and Solutions Webinar16

Example: EISCAT-3D (II) 17 Open Source Geospatial Catalogue (OSGC) CESNET site (CZ) Catalogue EISCAT archive Object Storage Juelich site (DE) OpenStack SWIFT CDMI with HTTP export EGI Federated Cloud Near Real Time tool to import data automatically from receiving stations Admin tools Scientific users Data administrators Web browser wget 5m files, ~1TB in total On-site Off-site Phase 1: In ENVRI Phase 2: In EGI-Engage Pre- processing service 1 Pre- processing service N... Processing / visualization service 1 Processing / visualization service N... Data Services and Solutions Webinar

Plans EGI-ENGAGE: Effort to further develop OCCI/CDMI interfaces in FedCloud OneData development Storage Testbed Other related projects: INDIGO will develop (data) cloud solutions Data Services and Solutions Webinar18

Distributed multi-provider storage Flexible access control Intra-federations scenarios for sharing data Works with Tokens or X.509 POSIX client for mounting user’s space Scalable from Single NAS to Large Datacentre Can be deployed on top of high-performance parallel storage solutions with very small overhead < 5%. Support for open data scenarios in preparation Onedata is currently supported by: PLGrid, EGI-Engage, INDIGO-DataCloud, ESPREX for ISS Data Services and Solutions Webinar19

Storage Testbed Testbed will allow to: Test tools and setups in a distributed and big enough collection of resources Pilot applications to be migrated to production Currently looking for Resource Providers Join as users/use cases to articulate requirements and preferences for this infrastructure Data Services and Solutions Webinar20

References EGI Federated Cloud resources Wiki site: User support: User support Federated Cloud Communities: Federated Cloud Storage HOWTO: orage orage Related Standards: OCCI: CDMI: Data Services and Solutions Webinar21