Overview of the global architecture Giacinto DONVITO INFN-Bari.

Slides:



Advertisements
Similar presentations
Cloud computing is used to describe a variety of computing concepts that involve a large number of computers connected through a real-time communication.
Advertisements

DataGrid is a project funded by the European Union 22 September 2003 – n° 1 EDG WP4 Fabric Management: Fabric Monitoring and Fault Tolerance
FI-WARE – Future Internet Core Platform FI-WARE Cloud Hosting July 2011 High-level description.
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space User Oriented Provisioning of Secure Virtualized.
Software to Data model Lenos Vacanas, Stelios Sotiriadis, Euripides Petrakis Technical University of Crete (TUC), Greece Workshop.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
Building service testbeds on FIRE D5.2.5 Virtual Cluster on Federated Cloud Demonstration Kit August 2012 Version 1.0 Copyright © 2012 CESGA. All rights.
Technology Overview. Agenda What’s New and Better in Windows Server 2003? Why Upgrade to Windows Server 2003 ?  From Windows NT 4.0  From Windows 2000.
A Lightweight Platform for Integration of Resource Limited Devices into Pervasive Grids Stavros Isaiadis and Vladimir Getov University of Westminster
SCI-BUS is supported by the FP7 Capacities Programme under contract nr RI CloudBroker Platform integration into WS-PGRADE/gUSE Zoltán Farkas MTA.
The Data Bridge Laurence Field IT/SDC 6 March 2015.
European Grid Initiative Federated Cloud update Peter solagna Pre-GDB Workshop 10/11/
INFSO-RI Module 01 ETICS Overview Etics Online Tutorial Marian ŻUREK Baltic Grid II Summer School Vilnius, 2-3 July 2009.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
The Eucalyptus Open-source Cloud Computing System Daniel Nurmi Rich Wolski, Chris Grzegorczyk, Graziano Obertelli, Sunil Soman, Lamia Youseff, Dmitrii.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
Creating SmartArt 1.Create a slide and select Insert > SmartArt. 2.Choose a SmartArt design and type your text. (Choose any format to start. You can change.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
EGI-Engage Data Services and Solutions Part 1: Data in the Grid Vincenzo Spinoso EGI.eu/INFN Data Services.
Next generation Science Gateways in the context of the INDIGO project: a pilot case on large scale climate-change data analytics Roberto Barbera, Riccardo.
OpenStack Chances and Practice at IHEP Haibo, Li Computing Center, the Institute of High Energy Physics, CAS, China 2012/10/15.
ALL INFORMATION PRESENTED AS WELL AS ALL SESSIONS ARE MICROSOFT CONFIDENTIAL AND UNDER YOUR NON-DISCLOSURE AGREEMENT (NDA) AND\OR TECHNOLOGY PREVIEW.
EGI Technical Forum Madrid COMPSs in the EGI Federated Cloud Daniele Lezzi – BSC EGI Technical Forum Madrid.
EGI-InSPIRE RI EGI Webinar EGI-InSPIRE RI Porting your application to the EGI Federated Cloud 17 Feb
INDIGO – DataCloud WP5 introduction INFN-Bari CYFRONET RIA
INDIGO: Building a DataCloud Framework to Support Open Science Yin Chen, Fernando Aguilar,
Breaking the frontiers of the Grid R. Graciani EGI TF 2012.
PLATFORM TO EASE THE DEPLOYMENT AND IMPROVE THE AVAILABILITY OF TRENCADIS INFRASTRUCTURE IberGrid 2013 Miguel Caballer GRyCAP – I3M - UPV.
UPV-IBM’S BIG DATA OBSERVATORY & HADOOP INFRASTRUCTURE MANAGEMENT Damian Segrelles, Germán Moltó & Ignacio Blanquer,
An Open Data Platform in the framework of the EGI-LifeWatch Competence Centre Fernando Aguilar Jesús Marco
WP4 Summary Patrick Fuhrmann for the WP4 Tream RIA
Possible contributions of UPV to WP5 (NDIGO- DataCloud) Germán Moltó UPV RIA
INDIGO – DataCloud Security and Authorization in WP5 INFN RIA
Claudio Grandi INFN Bologna Virtual Pools for Interactive Analysis and Software Development through an Integrated Cloud Environment Claudio Grandi (INFN.
INDIGO – DataCloud CERN CERN RIA
SCI-BUS is supported by the FP7 Capacities Programme under contract nr RI Accessing cloud resources through the WS-PGRADE/gUSE and CloudBroker integrated.
INDIGO DATACLOUD MEETING AMSTERDAM 4-5 th APRIL 2016 Lukasz Dutka RIA INDIGO-DataCloud is co-founded by the Horizon 2020Framework Programme AMSTERDAM.
Project Cumulus Overview March 15, End Goal Unified Public & Private PaaS for GlassFish/Java EE Simplify deployment of Java EE Apps on top of.
The INDIGO-DataCloud Data & Computing Platform for Scientific Communities RIA Giacinto Donvito INFN INDIGO-DataCloud Technical Director
Enabling scientific applications on hybrid e-Infrastructures: the FutureGateway framework Marco Fargetta (INFN), Riccardo Bruno (INFN), Roberto Barbera.
PaaS services for Computing and Storage
Onedata Eventually Consistent Virtual Filesystem for Multi-Cloud Infrastructures Michał Orzechowski (CYFRONET AGH)
CMS Experience with Indigo DataCloud
User Interfaces: Science Gateways, Workflows and Toolkits
StratusLab First Periodic Review
The PaaS Layer in the INDIGO-DataCloud
Overview of the global architecture
Population Imaging Use Case - EuroBioImaging
StratusLab Final Periodic Review
StratusLab Final Periodic Review
PaaS Core Session (Notes from UPV)
Processing of Images: Orchestrating an Elastic Cloud (
INDIGO – DataCloud PaaS
An easier path? Customizing a “Global Solution”
Chapter 21: Cloud Computing and Related Security Issues
Chapter 22: Cloud Computing Technology and Security
Management of Virtual Execution Environments 3 June 2008
The Onedata platform Konrad Zemek, Krzysztof Trzepla ACC Cyfronet AGH
Case Study: Algae Bloom in a Water Reservoir
EGI FedCloud in Digital Humanities
Orchestration & Container Management in EGI FedCloud
Cloud Computing: Concepts
DBOS DecisionBrain Optimization Server
Joining the EOSC Ecosystem
LifeWatch AARC Pilot Fernando Aguilar 13th FIM4R Workshop
Presentation transcript:

Overview of the global architecture Giacinto DONVITO INFN-Bari

Before we start… We distinguish different basic use-cases: Authomated IaaS PaaS-like: Application execution Long Running Services Integrating distributed data infrastructures with INDIGO-DataCloud 2

Deploying of service/applications Integrating distributed data infrastructures with INDIGO-DataCloud 3 WP6 Service WP5 Service WP4 Service IAM Future Gateway portal Home IdP Tosca template repository The user, landing on the WP6 portal, can login using his/her own credential (Home IdP) IAM generates an INDIGO Token that is used in the next steps The user could chose a TOSCA Template for an already available repository WP6 layer (both portal and APIs) provide to the user the capability to customize the parameters on the TOSCA template Size, scalibility, data requirements, etc After a TOSCA “submission” WP6 will wait for a “call-back” from the orchestrator to be updated with the status of the teplate deployment TOSCA Template Docker repository WP6 APIs PaaS Orchestrator

Deploying a “IaaS automated service” This means that the users are requiring a service that is based on an IaaS istance installed and customized by automatic scripts The user will manage the host/service by his own. This for example could be useful for having a dynamic cluster of batch resources. The scaling will be managed directly at the level of PaaS Orchestrator Integrating distributed data infrastructures with INDIGO-DataCloud 4

Deploying a IaaS Automated service Integrating distributed data infrastructures with INDIGO-DataCloud 5 Future Gateway portal TOSCA Template PaaS Orchestrator Kubernetes IAM Monitoring/ Information System Accounting Monitoring QoS/SLA Policy Mgnt Data Services IM Modified TOSCA Template Cloud native APIs Heat/IM WP6 APIs

Deploying a IaaS Automated service Integrating distributed data infrastructures with INDIGO-DataCloud 6 Future Gateway portal TOSCA Template PaaS Orchestrator Kubernetes IAM Data Services FTS DynaFed Onedata Data The PaaS orchestrator accept the TOSCA Template from WP6 layer PaaS Orch. Will ask all the µServices a pieces of information (SLA, Monitoring status, Rules, etc) in order to choose the best IaaS site. The data services are important in order to define the site. The Orchestrator could ask, for example, FTS to import data into Onedata As soon as the Orchestrator has the “IaaS name”, it provide a modified version of a TOSCA templates, to an Heat/IM endpoint. If the IaaS is not TOSCA compliant we may use IM at the PaaS level as a bridge Posix/ REST APIs WebDav WP6 APIs

Deploying a IaaS Automated service Integrating distributed data infrastructures with INDIGO-DataCloud 7 Future Gateway portal TOSCA Template INDIGO PaaS Layer User Service INDIGO IaaS Data Services FTS DynaFed Onedata WP6 APIs

DATA Access/Management/Import Integrating distributed data infrastructures with INDIGO-DataCloud 8 The WP6 layer is able to exploit the APIs provided by WP5 storage layer to export: Onedata access FTS Data import/export WebDav access/browsing The data access/import will use: Web/WebDav/GFTP Dropbox-like ?? Posix (also in the Docker) The QoS Data management will exploit CDMI (WP5) CDMI/REST APIs (WP4)

DATA Access/Management/Import Integrating distributed data infrastructures with INDIGO-DataCloud 9 Future Gateway portal FTS DynaFed Onedata Data GFTP/ Posix/ REST APIs WebDav External repository WP6 Service WP5 Service WP4 Service WebDav CDMI (QoS) Web/Posix/GFTP CDMI (Management) Posix (Access/Transfer) REST APIs WebDav WP6 APIs Data

Integrating distributed data infrastructures with INDIGO- DataCloud 10

Software as a Service Use Case Interactive console Data Analytics Services VMI repo Container repo Scheduling End userApp dev Site A TOSCA Spec Data AAI Data External application CIRMMP INAF-LBT CMCC – ENES ALGAE-BLOSSOM INGV - MOIST WN TOSCA end-point Data Analytics Services Site B Data WN TOSCA end-point INDIGO-DataCloud RIA Automated IaaS

Deploying a “PaaS Service” both LRS or Application execution Both of this use-case are implemented by means of Mesos (+ its frameworks) A Long Running Service (LRS) [Mesos + Marathon] Is a PaaS like service (MySQL, Apache Tomcat, PostgreSQL, etc) This is “managed” by the PaaS layer It is monitored, restarted, etc The elasticity could be provided automatically by the PaaS layer Application execution [Mesos + Chronos] The end-user ask for executing and application, deployed via Docker Providing information about the data to be pocessed This will help in deploying the application in the correct site. The Docker container are supported as a virtualization driver: Not as complex solution with orchestration features (Magnum or similar) The orchestration layer will be provided by WP5 in order to be portable on all the IaaS Integrating distributed data infrastructures with INDIGO-DataCloud 12

Integrating distributed data infrastructures with INDIGO-DataCloud 13 Future Gateway portal TOSCA Template PaaS Orchestrator Kubernetes IAM Monitoring/ Information System Accounting Monitoring QoS/SLA Policy Mgnt Data Services IM Modified TOSCA Template Cloud native APIs Heat/IM Mesos/Marathon /Chronos Mesos Agent Deploying a “PaaS Service”: LRS or Application execution Mesos/Marathon /Chronos WP6 APIs

Integrating distributed data infrastructures with INDIGO-DataCloud 14 Future Gateway portal TOSCA Template PaaS Orchestrator Kubernetes IAM Data Services IM Modified TOSCA Template Cloud native APIs Heat/IM Mesos/Marathon /Chronos Mesos Agent Deploying a “PaaS Service”: LRS or Application execution Mesos/Marathon /Chronos User Service CDMI/Web Clues Onedata will enable the access of the data within the Docker containers So the users can access data directly from their code very transparently Dynafed could be used both with real remote access, via Parrot, or via fuse, depending on the use-cases. The cointainer will be automatically configured accordingly WP6 APIs Mesos Agent Clues is used for managing the scalability of the clusters: IaaS Automation and Mesos Clusters This is more “flexible” then automatic Heat scalability Will be IaaS indipendent Clues will interact directly with the orchestrator Other PaaS core Services

Scientific Computational Portal “as a Service” Use Case Portal / Access services LBaaS VMI repo Container repo Scheduling End userApp dev Site A TOSCA Spec Storage AAI Use Case Portal / Access services LRMS front- end LRMS WN Site B LRMS WN Storage Site-level Scheduling TOSCA end- point VPNaaS ELIXIR Haddock CIRMMP FedCloud DARIAH INAF-LBT CMCC – ENES CTA ALGAE-BLOSSOM INGV - MOIST INDIGO-DataCloud RIA Automated IaaS PaaS Services ???

WAN bursting The main problem is the “overlay network” On top of what WP4 will be able to provide WP5 is investigating a SW solution for federated Mesos Clusters at geographical level A fist implementation of geographical batch cluster could exploit HTCondor where the network problem is solved at the “batch system level” As soon as a solution for the overlay network is usable: we may exploit inter-IaaS Mesos clusters Integrating distributed data infrastructures with INDIGO-DataCloud 16

WP4 architecture Clear interfaces both for Computing and Storage: TOSCA directly provided by Heat/IM Will allow users exploiting TOSCA also without PaaS layer The capability to exploit Docker container natively when needed/possible A better scheduling system at the level of IaaS QoS Management Standard protocols for data access/transfers The main aim is to exploit solutions/functionalities that are available on both the supported IaaS Integrating distributed data infrastructures with INDIGO-DataCloud 17

Open issues How to specify data on TOSCA Templates We need to monitor/index also Mesos Cluster deployed on the IaaS Final decision about CDMI vs REST at WP4 Level We need a Web Inteface for managing the QoS in a user-friendly way The grid job are supported by means of gateways? Not all the services have the same level of maturity … We should find, togheter with the user communities, the right steps for introducing more complexity/features as the services are ready... Integrating distributed data infrastructures with INDIGO-DataCloud 18

… with the goals of building a platform... Integrating distributed data infrastructures with INDIGO-DataCloud 19 Credit to German Moltò