The PaaS Layer in the INDIGO-DataCloud

Slides:



Advertisements
Similar presentations
Cloud computing is used to describe a variety of computing concepts that involve a large number of computers connected through a real-time communication.
Advertisements

System Center 2012 R2 Overview
High Performance Computing Course Notes Grid Computing.
PaaS Design and Architecture: A Deep Dive into Apache Stratos Samisa Abeysinghe VP Delivery, WSO2 Member Apache Software Foundation 10 th June 2014.
A Java Architecture for the Internet of Things Noel Poore, Architect Pete St. Pierre, Product Manager Java Platform Group, Internet of Things September.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
Cloud Computing. What is Cloud Computing? Cloud computing is a model for enabling convenient, on-demand network access to a shared pool of configurable.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
Aneka Cloud ApplicationPlatform. Introduction Aneka consists of a scalable cloud middleware that can be deployed on top of heterogeneous computing resources.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
EGI-Engage Data Services and Solutions Part 1: Data in the Grid Vincenzo Spinoso EGI.eu/INFN Data Services.
INDIGO – DataCloud WP5 introduction INFN-Bari CYFRONET RIA
Overview of the global architecture Giacinto DONVITO INFN-Bari.
INDIGO – DataCloud Security and Authorization in WP5 INFN RIA
Project Cumulus Overview March 15, End Goal Unified Public & Private PaaS for GlassFish/Java EE Simplify deployment of Java EE Apps on top of.
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
PaaS services for Computing and Storage
Onedata Eventually Consistent Virtual Filesystem for Multi-Cloud Infrastructures Michał Orzechowski (CYFRONET AGH)
CMS Experience with Indigo DataCloud
CLOUD ARCHITECTURE Many organizations and researchers have defined the architecture for cloud computing. Basically the whole system can be divided into.
Organizations Are Embracing New Opportunities
Deployment of Flows Loretta Auvil
Vincenzo Spinoso EGI.eu/INFN
OpenLegacy Training Day Four Introduction to Microservices
Blueprint of Persistent Infrastructure as a Service
Unified Data Access and MGMT. in Distributed hybrid Cloud
Overview of the global architecture
CLIF meets Jenkins Performance testing in continuous integration, and more... Bruno Dillenseger - Orange Labs CLIF is OW2's load testing framework project,
Population Imaging Use Case - EuroBioImaging
StratusLab Final Periodic Review
Consulting Services JobScheduler Architecture Decision Template
StratusLab Final Periodic Review
Overall Architecture and Component Model
Onedata Eventually Consistent Virtual Filesystem for Multi-Cloud Infrastructures Michał Orzechowski (CYFRONET AGH)
Federated IdM Across Heterogeneous Clouding Environment
PaaS Core Session (Notes from UPV)
StoRM Architecture and Daemons
Introduction to Data Management in EGI
Grid Computing.
INDIGO – DataCloud PaaS
An easier path? Customizing a “Global Solution”
Cloud Computing By P.Mahesh
Introduction to Cloud Computing
Management of Virtual Execution Environments 3 June 2008
Kubernetes Container Orchestration
Cloud Modeling Framework CloudMF
Kubernetes intro.
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
The Onedata platform Konrad Zemek, Krzysztof Trzepla ACC Cyfronet AGH
Case Study: Algae Bloom in a Water Reservoir
The XDC project Daniele Cesini
Module 01 ETICS Overview ETICS Online Tutorials
AWS Cloud Computing Masaki.
Orchestration & Container Management in EGI FedCloud
Container cluster management solutions
Technical Capabilities
Cloud Computing: Concepts
The Anatomy and The Physiology of the Grid
MMG: from proof-of-concept to production services at scale
The Anatomy and The Physiology of the Grid
Kubernetes.
Basics of Cloud Computing
ONAP Architecture Principle Review
Check-in Identity and Access Management solution that makes it easy to secure access to services and resources.
Presentation transcript:

The PaaS Layer in the INDIGO-DataCloud Giacinto Donvito INDIGO-DataCloud WP5 Leader and TC September 2016 giacinto.donvito@ba.infn.it RIA-653549

D.Salomoni - The INDIGO-DataCloud MidnightBlue Release This is the INDIGO-DataCloud General Architecture* *: see details in http://arxiv.org/abs/1603.09536 or in https://www.indigo-datacloud.eu/documents-deliverables September 2016 D.Salomoni - The INDIGO-DataCloud MidnightBlue Release

D.Salomoni - The INDIGO-DataCloud MidnightBlue Release INDIGO MidnightBlue Four main “solution blocks”: Data Center Solutions Data / Storage Solutions Automated Solutions User-Oriented Solutions And “common solutions”: Authentication and Authorization September 2016 D.Salomoni - The INDIGO-DataCloud MidnightBlue Release

D.Salomoni - The INDIGO-DataCloud Platform PaaS Features (1) Improved capabilities in the geographical exploitation of Cloud resources. End users need not know where resources are located, since the INDIGO PaaS layer is hiding the complexity of both scheduling and brokering. Standard interface to access PaaS services. Currently, each PaaS solution available on the market is using a different set of APIs, languages, etc. INDIGO uses the TOSCA standard to hide these differences. Support for data requirements in Cloud resource allocations. Resources can be allocated where data is stored. Integrated use of resources coming from both public and private Cloud infrastructures. The INDIGO resource orchestrator is capable of addressing both types of Cloud infrastructures through TOSCA templates handled at either the PaaS or IaaS level. July 20, 2016 - Jinan Cloud School D.Salomoni - The INDIGO-DataCloud Platform

D.Salomoni - The INDIGO-DataCloud Platform PaaS Features (2) Distributed data federations supporting legacy applications as well as high level capabilities for distributed QoS and Data Lifecycle Management. This includes for example remote Posix access to data. Integrated IaaS and PaaS support in resource allocations. For example, storage provided at the IaaS layer is automatically made available to higher-level allocation resources performed at the PaaS layer. Support for distributed data caching mechanisms and integration with existing storage infrastructures. INDIGO storage solutions are capable of providing efficient access to data and of transparently connecting to Posix filesystems already available in data centers. July 20, 2016 - Jinan Cloud School D.Salomoni - The INDIGO-DataCloud Platform

D.Salomoni - The INDIGO-DataCloud Platform PaaS Features (3) Deployment, monitoring and automatic scalability of existing applications. For example, existing applications such as web front-ends or R-Studio servers can be automatically and dynamically deployed in highly-available and scalable configurations. Support for dynamic and elastic clusters of resources. Resources and applications can be clustered through the INDIGO APIs. This includes for example batch systems on-demand (such as HTCondor or Torque) and extensible application platforms (such as Apache Mesos) capable of supporting both application execution and instantiation of long-running services. July 20, 2016 - Jinan Cloud School D.Salomoni - The INDIGO-DataCloud Platform

D.Salomoni - The INDIGO-DataCloud MidnightBlue Release The INDIGO PaaS core is built upon a set of services (exposing REST interfaces) that are: Deployed Scaled Managed Upgraded Monitored Self-healed through Kubernetes (http://kubernetes.io), an open source system for managing containerized applications across multiple hosts in a cluster. September 2016 D.Salomoni - The INDIGO-DataCloud MidnightBlue Release

D.Salomoni - The INDIGO-DataCloud MidnightBlue Release September 2016 D.Salomoni - The INDIGO-DataCloud MidnightBlue Release

D.Salomoni - The INDIGO-DataCloud MidnightBlue Release September 2016 D.Salomoni - The INDIGO-DataCloud MidnightBlue Release

The AAI common glue, implemented across the entire INDIGO architecture September 2016 D.Salomoni - The INDIGO-DataCloud MidnightBlue Release

D.Salomoni - The INDIGO-DataCloud Platform AAI Features INDIGO provides an advanced set of AAI features that includes: User authentication (supporting SAML, OIDC, X.509) Identity harmonization (link heterogeneous AuthN mechanisms to a single VO identity) Management of VO membership (i.e., groups and other attributes) Management of registration and enrolment flows Provisioning of VO structure and membership information to services Management, distribution and enforcement of authorization policies A Token Translation Service (TTS), creating credentials for services that do not natively support OpenID Connect. Services that do not support OpenID Connect are for example ssh, S3 storage, OpenNebula. July 20, 2016 - Jinan Cloud School D.Salomoni - The INDIGO-DataCloud Platform

D.Salomoni - The INDIGO-DataCloud MidnightBlue Release September 2016 D.Salomoni - The INDIGO-DataCloud MidnightBlue Release

Managed Services Deployment and Applications Execution through Mesos Mesos is able to manage cluster resources (cpu, mem) providing isolation and sharing across distributed applications (frameworks) Marathon and Chronos are two powerful frameworks that can be deployed on top of a Mesos Cluster. INDIGO PaaS uses: Marathon to deploy, monitor and scale Long-Running services, ensuring that they are always up and running. Chronos to run user applications (jobs), taking care of fetching input data, handling dependencies among jobs, rescheduling failed jobs. September 2016 G. Donvito - The INDIGO-DataCloud MidnightBlue Release

Managed Services Deployment and Applications Execution through Mesos Automatic deployment through Ansible recipes embedded in TOSCA and HOT templates All the services run in docker containers; High-availability of the cluster components: Leader election among master nodes managed by Zookeeper; HA Load-balancing; Service discovery through Consul that provides also DNS functionality and health checks; services are automatically registered in Consul as soon as they are deployed on the cluster The external access to the deployed services is ensured through load-balancers in HA (unique entrypoint: cluster Virtual IP) Cluster elasticity and application auto-scaling through CLUES plugin September 2016 G. Donvito - The INDIGO-DataCloud MidnightBlue Release

Distributed Data access and transfer FTS3 implements authorized data transfer from source to destination storage endpoints using various protocols via third party transfer.  Transfers are managed via command line client, a RESTful API or a web interface. The functionality comprises: Transfer auto-tuning/adaptive optimization Endpoint-centric VO configuration Transfer multi-hop VO activity shares Multiple replica support RESTful interface Retry of failed transfers Staging files from archive Support for Oracle and MySQL database back-ends Transfer and access protocols support on top of gfal2 plugin mechanism (SRM, gridFTP, HTTP, xroot) Session/connection reuse September 2016 G. Donvito - The INDIGO-DataCloud MidnightBlue Release

Distributed Data access and transfer Dynafed implements very fast loose coupling of storage endpoints as a single name-space exposed via HTTP and WebDAV. This allows to have federation of existing storage endpoints without the need of maintaining a file catalog for global to local file name  translation. On the fly built name-space of merged meta- data items taken on demand from a number of remote and local storage endpoints. September 2016 G. Donvito - The INDIGO-DataCloud MidnightBlue Release

D.Salomoni - The INDIGO-DataCloud Platform TOSCA Topology and Orchestration Specification for Cloud Applications Standardizes the language to describe: The structure of an IT service (its topology model) How to orchestrate operational behavior (plans such as build, deploy, patch, shutdown, etc.) Leveraging the BPMN (Business Process Model and Notation) standard Declarative model that spans applications, virtual and physical infrastructures. July 20, 2016 - Jinan Cloud School D.Salomoni - The INDIGO-DataCloud Platform

D.Salomoni - The INDIGO-DataCloud Platform TOSCA in a nutshell July 20, 2016 - Jinan Cloud School D.Salomoni - The INDIGO-DataCloud Platform

https://www.indigo-datacloud.eu Better Software for Better Science. Thank you https://www.indigo-datacloud.eu Better Software for Better Science. September 2016 D.Salomoni - The INDIGO-DataCloud MidnightBlue Release