Next generation Science Gateways in the context of the INDIGO project: a pilot case on large scale climate-change data analytics Roberto Barbera, Riccardo.

Slides:



Advertisements
Similar presentations
A Workflow Engine with Multi-Level Parallelism Supports Qifeng Huang and Yan Huang School of Computer Science Cardiff University
Advertisements

Distributed Data Processing
CGW 2009 Vine Toolkit A uniform access and portal solution to existing grid middleware services P.Dziubecki, T.Kuczynski, K.Kurowski, D.Szejnfeld, D.Tarnawczyk,
Massimo Cafaro GridLab Review GridLab WP10 Information Services Massimo Cafaro CACT/ISUFI University of Lecce, Italy.
Asper School of Business University of Manitoba Systems Analysis & Design Instructor: Bob Travica System architectures Updated: November 2014.
SaaS, PaaS & TaaS By: Raza Usmani
SPRING 2011 CLOUD COMPUTING Cloud Computing San José State University Computer Architecture (CS 147) Professor Sin-Min Lee Presentation by Vladimir Serdyukov.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
INTRODUCTION TO CLOUD COMPUTING Cs 595 Lecture 5 2/11/2015.
Understanding and Managing WebSphere V5
.NET, and Service Gateways Group members: Andre Tran, Priyanka Gangishetty, Irena Mao, Wileen Chiu.
Software to Data model Lenos Vacanas, Stelios Sotiriadis, Euripides Petrakis Technical University of Crete (TUC), Greece Workshop.
Lecture 8 – Platform as a Service. Introduction We have discussed the SPI model of Cloud Computing – IaaS – PaaS – SaaS.
EGI-Engage EGI-Engage Engaging the EGI Community towards an Open Science Commons Project Overview 9/14/2015 EGI-Engage: a project.
Cloud Computing 1. Outline  Introduction  Evolution  Cloud architecture  Map reduce operation  Platform 2.
An Introduction to Software Architecture
Geospatial Systems Architecture Todd Bacastow. GIS Evolution
SCI-BUS is supported by the FP7 Capacities Programme under contract nr RI CloudBroker Platform integration into WS-PGRADE/gUSE Zoltán Farkas MTA.
Flexibility and user-friendliness of grid portals: the PROGRESS approach Michal Kosiedowski
Architecting Web Services Unit – II – PART - III.
European Grid Initiative Federated Cloud update Peter solagna Pre-GDB Workshop 10/11/
Margherita Forcolin (Insiel S.p.A.) Thessaloniki, 13 October 2011.
1 © 2009 Cisco Systems, Inc. All rights reserved.Cisco Confidential Cloud Computing – The Value Proposition Wayne Clark Architect, Intelligent Network.
Digital Earth Communities GEOSS Interoperability for Weather Ocean and Water GEOSS Common Infrastructure Evolution Roberto Cossu ESA
Convert generic gUSE Portal into a science gateway Akos Balasko 02/07/
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Big data analytics workflows for climate
This project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement n° Data Repositories.
A scalable and flexible platform to run various types of resource intensive applications on clouds ISWG June 2015 Budapest, Hungary Tamas Kiss,
Geospatial Systems Architecture
Cloud-access Author: Riccardo Bruno. cloud-access flow web portal A user accesses through any device to a portal requesting access to an interactive application.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
Chapter 8 – Cloud Computing
Convert generic gUSE Portal into a science gateway Akos Balasko.
Università di Perugia Enabling Grids for E-sciencE Status of and requirements for Computational Chemistry NA4 – SA1 Meeting – 6 th April.
VLDATA Common solution for the (very-)large data challenge EINFRA-1, focus on topics (4) & (5)
INFSO-RI JRA2 Test Management Tools Eva Takacs (4D SOFT) ETICS 2 Final Review Brussels - 11 May 2010.
Directions in eScience Interoperability and Science Clouds June Interoperability in Action – Standards Implementation.
Co-ordination & Harmonisation of Advanced e-Infrastructures for Research and Education Data Sharing Grant.
Tutorial on Science Gateways, Roma, Catania Science Gateway Framework Motivations, architecture, features Riccardo Rotondo.
INFN OCCI implementation on Grid Infrastructure Michele Orrù INFN-CNAF OGF27, 13/10/ M.Orrù (INFN-CNAF) INFN OCCI implementation on Grid Infrastructure.
INDIGO – DataCloud WP5 introduction INFN-Bari CYFRONET RIA
An Open Data Platform in the framework of the EGI-LifeWatch Competence Centre Fernando Aguilar Jesús Marco
Overview of the global architecture Giacinto DONVITO INFN-Bari.
INDIGO – DataCloud Security and Authorization in WP5 INFN RIA
Utilizzo di portali per interfacciamento tra Grid e Cloud Workshop della Commissione Calcolo e Reti dell’INFN, May Laboratori Nazionali del.
REST API to develop application for mobile devices Mario Torrisi Dipartimento di Fisica e Astronomia – Università degli Studi.
Convert generic gUSE Portal into a science gateway Akos Balasko.
Overview on the work performed during EPIKH Training Faiza MEDJEK /INFN, CATANIA 1.
A Data Engine for Grid Science Gateways Enabling Easy Transfers and Data Sharing Dr. Marco Fargetta (1), Mr. Riccardo Rotondo (2,*), Prof. Roberto Barbera.
Enabling scientific applications on hybrid e-Infrastructures: the FutureGateway framework Marco Fargetta (INFN), Riccardo Bruno (INFN), Roberto Barbera.
PaaS services for Computing and Storage
User Interfaces: Science Gateways, Workflows and Toolkits
FutureGateway Overview
Overview of the global architecture
Architecting Web Services
Data Ingestion in ENES and collaboration with RDA
Architecting Web Services
Extend user interfaces with new portlets
Ieva Juodelytė IT 3 kursas 4 grupė
EGI-Engage Engaging the EGI Community towards an Open Science Commons
PROCESS - H2020 Project Work Package WP6 JRA3
Cloud Computing.
Ramesh Baral Team: Marjani Peterson, Andre Guerrero
Cloud Modeling Framework CloudMF
Climate Data Analytics in a Big Data world
Module 01 ETICS Overview ETICS Online Tutorials
An Introduction to Software Architecture
Cloud Computing: Concepts
Remedy Integration Strategy Leverage the power of the industry’s leading service management solution via open APIs February 2018.
Presentation transcript:

Next generation Science Gateways in the context of the INDIGO project: a pilot case on large scale climate-change data analytics Roberto Barbera, Riccardo Bruno, Marco Fargetta, Emidio Giorgio, Davide Salomoni (INFN) Sandro Fiore (CMCC), Cosimo Palazzo (CMCC), Giovanni Aloisio (CMCC) Marcin Plociennik (PNSC)

Overview The INDIGO-DataCloud project INDIGO Work packages Climate Data Analysis Ophidia Catania Science Gateway Framework and FutureGateway Demo Future steps Q&A References

The INDIGO Project INDIGO is a H2020 project aiming at developing a data/compute platform targeting scientific communities Focus on cloud computing uptake, lacking in several scientific areas, especially at PaaS and SaaS level INDIGO will also develop a flexible and modular presentation layer connected to the underlying IaaS and PaaS frameworks innovative user experiences including web/desktop applications and mobile appliances

INDIGO Work Packages User communities (WP2) provide input to technical work packages WP6 responsible for the development of high-level UIs

WP6 Architecture, from a userperspective The Future Gateway Framework (FG) will provide the proper entry point for scientists Workflow solutions (e.g. Kepler) will be adopted to run distributed workflows across multiple sites Workflows will be published on community- based platforms (myExperiment) to address workflows sharing and re-use; Ophidia will be exploited as big data analytics framework; The Fabric layer will be represented by test sites with ESGF nodes (e.g. CMCC); Real dataset from the CMIP5 experiment will be used for implementation, test and validation. Interoperability with the ESGF ecosystem will be properly addressed

Climate Data Analysis Case study on Climate model intercomparison data analysis Directly related to the CMIP* experiments CMIP studies output from coupled ocean-atmosphere general circulation models Main Use cases Anomalies analysis, Trend analysis, Climate change signal analysis Ophidia is a big data analytics framework for eScience Primarily used for the analysis of climate data, exploitable in multiple domains “Datacube” abstraction and OLAP-based approach for big data Support for array-based data analysis and scientific data formats Parallel computing techniques and smart data distribution methods ~100 array-based primitives and ~50 datacube operators i.e.: data sub-setting, data aggregation, array-based transformations, datacube roll-up/drill- down, data cube import, etc.

From Catania SGF to FutureGateway The Catania Science Gateway Framework (CSGF) is a technology, based on Liferay, allowing to create a platform to access distributed computing and data infrastructures through web browsers and mobile devices Main components : AAI module, Grid & Cloud Engine, pluggable portable and modules FutureGateway will be an evolution of CSGF taking into account (also) INDIGO requirements such as the implementation of RESTful APIs for the Engine Interoperability : exposing the FutureGateway Engine API through this REST platform to eliminate Java constraints for the application, facilitating integration of both web portals and applications developed with other technologies Lightweightness : less client/server interactions and bandwidth through JSON usage Security : combination of state of the art security in HTTP/S protocols and support of AAI mechanisms defined by INDIGO

From CSGF to FutureGateway GridEngine JSAGA Portlet … Classic CSGF (before INDIGO) Liferay/Glassfish JSAGA Portlet … FutureGateway Approach (INDIGO) Liferay/Tomcat Comunication Portlet-GridEngine-JSAGA only possible with JAVA libraries API Server Comunication Portlet-API Server via REST APIs, this allows to serve external applications The API Server interacts via JAVA libraries to JSAGA REST APIs GridEngine JSAGA Portlet … Current (intermediate) approach Liferay/Tomcat API Server Engine Same model of the previous schema The API Server Engine instructs the existing Grid Engine cutting development efforts in re-writing ad hoc JSAGA handlers REST APIs Web/Mobi le Apps Web/Mobi le Apps

The demo The goal of this demo is to show the interoperability between already existing scientific software and INDIGO stack, starting from the IaaS/Paas level up to the high-level interface (toolkit, libraries, science gateways, etc.) In the current configuration, users provide input to Ophidia executions via FutureGateway FG submits the user created instance to an Ophidia cluster, and makes the output available when ready There are some limitations, though… Restricted set of instances Instances configured statically INDIGO PaaS orchestration services are not yet used Restricted set of models and related parameters (see the demo)

DEMO The actual demo…

Future steps Both FutureGateway and Climate Data Analysis use case will evolve : Rather than running a command to the Ophidia cluster, the application will instantiate dynamically a cluster on a private cloud by exploiting the Indigo PaaS FutureGateway will exploit a richer set of features for VM management INDIGO PaaS Orchestrator New data analytics workflow interfaces will enable more complex experiments From single-experiment, single-model to multi-model ensemble experiments Two-level workflows will be implemented to enable geographically distributed experiments on climate data

Questions ? Thanks for your attention ! References INDIGO web site INDIGO SG Portal (based on FutureGateway) Ophidia home page