The EUBrazilOpenBio-BioVeL Use Case in EGI Daniele Lezzi, Barcelona Supercomputing Center EGI-TF2013- 16 September 2013.

Slides:



Advertisements
Similar presentations
Programming a service Cloud Rosa M. Badia, Jorge Ejarque, Daniele Lezzi, Raul Sirvent, Enric Tejedor Grid Computing and Clusters Group Barcelona Supercomputing.
Advertisements

Legacy code support for commercial production Grids G.Terstyanszky, T. Kiss, T. Delaitre, S. Winter School of Informatics, University.
System Center 2012 R2 Overview
1 EGI Federated Clouds Task Force HEPiX Spring 2012 Workshop Matteo Turilli
High memory instances Monthly SLA : Virtual Machines Validated & supported Microsoft workloads Price reduction: standard Windows (22%) & Linux (29%)
Microsoft ® Application Virtualization 4.5 Infrastructure Planning and Design Series.
Cost Effort Complexity Benefit Cloud Hosted Low Cost Agile Integrated Fully Supported.
Microsoft ® Application Virtualization 4.6 Infrastructure Planning and Design Published: September 2008 Updated: February 2010.
Additional SugarCRM details for complete, functional, and portable deployment.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
Raffaele Di Fazio Connecting to the Clouds Cloud Brokers and OCCI.
UI and Data Entry UI and Data Entry Front-End Business Logic Mid-Tier Data Store Back-End.
Microsoft Virtual Academy.
Large Scale Sky Computing Applications with Nimbus Pierre Riteau Université de Rennes 1, IRISA INRIA Rennes – Bretagne Atlantique Rennes, France
Introduction to SHIWA Technology Peter Kacsuk MTA SZTAKI and Univ.of Westminster
Service - Oriented Middleware for Distributed Data Mining on the Grid ,劉妘鑏 Antonio C., Domenico T., and Paolo T. Journal of Parallel and Distributed.
ServiceSs, a new programming model for the Cloud Daniele Lezzi, Rosa M. Badia, Jorge Ejarque, Raul Sirvent, Enric Tejedor Grid Computing and Clusters Group.
1 Taverna CISTIB Ernesto Coto Taverna Open Workshop, October 2014.
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Applications.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Services for advanced workflow programming.
Microsoft Azure Active Directory. AD Microsoft Azure Active Directory.
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
1 openModeller Presentation Plan: Overview of openModeller OMWS: an open standard for distributed ecological niche modelling openModeller in relation to.
User Scenarios in VENUS-C Focus on Structural Analysis Ignacio Blanquer I3M - UPV.
User scenario on Marine Biodiversity AquaMaps Pasquale Pagano National Research Council (CNR) – ISTI Italy.
1 FedCloud Task Force Demo EGI CF2012 – Munich 28/29 March Matteo Turilli
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI VM Management Chair: Alexander Papaspyrou 2/25/
Cloud Computing for Ecological Modeling in the D4Science Infrastructure A. Manzi (CERN), L. Candela, D. Castelli, G. Coro, P. Pagano, F. Sinibaldi (ISTI-CNR)
OpenNebula: Experience at SZTAKI Peter Kacsuk, Sandor Acs, Mark Gergely, Jozsef Kovacs MTA SZTAKI EGI CF Helsinki.
Grid Execution Management for Legacy Code Architecture Exposing legacy applications as Grid services: the GEMLCA approach Centre.
EGI Technical Forum Madrid COMPSs in the EGI Federated Cloud Daniele Lezzi – BSC EGI Technical Forum Madrid.
EGI-InSPIRE RI EGI Webinar EGI-InSPIRE RI Porting your application to the EGI Federated Cloud 17 Feb
INFN OCCI implementation on Grid Infrastructure Michele Orrù INFN-CNAF OGF27, 13/10/ M.Orrù (INFN-CNAF) INFN OCCI implementation on Grid Infrastructure.
Cloud interoperability and elasticity with COMPSs Federated Cloud F2F Jan , Amsterdam Daniele Lezzi – Barcelona Supercomputing Center.
PLATFORM TO EASE THE DEPLOYMENT AND IMPROVE THE AVAILABILITY OF TRENCADIS INFRASTRUCTURE IberGrid 2013 Miguel Caballer GRyCAP – I3M - UPV.
An Open Data Platform in the framework of the EGI-LifeWatch Competence Centre Fernando Aguilar Jesús Marco
Instituto de Biocomputación y Física de Sistemas Complejos Cloud resources and BIFI activities in JRA2 Reunión JRU Española.
1 Globe adapted from wikipedia/commons/f/fa/ Globe.svg IDGF-SP International Desktop Grid Federation - Support Project SZTAKI.
Open Data and Cloud Computing e-Infrastructure for Biodiversity Daniele Lezzi Barcelona Supercomputing Center International Workshop on Science Gateways.
Hadoop on the EGI Federated Cloud Dr Tamas Kiss, CloudSME Project Director University of Westminster, London, UK Carlos Blanco – University.
EGI Technical Forum Madrid The EUBrazilOpenBio-BioVeL Use Case in EGI Daniele Lezzi – BSC EGI Technical Forum Madrid.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
The StratusLab Distribution and Its Evolution 4ème Journée Cloud (Bordeaux, France) 30 November 2012.
Daniele Lezzi Execution of scientific workflows on federated multi-cloud infrastructures IBERGrid Madrid, 20 September 2013.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI A pan-European Research Infrastructure supporting the digital European Research.
EGI-Engage is co-funded by the Horizon 2020 Framework Programme of the European Union under grant number Federated Cloud Update.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Overview for ENVRI Gergely Sipos, Malgorzata Krakowian EGI.eu
LOFAR - Calibration, Analysis and Modelling of Radio-Astronomy Data EGI Conference May 2015, Lisbon Daniele Lezzi – Barcelona Supercomputing.
Lifemapper 2.0 Using and Creating Geospatial Data and Open Source Tools for the Biological Community Aimee Stewart, CJ Grady, Dave Vieglais, Jim Beach.
Support to user communities in EGI with COMPSs Federated Cloud F2F Jan , Amsterdam Daniele Lezzi – Barcelona Supercomputing Center.
1 EGI Federated Cloud Architecture Matteo Turilli Senior Research Associate, OeRC, University of Oxford Chair – EGI Federated Clouds Task Force
The EGI Federated Cloud
PaaS services for Computing and Storage
Onedata Eventually Consistent Virtual Filesystem for Multi-Cloud Infrastructures Michał Orzechowski (CYFRONET AGH)
Ecological Niche Modelling in the EGI Cloud Federation
Use of HLT farm and Clouds in ALICE
Flowbster: Dynamic creation of data pipelines in clouds
INTAROS WP5 Data integration and management
FedCloud Blueprint Update
StratusLab Final Periodic Review
StratusLab Final Periodic Review
Large scale modelling of the distribution
The EGI Federated Cloud, architecture and use cases
Brief introduction to the project
Exam : Implementing Microsoft Azure Infrastructure Solutions
An easier path? Customizing a “Global Solution”
Azure Container Instances
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
Module 01 ETICS Overview ETICS Online Tutorials
Presentation transcript:

The EUBrazilOpenBio-BioVeL Use Case in EGI Daniele Lezzi, Barcelona Supercomputing Center EGI-TF September 2013

Open Modeller Web Service (Brazilian Virtual Herbarium) openModeller Web Service (single machine) One WS instance and one oM server ~50min for a single species (until the final model is generated) How to scale? request response

Complex executions angiosperms (flowering plants) Assuming that 30% will have enough points to generate models (~9 000 species): –495k models, 540k tests, 90k projections 10 months to generate all models! But what if we want to generate models for –All ~43 thousand plant species from Brazil? –Using more than one spatial resolution? –Projecting into different environmental climatic scenarios? –With global coverage? Note: models may be regenerated every time new data is available for each species...

Open Modeller Web Service 2 (OMWS2) Open Modeller Web Service 1 is a specification to expose Open Modeller Ecological Niche Modelling (ENM) features as a Web Service. It enables decoupling interface from processing and it is used to connect OM Desktop 2 applications with Web servers. Describes the complete life cycle of a single operation of ENM – Complete specification of ENM experiments, including occurrences, parameters for the algorithms and location of layers in a single document. – One set of occurrence points, one algorithm, one operation, one set of parameters. ENM processing is embarrassingly parallel though – OMWS2 addresses this need

ENM Service (OMWS2) VENUS-C Cloud Middleware COMPSs Workflow Orchestrator OCCI CDMI EGI Federated Cloud In order to provide BioVeL with a cloud enabled openModeller endpoint, the EUBrazilOpenBio ENM service is exposed through an extended openModeller Web Service interface (OMWS+ in the picture). Such interface in EUBrazilOpenBio supports multi- staging and multiparametric experiments implemented through COMPSs and the openModeller software and managed through a Virtual Research Environment (VRE) portal. The OMWS extensions are backwards compatible with the original specification, allowing existing clients, as the Taverna Workflow Management System in BioVeL, to be fully supported in the new implementation. An Experiment Orchestrator Service acts as dispatcher of user’s requests towards different infrastructures. In the case of the EGI Federated Cloud, the VENUS-C middleware is used to instantiate openModeller workflows on cloud resources. The COMPSs Workflow Orchestrator receives the execution requests and takes care of the execution of the openModeller pipelines on dynamically deployed virtual machines. An rOCCI connector is used for the VMs management while data management supports CDMI endpoints. Shared requirements between EUBrazilOpenBio and BioVeL The EUBrazilOpenBio ENM service is exposed through an extended openModeller Web Service interface (OMWS2 in the picture). Such interface in EUBrazilOpenBio supports multi-staging and multiparametric experiments implemented through COMPSs and the openModeller software and managed through a Virtual Research Environment (VRE) portal. The OMWS extensions are backwards compatible with the original specification, allowing existing clients, as the Taverna Workflow Management System in BioVeL, to be fully supported in the new implementation. In the case of the EGI Federated Cloud, the VENUS-C middleware is used to instantiate openModeller workflows on cloud resources from different providers dynamically deployed by COMPSs. An OCCI connector is used for the VMs management while data management supports CDMI endpoints.

ENM architecture Experiment Orchestrator Service Biodiversity VRE Biodiversity Use Case GUI Open Modeller Web Service Extended VENUS-C Cloud PMES Internal Storage Condor Internal Storage Workflow Orchestrator Public clouds (EC2, Azure) Private clouds (OpenNebula, EMOTIVE) Legacy Clients Information System Information System Workspace and Storage Service Result Visualisation Service Result Visualisation Service Species Discovery Service Use Case Enactment Service OMWS Cluster OMWS Server OMWS2 VENUS-C OMWS2 OMWS Monitoring

ENM Service (OMWS2) RP1 In order to provide BioVeL with a cloud enabled openModeller endpoint, the EUBrazilOpenBio ENM service is exposed through an extended openModeller Web Service interface (OMWS+ in the picture). Such interface in EUBrazilOpenBio supports multi- staging and multiparametric experiments implemented through COMPSs and the openModeller software and managed through a Virtual Research Environment (VRE) portal. The OMWS extensions are backwards compatible with the original specification, allowing existing clients, as the Taverna Workflow Management System in BioVeL, to be fully supported in the new implementation. An Experiment Orchestrator Service acts as dispatcher of user’s requests towards different infrastructures. In the case of the EGI Federated Cloud, the VENUS-C middleware is used to instantiate openModeller workflows on cloud resources. The COMPSs Workflow Orchestrator receives the execution requests and takes care of the execution of the openModeller pipelines on dynamically deployed virtual machines. An rOCCI connector is used for the VMs management while data management supports CDMI endpoints. COMPSs Execution Service RP1 VM1 VMn RP2 COMPSs Execution Service RP2 VM1 VMn ENM Service: the OpenBio ENM service receives from the BioVeL portal simple requests for the generation of models, balancing them between different RPs. COMPSs Execution Service: deployed at each site, executes the requests to pre-deployed VMs. Additional VMs are dynamically created to serve additional requests. openModeller application: the application first checks if the layer is available, otherwise downloads it from a layers repository to the storage local to the provider. THREDDS Data Server Execution Scenario 1: Static VMs

RP1 In order to provide BioVeL with a cloud enabled openModeller endpoint, the EUBrazilOpenBio ENM service is exposed through an extended openModeller Web Service interface (OMWS+ in the picture). Such interface in EUBrazilOpenBio supports multi- staging and multiparametric experiments implemented through COMPSs and the openModeller software and managed through a Virtual Research Environment (VRE) portal. The OMWS extensions are backwards compatible with the original specification, allowing existing clients, as the Taverna Workflow Management System in BioVeL, to be fully supported in the new implementation. An Experiment Orchestrator Service acts as dispatcher of user’s requests towards different infrastructures. In the case of the EGI Federated Cloud, the VENUS-C middleware is used to instantiate openModeller workflows on cloud resources. The COMPSs Workflow Orchestrator receives the execution requests and takes care of the execution of the openModeller pipelines on dynamically deployed virtual machines. An rOCCI connector is used for the VMs management while data management supports CDMI endpoints. COMPSs Execution Service RP1 COMPSs Workflow Orchestrator VM1 VMn RP2 VM1 VMn Shared Storage COMPSs Execution Service: deployed at each site, starts the execution of complex COMPSs workflows using resources of a RP. COMPSs Orchestrator: executes (in parallel) the different parts of the complex wf using additional VMs dynamically requested from other RPs. Shared Storage Issue: the layers should be accessible via a shared storage or synchronized at each site. ENM Service (OMWS2) THREDDS Data Server Execution Scenario 2

CESNET In order to provide BioVeL with a cloud enabled openModeller endpoint, the EUBrazilOpenBio ENM service is exposed through an extended openModeller Web Service interface (OMWS+ in the picture). Such interface in EUBrazilOpenBio supports multi- staging and multiparametric experiments implemented through COMPSs and the openModeller software and managed through a Virtual Research Environment (VRE) portal. The OMWS extensions are backwards compatible with the original specification, allowing existing clients, as the Taverna Workflow Management System in BioVeL, to be fully supported in the new implementation. An Experiment Orchestrator Service acts as dispatcher of user’s requests towards different infrastructures. In the case of the EGI Federated Cloud, the VENUS-C middleware is used to instantiate openModeller workflows on cloud resources. The COMPSs Workflow Orchestrator receives the execution requests and takes care of the execution of the openModeller pipelines on dynamically deployed virtual machines. An rOCCI connector is used for the VMs management while data management supports CDMI endpoints. COMPSs Execution Service COMPSs Workflow Orchestrator OpenNebula CESGA The demo testbed Extra-Large instances: 4 cores, 15GB RAM. 10GB Disk Large instances: 2 cores, 8GB RAM. 10GB Disk OpenNebula UPV ENM Service (OMWS2) Input Data: for simplicity the layers are already stored at each site. VMs: Model operations are executed on Extra-Large instances at CESNET, Test and Project on Large instances at CESGA. This is achieved through the mapping of tasks requirements of the COMPSs workflow and RP templates. Providers: OpenNebula providers have been selected but any rOCCI compliant RP is transparently supported. Cmd Line (Complex WFs) Taverna Portal/Desktop Orchestrator COMPSs Execution Service COMPSs Workflow Orchestrator