Case Study: Algae Bloom in a Water Reservoir

Slides:



Advertisements
Similar presentations
ProActive Task Manager Component for SEGL Parameter Sweeping Natalia Currle-Linde and Wasseim Alzouabi High Performance Computing Center Stuttgart (HLRS),
Advertisements

EGI-Engage EGI-Engage Engaging the EGI Community towards an Open Science Commons Project Overview 9/14/2015 EGI-Engage: a project.
DORII review Deployment and management of production infrastructure SA2 Ioannis Liabotis Greek Research and Technology Network - GRNET.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
INDIGO – DataCloud WP5 introduction INFN-Bari CYFRONET RIA
INDIGO: Building a DataCloud Framework to Support Open Science Yin Chen, Fernando Aguilar,
Breaking the frontiers of the Grid R. Graciani EGI TF 2012.
An Open Data Platform in the framework of the EGI-LifeWatch Competence Centre Fernando Aguilar Jesús Marco
Overview of the global architecture Giacinto DONVITO INFN-Bari.
EGI-InSPIRE RI EGI Compute and Data Services for Open Access in H2020 Tiziana Ferrari Technical Director, EGI.eu
EGI-Engage is co-funded by the Horizon 2020 Framework Programme of the European Union under grant number Federated Cloud Update.
LOFAR - Calibration, Analysis and Modelling of Radio-Astronomy Data EGI Conference May 2015, Lisbon Daniele Lezzi – Barcelona Supercomputing.
Scientific Data Processing Portal and Heterogeneous Computing Resources at NRC “Kurchatov Institute” V. Aulov, D. Drizhuk, A. Klimentov, R. Mashinistov,
EGI-InSPIRE RI An Introduction to European Grid Infrastructure (EGI) March An Introduction to the European Grid Infrastructure.
Yin Chen, EGI.eu Fernando Aguilar, , IFCA-CSIC
PaaS services for Computing and Storage
Onedata Eventually Consistent Virtual Filesystem for Multi-Cloud Infrastructures Michał Orzechowski (CYFRONET AGH)
CMS Experience with Indigo DataCloud
Accessing the VI-SEEM infrastructure
PIDs in EUDAT Webinar, 15 Februari 2013
EOSC Services for Scientists
SA2 Knowledge Commons EGI-LifeWatch Competence Center
Regional Operations Centres Core infrastructure Centres
The PaaS Layer in the INDIGO-DataCloud
aspects of archive system design
Overview of the global architecture
Federated Cloud Computing
Population Imaging Use Case - EuroBioImaging
Supporting Research on Biodiversity: LifeWatch on the Cloud
IaaS Layer – Solutions for “Enablers”
INTAROS WP5 Data integration and management
Exploitation and Sustainability updates
Some ideas on possible INDIGO participation to the EINFRA call
Defining and tracking requirements for New Communities
KER - Open Data Platform
Data Ingestion in ENES and collaboration with RDA
Fernando Aguilar, IFCA-CSIC
Onedata Eventually Consistent Virtual Filesystem for Multi-Cloud Infrastructures Michał Orzechowski (CYFRONET AGH)
PaaS Core Session (Notes from UPV)
Ideas for an ICOS Competence Centre Implementation of an on-demand computation service Ute Karstens, André Bjärby, Oleg Mirzov, Roger Groth, Mitch Selander,
Data Ingestion in EMSO Presented by Marco Pappalardo
Processing of Images: Orchestrating an Elastic Cloud (
Robert Szuman – Poznań Supercomputing and Networking Center, Poland
DI4R, 30th September 2016, Krakow
EGI-Engage Engaging the EGI Community towards an Open Science Commons
The INDIGO-DataCloud contributions to the EOSC and next steps
An easier path? Customizing a “Global Solution”
INDIGO - DataCloud Dissemination activities
PROCESS - H2020 Project Work Package WP6 JRA3
Management of Virtual Execution Environments 3 June 2008
Connecting the European Grid Infrastructure to Research Communities
Solutions for federated services management EGI
Interoperability Pilots WP6/T6.3 Doina Cristina Duma (INFN)
& Fujitsu Innovation Gathering
The Onedata platform Konrad Zemek, Krzysztof Trzepla ACC Cyfronet AGH
ExaO: Software Defined Data Distribution for Exascale Sciences
The XDC project Daniele Cesini
Report on GLUE activities 5th EU-DataGRID Conference
Ruđer Bošković Institute, Croatia
LifeWatch Cloud Computing Workshop
Open Data from a Water Reservoir Platform
Quoting and Billing: Commercialization of Big Data Analytics
Technical Outreach Expert
Joining the EOSC Ecosystem
WP6 – EOSC integration J-F. Perrin (ILL) 15th Jan 2019
Expand portfolio of EGI services
Introduction to the SHIWA Simulation Platform EGI User Forum,
EOSC-hub Contribution to the EOSC WGs
Photon & Neutron working meeting
LifeWatch AARC Pilot Fernando Aguilar 13th FIM4R Workshop
Presentation transcript:

Case Study: Algae Bloom in a Water Reservoir Presented by Fernando Aguilar (IFCA-CSIC) aguilarf@ifca.unican.es INDIGO-DataCloud WP2 INDIGO Review, Bologna, 7th November 2016 RIA-653549

Case Study: Algae Bloom in a Water Reservoir Research Community: LifeWatch (ESFRI) Topic/Area: Biodiversity & Ecosystem research Objective of the Case Study: Monitor the evolution of the potential eutrophication of a Water Reservoir including the Data Life Cycle management. Hydrodynamic and Water Quality models for forecasting. Schedule: first version of model running by the end of the year. Prototype of Data Life Cycle Management by the second quarter of 2017. In production by third quarter 2017. Innovation challenge: Different components at different Data Life Cycle stages. Each Model test requires ~20GB and potentially o(100-1000) (multi-parametric) Teams involved: IFCA/CSIC Team + Ecohydros (SME) Team (consulting). Final user community: Researchers (LifeWatch Community), Water management authorities, ICT Groups, Limnology groups. Impact: Pro-active management actions on water reservoirs , including new policies. Definition of monitoring instrumentation and parameters to be under control.

Analysis of requirements and solution SPECIFIC REQUIREMENTS GENERIC REQUIREMENTS INDIGO SERVICES INTEGRATED LWAB#1: Model Processing CO#2 Deployment of customized computing back-ends as batch queues FutureGateWay CO#4 Automatic elasticity of computing batch queues Orchestrator (TOSCA, Mesos) LWAB#2: Distributed Storage SO#1, SO#2 Shared storage accessible like a POSIX filesystem, Persistent data storage OneData SO#11 Dropbox-like storage Testbed resources used Bari Mesos Cluster Bari OneProvider Data Center Solutions User-Oriented Solutions Data / Storage Solutions Authentication and Authorization Automated Solutions

Solution Developed provider-RECAS-BARI https://fgw01.ncg.ingrid.pt/lifewatch-test http://orchestrator01-indigo.cloud.ba.infn.it:8080 provider-RECAS-BARI https://iam.indigo-datacloud.eu/login Local OneClient

Demo description Testbed resources that will be used: Bari TestBed Teams involved: INFN/Bari, PSNC, IFCA/CSIC Prerequisites: application in Docker Sequence of actions Connect to IAM to access OneData. Input data upload. Access to the Graphical User Interface (FutureGateway). Fill the form (OneData, Access, Sweep Parameter values). Submit. The TOSCA template edited and sent to the orchestrator. Check Deployment status. After finishing, output accessible via OneData. Comparing models using Delft3D tools. Final outcome: output of models executed showing algae growth NOW WE SWITCH TO THE DEMO SCREEN…

INDIGO added value Scalable (storage and computing) resources in the cloud to perform o(100-10000) tests… …and share directly within the community User Friendly interface to use cloud resources: Final users only need to fill a form to submit a new simulation, avoiding the script edition or direct contact with the infrastructure (Supercomputer, Grid, Cloud) (very helpful for non IT experts). First time we use a flexible and “universal” user authentication (quite relevant to collaborate with SMEs also) Transparent access to shared large storage (OneData)

Few words on Data Ingestion Data from CdP Reservoir. Raw – Curated – Processed/Derived Real Time monitoring ~5GB. Model Data ~20GB for each 3D model. METADATA standards employed: EML Available in Pilot tests: Storage, transfer for processing, AAI integration (Shown in Demo). QoS needs: rules for preserving datasets. License, periodicity. And what is yet to be improved Data will be available through our Open Science Framework It can get DOI from OSF itself or OneData. Only Curated or Processed/Derived levels. Curated and Processed/Derived data need to be preserved. Regarding Data Ingestion, INDIGO services enable: Deployment of Data Management Plans Tools (Over cloud clusters). Data Collect (OneData). Deployment of Open Science Framework (Cloud Clusters, docker deployment). Curate (deploying software on the infrastructure. Storing OneData). Analyze/Process: Software deployment thanks to orchestrator (Demo). Publish: OneData. Deployment of catalogue or repository services. Preservation: QoS. Metadata management: OneData

Exploitation It will be used in the LifeWatch Environment. Lifewatch VO supporters use EGI FedCloud (e.g. IFCA, LIP). Solution presented at Delft3D Users Meeting (Netherlands, last week!) RDA 8th Plennary EGU 2017 Papers, agreements with others, any other plan for exploitation Publication being prepared with ECOHYDROS (SME) team System in production, being applied in other lakes/water reservoirs (Cogotas, Sanabria)

https://www.indigo-datacloud.eu Better Software for Better Science. Thank you https://www.indigo-datacloud.eu Better Software for Better Science.