ICOS on-demand atmospheric transport computation A use case for interoperability of EGI and EUDAT services Ute Karstens, André Bjärby, Oleg Mirzov, Roger.

Slides:



Advertisements
Similar presentations
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Advertisements

Robust Tools for Archiving and Preserving Digital Data Joseph JaJa, Mike Smorul, and Mike McGann Institute for Advanced Computer Studies Department of.
TPAC Digital Library Talk Overview Presenter:Glenn Hyland Tasmanian Partnership for Advanced Computing & Australian Antarctic Division Outline: TPAC Overview.
EGI-Engage EGI-Engage Engaging the EGI Community towards an Open Science Commons Project Overview 9/14/2015 EGI-Engage: a project.
Quick Introduction to NorduGrid Oxana Smirnova 4 th Nordic LHC Workshop November 23, 2001, Stockholm.
Alastair Duncan STFC Pre Coffee talk STFC July 2014 The Trials and Tribulations and ultimate success of parallelisation using Hadoop within the SCAPE project.
European Life Sciences Infrastructure for Biological Information META-pipe WP6 Kick-off Lars Ailo Bongo, ELIXIR-NO.
IODE Ocean Data Portal - ODP  The objective of the IODE Ocean Data Portal (ODP) is to facilitate and promote the exchange and dissemination of marine.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Processing services.
BOINC: Progress and Plans David P. Anderson Space Sciences Lab University of California, Berkeley BOINC:FAST August 2013.
Near Real-Time Verification At The Forecast Systems Laboratory: An Operational Perspective Michael P. Kay (CIRES/FSL/NOAA) Jennifer L. Mahoney (FSL/NOAA)
European Grid Initiative Data Services and Solutions Part 2: Data in the cloud Enol Fernández Data Services.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EUDAT EGI interoperability.
CLARIN EUDAT2020 uptake plan Dieter Van Uytvanck CLARIN ERIC EUDAT User Forum, Rome.
Breaking the frontiers of the Grid R. Graciani EGI TF 2012.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EPOS and EUDAT.
ICOS To collect high-quality observational data relevant to the greenhouse gas budget of Europe To make the ICOS data freely available to all interested.
European Life Sciences Infrastructure for Biological Information ELIXIR Cloud Roadmap Chairs: Steven Newhouse, EMBL-EBI & Mirek Ruda,
Get Data to Computation eudat.eu/b2stage B2STAGE How to shift large amounts of data Version 4 February 2016 This work is licensed under the.
Store and exchange data with colleagues and team Synchronize multiple versions of data Ensure automatic desktop synchronization of large files B2DROP is.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Collaboration.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Support to scientific.
EGI-InSPIRE RI EGI Compute and Data Services for Open Access in H2020 Tiziana Ferrari Technical Director, EGI.eu
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No TURBASE-DNS: A.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Herbadrop.
Scientific Data Processing Portal and Heterogeneous Computing Resources at NRC “Kurchatov Institute” V. Aulov, D. Drizhuk, A. Klimentov, R. Mashinistov,
EGI-InSPIRE RI An Introduction to European Grid Infrastructure (EGI) March An Introduction to the European Grid Infrastructure.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EGI - EUDAT interoperability.
EGI… …is a Federation of over 300 computing and data centres spread across 56 countries in Europe and worldwide …delivers advanced computing.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No LTER- Europe &
Petr Škoda, Jakub Koza Astronomical Institute Academy of Sciences
Onedata Eventually Consistent Virtual Filesystem for Multi-Cloud Infrastructures Michał Orzechowski (CYFRONET AGH)
Bob Jones EGEE Technical Director
ICOS and the GEO-C Initiative An overview Dr
Accessing the VI-SEEM infrastructure
Diego Scardaci EGI Technical Outreach Expert
Tokamak data mirror for JET and MAST Moving towards an open data repository for European nuclear fusion research.
EUDAT’s engagement with the Earth Sciences
Volunteer Computing for Science Gateways
Alessandro Spinuso, Andreas Rietbrock, Andrè Gemuend,
Design your e-infrastructure. egi
KER - Open Data Platform
Ideas for an ICOS Competence Centre Implementation of an on-demand computation service Ute Karstens, André Bjärby, Oleg Mirzov, Roger Groth, Mitch Selander,
Recap: introduction to e-science
EGI-Engage Engaging the EGI Community towards an Open Science Commons
Simulation use cases for T2 in ALICE
An easier path? Customizing a “Global Solution”
PROCESS - H2020 Project Work Package WP6 JRA3
Solutions for federated services management EGI
Climate Data Analytics in a Big Data world
EISCAT-3D: a data centric design for extreme scale computing
DATA SPHINX & EUDAT Collaboration
Haiyan Meng and Douglas Thain
Thursday pilot session: 7-minutes
Case Study: Algae Bloom in a Water Reservoir
EGI Webinar - Introduction -
NFFA Europe.
LifeWatch Cloud Computing Workshop
An EUDAT-based FAIR Data Approach for Data Interoperability
TeraScale Supernova Initiative
Break out group coordinator:
Copernicus Data in the EGI infrastructure
DATATURB Direct simulation data of turbulent flows
MMG: from proof-of-concept to production services at scale
Technical Outreach Expert
Joining the EOSC Ecosystem
Expand portfolio of EGI services
Maria Teresa Capria December 15, 2009 Paris – VOPlaneto 2009
EOSC-hub Contribution to the EOSC WGs
Photon & Neutron working meeting
Presentation transcript:

ICOS on-demand atmospheric transport computation A use case for interoperability of EGI and EUDAT services Ute Karstens, André Bjärby, Oleg Mirzov, Roger Groth, Mitch Selander, Maggie Hellström, Alex Vermeulen ICOS Carbon Portal @ Lund University Diego Scardaci, Matthew Viljoen EGI Foundation Peter Gille, Michaela Barth EUDAT and PDC Center for High Performance Computing, KTH Royal Institute of Technology, Stockholm

Integrated Carbon Observation System “A pan-European research infrastructure for quantifying and understanding the greenhouse gas balance of the European continent” Collect high-quality observational data relevant to the greenhouse gas budget of Europe Make the ICOS data freely available to all interested parties Promote the use of the ICOS data for further scientific study Support modelling activities of the greenhouse gas fluxes in time and space Support verification of the effectiveness of policies aiming to reduce greenhouse gas emissions

Footprint tool for atmospheric sites Web-based service at ICOS Carbon Portal On-demand computation and visualization of footprints and GHG concentrations at atmospheric measurement stations Based on the Lagrangian atmospheric transport model STILT Use case for testing interoperability between EGI and EUDAT services in WP7/Task 7.2 of EUDAT2020 Application examples: Analysis of the sensitivity of GHG concentration signals at potential and existing ICOS atmospheric measurement stations to GHG emissions and fluxes Evaluation of measurement strategies Network design studies

STILT atmospheric transport model calculations Atmospheric observations Emissions Meteorological driver fields ≈ 1 GB ≈ 0.5-1 TB ≈ 2-3 TB ≈ 1-2 TB per year Station Footprints GHG concentrations Federated Cloud STILT Lagrangian transport model ≈ 300 CPUs per footprint => 750 CPUh/station/year ICOS Carbon Portal Atmospheric observations Prior fluxes Emissions Meteorological driver fields EUDAT B2SAFE ≈ 1 GB ≈ 0.5-1 TB ≈ 2-3 TB ≈ 1-2 TB per year Station Footprints GHG concentrations EGI Federated Cloud STILT Lagrangian transport model ≈ 670 CPUs per footprint => 1700 CPUh per station per year ICOS Carbon Portal EUDAT B2SAFE ≈ 1-2 TB per year Station Footprints GHG concentrations Atmospheric observations Prior fluxes Emissions Meteorological driver fields EUDAT B2SAFE ≈ 1 GB ≈ 0.5-1 TB ≈ 2-3 TB EUDAT B2SAFE ≈ 1-2 TB per year Station Footprints GHG concentrations Atmospheric observations Prior fluxes Emissions Meteorological driver fields EUDAT B2SAFE ≈ 1 GB ≈ 0.5-1 TB ≈ 2-3 TB EUDAT B2SAFE ≈ 1-2 TB per year Station Footprints GHG concentrations EGI Federated Cloud STILT Lagrangian transport model ≈ 670 CPUs per footprint => 1700 CPUh per station per year ICOS Carbon Portal Atmospheric observations Prior fluxes Emissions Meteorological driver fields EUDAT B2SAFE ≈ 1 GB ≈ 0.5-1 TB ≈ 2-3 TB EUDAT B2SAFE ≈ 1-2 TB per year Station Footprints GHG concentrations EGI Federated Cloud STILT Lagrangian transport model ≈ 670 CPUs per footprint => 1700 CPUh per station per year ICOS Carbon Portal Atmospheric observations Prior fluxes Emissions Meteorological driver fields EUDAT B2SAFE ≈ 1 GB ≈ 0.5-1 TB ≈ 2-3 TB EGI Federated Cloud STILT Lagrangian transport model ≈ 670 CPUs per footprint => 1700 CPUh per station per year ICOS Carbon Portal EGI Federated Cloud STILT Lagrangian transport model ≈ 670 CPUs per footprint => 1700 CPUh per station per year ICOS Carbon Portal

Footprint tool workflow ICOS CP account User 1 VM Web service VM Worker AAI AAI User 2 Controller Model VM NFS Model Output Particle Location Footprints GHG conc. datahub.egi.eu OneData PDC/KTH VM ICOS Data Model Input Meteo Model Output Model Input

Footprint tool workflow ICOS CP account User 1 VM Web service VM Worker AAI VM Worker AAI User 2 Controller Model Model VM NFS Model Output Particle Location Footprints GHG conc. datahub.egi.eu OneData PDC/KTH VM ICOS Data Model Input Meteo Model Output Model Input

… Footprint tool workflow User 1 User 2 User 3 User 4 NFS VM Model Input Meteo datahub.egi.eu OneData ICOS CP account User 1 User 2 AAI Web service Controller ICOS Data Model Output PDC/KTH Worker Model NFS Particle Location Footprints GHG conc. User 3 User 4 … on demand

Components of the workflow Generic simulation Linux VMs in EGI Federated Cloud Scala for backend development Akka Cluster for orchestration Docker-based version of model (or data processing tool) Long-term storage to archive model results Intermediate storage close to the computation STILT-specific components Split model runs into small jobs to allow distribution over many cores/VMs to efficiently serve multiple users Handling of large numbers of small files (model output re-used as input) Prototype: 4 VMs of medium size (8 CPUs, 16-32 GB RAM) + 4 TB storage General framework will be applied to other types of model simulations and data processing tasks

Storage and data services Long-term storage for ICOS measurement data and elaborated products (e.g. STILT results) at B2SAFE instance at PDC/KTH (replicating to another B2SAFE node) using B2STAGE for data transfer (currently using iCommands/GridFTP) waiting for B2STAGE HTTP API for operational implementation at ICOS CP waiting for support of metadata handling for B2SAFE (GraphDB) Intermediate storage for STILT model input and output requires handling of large numbers (∼1 Mio) of small files (1-5 MB) output might be re-used as input in further model runs Network File System on dedicated VM to serve input/output data EGI DataHub for storage of data from multiple providers, e.g. meteorological 4D arrays (in future) OneData software solution already tested

Thank you ! More information about ICOS: www.icos-ri.eu and the Carbon Portal: www.icos-cp.eu Contact: ute.karstens@nateko.lu.se