The Mapper project receives funding from the EC's Seventh Framework Programme (FP7/2007-2013) under grant agreement n° RI-261507. Tools for Building and.

Slides:



Advertisements
Similar presentations
Polska Infrastruktura Informatycznego Wspomagania Nauki w Europejskiej Przestrzeni Badawczej Institute of Computer Science AGH ACC Cyfronet AGH The PL-Grid.
Advertisements

OMII-UK Steven Newhouse, Director. © 2 OMII-UK aims to provide software and support to enable a sustained future for the UK e-Science community and its.
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
An Operational Metadata Framework For Searching, Indexing, and Retrieving Distributed GIServices on the Internet By Ming-Hsiang.
UrbanFlood Towards a framework for creation, deployment and reliable operation of distributed, time-critical applications Marian Bubak and Marek Kasztelnik.
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CF21) IRNC Kick-Off Workshop July 13,
Polish Infrastructure for Supporting Computational Science in the European Research Space GridSpace Based Virtual Laboratory for PL-Grid Users Maciej Malawski,
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
MS DB Proposal Scott Canaan B. Thomas Golisano College of Computing & Information Sciences.
The Mapper project receives funding from the EC's Seventh Framework Programme (FP7/ ) under grant agreement n° RI Multiscale APPlications.
Course Instructor: Aisha Azeem
The Mapper project receives funding from the EC's Seventh Framework Programme (FP7/ ) under grant agreement n° RI CYFRONET Programming.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Basic Concepts The Unified Modeling Language (UML) SYSC System Analysis and Design.
Annual SERC Research Review - Student Presentation, October 5-6, Extending Model Based System Engineering to Utilize 3D Virtual Environments Peter.
SYNAT - the Polish National Research Content Infrastructure Wojtek Sylwestrzak, ICM Tomasz Rosiek, ICM Tomasz Krassowski, ICM Tartu, Estonia June 27, 2012.
February Semantion Privately owned, founded in 2000 First commercial implementation of OASIS ebXML Registry and Repository.
Apache Airavata GSOC Knowledge and Expertise Computational Resources Scientific Instruments Algorithms and Models Archived Data and Metadata Advanced.
Environment for Management of Experiments on the Grid Master of Science Thesis AGH University of Science and Technology, Krakow, Poland Faculty of Electrical.
OpenAlea An OpenSource platform for plant modeling C. Pradal, S. Dufour-Kowalski, F. Boudon, C. Fournier, C. Godin.
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space The Capabilities of the GridSpace2 Experiment.
EC Grant Agreement no GEOSS Interoperability for Weather Ocean and Water Enhancing the GEOSS Infrastructure for all the Stakeholders.
The Mapper project receives funding from the EC's Seventh Framework Programme (FP7/ ) under grant agreement n° RI Multiscale Applications.
San Diego Supercomputer CenterUniversity of California, San Diego Preservation Research Roadmap Reagan W. Moore San Diego Supercomputer Center
Service-enabling Legacy Applications for the GENIE Project Sofia Panagiotidi, Jeremy Cohen, John Darlington, Marko Krznarić and Eleftheria Katsiri.
New Communities: The Virtual Physiological Human Use Case Stefan Zasada University College London
Recording application executions enriched with domain semantics of computations and data Master of Science Thesis Michał Pelczar Krakow,
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
The Mapper project receives funding from the EC's Seventh Framework Programme (FP7/ ) under grant agreement n° RI MAPPER Multiscale Programming.
Simultech 2011, July, 2011, Noordwijkerhout, The Netherlands Component Approach to Distributed Multiscale Simulations Katarzyna Rycerz(1,2), Marian.
Taverna and my Grid Open Workflow for Life Sciences Tom Oinn
In each iteration macro model creates several micro modules, sends data to them and waits for the results. Using Akka Actors for Managing Iterations in.
Cracow Grid Workshop, October 27 – 29, 2003 Institute of Computer Science AGH Design of Distributed Grid Workflow Composition System Marian Bubak, Tomasz.
Issues in (Financial) High Performance Computing John Darlington Director Imperial College Internet Centre Fast Financial Algorithms and Computing 4th.
© 2012 xtUML.org Bill Chown – Mentor Graphics Model Driven Engineering.
DataNet – Flexible Metadata Overlay over File Resources Daniel Harężlak 1, Marek Kasztelnik 1, Maciej Pawlik 1, Bartosz Wilk 1, Marian Bubak 1,2 1 ACC.
A public-private partnership building a multidisciplinary cloud platform for data intensive science Bob Jones Head of openlab IT dept CERN This document.
EC-project number: Universal Grid Client: Grid Operation Invoker Tomasz Bartyński 1, Marian Bubak 1,2 Tomasz Gubała 1,3, Maciej Malawski 1,2 1 Academic.
Framework for MDO Studies Amitay Isaacs Center for Aerospace System Design and Engineering IIT Bombay.
1 The Mapper project receives funding from the EC's Seventh Framework Programme (FP7/ ) under grant agreement n° RI Towards Environment.
ICCS WSES BOF Discussion. Possible Topics Scientific workflows and Grid infrastructure Utilization of computing resources in scientific workflows; Virtual.
GEON2 and OpenEarth Framework (OEF) Bradley Wallet School of Geology and Geophysics, University of Oklahoma
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Cooperative experiments in VL-e: from scientific workflows to knowledge sharing Z.Zhao (1) V. Guevara( 1) A. Wibisono(1) A. Belloum(1) M. Bubak(1,2) B.
Lightweight construction of rich scientific applications Daniel Harężlak(1), Marek Kasztelnik(1), Maciej Pawlik(1), Bartosz Wilk(1) and Marian Bubak(1,
Federating PL-Grid Computational Resources with the Atmosphere Cloud Platform Piotr Nowakowski, Marek Kasztelnik, Tomasz Bartyński, Tomasz Gubała, Daniel.
Convert generic gUSE Portal into a science gateway Akos Balasko.
1 ProActive GCM – CCA Interoperability Maciej Malawski, Ludovic Henrio, Matthieu Morel, Francoise Baude, Denis Caromel, Marian Bubak Institute of Computer.
Satisfying Requirements BPF for DRA shall address: –DAQ Environment (Eclipse RCP): Gumtree ISEE workbench integration; –Design Composing and Configurability,
VLDATA Common solution for the (very-)large data challenge EINFRA-1, focus on topics (4) & (5)
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space The Capabilities of the GridSpace2 Experiment.
High Risk 1. Ensure productive use of GRID computing through participation of biologists to shape the development of the GRID. 2. Develop user-friendly.
The Mapper project receives funding from the EC's Seventh Framework Programme (FP7/ ) under grant agreement n° RI Requirements for Multiscale.
The Mapper project receives funding from the EC's Seventh Framework Programme (FP7/ ) under grant agreement n° RI CYFRONET Hands.
Store and exchange data with colleagues and team Synchronize multiple versions of data Ensure automatic desktop synchronization of large files B2DROP is.
InSilicoLab – Grid Environment for Supporting Numerical Experiments in Chemistry Joanna Kocot, Daniel Harężlak, Klemens Noga, Mariusz Sterzel, Tomasz Szepieniec.
The Mapper project receives funding from the EC's Seventh Framework Programme (FP7/ ) under grant agreement n° RI CYFRONET Multiscale.
The Mapper project receives funding from the EC's Seventh Framework Programme (FP7/ ) under grant agreement n° RI EGI and PRACE ecosystem.
EGI-InSPIRE RI EGI Compute and Data Services for Open Access in H2020 Tiziana Ferrari Technical Director, EGI.eu
The Mapper project receives funding from the EC's Seventh Framework Programme (FP7/ ) under grant agreement n° RI Distributed Multiscale.
Seasonal School Demo and Assigments
Multiscale Applications on European e-Infrastructures
Model Execution Environment for Investigation of Heart Valve Diseases
Design and Manufacturing in a Distributed Computer Environment
DICE - Distributed Computing Environments Team
Recap: introduction to e-science
PROCESS - H2020 Project Work Package WP6 JRA3
Final Review 27th March Final Review 27th March 2019.
Scientific Workflows Lecture 15
GGF10 Workflow Workshop Summary
Presentation transcript:

The Mapper project receives funding from the EC's Seventh Framework Programme (FP7/ ) under grant agreement n° RI Tools for Building and Execution of Multiscale Applications Marian Bubak AGH Krakow PL and University of Amsterdam NL Grzegorz Dyk and Daniel Harezlak ACC Cyfronet AGH Krakow PL on behalf of the MAPPER Consortium Summer School 2012, MTA SZTAKI, Budapest, HU 3 July 2012

2 Academic Computer Centre CYFRONET AGH (1973) 120 employees Academic Computer Centre CYFRONET AGH (1973) 120 employees Department of Computer Science AGH (1980) 800 students, 70 employees Department of Computer Science AGH (1980) 800 students, 70 employees Faculty of Electrical Engineering, Automatics, Computer Science and Electronics (1946) 4000 students, 400 employees Faculty of Electrical Engineering, Automatics, Computer Science and Electronics (1946) 4000 students, 400 employees AGH University of Science and Technology (1919) 15 faculties, students; 4000 employees AGH University of Science and Technology (1919) 15 faculties, students; 4000 employees Other 14 faculties Distributed Computing Environments (DICE) Team About the speakers University of Amsterdam, Institute for Informatics, Computational Science

3 DICE team - Main research interests investigation of methods for building complex scientific collaborative applications and large-scale distributed computing infrastructures elaboration of environments and tools for e-Science development of knowledge-based approach to services, components, and their semantic composition and integration CrossGrid interactive compute- and data-intensive applications K-Wf Grid knowledge-based composition of grid workflow applications CoreGRID problem solving environments, programming models GREDIA grid platform for media and banking applications ViroLab GridSpace virtual laboratory PL-Grid; advanced virtual laboratory gSLM service level management for grid and clouds UrbanFlood Common Information Space for Early Warning Systems MAPPER VPH-Share Collage ? computational strategies, software and services for distributed multiscale simulations Federating cloud resources for development and execution of VPH computationally and data intensive applications Executable Papers; 1st award of Elsevier Competition at ICCS2011

4 Plan Motivation: multiscale applications Multiscale modeling Objectives of the MAPPER project Programming and execution tools Infrastructure for multiscale simulations Demo of tools for an irrigation canals application Summary

5 Multiscale everywhere Natural processes are multiscale – 1 H 2 O molecule – A large collection of H 2 O molecules, forming H-bonds – A fluid called water, and, in solid form, ice.

6 Why multiscale models? There is simply no hope to computationally track complex natural processes at their finest spatio-temporal scales even with the ongoing growth in computational power Minimal requirement:

7 From multiscale to single scale Identify the relevant scales on the scale separation map Design specific models which solve each scale Assess errors of a method Couple the subsystems using an appropriate method temporal scale spatial scale xx L tt T

8 Multiscale computing Inherently hybrid models are best serviced by different types of computing environments Simulations in three dimensions require large scale computing capabilities. Such large scale hybrid models require a distributed computing ecosystem, where parts of the multiscale model are executed on the most appropriate computing resource.  Distributed Multiscale Computing

9 Two paradigms Loosely Coupled – One single scale model provides input to another – Single scale models are executed once – Workflow like Tightly Coupled – Single scale models call each other in an iterative loop – Single scale models may execute many times – Dedicated coupling libraries are needed temporal scale spatial scale xx L tt T temporal scale spatial scale xx L tt T

10 MAPPER Multiscale APPlications on European e-infRastructures University of Amsterdam Max-Planck Gesellschaft zur Foerderung der Wissenschaften E.V. University of Ulster Poznan Supercomputing and Networking Centre Akademia Gorniczo- Hutnicza im. Stanislawa Staszica w Krakowie Ludwig-Maximilians- Universität München University of Geneva Chalmers Tekniska Högskola University College London

11 Motivation: user needs VPHFusion Computional Biology Material Science Engineering Distributed Multiscale Computing Needs

12 Applications 7 applications from 5 scientific domains in a common generic multiscale computing framework virtual physiological human fusion hydrology nano material science computational biology SSMCoupling topology (x)MML Task graph Scheduling

13 MAPPER main objectives Develop computational strategies, software and services for distributed multiscale simulations across disciplines exploiting existing and evolving European e-infrastructure Deploy a computational science infrastructure Deliver high quality components aiming at large-scale, heterogeneous, high performance multi-disciplinary multiscale computing. Advance state-of-the-art in high performance computing on e- infrastructures enable distributed execution of multiscale models across e-Infrastructures,

14 Multiscale application life cycle Steps Register single-scale modules in a memory Design an application Execute application in distributed environment View results and provenence Support Multiscale Description Language MML (orange) Programming and execution tools (blue) Services accessing e- infrastructure (green)

15 Multiscale modeling language Uniformly describes multiscale models and their computational implementation on abstract level Two representations: graphical (gMML), textual (xMML) Includes description of – scale submodules – scaleless submodules (so called mappers and filters) – ports and their operators (for indicating type of connections between modules) – coupling topology – implementation Submodel execution loop in pseudocode f := finit /*initialization*/ t := 0 while not EC(f, t): Oi(f, t) /*intermediate observation*/ f := S(f, t) /*solving step*/ t += theta(f) end Of(f, t) /*final observation*/ Oi Of S finit undefined Corresponding symbols in gMML Example for Instent Restenosis application IC – initial conditions DD- drug diffusion BF – blood flow SMC – smooth muscle cells

16 Programming and execution tools MAPPER Memory (MaMe) a semantics-aware persistence store to record metadata about models and scales Multiscale Application Designer (MAD) visual composition tool transforming high level MML description into executable experiment GridSpace Experiment Workbench (EW) execution and result management on e- infrastructures via interoperability layers Provenance – recording, storing and querying provenance of experiment results

17 Mapper Memory (MaMe) Semantics-aware persistence store Records MML-based metadata about models and scales Supports exchanging and reusing MML metadata for – other MAPPER tools via REST interface – users via dedicated Web interface Ports and theiroperators

18 Application Designer (MAD) (1/2) User friendly visual tool for composing multiscale applications Supports importing application structure from xMML (section A and B) Supports composing multiscale applications in gMML (section B) with additional graphical specific information - layout, color etc. (C) Transforms gMML into xMML Performs MML analysis to identify its loosely and tightly coupled parts Using information from MaMe and GridSpace EW, transforms gMML into executable formats with information needed for actual execution (D) – GridSpace Experiment – MUSCLE connection file (cxa.rb)

19 Application Designer (MAD) (2/2) Supports composing multiscale applications from submodels and mappers registered in MaMe Inport/export coupling topology represented in gMML to/from XMML file Transforms high level MML description into executable experiment for GridSpace Experiment Workbench MAD

20 GridSpace Workbench Supports execution and result management of infrastructure independent experiments Experiment – an application composed of code fragments called snippets, expressed in: – general-purpose scripting programming languages(Bash, Ruby, Perl etc.) – domain-specific languages (CxA in MUSCLE, LAMMPS, Matlab etc) Snippets are evaluated by respective programs called interpreters Executors- responsible for snippets execution on computational resources : servers, clusters, grid – direct SSH on UserInterface (UI) machine – Interoperability layer (QCG, AHE)

21 Provenance Tracing of an experiment start, stop and snippet start/stop events Provenance data stored in RDF database; OPMV-based ontology Input/output files of snippets are copied and snapshots are created – experiment result history Provenance data browser – extensive querying capabilities

22 User environment Application composition: from MML to executable experiment Application composition: from MML to executable experiment Registration of MML metadata: submodules and scales Result and provenance Management Result and provenance Management Execution of experiment using interoperability layer on e-infrastructure Execution of experiment using interoperability layer on e-infrastructure

23 …… MoU signed Taskforce established 1 st evaluation Joined task force between MAPPER, EGI and PRACE Collaborate with EGI and PRACE to introduce new capabilities and policies onto e-Infrastructures Deliver new application tools, problem solving environments and services to meet end-users needs Work closely with various end-users communities (involved directly in MAPPER) to perform distributed multiscale simulations and complex experiments 05 1 st EU review selected two apps on MAPPER e-Infrastructure (EGI and PRACE resources) Tier - 2 Tier - 1 Tier - 0 MAPPER Taskforce E-infrastructure

24 Objective Provide a mutiscale model for the irrigation canal network "La Bourne" Active control and optimal management  History of the main unusual events / perturbations  Several scenarios in order to find the optimal configuration  Real-time-control and optimization of the water exploitation Canal network "La Bourne" millions m 3 of water are distributed to ~9000 clients for a total irrigated area of 10,000 ha, 46 km of length includes several junctions: tunnels, bridges, spillway,...etc. Example: Irrigation canals

25 Canals simulation - submodels LB models for long canal reaches. LB-Shallow Water 1D The water height varies with respect to X and Y. LB-Shallow water 2D LB-Free Surface 3D - Flow around gates/transport of sediments -It requires supercomputing capabilities CxA coupling taken from: Pham van Thang et al. Journal of Computational Physics,229(19) : ,

26 Demos Canals application life cycle (Daniel) Provenance at work (Grzegorz)

27 Canal Application # declare kernels which can be launched in the CxA cxa.add_kernel(’submodel_instance1, ’my.submodelA’) cxa.add_kernel(’submodel_instance2’, ’my.submodelB’) … # configure connection scheme of the CxA cs = cxa.cs # configure unidirectional connection betweenkernels cs.attach ’ submodel_instance1’=> ’submodel_instance2’ do tie ’portA’, ’portB’ ….. end … Tightly coupled Java based canal simulation using MUSCLE Stand-alone canal visualizer and movie maker

28 Canal Application MAD

29 Summary Elaboration of a concept of an environment supporting developers and users of multiscale applications for grid,HPC and cloud infrastructures Design of the formalism for describing structures of multiscale applications Enabling efficient access to e-infrastructures Validation of the formalism against real applications structure by using tools Proof of concept for transforming high level formal description to actual execution using e-infrastructures

30 More about MAPPER

31 And more at … dice.cyfronet.pl