August 31 2006, Elsevier, Amsterdam Scientific Workflows in e-Science Dr Zhiming Zhao System and Network.

Slides:



Advertisements
Similar presentations
A Workflow Engine with Multi-Level Parallelism Supports Qifeng Huang and Yan Huang School of Computer Science Cardiff University
Advertisements

WS-VLAM Introduction presentation ws-VLAM workflow Composer System and Network Engineering group Institute of informatics University of Amsterdam.
Ewa Deelman, Integrating Existing Scientific Workflow Systems: The Kepler/Pegasus Example Nandita Mangal,
WS-VLAM Introduction presentation WS-VLAM Workflow Engine System and Network Engineering group Institute of informatics University of Amsterdam.
WS-VLAM Introduction presentation WS-VLAM Semantic tools Systems, Networking, and Engineering group Institute of informatics University of Amsterdam.
Zhiming Zhao Paola Grosso, Ralph Koning, Jeroen van der Ham, Cees de Laat System and Network Engineering (SNE) University of Amsterdam (UvA) Z.Zhao et.
ProActive Task Manager Component for SEGL Parameter Sweeping Natalia Currle-Linde and Wasseim Alzouabi High Performance Computing Center Stuttgart (HLRS),
1 Richard White Design decisions: architecture 1 July 2005 BiodiversityWorld Grid Workshop NeSC, Edinburgh, 30 June - 1 July 2005 Design decisions: architecture.
Nadia Ranaldo - Eugenio Zimeo Department of Engineering University of Sannio – Benevento – Italy 2008 ProActive and GCM User Group Orchestrating.
Scientific workflow management in the VL-e framework Sub-program 2.5 Department of Computer Science Universiteit van Amsterdam.
L4-1-S1 UML Overview © M.E. Fayad SJSU -- CmpE Software Architectures Dr. M.E. Fayad, Professor Computer Engineering Department, Room #283I.
E-Science and Grid The VL-e approach L.O. (Bob) Hertzberger Computer Architecture and Parallel Systems Group Department of Computer Science Universiteit.
UvA, Amsterdam June 2007WS-VLAM Introduction presentation WS-VLAM Requirements list known as the WS-VLAM wishlist System and Network Engineering group.
WS-VLAM Introduction presentation WS-VLAM Introduction Systems and Network Engineering group Institute of informatics University of Amsterdam.
Chapter 10: Architectural Design
Architectural Design Establishing the overall structure of a software system Objectives To introduce architectural design and to discuss its importance.
A Semantic Workflow Mechanism to Realise Experimental Goals and Constraints Edoardo Pignotti, Peter Edwards, Alun Preece, Nick Gotts and Gary Polhill School.
Architectural Design.
Annual SERC Research Review - Student Presentation, October 5-6, Extending Model Based System Engineering to Utilize 3D Virtual Environments Peter.
Špindlerův Mlýn, Czech Republic, SOFSEM Semantically-aided Data-aware Service Workflow Composition Ondrej Habala, Marek Paralič,
June Amsterdam A Workflow Bus for e-Science Applications Dr Zhiming Zhao Faculty of Science, University of Amsterdam VL-e SP 2.5.
Ontology-derived Activity Components for Composing Travel Web Services Matthias Flügge Diana Tourtchaninova
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
Katanosh Morovat.   This concept is a formal approach for identifying the rules that encapsulate the structure, constraint, and control of the operation.
Framework: ISA-95 WG We are here User cases Studies
Scientific Workflow reusing and long term big data preservation Salima Benbernou Université Paris Descartes Project.
INFSO-SSA International Collaboration to Extend and Advance Grid Education ICEAGE Forum Meeting at EGEE Conference, Geneva Malcolm Atkinson & David.
Composing Models of Computation in Kepler/Ptolemy II
CONTENTS Arrival Characters Definition Merits Chararterstics Workflows Wfms Workflow engine Workflows levels & categories.
Lecture 9: Chapter 9 Architectural Design
Recording application executions enriched with domain semantics of computations and data Master of Science Thesis Michał Pelczar Krakow,
Privacy issues in integrating R environment in scientific workflows Dr. Zhiming Zhao University of Amsterdam Virtual Laboratory for e-Science Privacy issues.
SOFTWARE DESIGN AND ARCHITECTURE LECTURE 27. Review UML dynamic view – State Diagrams.
© 2007 Pearson Addison-Wesley. All rights reserved 0-1 Spring(2007) Instructor: Qiong Cheng © 2007 Pearson Addison-Wesley. All rights reserved.
Chapter 1: Overview of Workflow Management Dr. Shiyong Lu Department of Computer Science Wayne State University.
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. Semantic Web services Interoperability for Geospatial decision.
Peter Bajcsy, Rob Kooper, Luigi Marini, Barbara Minsker and Jim Myers National Center for Supercomputing Applications (NCSA) University of Illinois at.
E-science in the Netherlands Maria Heijne TU Delft Library Director / Chair Consortium of University Libraries and National Library.
© DATAMAT S.p.A. – Giuseppe Avellino, Stefano Beco, Barbara Cantalupo, Andrea Cavallini A Semantic Workflow Authoring Tool for Programming Grids.
Accelerating Scientific Exploration Using Workflow Automation Systems Terence Critchlow (LLNL) Ilkay Altintas (SDSC) Scott Klasky(ORNL) Mladen Vouk (NCSU)
11 CORE Architecture Mauro Bruno, Monica Scannapieco, Carlo Vaccari, Giulia Vaste Antonino Virgillito, Diego Zardetto (Istat)
An Ontological Framework for Web Service Processes By Claus Pahl and Ronan Barrett.
Service - Oriented Middleware for Distributed Data Mining on the Grid ,劉妘鑏 Antonio C., Domenico T., and Paolo T. Journal of Parallel and Distributed.
Software Engineering Prof. Ing. Ivo Vondrak, CSc. Dept. of Computer Science Technical University of Ostrava
Unified Modeling Language* Keng Siau University of Nebraska-Lincoln *Adapted from “Software Architecture and the UML” by Grady Booch.
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
L6-S1 UML Overview 2003 SJSU -- CmpE Advanced Object-Oriented Analysis & Design Dr. M.E. Fayad, Professor Computer Engineering Department, Room #283I College.
San Diego Supercomputer Center Grid Physics Network (GriPhyN) University of Florida DGL: The Assembly Language for Grid Computing Arun swaran Jagatheesan.
Streamflow - Programming Model for Data Streaming in Scientific Workflows Chathura Herath.
ICCS WSES BOF Discussion. Possible Topics Scientific workflows and Grid infrastructure Utilization of computing resources in scientific workflows; Virtual.
Virtual Lab for e-Science Towards a new Science Paradigm.
WS-VLAM Tutorial Part I: Hands on the User Graphical Interface Adam Belloum.
Cooperative experiments in VL-e: from scientific workflows to knowledge sharing Z.Zhao (1) V. Guevara( 1) A. Wibisono(1) A. Belloum(1) M. Bubak(1,2) B.
Experiment Management with Microsoft Project Gregor von Laszewski Leor E. Dilmanian Link to presentation on wiki 12:13:33Service Oriented Cyberinfrastructure.
ENABLING ADAPTABILITY IN COMPOSITE SERVICES USING TRANSPARENT SHAPING TECHNIQUES Onyeka Ezenwoye Autonomic Computing Research Laboratory School of Computing.
Virtual Lab AMsterdam VLAMsterdam Abstract Machine Toolbox A.S.Z. Belloum, Z.W. Hendrikse, E.C. Kaletas, H. Afsarmanesh and L.O. Hertzberger Computer Architecture.
Satisfying Requirements BPF for DRA shall address: –DAQ Environment (Eclipse RCP): Gumtree ISEE workbench integration; –Design Composing and Configurability,
Object-Oriented Systems. Goals Object-Oriented Methodologies – The Rumbaugh et al. OMT – The Booch methodology – Jacobson's methodologies.
Workflow-Driven Science using Kepler Ilkay Altintas, PhD San Diego Supercomputer Center, UCSD words.sdsc.edu.
Service Composition Orchestration BPEL Cédric Tedeschi ISI – M2R.
Chapter 9 Architectural Design. Why Architecture? The architecture is not the operational software. Rather, it is a representation that enables a software.
Virtual Laboratory Amsterdam L.O. (Bob) Hertzberger Computer Architecture and Parallel Systems Group Department of Computer Science Universiteit van Amsterdam.
A Graphical Modeling Environment for the
CCNT Lab of Zhejiang University
Model-Driven Analysis Frameworks for Embedded Systems
Introduction to UML.
Chapter 9 Architectural Design.
A General Approach to Real-time Workflow Monitoring
The ViroLab Virtual Laboratory for Viral Diseases
GGF10 Workflow Workshop Summary
Presentation transcript:

August , Elsevier, Amsterdam Scientific Workflows in e-Science Dr Zhiming Zhao System and Network Engineering, University of Amsterdam Virtual Laboratory for e-Science

August , Elsevier, Amsterdam Outline Background Scientific workflow management system Virtual Laboratory for e-Science Our approach Challenges and research lines Activities

August , Elsevier, Amsterdam Problem solving: a typical scenario in scientific research Analysis Hypothesis Related work Propose experiments Define steps Prototype computing systems Perform experiments Data collection Visualization Validation Adjust experiment Refine hypothesis Presentation Dissemination Define problemsExperiments Data analysis Discovery Activities are: -Iterative, dynamic, and human centered -Requires different levels of resources

August , Elsevier, Amsterdam Example scenarios In problem analysis  Identify domains, search key problems, find typical methods, and review related work In scientific experiments: scientific computing & data processing  Define dependencies between computing and data processing tasks, and schedule their runtime behavior In data analysis  Visualization, compare the results of different parameters, keep meaningful configuration and continue experiments  Search related work, compare results In dissemination  Documenting experiments, present results, citation, publication

August , Elsevier, Amsterdam Computer support for problem solving Problem Solving Environment: ( E Gallopoulos et. al., IEEE CS Eng ) Organize different software components/ tools Allows a user to assemble these tools at a high level of abstraction Control runtime behavior of experiments Examples: MATLab, Ptolemy, etc. Traditional PSE: organize and execute resources locally! Distributed resources Distributed Parallel computing Visualization, Remote resource invocation Distributed data sharing & dissemination Scientific workflow management systems: A new guise of PSE!

August , Elsevier, Amsterdam Inside a Scientific Workflow Management System In our view, a SWMS at least implements: A model for describing workflows; An engine for executing/managi ng workflows; Different levels of support for a user to compose, execute and control a workflow. Workflow (based on certain model) Engine User support resources Composition Engine level control Resource level control A SWMS

August , Elsevier, Amsterdam Scientific Workflows in e-Science Workflows varies at different Phases of experiments: design, runtime control, dissemination; Abstractions of resources: concrete and abstract; Levels of activity details: computing, data access, search/matching, human activities; … Experiment processes Abstract workflows Executable (concrete workflows) Workflows for administration, e.g., AAA,and other issues.

August , Elsevier, Amsterdam Diversity in SWMS Taverna: -Web services based language: Scufl; -FreeFluo: engine -Graphical viz of workflow Kepler: -Actor,director -MoML -Execution models Triana: -Components -Task graph -Data/control flow DAGMan: -Computing tasks -DAG Pegasus: -Based on DAGMan -VDL -DAG …

August , Elsevier, Amsterdam Virtual Laboratory for e-Science Dutch telescience Data intensive science Medical diagnosis Generic e -science framework layer Application layer Bioinformatics ASP Bio diversity Food Informatics Grid layer

August , Elsevier, Amsterdam Mission Effectively reuse existing workflow managements systems, and provide a generic e-Science framework for different application domains. A generic framework can Improve the reuse of workflow components and the workflows for different experiments Reduce the learning cost for different systems Allow application users to work on a consistent environment when underlying infrastructure changed

August , Elsevier, Amsterdam Previous work: VLAM-G environment VLAM-G  A Grid enable PSE  Data intensive applications  Visual interface  Two levels of workflow support  Human interaction support

August , Elsevier, Amsterdam Workflow in VLAMG

August , Elsevier, Amsterdam Experiment Topology –Graphical representation of self-contained data processing modules attached to each other in a workflow. Process-Flow Template – Graphical representation of data elements and processing steps in an experimental procedure. Study – Descriptions of experimental steps represented as an instance of a PFT with references to experiment topologies. VLAM-G PFT/Study

August , Elsevier, Amsterdam Lessons learned How to introduce a new PSE to a domain scientist?  Because it has a beautiful architecture?  Or because it can allow a scientist to keep their current work style? How to use existing work?  Scientists need one system or more options? How to include user in the computing loop?  Dynamic workflows and human in the loop computing are important. Z. Zhao et al., “Scientific workflow management: between generality and applicability”, QSIC 2005, Australia

August , Elsevier, Amsterdam Workflow support in VL-e Recommend suitable workflow systems for different application domains:  Analyze typical application use cases  Define small projects with different application domains  Review existing workflow systems  Recommend four workflow systems: Triana, Taverna, Kepler, and VLAMG A long term  Extend VLAMG and develop our own generic workflow framework

August , Elsevier, Amsterdam A workflow bus paradigm Workflow bus TavernaKepler Triana Sub workflow 1 Sub workflow 2 Sub workflow 3 Workflow A workflow bus is a special workflow system for executing meta workflows, in which sub workflows will be executed by different engines. Z. Zhao et al., “Workflow bus for e-Science”, in IEEE Int’l Conf. e-Science 2006, Amsterdam

August , Elsevier, Amsterdam Applications of workflow bus Use case 1:  A user has workflow in Taverna  Some functionality is missing in Taverna but can be provided by Triana  He can develop the workflow in two systems, and run it via the workflow bus Use case 2:  A user wants to execute a Taverna or Triana workflow in multiple instances with different input data

August , Elsevier, Amsterdam Ongoing research Web service in data intensive applications Execution models for Grid workflows Including PSE in scientific workflows Industrial standards in scientific workflows

August , Elsevier, Amsterdam Relevance between our research and Elsevier’s work In a same context from the scale of entire lifecycle of e-Science experiments Different focuses  We focus on runtime behavior of scientific experiments, e.g., Grid computing, data/computing intensive applications, and scheduling of computing tasks  Elsevier highlights data search and integration on well structured data bases, research preparation, and literature search and management

August , Elsevier, Amsterdam Cont. Different characteristics in workflows  In our workflows, processing and managing runtime dynamic data is the key patterns  In Elsevier workflows, storage, replicate, access, match and integrate static data might be more common Facing similar challenges:  Semantics based data search and integration  Workflow provenance  Collaborative interaction (workflow development, resource sharing, knowledge transfer)  Modeling user profiles

August , Elsevier, Amsterdam Activities Int’l workshop on “Workflow systems in e-Science”, organized by Zhiming Zhao and Adam Belloum, in the context of ICCS06, Reading University, May 28,  Proceedings is in LNCS, Springer Verlag.  A special issue will be published in Scientific Programming Journal.  Workshop on “Scientific workflows and industrial workflow standards in e-Science ”, organized by Adam Belloum and Zhiming Zhao, in the context of IEEE e-Science and Grid computing conference in Amsterdam December  Pegasus, Dr. Ewa Deelman (Department of Computer Science University of South California)  BPEL, Dr. Dieter König (IBM Research Germany Development Laboratory)  Kepler, Dr. Bertram Ludäscher (Department of Computer Science University of California, Davis)  Taverna, Prof. Peter Rice (European Bioinformatics Institute)  WS and Semantic issues, Dr. Steve Ross-Talbot (CEO, and a co-founder, of Pi4 Technologies)Pi4 Technologies  Triana, Dr. Ian J. Taylor (Department of Computer Science Cardiff University) 

August , Elsevier, Amsterdam References 1. Virtual Laboratory for e-Science: 2. Network and System Engineering, Faculty of Science, University of Amsterdam: Z. Zhao; A. Belloum; H. Yakali; P.M.A. Sloot and L.O. Hertzberger: Dynamic Workflow in a Grid Enabled Problem Solving Environment, in Proceedings of the 5th International Conference on Computer and Information Technology (CIT2005), pp IEEE Computer Society Press, Shanghai, China, September Z. Zhao; A. Belloum; A. Wibisono; F. Terpstra; P.T. de Boer; P.M.A. Sloot and L.O. Hertzberger: Scientific workflow management: between generality and applicability, in Proceedings of the International Workshop on Grid and Peer-to-Peer based Workflows in conjunction with the 5th International Conference on Quality Software, pp IEEE Computer Society Press, Melbourne, Australia, September 19th-21st Z. Zhao; A. Belloum; P.M.A. Sloot and L.O. Hertzberger: Agent technology and scientific workflow management in an e-Science environment, in Proceedings of the 17th IEEE International conference on Tools with Artificial Intelligence (ICTAI05), pp IEEE Computer Society Press, Hongkong, China, November 14th-16th 2005.