Workflow Management System based on Service Oriented Components for Grid Applications. Ju-Ho Choi Korea University, Seoul, Rep. of Korea.

Slides:



Advertisements
Similar presentations
Three types of remote process invocation
Advertisements

A Workflow Engine with Multi-Level Parallelism Supports Qifeng Huang and Yan Huang School of Computer Science Cardiff University
Legacy code support for commercial production Grids G.Terstyanszky, T. Kiss, T. Delaitre, S. Winter School of Informatics, University.
Database System Concepts and Architecture
GLOBUS PLUG-IN FOR WINGS WOKFLOW ENGINE Elizabeth Martí ITACA Universidad Politécnica de Valencia
A Dynamic World, what can Grids do for Multi-Core computing? Daniel Goodman, Anne Trefethen and Douglas Creager
1 OBJECTIVES To generate a web-based system enables to assemble model configurations. to submit these configurations on different.
Distributed components
Parallel Programming Models and Paradigms
1 Dr. Frederica Darema Senior Science and Technology Advisor NSF Future Parallel Computing Systems – what to remember from the past RAMP Workshop FCRC.
UvA, Amsterdam June 2007WS-VLAM Introduction presentation WS-VLAM Requirements list known as the WS-VLAM wishlist System and Network Engineering group.
Grids and Grid Technologies for Wide-Area Distributed Computing Mark Baker, Rajkumar Buyya and Domenico Laforenza.
Workload Management Massimo Sgaravatto INFN Padova.
The middleware that makes real time integration a reality.
Chapter 6: An Introduction to System Software and Virtual Machines
Grids and Globus at BNL Presented by John Scott Leita.
Community Manager A Dynamic Collaboration Solution on Heterogeneous Environment Hyeonsook Kim  2006 CUS. All rights reserved.
Copyright Arshi Khan1 System Programming Instructor Arshi Khan.
Asst.Prof.Dr.Ahmet Ünveren SPRING Computer Engineering Department Asst.Prof.Dr.Ahmet Ünveren SPRING Computer Engineering Department.
Resource Management Reading: “A Resource Management Architecture for Metacomputing Systems”
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space Cracow Grid Workshop’10 Kraków, October 11-13,
Ajou University, South Korea ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing” Disconnected Operation Service in Mobile Grid Computing.
SUN HPC Consortium, Heidelberg 2004 Grid(Lab) Resource Management System (GRMS) and GridLab Services Krzysztof Kurowski Poznan Supercomputing and Networking.
CONDOR DAGMan and Pegasus Selim Kalayci Florida International University 07/28/2009 Note: Slides are compiled from various TeraGrid Documentations.
Zhonghua Qu and Ovidiu Daescu December 24, 2009 University of Texas at Dallas.
A Lightweight Platform for Integration of Resource Limited Devices into Pervasive Grids Stavros Isaiadis and Vladimir Getov University of Westminster
DISTRIBUTED COMPUTING
WP9 Resource Management Current status and plans for future Juliusz Pukacki Krzysztof Kurowski Poznan Supercomputing.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
Grids and Portals for VLAB Marlon Pierce Community Grids Lab Indiana University.
INVITATION TO COMPUTER SCIENCE, JAVA VERSION, THIRD EDITION Chapter 6: An Introduction to System Software and Virtual Machines.
CSF4 Meta-Scheduler Name: Zhaohui Ding, Xiaohui Wei
11 CORE Architecture Mauro Bruno, Monica Scannapieco, Carlo Vaccari, Giulia Vaste Antonino Virgillito, Diego Zardetto (Istat)
The Grid System Design Liu Xiangrui Beijing Institute of Technology.
Evaluation of Agent Teamwork High Performance Distributed Computing Middleware. Solomon Lane Agent Teamwork Research Assistant October 2006 – March 2007.
Selected Topics in Software Engineering - Distributed Software Development.
Resource Brokering in the PROGRESS Project Juliusz Pukacki Grid Resource Management Workshop, October 2003.
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Code Applications Tamas Kiss Centre for Parallel.
Cracow Grid Workshop October 2009 Dipl.-Ing. (M.Sc.) Marcus Hilbrich Center for Information Services and High Performance.
Service - Oriented Middleware for Distributed Data Mining on the Grid ,劉妘鑏 Antonio C., Domenico T., and Paolo T. Journal of Parallel and Distributed.
Performance evaluation of component-based software systems Seminar of Component Engineering course Rofideh hadighi 7 Jan 2010.
Middleware for Grid Computing and the relationship to Middleware at large ECE 1770 : Middleware Systems By: Sepehr (Sep) Seyedi Date: Thurs. January 23,
Grid Architecture William E. Johnston Lawrence Berkeley National Lab and NASA Ames Research Center (These slides are available at grid.lbl.gov/~wej/Grids)
GSFL: A Workflow Framework for Grid Services Sriram Krishnan Patrick Wagstrom Gregor von Laszewski.
OPERATING SYSTEM SUPPORT DISTRIBUTED SYSTEMS CHAPTER 6 Lawrence Heyman July 8, 2002.
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
6/12/99 Java GrandeT. Haupt1 The Gateway System This project is a collaborative effort between Northeast Parallel Architectures Center (NPAC) Ohio Supercomputer.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Grid programming with components: an advanced COMPonent platform for an effective invisible grid © 2006 GridCOMP Grids Programming with components. An.
WALSAIP Portal Automated Composition of Signal Processing Operators Mariana Mendoza Botero.
CGW 04, Stripped replication for the grid environment as a web service1 Stripped replication for the Grid environment as a web service Marek Ciglan, Ondrej.
11 CORE Architecture Mauro Bruno, Monica Scannapieco, Carlo Vaccari, Giulia Vaste Antonino Virgillito, Diego Zardetto (Istat)
Architecture View Models A model is a complete, simplified description of a system from a particular perspective or viewpoint. There is no single view.
International Symposium on Grid Computing (ISGC-07), Taipei - March 26-29, 2007 Of 16 1 A Novel Grid Resource Broker Cum Meta Scheduler - Asvija B System.
Introduction to Grid Computing and its components.
On Using BPEL Extensibility to Implement OGSI and WSRF Grid Workflows Aleksander Slomiski Presented by Onyeka Ezenwoye CIS Advanced Topics in Software.
An approach to Web services Management in OGSA environment By Shobhana Kirtane.
From Use Cases to Implementation 1. Structural and Behavioral Aspects of Collaborations  Two aspects of Collaborations Structural – specifies the static.
Background Computer System Architectures Computer System Software.
MSF and MAGE: e-Science Middleware for BT Applications Sep 21, 2006 Jaeyoung Choi Soongsil University, Seoul Korea
ACGT Architecture and Grid Infrastructure Juliusz Pukacki ‏ EGEE Conference Budapest, 4 October 2007.
From Use Cases to Implementation 1. Mapping Requirements Directly to Design and Code  For many, if not most, of our requirements it is relatively easy.
G. Russo, D. Del Prete, S. Pardi Kick Off Meeting - Isola d'Elba, 2011 May 29th–June 01th A proposal for distributed computing monitoring for SuperB G.
Two New UML Diagram Types Component Diagram Deployment Diagram.
Enabling Grids for E-sciencE Agreement-based Workload and Resource Management Tiziana Ferrari, Elisabetta Ronchieri Mar 30-31, 2006.
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
DISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S
CMPE419 Mobile Application Development
CMPE419 Mobile Application Development
Presentation transcript:

Workflow Management System based on Service Oriented Components for Grid Applications. Ju-Ho Choi Korea University, Seoul, Rep. of Korea

Distributed Supercomputing Lab in Korea University Contents Introduction Previous Work Requirements Grid Workflow System Implementation Conclusion

Distributed Supercomputing Lab in Korea University Introduction Grid Workflow init Input Data Divide job job1 job2 Output Proc.. job3 job4 job5 job6 display control Coordinated execution of multiple applications Abstract activity Control/Data flow Composition of activities Unreliable, Distributed, heterogeneous, huge-scaled computational and data resources We need a PSE which provides a transparent view in the aspect of Grid user Requirements ?

Distributed Supercomputing Lab in Korea University Previous Work – Grid/PSE Grid Very Complicate !! Distributed, heterogeneous, unreliable Various hardware, software, different administration … Grid middleware : Globus, OGSA, … Distributed resource management, information service, data management, single sign-on, Unfamiliar User environment Now being Changing & Improved Platform : Virtual homogeneous environment Need a familiar, easy, and, transparent PSE for user User Environment : MPI, NetSolve, Nimrod/G, … Grid Portal, GridPort, CoG, … User domain application : Cactus, SCIRun, … Simple job requesttext-based output Hotpage, ASC portal … Simple job, Not easy for complex job Specific domain tool

Distributed Supercomputing Lab in Korea University Previous Work – Workflow Model Workflow patterns in Business Workflow  Basic Patterns AND-Split/Merge, OR-Split/Merge, XOR-Split/Merge  Other Patterns N out of M join, deferred choice, arbitrary cycle Previous Grid Workflow System  GWSL, Triana, GridAnt, myGrid, Petrinet-based Grid Workflow (FhRG)  Execution of multiple App. on multiple resources  Composing of the existing applications  Providing Graphic User Interfaces  Simple Link element for control flow Common patterns : AND, XOR, LOOP arbitrary cycle (Petrinet-based)  Data link Node for File transfer

Distributed Supercomputing Lab in Korea University Requirements Transparency with abstract workflow model  Complicate Grid infrastructure to user  Various Grid middleware, their change and standardization  Execution of the existing applications and making a new application without suffering from the details of Grid  Parallel/distributed programming, data transfer, fault handling Advanced distributed/parallel workflow pattern  Previous patterns are too simple to describe Grid applications Insufficient expressiveness of simple link elements  New patterns to support various parallelism for easy parallel and distributed programming  Integration of Control and Data flow

Distributed Supercomputing Lab in Korea University Requirements Fault handling method  Faults from unreliable Grid resources  Three categories of fault Hardware fault : network or host problems Software fault : illegal access, user defined error, … Resource utilization fault  Allocation or reservation problem without hardware fault caused by dynamic property of Grid Efficient execution of workflow  Static/Dynamic information and Prediction CPU speed, memory size, file system size, … Load, available memory, usable bandwidth, …  Current Insufficient and unsatisfying dynamic information  semi-automatically or manually adapting dynamic Grid Fault handling, dynamic reallocation, runtime modification

Distributed Supercomputing Lab in Korea University Grid Workflow : Architecture Grid Workflow Controller Realizer Generallizer Resource Mapper A workflow instance A workflow instance Workflow instance Activity Controller GUI Editing & running workflows Information Service Resource management File transfer Abstract workflow Activity Listener Allocator Monitor Data Transfer Manager Activity Activity Repository GWDL (Grid Workflow Description Language) Parser Dynamic Manager Workflow Engine

Distributed Supercomputing Lab in Korea University Grid Workflow : Abstract Workflow EUserDefinedB AC EHardwareCrash EUserDefinedA B2B2 alloc T Data 1 Data 3 B S S Data 2 A start Data 1 A end B start Data 3 B endC endC start (a) (b) Grid Workflow  Activity  Control/ Data flow  Resource  exception Activity  Function  Input/ output data  Exception

Distributed Supercomputing Lab in Korea University GWDL Grid Workflow Description Language Data Model Activity Model Flow Model Resource Model Control flow Data flow Exception handling Predefined Activity allocation notifytimer Nodes Activity Node Group Node Resource specification Execution environment Exception type Data type Life cycle NotAvailable Allocation UserDefined Memory Storage NotExistData Function Loose-Coupled Tight-Coupled User defined Activity Input/output Data Exception Fail Time

Distributed Supercomputing Lab in Korea University Grid Workflow : Exceptions Predefined Exceptions  Efail : unknown Fail  ENotAvaliable : hardware fault  EAllocation : allocation problem  ETimeExpired : timer  ENoData : no usable input or output data  EMemory : insufficient memory  EStorage : insufficient storage User defined Exceptions  For notification of software fault

Distributed Supercomputing Lab in Korea University Grid Workflow : Patterns (a) Sequence (b) XOR-Split (c) XOR-Join (d) Loop (e) Multi-Split (f) Multi-Join (g) AND-Split (h) AND-Join (i) AND-Loop symbol (i') AND-Loop description (j) Queue (k) Wait (l) Node copy (m) Explicit termination (n) Pause/Continue (o) Group node

Distributed Supercomputing Lab in Korea University Workflow Model: Patterns Composition of basic patterns  Pipeline W Q n C n C m Q n

Distributed Supercomputing Lab in Korea University Workflow Model: Patterns Composition of basic patterns  Static/Dynamic Data distribution Static: AND-Split and AND-Join Dynamic: AND-Loop, Multi-Split and Queue M Q A … A … staticdynamic

Distributed Supercomputing Lab in Korea University Grid Workflow : Workflow Execution Creating Concrete workflow  Mapping activities to resources Statically or dynamically Automatically or manually  Activity parameters LC (Loose-coupled) type  For the existing applications  Executable file, addresses for input/output file transfer TC (Tight-coupled) type  For making new application  Executable file, identification number  Addresses for Input/output data: file, variables  Address for communication with workflow engine Runtime Modification  Pause and continue a workflow instance  To handle fault, unexpected situation, or manual reallocation.

Distributed Supercomputing Lab in Korea University Implementation Our prototype  Workflow engine and graphic workflow editor : Java XML parsing : Apache Xerces  TC type activity : C++ Making TC type activity template codes: Apache Xalan  Fault detection Throwing user defined exception State notifications of TC type activity Polling with Java CoG’s GRAM.ping() function or Timeout  Grid middleware Globus Toolkit 3.2 : GRAM, MDS, GridFTP  Client library of Grid Java CoG (Commodity of Grid)

Distributed Supercomputing Lab in Korea University Conclusion and Future work Analysis of Requirements for Grid Applications We proposed a new Grid workflow system  Abstract workflow model  Advanced workflow pattern for parallelism  Exception for handling fault on Grid  Run time modification Future works  Scalability experiment of our Grid workflow system  Appling our system to various Grid applications

Distributed Supercomputing Lab in Korea University Thanks for your attention