CCGrid 2003, Tokyo, Japan GridFlow: Workflow Management for Grid Computing Junwei Cao ( 曹军威 ) C&C Research Labs, NEC Europe Ltd., Germany Stephen A. Jarvis.

Slides:



Advertisements
Similar presentations
The Anatomy of the Grid: An Integrated View of Grid Architecture Carl Kesselman USC/Information Sciences Institute Ian Foster, Steve Tuecke Argonne National.
Advertisements

High Performance Computing Course Notes Grid Computing.
The ADAMANT Project: Linking Scientific Workflows and Networks “Adaptive Data-Aware Multi-Domain Application Network Topologies” Ilia Baldine, Charles.
Workshop on HPC in India Grid Middleware for High Performance Computing Sathish Vadhiyar Grid Applications Research Lab (GARL) Supercomputer Education.
Performance-responsive Middleware for Grid Computing Dr Stephen Jarvis High Performance Systems Group University of Warwick, UK High Performance Systems.
Evaluation of Advertising Effectiveness Using Agent-Based Modeling and Simulation Junwei Cao Department of Computer Science University of Warwick.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Introduction CS 524 – High-Performance Computing.
GridFlow: Workflow Management for Grid Computing Kavita Shinde.
Computer Science Department 1 Load Balancing and Grid Computing David Finkel Computer Science Department Worcester Polytechnic Institute.
Parallel Programming on the SGI Origin2000 With thanks to Moshe Goldberg, TCC and Igor Zacharov SGI Taub Computer Center Technion Mar 2005 Anne Weill-Zrahia.
Parallel Programming Models and Paradigms
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
1© Copyright 2015 EMC Corporation. All rights reserved. SDN INTELLIGENT NETWORKING IMPLICATIONS FOR END-TO-END INTERNETWORKING Simone Mangiante Senior.
CCGrid 2014 Panel: Architect Cloud and HPC for Big Data Era Rajkumar Buyya Cloud Computing and Distributed Systems (CLOUDS) Lab Dept. of Computer Science.
Introduction to Grid Computing Ann Chervenak Carl Kesselman And the members of the Globus Team.
Applied Architectures Eunyoung Hwang. Objectives How principles have been used to solve challenging problems How architecture can be used to explain and.
Self-Organizing Agents for Grid Load Balancing Junwei Cao Fifth IEEE/ACM International Workshop on Grid Computing (GRID'04)
Hungarian Supercomputing GRID
Cloud Computing 1. Outline  Introduction  Evolution  Cloud architecture  Map reduce operation  Platform 2.
Self-Organizing Agents for Grid Load Balancing Junwei Cao, Ph.D. Research Scientist Center for Space Research Massachusetts Institute of Technology Cambridge,
ARGONNE  CHICAGO Ian Foster Discussion Points l Maintaining the right balance between research and development l Maintaining focus vs. accepting broader.
CCA Common Component Architecture Manoj Krishnan Pacific Northwest National Laboratory MCMD Programming and Implementation Issues.
Transparent Grid Enablement Using Transparent Shaping and GRID superscalar I. Description and Motivation II. Background Information: Transparent Shaping.
What are the main differences and commonalities between the IS and DA systems? How information is transferred between tasks: (i) IS it may be often achieved.
Peter Bajcsy, Rob Kooper, Luigi Marini, Barbara Minsker and Jim Myers National Center for Supercomputing Applications (NCSA) University of Illinois at.
Jarek Nabrzyski, Ariel Oleksiak Comparison of Grid Middleware in European Grid Projects Jarek Nabrzyski, Ariel Oleksiak Poznań Supercomputing and Networking.
The Grid System Design Liu Xiangrui Beijing Institute of Technology.
The Globus Project: A Status Report Ian Foster Carl Kesselman
Virtual Data Grid Architecture Ewa Deelman, Ian Foster, Carl Kesselman, Miron Livny.
Interoperability Grids, Clouds and Collaboratories Ruth Pordes Executive Director Open Science Grid, Fermilab.
Perspectives on Grid Technology Ian Foster Argonne National Laboratory The University of Chicago.
Grid Architecture William E. Johnston Lawrence Berkeley National Lab and NASA Ames Research Center (These slides are available at grid.lbl.gov/~wej/Grids)
DAME: A Distributed Diagnostics Environment for Maintenance Duncan Russell University of Leeds.
Tools for collaboration How to share your duck tales…
The System and Software Development Process Instructor: Dr. Hany H. Ammar Dept. of Computer Science and Electrical Engineering, WVU.
National Center for Supercomputing Applications Barbara S. Minsker, Ph.D. Associate Professor National Center for Supercomputing Applications and Department.
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
GRID ARCHITECTURE Chintan O.Patel. CS 551 Fall 2002 Workshop 1 Software Architectures 2 What is Grid ? "...a flexible, secure, coordinated resource- sharing.
Department of Electronic Engineering Challenges & Proposals INFSO Information Day e-Infrastructure Grid Initiatives 26/27 May.
Authors: Ronnie Julio Cole David
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
ISERVOGrid Architecture Working Group Brisbane Australia June Geoffrey Fox Community Grids Lab Indiana University
May 2004NTUA1 National Technical University of Athens EGEE Project 3 rd Parties Kick off Meeting, Athens, May 27-28, 2004 Dr. Costis Christogiannis Telecommunications.
MROrder: Flexible Job Ordering Optimization for Online MapReduce Workloads School of Computer Engineering Nanyang Technological University 30 th Aug 2013.
GridLab Resource Management System (GRMS) Jarek Nabrzyski GridLab Project Coordinator Poznań Supercomputing and.
High-Speed Policy-Based Packet Forwarding Using Efficient Multi-dimensional Range Matching Lakshman and Stiliadis ACM SIGCOMM 98.
7. Grid Computing Systems and Resource Management
An Overview of Scientific Workflows: Domains & Applications Laboratoire Lorrain de Recherche en Informatique et ses Applications Presented by Khaled Gaaloul.
GCC 2003 Shanghai China Implementation of Grid-enabled Medical Simulation Applications Using Workflow Techniques Junwei Cao Jochen Fingberg Guntram Berti.
Toward a common data and command representation for quantum chemistry Malcolm Atkinson Director 5 th April 2004.
Performance-responsive Scheduling for Grid Computing Dr Stephen Jarvis High Performance Systems Group University of Warwick, UK High Performance Systems.
Agent-Based Grid Load-Balancing Daniel P. Spooner University of Warwick, UK Junwei Cao NEC Europe Ltd., Germany.
Computer Science and Engineering Parallel and Distributed Processing CSE 8380 April 28, 2005 Session 29.
Use of Performance Prediction Techniques for Grid Management Junwei Cao University of Warwick April 2002.
PARALLEL AND DISTRIBUTED PROGRAMMING MODELS U. Jhashuva 1 Asst. Prof Dept. of CSE om.
2004 Queue Scheduling and Advance Reservations with COSY Junwei Cao Falk Zimmermann C&C Research Laboratories NEC Europe Ltd.
IPDPS 2003, Nice, France Agent-Based Grid Load Balancing Using Performance-Driven Task Scheduling Junwei Cao (C&C Research Labs, NEC Europe Ltd., Germany)
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
Agent-Based Grid Load-Balancing
Junwei Cao Darren J. Kerbyson Graham R. Nudd
University of Technology
Department of Computer Science University of Warwick
Agent-based Resource Management for Grid Computing
Scientific Workflows Lecture 15
Presentation transcript:

CCGrid 2003, Tokyo, Japan GridFlow: Workflow Management for Grid Computing Junwei Cao ( 曹军威 ) C&C Research Labs, NEC Europe Ltd., Germany Stephen A. Jarvis and Graham R. Nudd Dept. of Computer Science, Univ. of Warwick, UK Subhash Saini NASA Ames Research Center, USA

CCGrid 2003, Tokyo, Japan Outline Background – Grid Workflow System Architecture GridFlow User Portal Global Grid Workflow Management Local Grid Sub-workflow Scheduling Fuzzy Timing Techniques Summary Ongoing and Future Work

CCGrid 2003, Tokyo, Japan Background – Grid Workflow Workflow Definition WPDL, BPEL4WS, GSFL, ASCI Grid, … Workflow Systems WebFlow, Symphony, GridAnt, BPWS4J, TENT, … Component-based Systems CCA/XCAT, SCIRun, CXML, … Other Systems Condor DAGMan, UNICORE, MyGrid, GEMSS, GridLab, BioOpera, USC Grid failure handling, …

CCGrid 2003, Tokyo, Japan Grid Resources Grid Resource: A particular grid resource is a high-end computing or storage resource that can be accessed remotely. Local Grid: A local grid consists of multiple grid resources that belong to one organization. Global Grid: The global grid includes all grid resources that belong to different organizations within a virtual organization.

CCGrid 2003, Tokyo, Japan Grid Tasks Task: Tasks are the smallest elements in a grid workflow, e.g. MPI & PVM programs. Sub-workflow: A sub-workflow is a flow of closely related tasks that is to be executed in a predefined sequence on grid resources of a local grid (within one organization). Workflow: A grid application can be represented as a flow of several different activities, each activity represented by a sub- workflow.

CCGrid 2003, Tokyo, Japan Grid Management Mapping grid workflows to the global grid Mapping grid sub- workflows to local grids Mapping grid tasks to grid resources

CCGrid 2003, Tokyo, Japan System Architecture Global Grid GridFlow User Portal Grid Resources Workflow Management Resource Management (ARMS) Information Services (Globus MDS) Local Grid Sub-workflow scheduling Resource Scheduling (Titan) Performance Services (PACE … ) Grid Users

CCGrid 2003, Tokyo, Japan PACE Performance Prediction Application Tools Resource Tools Evaluation Engine Source Code Analysis Object Editor Object Library PSL Compiler CPU Network (MPI, PVM) Cache (L1, L2) HMCL Compiler

CCGrid 2003, Tokyo, Japan Titan Resource Scheduling Heuristic Evolutionary Near-optimal: Makespan Idletime Deadlines

CCGrid 2003, Tokyo, Japan ARMS Grid Management Agent structure Communication layer Decision-making layer Local management layer Agent hierarchy Service advertisement Service discovery Agent Capability Tables A AA AA Use r

CCGrid 2003, Tokyo, Japan GridFlow User Portal

CCGrid 2003, Tokyo, Japan Global Grid Workflow Management S 2 startT = 0 exeT = 3 endT = 3 S 1 startT = 0 exeT = 0 endT = 0 S 3 startT = 0 exeT = 5 endT = 5 S 4 startT = 5 exeT = 7 endT = 12 S 5 startT = 5 exeT = 4 endT = 9 S 6 startT = 12 exeT = 0 endT = 12 / 7 / 12 S2S2 S1S1 S3S3 S4S4 S5S5 S6S6 / 5

CCGrid 2003, Tokyo, Japan Local Grid Sub-workflow Scheduling Scheduling a flow of tasks onto grid resources within a local grid is very similar to the process that schedules a workflow onto different local grids. There are two challenges: It is a difficult task to provide an accurate prediction on task/workflow start, execution and end times. Multiple tasks from different sub-workflows may require the same grid resource at the same time.

CCGrid 2003, Tokyo, Japan Fuzzy Timing Techniques Turning the “prediction accuracy” into a fuzzy concept that is represented using fuzzy numbers. π 1 (τ)=0.5(0,2,6,7) π 2 (τ)=(2,4,4,6)

CCGrid 2003, Tokyo, Japan Fuzzy Number Operations latestearliest min max sum

CCGrid 2003, Tokyo, Japan Resource Conflict Solving I The start time of a task cannot be configured with the latest end time of its pre-tasks directly, since other tasks exists that may use the same resource at the same time. A first-come possibly-first-serve policy is adopted. This does not order the conflictive tasks explicitly, but adds some information on degrees of possibilities of task start times.

CCGrid 2003, Tokyo, Japan Resource Conflict Solving II All possible start sequences are considered and are combined to provide an estimation of the end time.

CCGrid 2003, Tokyo, Japan Summary GridFlow is a prototype grid workflow management system, focusing on grid workflow simulation and scheduling. GridFlow is based on a specific grid resource management infrastructure implemented using agent-based methodologies and performance- driven scheduling technologies. Making grid workflow management a reality also requires to address general grid computing challenges: openness, standards, security and QoS support.

CCGrid 2003, Tokyo, Japan Ongoing Work – Applications Developing grid enabled medical simulation services (GEMSS) using GT3 Developing grid performance services based on historical information analysis Developing medical application workflows using BPEL4WS Target Object Scanning & Preprocessing Numerical Modeling Analysis, Diagnosis, Design HPC Simulation

CCGrid 2003, Tokyo, Japan Future Work – Agile Computing Workflow techniques – one of keys for next generation agile (grid) computing Flexibility (Performance, Adaptation, QoS, Individualization) Efficiency (Cheap, Large-scale, Pervasive, Continuous, Massive) Cluster Computing HPC Supercomputing Grid Computing P2P Computing Internet Computing Agile (Grid) Computing

CCGrid 2003, Tokyo, Japan For More Information