1 Using SchedFlow for Performance Evaluation of Workflow Applications Barton P. Miller University of Wisconsin Elisa Heyman Gustavo Martínez.

Slides:

Advertisements

Similar presentations

Exploiting Deadline Flexibility in Grid Workflow Rescheduling Wei Chen Alan Fekete Young Choon Lee.

Advertisements

Current methods for negotiating firewalls for the Condor ® system Bruce Beckles (University of Cambridge Computing Service) Se-Chang Son (University of.

Lecture 12: MapReduce: Simplified Data Processing on Large Clusters Xiaowei Yang (Duke University)

Validating the Evaluation of Adaptive Systems by User Profile Simulation Javier Bravo and Alvaro Ortigosa {javier.bravo, Universidad.

Parallel H.264 Decoding on an Embedded Multicore Processor

GENI Experiment Control Using Gush Jeannie Albrecht and Amin Vahdat Williams College and UC San Diego.

GridFlow: Workflow Management for Grid Computing Kavita Shinde.

A Grid Resource Broker Supporting Advance Reservations and Benchmark- Based Resource Selection Erik Elmroth and Johan Tordsson Reporter ： S.Y.Chen.

1 A Performance Study of Grid Workflow Engines Alexandru Iosup and Dick Epema PDS Group Delft University of Technology The Netherlands Corina Stratan Parallel.

Academic Advisor: Prof. Ronen Brafman Team Members: Ran Isenberg Mirit Markovich Noa Aharon Alon Furman.

Cloud based Dynamic workflow with QOS for Mass Spectrometry Data Analysis Thesis Defense: Ashish Nagavaram Graduate student Computer Science and Engineering.

L22: SC Report, Map Reduce November 23, Map Reduce What is MapReduce? Example computing environment How it works Fault Tolerance Debugging Performance.

EstiNet Network Simulator & Emulator 2014/06/ 尉遲仲涵.

Authors: Weiwei Chen, Ewa Deelman 9th International Conference on Parallel Processing and Applied Mathmatics 1.

Introduction to Parallel Programming MapReduce Except where otherwise noted all portions of this work are Copyright (c) 2007 Google and are licensed under.

A Dynamic MapReduce Scheduler for Heterogeneous Workloads Chao Tian, Haojie Zhou, Yongqiang He,Li Zha 簡報人：碩資工一甲董耀文.

UAB Dynamic Monitoring and Tuning in Multicluster Environment Genaro Costa, Anna Morajko, Paola Caymes Scutari, Tomàs Margalef and Emilio Luque Universitat.

An Effective Dynamic Scheduling Runtime and Tuning System for Heterogeneous Multi and Many-Core Desktop Platforms Authous: Al’ecio P. D. Binotto, Carlos.

Self Adaptivity in Grid Computing Reporter : Po - Jen Lo Sathish S. Vadhiyar and Jack J. Dongarra.

A Budget Constrained Scheduling of Workflow Applications on Utility Grids using Genetic Algorithms Jia Yu and Rajkumar Buyya Grid Computing and Distributed.

 Escalonamento e Migração de Recursos e Balanceamento de carga Carlos Ferrão Lopes nº M6935 Bruno Simões nº M6082 Celina Alexandre nº M6807.

Towards a Javascript CoG Kit Gregor von Laszewski Fugang Wang Marlon Pierce Gerald Guo

A scheduling component for e-Science Central Anirudh Agarwal Jacek Cała.

An Autonomic Framework in Cloud Environment Jiedan Zhu Advisor: Prof. Gagan Agrawal.

An algorithm for a Parallel Machine Problem with Eligibility and Release and Delivery times, considering setup times Manuel Mateo Management.

Young Suk Moon Chair: Dr. Hans-Peter Bischof Reader: Dr. Gregor von Laszewski Observer: Dr. Minseok Kwon 1.

Workflow Resource Allocation through Auctions Universitat de Girona Albert Plà, Beatriz López, Javier Murillo eXiT 16/7/ AILOG Barcelona Universitat.

Parallel Optimization Tools for High Performance Design of Integrated Circuits WISCAD VLSI Design Automation Lab Azadeh Davoodi.

MapReduce How to painlessly process terabytes of data.

Scientific Workflow Scheduling in Computational Grids Report: Wei-Cheng Lee 8th Grid Computing Conference IEEE 2007 – Planning, Reservation,

Stochastic DAG Scheduling using Monte Carlo Approach Heterogeneous Computing Workshop (at IPDPS) 2012 Extended version: Elsevier JPDC (accepted July 2013,

The Owner Share scheduler for a distributed system 2009 International Conference on Parallel Processing Workshops Reporter: 李長霖.

MapReduce Kristof Bamps Wouter Deroey. Outline Problem overview MapReduce o overview o implementation o refinements o conclusion.

Advanced Spectrum Management in Multicell OFDMA Networks enabling Cognitive Radio Usage F. Bernardo, J. Pérez-Romero, O. Sallent, R. Agustí Radio Communications.

VIPIN VIJAYAN 11/11/03 A Performance Analysis of Two Distributed Computing Abstractions.

UAB Dynamic Tuning of Master/Worker Applications Anna Morajko, Paola Caymes Scutari, Tomàs Margalef, Eduardo Cesar, Joan Sorribes and Emilio Luque Universitat.

1 Vulnerability Assessment Elisa Heymann Computer Architecture and Operating Systems Department Universitat Autònoma de Barcelona

1 A Steering Portal for Condor/DAGMAN Naoya Maruyama on behalf of Akiko Iino Hidemoto Nakada, Satoshi Matsuoka Tokyo Institute of Technology.

Review of Condor,SGE,LSF,PBS

Model Checking Grid Policies JeeHyun Hwang, Mine Altunay, Tao Xie, Vincent Hu Presenter: tanya levshina International Symposium on Grid Computing (ISGC.

Record Linkage in a Distributed Environment

MROrder: Flexible Job Ordering Optimization for Online MapReduce Workloads School of Computer Engineering Nanyang Technological University 30 th Aug 2013.

PTools Annual Meeting, Knoxville, TN, September 2002 The Tool Daemon Protocol: Defining the Interface Between Tools and Process Management Systems.

Community Assessment Training 3- Community Assessment Training 6-1.

Supporting Load Balancing for Distributed Data-Intensive Applications Leonid Glimcher, Vignesh Ravi, and Gagan Agrawal Department of ComputerScience and.

An Overview of Scientific Workflows: Domains & Applications Laboratoire Lorrain de Recherche en Informatique et ses Applications Presented by Khaled Gaaloul.

Xi He Golisano College of Computing and Information Sciences Rochester Institute of Technology Rochester, NY THERMAL-AWARE RESOURCE.

Miquel Angel Senar Unitat d’Arquitectura de Computadors i Sistemes Operatius Universitat Autònoma de Barcelona Self-Adjusting.

Grid Compute Resources and Job Management. 2 Grid middleware - “glues” all pieces together Offers services that couple users with remote resources through.

Data Consolidation: A Task Scheduling and Data Migration Technique for Grid Networks Author: P. Kokkinos, K. Christodoulopoulos, A. Kretsis, and E. Varvarigos.

1 Hardware-Software Co-Synthesis of Low Power Real-Time Distributed Embedded Systems with Dynamically Reconfigurable FPGAs Li Shang and Niraj K.Jha Proceedings.

A stochastic scheduling algorithm for precedence constrained tasks on Grid Future Generation Computer Systems (2011) Xiaoyong Tang, Kenli Li, Guiping Liao,

Dynamic Tuning of Parallel Programs with DynInst Anna Morajko, Tomàs Margalef, Emilio Luque Universitat Autònoma de Barcelona Paradyn/Condor Week, March.

Active-HDL Server Farm Course 11. All materials updated on: September 30, 2004 Outline 1.Introduction 2.Advantages 3.Requirements 4.Installation 5.Architecture.

1 An unattended, fault-tolerant approach for the execution of distributed applications Manuel Rodríguez-Pascual, Rafael Mayo-García CIEMAT Madrid, Spain.

Lucas De Marchi sponsors: co-authors: Liria Matsumoto Sato

Condor DAGMan: Managing Job Dependencies with Condor

OpenPBS – Distributed Workload Management System

IW2D migration to HTCondor

Introduction | Model | Solution | Evaluation

Workflow Scheduling Optimisation: The case for revisiting DAG scheduling Rizos Sakellariou and Henan Zhao University of Manchester.

Jason Neih and Monica.S.Lam

The Design of a Grid Computing System for Drug Discovery and Design

Software Verification, Validation, and Acceptance Testing

Wide Area Workload Management Work Package DATAGRID project

Chavit Denninnart, Mohsen Amini Salehi and Xiangbo Li

Scheduling for MW applications

Ho-Ramamoorthy 2-Phase Deadlock Detection Algorithm

Experiences in Running Workloads over OSG/Grid3

GPU Scheduling on the NVIDIA TX2:

Presentation transcript:

1 Using SchedFlow for Performance Evaluation of Workflow Applications Barton P. Miller University of Wisconsin Elisa Heyman Gustavo Martínez Miquel Angel Senar Emilio Luque Universitat Autònoma de Barcelona

2 Our Problem T1 T2 T3 T4 T5 T6 T7 Scheduling Policies Workflow Engines

3 Our Solution T1 T2 T3 T4 T5 T6 T7 Scheduling Policies Workflow Engines SchedFlow

4 Outline ›Introduction ›SchedFlow ›Experimental Study ›Conclusions

5 Introduction ›For executing a workflow on a distributed environment, we need: ›Scheduling policy integrated into a ›Workflow engine ›Reduce makespan ›Factors ›Workload size ›Inaccurate computing and communication times ›Machines appearing/disappering dynamically

6 Introduction ›With SchedFlow, we assessed the influence of the workload on the makespan considering: ›Different scheduling policies ›Different workflow engines

SchedFlow T1 T2 T3 T4 T5 T6 T7 User Policy API queue Task manager ControllerObserver Scheduler Adaptor Scheduler Adaptor Workflow Engine logs

T1 T2 T3 T4 The user submits a workflow SchedFlow queue Task manager ControllerObserver Scheduler Adaptor Scheduler Adaptor Workflow Engine logs User Policy API

T1 T2 T3 The Scheduler uses the specified scheduling policy on the available resources discovered by the Observer. M1 M2 M3 T4 M4 SchedFlow queue Task manager ControllerObserver Scheduler Adaptor Scheduler Adaptor Workflow Engine logs

T1 T2 T3 The Controller receives the first task-machine pairs M2 M3 T4 M4 M1 SchedFlow queue Task manager ControllerObserver Scheduler Adaptor Scheduler Adaptor Workflow Engine logs

T1 T2 T3 The Controller tells the adaptor which engine to use. The adaptor deals with formatting and enqueues the task. M2 M3 T4 M4 M1 SchedFlow queue Task manager ControllerObserver Scheduler Adaptor Scheduler Adaptor Workflow Engine logs

T2 T3 M2 M3 T4 M4 M1 SchedFlow queue Task manager ControllerObserver Scheduler Adaptor Scheduler Adaptor Workflow Engine logs T1

T2 T3 The Engine sends the task to the assigned machine. The Observer checks the Engine log for finished tasks. M2 M3 T4 M4 SchedFlow M1 T1 queue Task manager ControllerObserver Scheduler Adaptor Scheduler Adaptor Workflow Engine logs

T2 T3 When the task finishes, the Observer notifies the Scheduler. M2 M3 T4 M4 SchedFlow M1 queue Task manager ControllerObserver Scheduler Adaptor Scheduler Adaptor Workflow Engine logs

T2 T3 T4 M4 The Scheduler finds the tasks that have their dependencies satisfied and sends them to the Controller. SchedFlow queue Task manager ControllerObserver Scheduler Adaptor Scheduler Adaptor Workflow Engine logs M2 M3

T2 T3 T4 M4 M2 M3 SchedFlow queue Task manager ControllerObserver Scheduler Adaptor Scheduler Adaptor Workflow Engine logs

T4 M4 SchedFlow queue Task manager ControllerObserver Scheduler Adaptor Scheduler Adaptor Workflow Engine logs M2 M3 T2 T3

T4 M4 M2 M3 SchedFlow queue Task manager ControllerObserver Scheduler Adaptor Scheduler Adaptor Workflow Engine logs M2 M3 T2 T3

T2 finishes OK. M3 is claimed. T4 M4 M2 SchedFlow queue Task manager ControllerObserver Scheduler Adaptor Scheduler Adaptor Workflow Engine logs M2 M3 T3

The Observer detects the problem and T3 is removed from M3 and dynamcally rescheduled. T4 M4 M2 M3 SchedFlow queue Task manager ControllerObserver Scheduler Adaptor Scheduler Adaptor Workflow Engine logs M2 M3 T3

T3 is rescheduled. The Observer does not include M3 as an available resource. T4 M4 T3 SchedFlow queue Task manager ControllerObserver Scheduler Adaptor Scheduler Adaptor Workflow Engine logs M2 M3

T4 M4 T3 M2 SchedFlow queue Task manager ControllerObserver Scheduler Adaptor Scheduler Adaptor Workflow Engine logs M2 M3

T4 M4 T3 SchedFlow queue Task manager ControllerObserver Scheduler Adaptor Scheduler Adaptor Workflow Engine logs M2 M3

T4 M4 T3 SchedFlow queue Task manager ControllerObserver Scheduler Adaptor Scheduler Adaptor Workflow Engine logs M2 M3

T4 M4 SchedFlow queue Task manager ControllerObserver Scheduler Adaptor Scheduler Adaptor Workflow Engine logs M2 M3 T3

T4 M4 SchedFlow queue Task manager ControllerObserver Scheduler Adaptor Scheduler Adaptor Workflow Engine logs M2 M3 T3

T4 M4 T3 finishes OK. The Observer notifies the Scheduler, and it releases T4. SchedFlow queue Task manager ControllerObserver Scheduler Adaptor Scheduler Adaptor Workflow Engine logs M2 M3

T4 SchedFlow M4 queue Task manager ControllerObserver Scheduler Adaptor Scheduler Adaptor Workflow Engine logs

T4 SchedFlow M4 queue Task manager ControllerObserver Scheduler Adaptor Scheduler Adaptor Workflow Engine logs

SchedFlow M4 queue Task manager ControllerObserver Scheduler Adaptor Scheduler Adaptor Workflow Engine logs T4

SchedFlow M4 queue Task manager ControllerObserver Scheduler Adaptor Scheduler Adaptor Workflow Engine logs T4 When T4 finishes the Observer computes the makespan.

32 Experimental Study ›Execution environment: –140 machines ›Workflow applications: –Montage (53 tasks) –LIGO (81 tasks) ›Workflow engines: –Condor-DAGMan 7.0 –Taverna –Karajan 4_0_a1

33 Experimental Study ›Scheduling policies: –Default –Min-min –HEFT –BMCT

34 Experimental Study ›Input workload: –400 MB –1024 MB ›We studied the effect of the scheduling policies. ›We measured application makespan ›Real executions

35 Results ›Mantage ran on Taverna, DAGMan, Karajan ›400 MB input workload ›120 executions ›Default scheduling policy

36 Results ›Same experiments but using SchedFlow ›Min-min, HEFT, BMCT ›Rescheduling

37 Results ›Mantage ran on Taverna, DAGMan, Karajan ›1024 MB input workload ›120 executions ›Default scheduling policy

38 Results ›Same experiments but using SchedFlow ›Min-min, HEFT, BMCT ›Rescheduling

39 Results ›LIGO ran on Taverna, DAGMan, Karajan ›400 MB input workload ›120 executions ›Default scheduling policy

40 Results ›Same experiments but using SchedFlow ›Min-min, HEFT, BMCT ›Rescheduling

41 Results ›LIGO ran on Taverna, DAGMan, Karajan ›1024 MB input workload ›120 executions ›Default scheduling policy

42 Results ›Same experiments but using SchedFlow ›Min-min, HEFT, BMCT ›Rescheduling

43 Conclusions ›No single scheduling policy is the best for all scenarios ›SchedFlow allows us to obtain better performance providing: –Flexibility regarding scheduling policies –Support for rescheduling –Integration with Workflow Engines

44 Using SchedFlow for Performance Evaluation of Workflow Applications Barton P. Miller University of Wisconsin Elisa Heyman Gustavo Martínez Miquel Angel Senar Emilio Luque Universitat Autònoma de Barcelona