Scheduling Algorithms Performance Evaluation in Grid Environments R, Zhang, C. Koelbel, K. Kennedy.

Slides:



Advertisements
Similar presentations
Exploiting Deadline Flexibility in Grid Workflow Rescheduling Wei Chen Alan Fekete Young Choon Lee.
Advertisements

Energy-efficient Task Scheduling in Heterogeneous Environment 2013/10/25.
Constraint Optimization We are interested in the general non-linear programming problem like the following Find x which optimizes f(x) subject to gi(x)
Scheduling in Distributed Systems Gurmeet Singh CS 599 Lecture.
Resource Management §A resource can be a logical, such as a shared file, or physical, such as a CPU (a node of the distributed system). One of the functions.
CPE555A: Real-Time Embedded Systems
Using Parallel Genetic Algorithm in a Predictive Job Scheduling
CALTECH CS137 Winter DeHon CS137: Electronic Design Automation Day 14: March 3, 2004 Scheduling Heuristics and Approximation.
Wavelength Assignment in Optical Network Design Team 6: Lisa Zhang (Mentor) Brendan Farrell, Yi Huang, Mark Iwen, Ting Wang, Jintong Zheng Progress Report.
Scheduling of parallel jobs in a heterogeneous grid environment Scheduling of parallel jobs in a heterogeneous grid environment Each site has a homogeneous.
An Approach to Evaluate Data Trustworthiness Based on Data Provenance Department of Computer Science Purdue University.
THE SINGLE MACHINE EARLY/TARDY PROBLEM* PENG SI OW & THOMAS E. MORTON IE Paper Presentation A. İrfan Mahmutoğulları *Ow, P. S., & Morton, T. E. (1989).
Date:2011/06/08 吳昕澧 BOA: The Bayesian Optimization Algorithm.
QoS-constrained List Scheduling Heuristics for Parallel Applications on Grids 16-th Euromicro PDP Toulose, February 2008 QoS-CONSTRAINED LIST SCHEDULING.
Optimizing Schedules for Prioritized Path Planning of Multi-Robot Systems Maren Bennewitz Wolfram Burgard Sebastian Thrun.
Runtime Autonomous Component Management Systems. CMS Runtime Component Optimizer We have designed software APIs for CMS Runtime Optimizer Develop general-purpose.
Solving the Protein Threading Problem in Parallel Nocola Yanev, Rumen Andonov Indrajit Bhattacharya CMSC 838T Presentation.
Scalable Information-Driven Sensor Querying and Routing for ad hoc Heterogeneous Sensor Networks Maurice Chu, Horst Haussecker and Feng Zhao Xerox Palo.
Grid Load Balancing Scheduling Algorithm Based on Statistics Thinking The 9th International Conference for Young Computer Scientists Bin Lu, Hongbin Zhang.
Authors: Weiwei Chen, Ewa Deelman 9th International Conference on Parallel Processing and Applied Mathmatics 1.
A Budget Constrained Scheduling of Workflow Applications on Utility Grids using Genetic Algorithms Jia Yu and Rajkumar Buyya Grid Computing and Distributed.
Chapter 2 Modeling and Finding Abnormal Nodes. How to define abnormal nodes ? One plausible answer is : –A node is abnormal if there are no or very few.
Of 21 1 Low-Cost Task Scheduling for Distributed-Memory Machines Andrei Radulescu and Arjan J.C. Van Gemund Presented by Bahadır Kaan Özütam.
 Escalonamento e Migração de Recursos e Balanceamento de carga Carlos Ferrão Lopes nº M6935 Bruno Simões nº M6082 Celina Alexandre nº M6807.
Scheduling of Parallel Jobs In a Heterogeneous Multi-Site Environment By Gerald Sabin from Ohio State Reviewed by Shengchao Yu 02/2005.
CALTECH CS137 Winter DeHon CS137: Electronic Design Automation Day 12: February 13, 2002 Scheduling Heuristics and Approximation.
Energy Aware Task Mapping Algorithm For Heterogeneous MPSoC Based Architectures Amr M. A. Hussien¹, Ahmed M. Eltawil¹, Rahul Amin 2 and Jim Martin 2 ¹Wireless.
A Survey of Distributed Task Schedulers Kei Takahashi (M1)
ASC2003 (July 15,2003)1 Uniformly Distributed Sampling: An Exact Algorithm for GA’s Initial Population in A Tree Graph H. S.
INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY, P.P , MARCH An ANFIS-based Dispatching Rule For Complex Fuzzy Job Shop Scheduling.
Stochastic DAG Scheduling using Monte Carlo Approach Heterogeneous Computing Workshop (at IPDPS) 2012 Extended version: Elsevier JPDC (accepted July 2013,
LATA: A Latency and Throughput- Aware Packet Processing System Author: Jilong Kuang and Laxmi Bhuyan Publisher: DAC 2010 Presenter: Chun-Sheng Hsueh Date:
1 Andreea Chis under the guidance of Frédéric Desprez and Eddy Caron Scheduling for a Climate Forecast Application ANR-05-CIGC-11.
The Application of The Improved Hybrid Ant Colony Algorithm in Vehicle Routing Optimization Problem International Conference on Future Computer and Communication,
1 Short Term Scheduling. 2  Planning horizon is short  Multiple unique jobs (tasks) with varying processing times and due dates  Multiple unique jobs.
Static Process Scheduling Section 5.2 CSc 8320 Alex De Ruiter
Summary :-Distributed Process Scheduling Prepared By:- Monika Patel.
Parallelizing Video Transcoding Using Map-Reduce-Based Cloud Computing Speaker : 童耀民 MA1G0222 Feng Lao, Xinggong Zhang and Zongming Guo Institute of Computer.
Real-Time Support for Mobile Robotics K. Ramamritham (+ Li Huan, Prashant Shenoy, Rod Grupen)
A Hyper-heuristic for scheduling independent jobs in Computational Grids Author: Juan Antonio Gonzalez Sanchez Coauthors: Maria Serna and Fatos Xhafa.
CS 484 Load Balancing. Goal: All processors working all the time Efficiency of 1 Distribute the load (work) to meet the goal Two types of load balancing.
COSC 5341 High-Performance Computer Networks Presentation for By Linghai Zhang ID:
Data Consolidation: A Task Scheduling and Data Migration Technique for Grid Networks Author: P. Kokkinos, K. Christodoulopoulos, A. Kretsis, and E. Varvarigos.
Critical Area Attention in Traffic Aware Dynamic Node Scheduling for Low Power Sensor Network Proceeding of the 2005 IEEE Wireless Communications and Networking.
1 Low Latency Multimedia Broadcast in Multi-Rate Wireless Meshes Chun Tung Chou, Archan Misra Proc. 1st IEEE Workshop on Wireless Mesh Networks (WIMESH),
A stochastic scheduling algorithm for precedence constrained tasks on Grid Future Generation Computer Systems (2011) Xiaoyong Tang, Kenli Li, Guiping Liao,
Genetic algorithms for task scheduling problem J. Parallel Distrib. Comput. (2010) Fatma A. Omara, Mona M. Arafa 2016/3/111 Shang-Chi Wu.
Metaheuristics for the New Millennium Bruce L. Golden RH Smith School of Business University of Maryland by Presented at the University of Iowa, March.
Scheduling Strategies for Mapping Application Workflows Onto the Grid A. Mandal, K. Kennedy, C. Koelbel, G. Marin, J. Mellor- Crummey, B. Liu, L. Johnsson.
Multiple-goal Search Algorithms and their Application to Web Crawling Dmitry Davidov and Shaul Markovitch Computer Science Department Technion, Haifa 32000,
哈工大信息检索研究室 HITIR ’ s Update Summary at TAC2008 Extractive Content Selection Using Evolutionary Manifold-ranking and Spectral Clustering Reporter: Ph.d.
1 Performance Impact of Resource Provisioning on Workflows Gurmeet Singh, Carl Kesselman and Ewa Deelman Information Science Institute University of Southern.
Machine Learning Lecture 4: Unsupervised Learning (clustering) 1.
Jérémie Sublime Sonia Yassa Development of meta-heuristics for workflow scheduling based on quality of service requirements 1.
Genetic Algorithms And other approaches for similar applications Optimization Techniques.
Robust Task Scheduling in Non-deterministic Heterogeneous Computing Systems Zhiao Shi Asim YarKhan, Jack Dongarra Followed by GridSolve, FT-MPI, Open MPI.
Real-Time Operating Systems RTOS For Embedded systems.
Resource Provision for Batch and Interactive Workloads in Data Centers Ting-Wei Chang, Pangfeng Liu Department of Computer Science and Information Engineering,
New Workflow Scheduling Techniques Presentation: Anirban Mandal
VGrADS Tools Activities
A Dynamic Critical Path Algorithm for Scheduling Scientific Workflow Applications on Global Grids e-Science IEEE 2007 Report: Wei-Cheng Lee
Namyoon Woo and Heon Y. Yeom
Richard Anderson Lecture 6 Greedy Algorithms
Richard Anderson Autumn 2016 Lecture 7
Market-based Dynamic Task Allocation in Mobile Surveillance Systems
Richard Anderson Lecture 7 Greedy Algorithms
Richard Anderson Winter 2019 Lecture 7
Richard Anderson Autumn 2015 Lecture 7
R, Zhang, A. Chien, A. Mandal, C. Koelbel,
Richard Anderson Autumn 2019 Lecture 7
Presentation transcript:

Scheduling Algorithms Performance Evaluation in Grid Environments R, Zhang, C. Koelbel, K. Kennedy

Motivation Effective scheduling is critical for the performance of an application launched onto the Grid(Distributed) environment Many scheduling algorithms have been proposed, studied and compared but there are few studies comparing their performance in Grid environments The Grid environment has the unique property of drastic cost differences between inter-cluster and the intra-cluster data transfers.

Two schemes In this paper, we compare several scheduling algorithms that represent two major schemes: Level based and List based algorithms. The list-based approach first orders the nodes in the DAG by a pre-calculated priority(i.e. b-level, critical path). Then the scheduler considers the nodes in order, assigning each to a resource that minimizes a suitable cost function(makespan). The level-based methods first organize the DAG into levels in which all the nodes are independent and then schedule the jobs within each level. We will also examine some hybrid versions.

Algorithms Heterogeneous Earliest Finish Time (HEFT) Scheduling Algorithm Use b-level to sort Heterogeneously adept version of MCP Levelized Heuristic Based Scheduling (LHBS) Algorithm Greedy Heuristic (simple LHBS) Min-min Heuristic (sophisticated LHBS) Min-max Heuristic (sophisticated LHBS) Sufferage Heuristic (sophisticated LHBS) Hybrid Heuristic(HHS) Scheduling Algorithm Different combinations, we use LHHS

Experimental Environment We defined and generated the experimental environment including the universe of compute and network resources and DAGs representing different applications. Three resource universes The universal environment: over 18,000 processors in 500 clusters The many-cluster environment: around 300 processors in 20 clusters The big-cluster environment: around 300 processors in 4 clusters

DAG Design We use DAGs taken from two real Grid applications(EMAN and Montage) and three classes of artificially-generated DAGs which abstract certain characteristics of these applications. The DAG generator can generate different formats of DAGs. Currently, we support fully random, level, and choke formats. The DAG generator also takes other parameters. For each DAG we have cost model instance to estimate the performance of tasks in such DAG.

Experiment Design We produced our DAGs with the following parameters: Type = {random, level, choke} Total number of nodes = {300, 1000, 3000} Shape = {0.5, 1.0, 5.0} Average out degree = {1.0, 2.0, 5.0} We generated 5 random DAGs for each possible parameter combination.In addition, we used 30 EMAN and 30 Montage DAGs. For each of those DAGs, we applied our cost model with the following parameters: DataSize = {20,1000},{100,1000},{500,1000} CCR = {0.1, 1.0, 10} Complexity = {0.85,1.15}, {0,6, 1.4}, {0.15, 1.85} In total,our experiments schedule and evaluate over 10,000 DAG/environment combinations.

Performance

Different DAG types

Different CCR

Different Resources

Cluster Settings

Cluster fitting

Constrained Resource

Current work Look ahead Idea: Instead of dispatching the task to the resource that results in the earliest finish time, try to find the resource that will give the earliest finish time for the child level tasks. Drawback: The complexity soar up to O(VP^2*d), d is the average out degree. Dynamic sorting Idea: Update the t-level of each task after their parents are scheduled. Then schedule tasks in one level on the order of t-level+b-level. Drawback: The preliminary results didn’t show any advantage over HEFT or LHHS. Cluster Constraint Idea: Select the clusters to schedule on. We introduced the aggregated computing power property of the cluster. Drawback: It doesn’t perform much better when the DAG is computational intensive. Hope: It works very well in other situations, outperform the one-level same algorithm by as many as 50%. To complement this, we can simply choose the fastest clusters in computational intensive cases.