A Hyper-heuristic for scheduling independent jobs in Computational Grids Author: Juan Antonio Gonzalez Sanchez Coauthors: Maria Serna and Fatos Xhafa.

Slides:



Advertisements
Similar presentations
Scheduling in Distributed Systems Gurmeet Singh CS 599 Lecture.
Advertisements

Dynamic Thread Assignment on Heterogeneous Multiprocessor Architectures Pree Thiengburanathum Advanced computer architecture Oct 24,
SLA-Oriented Resource Provisioning for Cloud Computing
Using Parallel Genetic Algorithm in a Predictive Job Scheduling
By: Chris Klumph and Kody Willman. Types of Heuristics References Terminology Static Mappings 6 Example mappings 4 Graph chromosome mappings 1 Tree mapping.
ISE480 Sequencing and Scheduling Izmir University of Economics ISE Fall Semestre.
Silberschatz, Galvin and Gagne  2002 Modified for CSCI 399, Royden, Operating System Concepts Operating Systems Lecture 19 Scheduling IV.
GridFlow: Workflow Management for Grid Computing Kavita Shinde.
Scheduling in Batch Systems
Chapter 6: CPU Scheduling. 5.2 Silberschatz, Galvin and Gagne ©2005 Operating System Concepts – 7 th Edition, Feb 2, 2005 Chapter 6: CPU Scheduling Basic.
Security-Driven Heuristics and A Fast Genetic Algorithm for Trusted Grid Job Scheduling Shanshan Song, Ricky Kwok, and Kai Hwang University of Southern.
Scheduling with Optimized Communication for Time-Triggered Embedded Systems Slide 1 Scheduling with Optimized Communication for Time-Triggered Embedded.
High Performance Computing 1 Parallelization Strategies and Load Balancing Some material borrowed from lectures of J. Demmel, UC Berkeley.
1 Introduction to Load Balancing: l Definition of Distributed systems. Collection of independent loosely coupled computing resources. l Load Balancing.
Present by Chen, Ting-Wei Adaptive Task Checkpointing and Replication: Toward Efficient Fault-Tolerant Grids Maria Chtepen, Filip H.A. Claeys, Bart Dhoedt,
WSN Simulation Template for OMNeT++
The new The new MONARC Simulation Framework Iosif Legrand  California Institute of Technology.
Grid Load Balancing Scheduling Algorithm Based on Statistics Thinking The 9th International Conference for Young Computer Scientists Bin Lu, Hongbin Zhang.
Lab 01 Fundamentals SE 405 Discrete Event Simulation
Using Simulated Annealing and Evolution Strategy scheduling capital products with complex product structure By: Dongping SONG Supervisors: Dr. Chris Hicks.
IE 594 : Research Methodology – Discrete Event Simulation David S. Kim Spring 2009.
Self-Organizing Agents for Grid Load Balancing Junwei Cao Fifth IEEE/ACM International Workshop on Grid Computing (GRID'04)
Efficient Scheduling of Heterogeneous Continuous Queries Mohamed A. Sharaf Panos K. Chrysanthis Alexandros Labrinidis Kirk Pruhs Advanced Data Management.
1 Enabling Large Scale Network Simulation with 100 Million Nodes using Grid Infrastructure Hiroyuki Ohsaki Graduate School of Information Sci. & Tech.
Modeling and simulation of systems Simulation optimization and example of its usage in flexible production system control.
WP9 Resource Management Current status and plans for future Juliusz Pukacki Krzysztof Kurowski Poznan Supercomputing.
1 Performance Analysis of Coexisting Secondary Users in Heterogeneous Cognitive Radio Network Xiaohua Li Dept. of Electrical & Computer Engineering State.
Meta Scheduling Sathish Vadhiyar Sources/Credits/Taken from: Papers listed in “References” slide.
1 Performance Evaluation of Computer Systems and Networks Introduction, Outlines, Class Policy Instructor: A. Ghasemi Many thanks to Dr. Behzad Akbari.
Chapter 3 System Performance and Models. 2 Systems and Models The concept of modeling in the study of the dynamic behavior of simple system is be able.
A Survey of Distributed Task Schedulers Kei Takahashi (M1)
Autonomic scheduling of tasks from data parallel patterns to CPU/GPU core mixes Published in: High Performance Computing and Simulation (HPCS), 2013 International.
1 Short Term Scheduling. 2  Planning horizon is short  Multiple unique jobs (tasks) with varying processing times and due dates  Multiple unique jobs.
OPERATING SYSTEMS CS 3530 Summer 2014 Systems with Multi-programming Chapter 4.
O PTIMAL SERVICE TASK PARTITION AND DISTRIBUTION IN GRID SYSTEM WITH STAR TOPOLOGY G REGORY L EVITIN, Y UAN -S HUN D AI Adviser: Frank, Yeong-Sung Lin.
1 11/29/2015 Chapter 6: CPU Scheduling l Basic Concepts l Scheduling Criteria l Scheduling Algorithms l Multiple-Processor Scheduling l Real-Time Scheduling.
MROrder: Flexible Job Ordering Optimization for Online MapReduce Workloads School of Computer Engineering Nanyang Technological University 30 th Aug 2013.
Copyright © 2010, SAS Institute Inc. All rights reserved. SAS ® Using the SAS Grid.
OPERATING SYSTEMS CS 3530 Summer 2014 Systems and Models Chapter 03.
Silberschatz and Galvin  Operating System Concepts Module 5: CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms Multiple-Processor.
1 CS.217 Operating System By Ajarn..Sutapart Sappajak,METC,MSIT Chapter 5 CPU Scheduling Slide 1 Chapter 5 CPU Scheduling.
Content caching and scheduling in wireless networks with elastic and inelastic traffic Group-VI 09CS CS CS30020 Performance Modelling in Computer.
Efficient Load Balancing Algorithm for Cloud Computing Network Che-Lun Hung 1, Hsiao-hsi Wang 2 and Yu-Chen Hu 2 1 Dept. of Computer Science & Communication.
Xi He Golisano College of Computing and Information Sciences Rochester Institute of Technology Rochester, NY THERMAL-AWARE RESOURCE.
A Fast Genetic Algorithm Based Static Heuristic For Scheduling Independent Tasks on Heterogeneous Systems Gaurav Menghani Department of Computer Engineering,
Data Consolidation: A Task Scheduling and Data Migration Technique for Grid Networks Author: P. Kokkinos, K. Christodoulopoulos, A. Kretsis, and E. Varvarigos.
1 MEVAL: A Practically Efficient System for Secure Multi-party Statistical Analysis Koki Hamada NTT Secure Platform Laboratories.
A stochastic scheduling algorithm for precedence constrained tasks on Grid Future Generation Computer Systems (2011) Xiaoyong Tang, Kenli Li, Guiping Liao,
Chapter 4 CPU Scheduling. 2 Basic Concepts Scheduling Criteria Scheduling Algorithms Multiple-Processor Scheduling Real-Time Scheduling Algorithm Evaluation.
Name : Mamatha J M Seminar guide: Mr. Kemparaju. GRID COMPUTING.
Scheduling Algorithms Performance Evaluation in Grid Environments R, Zhang, C. Koelbel, K. Kennedy.
OPERATING SYSTEMS CS 3502 Fall 2017
CPU SCHEDULING.
Introduction to Load Balancing:
HyperSim: High Performance Simulation Kernel
Efficient Load Balancing Algorithm for Cloud
Chapter 6: CPU Scheduling
Chapter 6: CPU Scheduling
Collaborative Offloading for Distributed Mobile-Cloud Apps
CPU Scheduling G.Anuradha
Module 5: CPU Scheduling
3: CPU Scheduling Basic Concepts Scheduling Criteria
Chapter 6: CPU Scheduling
CPU SCHEDULING.
Lecture 2 Part 3 CPU Scheduling
Operating System , Fall 2000 EA101 W 9:00-10:00 F 9:00-11:00
Chapter 6: CPU Scheduling
Module 5: CPU Scheduling
Chapter 6: CPU Scheduling
Module 5: CPU Scheduling
Presentation transcript:

A Hyper-heuristic for scheduling independent jobs in Computational Grids Author: Juan Antonio Gonzalez Sanchez Coauthors: Maria Serna and Fatos Xhafa

Overview Introduction and motivation Hyper-heuristic design Hyper-heuristic tests Conclusions Future work

Introduction and motivation Computational Grid –Parallel and distributed system that enables the sharing, selection and aggregation of geographically distributed resources Efficient scheduling of tasks in resources in a global, heterogeneous and dynamic environment Tasks –From different users –Executed in unique resource –Different types (intensive numeric computation vs data process) / (immediate vs batch) Resources –Dynamically added/dropped from the Grid –Can process one task at a time –Specialiced resources (intensive numeric computation vs data process)

Introduction and motivation Ad-hoc heuristics: –Simples –Deterministic –Short execution time E.g.: Opportunistic Load balancing, Minimum Completion Time, Minimum Execution Time, etc... No one method performs best! –Need to select them in an accordance with grid instance to yield best performance

Problem definition (instances) –Tasks to be scheduled –Resources to be used in scheduling –Workload of each task (in millions of instructions) –Computing capacity of each resource in mips –Ready time: ready[m] - when the resource m will finish executing its scheduled tasks. –ETC[t][m] – Expected Time to Compute task t in resource m (from Simulation Model of Braun et al. 2001)

Problem definition (Objectives) Makespan: finishing time of latest task max{F t : t in Tasks} or max{Cr: r in Resources} Flowtime: sum of finishing times of tasks sum {F t : t in Tasks} Note: Completion time: C r = ready[r] + sum {ETC[t][r] for tasks scheduled in r}

Ad-hoc heuristics Our Ad-hoc heuristics: –Immediate mode: tasks are scheduled as soon as they arrive in the system Opportunistic Load Balancing, Minimum Completion Time, Minimum Execution Time, Switching Algorithm and K-Percent Best –Batch mode: The schedule is done for a set of tasks (a batch). Min-Min, Max-Min, Sufferage and Relative-Cost

Evaluating instance characteristics Instance notation x_yyzz ( Braun et al. 2001) –X : computing consistency (c-consistent, i-inconsistent and s-semiconsistent) –YY: Tasks heterogeneity (hi-high and lo-low) –ZZ: Resources heterogeneity (hi-high and lo-low) Preprocess input information –Workload variance  task heterogeneity –Mips variance  resource heterogeneity –ETC matrix analysis  matrix consistency Note: ETC=Expected Time to Compute

Performance of Hyper-heuristic for Braun et al.’s instances - Makespan OLBMCTMETSAKPBMin- Min Max- Min SuffRC C_HIHIXX C_HILOXX C_LOHIXX C_LOLOXX I_HIHIXX I_HILOXX I_LOHIXX I_LOLOXX S_HIHIXX S_HILOXX S_LOHIXX S_LOLOXX X-the method was chosen most of the times out of 100 independent runs

Performance of Hyper-heuristic for Braun et al.’s instances - Flowtime OLBMCTMETSAKPBMin- Min Max- Min SuffRC C_HIHIXX C_HILOXX C_LOHIXX C_LOLOXX I_HIHIXX I_HILOXX I_LOHIXX I_LOLOXX S_HIHIXX S_HILOXX S_LOHIXX S_LOLOXX X-the method was chosen most of the times out of 100 independent runs

Proposal: Hyper-heuristic design Parameters of the hyper-heuristic: –Parameter to fix task heterogeneity threshold –Parameter to fix resource heterogeneity threshold –Parameter to work with immediate or batch methods –Parameter to fix the measure to optimize (makespan or flowtime)

Proposal: Hyper-heuristic design High-level algorithm: Input: parameters, tasks, resources, ready-times, ETC matrix 1.Evaluate task heterogeneity 2.Evaluate resource heterogeneity 3.Examine ETC matrix to deduce its consistency 4.Choose (based on parameters) the ad-hoc method to execute 5.Execute ad-hoc method Output: schedule

Performance Evaluation of Hyper-heuristic for a grid simulation environment We use a Grid Simulator implemented with a discrete event simulation library (HyperSim). Highly parametrizable: –Distributions of arriving and leaving of resources in the Grid and its mips –Distributions of task arrival to the Grid and its workloads –Initial resources/tasks in the system and maximum tasks to generate –Task and resource types –Percentage of immediate/batch tasks. For a schedule event the simulator calls the hyper- heuristic and passes it the ETC matrix, ready times, resources and tasks that will be scheduled as input

Simulator trace example time= event= EVN_NEW_HOST info: Host# 00000, Mips = time= event= EVN_NEW_HOST info: Host# 00001, Mips = time= event= EVN_ENTER info: Task# 00000, Work = time= event= EVN_ENTER info: Task# 00001, Work = time= event= EVN_ENTER info: Task# 00002, Work = time= event= EVN_ENTER info: Task# 00003, Work = time= event= EVN_SCHEDULE info: Scheduled Tasks, Hosts time= event= EVN_START info: Task# on Host# finishTime = exeTime = time= event= EVN_START info: Task# on Host# finishTime = exeTime =

Simulator + Hyperheuristic Simulator Hyper Parameters Statistic Results

Performance Evaluation of Hyper-heuristic using the simulator: static vs dynamic Two environment types for tests: –Static: generate concrete instances (a priori fixed configuration) –Dynamic: the usual use of the simulator intended for a real environment Using the Simulator for generating 3 Grid types: Small, Medium and Large size Tests for the 2 measures: Makespan and Flowtime We compare the hyper-heuristic versus an hyper- heuristic with fully random decisions Percentage ratio of tasks immediate/batch is modified too: 0%, 25%, 50%, 75% and 100%.

Static – Makespan (small, medium and large size instances)

Static – Flowtime (small, medium and large size instances)

Dynamic - Makespan

Dynamic - Flowtime

Conclusions As expected, the schedules done by the HH using guided decisions are better than decisions without any knowledge. For Makespan, we have seen that the results increase when the ratio of immediate/batch is 0.5, this indicates that both types of ad-hocs damage each other’s strategy For Flowtime, when the ratio of immediate/batch is favorable to batch, better results are produced.

Future Work Add transmission time in our simulations Make the hyperheursitic more “intelligent” in decision-taking Evaluate the HH in a real grid: –Develop an interface to use it in a real grid –Extract the state of the net (grid characteristics, job characteristics etc.)