Evaluating Meta-Scheduling Algorithms in VLAM-G Environment V.Korkhov, A.Belloum, L.O.Hertzberger FNWI, University of Amsterdam Key VLAM-G applications.

Slides:



Advertisements
Similar presentations
Analysis of Computer Algorithms
Advertisements

A Workflow Engine with Multi-Level Parallelism Supports Qifeng Huang and Yan Huang School of Computer Science Cardiff University
Energy-efficient Task Scheduling in Heterogeneous Environment 2013/10/25.
Scheduling in Distributed Systems Gurmeet Singh CS 599 Lecture.
Neural and Evolutionary Computing - Lecture 4 1 Random Search Algorithms. Simulated Annealing Motivation Simple Random Search Algorithms Simulated Annealing.
Walter Binder University of Lugano, Switzerland Niranjan Suri IHMC, Florida, USA Green Computing: Energy Consumption Optimized Service Hosting.
Fundamentals of Python: From First Programs Through Data Structures
WS-VLAM: Towards a Scalable Workflow System on the Grid V. Korkhov, D. Vasyunin, A. Wibisono, V. Guevara-Masis, A. Belloum Institute.
Resource Management of Grid Computing
The Grid Background and Architecture. 1. Keys to success for IT technologies Infrastructure Open Standards.
Date:2011/06/08 吳昕澧 BOA: The Bayesian Optimization Algorithm.
GridFlow: Workflow Management for Grid Computing Kavita Shinde.
1 HW/SW Partitioning Embedded Systems Design. 2 Hardware/Software Codesign “Exploration of the system design space formed by combinations of hardware.
Distributed Application Management Using PLuSH Jeannie Albrecht, Christopher Tuttle, Alex C. Snoeren, and Amin Vahdat UC San Diego CSE {jalbrecht, ctuttle,
Task Assignment and Transaction Clustering Heuristics.
QoS-constrained List Scheduling Heuristics for Parallel Applications on Grids 16-th Euromicro PDP Toulose, February 2008 QoS-CONSTRAINED LIST SCHEDULING.
Scheduling with Optimized Communication for Time-Triggered Embedded Systems Slide 1 Scheduling with Optimized Communication for Time-Triggered Embedded.
MAE 552 – Heuristic Optimization
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
Grids and Grid Technologies for Wide-Area Distributed Computing Mark Baker, Rajkumar Buyya and Domenico Laforenza.
The new The new MONARC Simulation Framework Iosif Legrand  California Institute of Technology.
A Platform-based Design Flow for Kahn Process Networks Abhijit Davare Qi Zhu December 10, 2004.
On Fairness, Optimizing Replica Selection in Data Grids Husni Hamad E. AL-Mistarihi and Chan Huah Yong IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS,
A New Approach for Task Level Computational Resource Bi-Partitioning Gang Wang, Wenrui Gong, Ryan Kastner Express Lab, Dept. of ECE, University of California,
Virtual Lab AMsterdam VLAM-G Project VLAM-G developers team Computer Architecture and Parallel Systems Group Department of Computer Science Universiteit.
Course Instructor: Aisha Azeem
Ajou University, South Korea ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing” Disconnected Operation Service in Mobile Grid Computing.
Self-Organizing Agents for Grid Load Balancing Junwei Cao Fifth IEEE/ACM International Workshop on Grid Computing (GRID'04)
Dynamic and Decentralized Approaches for Optimal Allocation of Multiple Resources in Virtualized Data Centers Wei Chen, Samuel Hargrove, Heh Miao, Liang.
Department of Computer Science Engineering SRM University
A Budget Constrained Scheduling of Workflow Applications on Utility Grids using Genetic Algorithms Jia Yu and Rajkumar Buyya Grid Computing and Distributed.
Parallel Programming Models Jihad El-Sana These slides are based on the book: Introduction to Parallel Computing, Blaise Barney, Lawrence Livermore National.
Network Aware Resource Allocation in Distributed Clouds.
Service-enabling Legacy Applications for the GENIE Project Sofia Panagiotidi, Jeremy Cohen, John Darlington, Marko Krznarić and Eleftheria Katsiri.
Appraisal and Data Mining of Large Size Complex Documents Rob Kooper, William McFadden and Peter Bajcsy National Center for Supercomputing Applications.
The Data Grid: Towards an Architecture for the Distributed Management and Analysis of Large Scientific Dataset Caitlin Minteer & Kelly Clynes.
Multicore Resource Management 謝政宏. 2 Outline Background Virtual Private Machines  Spatial Component  Temporal Component  Minimum and Maximum.
Query Optimization. Query Optimization Query Optimization The execution cost is expressed as weighted combination of I/O, CPU and communication cost.
20 October 2006Workflow Optimization in Distributed Environments Dynamic Workflow Management Using Performance Data David W. Walker, Yan Huang, Omer F.
A Survey of Distributed Task Schedulers Kei Takahashi (M1)
Scientific Workflow Scheduling in Computational Grids Report: Wei-Cheng Lee 8th Grid Computing Conference IEEE 2007 – Planning, Reservation,
1 Distributed Energy-Efficient Scheduling for Data-Intensive Applications with Deadline Constraints on Data Grids Cong Liu and Xiao Qin Auburn University.
Analysis of algorithms Analysis of algorithms is the branch of computer science that studies the performance of algorithms, especially their run time.
Simulated Annealing.
Major objective of this course is: Design and analysis of modern algorithms Different variants Accuracy Efficiency Comparing efficiencies Motivation thinking.
Fundamentals of Algorithms MCS - 2 Lecture # 8. Growth of Functions.
Virtual Lab for e-Science Towards a new Science Paradigm.
Cooperative experiments in VL-e: from scientific workflows to knowledge sharing Z.Zhao (1) V. Guevara( 1) A. Wibisono(1) A. Belloum(1) M. Bubak(1,2) B.
CS 484 Designing Parallel Algorithms Designing a parallel algorithm is not easy. There is no recipe or magical ingredient Except creativity We can benefit.
Department of Computer Science MapReduce for the Cell B. E. Architecture Marc de Kruijf University of Wisconsin−Madison Advised by Professor Sankaralingam.
Full and Para Virtualization
Optimization Problems
Introduction to Grid Computing and its components.
Globus Grid Tutorial Part 2: Running Programs Across Multiple Resources.
Computer science is a field of study that deals with solving a variety of problems by using computers. To solve a given problem by using computers, you.
Author Utility-Based Scheduling for Bulk Data Transfers between Distributed Computing Facilities Xin Wang, Wei Tang, Raj Kettimuthu,
Virtual Lab AMsterdam VLAMsterdam Abstract Machine Toolbox A.S.Z. Belloum, Z.W. Hendrikse, E.C. Kaletas, H. Afsarmanesh and L.O. Hertzberger Computer Architecture.
Data Consolidation: A Task Scheduling and Data Migration Technique for Grid Networks Author: P. Kokkinos, K. Christodoulopoulos, A. Kretsis, and E. Varvarigos.
Onlinedeeneislam.blogspot.com1 Design and Analysis of Algorithms Slide # 1 Download From
Achieving All the Time, Everywhere Access in Next- Generation Mobile Networks by Marcello Cinque, Domenico Cotroneo and Stefano Russo Presented by Ashok.
Architecture for Resource Allocation Services Supporting Interactive Remote Desktop Sessions in Utility Grids Vanish Talwar, HP Labs Bikash Agarwalla,
Memory management The main purpose of a computer system is to execute programs. These programs, together with the data they access, must be in main memory.
18 May 2006CCGrid2006 Dynamic Workflow Management Using Performance Data Lican Huang, David W. Walker, Yan Huang, and Omer F. Rana Cardiff School of Computer.
Real-Time Operating Systems RTOS For Embedded systems.
Memory Management.
Grid Computing.
Objective of This Course
The use of Neural Networks to schedule flow-shop with dynamic job arrival ‘A Multi-Neural Network Learning for lot Sizing and Sequencing on a Flow-Shop’
Resource Allocation in a Middleware for Streaming Data
L. Glimcher, R. Jin, G. Agrawal Presented by: Leo Glimcher
Presentation transcript:

Evaluating Meta-Scheduling Algorithms in VLAM-G Environment V.Korkhov, A.Belloum, L.O.Hertzberger FNWI, University of Amsterdam Key VLAM-G applications VLAM-G VLAM-G, the Grid-based Virtual Laboratory AMsterdam, provides a science portal for distributed analysis in applied scientific research. VLAM-G provides tools and instruments designed to help scientists in performing experiments by providing high-level interface to Grid environment. Virtual laboratory can spread over multiple organizations, enabling access to resources available across different organization domains. The core of VLAM-G concept is a virtual experiment which is composed of distributed processing modules and can be considered as a meta-application. Scheduling algorithms used in evaluation Basic Greedy Algorithm (straightforward heuristic): extract set of available resources from Grid indexing service using minimal module requirement; sort resources according to available CPU power; sort modules according to CPU requirement (amount of processing cycles needed); try to map modules to different resources starting with the most powerful ones, only if no suitable resources left map more instances to single resource. Modified Basic Greedy Algorithm: modification of the previous algorithm, allowing to map several instances to single resource unless it decreases overall performance (estimation of partial run-time is made on each step) Computation-Network Prioritized Algorithm: here we introduce means to describe the level of meta- application relative intensity of data transfers and computations. Heuristic coefficient CN defines priority either of networking communications or computational operations for the experiment. The following formulae is used to rank resources: R=CPU rel *CN + BW rel /CN Simulated Annealing: based on probabilistic methods that avoid being stuck at local (non-global) minima. Here an objective function to be minimized is the overall execution time of an experiment. The execution time of each instance is equivalent to the "energy" of an instance. Then, "temperature" is the average of these times. Starting from some initial schedule and initial temperature, the algorithm randomly selects an instance to be remapped, randomly selects a suitable resource and remaps the instance. The total "energy" (execution time) of the experiment is estimated. Any downhill step is accepted and the process repeats. An uphill step may be accepted. This uphill decision is made by the Metropolis criterion. The Metropolis criterion attempts to permit small uphill moves while rejecting large uphill moves. Thus, the algorithm can escape from local minima. Simulation results For the experiments we simulated a resource pool of 20 available machines of various computing power, memory and storage capabilities. The machines were distributed across 5 domains with different bandwidths within the domains and between the domains. We combined the domains with powerful computational resources linked by a low bandwidth links with fast, but slower resource domains. The resources were represented by abstract values of CPU power (from 500 up to 3000 units), load ( ), network bandwidth (2-100 units between domains, within domains). Thus we tried to approach the real heterogeneity of Grid environment. To achieve this we simulated the information usually received from Grid indexing service (MDS). All the mentioned algorithms were tested in this simulation environment on several experiment topologies consisting of various number of module instances: from 3 to 10. On the figure we present the results for one set of experiment topologies. The Y axis on the charts corresponds to resulting schedule evaluation, proportional to overall runtime. The less the value is the more efficient is the schedule. The X axis represents all four examined algorithms. VLAM-G GUI Material analysis: MACSLab Medicine: MRI Scanner VLAM-G Architecture Front-End Session ManagerResource Manager Run-Time System Application with QoS Grid information services (NWS, Globus MDS) Grid resource allocation services (Globus GRAM) Fabric layer (hosts, networks etc.) VIMCO Resource Manager (RM): receives an experiment topology with module requirements (QoS); performs resource discovery, location and selection according to module requirements composes a number of candidate schedules that are estimated using specified cost model and resource state information; selects optimal schedule Application model and cost function VLAM-G experiment is represented by a meta-application composed of a number of components. For the application component C i we define: comp(C i ) - the computational load of the component C i, may be counted in the number of instructions to be executed; comm(C i,C j ) - the communicational load between C i and C j, may be counted in the number of bytes transferred between the components Consider S k, (k<=M), M – number of sites. We define: compT(C i,S k ) - the computation time for the component C i running on the site S k when the site provides all its resources to the application. In case other components are scheduled in then the function may be degraded. commT(C i,C j,S k,S l ) - the communication time taken for data transfers between C i scheduled on S k and C j scheduled on S l. The function may depend on the number of components sharing the link. commTT(C i,S k ) - the total communication cost for component C i placed on site S k, is a function of commT functions for the component’s links. In the simplest case is the sum of those costs. The cost function used for pipeline applications is thus: The basic greedy algorithm usually gives one of the two worst results. The modified basic greedy algorithm has different behaviors: in topologies with small number of modules it is very effective and gives good results, though the more the size of experiment grows the less effective it becomes. For the topologies with more than 6 modules BGM gives the worst results among all the algorithms. Computation-network prioritized algorithm gives better results than basic greedy algorithms (both standard and modified) only except the case of 3 modules when BGM is the most effective. The best results (except 3 modules case) have been shown by simulated annealing algorithm, though it was the most time consuming one. Thus the simulation results have shown that heuristic mapping algorithms might be effective in some system configurations, especially in small homogeneous environment, but for generic case and complex system topologies the algorithms of random search like simulated annealing are more promising. The lack of such algorithms is increased requirement for execution time caused by much more complex and extensive computations taken. Project Leader : L.O. Hertzberger Phone: Fax: Contacts: Vladimir Korkhov, Adam Belloum,