Agent-Based Grid Load-Balancing Daniel P. Spooner University of Warwick, UK Junwei Cao NEC Europe Ltd., Germany.

Slides:

Advertisements

Similar presentations

Efficient Event-based Resource Discovery Wei Yan*, Songlin Hu*, Vinod Muthusamy +, Hans-Arno Jacobsen +, Li Zha* * Chinese Academy of Sciences, Beijing.

Advertisements

Resource Management §A resource can be a logical, such as a shared file, or physical, such as a CPU (a node of the distributed system). One of the functions.

A system Performance Model Instructor: Dr. Yanqing Zhang Presented by: Rajapaksage Jayampthi S.

Condor-G: A Computation Management Agent for Multi-Institutional Grids James Frey, Todd Tannenbaum, Miron Livny, Ian Foster, Steven Tuecke Reporter: Fu-Jiun.

Performance-responsive Middleware for Grid Computing Dr Stephen Jarvis High Performance Systems Group University of Warwick, UK High Performance Systems.

A SYSTEM PERFORMANCE MODEL CSC 8320 Advanced Operating Systems Georgia State University Yuan Long.

Evaluation of Advertising Effectiveness Using Agent-Based Modeling and Simulation Junwei Cao Department of Computer Science University of Warwick.

Resource Management of Grid Computing

Nadia Ranaldo - Eugenio Zimeo Department of Engineering University of Sannio – Benevento – Italy 2008 ProActive and GCM User Group Orchestrating.

Effective Coordination of Multiple Intelligent Agents for Command and Control The Robotics Institute Carnegie Mellon University PI: Katia Sycara

Universität Dortmund Robotics Research Institute Information Technology Section Grid Metaschedulers An Overview and Up-to-date Solutions Christian.

GridFlow: Workflow Management for Grid Computing Kavita Shinde.

The CrossGrid project Juha Alatalo Timo Koivusalo.

Application-specific Tools Netsolve, Ninf, and NEOS CSE 225 Chas Wurster.

Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.

RETSINA: A Distributed Multi-Agent Infrastructure for Information Gathering and Decision Support The Robotics Institute Carnegie Mellon University PI:

GHS: A Performance Prediction and Task Scheduling System for Grid Computing Xian-He Sun Department of Computer Science Illinois Institute of Technology.

Diffusion scheduling in multiagent computing system MotivationArchitectureAlgorithmsExamplesDynamics Robert Schaefer, AGH University of Science and Technology,

New Challenges in Cloud Datacenter Monitoring and Management

23 September 2004 Evaluating Adaptive Middleware Load Balancing Strategies for Middleware Systems Department of Electrical Engineering & Computer Science.

Self-Organizing Agents for Grid Load Balancing Junwei Cao Fifth IEEE/ACM International Workshop on Grid Computing (GRID'04)

IIIAURJCUPV Task 7.1 Software architecture and computation model E. Marcos C. Acuña Task 7.2 Multiagent System Platform A. Espinosa Task.

MobSched: An Optimizable Scheduler for Mobile Cloud Computing S. SindiaS. GaoB. Black A.LimV. D. AgrawalP. Agrawal Auburn University, Auburn, AL 45 th.

Distributed Real-Time Systems for the Intelligent Power Grid Prof. Vincenzo Liberatore.

©Ian Sommerville 2006Software Engineering, 8th edition. Chapter 12 Slide 1 Distributed Systems Architectures.

N. GSU Slide 1 Chapter 02 Cloud Computing Systems N. Xiong Georgia State University.

Self-Organizing Agents for Grid Load Balancing Junwei Cao, Ph.D. Research Scientist Center for Space Research Massachusetts Institute of Technology Cambridge,

Active Monitoring in GRID environments using Mobile Agent technology Orazio Tomarchio Andrea Calvagna Dipartimento di Ingegneria Informatica e delle Telecomunicazioni.

WP9 Resource Management Current status and plans for future Juliusz Pukacki Krzysztof Kurowski Poznan Supercomputing.

Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.

DR Software: Essential Foundational Elements and Platform Components UCLA Smart Grid Energy Research Center (SMERC) Industry Partners Program (IPP) Meeting.

CCGrid 2003, Tokyo, Japan GridFlow: Workflow Management for Grid Computing Junwei Cao ( 曹军威 ) C&C Research Labs, NEC Europe Ltd., Germany Stephen A. Jarvis.

Your university or experiment logo here Caitriana Nicholson University of Glasgow Dynamic Data Replication in LCG 2008.

Scientific Workflow Scheduling in Computational Grids Report: Wei-Cheng Lee 8th Grid Computing Conference IEEE 2007 – Planning, Reservation,

The Grid System Design Liu Xiangrui Beijing Institute of Technology.

DataTAG Research and Technological Development for a Transatlantic Grid Abstract Several major international Grid development projects are underway at.

Dynamic Load Balancing in Charm++ Abhinav S Bhatele Parallel Programming Lab, UIUC.

Perspectives on Grid Technology Ian Foster Argonne National Laboratory The University of Chicago.

Grid Computing & Semantic Web. Grid Computing Proposed with the idea of electric power grid; Aims at integrating large-scale (global scale) computing.

The Grid the united computing power Jian He Amit Karnik.

GRID ARCHITECTURE Chintan O.Patel. CS 551 Fall 2002 Workshop 1 Software Architectures 2 What is Grid ? "...a flexible, secure, coordinated resource- sharing.

1 BRUSSELS - 14 July 2003 Full Security Support in a heterogeneous mobile GRID testbed for wireless extensions to the.

Department of Electronic Engineering Challenges & Proposals INFSO Information Day e-Infrastructure Grid Initiatives 26/27 May.

Performance evaluation on grid Zsolt Németh MTA SZTAKI Computer and Automation Research Institute.

What is SAM-Grid? Job Handling Data Handling Monitoring and Information.

Service-oriented Resource Broker for QoS-Guaranteed in Grid Computing System Yichao Yang, Jin Wu, Lei Lang, Yanbo Zhou and Zhili Sun Centre for communication.

GSAF: A Grid-based Services Transfer Framework Chunyan Miao, Wang Wei, Zhiqi Shen, Tan Tin Wee.

6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.

Resources Management and Component Placement Presenter:Bo Sheng.

Trust and Security for Next Generation Grids, Securing Grid-Based Supply Chains Marco Di Girolamo HP Italy Innovation Center, Italy On.

Scalable and Topology-Aware Load Balancers in Charm++ Amit Sharma Parallel Programming Lab, UIUC.

Xi He Golisano College of Computing and Information Sciences Rochester Institute of Technology Rochester, NY THERMAL-AWARE RESOURCE.

Performance-responsive Scheduling for Grid Computing Dr Stephen Jarvis High Performance Systems Group University of Warwick, UK High Performance Systems.

DS-Grid: Large Scale Distributed Simulation on the Grid Georgios Theodoropoulos Midlands e-Science Centre University of Birmingham, UK Stephen John Turner,

Hierarchical Load Balancing for Large Scale Supercomputers Gengbin Zheng Charm++ Workshop 2010 Parallel Programming Lab, UIUC 1Charm++ Workshop 2010.

Euro-Par, HASTE: An Adaptive Middleware for Supporting Time-Critical Event Handling in Distributed Environments ICAC 2008 Conference June 2 nd,

+ Support multiple virtual environment for Grid computing Dr. Lizhe Wang.

Grid Activities in CMS Asad Samar (Caltech) PPDG meeting, Argonne July 13-14, 2000.

Use of Performance Prediction Techniques for Grid Management Junwei Cao University of Warwick April 2002.

2004 Queue Scheduling and Advance Reservations with COSY Junwei Cao Falk Zimmermann C&C Research Laboratories NEC Europe Ltd.

IPDPS 2003, Nice, France Agent-Based Grid Load Balancing Using Performance-Driven Task Scheduling Junwei Cao (C&C Research Labs, NEC Europe Ltd., Germany)

Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,

Agent-Based Grid Load-Balancing

StratusLab Final Periodic Review

StratusLab Final Periodic Review

Junwei Cao Darren J. Kerbyson Graham R. Nudd

Department of Computer Science University of Warwick

Agent-based Resource Management for Grid Computing

Presentation transcript:

Agent-Based Grid Load-Balancing Daniel P. Spooner University of Warwick, UK Junwei Cao NEC Europe Ltd., Germany

Outline Grid Computing & Middleware Agent-Based Methodology Distributed Service Discovery Grid Load-Balancing Experimental Results Conclusions & Future Work

Grid Computing Computational Grids - blueprint for a new computing infrastructure Grid/Web Services - distributed system and application integration Data/Service/Community Grids - large-scale resource sharing in VOs

Grid Middleware Resource Management & Scheduling Information Services (Monitoring & Discovery) Data Management and Access Application Programming Environments Security, Accounting, QoS …… Condor, LSF, Ninf, Nimrod, …… Globus, Legion, DPSS, …… Java/Jini, CORBA, Web Services, ……

Grid Resource Management Key challenges: Cross-domain Large-scale Dynamic QoS support GRM

Agent-Based Methodology An agent is: A local grid manager An user agent A broker A service provider A service requestor A matchmaker A AA A

Local Management Processor 1 Processor 2 Processor 3 Processor 4 Processor 5 Processor 6 Processor 7 Processor 8 2 n -1 Performance Prediction Task models Hardware models FIFO Algorithm Genetic Algorithm Heuristic & Evolutionary Near-optimal on makespan, deadlines and idletime.

Service Discovery Pure data-pull No advertisement Full discovery Efficient when service change more quickly Pure data-push Full advertisement No discovery Efficient when requests arrive more frequently R/A AU/A 1 TaskInfo 2 PullSerInfo 3 PullSerInfo 4 Return R/A AU/A 3 TaskInfo 1 PushSerInfo 2 PushSerInfo 4 Return R/A Centralised, not applicable for grid computing!

Optimisation Strategies Configurable data-pull or data-push Agent hierarchy Multi-step advertisement & multi-step discovery Efficient when frequencies of request arrivals and service changes are almost the same Distributed! balancing between advertisement and discovery! A AA AA User

Agent Implementation Performance Prediction FIFO Scheduling Database Management Match Maker AdvertisementDiscovery Task Model Results Sched. Time User Deadline To Another Agent Task Execution Task Management FIFO/GA Scheduling Service Info. Provider Current Makespan Service Info Hw Model Task Execution Resource Monitoring Task Execution

Load Balancing Metrics Total makespan Average advance time of task execution completions (required deadline - actual task completion time) Average processor utilisation rate (busy time / total makespan) Load balancing level (1 - mean square deviation of processor utilisation rates / average processor utilisation rate) Total number of network packages for both advertisement and discovery

Experiment Design S 1 (SGIOrigin2000, 16) S 2 (SGIOrigin2000, 16) S 4 (SunUltra10, 16) S 3 (SunUltra10, 16) S 5 (SunUltra5, 16) S 6 (SunUltra5, 16) S 12 (SunSPARCstati on2, 16) S 11 (SunSPARCstati on2, 16) S 8 (SunUltra1, 16) S 7 (SunUltra5, 16) S 10 (SunUltra1, 16) S 9 (SunUltra1, 16) Tasks: sweep3d fft improc closure jacobi memsort cpi

Experiment 1 FIFO

Experiment 2 GA

Experiment 3 GA

Task Execution Both GA and agents contribute towards the improvement in task executions.

Resource Utilisation Less powerful S11 & S12 benefit mainly from the GA. More powerful S1 & S2 benefit mainly from agents.

Load Balancing The GA contributes more to local grid load balancing. Agents contribute more to global grid load balancing.

Total Makespan The centralised pure data-pull can always achieve the best results Distributed agents with the hierarchical model can also achieve reasonably good results

Network Package The network overhead for the pure data-pull strategy to achieve better results is very high. Distributed agent- based service advertisement and discovery can scale well.

Conclusions An multi-agent paradigm provides a clear high-level abstraction of grid resource management system. Distributed service advertisement and discovery strategies can be used to improve agent performance. Agent-based framework is scalable, flexible, and extensible for further enhancements.