The Gridbus Toolkit for Building and Deploying eScience Applications on Utility Grids Rajkumar Buyya Fellow of Grid Computing Grid Computing and Distributed.

Slides:



Advertisements
Similar presentations
Convergence Characteristics for Clusters, Grids, and P2P networks
Advertisements

Nimrod/G GRID Resource Broker and Computational Economy
Grid Business Symposium 2005, Seoul, Korea
Nimrod/G and Grid Market A Case for Economy Grid Architecture for Service Oriented Global Grid Computing Rajkumar Buyya, David Abramson, Jon Giddy Monash.
Computational Grids and Computational Economy: Nimrod/G Approach David Abramson Rajkumar Buyya Jonathan Giddy.
Pricing for Utility-driven Resource Management and Allocation in Clusters Chee Shin Yeo and Rajkumar Buyya Grid Computing and Distributed Systems (GRIDS)
Grid Economics and Business Models: A Gridbus Perspective Rajkumar Buyya Grid and Distributed Systems (GRIDS) Laboratory Dept. of Computer Science and.
Utility-Oriented Cloud & Grid Computing: A Vision, Hype, and Reality
Abstraction Layers Why do we need them? –Protection against change Where in the hourglass do we put them? –Computer Scientist perspective Expose low-level.
SLA-Oriented Resource Provisioning for Cloud Computing
1 GridSim 2.0 Adv. Grid Modelling & Simulation Toolkit Rajkumar Buyya, Manzur Murshed (Monash), Anthony Sulistio, Chee Shin Yeo Grid Computing and Distributed.
1 Project Overview EconomyGrid Economic Paradigm For “Resource Management and Scheduling” for Service-Oriented Grid Computing Presenter Name: Sama GovindaRamanujam.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 12 Slide 1 Distributed Systems Design 2.
High Performance Computing Course Notes Grid Computing.
® IBM India Research Lab © 2006 IBM Corporation Challenges in Building a Strategic Information Integration Infrastructure Mukesh Mohania IBM India Research.
Gridbus Middleware and Utility Grids: Building Autonomic and Market-Oriented Global Grids for Delivering IT Services as the 5 th Utility Dr. Rajkumar Buyya.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Universität Dortmund Robotics Research Institute Information Technology Section Grid Metaschedulers An Overview and Up-to-date Solutions Christian.
DataGrid Kimmo Soikkeli Ilkka Sormunen. What is DataGrid? DataGrid is a project that aims to enable access to geographically distributed computing power.
Grids and Grid Technologies for Wide-Area Distributed Computing Mark Baker, Rajkumar Buyya and Domenico Laforenza.
The Gridbus Middleware: Building and Managing Utility Grids for Powering e-Science and e-Business Applications Dr. Rajkumar Buyya Grid Computing and Distributed.
The Gridbus Middleware: Creating and Managing Utility Grids for Powering e-Science and e-Business Applications Dr. Rajkumar Buyya Grid Computing and Distributed.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
SensorGrid: A new Cyberinfrastructure Integrating Sensor Network and Grid Computing for e-Science Applications Dr. Rajkumar Buyya Grid Computing and Distributed.
SUN HPC Consortium, Heidelberg 2004 Grid(Lab) Resource Management System (GRMS) and GridLab Services Krzysztof Kurowski Poznan Supercomputing and Networking.
QoS-based Scheduling of e-Research Application Workflows on Global Grids Dr. Rajkumar Buyya Grid Computing and Distributed Systems (GRIDS) Laboratory Dept.
Apache Airavata GSOC Knowledge and Expertise Computational Resources Scientific Instruments Algorithms and Models Archived Data and Metadata Advanced.
Gridbus Resource Broker for Application Service Costs-based Scheduling on Global Grids: A Case Study in Brain Activity Analysis Srikumar Venugopal 1, Rajkumar.
1 Constructing A Grid Simulation with Differentiated Network Service using GridSim Anthony Sulistio, Gokul Poduval, Rajkumar Buyya, Chen-Kong Tham Fellow.
©Ian Sommerville 2006Software Engineering, 8th edition. Chapter 12 Slide 1 Distributed Systems Architectures.
GCC 2006 Panel: Grid Research and Engineering Vs Standards Dr. Rajkumar Buyya Grid Computing and Distributed Systems (GRIDS) Laboratory Dept. of Computer.
1 Another Perspective on Grid Problem and Challenges ( A view from the Globus Project: ) M.Effatparvar Fall 1391.
Gridbus Toolkit for Belle Analysis Data Grid and Utility Computing Rajkumar Buyya Grid Computing and Distributed Systems (GRIDS) Lab. Dept. of Computer.
Nimrod/G GRID Resource Broker and Computational Economy David Abramson, Rajkumar Buyya, Jon Giddy School of Computer Science and Software Engineering Monash.
DISTRIBUTED COMPUTING
ARGONNE  CHICAGO Ian Foster Discussion Points l Maintaining the right balance between research and development l Maintaining focus vs. accepting broader.
Grid Resource Brokering and Cost-based Scheduling With Nimrod-G and Gridbus Case Studies Rajkumar Buyya Cloud Computing and Distributed Systems (CLOUDS)
Flexibility and user-friendliness of grid portals: the PROGRESS approach Michal Kosiedowski
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
Grid Computing Melbourne: Gridbus Perspective
1 678 Topics Covered (1) Part A: Foundation Socket Programming Thread Programming Elements of Parallel Computing Part B: Cluster Computing Elements of.
Grid Resource Management: Challenges, Approaches, & Solutions Dr. Rajkumar Buyya Cloud Computing and Distributed Systems (CLOUDS) Lab. The University of.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
1 520 Student Presentation GridSim – Grid Modeling and Simulation Toolkit.
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
Virtual Data Grid Architecture Ewa Deelman, Ian Foster, Carl Kesselman, Miron Livny.
Resource Brokering in the PROGRESS Project Juliusz Pukacki Grid Resource Management Workshop, October 2003.
“ A Distributed Computational Economy and the Nimrod-G Grid Resource Broker ”
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
Perspectives on Grid Technology Ian Foster Argonne National Laboratory The University of Chicago.
Grid Architecture William E. Johnston Lawrence Berkeley National Lab and NASA Ames Research Center (These slides are available at grid.lbl.gov/~wej/Grids)
SEEK Welcome Malcolm Atkinson Director 12 th May 2004.
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
Grid Computing and the Gridbus Middleware: Making the Global Cyberinfrastructure for e-Science and e-Business Applications a Reality Dr. Rajkumar Buyya.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Authors: Rajkumar Buyya, David Abramson & Jonathan Giddy
Economic and On Demand Brain Activity Analysis on Global Grids A case study.
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
Aneka Cloud ApplicationPlatform. Introduction Aneka consists of a scalable cloud middleware that can be deployed on top of heterogeneous computing resources.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
Università di Perugia Enabling Grids for E-sciencE Status of and requirements for Computational Chemistry NA4 – SA1 Meeting – 6 th April.
INTRODUCTION TO GRID & CLOUD COMPUTING U. Jhashuva 1 Asst. Professor Dept. of CSE.
Distributed Systems Architectures. Topics covered l Client-server architectures l Distributed object architectures l Inter-organisational computing.
David Abramson, Rajkumar Buyya, and Jonathan Giddy
Convergence Characteristics for Clusters, Grids, and P2P networks
Large Scale Distributed Computing
Building and running HPC apps in Windows Azure
Presentation transcript:

The Gridbus Toolkit for Building and Deploying eScience Applications on Utility Grids Rajkumar Buyya Fellow of Grid Computing Grid Computing and Distributed Systems (GRIDS) Lab. Dept. of Computer Science and Software Engineering The University of Melbourne, Australia

2 Outline Introduction to eScience and Challenges Introduction to the Gridbus Project An Overview of Gridbus Components Grid Service Broker Architecture Design and Implementation Scheduling Algorithms BioGrid Demo OR Performance Evaluation A Case Study in High Energy Physics Economy-based Scheduling in Data Grids Summary

3 Prominent Grid Drivers: Emerging eScinece and eBusiness Apps Next generation experiments, simulations, sensors, satellites, even people and businesses are creating a flood of data. They all involve numerous experts/resources from multiple organization in synthesis, modeling, simulation, analysis, and interpretation. Life Sciences Digital Biology Finance: Portfolio analysis ~PBytes/sec Newswire & data mining: Natural language engineering Astronomy Internet & Ecommerce High Energy PhysicsBrain Activity Analysis Quantum Chemistry

4 E-Science Elements Distributed instruments Distributed computation Distributed data Peers sharing ideas and collaborative interpretation of data/results E-Scientist 2100 Remote Visualization Data & Compute Service

5 Grids have Emerged as Scalable Cyberinfrastructure for e-Science Applications Grid Resource Broker Resource Broker Application Grid Information Service Grid Resource Broker database R2R2 R3R3 RNRN R1R1 R4R4 R5R5 R6R6 Grid Information Service

6 Type of Services Modern Grids Offer Computational Services – CPU cycles NASA IPG, TeraGrid, I-Grid,… Data Services Data replication, management, secure access-- LHC Grid/Napster Application Services Access to remote software/libraries and license management—NetSolve Information Services Extraction and presentation of data with meaning Knowledge Services The way knowledge is acquired and managed— data mining. Utility Computing Services Towards a market-based Grid computing: Leasing and delivering Grid services as ICT utilities. Computional Grid Data Grid ASP Grid Information Grid Knowledge Grid Utility Grid

7 Grid Challenges Security Resource Allocation & Scheduling Data locality Network Management System Management Resource Discovery Uniform Access Computational Economy Application Construction

8 Some Grid Initiatives Worldwide Australia Nimrod-G Gridbus DISCWorld GrangeNet. APACGrid ARC eResearch? Brazil OurGrid, EasyGrid LNCC-Grid + many others China ChinaGrid – Education CNGrid - application Europe UK eScience EU Grids.. and many more... India I-Grid  Japan NAGERI Korea... N*Grid Singapore NGP USA Globus NASA IPG AccessGrid TeraGrid Cyberinfrasture and many more... Industry Initiatives IBM On Demand Computing HP Adaptive Computing Sun N1 Microsoft -.NET Oracle 10g Infosys – Business Grid StorageTek –Grid.. and many more Public Forums Global Grid Forum Australian Grid Forum Conferences: CCGrid Grid P2P HPDC billion – 3 yrs 1 billion – 5 yrs 450million – 5 yrs 486million – 5 yrs 1.3 billion (Rs) 27 million 2? billion 120million – 5 yrs

9 The Gridbus Melbourne: Enable Leasing of ICT Services on Demand WWG World Wide Grid!  On Demand Utility Computing Gridbus Distributed Data

10 The Gridbus Project: A multi-institutional “Open Source” R&D Project with focus on: Architecture, Specification, and Open Source Reference Implementation. Service-Oriented Grid, Utility Computing & Distributed Data and Computation Economy Scaling from Desktops, Clusters, Cluster Federation, Enterprise Grids to Global Grids. Grid Market Directory and Web Services Grid Bank: Accounting and Transaction Management Visual Tools for Creation of Distributed Applications Workflow Composition and Deployment Services Data Grid Brokering and Grid Economy Services Data Replication Strategies GridSim Toolkit: Enhanced to support Data Grid, Reservation, etc. Libra: Economic Cluster Scheduler Coupling of Clusters and Computational Economy Alchemi: Harnessing.NET/Windows-based Resources WWG: Global Data Intensive Grid Testbed Application Enabler Projects: High-Energy Physics, Astronomy, Brain Activity Analysis – Osaka U., Natural Language Processing, Portfolio Analysis – Spain, BioGrid - WEHI (via APACGrid), SensorGrid (NICTA), Medical Imaging (HFI) Supported by:

11 Grid Economy: Methodology for Sustained Resourced Sharing and Managing Supply-and-Demand for Resources

12 New challenges of Grid Economy Grid Service Providers (GSPs) How do I decide service pricing models ? How do I specify them ? How do I translate them into resource allocations ? How do I enforce them ? How do I advertise & attract consumers ? How do I do accounting and handle payments? ….. Grid Service Consumers (GSCs) How do I decide expenses ? How do I express QoS requirements ? How do I trade between timeframe & cost ? How do I map jobs to resources to meet my QoS needs? ….. They need mechanisms and technologies for value expression, value translation, and value enforcement.

GRACE: Service Oriented Grid Architecture GRid Architecture for Computational Economy (GRACE)

14 Grid Node N GRACE: A Reference Service-Oriented Grid Architecture for Computational Economies Grid Consumer Programming Environments Grid Resource Broker Grid Service Providers Grid Explorer Schedule Advisor Trade Manager Job Control Agent Deployment Agent Trade Server Resource Allocation Resource Reservation R1R1 Misc. services Information Service R2R2 RmRm … Pricing Algorithms Accounting Grid Node1 … Grid Middleware Services … … Health Monitor Grid Market Services JobExec Info ? Secure Trading QoS Storage Sign-on Grid Bank Applications Data Catalogue

15 Gridbus and Complementary Grid Technologies – realizing GRACE AIX Solaris WindowsLinux.NET Grid Fabric Software Grid Applications Core Grid Middleware User-Level Middleware (Grid Tools) Grid Bank Grid Exchange & Federation JVM Grid Brokers: X-Parameter Sweep Lang. Gridbus Data Broker MPI CondorSGETomcatPBS Alchemi Workflow IRIXOSF1 Mac Libra GlobusUnicore … … Grid Market Directory PDBCDB Worldwide Grid Grid Fabric Hardware … … PortalsScienceCommerceEngineering … … Collaboratories … … Workflow Engine Grid Storage Economy Grid Economy NorduGridXGrid ExcellGrid Nimrod-G GRIDSIMGRIDSIM Gridscape

16 Gridbus Technologies Application Construction Tools Visual Parametric Modeller (VPM) Grid Economy Services Grid Market Directory A Registry for publication of GSPs and their Services – VO/VE Grid Bank A Grid Accounting Services Grid Trading Services Data Grid Service Broker QoS based Scheduling of Distributed Data Oriented Apps on global Grids Grid Workflow Management System Gridscape Interactive Grid Testbed Portal Generator G-monitor Grid Application Execution Management Portal GridSim A Grid Simulation Toolkit Libra Economy based Cluster Scheduling

17 Alchemi:.NET-based Enterprise Grid Platform & Web Services Internet Alchemi Worker Agent Alchemi Manager Alchemi Users Web Services like Model General Purpose Dedicated/Non-dedicate workers Role-based Security.NET and Web Services C# Implementation GridThread and Job Model Programming Easy to setup and use

18 On Demand Assembly of Services: Putting Them All Together Data Source (Instruments/dis tributed sources) Data Replicator (GDMP) ASP Catalogue Grid Info Service Grid Market Directory GSP (Accounting Service) Gridbus GridBank Data GSP (e.g., UofM) PE GSP (e.g., VPAC) PE GSP (e.g., IBM) CPU or PE Grid Service (GS) (Globus) Alchemi GS GTS Cluster Scheduler Grid Service Provider (GSP) (e.g., CERN) PE Cluster Scheduler Job 8 Grid Resource Broker 2 Visual Application Composer Application Code Explore data Results 97 Results+ Cost Info Bill 12 Data Catalogue

Creation and Operation of Virtual Enterprises Grid Market Directory Grid Bank

20 A Market-Oriented Grid Environment

21 Grid Market Infrastructure Grids need to provide an infrastructure that supports: (a) the creation of one or more GMP registries; (b) the contributors to register themselves as GSPs along with their resources/application services that they wish to provide; (c) GSPs to publish themselves in one or more GMPs along with service prices; and (d) Grid resource brokers to discover resources/services and their attributes (e.g., access price and usage constraints) that meet user QoS requirements.

22 Grid Bank: Grid Transactions Authorization, Accounting, & Payment Infrastructure Grid Resource Broker (GRB) GridBank Payment Module Grid Trade Server GridBank Charging Module GridBank Server Establish Service Costs ApplicationsApplications Grid Agent Grid Resource Meter GridCheque Deploy Agent and Submt Jobs Usage Agreement Resource Usage GridCheque Grid Service Provider (GSP) GridCheque + Resource Usage (GSC Account Charge Grid Service Consumer (GSC) R1R2 R3 R4 User

Grid Applications: Composition and Deployment – A Broker Perspective Nimrod-G Broker: A Grid Broker for Computational Grids Gridbus Broker: A Grid Service Broker for Data Grids

24 Grid Applications and Parametric Computing Bioinformatics: Drug Design / Protein Modelling Bioinformatics: Drug Design / Protein Modelling Sensitivity experiments on smog formation Natural Language Engineering Ecological Modelling: Control Strategies for Cattle Tick Electronic CAD: Field Programmable Gate Arrays Computer Graphics: Ray Tracing High Energy Physics: Searching for Rare Events Finance: Investment Risk Analysis VLSI Design: SPICE Simulations Aerospace: Wing Design Network Simulation Automobile: Crash Simulation Data Mining Civil Engineering: Building Design astrophysics

25 Thesis Build a task farming application (parameter sweep or bag of tasks) and execute it on Grid within “T” hours or early and cost not exceeding $M. Manual Automated Three Options/Solutions: Using pure Globus commands Build your own Distributed App & Scheduler Use Gridbus Resource Broker to compose and schedule

The Gridbus Grid Service Broker for Data Grid Applications Builds on the Nimrod-G Computational Grid Broker and Computational Economy [Buyya, Abramson, Giddy, Monash University, ] And Extends its notion for Data and Service Grids

27 A resource broker for scheduling task farming data Grid applications with static or dynamic parameter sweeps on global Grids. It uses computational economy paradigm for optimal selection of computational and data services depending on their quality, cost, and availability, and users’ QoS requirements (deadline, budget, & T/C optimisation) Key Features A single window to manage & control experiment Programmable Task Farming Engine Resource Discovery and Resource Trading Optimal Data Source Discovery Scheduling & Predications Generic Dispatcher & Grid Agents Transportation of data & sharing of results Accounting Grid Service Broker (GSB)

28 Gridbus Broker at a Glance Home Node/Portal -PBS -Condor -SGE Alchemi Globus Job manager fork()batch() Gridbus Broker Gateway Unicore fork() batch() -PBS -Condor -Alchemi Data Store Access Technology Grid FTP SRB Gridbus agent Data Catalog Credential Repository MyProxy

29 Gridbus Broker Architecture Grid Middleware Gridbus Client Gribus Client Grid Info Server Schedule Advisor Trading Manager Gridbus Farming Engine Record Keeper Grid Explorer GE GIS, NWS TM TS RM & TS Grid Dispatcher RM: Local Resource Manager, TS: Trade Server G G C U Globus enabled node. A L Alchemi enabled node. (Data Grid Scheduler) Data Catalog Data Node Unicore enabled node. $ $ $ App, T, $, Opt (Bag of Tasks Applications)

30 Gridbus Services for eScience applications Application Development Environment: XML-based language for composition of task farming (legacy) applications as parameter sweep applications. Task farming APIs for new applications. Web APIs (e.g., Portlets) for Grid portal development. Workflow interface and Gridbus-enabled workflow engine. Resource Allocation and Scheduling Dynamic discovery of optional computational and data nodes that meet user QoS requirements. Hide L ow-Level Grid Middleware interfaces Globus, Alchemi, Unicore, NorduGrid, XGrid, etc.

31 Gridbus Broker: XML file 1 1 main./calc $X $Y

32 Portal-based Access to Grid Broker for Launching and Steering Applications Grid Broker World-Wide Grid

33 Figure 3 : Logging into the portal. Drug Design Made Easy!

34 Excel Plugin to Access Gridbus Services Excel ExcelGrid Add-In ExcelGrid Runner ExcelGridJob ExcelGrid MiddlewareGridbus BrokerEnterprise Grid 210 0

35 Discover Resources Distribute Jobs Establish Rates Meet requirements ? Remaining Jobs, Deadline, & Budget ? Evaluate & Reschedule Discover More Resources Compose & Schedule Adaptive Scheduling Steps

36 Deadline (D) and Budget (B) Constrained Scheduling Algorithms AlgorithmExecution Time (D) Execution Cost (B) Compute Grid Data Grid Cost OptLimited by DMinimize Yes Cost-Time OptMinimize if possible Minimize Yes Time OptMinimizeLimited by B Yes Conservative- Time Opt MinimizeLimited by B, jobs have guaranteed minimum budget Yes

37 Sample Applications of Gridbus Broker Molecular Docking - WEHI Drug Discovery Brain Activity Analysis – Osaka University Neuroscience studies Natural Language Engineering – Melbourne NLP Indexing of newswire data High Energy Physics – School of Physics/Melbourne Belle experiment data analysis Finance - Portfolio Analysis – U. Comp. Madrid/Spain Investment risk analysis Astronomy Australian Virtual Observatory Spreadsheet Processing Microsoft Excel

Economy-based Data Grid Scheduling High Energy Physics as eScience Application Case Study

39 Case Study: High Energy Physics What is High Energy Physics? (HEP) Study of the fundamental constituents of matter and forces. High Energy Physics - using H.E. enables the probing of smaller distances/structures and study in early-universe like environ. Particle Physics - quanta of matter/forces and their properties The Belle Experiment KEK B-Factory, Japan Investigating fundamental violation of symmetry in nature (Charge Parity) which may help explain the universal matter – antimatter imbalance. Collaboration 400 people, 50 institutes 100’s TB data currently

40 Case Study: Event Simulation and Analysis B0->D*+D*-Ks Simulation and Analysis Package - Belle Analysis Software Framework (BASF) Experiment in 2 parts – Generation of Simulated Data and Analysis of the distributed data Only the Analysis is discussed here

41 Australian Belle Data Grid Testbed

42 Case Study: Input File for Analysis parameter jobf Gridfile lfn:/users/winton/fsimddks/fsimdata*.mdst; task main copy runme.grid2 node:runme.grid2 node:execute./runme.grid2 $jobf $jobname endtask Dynamic parameter defined to describe an input data file Logical file name pointing to the location in the replica catalog that contains a mapping to where the physical files are present. 100 data files (30MB each) were equally distributed among the five nodes

43 Resources Used and their Service Price Organization Node detailsRoleCost (in G$/CPU-sec) CS,UniMelbbelle.cs.mu.oz.au 4 CPU, 2GB RAM, 40 GB HD, Linux Broker host, Data host, NWS server N.A. (Not used as a compute resource) Physics, UniMelbfleagle.ph.unimelb.edu.au 1 CPU, 512 MB RAM, 40 GB HD, Linux Replica Catalog host, Data host, Compute resource, NWS sensor 2 CS, University of Adelaide belle.cs.adelaide.edu.au 4 CPU (only 1 available), 2GB RAM, 40 GB HD, Linux Data host, NWS sensor N.A. (Not used as a compute resource) ANU, Canberrabelle.anu.edu.au 4 CPU, 2GB RAM, 40 GB HD, Linux Data host, Compute resource, NWS sensor 4 Dept of Physics, USyd belle.physics.usyd.edu.au 4 CPU (only 1 available), 2GB RAM, 40 GB HD, Linux Data host, Compute resource, NWS sensor 4 VPAC, Melbournebrecca-2.vpac.org 180 node cluster (only head node used), Linux Compute resource, NWS sensor 6

44 Network Cost (in Grid $/Currency!)

45 Deploying Application Scenario A data grid scenario with 100 jobs and each accessing remote data of ~30MB Deadline: 3hrs. Budget: G$ 60K Scheduling Optimisation Scenario: Minimise Time Minimise Cost Results:

46 Time Minimization in Data Grids Time (in mins.) Number of jobs completed fleagle.ph.unimelb.edu.aubelle.anu.edu.aubelle.physics.usyd.edu.aubrecca-2.vpac.org

47 Results : Cost Minimization in Data Grids Time(in mins.) Number of jobs completed fleagle.ph.unimelb.edu.aubelle.anu.edu.aubelle.physics.usyd.edu.aubrecca-2.vpac.org

48 Observation Organization Node detailsCost (in G$/CPU-sec)Total Jobs Executed TimeCost CS,UniMelbbelle.cs.mu.oz.au 4 CPU, 2GB RAM, 40 GB HD, Linux N.A. (Not used as a compute resource) -- Physics, UniMelbfleagle.ph.unimelb.edu.au 1 CPU, 512 MB RAM, 40 GB HD, Linux CS, University of Adelaide belle.cs.adelaide.edu.au 4 CPU (only 1 available), 2GB RAM, 40 GB HD, Linux N.A. (Not used as a compute resource) -- ANU, Canberrabelle.anu.edu.au 4 CPU, 2GB RAM, 40 GB HD, Linux 42 2 Dept of Physics, USyd belle.physics.usyd.edu.au 4 CPU (only 1 available), 2GB RAM, 40 GB HD, Linux VPAC, Melbournebrecca-2.vpac.org 180 node cluster (only head node used), Linux 623 2

49 Grid Workflow Management System and Broker Services Database Workflow Submission Handler Workflow Language Parser Tasks Parameters Dependencies Resource Discovery DispatcherData Movement GMD Replica Catalog Gridbus Broker Globus Web services HTTPGridFTP Data transfer Workflow PlannerApplication Composition …… Scientific Portal Workflow Enactment Engine Workflow description & QoS Info Service MDS Workflow Scheduler

50 The GridSim Toolkit A Java based tool for Grid Scheduling Simulations Basic Discrete Event Simulation Infrastructure Virtual Machine (Java, cJVM, RMI) PCs Clusters Workstations... SMPs Distributed Resources GridSim Toolkit Application Modeling Information Services Resource Allocation Grid Resource Brokers or Schedulers’s Simulation Statistics Resource Modeling and Simulation (with Time and Space shared schedulers) Job Management ClustersSingle CPUReservationSMPsLoad Pattern Application Configuration Resource Configuration Visual Modeler Grid Scenario Network SimJavaDistributed SimJava Resource Entities Output Application, User, Grid Scenario’s Input and Results Add your own policy for resource allocation

51 Selected GridSim Users

52 Summary and Conclusion Introduced requirements for an eScience application Demonstrated suitability of Grid computing as Cyberinfrastructure for eScience and eBusiness. Grids exploit synergies that result from cooperation of autonomous entities: Resource sharing, dynamic provisioning, and aggregation at global level. Grids allow users to dynamically lease Grid services at runtime based on their quality, cost, availability, and users QoS requirements. Delivering ICT services as computing utilities. Grids offer enormous opportunities for realizing eScience and eBusiness at global level.

53 Any Questions ?

54 Testbed Details All nodes ran Globus ANU and Melbourne CS had 4 CPUs each. Sydney node was effectively 1 processor (SMP kernel was disabled) Adelaide Globus Gatekeeper was not functioning however we could get data off it. BASF was pre-installed on all the machines Gigabyte of code.