David Abramson, Rajkumar Buyya, and Jonathan Giddy

Slides:



Advertisements
Similar presentations
Nimrod/G GRID Resource Broker and Computational Economy
Advertisements

Computational Grids and Computational Economy: Nimrod/G Approach David Abramson Rajkumar Buyya Jonathan Giddy.
CSF4 Meta-Scheduler Tutorial 1st PRAGMA Institute Zhaohui Ding or
National Institute of Advanced Industrial Science and Technology Ninf-G - Core GridRPC Infrastructure Software OGF19 Yoshio Tanaka (AIST) On behalf.
The Anatomy of the Grid: An Integrated View of Grid Architecture Carl Kesselman USC/Information Sciences Institute Ian Foster, Steve Tuecke Argonne National.
1 Project Overview EconomyGrid Economic Paradigm For “Resource Management and Scheduling” for Service-Oriented Grid Computing Presenter Name: Sama GovindaRamanujam.
High Performance Parametric Modeling with Nimrod/G: A Killer Application for the Global Grid ? David Abramson, Jon Giddy and Lew Kotler Presentation By:
Condor-G: A Computation Management Agent for Multi-Institutional Grids James Frey, Todd Tannenbaum, Miron Livny, Ian Foster, Steven Tuecke Reporter: Fu-Jiun.
A Computation Management Agent for Multi-Institutional Grids
Resource Management of Grid Computing
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
USING THE GLOBUS TOOLKIT This summary by: Asad Samar / CALTECH/CMS Ben Segal / CERN-IT FULL INFO AT:
Universität Dortmund Robotics Research Institute Information Technology Section Grid Metaschedulers An Overview and Up-to-date Solutions Christian.
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
Grids and Grid Technologies for Wide-Area Distributed Computing Mark Baker, Rajkumar Buyya and Domenico Laforenza.
Workload Management Massimo Sgaravatto INFN Padova.
1 GRID D. Royo, O. Ardaiz, L. Díaz de Cerio, R. Meseguer, A. Gallardo, K. Sanjeevan Computer Architecture Department Universitat Politècnica de Catalunya.
Grids and Globus at BNL Presented by John Scott Leita.
Resource Management Reading: “A Resource Management Architecture for Metacomputing Systems”
The Globus Toolkit: Description and Applications Review Steve Tuecke & Ian Foster Argonne National Laboratory The University of Chicago Globus Co-PI: Carl.
INFN-GRID Globus evaluation (WP 1) Massimo Sgaravatto INFN Padova for the INFN Globus group
Nimrod/G GRID Resource Broker and Computational Economy David Abramson, Rajkumar Buyya, Jon Giddy School of Computer Science and Software Engineering Monash.
DISTRIBUTED COMPUTING
WP9 Resource Management Current status and plans for future Juliusz Pukacki Krzysztof Kurowski Poznan Supercomputing.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
From PC Clusters to a Global Computational Grid David Abramson Head of School Computer Science and Software Engineering Monash University Thanks to Jon.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
1 520 Student Presentation GridSim – Grid Modeling and Simulation Toolkit.
Nimrod & NetSolve Sathish Vadhiyar. Nimrod Sources/Credits: Nimrod web site & papers.
The Globus Project: A Status Report Ian Foster Carl Kesselman
“ A Distributed Computational Economy and the Nimrod-G Grid Resource Broker ”
Perspectives on Grid Technology Ian Foster Argonne National Laboratory The University of Chicago.
Tools for collaboration How to share your duck tales…
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Globus Toolkit Massimo Sgaravatto INFN Padova. Massimo Sgaravatto Introduction Grid Services: LHC regional centres need distributed computing Analyze.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
Economic and On Demand Brain Activity Analysis on Global Grids A case study.
Globus Grid Tutorial Part 2: Running Programs Across Multiple Resources.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
6 march Building the INFN Grid Proposal outline a.ghiselli,l.luminari,m.sgaravatto,c.vistoli INFN Grid meeting, milano.
Millions of Jobs or a few good solutions …. David Abramson Monash University MeSsAGE Lab X.
The Globus Toolkit The Globus project was started by Ian Foster and Carl Kesselman from Argonne National Labs and USC respectively. The Globus toolkit.
Holding slide prior to starting show. Scheduling Parametric Jobs on the Grid Jonathan Giddy
INTRODUCTION TO GRID & CLOUD COMPUTING U. Jhashuva 1 Asst. Professor Dept. of CSE.
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
GridOS: Operating System Services for Grid Architectures
Workload Management Workpackage
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING CLOUD COMPUTING
Clouds , Grids and Clusters
Duncan MacMichael & Galen Deal CSS 534 – Autumn 2016
Example: Rapid Atmospheric Modeling System, ColoState U
Peter Kacsuk – Sipos Gergely MTA SZTAKI
The Open Grid Service Architecture (OGSA) Standard for Grid Computing
Globus —— Toolkits for Grid Computing
Rajkumar Buyya School of Computer Science and Software Engineering
Grid Computing.
Grid Portal Services IeSE (the Integrated e-Science Environment)
University of Technology
Convergence Characteristics for Clusters, Grids, and P2P networks
CSS490 Grid Computing Textbook No Corresponding Chapter
Large Scale Distributed Computing
Wide Area Workload Management Work Package DATAGRID project
Proposed Grid Protocol Architecture Working Group
The Anatomy and The Physiology of the Grid
The Anatomy and The Physiology of the Grid
Grid Computing Software Interface
L. Glimcher, R. Jin, G. Agrawal Presented by: Leo Glimcher
Presentation transcript:

David Abramson, Rajkumar Buyya, and Jonathan Giddy Computational Grids and Computational Economy: Nimrod/G Approach Nimrod/G: Economic/Market-based Resource Management and Scheduling (for Parametric Modeling) on the Global Computational Grid Project Team: David Abramson, Rajkumar Buyya, and Jonathan Giddy

A user has an application, say a simulation program, that for a given set of parameters, will calculate a single result. The user wishes to explore the effect of modifying some of those input parameters… Parametric Modeling Study the behaviour of output variables against a range of different input scenarios Execute one application repeatedly for many combinations of input parameters Coarse-grained SPMD (single program - multiple data) model for i in (10, 20, 30, 40, 50, 60, 70, 80, 90, 100): for j in (‘v’, ‘w’, ‘x’, ‘y’, ‘z’): myprog $i $j > output.$i.$j - Each computation is totally independent from the others - This pseudocode takes 50 x the time for one single execution.

Working with Small Clusters Nimrod (1994 - ) DSTC funded project Designed for department level clusters Proof of concept Clustor (Activetools) (1997 - ) Commercial version of Nimrod Re-engineered Features Workstation orientation Access to idle workstations Random allocation policy Password security

Execution Architecture Input Files Substitution Output Files Computational Nodes Root Machine

Clustor Tools

Dispatch cycle using Clustor...

Sample Applications of Clustor Bioinformatics: Protein Modeling Sensitivity experiments on smog formation Parametric study of Laser detuning Combinatorial Optimization: Simulated Annealing Ecological Modeling: Control Strategies for Cattle Tick Electronic CAD: Field Programmable Gate Arrays Computer Graphics: Ray Tracing High Energy Physics: Searching for Rare Events Physics: Laser-Atom Collisions VLSI Design: SPICE Simulations

Clustor limitations Manual resource location static file of machine names No resource scheduling first come first served No cost model all machines cost alike Single access mechanism

Requirements Users and system managers want to know where it will run when it will run how much it will cost that access is secure homogeneous access

Towards Grid Computing…. Source: www.globus.org & updated

Why “The Grid”? New applications based on high-speed coupling of people, computers, databases, instruments, etc. Computer-enhanced instruments Collaborative engineering Browsing of remote datasets Use of remote software Data-intensive computing Very large-scale simulation Large-scale parameter studies Source: www.globus.org

The Grid Vision: To offer “Dependable, consistent, pervasive access to [high-end] resources” Dependable: Can provide performance and functionality guarantees Consistent: Uniform interfaces to a wide variety of resources Pervasive: Ability to “plug in” from anywhere Source: www.globus.org

Challenging Issues Authenticate once Specify simulation (code, resources, etc.) Locate resources Negotiate authorization, acceptable use, etc. Acquire resources Initiate computation Steer computation Access remote datasets Collaborate on results Account for usage Domain 1 Domain 2 Source: www.globus.org

Standards & Commodity Tech Where appropriate, exploit standards and commodity technology in core infrastructure LDAP, SSL, X.509, GSS-API, GAA-API, http, ftp, XML, etc. Provides leverage Interface with other common standards CORBA, Java/Jini, DCOM, Web, etc While our core infrastructure may not be built on one of these distributed architectures, we must cleanly interface with them Source: www.globus.org

The Globus Project Basic research in grid-related technologies Resource management, QoS, networking, storage, security, adaptation, policy, etc. Development of Globus toolkit Core services for grid-enabled tools & applns Construction of large grid testbed: GUSTO Largest grid testbed in terms of sites & apps Application experiments Tele-immersion, distributed computing, etc. Source: www.globus.org

Layered Architecture (Grid Components) Applications High-level Services and Tools GlobusView Testbed Status DUROC MPI MPI-IO CC++ Nimrod/G globusrun Core Services Nexus GRAM Metacomputing Directory Service Globus Security Interface Heartbeat Monitor Gloperf GASS Local Services Condor MPI TCP UDP LSF Easy NQE AIX Irix Solaris Source: www.globus.org

Core Globus Services Communication infrastructure (Nexus, IO) Information services (MDS) Network performance monitoring (Gloperf) Process monitoring (HBM) Remote file and executable management (GASS and GEM) Resource management (GRAM) Security (GSI) Source: www.globus.org

Nimrod/G Architecture Nimrod/G Client Nimrod/G Client Nimrod/G Client Parametric Engine Schedule Advisor Resource Discovery Persistent Info. Dispatcher Grid Directory Services Grid Middleware Services GUSTO Test Bed

Nimrod/G Interactions Additional services used implicitly: GSI (authentication & authorization) Nexus (communication) Resource location MDS server Scheduler Resource allocation (local) Prmtc.. Engine Dispatcher GRAM server Queuing System Job Wrapper User process GASS server File access Root node Gatekeeper node Computational node

Global resource allocation Global information is hard to get and out of date Load balancing Fairness to multiple users Global limits are easy to set and fairly stable Load profiling Cost-based resource allocation

Computational Economy Resource selection on based real money and market based A large number of sellers and buyers (resources may be dedicated/shared) Negotiation: tenders/bids and select those offers meet the requirement Trading and Advance Resource Reservation Schedule computations on those resources that meet all requirements

Cost Model 1 3 2 User 5 Machine 1 User 1 Machine 5 non-uniform costing time to time one user to another usage duration encourages use of local resources first user can access remote resources, but pays a penalty in higher cost.

A Nimrod/G User Console Deadline Cost Available Machines

Some early results

Related Works AppLeS (UC. San Diego) application level scheduling & case-by-case NetSolve (UTK/ORNL) API for creating farms DISCWorld (U. Adelaide) remote information access Millennium (UC. Berkeley) remote execution environment on clusters and supports computational economy

Conclusions Nimrod/G architecture offers a scalable model for resource management and scheduling on computational grids Supports Computational Economy The current model supporting Parametric Computing can be extended to support parallel jobs or any other computational model. Plan to use the concept of Advance Resource Reservation in order to offer the feature wherein the user can say “I am willing to pay $…, can you complete my job by this time…”

Further Information Nimrod/G: www.csse.monash.edu.au/~davida/nimrod.html Active Tools (Clustor): www.activetools.com

Closed systems