GridBench: A Tool for Benchmarking Grids

Slides:



Advertisements
Similar presentations
A Workflow Engine with Multi-Level Parallelism Supports Qifeng Huang and Yan Huang School of Computer Science Cardiff University
Advertisements

A MapReduce Workflow System for Architecting Scientific Data Intensive Applications By Phuong Nguyen and Milton Halem phuong3 or 1.
C. Grimme, A. Papaspyrou Scheduling in C3-Grid AstroGrid-D Workshop Project: C3-Grid Collaborative Climate Community Data and Processing Grid Scheduling.
1 OBJECTIVES To generate a web-based system enables to assemble model configurations. to submit these configurations on different.
GridRPC Sources / Credits: IRISA/IFSIC IRISA/INRIA Thierry Priol et. al papers.
® IBM Software Group © 2006 IBM Corporation Rational Software France Object-Oriented Analysis and Design with UML2 and Rational Software Modeler 04. Other.
Assignment 3 Using GRAM to Submit a Job to the Grid James Ruff Senior Western Carolina University Department of Mathematics and Computer Science.
Combining Static and Dynamic Data in Code Visualization David Eng Sable Research Group, McGill University PASTE 2002 Charleston, South Carolina November.
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
OpenCL Introduction A TECHNICAL REVIEW LU OCT
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
Tools and Utilities for parallel and serial codes in ENEA-GRID environment CRESCO Project: Salvatore Raia SubProject I.2 C.R. ENEA-Portici. 11/12/2007.
Nicholas LoulloudesMarch 3 rd, 2009 g-Eclipse Testing and Benchmarking Grid Infrastructures using the g-Eclipse Framework Nicholas Loulloudes On behalf.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
INFSO-RI Enabling Grids for E-sciencE Grid Applications -- Cyprus Contribution to EGEE Organization: HPCL, University Of Cyprus.
Contents 1.Introduction, architecture 2.Live demonstration 3.Extensibility.
4.2.1 Programming Models Technology drivers – Node count, scale of parallelism within the node – Heterogeneity – Complex memory hierarchies – Failure rates.
Evaluation of Agent Teamwork High Performance Distributed Computing Middleware. Solomon Lane Agent Teamwork Research Assistant October 2006 – March 2007.
Master Worker Paradigm Support in Software Component Models Hinde Bouziane, Christian Pérez PARIS Research Team INRIA/IRISA Rennes ANR CIGC LEGO (ANR-05-CICG-11)
CSC 7600 Lecture 28 : Final Exam Review Spring 2010 HIGH PERFORMANCE COMPUTING: MODELS, METHODS, & MEANS FINAL EXAM REVIEW Daniel Kogler, Chirag Dekate.
George Tsouloupas University of Cyprus Task 2.3 GridBench ● 1 st Year Targets ● Background ● Prototype ● Problems and Issues ● What's Next.
Nguyen Tuan Anh. VN-Grid: Goals  Grid middleware (focus of this presentation)  Tuan Anh  Grid applications  Hoai.
Interactive Workflows Branislav Šimo, Ondrej Habala, Ladislav Hluchý Institute of Informatics, Slovak Academy of Sciences.
Connections to Other Packages The Cactus Team Albert Einstein Institute
INFSO-RI Enabling Grids for E-sciencE Charon Extension Layer. Modular environment for Grid jobs and applications management Jan.
Globus Grid Tutorial Part 2: Running Programs Across Multiple Resources.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
Globus: A Report. Introduction What is Globus? Need for Globus. Goal of Globus Approach used by Globus: –Develop High level tools and basic technologies.
Università di Perugia Enabling Grids for E-sciencE Status of and requirements for Computational Chemistry NA4 – SA1 Meeting – 6 th April.
Open Grid Services for Earth Observation Pedro Gonçalves.
Gennaro Tortone, Sergio Fantinel – Bologna, LCG-EDT Monitoring Service DataTAG WP4 Monitoring Group DataTAG WP4 meeting Bologna –
Developing GRID Applications GRACE Project
Simulation Production System Science Advisory Committee Meeting UW-Madison March 1 st -2 nd 2007 Juan Carlos Díaz Vélez.
SAM architecture EGEE 07 Service Availability Monitor for the LHC experiments Simone Campana, Alessandro Di Girolamo, Nicolò Magini, Patricia Mendez Lorenzo,
Review of PARK Reflectometry Group 10/31/2007. Outline Goal Hardware target Software infrastructure PARK organization Use cases Park Components. GUI /
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
CNAF - 24 September 2004 EGEE SA-1 SPACI Activity Italo Epicoco.
NAREGI PSE with ACS S.Kawata 1, H.Usami 2, M.Yamada 3, Y.Miyahara 3, Y.Hayase 4 1 Utsunomiya University 2 National Institute of Informatics 3 FUJITSU Limited.
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
Enabling Grids for E-sciencE University of Perugia Computational Chemistry status report EGAAP Meeting – 21 rst April 2005 Athens, Greece.
Petr Škoda, Jakub Koza Astronomical Institute Academy of Sciences
Agent Teamwork Research Assistant
Simulation Production System
Applied Operating System Concepts
Introduction to Load Balancing:
Porting MM5 and BOLAM codes to the GRID
BDII Performance Tests
Sergio Fantinel, INFN LNL/PD
Introduction to Grid Technology
Ruslan Fomkin and Tore Risch Uppsala DataBase Laboratory
Software as Data Structure
CRESCO Project: Salvatore Raia
CompChem VO: User experience using MPI
NGS computation services: APIs and Parallel Jobs
An easier path? Customizing a “Global Solution”
gLite Information System
Liang Chen Advisor: Gagan Agrawal Computer Science & Engineering
Many-core Software Development Platforms
Class project by Piyush Ranjan Satapathy & Van Lepham
Leigh Grundhoefer Indiana University
Language Processors Application Domain – ideas concerning the behavior of a software. Execution Domain – Ideas implemented in Computer System. Semantic.
EGEE Middleware: gLite Information Systems (IS)
Wide Area Workload Management Work Package DATAGRID project
Resource and Service Management on the Grid
Overview of Workflows: Why Use Them?
Database System Concepts and Architecture
Production Manager Tools (New Architecture)
Introduction to the SHIWA Simulation Platform EGI User Forum,
Information Services Claudio Cherubino INFN Catania Bologna
Presentation transcript:

GridBench: A Tool for Benchmarking Grids George Tsouloupas Marios Dikaiakos Dept. of Computer Science University of Cyprus {georget,mdd}@ucy.ac.cy

Introduction Utility of Grid Benchmarking: The experimental evaluation of Grid Infrastructures. Researcher: Characterization of factors of application and infrastructure performance; provide insight to properties of Grid Architectures; Study the dynamic nature of the Grid. User: Evaluate fitness of a collection of resources for running a specific application. Investigate/troubleshoot application performance (by comparison to benchmark runs)

Introduction (cont’d) Utility of Grid Benchmarking (cont’d) Developer: Evaluate fitness of a collection of resources for running a general class of applications, by looking into different programming modelsadigms or which type of middleware to use. Integrator/Administrator: Evaluate the employment of specific middleware Evaluate scheduling and resource allocation algorithms.

Benchmarking the Grid: A Hierarchical Framework Targets: Individual Resources (cluster nodes, Storage Elements) Sites (clusters, SMPs) Grid Constellations (i.e. multiple sites / VOs) Middleware (e.g. mpich-g2) Benchmarks: Micro-benchmarks –isolate basic performance characteristics Micro-kernel Benchmarks –synthetic codes Application Benchmarks –derived from real applications deployed on grids

A Simple Model VO SITE Storage NODE … Site

GridBench Design: Software Architecture Components: RSL/JDL Compiler (XML benchmark descriptions to job description languages) Orchestrator (benchmark execution, collection of results) Benchmark component (the benchmark executable) Monitoring component (collects monitoring information) Archive Database (benchmark result archive) Information provider (results to information services) Benchmark Definition GUI Benchmark Browser GUI

GridBench Design: Software Architecture

GridBench Design: Archival & Publication of Results Benchmark results are archived in a native-XML database (Apache Xindice) The Benchmark results are stored together with their GBDL definition. Results are meaningless without the specific parameters etc. that were used (Reference to) Monitoring data also archived. Comprehension/Analysis of results is (hopefully considerably) enhanced when combined with infrastructure monitoring during the run. Benchmark results related to a Site are published to the Site’s MDS for easy access by users and schedulers

Micro-benchmarks & Micro-kernels Micro-benchmarks at the Worker-node level EPWhetstone: embarrassingly parallel adaptation of the serial whetstone benchmark. BlasBench: evaluate serial performance of the BLAS routines. Micro-benchmarks at the Site level Bonnie++: Storage I/O performance MPPTest: MPI performance measurements Micro-benchmarks at the VO level MPPTest: MPI performance measurements (spanning sites) gb_ftb: File Transfer Benchmark Micro-kernels at the Site level High-Performance Linpack Selected kernels from the NAS Parallel Benchmarks Micro-kernels at the VO level Computationally Intensive Grid Benchmarks

GridBench Design: Meta-data GBDL Main Parts: Components Metrics Parameters Location Co/pre-requisites Monitors

Metrics representation Benchmark-level metrics Vs component-level metrics (e.g. CIGB completion time Vs EP component completion time) Single-value metrics <metric name="epwhetstone_IPS“ type="value"> <vector unit="MIPS">53</vector> </metric> Multi-value metrics <metric name="xfer-rate" type="list"> <vector unit="bps"> 2039430,2083930,1909830,2184750,...</vector> <vector unit="second" toffset="1055327287"> 0,10,20,30,...</vector>

Monitoring GBDL can contain monitoring definitions. <component name="data-transfer" ID="xfer01">...</component> <monitor type="RGMA" source="ccwp71.in2p3.fr:3306" query="select * from NetworkTCPThroughput where NMIdSource='adc0003.cern.ch' and NMIdDestination='ccwp7.in2p3.fr' <parameter name="begin">comp-begin="xfer01"<ameter> <parameter name="end">comp-end="xfer01"<ameter> </monitor> Implies that a monitoring client must be implemented/adapted (potentially several monitoring clients can be employed)

Compilation to RSL/JDL GBDL XML-based GBDL to Job Description Support for simple jobs can be through the use of simple templates. (executable, parameters and locations are transformed to simple RSL/JDL ‘scripts’) Most benchmarks need special command-line parameter formatting, or parameter files. -> ParameterHandler for each benchmark that needs it. Param handler compiler RSL JDL …

Compilation to RSL (example) GBDL <benchmark date= name="nbody"> <component id="A" name="nbody“ type="mpi"> <location type="single"> <resource cpucount="2” name="ce1.grid.ucy"/> name="ce2.grid.ucy"/> </location> <parameter name="executable" type="attribute"> /bin/nbody.exec </parameter> <parameter name="nparticles" type="value"> 1000 </parameter> </component> </benchmark> RSL +(&(resourceManagerContact= “ce1.grid.ucy") (label="subjob 0") (environment= (GLOBUS_DUROC_SUBJOB_INDEX 0)) (count=2) (arguments="-n 1000") (executable="/bin/nbody.exec" )) (&(resourceManagerContact= "ce2.grid.ucy ") (label="subjob 1") (GLOBUS_DUROC_SUBJOB_INDEX 1)) (arguments="-n 1000")

Experiments User Interface Compiler GridBench UI Monitor CE Benchmark Component Executable RSL GBDL GRAM Orchestrator std out GRAM std out CE Benchmark Component Executable Archiver SE Xindice

Results

WIP and Future Work Complete GBDL specification Revise Metric and Monitoring representations Implementation of more benchmarks.