The NorduGrid Information System Balázs Kónya GGF5 21-24 July, 2002, Edinburgh.

Slides:



Advertisements
Similar presentations
A Workflow Engine with Multi-Level Parallelism Supports Qifeng Huang and Yan Huang School of Computer Science Cardiff University
Advertisements

FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
NorduGrid Grid Manager developed at NorduGrid project.
A conceptual model of grid resources and services Authors: Sergio Andreozzi Massimo Sgaravatto Cristina Vistoli Presenter: Sergio Andreozzi INFN-CNAF Bologna.
USING THE GLOBUS TOOLKIT This summary by: Asad Samar / CALTECH/CMS Ben Segal / CERN-IT FULL INFO AT:
Swedish participation in DataGrid and NorduGrid Paula Eerola SWEGRID meeting,
Universität Dortmund Robotics Research Institute Information Technology Section Grid Metaschedulers An Overview and Up-to-date Solutions Christian.
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
Massimo Cafaro GridLab Review GridLab WP10 Information Services Massimo Cafaro CACT/ISUFI University of Lecce, Italy.
AustrianGrid, LCG & more Reinhard Bischof HPC-Seminar April 8 th 2005.
DataGrid Kimmo Soikkeli Ilkka Sormunen. What is DataGrid? DataGrid is a project that aims to enable access to geographically distributed computing power.
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
Grid and High Energy Physics Paula Eerola Lunarc, Artist’s view on Grid, by Ursula Wilby, Sydsvenskan
Grid Computing, B. Wilkinson, 20046c.1 Globus III - Information Services.
Grids and Globus at BNL Presented by John Scott Leita.
The NorduGrid project: Using Globus toolkit for building Grid infrastructure presented by Aleksandr Konstantinov Mattias Ellert Aleksandr Konstantinov.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
The EDG Testbed Deployment Details The European DataGrid Project
The EU DataGrid – Information and Monitoring Services The European DataGrid Project Team
INFN-GRID Globus evaluation (WP 1) Massimo Sgaravatto INFN Padova for the INFN Globus group
Overview of the NorduGrid Information System Balázs Kónya 3 rd NorduGrid Workshop 23 May, 2002, Helsinki.
Introduction to NorduGrid ARC / Arto Teräs Slide 1(16) Introduction to NorduGrid ARC Arto Teräs Free and Open Source Software Developers' Meeting.
Grid Computing - AAU 14/ Grid Computing Josva Kleist Danish Center for Grid Computing
NeSC Apps Workshop July 20 th, 2002 Customizable command line tools for Grids Ian Kelley + Gabrielle Allen Max Planck Institute for Gravitational Physics.
A. Cavalli - F. Semeria INFN Experience With Globus GIS 1 A. Cavalli - F. Semeria INFN First INFN Grid Workshop Catania, 9-11 April 2001 INFN Experience.
NorduGrid Architecture and tools CHEP2003 – UCSD Anders Wäänänen
Computational grids and grids projects DSS,
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
Towards a Universal Client for Grid Monitoring Systems Towards a Universal Client for Grid Monitoring Systems Design and Implementation of the Ovid Browser.
DataGrid WP1 Massimo Sgaravatto INFN Padova. WP1 (Grid Workload Management) Objective of the first DataGrid workpackage is (according to the project "Technical.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Information System on gLite middleware Vincent.
1 / 18 Federal University of Rio de Janeiro – COPPE/UFRJ Author : Wladimir S. Meyer – Doctorate Student Advisors : Jano Moreira de Souza – Ph.D. Milton.
Grid Workload Management Massimo Sgaravatto INFN Padova.
Resource Brokering in the PROGRESS Project Juliusz Pukacki Grid Resource Management Workshop, October 2003.
Author - Title- Date - n° 1 Partner Logo EU DataGrid, Work Package 5 The Storage Element.
Quick Introduction to NorduGrid Oxana Smirnova 4 th Nordic LHC Workshop November 23, 2001, Stockholm.
Grid Middleware Tutorial / Grid Technologies IntroSlide 1 /14 Grid Technologies Intro Ivan Degtyarenko ivan.degtyarenko dog csc dot fi CSC – The Finnish.
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Performance Improvements to BDII - Grid Information.
CLRC and the European DataGrid Middleware Information and Monitoring Services The current information service is built on the hierarchical database OpenLDAP.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
GRIDS Center Middleware Overview Sandra Redman Information Technology and Systems Center and Information Technology Research Center National Space Science.
INFSO-RI Enabling Grids for E-sciencE OSG-LCG Interoperability Activity Author: Laurence Field (CERN)
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
E-infrastructure shared between Europe and Latin America FP6−2004−Infrastructures−6-SSA gLite Information System Pedro Rausch IF.
DGC Paris WP2 Summary of Discussions and Plans Peter Z. Kunszt And the WP2 team.
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
DataTAG is a project funded by the European Union DataTAG WP4 meeting, Bologna 29/07/2003 – n o 1 GLUE Schema - Status Report DataTAG WP4 meeting Bologna,
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America gLite Information System Claudio Cherubino.
The NorduGrid toolkit user interface Mattias Ellert Presented at the 3 rd NorduGrid workshop, Helsinki,
Super Computing 2000 DOE SCIENCE ON THE GRID Storage Resource Management For the Earth Science Grid Scientific Data Management Research Group NERSC, LBNL.
The EU DataGrid – Information and Monitoring Services The European DataGrid Project Team
GIIS Implementation and Requirements F. Semeria INFN European Datagrid Conference Amsterdam, 7 March 2001.
April 4, 2002Atlas Testbed Workshop ATLAS Hierarchical MDS Server Patrick McGuigan.
Grid Workload Management (WP 1) Massimo Sgaravatto INFN Padova.
Gennaro Tortone, Sergio Fantinel – Bologna, LCG-EDT Monitoring Service DataTAG WP4 Monitoring Group DataTAG WP4 meeting Bologna –
FESR Trinacria Grid Virtual Laboratory gLite Information System Muoio Annamaria INFN - Catania gLite 3.0 Tutorial Trigrid Catania,
A System for Monitoring and Management of Computational Grids Warren Smith Computer Sciences Corporation NASA Ames Research Center.
E-science grid facility for Europe and Latin America Updates on Information System Annamaria Muoio - INFN Tutorials for trainers 01/07/2008.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
NorduGrid's ARC: A Grid Solution for Decentralized Resources Oxana Smirnova (Lund University/CERN) for the NorduGrid collaboration ISGC 2005, Taiwan.
The EDG Testbed Deployment Details
Oxana Smirnova, Jakob Nielsen (Lund University/CERN)
The EU DataGrid – Information and Monitoring Services
A conceptual model of grid resources and services
EGEE Middleware: gLite Information Systems (IS)
Information Services Claudio Cherubino INFN Catania Bologna
Presentation transcript:

The NorduGrid Information System Balázs Kónya GGF July, 2002, Edinburgh

23/07/ what is this talk about? representation of information technical implementation (LDAP,...) a working implementation of a Grid information model:

23/07/ ● Create a Grid infrastructure in Nordic countries ● Operate a production quality Testbed ● Expose the infrastructure to end-users of different scientific communities ● Survey current Grid technologies ● Pursue basic research on Grid Computing ● Develop Middleware Solutions NorduGrid Project

23/07/ NorduGrid Project ● operates a production Grid Testbed composed of clusters ● “An overview of a Grid Architecture for Scientific Computing” ● develops the missing Middleware pieces, plugs the holes of the Globus: ● job submission, monitoring (extended RSL) ● gridftp-based replacement of the GRAM (gridmanager) ● broker ● information model NorduGrid toolkit: extension of the Globus toolkit

23/07/ Helsinki Institute of Physics Lund University, Uppsala University, Stockholm University, KTH Oslo University, Bergen University Copenhagen University: Niels Bohr Institute, Research Center COM, DIKU NorduGrid: participants

23/07/ Grid Information System a)resource characterization / description b)resource discovery c)monitoring of services / resources Resource & Job Management Data Management Information System + security The nerve system of the Grid information is a critical resource on the Grid

23/07/ Why is it so complicated? ● large number of resources => scalability ● diverse heterogeneous resources => characterization? ● decentralized, automatic maintenance ● efficient access to dynamic data ● quality and reliability of information => fake information can 'kill' the Grid

23/07/ The challenge Grid users always want prompt access to all the information inevitable compromise: load on the Grid up-to-dateness ● try to avoid continuous monitoring ● generate information on demand (pull model) ● apply elaborate caching and keep track of validity of the data (ttl) ● organize “information producers” into some kind of topology (i.e. hierarchy)

23/07/ Solution provided by the Globus Toolkit ● Monitoring and Discovery Services (MDS 2.1) ● comes as a part of Globus Toolkit TM version 2.0 ● “de facto” standard information system ● OpenLDAP based implementation ● general framework for creating Grid Information Systems ● information model (MDS schema) ● providers (populate the schema) ● GRIS (Grid Resource Information Service), LDAP backend, presents the information for the consumers ● GIIS (Grid Index Information Service), LDAP backend, link together GRISes, build hierarchy, caching

23/07/ limitations of MDS 2.1 ● not suitable for describing clusters ● the schema simply forgot about clusters ● single machine, “host” based representation ● each node of a cluster needs to run a GRIS ● users are not interested in most of the MDS attributes ● cluster management/batch system information is badly represented (hidden) ● insufficient job information ● buggy providers ● overcomplicated schema ● MDS 2.1 has never been widely deployed

23/07/ demand for a better model ● at the end of 2001 NorduGrid aimed to have a production Testbed with a usable and reliable information system: ● within a finite amount of time ● based upon the Globus MDS framework ● with a new cluster-based model ● simple & natural mirror of our Testbed architecture ● on the other hand at that time ● no GridForum standards, no available common Grid Information Model ● unrelated, uncoordinated efforts within the GridForum ● preliminary theoretical results won't help you to run a Testbed

23/07/ recipe for creating your own MDS based information system 1) Model your own Testbed architecture, define the Grid objects. Formulate this in an LDAP schema 2) Structure your objects (LDAP entries) into a GRIS Tree 3) Implement providers to populate the LDAP entries 4) Create a topology of GRISes -> GIIS hierarchy

23/07/ technical overview NorduGrid Information System: ● built upon the MDS 2.1 LDAP backends ● the NorduGrid schema gives a natural representation of our resources ● clusters (queues, jobs, users) ● storage elements ● replica catalog ● efficient providers fill the entries of the schema ● each “grid unit” runs its own GRIS ● GRISes are organized into a dynamic country- based GIIS hierarchy

23/07/ DIT of a cluster cluster queue jobs users job-01 job-02 job-03 user-01 user-02 queue jobs users job-04 job-05 user-02 user-03 user-01

23/07/ cluster entry

23/07/ DIT of a cluster cluster queue jobs users job-01 job-02 job-03 user-01 user-02 queue jobs users job-04 job-05 user-02 user-03 user-01

23/07/ queue entry

23/07/ DIT of a cluster cluster queue jobs users job-01 job-02 job-03 user-01 user-02 queue jobs users job-04 job-05 user-02 user-03 user-01

23/07/ job entry job status monitoring = information system query

23/07/ another job entry - the job entry is generated on the execution cluster - when the job is completed and the results are retrieved the job disappears from the information system

23/07/ DIT of a cluster cluster queue jobs users job-01 job-02 job-03 user-01 user-02 queue jobs users job-04 job-05 user-02 user-03 user-01

23/07/ personalized information user based information is essential on the Grid: ● users are not really interested in the total number of cpus of a cluster, but how many of those are available for them! ● number of queuing jobs are irrelevant if the submission gets immediately executed ● instead of total disk space the user's quota is interesting nordugrid-authuser objectclass ● freecpus ● diskspace ● queuelength

23/07/ user entry

23/07/ available information: SE, RC (preliminary) ● Storage Element ● se-baseurl gsiftp://bambi.quark.lu.se:2811/gamma/scratch ● se-freespace ● se-authuser ● Replica Catalog ● rc-baseurl ldap://grid.uio.no:389/rc=NorduGrid,dc=nordugrid,dc=org ● rc-authuser these objectclasses are 'under construction' they will gain real importance with the NorduGrid Storage Manager

23/07/ Hierarchy Hierarchy of GRISes/GIISes

23/07/ interfaces ● The information system speaks LDAP, easy to interface: ● users with command line ldapsearch ● ng-userinterface (submission, brokering, job monitoring) through LDAP C API ● Load Monitor, MDS browser through PHP LDAP API

23/07/ broker & jobsubmission 1)searches through the NorduGrid Testbed for available clusters 2)loops through all the clusters and selects those queues (possible targets) where: ● the user is authorized to run ● the requested software (RuntimeEnvironment) is available ● the cluster & queue parameters match the job requests 3)selects a job destination from the matching targets a)randomly selects among the free resources (where user-freecpus >0) b)in case there are no free matching resources some of the “load” attributes (i.e. user-queuelength) are taken into account

23/07/ a brokering session [konyab]$./ ngsub -d 1 -f ~/gm_test/ui_sleep.rsl User subject name: /O=Grid/O=NorduGrid/OU=quark.lu.se/CN=Balazs Konya Remaining proxy lifetime: 5 hours, 1 minute Initializing LDAP connection to grid.nbi.dk:2135 Initializing LDAP query to grid.nbi.dk:2135 Getting LDAP query results from grid.nbi.dk:2135 Initializing LDAP connection to grid.uio.no Initializing LDAP connection to grid.fi.uib.no Initializing LDAP connection to fire.ii.uib.no Initializing LDAP connection to grid.nbi.dk Initializing LDAP connection to ns1.nordita.dk Initializing LDAP connection to hepax1.nbi.dk Initializing LDAP connection to lscf.nbi.dk Initializing LDAP connection to grid.tsl.uu.se Initializing LDAP connection to grendel.it.uu.se Initializing LDAP connection to grid.quark.lu.se Initializing LDAP query to grid.uio.no Initializing LDAP query to grid.fi.uib.no Initializing LDAP query to fire.ii.uib.no Initializing LDAP query to grid.nbi.dk Initializing LDAP query to ns1.nordita.dk Initializing LDAP query to hepax1.nbi.dk Initializing LDAP query to lscf.nbi.dk Initializing LDAP query to grid.tsl.uu.se Initializing LDAP query to grendel.it.uu.se Initializing LDAP query to grid.quark.lu.se Getting LDAP query results from grid.uio.no Getting LDAP query results from grid.fi.uib.no Getting LDAP query results from fire.ii.uib.no Getting LDAP query results from grid.nbi.dk Getting LDAP query results from ns1.nordita.dk Getting LDAP query results from hepax1.nbi.dk Getting LDAP query results from lscf.nbi.dk Getting LDAP query results from grid.tsl.uu.se Getting LDAP query results from grendel.it.uu.se Getting LDAP query results from grid.quark.lu.se Cluster: Oslo Grid Cluster (grid.uio.no) Queue: default Queue accepted as possible submission target Cluster: Oslo Grid Cluster (grid.uio.no) Queue: veryshort Queue rejected because it does not match the XRSL specification Cluster: Bergen Grid Cluster (grid.fi.uib.no) Queue: default Queue accepted as possible submission target Cluster: Parallab IBM Cluster (fire.ii.uib.no) Queue: dque Queue rejected because user not authorized Cluster: Copenhagen Grid Cluster (grid.nbi.dk) Queue: long Queue accepted as possible submission target Cluster: Copenhagen Grid Cluster (grid.nbi.dk) Queue: short Queue accepted as possible submission target Cluster: Copenhagen Nordita Cluster (ns1.nordita.dk) Queue: p-long Queue rejected because it does not match the XRSL specification Cluster: Copenhagen Nordita Cluster (ns1.nordita.dk) Queue: p-medium Queue rejected because it does not match the XRSL specification Cluster: Copenhagen Nordita Cluster (ns1.nordita.dk) Queue: p-short Queue rejected due to status: inactive Cluster: Copenhagen Alpha Linux Machine (hepax1.nbi.dk) Queue: long Queue rejected due to status: Cluster: Copenhagen Alpha Linux Machine (hepax1.nbi.dk) Queue: short Queue rejected due to status: Cluster: Copenhagen LSCF Cluster (lscf.nbi.dk) Queue: gridlong Queue rejected due to status: Cluster: Copenhagen LSCF Cluster (lscf.nbi.dk) Queue: gridshort Queue rejected due to status: Cluster: Uppsala Grid Cluster (grid.tsl.uu.se) Queue: default Queue accepted as possible submission target Cluster: Uppsala Grendel Cluster (grendel.it.uu.se) Queue: workq Queue accepted as possible submission target Cluster: Lund Grid Cluster (grid.quark.lu.se) Queue: pc Queue accepted as possible submission target Cluster: Lund Grid Cluster (grid.quark.lu.se) Queue: pclong Queue rejected because it does not match the XRSL specification Uppsala Grendel Cluster (grendel.it.uu.se) selected queue workq selected Job submitted with jobid grendel.it.uu.se:2119/jobmanager-ng/

23/07/ Summary ● NorduGrid Testbed runs over an MDS based, hierarchically distributed Information System ● We have designed and implemented an information model which ● naturally maps our architecture ● contains job information ● cluster oriented ● provides user-based information ● simple, functional and extensible ● Our system continuously evolves as new sites & users provide their feedback

23/07/ Future ● our work is not “THE” information model, but can serve as a good starting point ● The Grid needs a common information model ● without a common schema the futureTestbeds of different Grid projects will not be able to talk to each other! ● The GGF should coordinate these efforts ● we hope that our experience can contribute to this challenge

23/07/ The NorduGrid core team : Mattias Ellert Aleksandr Konstantinov Balázs Kónya Jakob Langgaard Nielsen Oxana Smirnova Anders Wäänänen