The EU DataGrid – Information and Monitoring Services

Slides:



Advertisements
Similar presentations
21 Sep 2005LCG's R-GMA Applications R-GMA and LCG Steve Fisher & Antony Wilson.
Advertisements

Grid Application Builders Teach In31/01/02Antony Wilson Information & Monitoring Services WP3.
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
Connect. Communicate. Collaborate Click to edit Master title style MODULE 1: perfSONAR TECHNICAL OVERVIEW.
GGF Toronto Spitfire A Relational DB Service for the Grid Peter Z. Kunszt European DataGrid Data Management CERN Database Group.
Directory & Naming Services CS-328 Dick Steflik. A Directory.
Grid Computing, B. Wilkinson, 20046c.1 Globus III - Information Services.
The EU DataGrid – Information and Monitoring Services The European DataGrid Project Team
A Java Based Prototype Grid User Interface Janice Drohan Project Supervisor: Prof. Peter Clarke.
WP3 RGMA Deployment Laurence Field / RAL Steve Fisher / RAL.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE Middleware: gLite Information Systems (IS) EGEE Tutorial 23 rd APAN Meeting,
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Information System (IS) Valeria Ardizzone.
A. Cavalli - F. Semeria INFN Experience With Globus GIS 1 A. Cavalli - F. Semeria INFN First INFN Grid Workshop Catania, 9-11 April 2001 INFN Experience.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Information System on gLite middleware Vincent.
Application code Registry 1 Alignment of R-GMA with developments in the Open Grid Services Architecture (OGSA) is advancing. The existing Servlets and.
GLite Information System(s) Antonio Juan Rubio Montero CIEMAT 10 th EELA Tutorial. Madrid, May 7 th -11 th,2007.
The Replica Location Service The Globus Project™ And The DataGrid Project Copyright (c) 2002 University of Chicago and The University of Southern California.
An information and monitoring system for static and dynamic information about grid resources, applications, networks … RDBMS Servlet aware of API during.
CLRC and the European DataGrid Middleware Information and Monitoring Services The current information service is built on the hierarchical database OpenLDAP.
EGEE is a project funded by the European Union under contract IST R-GMA: Production Services for Information and Monitoring in the Grid John.
E-infrastructure shared between Europe and Latin America FP6−2004−Infrastructures−6-SSA gLite Information System Pedro Rausch IF.
7. Grid Computing Systems and Resource Management
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America gLite Information System Claudio Cherubino.
E-infrastructure shared between Europe and Latin America gLite Information System(s) Manuel Rubio del Solar CETA-CIEMAT EELA Tutorial, Mérida,
The EU DataGrid – Information and Monitoring Services The European DataGrid Project Team
EGEE is a project funded by the European Union under contract IST Information Systems Flavia Donno Section Leader for LCG Experiment Integration.
GIIS Implementation and Requirements F. Semeria INFN European Datagrid Conference Amsterdam, 7 March 2001.
April 4, 2002Atlas Testbed Workshop ATLAS Hierarchical MDS Server Patrick McGuigan.
EGEE is a project funded by the European Union under contract IST Information and Monitoring Services within a Grid R-GMA (Relational Grid.
FESR Trinacria Grid Virtual Laboratory gLite Information System Muoio Annamaria INFN - Catania gLite 3.0 Tutorial Trigrid Catania,
The AstroGrid-D Information Service Stellaris A central grid component to store, manage and transform metadata - and connect to the VO!
E-science grid facility for Europe and Latin America Updates on Information System Annamaria Muoio - INFN Tutorials for trainers 01/07/2008.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
EGEE is a project funded by the European Union under contract INFSO-RI DGAS Grid accounting L.Gaido on behalf of A.Guarise LCG Workshop November.
OGSA Information Abdeslem Djaoui OGSA Information, OGSA-WG #11
The Information System in gLite middleware
The Operations Portal and the Grid Operations Interoperability
The EDG Testbed Deployment Details
gLite Information System
Classic Storage Element
Introduction to LDAP Frank A. Kuse.
gLite Information System(s)
Practical: The Information Systems
BDII Performance Tests
R-GMA as an example of a generic framework for information exchange
Globus —— Toolkits for Grid Computing
The Information System in gLite
Middleware independent Information Service
Resource monitoring and discovery in OGSA
Sergio Fantinel, INFN LNL/PD
Information System Virginia Martín-Rubio Pascual
gLite Information System
Grid Metadata Management
SAGA API for gLite Service Discovery
gLite Information System
A conceptual model of grid resources and services
gLite Information System(s)
The Globus Toolkit™: Information Services
Author: Laurence Field (CERN)
Report on GLUE activities 5th EU-DataGRID Conference
Introduction to Name and Directory Services
gLite Information System overview
WP7 Network Monitoring Schemata
EGEE Middleware: gLite Information Systems (IS)
Information and Monitoring System
RELATIONAL GRID MONITORING ARCHITECHTURE
gLite Information System
The gLite Information System
Information System (BDII)
Information Services Claudio Cherubino INFN Catania Bologna
Presentation transcript:

The EU DataGrid – Information and Monitoring Services The European DataGrid Project Team http://www.eu-datagrid.org

Aim The aim of the Information and Monitoring Service is to deliver a flexible infrastructure that provides information on both The EU DataGrid itself Mainly for the middleware packages Grid applications For users The current system is based on MDS which only offers the first service - to the middleware. The user may however query it to understand the status of the DataGrid

Overview of Talk Requirements of an information and monitoring service Globus MDS (Metacomputing Directory Service or Monitoring and Discovery Service as it is now called) OpenLDAP, a hierarchical database EDG GRIS/GIIS hierarchy EDG information providers Searching the information system R-GMA (Relational Grid Monitoring Architecture) A relational implementation of the Global Grid Forums GMA

Some Requirements of a Grid Information & Monitoring Service The system must be able to cope with nodes in a distributed environment Dynamic addition and deletion of information providers should be supported It must have a security system able to address the access to information at a fine level of granularity It must be able to work well on an unreliable Wide Area Network (WAN) The system must allow new data types to be defined

Monitoring and Discovery Services Globus MDS Monitoring and Discovery Services

LDAP attributes EDG currently uses Globus MDS which is built on OpenLDAP A schema describes the attributes and the types of the attributes associated with data objects Example - some attributes of SiteInfo: siteName: RALDEV sysAdminContact: grid.sysadmin@rl.ac.uk userSupportContact: grid.support@rl.ac.uk siteSecurityContact: grid.security@rl.ac.uk dataGridVersion: 1.2 InstallationDate: 20020704142800Z This is an introduction to some LDAP terminology These terms will be used, put into context in the next few slides

LDAP hierarchy Lightweight Directory Assess Protocol (LDAP) offers a hierarchical view of information The objects are arranged in a Directory Information Tree (DIT) One or more attributes represent the Relative Distinguished Name (RDN) An object is identified by its Distinguished name This is its RDN with the Distinguished name of its parent

RDNs and DNs RDN DN status supported protocols SE site SE Status seId=dev02.hepgrid.clrc.ac.uk Status in=status Protocols seProtocol=gridftp seProtocol=rfio seProtocol=file DN Site Mds-Vo-name=ral-dev,Mds-Vo-name=uk,o=Grid SE seId=dev02.hepgrid.clrc.ac.uk,Mds-Vo-name=ral-dev,Mds-Vo-name=uk,o=Grid Status in=status,seId=dev02.hepgrid.clrc.ac.uk,Mds-Vo-name=ral-dev,Mds-Vo-name=uk,o=Grid Protocols seProtocol=gridftp, seId=dev02.hepgrid.clrc.ac.uk,Mds-Vo-name=ral-dev,Mds-Vo-name=uk,o=Grid seProtocol=rfio, seId=dev02.hepgrid.clrc.ac.uk,Mds-Vo-name=ral-dev,Mds-Vo-name=uk,o=Grid seProtocol=file, seId=dev02.hepgrid.clrc.ac.uk,Mds-Vo-name=ral-dev,Mds-Vo-name=uk,o=Grid status supported protocols SE site Each of the icon represents a data object published by an information provider. In some cases one information provider publishes several data objects. Each site provides information about itself, its computing elements, its storage elements and network monitoring information We will now have a look at some example data for these objects.

MDS GRISs & GIISs Information providers are scripts which when invoked by the LDAP server make available the desired information Information is cached by the server to improve performance LDAP was designed for use with relatively static data, not rapidly changing data Within MDS the EDG information providers are invoked by a local LDAP server, the Grid Resource Information Server (GRIS) “Aggregate directories”, Grid Information Index Servers (GIIS), are used to group resources The GRISs use soft state registration to register with one or more GIISs The GIIS can then act as a single point of contact for a number of resources A GIIS may represent a site, country, virtual organization, etc. In turn a GIIS may register with another GIIS

EDG GRIS/GIIS Hierarchy GRID Data siteA siteD siteC siteB countryA countryB datagrid information providers There is a top level datagrid GIIS to which all of the country GIISs register Each country has a GIIS to which all of the site GIISs register Each Site has a Grid Information Index Server (GIIS) which acts as a single point of contact for all of the sites resources. The GRISs register with their site GIIS <RETURN> We will now look at the EDG information providers Information providers publish information to a local LDAP server known as a Grid Resource Information Server (GRIS)

EDG Information Providers The EDG have produced information providers: Site information The Computing Element The Storage Element Network Monitoring All of the EDG data objects are dynamic, they have a time stamp and a time to live (used by the cache mechanism) associated with them

EDG Information Providers & the Directory Information Tree network information between this and other sites CE storage elements that are close (not necessarily at the same site) status supported protocols file statistics SE site information site Each of the icon represents a data object published by an information provider. In some cases one information provider publishes several data objects. Each site provides information about itself, its computing elements, its storage elements and network monitoring information We will now have a look at some example data for these objects. Note that there are 2 hierarchies: The GIIS/GRIS structure The DIT

Siteinfo in=siteinfo,Mds-Vo-name=ral-dev,Mds-Vo-name=uk,o=Grid objectClass: SiteInfo objectClass: DataGridTop objectClass: DynamicObject siteName: RALDEV sysAdminContact: grid.sysadmin@rl.ac.uk userSupportContact: grid.support@rl.ac.uk siteSecurityContact: grid.security@rl.ac.uk dataGridVersion: 1.2 installationDate: 20020704142800Z This is an example of the information published about a site. As already mentioned the schema defines the attributes and their types that are associated with an object.

Storage Element seId=dev02.hepgrid.clrc.ac.uk,Mds-Vo-name=ral-dev,Mds-Vo-name=uk,o=Grid objectClass: StorageElement objectClass: DataGridTop objectClass: DynamicObject SEId: dev02.hepgrid.clrc.ac.uk CloseCE: dev01.hepgrid.clrc.ac.uk:2119/jobmanager-pbs-M SEtypearchitecture: disk SEsize: 13177 SEResourceContactString: grid.support@rl.ac.uk SEvo: wpsix, :/flatfiles/05/wpsix This is an example of the information published about a storage element <<NB only go through a couple of attributes >>

Storage Element Protocols seProtocol=gridftp, seId=dev02.hepgrid.clrc.ac.uk,Mds-Vo-name=ral-dev,Mds-Vo-name=uk,o=Grid objectClass: StorageElementProtocol objectClass: DataGridTop objectClass: DynamicObject SEId: dev02.hepgrid.clrc.ac.uk SEProtocol: gridftp Port: 2811 seProtocol=rfio, seId=dev02.hepgrid.clrc.ac.uk,Mds-Vo-name=ral-dev,Mds-Vo-name=uk,o=Grid SEProtocol: rfio Port: 3147 seProtocol=file, seId=dev02.hepgrid.clrc.ac.uk,Mds-Vo-name=ral-dev,Mds-Vo-name=uk,o=Grid SEProtocol: file This is an example of the information published by the storage element about the protocols that it supports <<NB only go through a couple of attributes >>

Storage Element Status in=status,seId=dev02.hepgrid.clrc.ac.uk,Mds-Vo-name=ral-dev,Mds-Vo-name=uk,o=Grid objectClass: StorageElementStatus objectClass: DataGridTop objectClass: DynamicObject SEfreespace: 12031 SEId: dev02.hepgrid.clrc.ac.uk This is an example of the information published by the storage element about its current status <<NB only go through a couple of attributes >>

Map Centre An easy way to browse the data available in the Information and Monitoring Service is to use Map Centre http://ccwp7.in2p3.fr/mapcenter/datagrid-d/ We can access the data in a number of ways You will get to use map centre in one of the tutorials

Querying the Information & Monitoring Service Queries can be posed to the current Information and Monitoring Service using LDAP search commands: $ldapsearch\ -x\ -H ldap://lxshare0225.cern.ch:2135\ -b 'Mds-Vo-name=datagrid,o=grid\ 'objectclass=StorageElment‘\ seId SEsize \ -s base|one|sub “simple” authentication uniform resource identifier base distinguished name for search filter attributes to be returned scope of the search specifying just the base object, one-level or the complete subtree

Querying the GRIS/GIIS Hierarchy Mds-Vo-name=datagrid,o=grid This will look at all the data GRID Data Mds-Vo-name =siteA Mds-Vo-name =siteD Mds-Vo-name =siteC Mds-Vo-name =siteB Mds-Vo-name =countryA Mds-Vo-name =countryB Mds-Vo-name =datagrid Mds-Vo-name=siteB, Mds-Vo-name=countryA,Mds-Vo-name=datagrid,o=grid This will look at all the data from siteB Mds-Vo-name=countryA,o=grid This will look at all the data from countryA Mds-Vo-name=siteB,Mds-Vo-name=countryA,o=grid This will look at all the data from siteB MDS uses a soft state registration and a hierarchy of grid information index servers, GIISs. The current configuration is that we have a GIIS for each site which registers with a GIIS for the country. The country GIISs are registered with the top level datagrid GIIS The information you can see and the way you access specific data depends on where you view the information form <<RETURN>> If for example you wish to see information about site b you could query the server directly using the site name in your search If you wished to view all of the information about a country you could query the country GIIS using “Mds-Vo-name=countryA,o=grid” You could also only retrieve the information from site b but this time you would have to refer to it as “Vo-name=siteB, Vo-name=countryA,o=grid” When querying the top level GIIS you have access to all of the published information To see only the information from site b this time you must refer to it as “Vo-name=siteB, Vo-name=countryA, Vo-name=datagrid, o=grid” Mds-Vo-name=siteB,o=grid This will look at all the data from siteB

Relational - Grid Monitoring Architecture R-GMA Relational - Grid Monitoring Architecture

R-GMA LDAP does not allow queries over different objects I.e. you can only query based on attributes of an object (no “Joins”) MDS is not designed for applications to publish their own data It has relatively static descriptions of the data being published – the schema. R-GMA is a relational implementation of the Grid Monitoring Architecture (GMA) of the GGF The relational model is very flexible and allows complex queries which make use of information in multiple objects R-GMA provides a means for anyone to publish any information on the Grid – can also do the job of the current MDS It is highly dynamic – with new Producers of information being noticed by existing Consumers

R-GMA The Consumer Producer Model GRID Data GRID Data Producer Use the Grid Monitoring Architecture from Global Grid Forum A relational implementation Applied to both information and monitoring Creates impression that you have one RDBMS per Virtual Organization Registry Command flow Information flow Consumer

Relational Approach Not a general distributed RDBMS system, but a way to use the relational model in a distributed environment where ACID properties are not generally important. Producers announce: SQL “CREATE TABLE” publish: SQL “INSERT” Consumers collect: SQL “SELECT” The mediator is a component within the Consumer which locates one or more Producers and combines the information as necessary Atomicity, Consistency, Isolation, and Durability

Summary The current information system is Globus MDS This uses LDAP as its underlying data interface The are a number of EDG information providers The new Information and Monitoring Service is R-GMA Early version in 1.2 It can support complex SQL queries It can return a single result set or stream data Grid applications can publish data

The End Information and Monitoring Services http://hepunx.rl.ac.uk/edg/wp3/