The Information System in gLite middleware

Slides:



Advertisements
Similar presentations
IST E-infrastructure shared between Europe and Latin America The gLite Information System(s) Christian Grunfeld, UNLP EELA.
Advertisements

FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Information System Gonçalo Borges, Jorge Gomes, Mário David
The EU DataGrid – Information and Monitoring Services The European DataGrid Project Team
E-science grid facility for Europe and Latin America Installation and configuration of a top BDII Gianni M. Ricciardi – Consorzio COMETA.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE Middleware: gLite Information Systems (IS) EGEE Tutorial 23 rd APAN Meeting,
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Information System (IS) Valeria Ardizzone.
4th EELA TUTORIAL - USERS AND SYSTEM ADMINISTRATORS E-infrastructure shared between Europe and Latin America BDII Server Installation Vanessa.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Information System on gLite middleware Vincent.
GLite Information System(s) Antonio Juan Rubio Montero CIEMAT 10 th EELA Tutorial. Madrid, May 7 th -11 th,2007.
E-infrastructure shared between Europe and Latin America 12th EELA Tutorial for Users and System Administrators gLite Information System.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America BDII Server Installation and Configuration.
E-infrastructure shared between Europe and Latin America FP6−2004−Infrastructures−6-SSA gLite Information System Pedro Rausch IF.
SAM Sensors & Tests Judit Novak CERN IT/GD SAM Review I. 21. May 2007, CERN.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America gLite Information System Claudio Cherubino.
E-infrastructure shared between Europe and Latin America gLite Information System(s) Manuel Rubio del Solar CETA-CIEMAT EELA Tutorial, Mérida,
Ninth EELA Tutorial for Users and Managers E-infrastructure shared between Europe and Latin America BDII Server Installation Yubiryn Ramírez.
The EU DataGrid – Information and Monitoring Services The European DataGrid Project Team
EGEE-II INFSO-RI Enabling Grids for E-sciencE Practical: The Information Systems.
GIIS Implementation and Requirements F. Semeria INFN European Datagrid Conference Amsterdam, 7 March 2001.
Gennaro Tortone, Sergio Fantinel – Bologna, LCG-EDT Monitoring Service DataTAG WP4 Monitoring Group DataTAG WP4 meeting Bologna –
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Information System Tutorial Laurence Field.
SAM Status Update Piotr Nyczyk LCG Management Board CERN, 5 June 2007.
FESR Trinacria Grid Virtual Laboratory gLite Information System Muoio Annamaria INFN - Catania gLite 3.0 Tutorial Trigrid Catania,
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America BDII Server Installation Claudio Cherubino.
Co-ordination & Harmonisation of Advanced e-Infrastructures for Research and Education Data Sharing Research Infrastructures Grant Agreement n
First South Africa Grid Training Installation and configuration of BDII Gianni M. Ricciardi Consorzio COMETA First South Africa Grid Training Catania,
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks BDII Server Installation & Configuration.
E-science grid facility for Europe and Latin America Updates on Information System Annamaria Muoio - INFN Tutorials for trainers 01/07/2008.
INFSO-RI Enabling Grids for E-sciencE The Information System: GRIS, GIIS, BDII, information providers, Installation,Configuration,
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
CE design report Luigi Zangrando
Implementation of GLUE 2.0 support in the EMI Data Area Elisabetta Ronchieri on behalf of JRA1’s GLUE 2.0 Working Group INFN-CNAF 13 April 2011, EGI User.
The Grid Information System Maria Alandes Pradillo IT-SDC White Area Lecture, 4th June 2014.
Information System Evolution Enabling Grids for E-sciencE EGEE-III INFSO-RI LDAP LDAP_ADD LDAP_MODIFY Query Merge Update Provider Plugin LDIF.
gLite Information System: General Information, GRIS, GIIS, BDII, Info provider, Sensors, Configuration and Debugging.
EMI is partially funded by the European Commission under Grant Agreement RI EMI Status And Plans Laurence Field, CERN Towards an Integrated Information.
gLite Information System
Classic Storage Element
Installation and configuration of a top BDII
gLite Information System(s)
Practical: The Information Systems
BDII Performance Tests
lcg-infosites documentation (v2.1, LCG2.3.1) 10/03/05
The EU DataGrid – Information and Monitoring Services
The Information System in gLite
Grid2Win: Porting of gLite middleware to Windows XP platform
Sergio Fantinel, INFN LNL/PD
Information System Virginia Martín-Rubio Pascual
gLite Information System
Testing Activities on the CERT-TB Status report
Berkley Database Information Index (BDII) Server Installation & Configuration Giuseppe La Rocca INFN – Catania gLite Tutorial Rome, April 2006.
gLite Information System
Interoperability & Standards
Discussions on group meeting
Hands-on on Information System
A conceptual model of grid resources and services
gLite Information System(s)
Author: Laurence Field (CERN)
gLite Information System overview
EGEE Middleware: gLite Information Systems (IS)
gLite Information System Overview
Information and Monitoring System
Information System in gLite
The EU DataGrid Fabric Management Services
gLite Information System
BDII Server Installation and Configuration
The gLite Information System
Information System (BDII)
Information Services Claudio Cherubino INFN Catania Bologna
Presentation transcript:

The Information System in gLite middleware Dott.ssa Valeria Ardizzone Consorzio COMETA

Outline Overview of the LCG Information System Differents between Top and Site BDII Generic Information Provider BDII update process

Information System What is? System to collect information on the state of resources Why? To discover resources of the grid and their nature To have useful data that helps who is in charge of managing the workload to do it more efficiently. To check for health status of resources. How? Monitoring state of resources locally and publishing right information on the information system. Adopting a data model that MUST be well known to all components that want to access monitored information Using different approaches that we are going to investigate in next slides

Design of Information Systems About Measures Measures SHOULD be sensitive to the aim the users want to achieve. Measures SHOULD be enough accurate to be considered valid. Rate of taking measures MUST be adequate to be used. About the gathering of Information How and when collected info should be published? Where should collected info be stored? How long should this info be maintained in the storage? Querying the Information System Where should queries be sent to have a response? What syntax and protocols have to be adopted to make queries? What is the adopted data model to describe resources? Security Who is allowed to execute queries against the IS and what type of queries is he allowed to do? Management of user rights and credentials.

Adopted Information Systems The BDII (Berkley DB Information Index) has been adopted in LCG middleware as the Information System provider. It is an evolution of the Globus Meta Directory System (MDS) LCG-2 actually adopts BDII as Information System. It is based on Lightweight Directory Access Protocol (LDAP) servers. The Relational Grid Monitoring Architecture (R-GMA) Is an implementation of the Grid Monitoring Architecture (GMA) standardized by the Global Grid Forum (GGF) It is a relational implementation of the GMA It is strongly Web Services Oriented It will be adopted by next releases of the gLite middleware

LCG Information System

LCG Information System LCG adopts a combination of solutions Globus MDS At the lowest level of the information system To discover and monitor resources and publish information Grid Information Security (GSI) credentials Caching BDII At the highest level of the system Because MDS had some troubles in terms of scalability Used by the Resource Broker for the matchmaking process Can be configured by each VO Queries underlying systems periodically (2 minutes) Hierarchical system Information is collected on the leaves of a hierarchical tree and travels towards the root Clients can query the hierarchical tree at every level The higher the level against which queries are made, the older is the obtained information

Collecting Information Gathering of information at different levels Lower level: Grid Resource Information Server (GRIS) Collects information on the state of a given resource One GRIS on top of each resource A set of scripts and sensor that try to extract useful info on the resource Medium level: Grid Index Information Server (GIIS) Collects information on resources of a given site One GIIS for each site Higher level: BDII Collects information on resources of a given VO One BDII for each VO (suggested solution) Way of collecting info Pull model (higher level servers periodically query lower level servers) LDAP query model

Globus MDS (The past) Globus Meta Directory Server (MDS) It is a hierarchical system Based on LDAP servers GRISes are leaves of the tree GIISes are intermediate nodes of the tree The user can query the system at every level The higher the information is in the tree, the older it is Grid Resource Information Service (GRIS) One for each Grid Resource (CE or SE) Collects info on that resource Static or dynamic info Adopts techniques to take measures (such as sensors) Grid Index Information Service (GIIS) One for each site Collects info from above GRISes Caches info according to its validity time Queries above GRISes or GIISes whether needed

Berkley Database Information Index (BDII) The Berkley Database Information Index (BDII) Developed within the context of LCG project Solves problems of instability of the MDS occurring when the number of sites grows too much Stays on top of GIIS sites One for each VO Centralized system Three levels of hierarchy Accessed by the Workload Management System Way of working One GRIS for each resource One GIIS for each site collecting info from below GRIS systems One BDII for a given VO collecting information from below GIIS systems Two LDAP servers, one for write access and one for read access Every two minutes a cron-job runs a script and collects info from a list of GIIS sites The list of GIIS is placed in the configuration file of the BDII

Berkley Database Information Index (BDII) Every node (except UI,BDII and Wns) has a GRIS service in order to publish its informations A node in every site collects all site GRISes and publishes them using a site BDII, by default installed on the CE The top BDII collects all site BDIIs User can run a set of commands to query the top BDII.

GLUE Schema

GLUE Schema Grid Laboratory Uniform Environment (GLUE) Schema It is a data model to describe in a meaningful way information on grid resources (static and dynamic info) As result of a collaboration between the EU-DataTAG and iVDGL projects EGEE, NorduGrid, LCG and Grid3/OSG contributed to the definition of the schema XML Schema Now, GLUE Schema is being mapped to an XML representation http://infnforge.cnaf.infn.it/glueinfomodel/Spec/V12/R1

Site Element

Cluster Element

Computing Element

LCG Information System Hierarchy BDII-A BDII-B BDII-C Site BDII Site BDII Site BDII SE Local GRIS SE Local GRIS CE Local GRIS SE Local GRIS CE Local GRIS SE Local GRIS RB Local GRIS Site 1 Site 2 Site 3

Top BDII vs Site BDII Site BDII By default it is installed on the CE It collects all site GRISes* (for example SE,RB,LFC,etc..) The name of the service is bdii Top BDII It collects all site BDIIs** (CE1,CE2 ..... CEn) It gives to the RB all needed informations to match and dispatch user's jobs It can run in the same machine where the RB is running (it's more fast in answer)

Generic Information Provider (GIP) The Generic Information Provider, GIP, is a highly configurable information provider that makes a separation between static and dynamic information. It can be used to produce any kind of information for use with LDAP based grid information systems. An information provider in its simplest form is a script that prints an LDIF file to standard out. The LDIF file should conform to the schema used in the GIP. The problem is with dynamic information. Usually there are only a few attributes that have to be found dynamically. The idea with the GIP is to use dynamic plug-ins to obtain these values and use a common framework for everything else.

Generic Information Provider GIP The GIP script reads in all the ldif files from the static_dir and run all the providers in the plugin_dir. It will then run any dynamic plug-ins found in the plugin_dir to obtain the dynamic values. It will print the LDIF to standard out but use the any dynamic values found from the plugins. Plugins and providers should not be put directly into the directory, insted a symbolic link or wrapper script should be used.

SiteInfo Schema objectClass: DataGridTop objectClass: SiteInfo in: information siteName: Name of site sysAdminContact: Contact details for local system administrator userSupportContact: Contact details for local user support siteSecurityContact: Contact details for site security dataGridVersion: Overall DataGrid software version installationDate: Date of installation cpuResourceDescription: Description of available CPU resources diskResourceDescription: Description of available disk resources supportedFileSystem: Supported file systems AFS, NFS, local batchSystem: Available batch system, PBS, LSF, BQS, Condor, fork etc massStorageDescription: Description of available mass storage experimentalSoftware: Availablity of software, ATLAS, CMS, ESA etc…

SiteInfo Information Provider dn: in=host.invalid/siteinfo,Mds-Vo-name=local,o=grid objectClass: SiteInfo objectClass: DataGridTop objectClass: DynamicObject siteName: SITE-NAME sysAdminContact: admin@host.invalid userSupportContact: support@host.invalid siteSecurityContact: security@host.invalid dataGridVersion: LCG-2_2_0 installationDate: 20040106120000Z

BDII Update Process (1) The BDII consists of two or more standard LDAP databases that are populated by an update process. The update process obtains LDIF (LDAP Data Interchange Format ) file from either doing an ldapsearch on LDAP URLs or by running a local script that generates LDIF. The LDIF is then inserted into the LDAP database. Options exist to update the the list of LDAP URLs from a web page and to use an LDIF file from a web page to modify the data before inserting it into the database. The daemon script will start the update process is bdii-update.

BDII Update Process (2) The update process does the following in an endless loop: Update configuration file that contains the LDAP URLs from the web. Fork off processes to obtain the LDIF from the URLs. Write the LDIF files in the temporary directory. Try to obtain an LDIF file that will modify (filter) the data. Modify the data read from the temporary files. Kill the oldest database instance along with any stale clients. Clear the corresponding database. Add the fresh data to a new database. Start the new database. Forward traffic from the BDII read port to the new database. Sleep for a while.

lcg-info 25

Praticals: lcginfo -h/--help: to see the help –list-attrs: print the list of the possible attributes --list-ce: lists the CEs which satisfy a query, or all the CEs if no query is given. --list-se: lists the SEs which satisfy a query, or all the SEs if no query is given. --bdii: allows to specify a BDII in the form <hostname>:<port>. If not given, the value of the environmental variable LCG_GFAL_INFOSYS is used. If that is not defined, the command returns an error. --vo: restricts the output to CEs or SEs where the given VO is authorized.

lcg-infosites 27

Praticals: lcg-infosites -h/--help: help option --vo: VO name (mandatory) --is: it's possible to specify a not default Top BDII Some options: se: The names of the SEs supported by the user's VO ce: The information relative to number of CPUs, running jobs,etc. rb: Names of the Rbs available for each VO sitenames: Names of the LCG sites tag: The names of the tags relative to the software installed in site is printed together with the corresponding CE closeSE: The names of the CEs where the user's VO is allowed to run together with their corresponding closest SEs are provided

References Esercitazione LCG-2 User Guide GLUE Schema http://gilda.ct.infn.it/twiki/bin/view/GILDA/InformationSystems#Information_System_BDII LCG-2 User Guide https://edms.cern.ch/file/454439//LCG-2-UserGuide.html GLUE Schema http://infnforge.cnaf.infn.it/glueinfomodel/

To change: View -> Header and Footer