The gLite Information System

Slides:



Advertisements
Similar presentations
IST E-infrastructure shared between Europe and Latin America The gLite Information System(s) Christian Grunfeld, UNLP EELA.
Advertisements

FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Services Abderrahman El Kharrim
A conceptual model of grid resources and services Authors: Sergio Andreozzi Massimo Sgaravatto Cristina Vistoli Presenter: Sergio Andreozzi INFN-CNAF Bologna.
Makrand Siddhabhatti Tata Institute of Fundamental Research Mumbai 17 Aug
EGEE-II INFSO-RI Enabling Grids for E-sciencE Introduction to R-GMA: Relational Grid Monitoring Architecture.
The EU DataGrid – Information and Monitoring Services The European DataGrid Project Team
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE Middleware: gLite Information Systems (IS) EGEE Tutorial 23 rd APAN Meeting,
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Information System (IS) Valeria Ardizzone.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Information System on gLite middleware Vincent.
Application code Registry 1 Alignment of R-GMA with developments in the Open Grid Services Architecture (OGSA) is advancing. The existing Servlets and.
GLite Information System(s) Antonio Juan Rubio Montero CIEMAT 10 th EELA Tutorial. Madrid, May 7 th -11 th,2007.
E-infrastructure shared between Europe and Latin America 12th EELA Tutorial for Users and System Administrators gLite Information System.
BDII Server Installation and Configuration Manuel Rubio del Solar Extremadura Advanced Research Center (CETA-CIEMAT) 11th EELA Tutorial for Users Sevilla,
E-infrastructure shared between Europe and Latin America FP6−2004−Infrastructures−6-SSA gLite Information System Pedro Rausch IF.
SAM Sensors & Tests Judit Novak CERN IT/GD SAM Review I. 21. May 2007, CERN.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America gLite Information System Claudio Cherubino.
Testing and integrating the WLCG/EGEE middleware in the LHC computing Simone Campana, Alessandro Di Girolamo, Elisa Lanciotti, Nicolò Magini, Patricia.
E-infrastructure shared between Europe and Latin America gLite Information System(s) Manuel Rubio del Solar CETA-CIEMAT EELA Tutorial, Mérida,
EGEE-II INFSO-RI Enabling Grids for E-sciencE Practical: The Information Systems.
EGEE is a project funded by the European Union under contract IST Information and Monitoring Services within a Grid R-GMA (Relational Grid.
FESR Trinacria Grid Virtual Laboratory Relational Grid Monitoring Architecture (R-GMA) Valeria Ardizzone INFN Catania Tutorial per Insegnanti.
INFSO-RI Enabling Grids for E-sciencE R-GMA Gergely Sipos and Péter Kacsuk MTA SZTAKI Credit to Valeria Ardizzone.
Gennaro Tortone, Sergio Fantinel – Bologna, LCG-EDT Monitoring Service DataTAG WP4 Monitoring Group DataTAG WP4 meeting Bologna –
EGEE-II INFSO-RI Enabling Grids for E-sciencE Practical using R-GMA.
INFSO-RI Enabling Grids for E-sciencE gLite Information System: R-GMA Tony Calanducci INFN Catania gLite tutorial at the EGEE User.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Introduction to R-GMA: Relational Grid Monitoring Architecture.
FESR Trinacria Grid Virtual Laboratory gLite Information System Muoio Annamaria INFN - Catania gLite 3.0 Tutorial Trigrid Catania,
E-science grid facility for Europe and Latin America Updates on Information System Annamaria Muoio - INFN Tutorials for trainers 01/07/2008.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Relational Grid Monitoring Architecture (R-GMA)
Implementation of GLUE 2.0 support in the EMI Data Area Elisabetta Ronchieri on behalf of JRA1’s GLUE 2.0 Working Group INFN-CNAF 13 April 2011, EGI User.
EMI is partially funded by the European Commission under Grant Agreement RI EMI Status And Plans Laurence Field, CERN Towards an Integrated Information.
The Information System in gLite middleware
gLite Information System
Information System Valeria Ardizzone INFN
The Information System
Classic Storage Element
MON Box Installation and configuration
Installation and configuration of a top BDII
gLite Information System(s)
Practical: The Information Systems
BDII Performance Tests
Comparison of LCG-2 and gLite v1.0
The EU DataGrid – Information and Monitoring Services
The Information System in gLite
Middleware independent Information Service
Sergio Fantinel, INFN LNL/PD
Information System Virginia Martín-Rubio Pascual
gLite Information System
SAGA API for gLite Service Discovery
gLite Information System
a VO-oriented perspective
The Information System
Tutorial Grille (LCG/EGEE)
A conceptual model of grid resources and services
gLite Information System(s)
Author: Laurence Field (CERN)
Report on GLUE activities 5th EU-DataGRID Conference
gLite Information System overview
R-GMA (Relational Grid Monitoring Architecture) for monitoring applications “s” gLite and LCG.
EGEE Middleware: gLite Information Systems (IS)
The GENIUS portal and the GILDA t-Infrastructure
gLite Information System Overview
Information and Monitoring System
RELATIONAL GRID MONITORING ARCHITECHTURE
gLite Information System
BDII Server Installation and Configuration
Information System (BDII)
Information Services Claudio Cherubino INFN Catania Bologna
Presentation transcript:

The gLite Information System Domenico Vicinanza, CERN EELA Tutorial, Santiago, September 2006

Information System What? Why? How? System to collect information on the state of resources Why? To discover resources of the grid and their nature To have useful data to know who is in charge of managing the workload to do it more efficiently. To check for health status of resources. How? Monitoring state of resources locally and publishing fresh data on the information system. Adopting a data model that MUST be well known to all components that want to access monitored information Using different approaches that we are going to investigate in the next slides Santiago, Chile, EELA Tutorial, 06-07.09.2006

Uses of the IS in Grid If you are a middleware developer Workload Management System: Matching job requirements and Grid resources Monitoring Services: Retrieving information of Grid Resources status and availability If you are a user Retrieve information of Grid resources and status Get the information of your jobs status If you are site manager or service You “generate” the information for example related to your site or to a given service Santiago, Chile, EELA Tutorial, 06-07.09.2006

Elements behind the IS ******************************************************************************* These are the data for alice: (in terms of CPUs) ******************************************************************************* #CPU Free Total Jobs Running Waiting Computing Element ---------------------------------------------------------------------------- 52 51 0 0 0 ce.prd.hp.com:2119/jobmanager-lcgpbs-long 14 3 2 1 lcg06.sinp.msu.ru:2119/jobmanager-lcgpbs-long […………] The total values are: ---------------------- 10347 5565 2717 924 1793 I need to know all the CEs which serve my VO to send my jobs in bunches. What about the SEs capacities? ¤ Something has managed this information: (General IS architecture) ¤ Something has provided it: (Providers, Servers) ¤ It is following a certain “schema”: (GLUE Schema) ¤ And she has accessed it following a protocol: (Access Protocol: LDAP) She will use some gLite tools and, after few moments,… Santiago, Chile, EELA Tutorial, 06-07.09.2006

LCG Information System LCG adopts a combination of solutions MDS (Globus Meta Directory System) At the lowest level of the information system To discover and monitor resources and publish information Grid Information Security (GSI) credentials Caching BDII (Berkeley DB Information Index) At the highest level of the system Because MDS had some troubles in terms of scalability Used by the Resource Broker for the matchmaking process Can be configured by each VO Queries underlying systems periodically (2 minutes) Hierarchical system Information is collected on the leaves of a hierarchical tree and travels towards the root Clients can query the hierarchical tree at every level The higher the level against which queries are made, the older is the obtained information Santiago, Chile, EELA Tutorial, 06-07.09.2006

Information System The BDII (Berkeley DB Information Index) has been adopted in LCG middleware as the Information System provider. It is an evolution of the Globus Meta Directory System (MDS) LCG-2 actually adopts BDII as Information System. It is based on Lightweight Directory Access Protocol (LDAP) server The Relational Grid Monitoring Architecture (R-GMA) Is an implementation of the Grid Monitoring Architecture (GMA) standardized by the Global Grid Forum (GGF) It is a relational implementation of the GMA It is strongly Web Services Oriented It uses standard SQL query syntax Santiago, Chile, EELA Tutorial, 06-07.09.2006

Collecting Information Gathering of information at different levels Lower level: Grid Resource Information Server (GRIS) - MDS Collects information on the state of a given resource One GRIS on top of each resource: CE, SE, RB, MyProxy A set of scripts and sensors that try to extract useful info on the resource Medium level: Grid Index Information Server (GIIS) – Local BDII Collects information on resources of a given site One GIIS for each site Higher level: Top-level BDII Collects information on resources of a given VO One BDII for each VO (suggested solution) Way of collecting info Pull model (higher level servers periodically query lower level servers) LDAP query model Santiago, Chile, EELA Tutorial, 06-07.09.2006

The hierarchy Way of working One GRIS for each resource One GIIS for each site collecting info from below GRIS systems One BDII for a given VO collecting information from below GIIS systems Two LDAP servers, one for write access and one for read access Every two minutes a cron-job runs a script and collects info from a list of GIIS sites The list of GIIS is placed in the configuration file of the BDII Santiago, Chile, EELA Tutorial, 06-07.09.2006

The LDAP Protocol ► LDAP structures data as a tree ► The values of each entry are uniquely named ► Following a path from the node back to the root of the DIT, a unique name is built (the DN): “id=dv,ou=IT,or=CERN,st=Geneva, \ c=Switzerland,o=grid” o = grid (root of the DIT) c= US c=Switzerland c=Spain st = Geneva or = CERN ou = IT ou = EP objectClass:person cn: Vicinanza D. phone: 5555666 office: 28-r026 id = dv id=gv id=fd Santiago, Chile, EELA Tutorial, 06-07.09.2006

R-GMA The Relational Grid Monitoring Architecture (R-GMA) It is the relational implementation of GMA defined by the GGF Adopts a database model with tables and relations between tables Implements a virtual database The user queries the R-GMA as he/she was querying to a classical database (SQL string) Implements different type of queries The information Produced and accessed locally to its site Always new Can be collected by an entity (secondary producer) to be accessed faster Santiago, Chile, EELA Tutorial, 06-07.09.2006

GMA Architecture and Relational Model Registry The Producer stores its location (URL) in the Registry. The Consumer looks up producer URLs in the Registry. The Consumer contacts the Producer to get all the data. Or the Consumer can listen to the Producer for new data. Store Location Look up Location Producer Consumer Execute or Stream data name ID birth Group Tom 4 1977-08-20 HR SELECT * FROM people WHERE group=‘HR’ Santiago, Chile, EELA Tutorial, 06-07.09.2006

Multiple Producers The Consumer will get all the URLs that could satisfy the query. The Consumer will connect to all the Producers. Producers that can satisfy the query will send the tuples to the Consumer. The Consumer will merge these tuples to form one result set. Registry TableName URL 1 URL 2 Producer 1 Producer 2 TableName Value 1 Value 2 TableName Value 3 Value 4 Consumer TableName Value 1 Value2 Value 3 Value 4 Santiago, Chile, EELA Tutorial, 06-07.09.2006

Select * from CPULoad Santiago, Chile, EELA Tutorial, 06-07.09.2006 CPULoad (Consumer) Country Site Facility Load Timestamp UK RAL CDF 0.3 19055711022002 ATLAS 1.6 19055611022002 GLA 0.4 19055811022002 ALICE 0.5 CH CERN 0.9 0.6 19055511022002 CPULoad (Producer 1) UK RAL CDF 0.3 19055711022002 ATLAS 1.6 19055611022002 CPULoad (Producer 2) UK GLA CDF 0.4 19055811022002 ALICE 0.5 19055611022002 CPULoad (Producer 3) CH CERN ATLAS 1.6 19055611022002 CDF 0.6 19055511022002 Santiago, Chile, EELA Tutorial, 06-07.09.2006

Joins Service URI VO type emailContact site gppse01 alice SE sysad@rl.ac.uk RAL atlas gppse02 cms lxshare0404 sysad@cern.ch CERN ServiceStatus URI VO type up status gppse01 alice SE y SE is running atlas gppse02 cms n SE ERROR 101 lxshare0404 Result Set (Consumer) URI emailContact gppse02 sysad@rl.ac.uk SELECT Service.URI Service.emailContact FROM Service S, ServiceStatus SS WHERE (S.URI= SS.URI and SS.up=‘n’) Santiago, Chile, EELA Tutorial, 06-07.09.2006

GLUE Schema Santiago, Chile, EELA Tutorial, 06-07.09.2006

Definition and main goals Schema: a description of objects and attributes needs to describe Grid resources, and the relationships between the objects. Main goals: Define a minimum common schema requirement for interoperability Compute Elements, Network Elements, Storage Elements To address need to common schemas between projects framework independent (LDAP, SQL, XML) Santiago, Chile, EELA Tutorial, 06-07.09.2006

Glue Schema Grid Laboratory Uniform Environment (GLUE) Schema It is a data model to describe in a meaningful way information on grid resources (static and dynamic info) As result of a collaboration between the EU-DataTAG and iVDGL projects EGEE, NorduGrid, LCG and Grid3/OSG contributed to the definition of the schema XML Schema Now, GLUE Schema is being mapped to an XML representation http://infnforge.cnaf.infn.it/glueinfomodel/Spec/V12/R1 Santiago, Chile, EELA Tutorial, 06-07.09.2006

Example of attibutes Operating System OSName OSRelease OSVersion QueueState RunningJobs TotalJobs QueueStatus WaitQueueLength WorstResponseTime EstimatedResponseTime Santiago, Chile, EELA Tutorial, 06-07.09.2006

Site Element Santiago, Chile, EELA Tutorial, 06-07.09.2006

Cluster Element Santiago, Chile, EELA Tutorial, 06-07.09.2006

Computing Element Santiago, Chile, EELA Tutorial, 06-07.09.2006

References gLite 3.0 User Guide R-GMA home page GLUE Schema https://edms.cern.ch/file/722398/1.1/gLite-3-UserGuide.pdf R-GMA home page http://www.r-gma.org/ GLUE Schema http://infnforge.cnaf.infn.it/glueinfomodel/ Santiago, Chile, EELA Tutorial, 06-07.09.2006

Questions… Thanks to Roberto Barbera who firstly developed these slides Santiago, Chile, EELA Tutorial, 06-07.09.2006