gLite Information System Overview

Slides:



Advertisements
Similar presentations
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
Advertisements

The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Services Abderrahman El Kharrim
Grid Computing, B. Wilkinson, 20046c.1 Globus III - Information Services.
Systems Architecture, Fourth Edition1 Internet and Distributed Application Services Chapter 13.
INFSO-RI Enabling Grids for E-sciencE Practicals on VOMS and MyProxy Emidio Giorgio INFN Retreat between GILDA and ESR VO, Bratislava,
The EU DataGrid – Information and Monitoring Services The European DataGrid Project Team
E-science grid facility for Europe and Latin America Installation and configuration of a top BDII Gianni M. Ricciardi – Consorzio COMETA.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE Middleware: gLite Information Systems (IS) EGEE Tutorial 23 rd APAN Meeting,
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Information System (IS) Valeria Ardizzone.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Information System on gLite middleware Vincent.
Implementing LDAP Client/Server System for Directory Service By Maochun Sun Project Advisor: Dr. Chung-E Wang Department of Computer Science California.
LDAP (Lightweight Directory Access Protocol ) Speaker: Chang-Yu Wu Adviser: Quincy Wu Date:2007/08/22.
By Rashid Khan Lesson 6-Building a Directory Service.
E-infrastructure shared between Europe and Latin America FP6−2004−Infrastructures−6-SSA gLite Information System Pedro Rausch IF.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America gLite Information System Claudio Cherubino.
The EU DataGrid – Information and Monitoring Services The European DataGrid Project Team
EGEE-II INFSO-RI Enabling Grids for E-sciencE Practical: The Information Systems.
Active Directory. Computers in organizations Computers are linked together for communication and sharing of resources There is always a need to administer.
Directory Services CS5493/7493. Directory Services Directory services represent a technological breakthrough by integrating into a single management tool:
FESR Trinacria Grid Virtual Laboratory gLite Information System Muoio Annamaria INFN - Catania gLite 3.0 Tutorial Trigrid Catania,
EGI-InSPIRE RI Grid Training for Power Users EGI-InSPIRE N G I A E G I S Grid Training for Power Users Institute of Physics Belgrade.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) 马兰馨 IHEP, CAS Hands on gLite Security.
Active Directory Domain Services (AD DS). Identity and Access (IDA) – An IDA infrastructure should: Store information about users, groups, computers and.
First South Africa Grid Training Installation and configuration of BDII Gianni M. Ricciardi Consorzio COMETA First South Africa Grid Training Catania,
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks BDII Server Installation & Configuration.
E-science grid facility for Europe and Latin America Updates on Information System Annamaria Muoio - INFN Tutorials for trainers 01/07/2008.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Job monitoring and accounting data visualization
The Information System in gLite middleware
Practicals on gLite Information Systems
gLite Information System
OGF PGI – EDGI Security Use Case and Requirements
Classic Storage Element
Information System: Hands On
How to connect your DG to EDGeS? Zoltán Farkas, MTA SZTAKI
Practicals on VOMS and MyProxy
Installation and configuration of a top BDII
Introduction to LDAP Frank A. Kuse.
gLite Information System(s)
Practical: The Information Systems
Introductions Using gLite Grid Miguel Angel Díaz Corchero
BDII Performance Tests
gLite Information System overview
lcg-infosites documentation (v2.1, LCG2.3.1) 10/03/05
The EU DataGrid – Information and Monitoring Services
The Information System in gLite
Middleware independent Information Service
Sergio Fantinel, INFN LNL/PD
Introduction to Grid Technology
Workload Management System
Information System Virginia Martín-Rubio Pascual
gLite Information System
Practicals on gLite Information System
gLite Information System
a VO-oriented perspective
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
Hands-on on Information System
Application layer Lecture 7.
gLite Information System(s)
The Globus Toolkit™: Information Services
Report on GLUE activities 5th EU-DataGRID Conference
Introduction to Name and Directory Services
gLite Information System overview
EGEE Middleware: gLite Information Systems (IS)
Information and Monitoring System
Architecture of the gLite Data Management System
gLite Information System
The gLite Information System
Information System (BDII)
Information Services Claudio Cherubino INFN Catania Bologna
Presentation transcript:

gLite Information System Overview Raquel Pezoa (raquel.pezoa@usm.cl) LA 3 2010 - EPIKH School for Application Porting Valparaíso, 29.11.2010

Outline This presentation is based on previous presentations from EPIKH Schools for Application Porting

Outline Motivation / Introduction Background GLUE Schema LDAP MDS Architecture Monitoring Information system in practice lcg-infosites lcg-info

Introduction

The power Grid paradigm wind coal Electric Power Grid System Users can access electrical power coming from different (and heterogeneous) sources nuclear hydroelectric

The computing Grid approach System Users can access storage and computing resources coming from different (and heterogeneous) sources The resources shared within the Grid can be physical objects (CPUs, storage devices) or logical resources (computing queues, distributed file systems)

Information System An Information System is a vital component in a grid, because of in a heterogeneous and dynamic grid environment it is necessary to be aware about the diverse resources and services that compose the grid. A Grid Information System (GIS) provides special services to discover, characterize and monitor the resources and services.

Information System What is? Why? How? System to collect information on the state of resources Why? To discover resources of the grid and their nature To have useful data in order to who is in charge of managing the workload to do it more efficiently. To check for health status of resources. How? Monitoring state of resources locally and publishing fresh data on the information system. Adopting a data model that MUST be well known to all components that want to access monitored information Using different approaches that we are going to investigate in the next slides

The Info. system scope Discover new added resources Make available updated information about the resources Monitor resources load and status How does it work? Collecting information from the resources and publishing fresh data periodically. Adopting a data model that MUST be well known to all Grid components.

How it is used and by whom? Users Retrieve information about resources: Where can I run my job? Where can I copy my files? Which software packages are available on a given CE? Site managers Publish information about the resources and services they provide Middleware WMS: matching job requirements and allocating the resources Monitoring Services: retrieving information about the status and availability of resources.

GLUE Schema LDAP MDS Architecture Background GLUE Schema LDAP MDS Architecture

GLUE Schema - Overview Stands for “Grid Laboratory Uniform Environment” Provides a standardized description of a Grid computing system It is not tied to any particular implementation The Grid components are represented as objects which have attributes and relations to other objects

GLUE Schema – Why we need it? The Grid is a paradigm of distributed computing that enables the coordination of resources and services not subject to centralized control. These resources are geographically dispersed, span multiple trust domains and are heterogeneous. Resources can be dynamically contributed by different owner institutions, so a precise and shared description of resources among information consumers and resource providers is necessary. This description should also be common to different Grid infrastructures in order to contribute to the interoperability among them.

GLUE Schema – Core entities

GLUE Schema – Site properties Site entity – Properties: Property Type Description UniqueID string Unique Identifier of the Site Name Human-readable name Short description of this site EmailContact The main email contact for the site. Syntax rule: "mailto:" followed by a list of email addresses separated by a comma UserSupportContact E-mail addresses of the support service SysAdminContact E-mail addresses of the system administrator SecurityContact E-mail addresses of the security manager Location Geographical location of this site (e.g., city, state, country) Latitude real32 The position of a place north or south of the equator measured from -90º to 90º, with positive values going north and negative values going south Longitude The position of a place east or west of Greenwich, England measured from -180º to 180º with positive values going east and negative values going west Web uri The URI identifying a web page with more information about this site Sponsor VO sponsoring the site; the syntax should allow the expression of the percentage of sponsorship OtherInfo This attribute is to be used to publish info that does not fit in any other attribute of the site entity

GLUE Schema – Service properties Service entity – Properties: Property Type Description UniqueID string Unique Identifier of this service Name Human-friendly name serviceType_t The service type Version Version of the service: <major version number>.<minor version number>.<patch version number> Endpoint uri Network endpoint for this service Status serviceStatus_t Status of the service. String enumeration: OK, Warning, Critical, Unknown, Other StatusInfo Textual explanation for the status of the service WSDL URI of the WSDL describing the service Semantics URL of detailed description StartTime dateTime_xs_t The timestamp related to last start time of this service Owner Owner of the service AccessControlBase.Rule ACL_t Authorization rule for this entity

GLUE Schema – CE entity

LDAP - Overview Stands for “Lightweight Directory Access Protocol” It is a protocol that defines the method by which directory data is accessed Optimized for reading, browsing and searching information (‘write-once-read-many-times’ service) Data is represented as a hierarchy of objects (entities) forming a tree structure Data Information Tree (DIT)

LDAP – Data Information Tree Attribute Type String representation Distinguished Name (DN) Unique name (path) that unambiguously identifies a single entry dc=grid (root of the DIT) Attribute Type String domainComponent DC CountryName C OrganizationName O OrganizationUnitName OU CommonName CN c= Brazil c=Italy c=Spain o=INFN ou=Catania ou=Roma cn=Elisa Ingra cn=Elisa Ingra,ou=Catania,o=INFN,c=Italy,dc=grid

LDAP – Data Information Tree The tree hierarchy is described by textual files following the LDIF format (LDAP Data Interchange Files) dn: dc=example,dc=root dc: example description: My company objectClass: dcObject objectClass: organization o: Example, Inc. ## FIRST Level hierarchy - people # this is an ENTRY sequence and is preceded by a BLANK line dn: ou=people, dc=example,dc=root ou: people description: All people in organisation objectClass: organizationalUnit ## SECOND Level hierarchy - people entries dn: cn=Joe Schmo,ou=people,dc=example,dc=root objectclass: inetOrgPerson cn: Joe Schmo sn: Schmo uid: jschmo mail: joe@example.com ou: sales

LDAP – Referrals An individual LDAP server might not store the entire DIT Servers need to be linked together in order to form a distributed directory that contains the whole DIT All client requests start at the global directory LDAP 1 LDAP 1 server LDAP 2 server LDAP 3 server c=USA,dc=example o=IBM,c=USA,dc=example dc=example referral c= USA o=CISCO o=IBM ou=IT ou=Web referral

MDS gLite adopts the Globus Monitoring and Discovery Service (MDS) architecture as its Information System It implements the GLUE Schema using OpenLDAP Grid Resource Information Server (GRIS) LDAP server at Grid Elements level (e.g. GRIS runs in a CE) Local CEs and SEs run an Information Provider software, which collects info about the resource Grid Index Information Server (or site BDII) (GIIS) LDAP server at a site level each Grid site to collect information by all GRISes present in that site Berkeley Database Information Index (BDII) Top level database Store and publish data gathered from the site Uses LDAP as protocol and GLUE Schema as data model

MDS - Architecture Site level BDII Computational resources available within a confederation of laboratories, the distributed data storage systems owned by a particular agency. Site level BDII Grid Index Information Server (GIIS) GIIS GIIS allows to onstruct aggregate directories and it provides a means of combining arbitrary GRIS Operating System , versions, CPU and memory availability

MDS - Architecture BDII consists of 2 servers: one contains a ready-only database and the other a write-only database Every 2 minutes a cron-job runs a script and collects info from lower-level servers (GIIS) Once updated, the 2 servers change roles

LDAP browser showing BDII info Top BDII http://gstat-prod.cern.ch/gstat/site/EELA-UTFSM/

LDAP browser showing BDII info Site BDII LDAP browser: LUMA, http://luma.sourceforge.net/

lcg-infosites In this section, two higher level tools will be presented: lcg-infosites and lcg-info. These tools should be enough for most common user needs and will usually avoid the necessity of raw LDAP queries. The lcg-infosites command is actually just a perl script wrapping a series of LDAP commands and was developed to allow users to retrieve information on Grid resources for the most common cases

Practicals: lcg-infosites USAGE: lcg-infosites --vo options -v --is -h/--help: help option --vo: VO name (mandatory) --is: it's possible to specify a not default Top BDII Some options: se: The names of the SEs supported by the user's VO ce: The information relative to number of CPUs, running jobs,etc. rb: Names of the Rbs available for each VO sitenames: Names of the LCG sites tag: The names of the tags relative to the software installed in site is printed together with the corresponding CE closeSE: The names of the CEs where the user's VO is allowed to run together with their corresponding closest SEs are provided

Practicals: lcg-infosites Login to a UI machine and do the following: rpezoa@suiza:~$ ssh cluster29.fis.utfsm.cl -l valparaiso18 valparaiso18@cluster29.fis.utfsm.cl's password: Last login: Mon Nov 29 08:09:50 2010 from ip116.11.priv.inf.utfsm.cl

Practicals: lcg-infosites Initiate a proxy to be able to run the grid commands [valparaiso22@cluster29 ~]$ voms-proxy-init --voms gilda Enter GRID pass phrase: Your identity: /C=IT/O=GILDA/OU=Personal Certificate/L=VALPARAISO/CN=VALPARAISO22 Creating temporary proxy .................................................................................................................................... Done Contacting voms.ct.infn.it:15001 [/C=IT/O=INFN/OU=Host/L=Catania/CN=voms.ct.infn.it] "gilda" Done Creating proxy .......................................................................................................... Done Your proxy is valid until Mon Nov 29 21:53:48 2010

Practicals: lcg-infosites [valparaiso18@cluster29 ~]$ lcg-infosites --vo gilda ce #CPU Free Total Jobs Running Waiting ComputingElement ---------------------------------------------------------- 8 8 0 0 0 ce-edu.grid.acad.bg:2119/jobmanager-pbs-gilda 28 27 0 0 0 ce.hpc.iit.bme.hu:2119/jobmanager-lcgpbs-gilda 168 157 33 3 30 ce1-egee.srce.hr:2119/jobmanager-sge-prod 22 20 2 0 2 grid010.ct.infn.it:2119/jobmanager-lcgpbs-gilda 12 12 1171 0 1171 gilda-01.pd.infn.it:2119/jobmanager-lcgpbs-gilda 18 17 1 0 1 sirius-ce.ct.infn.it:2119/jobmanager-lcgpbs-gilda 1952 1777 34 34 0 grisuce.scope.unina.it:2119/jobmanager-lcgpbs-grisu_short 1952 1777 2 2 0 grisuce.scope.unina.it:2119/jobmanager-lcgpbs-grisu_long 48 48 0 0 0 iceage-ce-01.ct.infn.it:2119/jobmanager-lcgpbs-gilda 16 16 0 0 0 gn0.hpcc.sztaki.hu:2119/jobmanager-lcgpbs-gilda 0 0 0 0 0 ce1.grid.edges-grid.eu:2119/jobmanager-edges-szdgr 0 0 0 0 0 ce1.grid.edges-grid.eu:2119/jobmanager-edges-xwlal 0 0 0 0 0 ce1.grid.edges-grid.eu:2119/jobmanager-edges-homeboinc 0 0 0 0 0 ce1.grid.edges-grid.eu:2119/jobmanager-edges-uow 0 0 0 0 0 ce1.grid.edges-grid.eu:2119/jobmanager-edges-xwlripub 0 0 0 0 0 ce1.grid.edges-grid.eu:2119/jobmanager-edges-asgctwdg [...]

Practicals: lcg-infosites [valparaiso18@cluster29 ~]$ lcg-infosites --vo gilda se Avail Space(Kb) Used Space(Kb) Type SEs ---------------------------------------------------------- n.a n.a n.a se-edu.grid.acad.bg 16241329 3464018 n.a se.hpc.iit.bme.hu 2390269117 97445067 n.a se1-egee.srce.hr 24377546 13973911 n.a gilda-02.pd.infn.it 80605789 4411630 n.a sirius-se.ct.infn.it 4089111554 704961140 n.a grisuse.scope.unina.it 60819307 6465967 n.a iceage-se-01.ct.infn.it 97172774 6523424 n.a dgt02.ui.savba.sk 1054708194 44668166 n.a fn2.hpcc.sztaki.hu 16740248674 301017243 n.a se.scope.unina.it

lcg-info The lcg-info command can be used to list either CEs or the SEs that satisfy a given set of conditions on their attributes, and to print, for each of them, the values of a given set of attributes. The information is taken from the BDII specified by the $LCG_GFAL_INFOSYS environment variable or in the command line. [valparaiso18@cluster29 ~]$ echo $LCG_GFAL_INFOSYS gilda-bdii.ct.infn.it:2170

Practicals: lcg-info -h/--help: to see the help USAGE: lcg-info [--list-ce | --list-se] [--query ] [--attrs ] -h/--help: to see the help –list-attrs: print the list of the possible attributes --list-ce: lists the CEs which satisfy a query, or all the CEs if no query is given. --list-se: lists the SEs which satisfy a query, or all the SEs if no query is given. --bdii: allows to specify a BDII in the form <hostname>:<port>. If not given, the value of the environmental variable LCG_GFAL_INFOSYS is used. If that is not defined, the command returns an error. --vo: restricts the output to CEs or SEs where the given VO is authorized.

Practicals: lcg-info [valparaiso18@cluster29 ~]$ lcg-info --vo gilda --list-ce --query 'TotalCPUs>=10' --attrs 'RunningJobs,FreeCPUs' - CE: ce.hpc.iit.bme.hu:2119/jobmanager-lcgpbs-gilda - RunningJobs 0 - FreeCPUs 27 - CE: ce.scope.unina.it:2119/jobmanager-lcgpbs-egee_long - RunningJobs 4 - FreeCPUs 1777 - CE: ce.scope.unina.it:2119/jobmanager-lcgpbs-egee_short - RunningJobs 70 List all the Computing Elements in the BDII satisfying the given conditions (TotalCPUs=10) and print the desired attributes (FreeCPUs,!Total Jobs)

Practicals: lcg-info [valparaiso18@cluster29 ~]$ lcg-info --vo gilda --list-ce --attrs Tag - CE: ce-edu.grid.acad.bg:2119/jobmanager-pbs-gilda - Tag GLITE-3_0_0 GLITE-3_0_1 GLITE-3_0_2 GLITE-3_1_0 LCG-2 LCG-2_1_0 LCG-2_1_1 [...] MPICH MPICH-1.2.7 MPICH-1.2.7p1 MPIEXEC MPIRUN MPI_SHARED_HOME R-GMA - CE: ce.hpc.iit.bme.hu:2119/jobmanager-lcgpbs-gilda - Tag ASTRA-STK One of the most common use of the "lcg-info" is to check which software packages are available on CE's. The software packages are stored in the "Tag" attribute.

Practicals $ lcg-info --vo gilda --list-ce --query 'Tag=MPICH' --attrs 'FreeCPUs‘ $ lcg-infosites --vo gilda ce $ lcg-infosites --vo gilda ce -v 2 $ lcg-infosites --vo gilda se $ lcg-infosites --vo eumed tag $ lcg-infosites --vo eumed closeSE $ lcg-info --vo eumed --list-ce --query 'Tag=MPICH' --attrs 'FreeCPUs‘ $lcg-info --vo eumed --list-ce --query 'TotalCPUs>=10' --attrs 'RunningJobs,FreeCPUs’ One of the most common use of the "lcg-info" is to check which software packages are available on CE's. The software packages are stored in the "Tag" attribute.

References gLite documentation LDAP Concepts & Overview http://glite.web.cern.ch/glite/documentation/ LDAP Concepts & Overview http://www.zytrax.com/books/ldap/ch2/ OpenLDAP community http://www.openldap.org GLUE Schema Specification version 1.3 Final http://forge.gridforum.org/sf/go/doc14185?nav=1 Lilongwe, EELA-2 Workshop, 13.11.2008