Presentation is loading. Please wait.

Presentation is loading. Please wait.

Introduction to the current EDG Testbed Software Krak ó w, December 2002 Steve Fisher – on behalf of The European.

Similar presentations


Presentation on theme: "Introduction to the current EDG Testbed Software Krak ó w, December 2002 Steve Fisher – on behalf of The European."— Presentation transcript:

1 Introduction to the current EDG Testbed Software Krak ó w, December 2002 Steve Fisher s.m.fisher@rl.ac.uk – RALs.m.fisher@rl.ac.uk on behalf of The European DataGrid Project Team http://www.edg.org/

2 The EDG Testbed Software - 2 The European DataGrid u Funded by the European Union n Jan 1, 2001 - Dec 31, 2003 u Develop, implement and exploit a large-scale data and CPU-oriented computational GRID. u Develop middleware, in collaboration with some of the leading centres of competence in GRID technology. u Complement, and help to coordinate at a European level, several on- going national GRID projects.

3 The EDG Testbed Software - 3 The EDG Main Partners  CERN – International (Switzerland/France)  CNRS - France  ESA/ESRIN – International (Italy)  INFN - Italy  NIKHEF – The Netherlands  PPARC - UK

4 The EDG Testbed Software - 4 Research and Academic Institutes CESNET (Czech Republic) Commissariat à l'énergie atomique (CEA) – France Computer and Automation Research Institute, Hungarian Academy of Sciences (MTA SZTAKI) Consiglio Nazionale delle Ricerche (Italy) Helsinki Institute of Physics – Finland Institut de Fisica d'Altes Energies (IFAE) - Spain Istituto Trentino di Cultura (IRST) – Italy Konrad-Zuse-Zentrum für Informationstechnik Berlin - Germany Royal Netherlands Meteorological Institute (KNMI) Ruprecht-Karls-Universität Heidelberg - Germany Stichting Academisch Rekencentrum Amsterdam (SARA) – Netherlands Swedish Research Council - Sweden EDG Assistant Partners Industrial Partners Datamat (Italy) IBM-UK (UK) CS-SI (France)

5 The EDG Testbed Software - 5 } } Applications u WP1: Work Load Management System u WP2: Data Management u WP3: Information and Monitoring u WP4: Fabric Management u WP5: Storage Element u WP6: Testbed and demonstrators u WP7: Network Monitoring u WP8: High Energy Physics u WP9: Earth Observation u WP10: Biology u WP11: Dissemination u WP12: Management EDG structure: work packages

6 The EDG Testbed Software - 6 Current EDG Testbed CERN Lyon RAL ManchesterNIKHEF Reference site: CERN Testbed1 EDG sites NorduGrid Italy: Bologna Cagliari Catania Milano Napoli Padova Parma Pisa Roma Torino NorduGrid: Bergen Copenhagen Helsinki Lund Oslo Stockholm Uppsala Karlsruhe Barcelona Madrid Lisboa

7 The EDG Testbed Software - 7 Security: Authentication/Authorization u Authentication n Who you are s users identified by certificates signed by a CA u Authorization n What you are allowed to do s based on membership of Virtual Organizations (VO).

8 The EDG Testbed Software - 8 Certificate Request VO user cert-request grid-cert-request once in every two- three years

9 The EDG Testbed Software - 9 Requesting a Certificate u grid-cert-request A certificate request and private key is being created. [...] Using configuration from /usr/local/grid/globus/etc/globus-user-ssleay.conf Generating a 1024 bit RSA private key [...] A private key and a certificate request has been generated with the subject: /O=Grid/O=CERN/OU=cern.ch/CN=Akos Frohner [...] Your private key is stored in.../.globus/userkey.pem Your request is stored in.../.globus/usercert_request.pem Please e-mail the certificate request to the CERN CA cat.../.globus/usercert_request.pem | mail cern-globus-ca@cern.ch Your certificate will be mailed to you within two working days.

10 The EDG Testbed Software - 10 Certificate Signing CA user cert-request grid-cert-request certificate cert signing

11 The EDG Testbed Software - 11 Registration/Authorization User registration in an EDG Virtual Organisation u convert your certificate: n openssl pkcs12 –export –in ~/.globus/usercert.pem –inkey ~/.globus/userkey.pem –out user.p12 –name ’Joe Smith’ u import your certificate in your browser u sign the usage guidelines: https://marianne.in2p3.fr/cgi-bin/datagrid/register/account.pl https://marianne.in2p3.fr/cgi-bin/datagrid/register/account.pl u ask an account from your VO administrator by email -> You are registered in the VO server and have a user account.

12 The EDG Testbed Software - 12 Registration VO user registration cert.pkcs12 convert certificate Usage guidelines Account Registration once for the lifetime of the VO – you may change the certificate keys!

13 The EDG Testbed Software - 13 Starting a Session user proxy-cert grid-proxy-init cert.pkcs12 certificate every 12/24 hours

14 The EDG Testbed Software - 14 Usage You must have a valid certificate from a trusted CA! u “login”: grid-proxy-init short lifetime certificate: 24 hours Enter PEM pass phrase:...........................+++++....................................+++++  checking the proxy: grid-proxy-info -subject /O=Grid/O=CERN/OU=cern.ch/CN=Akos Frohner/CN=proxy -> use the grid services u “logout”: grid-proxy-destroy

15 The EDG Testbed Software - 15 Configuration on the Server CA service host-cert cert signing host-request grid-cert-request ca-certificate crl cert/crl update crl automatically updated periodically

16 The EDG Testbed Software - 16 Authorization Information VO-server service host-cert gridmap mkgridmap ca-certificates crls automatically updated periodically

17 The EDG Testbed Software - 17 Using a Service user service proxy-cert grid-proxy-init cert.pkcs12 certificate host/proxy certs exchanged host-cert gridmap ca-certificates crls

18 The EDG Testbed Software - 18 EDG Logical Machine Types 1.User Interface (UI) 2.Resource Broker (RB) 3.Information Service (IS) 4.Computing Element (CE) n Gatekeeper (Front-end Node) n Worker Nodes (WN) 5.Storage Element (SE) 6.Replica Catalog (RC)

19 The EDG Testbed Software - 19 Information Systems overview u The aim of the Information and Monitoring Service is to deliver a flexible infrastructure that provides information on n the EU DataGrid itself n grid applications u EDG info systems are based upon Globus MDS (Metacomputing Directory Service or Monitoring and Discovery Service as it is now called) n Based on OpenLDAP, a hierarchical database u The information system is currently used mainly by the middleware. n You can use it to find out what is going on

20 The EDG Testbed Software - 20 LDAP attributes u A schema describes the attributes and the types of the attributes associated with data objects u Example - some attributes of SiteInfo: n siteName: RALDEV n sysAdminContact: grid.sysadmin@rl.ac.uk n userSupportContact: grid.support@rl.ac.uk n siteSecurityContact: grid.security@rl.ac.uk n dataGridVersion: 1.2 n InstallationDate: 20020704142800Z

21 The EDG Testbed Software - 21 LDAP hierarchy u Lightweight Directory Assess Protocol (LDAP) offers a hierarchical view of information u The objects are arranged in a Directory Information Tree (DIT) u One or more attributes represent the Relative Distinguished Name (RDN) u An object is identified by its Distinguished name n This is its RDN with the Distinguished name of its parent

22 The EDG Testbed Software - 22 RDNs and DNs u RDN n SE s seId=dev02.hepgrid.clrc.ac.uk n Protocols s seProtocol=gridftp s seProtocol=rfio s seProtocol=file u DN n Site s Mds-Vo-name=ral-dev,Mds-Vo- name=uk,o=Grid n SE s seId=dev02.hepgrid.clrc.ac.uk,Mds- Vo-name=ral-dev,Mds-Vo- name=uk,o=Grid n Protocols s seProtocol=gridftp, seId=dev02.hepgrid.clrc.ac.uk,Mds- Vo-name=ral-dev,Mds-Vo- name=uk,o=Grid s seProtocol=rfio, seId=dev02.hepgrid.clrc.ac.uk,Mds- Vo-name=ral-dev,Mds-Vo- name=uk,o=Grid s seProtocol=file, seId=dev02.hepgrid.clrc.ac.uk,Mds- Vo-name=ral-dev,Mds-Vo- name=uk,o=Grid supported protocols SE site

23 The EDG Testbed Software - 23 MDS GRISs & GIISs u Information providers are scripts which when invoked by the LDAP server make available the desired information n Information is cached by the server to improve performance u Within MDS the EDG information providers are invoked by a local LDAP server, the Grid Resource Information Server (GRIS) u “Aggregate directories”, Grid Information Index Servers (GIIS), are used to group resources u The GRISs use soft state registration to register with one or more GIISs u The GIIS can then act as a single point of contact for a number of resources n A GIIS may represent a site, country, virtual organization, etc. u In turn a GIIS may register with another GIIS

24 The EDG Testbed Software - 24 EDG Information Providers & the Directory Information Tree computing element storage elements that are close (not necessarily at the same site) status supported protocols file statistics network information between this and other sites storage element site information site

25 The EDG Testbed Software - 25 EDG GRIS/GIIS Hierarchy u Information providers publish information to a local LDAP server known as a Grid Resource Information Server (GRIS) u Each country has a GIIS to which all of the site GIISs register u There is a top level datagrid GIIS to which all of the country GIISs register u Each Site has a Grid Information Index Server (GIIS) which acts as a single point of contact for all of the sites resources. The GRISs register with their site GIIS

26 The EDG Testbed Software - 26 EDG Information Providers u The EDG have produced information providers: n Site information n The Computing Element n The Storage Element n Network Monitoring u All of the EDG data objects are dynamic, they have a time stamp and a time to live (used by the cache mechanism) associated with them

27 The EDG Testbed Software - 27 Siteinfo in=siteinfo,Mds-Vo-name=ral-dev,Mds-Vo-name=uk,o=Grid objectClass: SiteInfo objectClass: DataGridTop objectClass: DynamicObject siteName: RALDEV sysAdminContact: grid.sysadmin@rl.ac.uk userSupportContact: grid.support@rl.ac.uk siteSecurityContact: grid.security@rl.ac.uk dataGridVersion: 1.2 installationDate: 20020704142800Z

28 The EDG Testbed Software - 28 Computing Element ceId=dev01.hepgrid.clrc.ac.uk:2119/jobmanager- pbs-M,hn=dev01.hepgrid.clrc.ac.uk,Mds-Vo- name=ral-dev,Mds-Vo-name=uk,o=Grid objectClass: DataGridTop objectClass: ComputingElement CEId: dev01.hepgrid.clrc.ac.uk:2119/jobmanager- pbs-M GlobusResourceContactString:dev01.hepgrid.clrc.ac. uk:2119/jobmanager- pbs:/O=Grid/O=UKHEP/CN=dev01.hepgrid.clrc.a c.uk GRAMVersion: ? Architecture: intel OpSys: RH 6.2 MinPhysicalMemory: 258 MinLocalDiskSpace: 2048 TotalCPUs: 1 FreeCPUs: 1 NumSMPs: 0 MinSPUProcessors: 0 MaxSPUProcessors: 0 TotalJobs: 0 RunningJobs: 0 IdleJobs: 0 MaxTotalJobs: 1 MaxRunningJobs: 1 WorstTraversalTime: 108000 EstimatedTraversalTime: 0 Active: TRUE Priority: 20 MaxCPUTime: 108000 MaxWallClockTime: 432000 AverageSI00: 300 MinSI00: 300 MaxSI00: 300 AuthorizedUser:/O=Grid/O=UKHEP/OU=hepgrid.cl rc.ac.uk/CN=Tim Eves AuthorizedUser:/O=Grid/O=UKHEP/OU=hepgrid.cl rc.ac.uk/CN=Tim Folkes RunTimeEnvironment: RALDEV AFSAvailable: FALSE OutboundIP: TRUE InboundIP: FALSE QueueName: M LRMSType: PBS LRMSVersion: OpenPBS_2.3

29 The EDG Testbed Software - 29 u Queries can be posed to the current Information and Monitoring Service using LDAP search commands u An LDAP search consists of the following components Querying the Information & Monitoring Service $ldapsearch\ -x\ -H ldap://lxshare0225.cern.ch:2135\ -b 'Mds-Vo-name=datagrid,o=grid\ 'objectclass=ComputingElment‘\ CEId FreeCPUs \ -s base|one|sub “simple” authentication uniform resource identifier base distinguished name for search filter attributes to be returned scope of the search specifying just the base object, one-level or the complete subtree

30 The EDG Testbed Software - 30 Querying the GRIS/GIIS Hierarchy u Mds-Vo-name=siteB,o=grid n This will look at all the data from siteB Mds-Vo-name =siteA Mds-Vo-name =siteD Mds-Vo-name =siteC Mds-Vo-name =siteB Mds-Vo-name =countryA Mds-Vo-name =countryB Mds-Vo-name =datagrid u Mds-Vo-name=countryA,o=grid n This will look at all the data from countryA u Mds-Vo-name=datagrid,o=grid n This will look at all the data u Mds-Vo-name=siteB,Mds-Vo- name=countryA,o=grid n This will look at all the data from siteB u Mds-Vo-name=siteB, Mds-Vo- name=countryA,Mds-Vo- name=datagrid,o=grid n This will look at all the data from siteB

31 The EDG Testbed Software - 31 The EDG WMS u The user interacts with GRID via a Workload Management System u The Goal of WMS is the distributed scheduling and resource management in a GRID environment. u What does it allow GRID users to do? To submit their jobs To execute them To get information about their status To retrieve their output u The WMS tries to optimize the usage of resources

32 The EDG Testbed Software - 32 WMS Components u WMS is currently composed of the following parts: 1.User Interface (UI) : access point for the user to the GRID 2.Resource Broker (RB) : the broker of GRID resources, performing the match-making 3.Job Submission System (JSS) : provides a reliable submission system 4.Information Index (II) : a specialized Globus GIIS (LDAP server) used by the Resource Broker as a filter to the information service (IS) to select resources 5.Logging and Bookkeeping services (LB) : store Job Info available for users to query

33 The EDG Testbed Software - 33 WMS UI Commands  dg-job-submit submits a job  dg-job-list-match lists resources matching a job description  dg-job-cancel cancels a given job  dg-job-status displays the status of the job (submitted, waiting, ready, scheduled, running, chkpt, done, outputready, aborted, cleared)  dg-job-get-output returns the job-output to the user  dg-job-get-logging-info displays logging information about submitted jobs  dg-job-id-info is a utility for the user to display job info in a formatted style

34 The EDG Testbed Software - 34 Example of UI Command Options  dg-job-submit –r –n -c -o -r the job is submitted by the RB directly to the computing element identified by -n an e-mail message containing basic information regarding the job (status and identification) is sent to the specified when the job enters one of the following status: DONE or ABORTED READY RUNNING -c the configuration file is pointed by the UI instead of the standard configuration file -o the generated dg_jobId is written in the  dg-job-status –i (or dg_jobId) -i the bookkeeping information about dg_jobId contained in the are displayed

35 The EDG Testbed Software - 35 Job Description Language (JDL) uMandatory for every single JDL file: Executable (contains the command name) Other attributes: InputSandbox OutputSandbox uMandatory for JDL file dealing with Data Management: ReplicaCatalog (contains the Replica Catalog Identifier) DataAccessProtocol (contains the protocol or the list of protocols which the application is able to speak with for accessing InputData on a given SE) If InputData contains at least one PFN and no LFNs, only DataAccessProtocol is mandatory. If InputData contains at least one LFN, both ReplicaCatalog and DataAccessProtocol are mandatory.

36 The EDG Testbed Software - 36 Example JDL File Executable = “gridTest”; InputData = “LF:testbed0-00019”; ReplicaCatalog = “ldap://sunlab2g.cnaf.infn.it:2010/ \ rc=WP2 INFN Test, dc=infn, dc=it”; DataAccessProtocol = “gridftp”; StdError = “stderr.log”; StdOutput = “stdout.log”; OutputSandbox = {“stderr.log”, “stdout.log”}; InputSandbox = {“home/joda/test/gridTest”}; Rank = “other.MaxCpuTime”; Requirements = other.Architecture==“INTEL” && \ other.OpSys==“LINUX” && other.FreeCpus >=4;

37 The EDG Testbed Software - 37 A Job Submission Example UI JDL Logging & Book-keeping (LB) Resource Broker (RB) Job Submission Service (JSS) Storage Element (SE) Compute Element CE) Information Service (IS) Replica Catalogue (RC)

38 The EDG Testbed Software - 38 A Job Submission Example UI JDL Logging & Book-keeping (LB) Resource Broker (RB) Job Submission Service (JSS) Storage Element (SE) Compute Element (CE) Information Service (IS) Replica Catalogue (RC) Job Submit Event Input Sandbox Job Status submitted

39 The EDG Testbed Software - 39 A Job Submission Example UI JDL Logging & Book-keeping (LB) Resource Broker (RB) Job Submission Service (JSS) Storage Element (SE) Compute Element (CE) Information Service (IS) Replica Catalogue (RC) Job Status submitted waiting

40 The EDG Testbed Software - 40 A Job Submission Example UI JDL Logging & Book-keeping (LB) Resource Broker (RB) Job Submission Service (JSS) Storage Element (SE) Compute Element (CE) Information Service (IS) Replica Catalogue (RC) Job Status submitted waiting ready

41 The EDG Testbed Software - 41 A Job Submission Example UI JDL Logging & Book-keeping (LB) Resource Broker (RB) Job Submission Service (JSS) Storage Element (SE) Compute Element (CE) Information Service (IS) Replica Catalogue (RC) Job Status submitted waiting ready BrokerInfo scheduled

42 The EDG Testbed Software - 42 A Job Submission Example UI JDL Logging & Book-keeping (LB) Resource Broker (RB) Job Submission Service (JSS) Storage Element (SE) Compute Element (CE) Information Service (IS) Replica Catalogue (RC) Job Status submitted waiting ready scheduled Input Sandbox running

43 The EDG Testbed Software - 43 A Job Submission Example UI JDL Logging & Book-keeping (LB) Resource Broker (RB) Job Submission Service (JSS) Storage Element (SE) Compute Element (CE) Information Service (IS) Replica Catalogue (RC) Job Status submitted waiting ready scheduled Job Status running

44 The EDG Testbed Software - 44 A Job Submission Example UI JDL Logging & Book-keeping Resource Broker Job Submission Service Storage Element ComputeElement Information Service Replica Catalogue submitted waiting ready scheduled running Job Status done Job Status

45 The EDG Testbed Software - 45 A Job Submission Example UI JDL Logging & Book-keeping Resource Broker Job Submission Service Storage Element ComputeElement Information Service Replica Catalogue submitted waiting ready scheduled running done Job Status outputready Output Sandbox

46 The EDG Testbed Software - 46 A Job Submission Example UI JDL Logging & Book-keeping (LB) Resource Broker (RB) Job Submission Service (JS) Storage Element (SE) Compute Element (CE) Information Service (IS) Replica Catalogue (RC) Output Sandbox cleared submitted waiting ready scheduled running done Job Status outputready

47 The EDG Testbed Software - 47 EDG Data Management Tools u Tools for n Locating data n Copying data n Managing and replicating data n Meta Data management u On EDG Testbed you have n EDG Replica Catalog n globus-url-copy (GridFTP) n EDG Replica Manager n Grid Data Mirroring Package (GDMP)

48 The EDG Testbed Software - 48 EDG Replica Catalog u Based upon the Globus LDAP Replica Catalog (will be replaced by RLS) u Stores LFN/PFN mappings and additional information (e.g. filesize): n Physical File Name (PFN): host + full path & and file name n Logical File Name (LFN): logical name that may be resolved to PFNs n LFN : PFN = 1 : n u Only files on storage elements may be registered u Each VO has a specific storage dir on an SE  Example PFN: lxshare0222.cern.ch/flatfiles/SE1/iteam/file1.dat host storage dir  LFN must be full path of file starting from storage dir LFN of above PFN: file1.dat

49 The EDG Testbed Software - 49 EDG Replica Catalog u API and command line tools n addLogicalFileName n getLogicalFileName n deleteLogicalFileName n getPhysicalFileName n addPhysicalFileName n deletePhysicalFileName n addLogicalFileAttribute n getLogicalFileAttribute n deleteLogicalFileAttribute http://cmsdoc.cern.ch/cms/grid/userguide/gdmp-3-0/node85.html

50 The EDG Testbed Software - 50 globus-url-copy u Low level tool for secure copying globus-url-copy :// \ :// u Main Protocols: n gsiftp – for secure transfer, only available on SE and CE n file – for accessing files stored on the local file system on e.g. UI, WN globus-url-copy file://`pwd`/file1.dat \ gsiftp://lxshare0222.cern.ch/ \ flatfiles/SE1/EDGTutorial/file1.dat

51 The EDG Testbed Software - 51 The EDG Replica Manager u Extends the Globus replica manager u Client side tool u Allows replication (copy) and registering of files in RC u Keeps RC consistent with stored data.

52 The EDG Testbed Software - 52 The Replica Manager APIs u (un)registerEntry(LogicalFileName lfn, FileName source) n Replica Catalogue operations only - no file transfer u copyFile(FileName source, FileName destination, String protocol) n allows for third-party transfer n transfer between: s two StorageElements or s ComputingElement and Storage Element s Space management policies under development

53 The EDG Testbed Software - 53 u copyAndRegisterFile(LogicalFileName lfn, FileName source, FileName destination, String protocol) n third-party transfer but : files can only be registered in Replica Catalogue if destination PFN contains a valid SE u replicateFile(LogicalFileName lfn, FileName source, FileName destination, String protocol) u deleteFile(LogicalFileName lfn, FileName source) The Replica Manager APIs

54 The EDG Testbed Software - 54 u based on CMS requirements for replicating Objectivity files for High Level Trigger studies u production prototype project for evaluating Grid technologies (especially Globus) u http://cern.ch/GDMP

55 The EDG Testbed Software - 55 Overview of Components EDG Replica Catalogue Site1 Site3 Site2 GDMP client

56 The EDG Testbed Software - 56 Subscription Model n All the sites that subscribe to a particular site get notified whenever there is an update in its catalog. Site 1 Site 3 Site 2 Subscriber list Subscriber list subscribe

57 The EDG Testbed Software - 57 Export / Import Catalogue n Export Catalog s information about the new files produced. s is published n Import Catalog s information about the files which have been published by other sites but not yet transferred locally s As soon as the file is transferred locally, it is removed from the import catalogue. n Possible to pull the information about new files into your import catalogue. Site 1 Site 3 export catalog import catalog Site 2 export catalog 1)register, publish new files 2) transfer files 1) get info about new files 3) delete files

58 The EDG Testbed Software - 58 Usage u gdmp_ping n Ping a GDMP server and get its status u gdmp_host_subscribe n first thing to be done by a site u gdmp_register_local_file n Registers a file in local file catalogue but NOT in Replica Catalogue (RC) u gdmp_publish_catalogue n send information of newly created files to subscribed hosts (no real data transfer) – update RC u gdmp_replicate_get - gdmp_replicate_put n get/put all the files from the import catalogue – update RC u gdmp_remove_local_file n Delete a local file and update RC u gdmp_get_catalogue n Get remote catalogue contents – for error recovery

59 The EDG Testbed Software - 59 GDMP vs. EDG Replica Manager u GDMP n Replicates sets of files n Replication between SEs n Mass storage interface n File size as logical attribute n Subscription model n Event notification n CRC file size check n Support for Objectivity u Replica Manager n Replicates single files n Replication between SEs, CEs to SE.


Download ppt "Introduction to the current EDG Testbed Software Krak ó w, December 2002 Steve Fisher – on behalf of The European."

Similar presentations


Ads by Google