Presentation is loading. Please wait.

Presentation is loading. Please wait.

E. Ronchieri – n° 1 EDG release 2 Elisabetta Ronchieri INFN CNAF - DataGrid WP1 – Workload Management System

Similar presentations


Presentation on theme: "E. Ronchieri – n° 1 EDG release 2 Elisabetta Ronchieri INFN CNAF - DataGrid WP1 – Workload Management System"— Presentation transcript:

1 E. Ronchieri – n° 1 EDG release 2 Elisabetta Ronchieri INFN CNAF - DataGrid WP1 – Workload Management System elisabetta.ronchieri@cnaf.infn.it

2 E. Ronchieri – n° 2 Outline u What is Grid? u Grid Projects – Focus on EU Data Grid Project u Selected Areas + Technologies n Security – Information and Monitoring Services – Storage Management - Data Management - Workload Management u Installation

3 E. Ronchieri – n° 3 Grid Vision u Researchers, Grid Middleware, Scientific instruments and experiments and Resources are the major figures n Researchers interact with colleagues, share and access data n Grid middleware provides part of the sw infrastructure n Experiments provides huge amount of data u Grid is: n a special form of distributed computing s Computing and storage resources are distributed over several sites s Sites are typically connected via wide-area NW links u It can be best applied to applications that have the following features: n Distributed user community n Lots of computing power (Computational Grid) n Lots of storage capacity (Data Grid) u Currently, it is applied mainly in computing sciences

4 E. Ronchieri – n° 4 Grid Today u Still many steps must be done (especially to make the Grid popular to a conventional user) u Considerable expertise is still required (especially to make efficient the use of the Grid technology) u There is no single Grid (several projects,…) u Grids need to work together for a standardization n Global Grid Forum (GGF http://www.ggf.org)http://www.ggf.org s Its mission is to promote and develop Grid technologies and applications s There are a lot working group in several different areas (Scheduling and Resource Management, Security, ….)

5 E. Ronchieri – n° 5 Outline u What is Grid? u Grid Projects – Focus on EU Data Grid Project u Selected Areas + Technologies n Security – Information and Monitoring Services – Storage Management - Data Management - Workload Management u Installation

6 E. Ronchieri – n° 6 Major US & European Grid Projects, many with strong HEP participation US projects European projects Many national, regional Grid projects -- GridPP(UK), INFN-grid(I), NorduGrid, Dutch Grid, … The Virtual Data Toolkit (VDT) The DataGrid Toolkit

7 E. Ronchieri – n° 7 Outline u What is Grid? u Grid Projects – Focus on EU Data Grid Project u Selected Areas + Technologies n Security – Information and Monitoring Services – Storage Management - Data Management - Workload Management u Installation

8 E. Ronchieri – n° 8 EDG Globus-based middleware architecture u EDG is built on the emerging Grid technology u Start: Jan 1, 2001 End: Dec 31, 2003 u Current EDG architectural functional blocks: n Basic Services provided by Globus 2.2.x (such as authentication authorization, info providers, replica catalog, secure file transfers) and Condor (such as the submission, the effective job cancellation, the event monitoring, the support for the monitoring) n Higher Level EDG Middleware developed within EDG n Application (such as HEP, BIO, and EO) OS & Net services Basic Services High level Grid middleware LHC VOs common application layer Other apps ALICEATLASCMSLHCb Specific application layer Other apps GLOBU S 2.2.x and Condor Grid middleware

9 E. Ronchieri – n° 9 Outline u What is Grid? u Grid Projects – Focus on EU Data Grid Project u Selected Areas + Technologies n Security – Information and Monitoring Services – Storage Management - Data Management - Workload Management u Installation

10 E. Ronchieri – n° 10 Selected Areas for Grid Technologies in EU DataGrid (and partly Globus) u Security n All access to and interaction with Grid resources need to be done in a secure way n Major technologies: PKI (Public Key Infrastructure), and GSS u Information and Monitoring Services n Before you start using the Grid, you need to know what resources are there and what you can use n Major technologies: LDAP based or Web Service approach u Data Management n Main focus of a Data Grid n Major technologies: LDAP based or Web Service approach u Workload Management n Submit your application to Grid where it is executed

11 E. Ronchieri – n° 11 Outline u What is Grid? u Grid Project – Focus on EU DataGrid Projects u Selected Areas + Technologies n Security – Information and Monitoring Services – Storage Management - Data Management – Workload Management u Installation

12 E. Ronchieri – n° 12 Security in EDG u Why: n User jobs might access several remote resources n Users need to be s Authenticated (Who am I?) s Authorized (What can I do?) u Mainly uses: n The security infrastructure provided by Globus s Based on PKI (Public Key Infrastructure) and GSS

13 E. Ronchieri – n° 13 Grid Security Requirements 1) Easy to use 2) Single sign-on 3) Run applications 1) Specify local access control 2) Auditing, accounting, etc. 3) Integration local system kerberos, AFS, license mgr. User View Resource Owner View

14 E. Ronchieri – n° 14 Grid Security Infrastructure (GSI) u Extensions to existing standard protocols & APIs n Standards: SSL/TLS, X.509 & CA, GSS n Extensions for single sign-on and delegation u Globus Toolkit reference implementation of GSI n SSLeay/OpenSSL + GSS-API + delegation + single sign on

15 E. Ronchieri – n° 15 Site N (Unix) Example of GSI usage Site A (Unix) Site B Computer User Storage system Proxy Credential GridFTP Server Grid Service Remote file access request Restricted Proxy

16 E. Ronchieri – n° 16 VO-LDAP Architecture mkgridmap grid-mapfile VO Directory CN=Mario Rossi o=xyz, dc=eu-datagrid, dc=org CN=Franz ElmerCN=John Smith Authentication Certificate ou=Peopleou=Testbed1ou=??? local users ban list Adopted by DataGrid Testbed0 (2001/02) DataGrid Testbed1 (2003) DataTAG Testbed (2003)

17 E. Ronchieri – n° 17 Outline u What is Grid? u Grid Projects – Focus on EU Data Grid Project u Selected Areas + Technologies n Security – Information and Monitoring Services - Storage Management - Data Management -Workload Management u Installation

18 E. Ronchieri – n° 18 Grid Information and Monitoring Services MDS 2.xR-GMA DATA ModelLDAP (Hierarchical)Relational communicatonLDAPHTTP Information storage LDAP-based backends re-written by Globus Relational Data Base queriesLDAP queries Ldapsearch -x -H ldap://lxshare0225.cern.ch:2135\ -b 'Mds-Vo-name=datagrid,o=grid’\ 'objectclass=StorageElement‘\ seId SEsize SQL queries Select * from StorageElement Components GRIS SE GRIS CE GIIS WN Producer Consumer Registry

19 E. Ronchieri – n° 19 u EDG release 1.x is totally based on MDS 2.x n Due to stability problems of this component, in the last period we use to deploy a pure LDAP server in front of a top level GIIS u EDG release 2.x is based on both MDS 2.x and R-GMA n Since the GIS is a vital service for the WM, the Broker will rely on MDS 2.x until R-GMA won’t prove to be reliable Grid Information and Monitoring Services in EDG

20 E. Ronchieri – n° 20 Outline u What is Grid? u Grid Projects – Focus on EU Data Grid Project u Selected Areas + Technologies n Security – Information and Monitoring Services - Storage Management - Data Management -Workload Management u Installation

21 E. Ronchieri – n° 21 Interfaces to SE u First release of the SE control System u The three interfaces to the outside world are: n Data transfer s Gridftp will be used to transfer files over the WAN and the files will be available to local nodes by NFS n Information s Existing MDS information providers will be extended to provide the extra information in the GLUE storage schema n Control s Function such as reservation for reading and writing, metadata modification, access via gridftp u It is an implementation of the Storage Resource Management (SRM) specification u The SE control interface to a generic MSS has already been tailored for CERN and RAL n Work is under way with in2p3, wp10 and wp9 to adapt it to their MSS http://sdm.lbl.gov/srm-wg

22 E. Ronchieri – n° 22 Outline u What is Grid? u Grid Projects – Focus on EU Data Grid Project u Selected Areas + Technologies n Security – Information and Monitoring Services - Storage Management - Data Management – Workload Management u Installation

23 E. Ronchieri – n° 23 Naming Schemes u GUID – Global Unique Identifier n guid:f81d4fae-7dec-11d0-a765-00a0c91e6bf6 u LFN – Logical File Name n lfn://event20030612 u SFN – Storage File Name n sfn://ibm139.cnaf.infn.it/edg/storageelement/dev/wpsix/pippo s Host + path + filename GUID LFN1 LFN2 LFN3 SFN1 SFN2 SFN3

24 E. Ronchieri – n° 24 Replica Manager Replica Metadata Catalog Replica Location Service File Transfer Optimization Client RLS RMC GridFTP edg-replica-manager Replication Services: EDG Replica Manager Used for querying and assigning LFNs Used for locating replicas and assigning SFNs Used for transferring file

25 E. Ronchieri – n° 25 VO Replication Services Architecture Site Replica Manager Storage Element Computing Element Optimiser Resource Broker User Interface Replica Metadata Catalog Site Replica Manager Storage Element Computing Element Optimiser Replica Location Service Local Replica Catalog LFNs -> GUID GUID->SFNs

26 E. Ronchieri – n° 26 Outline u What is Grid? u Grid Projects – Focus on EU Data Grid Project u Selected Areas + Technologies n Security – Information and Monitoring Services - Storage Management - Data Management - Workload Management u Installation

27 E. Ronchieri – n° 27 Review of WMS architecture u WMS architecture reviewed n To apply the “lessons” learned and addressing the shortcomings emerged with the first release of the software n To address the scalability problems n To increase the reliability of the system n To favor interoperability with other Grid frameworks, by allowing exploiting WP1 modules (e.g. RB) also “outside” the EDG WMS

28 E. Ronchieri – n° 28 WMS Revised Architecture UI Replica Manager Inform. Service Network Server Job Contr. - CondorG Workload Manager RB node CE characts & status SE characts & status RB storage Match- Maker/ Broker Job Adapter Log Monitor Logging & Bookkeeping

29 E. Ronchieri – n° 29 Improvements u Duplication of persistent information related to jobs avoided n LB only repository of job information n Possible to have multiple LB servers per RB (to avoid bottlenecks) u Techniques to quickly recover from failures n E.g.: communication among components of WMS much more reliable (done via persistent queues in the file system) u Also less exposed to memory leaks (coming not only from EDG software) u Flexibility and interoperability increased n E.g. RB-Matchmaker as pluggable module n Glue Schema compliance u Other enhancements in design and implementation

30 E. Ronchieri – n° 30 New functionalities u User APIs n Including a Java GUI u Trivial job check-pointing service n User can save from time to time the state of the job (defined by the application) n A job can be restarted from an intermediate (i.e. previously saved) job state u Gang-matching n Allow to take into account both CE and SE information in the matchmaking n For example to require a job to run on a CE close to a SE with enough space u Support for parallel MPI jobs u Support for interactive jobs n Jobs running on some CE worker node where a channel to the submitting (UI) node is available for the standard streams (by integrating the Condor Bypass software)

31 E. Ronchieri – n° 31 Outline u What is Grid? u Grid Projects – Focus on EU Data Grid Project u Selected Areas + Technologies n Security – Information and Monitoring Services - Storage Management - Data Management - Workload Management u Installation

32 E. Ronchieri – n° 32 Installation u EDG SW: n Is delivered via rpms n Is handled on CVS repository u Globus + Condor SW: n are provided via VDT (delivered rpms) n upgraded to Globus 2.2.4 and Condor 6.5.1 u LCFGng: n Is an automatic installation tool based on rpms n Is also used for the configuration of the middleware components n Works for RH 6.2 and RH 7.3 u Sites: n Development testbed

33 E. Ronchieri – n° 33 EDG Deploying u R-GMA, RM, RLS, ROS, RMC, and WMS + GLUE schema u EDG release 2.0 n A temporary tag contains the functionalities for EDG 2.0 (deployed at CERN, NIKHEF, CNAF, and RAL) n not officially tagged as EDG 2.0 until the basic functionalities work (e.g. job submission, data transfers, etc) n Hopefully the first EDG 2.0 tag at the end of this week n The schedule for moving to gcc3.2.2 for all software is planning for this September n The integration of more functionalities is entirely at the mercy of LCG

34 E. Ronchieri – n° 34 Conclusion u Many improvements and many new functionalities u Preliminary results encouraging u More comprehensive evaluation with real tests performed by real users on the large scale testbed


Download ppt "E. Ronchieri – n° 1 EDG release 2 Elisabetta Ronchieri INFN CNAF - DataGrid WP1 – Workload Management System"

Similar presentations


Ads by Google