Download presentation
Presentation is loading. Please wait.
1
The EU DataGrid - Introduction The European DataGrid Project Team http://www.eu-datagrid.org/ Peter.Kunszt@cern.ch
2
The EDG Intro– Tutorial - n° 2 Contents The EDG Project scope Achievements EDG structure Middleware Workpackages: Goals, Achievements, Issues Testbed Release Plans
3
The EDG Intro– Tutorial - n° 3 Glossary RBResource Broker VOVirtual Organisation CE Computing Element SE Storage Element GDMP GRID Data Mirroring Package LDAP Lightweighted Directory Access Protocol LCFG Local Configuration System LRMS Local Resource management system (Batch) (PBS, LSF) WMSWorkload Management System LFNLogical File Name (like MyMu.dat) SFNSite File Name ( like storageEl1.cern.ch:/home/data/MyMu.dat )
4
The EDG Intro– Tutorial - n° 4 The Grid vision Flexible, secure, coordinated resource sharing among dynamic collections of individuals, institutions, and resource n From “The Anatomy of the Grid: Enabling Scalable Virtual Organizations” Enable communities (“virtual organizations”) to share geographically distributed resources as they pursue common goals -- assuming the absence of… n central location, n central control, n omniscience, n existing trust relationships.
5
The EDG Intro– Tutorial - n° 5 Grids: Elements of the Problem Resource sharing n Computers, storage, sensors, networks, … n Sharing always conditional: issues of trust, policy, negotiation, payment, … Coordinated problem solving n Beyond client-server: distributed data analysis, computation, collaboration, … Dynamic, multi-institutional virtual orgs n Community overlays on classic org structures n Large or small, static or dynamic
6
The EDG Intro– Tutorial - n° 6 EDG overview : goals DataGrid is a project funded by European Union whose objective is to exploit and build the next generation computing infrastructure providing intensive computation and analysis of shared large-scale databases. Enable data intensive sciences by providing world wide Grid test beds to large distributed scientific organisations ( “Virtual Organisations, Vos”) Start ( Kick off ) : Jan 1, 2001 End : Dec 31, 2003 Applications/End Users Communities : HEP, Earth Observation, Biology Specific Project Objetives: n Middleware for fabric & grid management n Large scale testbed n Production quality demonstrations n Collaborate and coordinate with other projects (Globus, Condor, CrossGrid, DataTAG, etc) n Contribute to Open Standards and international bodies ( GGF, Industry&Research forum)
7
The EDG Intro– Tutorial - n° 7 DataGrid Main Partners CERN – International (Switzerland/France) CNRS - France ESA/ESRIN – International (Italy) INFN - Italy NIKHEF – The Netherlands PPARC - UK
8
The EDG Intro– Tutorial - n° 8 Research and Academic Institutes CESNET (Czech Republic) Commissariat à l'énergie atomique (CEA) – France Computer and Automation Research Institute, Hungarian Academy of Sciences (MTA SZTAKI) Consiglio Nazionale delle Ricerche (Italy) Helsinki Institute of Physics – Finland Institut de Fisica d'Altes Energies (IFAE) - Spain Istituto Trentino di Cultura (IRST) – Italy Konrad-Zuse-Zentrum für Informationstechnik Berlin - Germany Royal Netherlands Meteorological Institute (KNMI) Ruprecht-Karls-Universität Heidelberg - Germany Stichting Academisch Rekencentrum Amsterdam (SARA) – Netherlands Swedish Research Council - Sweden Assistant Partners Industrial Partners Datamat (Italy) IBM-UK (UK) CS-SI (France)
9
The EDG Intro– Tutorial - n° 9 Project Schedule Project started on 1/Jan/2001 TestBed 0 (early 2001) n International test bed 0 infrastructure deployed s Globus 1 only - no EDG middleware TestBed 1 ( now ) n First release of EU DataGrid software to defined users within the project: s HEP experiments (WP 8), Earth Observation (WP 9), Biomedical applications (WP 10) Successful Project Review by EU: March 1 st 2002 TestBed 2 (October 2002) n Builds on TestBed 1 to extend facilities of DataGrid TestBed 3 (March 2003) & 4 (September 2003) Project stops on 31/Dec/2003
10
The EDG Intro– Tutorial - n° 10 EDG Highlights The project is up and running! n All 21 partners are now contributing at contractual level n total of ~60 man years for first year All EU deliverables (40, >2000 pages) submitted n in time for the review according to the contract technical annex First test bed delivered with real production demos All deliverables (code & documents) available via www.edg.org www.edg.org n http://eu-datagrid.web.cern.ch/eu-datagrid/Deliverables/default.htm http://eu-datagrid.web.cern.ch/eu-datagrid/Deliverables/default.htm n requirements, surveys, architecture, design, procedures, testbed analysis etc.
11
The EDG Intro– Tutorial - n° 11 DataGrid work packages The EDG collaboration is structured in 12 Work Packages n WP1: Work Load Management System n WP2: Data Management n WP3: Grid Monitoring / Grid Information Systems n WP4: Fabric Management n WP5: Storage Element n WP6: Testbed and demonstrators – Production quality International Infrastructure n WP7: Network Monitoring n WP8: High Energy Physics Applications n WP9: Earth Observation n WP10: Biology n WP11: Dissemination n WP12: Management
12
The EDG Intro– Tutorial - n° 12 Objectives for the first year of the project Collect requirements for middleware n Take into account requirements from application groups Survey current technology n For all middleware Core Services testbed n Testbed 0: Globus (no EDG middleware) First Grid testbed release n Testbed 1: first release of EDG middleware WP1: workload n Job resource specification & scheduling WP2: data management n Data access, migration & replication WP3: grid monitoring services n Monitoring infrastructure, directories & presentation tools WP4: fabric management n Framework for fabric configuration management & automatic sw installation WP5: mass storage management n Common interface for Mass Storage Sys. WP7: network services n Network services and monitoring
13
The EDG Intro– Tutorial - n° 13 DataGrid Architecture Collective Services Information & Monitoring Replica Manager Grid Scheduler Local Application Local Database Underlying Grid Services Computing Element Services Authorization Authentication and Accounting Replica Catalog Storage Element Services SQL Database Services Fabric services Configuration Management Configuration Management Node Installation & Management Node Installation & Management Monitoring and Fault Tolerance Monitoring and Fault Tolerance Resource Management Fabric Storage Management Fabric Storage Management Grid Fabric Local Computing Grid Grid Application Layer Data Management Job Management Metadata Management Object to File Mapping Logging & Book- keeping
14
The EDG Intro– Tutorial - n° 14 EDG Interfaces Collective Services Information & Monitoring Replica Manager Grid Scheduler Local Application Local Database Underlying Grid Services Computing Element Services Authorization Authentication and Accounting Replica Catalog Storage Element Services SQL Database Services Fabric services Configuration Management Configuration Management Node Installation & Management Node Installation & Management Monitoring and Fault Tolerance Monitoring and Fault Tolerance Resource Management Fabric Storage Management Fabric Storage Management Grid Application Layer Data Management Job Management Metadata Management Object to File Mapping Logging & Book- keeping Computing Elements SystemManagers Scientist s OperatingSystems File Systems StorageElements Mass Storage Systems HPSS, Castor User Accounts Certificate Authorities ApplicationDevelopers Batch Systems PBS, LSF
15
The EDG Intro– Tutorial - n° 15 WP1: Work Load Management Goals n Maximize use of resources by efficient scheduling of user jobs Achievements n Analysis of work-load management system requirements & survey of existing mature implementations Globus & Condor (D1.1) n Definition of architecture for scheduling & res. mgmt. (D1.2) n Development of "super scheduling" component using application data and computing elements requirements Issues n Integration with software from other WPs n Advanced job submission facilities Current components Job Description Language Resource Broker Job Submission Service Information Index User Interface Logging & Bookkeeping Service Collective Services Information & Monitoring Replica Manager Grid Scheduler Local Application Local Database Underlying Grid Services Computing Element Services Authorization Authentication and Accounting Replica Catalog Storage Element Services SQL Database Services Fabric services Configuration Management Configuration Management Node Installation & Management Node Installation & Management Monitoring and Fault Tolerance Monitoring and Fault Tolerance Resource Management Fabric Storage Management Fabric Storage Management Grid Application Layer Data Management Job Management Metadata Management Object to File Mapping Logging & Bookkeeping
16
The EDG Intro– Tutorial - n° 16 WP2: Data Management Goals n Coherently manage and share petabyte-scale information volumes in high-throughput production-quality grid environments Achievements n Survey of existing tools and technologies for data access and mass storage systems (D2.1) n Definition of architecture for data management (D2.2) n Deployment of Grid Data Mirroring Package (GDMP) in testbed 1 n Close collaboration with Globus, PPDG/GriPhyN & Condor n Working with GGF on standards Issues n Security: clear mechanisms handling authentication and authorization Current components GDMP Replica Catalog Replica Manager Spitfire Collective Services Information & Monitoring Replica Manager Grid Scheduler Local Application Local Database Underlying Grid Services Computing Element Services Authorization Authentication and Accounting Replica Catalog Storage Element Services SQL Database Services Fabric services Configuration Management Configuration Management Node Installation & Management Node Installation & Management Monitoring and Fault Tolerance Monitoring and Fault Tolerance Resource Management Fabric Storage Management Fabric Storage Management Grid Application Layer Data Management Job Management Metadata Management Object to File Mapping Logging & Bookkeeping
17
The EDG Intro– Tutorial - n° 17 WP3: Grid Monitoring Services Goals n Provide information system for discovering resources and monitoring status Achievements n Survey of current technologies (D3.1) n Coordination of schemas in testbed 1 n Development of Ftree caching backend based on OpenLDAP (Light Weight Directory Access Protocol) to address shortcoming in MDS v1 n Design of Relational Grid Monitoring Architecture (R-GMA) (D3.2) – to be further developed with GGF n GRM and PROVE adapted to grid environments to support end-user application monitoring Issues n MDS vs. R-GMA Components MDS/Ftree R-GMA GRM/PROVE Collective Services Information & Monitoring Replica Manager Grid Scheduler Local Application Local Database Underlying Grid Services Computing Element Services Authorizat ion Authentication and Accounting Replica Catalog Storage Element Services SQL Database Services Fabric services Configuration Management Configuration Management Node Installation & Management Node Installation & Management Monitoring and Fault Tolerance Monitoring and Fault Tolerance Resource Management Fabric Storage Management Fabric Storage Management Grid Application Layer Data Management Job Management Metadata Management Object to File Mapping Logging & Book-keeping
18
The EDG Intro– Tutorial - n° 18 WP4: Fabric Management Goals n manage clusters (~thousands) of nodes Achievements n Survey of existing tools, techniques and protocols (D4.1) n Defined an agreed architecture for fabric management (D4.2) n Initial implementations deployed at several sites in testbed 1 Issues n How to ensure the node configurations are consistent and handle updates to the software suites Components LCFG PBS & LSF info providers Image installation Config. Cache Mgr Collective Services Information & Monitoring Replica Manager Grid Scheduler Local Application Local Database Underlying Grid Services Computing Element Services Authorization Authentication and Accounting Replica Catalog Storage Element Services SQL Database Services Fabric services Configuration Management Configuration Management Node Installation & Management Node Installation & Management Monitoring and Fault Tolerance Monitoring and Fault Tolerance Resource Management Fabric Storage Management Fabric Storage Management Grid Application Layer Data Management Job Management Metadata Management Object to File Mapping Logging & Book-keeping
19
The EDG Intro– Tutorial - n° 19 WP5: Mass Storage Management Goals n Provide common user and data export/import interfaces to existing local mass storage systems Achievements n Review of Grid data systems, tape and disk storage systems and local file systems (D5.1) n Definition of Architecture and Design for DataGrid Storage Element (D5.2) n Collaboration with Globus on GridFTP/RFIO n Collaboration with PPDG on control API n First attempt at exchanging Hierarchical Storage Manager (HSM) tapes Issues n Scope and requirements for storage element n Inter-working with other Grids Components Storage Element info. providers RFIO MSS staging Collective Services Information & Monitoring Replica Manager Grid Scheduler Local Application Local Database Underlying Grid Services Computing Element Services Authorization Authentication and Accounting Replica Catalog Storage Element Services SQL Database Services Fabric services Configuration Management Configuration Management Node Installation & Management Node Installation & Management Monitoring and Fault Tolerance Monitoring and Fault Tolerance Resource Management Fabric Storage Management Fabric Storage Management Grid Application Layer Data Management Job Management Metadata Management Object to File Mapping Logging & Bookkeeping
20
The EDG Intro– Tutorial - n° 20 WP7: Network Services Goals n Review the network service requirements for DataGrid n Establish and manage the DataGrid network facilities n Monitor the traffic and performance of the network n Deal with the distributed security aspects Achievements n Analysis of network requirements for testbed 1 & study of available network physical infrastructure (D7.1) n Use of European backbone GEANT since Dec. 2001 n Initial network monitoring architecture defined (D7.2) and first tools deployed in testbed 1 n Collaboration with Dante & DataTAG n Working with GGF (Grid High Performance Networks) & Globus (monitoring/MDS) Issues n Resources for study of security issues n End-to-end performance for applications depend on a complex combination of components Components network monitoring tools: PingER Udpmon Iperf Collective Services Information & Monitoring Replica Manager Grid Scheduler Local Application Local Database Underlying Grid Services Computing Element Services Authorization Authentication and Accounting Replica Catalog Storage Element Services SQL Database Services Fabric services Configuration Management Configuration Management Node Installation & Management Node Installation & Management Monitoring and Fault Tolerance Monitoring and Fault Tolerance Resource Management Fabric Storage Management Fabric Storage Management Grid Application Layer Data Management Job Management Metadata Management Object to File Mapping Logging & Bookkeepgin
21
The EDG Intro– Tutorial - n° 21 WP6: TestBed Integration Goals n Deploy testbeds for the end-to-end application experiments & demos n Integrate successive releases of the software components Achievements n Integration of EDG sw release 1.0 and deployment n Working implementation of multiple Virtual Organisations (VOs) s & basic security infrastructure n Definition of acceptable usage contracts and creation of Certification Authorities group Issues n Procedures for software integration n Test plan for software release n Support for production-style usage of the testbed Components Globus packaging & EDG config Build tools End-user documents Collective Services Information & Monitoring Replica Manager Grid Scheduler Local Application Local Database Underlying Grid Services Computing Element Services Authorization Authentication and Accounting Replica Catalog Storage Element Services SQL Database Services Fabric services Configuration Management Configuration Management Node Installation & Management Node Installation & Management Monitoring and Fault Tolerance Monitoring and Fault Tolerance Resource Management Fabric Storage Management Fabric Storage Management Grid Application Layer Data Management Job Management Metadata Management Object to File Mapping Logging & Bookkeeping WP6 additions to Globus GlobusEDG release
22
The EDG Intro– Tutorial - n° 22 Grid aspects covered by EDG testbed 1 VO servers LDAP directory for mapping users (with certificates) to correct VO Storage Element Grid-aware storage area, situated close to a CE User Interface Submit & monitor jobs, retrieve output Replica Manager Replicates data to one or more CEs Job Submission Service Manages submission of jobs to Res. Broker Replica Catalog Keeps track of multiple data files “replicated” on different CEs Information index Provides info about grid resources via GIIS/GRIS hierarchy Information & Monitoring Provides info on resource utilization & performance Resource Broker Uses Info Index to discover & select resources based on job requirements Grid Fabric Mgmt Configure, installs & maintains grid sw packages and environ. Logging and Bookkeeping Collects resource usage & job status Network performance, security and monitoring Provides efficient network transport, security & bandwidth monitoring Computing Element Gatekeeper to a grid computing resource Testbed admin. Certificate auth.,user reg., usage policy etc.
23
The EDG Intro– Tutorial - n° 23 Tasks for the WP6 integration team Testing and integration of the Globus package Exact definition of RPM lists (components) for the various testbed machine profiles (CE service, RB, UI, SE service, NE, WN, ) – check dependencies Perform preliminary centrally (CERN) managed tests on EDG m/w before green light for spread EDG testbed sites deployment Provide, update end user documentation for installers/site managers, developers and end users Define EDG release policies, coordinate the integration team staff with the various WorkPackage managers – keep high inter-coordination. Assign the reported bugs to the corresponding developers/site managers (BugZilla) Complete support for the iTeam testing VO
24
The EDG Intro– Tutorial - n° 24 EDG overview: Middleware release schedule Planned intermediate release schedule n Release 1.1: January 2002 n Release 1.2: March 2002 n Release 1.3: May 2002 n Release 1.4: July 2002 Similar schedule for 2003 Each release includes n feedback from use of previous release by application groups n planned improvements/extension by middle-ware WPs n use of WP6 software infrastructure n feeds into architecture group July 1.1.3 Internal August
25
The EDG Intro– Tutorial - n° 25 Release Plan details Current release EDG 1.1.4 Deployed on testbed under RedHat 6.2 Finalising build of EDG 1.2 (now) GDMP 3.0 GSI-enabled RFIO client and server EDG 1.3 (internal) Build using autobuild tools – to ease future porting Support for MPI on single site EDG 1.4 (August) n Support RH 6.2 & 7.2 n Basic support for interactive jobs n Integration of Condor DAGman n Use MDS 2.2 with first GLUE schema EDG 2.0 (Oct) n Still based on Globus 2.x (pre-OGSA) n Use updated GLUE schema n Job partitioning & check-pointing n Advanced reservation/co-allocation See http://edms.cern.ch/document/333297 for further detailshttp://edms.cern.ch/document/333297
26
The EDG Intro– Tutorial - n° 26 EDG overview : testbed schedule Planned intermediate testbed schedule n Testbed 0: March 2001 n Testbed 1: November 2001-January 2002 n Testbed 2: September-October 2002 n Testbed 3: March 2003 n Testbed 4: September-October 2003 Number of EDG testbed sites permanently increasing : currently 9 sites are visible to the CERN resource broker Each site normally implements, at least : n A central install & config server (LCFG server) n WMS (WP1) dedicated machines : UI, CE (g/k & worker node(s) ) n MDS Info Providers to the global EDG GIIS/GRIS n Network Monitoring
27
The EDG Intro– Tutorial - n° 27 Development & Production testbeds Development n Initial set of 5 sites will keep small cluster of PCs for development purposes to test new versions of the software, configurations etc. Production n More stable environment for use by application groups s more sites s more nodes per site (grow to meaningful size at major centres) s more users per VO n Usage already foreseen in Data Challenge schedules for LHC experiments s harmonize release schedules
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.