The EDG Testbed The European DataGrid Project Team
EU DataGrid: Testbed 2 Contents u Hardware Components of an EDG Testbed u The EDG Testbed Configuration u How to set up an EDG Testbed n Obtaining code n Configuring different machines
EU DataGrid: Testbed 3 Some EDG Logical Machine Types u User Interface (UI) u Resource Broker (RB) u Computing Element (CE) n Frontend Node n Worker Nodes (WN) u Storage Element (SE) u Information Catalog (IC) u Monitoring (MON)
EU DataGrid: Testbed 4 A Simple Testbed Configuration User Interface Resource Broker etc. Storage Element 1 Storage Element 2 Computing Element 1 Computing Element 2 “CLOSE”
EU DataGrid: Testbed 5 Testbeds u Work Package Testbeds and development machines n Development versions u Development Testbed: Integration & Evaluation of SW n Current tagged release + new pkg. New tagged release. u Certification Testbed: Extended, Detailed Testing n Tagged release u Application Testbed: End-user Applications n Stable, certified release
EU DataGrid: Testbed 6 DataGrid testbeds Application testbed: More than 1000 CPUs 5 Terabyte of storage EDG sw installed at more than 40 sites
EU DataGrid: Testbed 7 Application Testbed Resources u All numbers are always out of date u Hidden Infrastructure n Information and Monitoring n Resource Brokers n User Interfaces n VO Membership Servers n Certification Authorities SiteCountryCPU s Storage CC-IN2P3*FR GB CERN*CH GB CNAF*IT GB Ecole Poly.FR6220 GB Imperial Coll.UK92450 GB LiverpoolUK210 GB ManchesterUK915 GB NIKHEF*NL GB OxfordUK130 GB PadovaIT11666 GB RAL*UK6332 GB SARANL GB TOTAL GB *also Dev. TB; +200 TB including tape
EU DataGrid: Testbed 8 EDG Software Distribution u All software available as source & binary RPMs u Binaries for RedHat 7.3 with gcc and u Packages include n Complete Globus distribution (via VDT) n EDG packages (~50 packages) n Support tools (perl, ant, jdk, …) u RPM lists for different machine types
EU DataGrid: Testbed 9 Automatic EDG Fabric Management Setup Tasks u Node Installation & Management u Configuration Management Runtime Tasks u Monitoring & Fault Tolerance u Resource Management Runtime tasks may automatically trigger setup tasks n New machines join the grid n Failure detection/repair (e.g. restarting daemons)
EU DataGrid: Testbed 10 LCFG (Local ConFiGuration system) u Developed at University of Edinburgh u Widely used fabric installation & configuration tool u Automated installation and configuration in a very diverse and evolving environment LCFG configuration files Compiler (mkxprof) Web Server XML Profile LCFG SERVER HTTP ldxprof Generic Component Generic Component rdxprof LCFG Components DBM File LCFG CLIENT Notif y UDP Acknowledge
EU DataGrid: Testbed 11 Example LCFG Configuration File +inet.services telnet login ftp +inet.allow telnet login ftp sshd +inet.allow_telnet ALLOWED_NETWORKS +inet.allow_login ALLOWED_NETWORKS +inet.allow_ftp ALLOWED_NETWORKS +inet.allow_sshd ALL +inet.daemon_sshd yes auth.users myckey +auth.userhome_mickey /home/mickey +auth.usershell_mickey /bin/tcsh +inet.services telnet login ftp +inet.allow telnet login ftp sshd +inet.allow_telnet ALLOWED_NETWORKS +inet.allow_login ALLOWED_NETWORKS +inet.allow_ftp ALLOWED_NETWORKS +inet.allow_sshd ALL +inet.daemon_sshd yes auth.users myckey +auth.userhome_mickey /home/mickey +auth.usershell_mickey /bin/tcsh Config files , /home/MickeyMouseHome /bin/tcsh , /home/MickeyMouseHome /bin/tcsh XML profiles mkxprof
EU DataGrid: Testbed 12 Fabric Monitoring & Fault Tolerance Sensor Collector agent Local Node Cache monitoring Decision unit Actuator agent Actuator Rule config Central Repository DB Consumer
EU DataGrid: Testbed 13 LCFG Installation Server setup: u Download rpms (perl + lcfg + apache) u Install rpms u Start http server (apache, …) u Create configuration files u Run mkxprof on them Client setup: u Download rpms (perl + lcfg) u Install rpms u Reboot (rdxprof will be started) Configuration management (server): u Update config files u Run mkxprof
EU DataGrid: Testbed 14 EDG Machine Installation On the LCFG server: u Create directories for rpms u Download rpms from central edg repository u Create LCFG profile for each client machine: n Filename = hostname; includes machine type specific config file and site specific config file (needs to be customized!) n Example templates + rpm-lists are provided n Run mkxprof on each of these files On the LCFG clients: u Setup clients as describe before D O N E
EU DataGrid: Testbed 15 Manual Setup (without LCFG) u Download rpms directly on machine (RPM-lists per machine type exist) u Install rpms u Configure individual services (see installation guide)
EU DataGrid: Testbed 16 Issues when Adding new Sites to the Testbed u EDG is currently setting-up procedures explaining how to add new sites n Variations already tested with Taiwan and Romania n Step-by-step instructions produced which we expect to become simpler over time u Need to clarify the “minimum requirements” for a site to become a member of the testbed n A number of regular tasks have to be performed by the sites administrators n A maximum delay needs to be defined for responding to requests/problems if the testbed is to run efficiently u Sites from new countries have to identify/create a supporting CA n Since CAs need mutual trust this could lead to an explosion of inspection activities u Some tasks will fall on the people responsible for managing the VOs n HEP experiment secretariats already perform some level of authentication of their institutes and members. How an we get some leverage from this?
EU DataGrid: Testbed 17 Further Information u EDG Testbed homepage: u u Fabric management: u u LCFG on EDG Testbed information: u u doc/ doc/