EGEE-II TCD 22 nd -25 th May 2007 Enabling Grids for E-sciencE www.eu-egee.org EGEE and gLite are registered trademarks Experiences with a distributed.

Slides:



Advertisements
Similar presentations
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Supporting MPI applications on the EGEE Grid.
Advertisements

EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks gLite Release Process Maria Alandes Pradillo.
Combining a Virtual Grid Testbed and eLearning Courseware Kathryn Cassidy, Jason McCandless, Stephen Childs, John Walsh, Brian Coghlan, Declan Dagger Department.
INFSO-RI Enabling Grids for E-sciencE Status of LCG-2 porting Stephen Childs, Brian Coghlan and Eamonn Kenny Grid-Ireland/EGEE October.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Simply monitor a grid site with Nagios J.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks C. Loomis (CNRS/LAL) M.-E. Bégin (SixSq.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks gLite IPv6 compliance project tests Further.
EGEE-II TCD 22 nd -25 th May 2007 Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Multi-Platform Support Presenters:
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Extensions to the ETICS Build System Client.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks GRNET SA3 Progress Report Ioannis Liabotis.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks C. Loomis (CNRS/LAL) M.-E. Bégin (SixSq.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Middleware Deployment and Support in EGEE.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks JRA1 summary Claudio Grandi EGEE-II JRA1.
9th EELA TUTORIAL - USERS AND SYSTEM ADMINISTRATORS E-infrastructure shared between Europe and Latin America CE + WN installation and configuration.
INFSO-RI Enabling Grids for E-sciencE Strategy for gLite multi-platform support Author:Eamonn Kenny Meeting:SA3 All Hands Meeting.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks CERN status report SA3 All Hands Meeting.
Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Usage of virtualization in gLite certification Andreas Unterkircher.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Stuart Kenny and Stephen Childs Trinity.
4th EELA TUTORIAL - USERS AND SYSTEM ADMINISTRATORS E-infrastructure shared between Europe and Latin America CE + WN installation and configuration.
INFSO-RI Enabling Grids for E-sciencE EGEE SA1 in EGEE-II – Overview Ian Bird IT Department CERN, Switzerland EGEE.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The GILDA t-Infrastructure Roberto Barbera.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE Site Architecture Resource Center Deployment Considerations MIMOS EGEE Tutorial.
Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Tools and techniques for managing virtual machine images Andreas.
SAM Sensors & Tests Judit Novak CERN IT/GD SAM Review I. 21. May 2007, CERN.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks GLite testing status and future Gianni Pucciani.
Cloud Computing Lecture 5-6 Muhammad Ahmad Jan.
EGEE-II INFSO-RI Enabling Grids for E-sciencE YAIM Overview MiMOS Grid tutorial HungChe, ASGC OPS Team.
INFSO-RI Enabling Grids for E-sciencE /10/20054th EGEE Conference - Pisa1 gLite Configuration and Deployment Models JRA1 Integration.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks APEL CPU Accounting in the EGEE/WLCG infrastructure.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Update Authorization Service Christoph Witzig,
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Progress report from University of Cyprus.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks SA3 partner collaboration tasks & process.
12th EELA TUTORIAL - USERS AND SYSTEM ADMINISTRATORS FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin.
EGEE-III INFSO-RI Enabling Grids for E-sciencE SA3 All Hands Meeting 'Cluster of Competence' Experience SA3 INFN Cyprus May 7th-8th.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Middleware Update Maria Alandes Pradillo.
INFSO-RI Enabling Grids for E-sciencE gLite Certification and Deployment Process Markus Schulz, SA1, CERN EGEE 1 st EU Review 9-11/02/2005.
Grid testing using virtual machines Stephen Childs*, Brian Coghlan, David O'Callaghan, Geoff Quigley, John Walsh Department of Computer Science Trinity.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks gLite configuration (plans) Robert Harakaly.
TP: Grid site installation BEINGRID site installation.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Patch Preparation SA3 All Hands Meeting.
Enabling Grids for E-sciencE INFSO-RI Enabling Grids for E-sciencE Gavin McCance GDB – 6 June 2007 FTS 2.0 deployment and testing.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks MSA3.4.1 “The process document” Oliver Keeble.
INFSO-RI Enabling Grids for E-sciencE gLite Test and Certification Effort Nick Thackray CERN.
TCD Site Report Stuart Kenny*, Stephen Childs, Brian Coghlan, Geoff Quigley.
Grid-Ireland test facilities Stephen Childs Dept. of Computer Science Trinity College Dublin.
EGEE-II INFSO-RI Enabling Grids for E-sciencE SA3 Report Markus Schulz EGEE-II SA3 Activity Leader IT Department,
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Imperial College SA3 Status David Colling.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Implementing product teams Oliver Keeble.
EGEE-III INFSO-RI Enabling Grids for E-sciencE JRA1 and SA3 All Hands Meeting December 2009, CERN, Geneva Product Teams –
INFSOM-RI ETICS and EGEE Testing Alberto Di Meglio.
INFSO-RI Enabling Grids for E-sciencE File Transfer Software and Service SC3 Gavin McCance – JRA1 Data Management Cluster Service.
StratusLab is co-funded by the European Community’s Seventh Framework Programme (Capacities) Grant Agreement INFSO-RI Demonstration StratusLab First.
II EGEE conference Den Haag November, ROC-CIC status in Italy
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Job Management Claudio Grandi.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks CYFRONET site report Marcin Radecki CYFRONET.
2007/05/22 Integration of virtualization software Pierre Girard ATLAS 3T1 Meeting
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarksEGEE-III INFSO-RI MPI on the grid:
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Towards an Information System Product Team.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
EGEE-II INFSO-RI Enabling Grids for E-sciencE SA3 Report Markus Schulz For EGEE-II SA3 IT Department, CERN Final EU.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Nagios Grid Monitor E. Imamagic, SRCE OAT.
Status & Strategy for gLite multi-platform Support
Andreas Unterkircher CERN Grid Deployment
Partner Status HPCL-University of Cyprus
Quattor Usage at Nikhef
Future Test Activities SA3 All Hands Meeting Dublin
Quality Control in the dCache team.
Virtualization in the gLite Grid Middleware software process
Presentation transcript:

EGEE-II TCD 22 nd -25 th May 2007 Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Experiences with a distributed patch certification Presenter: John Walsh Location: PIC, Barcelona, ES

Enabling Grids for E-sciencE EGEE-II TCD 22nd-25th May 2007 Motivations My view of Testing and Certification –Should adhere to general scientific principles and methods –‘Deployment and Testing’ is an ‘experiment’ –Must be independently repeatable –Results must be independently reproducible 2

Enabling Grids for E-sciencE EGEE-II TCD 22nd-25th May 2007 EGEE Testbed types Testbed Types –Multi Site TB  Wide area network  Medium to Large Scale deployments  Must be highly coordinated  “Controlled” environment difficult  A single service may make whole TB unusable for periods 3

Enabling Grids for E-sciencE EGEE-II TCD 22nd-25th May 2007 EGEE Testbed types Testbed Types –Multi Site TB  Wide area network  Medium to Large Scale deployments  Must be highly coordinated  “Controlled” environment difficult  A single service may make whole TB unusable for periods –Single Isolated TB  Generally small scale, limited external access  Can replicate components / simulate conditions of “real world”  Does not reproduce all conditions of the “real world”  Need not reproduce complete infrastructure  Highly controlled, less variables  Single tester can control environment  SAM integration difficult, but SAM standalone possible 4

Enabling Grids for E-sciencE EGEE-II TCD 22nd-25th May 2007 TCD Testbeds TCD runs a medium scale set of Isolated TBs –Xen extensively used –Non-trivial setup –Isolated ELgrid, e-Learning grid:  Replicates core Grid-Ireland infrastructure  18 sites with 4 WNs each, and national services  Look and feel without impacting on production services 5

Enabling Grids for E-sciencE EGEE-II TCD 22nd-25th May 2007 TCD Testbeds TCD runs a medium scale set of Isolated TBs –Xen extensively used –Non-trivial setup –Isolated ELgrid, e-Learning grid:  Replicates core Grid-Ireland infrastructure  18 sites with 4 WNs each, and national services  Look and feel without impacting on production services –Isolated TestGrid, allows multiple Grid Infrastructures:  Certification infrastructure: 18 Grid-Ireland sites,4 WNs each, and national services Tests Quattor profile changes Quality control before deployment on production  R-GMA testing infrastructure: WMS, R-GMA, 1 site, 4 WNs  Experimental and Porting infrastructure: >150 nodes, multiple sites  TestGrid allows mixed public and private network address spaces 6

Enabling Grids for E-sciencE EGEE-II TCD 22nd-25th May 2007 TCD infrastructure 7 Certification TB R-GMA TB

Enabling Grids for E-sciencE EGEE-II TCD 22nd-25th May 2007 R-GMA testbed Implements core set of service nodes –Top level  R-GMA registry/browser/schema  gLite WMS (Xen) –Site  R-GMA site mon  gLite UI (Xen)  gLite CE + site BDII + torque (Xen)  gLite Classic SE (Xen)  >2 WN (Xen) Installation via YAIM –Quattor in catchup mode (even on Production) 5 TB Fileserver for image backups R-GMA registryR-GMA MON XEN hypervisor VM1: WMS VM2: CE VM3: UI VM4: SE VM5-n: WNs Example R-GMA Certification TB

Enabling Grids for E-sciencE EGEE-II TCD 22nd-25th May 2007 Simple Install Procedure Xen Nodes –Basic SL3 image (copied from repository) –Java –NTP –Minimal network settings –APT  Basic SL repository For each node –Install latest (certified) YAIM Central YAIM configuration –Defines Basic Site Configuration –3 way diff can check for changes in configurations Each node configured as per type WMS requires extra Condor repository 9

Enabling Grids for E-sciencE EGEE-II TCD 22nd-25th May 2007 Simple Upgrade Procedure Nice thing is way images can be used Each node image should be copied to backup server –Known (good) state –Rollback possible Then can use images to instantiate nodes very quickly Can prepare siteInfo.def off-line and copy it to node Do YAIM install –Fixes up repos in /etc/apt/sources.list.d/lcg.list –Problems?  Raise problem in savannah Patch discussion Do YAIM configure –Problems(?)  As above 10

Enabling Grids for E-sciencE EGEE-II TCD 22nd-25th May 2007 R-GMA certification Hey presto, now have a TB R-GMA testbed can only be used for testing: –Correctness behaviour of YAIM –That a patch fixes its target problem –Basic R-GMA components:  rgma-client-check OK  rgma-server-check (mon and reg) OK  Daemon startup scripts OK  Basic R-GMA testsuite OK –That the R-GMA daemons are stable(?) –Whether there are any new tests that can be added to TestSuite –A new SAM test(?) 11

Enabling Grids for E-sciencE EGEE-II TCD 22nd-25th May 2007 Stability R-GMA stability –Tomcat daemon can take days to become unstable –How stable are the components R-GMA depends on ?  MySQL, java JDBC connectors, etc –Is the default configuration OK ?  Can it be improved ? Stress testing is vital –Should attempt to keep stats on system and component behaviour  Memory usage (any leaks?)  Disk usage, number of files, etc  File descriptors (any descriptors leaking?)  Log files OK ? Rotation policy OK ? 12

Enabling Grids for E-sciencE EGEE-II TCD 22nd-25th May 2007 Patch Problems Patch may introduce a new problem –Important to discuss with the developers and within SA3  Issues involved  Evaluation of problem  Will applying this patch cause more problems than solve? Will it become a showstopper? 13

Enabling Grids for E-sciencE EGEE-II TCD 22nd-25th May 2007 Summary Isolated testbed experience has been positive –Xen lessens hardware costs –Can create custom TBs on demand –Large range of testing scenarios possible –Extra layer of quality control Non-trivial setup –But once completed it becomes a good scientific testbed Requires extra infrastructure nodes to be installed Simple store/test/rollback procedure Isolated testbed does not capture all scenarios –Scaling of tests may not always be possible –In future intend to add network emulation to help –PPS plays critical post-certification role 14

Enabling Grids for E-sciencE EGEE-II TCD 22nd-25th May 2007 TestGrid Simple CA Many nodes are (re)installed repeatedly –Host certificates must be securely saved  Copy to chosen media and store safely –TestGrid now uses a simpleCA for private network nodes  Allows greater flexibility in generating certs CA controlled by small team of administrators Does not require standard cert issuing procedure Faster turn around on cert generation Certificates cannot used outside of Testbed environment Namespace is disjoint to EUGridPMA namespace  Initial overhead in setting up simpleCA Learning curve Best setup with local CA expert –Extra RPM for the simpleCA deployed on required nodes 15