EGEE-II INFSO-RI-031688 Enabling Grids for E-sciencE www.eu-egee.org www.glite.org SA3 Report Markus Schulz EGEE-II SA3 Activity Leader IT Department,

Slides:



Advertisements
Similar presentations
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Status of Interoperability Markus Schulz.
Advertisements

EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks gLite Release Process Maria Alandes Pradillo.
EGEE is a project funded by the European Union under contract IST JRA1 Testing Activity: Status and Plans Leanne Guy EGEE Middleware Testing.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Simply monitor a grid site with Nagios J.
EMI INFSO-RI SA2 - Quality Assurance Alberto Aimar (CERN) SA2 Leader EMI First EC Review 22 June 2011, Brussels.
INFSO-RI Enabling Grids for E-sciencE SA1: Cookbook (DSA1.7) Ian Bird CERN 18 January 2006.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
EGEE is a project funded by the European Union under contract IST Testing processes Leanne Guy Testing activity manager JRA1 All hands meeting,
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks gLite IPv6 compliance project tests Further.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Steven Newhouse EGEE’s plans for transition.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks General relationships with EGEE JRA1 SA3.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks PPS All sites Meeting: Introduction & Agenda.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Extensions to the ETICS Build System Client.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Oliver Keeble SA3 Activity Leader CERN EGEE-III.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Middleware Deployment and Support in EGEE.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks JRA1 summary Claudio Grandi EGEE-II JRA1.
INFSO-RI Enabling Grids for E-sciencE SA3 Status and Plans Markus Schulz SA3 CERN-IT-GD.
INFSO-RI Enabling Grids for E-sciencE SA1 and gLite: Test, Certification and Pre-production Nick Thackray SA1, CERN.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Oliver Keeble SA3 Activity Leader CERN EGEE-III.
INFSO-RI Enabling Grids for E-sciencE Integration and Testing, SA3 Markus Schulz CERN IT JRA1 All-Hands Meeting 22 nd - 24 nd March.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Multi-level monitoring - an overview James.
INFSO-RI Enabling Grids for E-sciencE Strategy for gLite multi-platform support Author:Eamonn Kenny Meeting:SA3 All Hands Meeting.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks State of Interoperability Laurence Field.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks gLite Build Programme and Multi-Platform.
GLite – An Outsider’s View Stephen Burke RAL. January 31 st 2005gLite overview Introduction A personal view of the current situation –Asked to be provocative!
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks CERN status report SA3 All Hands Meeting.
Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Usage of virtualization in gLite certification Andreas Unterkircher.
INFSO-RI Enabling Grids for E-sciencE EGEE SA1 in EGEE-II – Overview Ian Bird IT Department CERN, Switzerland EGEE.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Antonio Retico CERN, Geneva 19 Jan 2009 PPS in EGEEIII: Some Points.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The GILDA t-Infrastructure Roberto Barbera.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The future of the gLite release process Oliver.
SAM Sensors & Tests Judit Novak CERN IT/GD SAM Review I. 21. May 2007, CERN.
WLCG Software Lifecycle First ideas for a post EMI approach 0.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks GLite testing status and future Gianni Pucciani.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks SA3 partner collaboration tasks & process.
EGEE-III INFSO-RI Enabling Grids for E-sciencE SA3 All Hands Meeting 'Cluster of Competence' Experience SA3 INFN Cyprus May 7th-8th.
INFSO-RI Enabling Grids for E-sciencE gLite Certification and Deployment Process Markus Schulz, SA1, CERN EGEE 1 st EU Review 9-11/02/2005.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Patch Preparation SA3 All Hands Meeting.
Enabling Grids for E-sciencE INFSO-RI Enabling Grids for E-sciencE Gavin McCance GDB – 6 June 2007 FTS 2.0 deployment and testing.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks MSA3.4.1 “The process document” Oliver Keeble.
INFSO-RI Enabling Grids for E-sciencE gLite Test and Certification Effort Nick Thackray CERN.
EGEE-II TCD 22 nd -25 th May 2007 Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Experiences with a distributed.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Status and outlook for testing in the LCG.
INFSO-RI Enabling Grids for E-sciencE Operations Parallel Session Summary Markus Schulz CERN IT/GD Joint OSG and EGEE Operations.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Technical Overview EGEE-II’s achievements.
Components Selection Validation Integration Deployment What it could mean inside EGI
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Implementing product teams Oliver Keeble.
INFSO-RI Enabling Grids for E-sciencE Upcoming Releases Markus Schulz CERN SA1 15 th June 2005.
Enabling Grids for E-sciencE EGEE-III-INFSO-RI EGEE and gLite are registered trademarks Francesco Giacomini JRA1 Activity Leader.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE Operations: Evolution of the Role of.
Status of gLite-3.0 deployment and uptake Ian Bird CERN IT LCG-LHCC Referees Meeting 29 th January 2007.
INFSO-RI Enabling Grids for E-sciencE File Transfer Software and Service SC3 Gavin McCance – JRA1 Data Management Cluster Service.
EMI INFSO-RI SA2: Quality Assurance Status Report Alberto Aimar(SA2) SA2 Leader EMI First EC Review 22 June 2011, Brussels.
EMI INFSO-RI Testbed for project continuous Integration Danilo Dongiovanni (INFN-CNAF) -SA2.6 Task Leader Jozef Cernak(UPJŠ, Kosice, Slovakia)
INFSO-RI Enabling Grids for E-sciencE Software Process Author: Laurence Field (CERN) Presented by David Smith JRA1 All Hands meeting,
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Job Management Claudio Grandi.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Towards an Information System Product Team.
INFSO-RI Enabling Grids for E-sciencE Padova site report Massimo Sgaravatto On behalf of the JRA1 IT-CZ Padova group.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks CREAM: current status and next steps EGEE-JRA1.
EGEE-II INFSO-RI Enabling Grids for E-sciencE SA3 Report Markus Schulz For EGEE-II SA3 IT Department, CERN Final EU.
JRA1 Middleware re-engineering
Status & Strategy for gLite multi-platform Support
SA3’s Responds to the Review Report
Andreas Unterkircher CERN Grid Deployment
Testing for patch certification
Leanne Guy EGEE JRA1 Test Team Manager
Francesco Giacomini – INFN JRA1 All-Hands Nikhef, February 2008
Presentation transcript:

EGEE-II INFSO-RI Enabling Grids for E-sciencE SA3 Report Markus Schulz EGEE-II SA3 Activity Leader IT Department, CERN 1 st EU Review of EGEE-II CERN, th May 2007

Enabling Grids for E-sciencE EGEE-II INFSO-RI st EU Review, May 15-16, Outline Activity Goals Main Achievements Status –Integration and Release Management –Testing –Interoperability –Porting Issues for SA3 Future Plans Summary

Enabling Grids for E-sciencE EGEE-II INFSO-RI st EU Review, May 15-16, SA3 in Numbers EGEE-II Budget Manpower: 12 partners, 9 countries, 30 FTE

Enabling Grids for E-sciencE EGEE-II INFSO-RI st EU Review, May 15-16, Activity Goals Manage the process of building middleware distributions –Integrating middleware components from a variety of sources –Define acceptance criteria for accepting components –Test and certify middleware  Ensure: reliability,robustness, scalability, security and usability –Decouple middleware distributions from middleware development –Software selection and priorities to be set by the TCG SA3 is a new activity –Tasks had been covered by SA1 and JRA1 during EGEE

Enabling Grids for E-sciencE EGEE-II INFSO-RI st EU Review, May 15-16, Tasks Integration and Packaging Testing and Certification –Functional and Stress Testing –Security, Vulnerability Testing –Operate Certification and Testing Test Beds –Project Testing Coordination Debugging, Analysis, Support Interoperation Capture Requirements + Support for porting and contribution to standardization Details of resource allocation can be found in the Execution Plan

Enabling Grids for E-sciencE EGEE-II INFSO-RI st EU Review, May 15-16, Achievements Integrated release of LCG-2.7 and gLite-1.5 –Different build systems –Different configuration management –Different, overlapping functionality –Different process……  LCG-2 process tailored to production  gLite process tailored to rapid development Released on May 4th –4 days later than planned LCG-2 prototyping product product gLite 2006 gLite 3.0

Enabling Grids for E-sciencE EGEE-II INFSO-RI st EU Review, May 15-16, Achievements II Introduced new software lifecycle process –Based on experience with the gLite lifecycle and LCG practice –No “big bang” releases  Components are updated independently –Component updates delivered on a weekly basis to PPS  Every second week to production –Acceptance criteria for new components defined –Clear link between component versions, Patches and Bugs  Semiautomatic release notes production –Clear prioritization by stakeholders  TCG for medium term and EMT for short term goals –Documented in MSA3.2 –In use since July 2006

Enabling Grids for E-sciencE EGEE-II INFSO-RI st EU Review, May 15-16, Achievements III Test strategy, process,framework and external testbeds –SAM framework for automated testing (SA1 product) –Central repository for tests –Formal follow-up on test development –Increased test cases (depth)  Distributed approach  Development of tests mostly by partners –Formal process of Patch certification –Extended test beds  8 sites roughly 100 nodes  External partners cover additional deployment scenarios –Extensive use of virtualized test beds –Introduced concept of “Experimental Services”  Massive scalability tests can’t be conducted on test infrastructures

Enabling Grids for E-sciencE EGEE-II INFSO-RI st EU Review, May 15-16, Status Integration and Release Management Preproduction ensures user input and large scale testing TCG prioritization driven by, users, sites, developers, and operations, short term planning via EMT

Enabling Grids for E-sciencE EGEE-II INFSO-RI st EU Review, May 15-16, Basic Concepts Two distinct entities are tracked by the process Problems and Solutions –Problems = Bugs –Solutions = Bug Fixes = Patches –New features are tracked as “Enhancement”  Missing feature = Problem Process defines for these entities: –States and conditions for state transitions –Roles and responsibilities of actors

Enabling Grids for E-sciencE EGEE-II INFSO-RI st EU Review, May 15-16, Releases and Updates A software release is a set of packages (baseline) –These packages are continuously updated by Patches The baseline contains a core. –Changes to the core make the release non-backwards compatible  At the software level rather than on the service level –Changes to the core will require a new release –VDT-1.6, globus4, + SL4 as reference OS == gLite-3.1 –VDT-1.2, globus2, + SL3 as reference OS == gLite-3.0 All Patches that pass the Preproduction state at a given date form an update to the release –No fix or enhancement has to wait for other components

Enabling Grids for E-sciencE EGEE-II INFSO-RI st EU Review, May 15-16, Simplified Process TCGSA3 Software Provider JRA1, VDT,.... New components and major changes Requests changes Endorses SA3 proposals Negotiates with providers Proposal for TCG starts change by Bug creation

Enabling Grids for E-sciencE EGEE-II INFSO-RI st EU Review, May 15-16, Simplified Process TCGEMT SA3 Integration Software Provider JRA1, VDT,.... SA3 Configuration Prioritization: EMT twice a week TCG every second week Bug and Patch processing Installation tests Functional Tests Patch Specific Tests Scalability Tests Tests on external testbeds SA3 Test Process Continuous, several Bugs and Patches in parallel SA1 PPS Updates and Operates Users Test & Reject Once a week Patches that pass certification move to PPS Rejected Patches SA3 Release Manager Coordinates SA1 Production Service Updates and Operates Every second week Patches are moved to Production Experimental Services Use production service Users Stress tests

Enabling Grids for E-sciencE EGEE-II INFSO-RI st EU Review, May 15-16, Usage Process is in active use since July 2006 Produced 23 updates to the production system –26 since May 2006 Processed 269 patches –Addressing 835 Bugs

Enabling Grids for E-sciencE EGEE-II INFSO-RI st EU Review, May 15-16, Statistics gLite-3.0 gLite-3.1LCG-2.7 Ratio between Config. Cert and PPS indicates that change rate is above what SA3 can handle Yaim patches due to merger Patches reflect activity in an area

Enabling Grids for E-sciencE EGEE-II INFSO-RI st EU Review, May 15-16, Statistics We have to manage a very large number of different node types

Enabling Grids for E-sciencE EGEE-II INFSO-RI st EU Review, May 15-16, Configuration Management gLite-1 configuration based on XML and Python LCG-2 configuration based on Key-Value pairs + bash –YAIM Site administrators preferred YAIM (result of survey) –Wrappers for gLite components –Process started to move to single layer configuration  FTS, WN, UI, and WMS are already in single layer mode Installation tool –APT for (semi) automatic RPM updates –RPM lists for other tools –Tarballs for Uis and WNs

Enabling Grids for E-sciencE EGEE-II INFSO-RI st EU Review, May 15-16, Build Systems Currently 3 systems in use LCG build system for legacy components –To be phased out during the year gLite build system –Used for the gLite-3.0 branch ETICS –Used for the gLite-3.1 branch –Migration process to ETICS started in early August  Requires large fraction of SA3 integration resources –Will be finished around August 2007

Enabling Grids for E-sciencE EGEE-II INFSO-RI st EU Review, May 15-16, Testing Test plans and process documented in MSA3.5 Test strategy –Multi level tests (from simple quick tests, to stress tests)  To abort as early as possible –As much steps in parallel as possible  Component by component Install, configure, functional tests, first patch certification  Requires many temporary testbeds We use virtualization (Xen based) to save time and resources –Automate as much testing as possible  But first ensure coverage –First local then external testbeds –Moved towards testing components against a “Baseline Release”  Required significant reorganization of the testbed operation

Enabling Grids for E-sciencE EGEE-II INFSO-RI st EU Review, May 15-16, Testing Framework We have chosen SAM as our framework for testing –Maintained and used by SA1 –Several tests can be used in certification and production –Tests need very little modification  Concept is compatible with testing in ETICS ---> easy port –Provides Web based, customizable views and history

Enabling Grids for E-sciencE EGEE-II INFSO-RI st EU Review, May 15-16, Test Status Test development mainly by partners –Partners signed up for tests –Progressed monitored and documented every 2 weeks –Steady progress New class of tests: Security testing –Done by Posznan  Code reviews (VOMS and R-GMA)  Penetration tests  Independent testbed Interoperability tests –Not jet integrated in the tests process

Enabling Grids for E-sciencE EGEE-II INFSO-RI st EU Review, May 15-16, Test Status ComponentAvailable testsResponsibleSAM BLAH 5 tests for basic functionality INFNno Batch systems Torque has been tested and scripts developed. Partners should extend these tests to other batch systems GRNET (Torque), INFN (LSF), PIC (Condor), CESGA (SGE) no CE 19 SAM tests CERNyes gLite CE 19 SAM tests & manual test result page CERNyes DGAS 5 tests INFNno DPM 41 tests CERNno FTS 7 tests CERNyes Information System 1 basic test & GIS mon & performance and scalability tests INFN (until April 2007), CERNno LB 4 functionality tests University of Brusselsno LFC 2 SAM tests, 2 API tests, LFC performance test page CERN, LALYes, No MyProxy 1 SAM test CERNyes

Enabling Grids for E-sciencE EGEE-II INFSO-RI st EU Review, May 15-16, Test Status ComponentAvailable testsResponsibleSAM RGMA RGMA test results page, source code auditing TCD, PSNCno RB The old edg-tests suite is used occasionally CERNno SE 3 SAM tests CERNyes SRM v.2 S2 testuite, SRM2 test in DPM testsuite CERNyes UI Extensive test suite testing all commands listed in the LCG User Guide (30+) CERNno WN Most of the UI tests are also applicable to the WN. CERNno VOMS 28 tests, VOMS source code auditing CERN, PSNCyes WMS Tests for: bulk submission, interactive jobs, parametric jobs. Glite version of edg-tests, WMS tests result page CERN, IMPERIAL (since April 2007), CSIC (WMProxy) partly

Enabling Grids for E-sciencE EGEE-II INFSO-RI st EU Review, May 15-16, Test Beds Virtual testbeds for individual testers ( about 5 ) Dynamical allocated test nodes ( > 50 nodes) Central certification testbed

Enabling Grids for E-sciencE EGEE-II INFSO-RI st EU Review, May 15-16, Test Beds External testbeds linked to the certification testbed –CESGA (SGE) –PIC (Condor) –GRNET ( Torque) –UCY (Torque) –INFN (LSF) –LAL (DPM,LFC) –DESY (dcache) Standalone testbeds –Posznan (Security) –IMPERIAL (WMS) –TCD (Porting) Setup and coordination took a long time, last site joined end of 2006.

Enabling Grids for E-sciencE EGEE-II INFSO-RI st EU Review, May 15-16, Interoperability OSG –In production since almost 10 months  Used extensively by CMS –Interoperability testbed in preparation ARC –Problem has been analysed in depth –Plan documented in MSA3.4 –First prototype exists  Still a long way  ARC’s focus is on the CREAM CE

Enabling Grids for E-sciencE EGEE-II INFSO-RI st EU Review, May 15-16, Interoperability UNICORE –Problem has been analysed in depth  Very complex  Minimal overlap between concepts –Plan documented in MSA3.3 –First components exists  Slower progress than expected in the Plan  Proof of principle tests have bee successful NAREGI –Close contact during 2006 –NAREGI demonstrated first set of interoperable tools

Enabling Grids for E-sciencE EGEE-II INFSO-RI st EU Review, May 15-16, Interoperability GIN info –Links information systems of many grid infrastructures  The maps with the grid infrastructures are based on the GIN-BDII Generic Information Provider Provider EGEE Provider OSG Provider NDGF GIN BDII ARC BDII Provider Naregi Provider Teragrid Provider Pragma EGEE Site OSG Site NDGF Site Naregi Grid Teragrid Grid Pragma Grid

Enabling Grids for E-sciencE EGEE-II INFSO-RI st EU Review, May 15-16, Interoperability SA3 is participating actively in the GLUE standardization process Process has been moved to OGF –SA3 member is co-chairing the working group

Enabling Grids for E-sciencE EGEE-II INFSO-RI st EU Review, May 15-16, Porting Main partners are TCD and Posznan Problems with porting –Software dependencies and interdependencies  --->“Plan for glite restructuring” –Up to now only “post release” porting  Difficult to follow change rate  Other platforms have to be supported at release time –TCD is moving to ETICS  Supports better concurrent multi platform build and tests 

Enabling Grids for E-sciencE EGEE-II INFSO-RI st EU Review, May 15-16, Porting Status table at TCD: –

Enabling Grids for E-sciencE EGEE-II INFSO-RI st EU Review, May 15-16, Issues Slow start of the activity –New activity –Recruitment process took until completion 3 months –Several partner required training Merging 2 middleware stacks, tool sets and processes –While keeping changes flowing to production –Was very difficult, done under high pressure by the applications Introducing change while supporting a production service –More than 200 individual updates –How to handle major changes like moving to ETICS? –Current resource level is adequate to support steady state

Enabling Grids for E-sciencE EGEE-II INFSO-RI st EU Review, May 15-16, Issues Testing –Partner contribution started slow  Testbeds needed more than 6 months Hiring, hardware procurements, –Still most tests originate from local team –Introduced more frequent communication  Phone conferences  Formal follow-up on status Interoperability –Underestimated UNICORE interoperation complexity  Review of plan at the next meeting next month –ARC struggled with some technical issues  But mainly a partner issue  Review of plan at the next allhands meeting

Enabling Grids for E-sciencE EGEE-II INFSO-RI st EU Review, May 15-16, Issues Move to ETICS –Will be very beneficial when achieved –Significant upfront investment and training –ETICS is now maturing quickly  But relative timing of both projects was problematic ??????? Or should I ignore the ETICS problems

Enabling Grids for E-sciencE EGEE-II INFSO-RI st EU Review, May 15-16, Plans Complete the move to the ETICS build system –A significant investment for the next 6 months Move install and configuration tests to ETICS Automate more test cases with SAM Move to single layer, component centric configuration tools (component YAIM) –Well underway, in certification Support at least 2 additional platforms for all releases –To be defined by TCG –Can be restricted to some components (Uis, WN) Contribute to the “gLite Restructuring Plan”

Enabling Grids for E-sciencE EGEE-II INFSO-RI st EU Review, May 15-16, Summary SA3 got off the ground Integrated LCG-2.7 and gLite-1.5 Defined and implemented Software Life Cycle process –Component based updates work! ( 269 patches since June) Test process defined and implemented –Many additional tests –Common framework with SA1 (SAM) –External testbeds to cover deployment scenarios Move to ETICS is well underway –Will improve portability Interoperation made visible progress –OSG interoperation used on daily basis