JRA2: Quality Assurance

Slides:



Advertisements
Similar presentations
Module N° 4 – ICAO SSP framework
Advertisements

Stepan Potiyenko ISS Sr.SW Developer.
DataGrid is a project funded by the European Commission under contract IST rd EU Review – 19-20/02/2004 DataGrid Quality Assurance On behalf.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE-III Program of Work Erwin Laure EGEE-II / EGEE-III Transition Meeting CERN,
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks JRA2: Quality Assurance & Security Coordination.
08/11/908 WP2 e-NMR Grid deployment and operations Technical Review in Brussels, 8 th of December 2008 Marco Verlato.
LCG Milestones for Deployment, Fabric, & Grid Technology Ian Bird LCG Deployment Area Manager PEB 3-Dec-2002.
EGI: A European Distributed Computing Infrastructure Steven Newhouse Interim EGI.eu Director.
Cybersecurity: Engineering a Secure Information Technology Organization, 1st Edition Chapter 7 Software Supporting Processes and Software Reuse.
EGEE is a project funded by the European Union under contract IST JRA1 Testing Activity: Status and Plans Leanne Guy EGEE Middleware Testing.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks From ROCs to NGIs The pole1 and pole 2 people.
Software System Engineering: A tutorial
EMI SA2: Quality Assurance (EMI-SA2 Work Package) Alberto Aimar (CERN) WP Leader.
EGI: SA1 Operations John Gordon EGEE09 Barcelona September 2009.
Module N° 8 – SSP implementation plan. SSP – A structured approach Module 2 Basic safety management concepts Module 2 Basic safety management concepts.
EMI INFSO-RI SA2 - Quality Assurance Alberto Aimar (CERN) SA2 Leader EMI First EC Review 22 June 2011, Brussels.
INFSO-RI Enabling Grids for E-sciencE SA1: Cookbook (DSA1.7) Ian Bird CERN 18 January 2006.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Steven Newhouse EGEE’s plans for transition.
INFSO-RI Enabling Grids for E-sciencE EGEE and Industry Bob Jones EGEE-II Project Director Final EGEE Review CERN, May 2006.
Responsibilities of ROC and CIC in EGEE infrastructure A.Kryukov, SINP MSU, CIC Manager Yu.Lazin, IHEP, ROC Manager
GridPP Deployment & Operations GridPP has built a Computing Grid of more than 5,000 CPUs, with equipment based at many of the particle physics centres.
INFSO-RI Enabling Grids for E-sciencE Plan until the end of the project and beyond, sustainability plans Dieter Kranzlmüller Deputy.
EGEE is a project funded by the European Union under contract IST JRA1-SA1 requirement gathering Maite Barroso JRA1 Integration and Testing.
SA1/SA2 meeting 28 November The status of EGEE project and next steps Bob Jones EGEE Technical Director EGEE is proposed as.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations Automation Team James Casey EGEE’08.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Multi-level monitoring - an overview James.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks SA2 Quality Plan for EGEE III Geneviève.
JRA Execution Plan 13 January JRA1 Execution Plan Frédéric Hemmer EGEE Middleware Manager EGEE is proposed as a project funded by the European.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE-EGI Grid Operations Transition Maite.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks SA1: Grid Operations Maite Barroso (CERN)
INFSO-RI Enabling Grids for E-sciencE EGEE SA1 in EGEE-II – Overview Ian Bird IT Department CERN, Switzerland EGEE.
EGEE-III-INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE-III All Activity Meeting Brussels,
EGEE is a project funded by the European Union under contract IST Network Resources Provision Jean-Paul Gautier SA2 manager Cork meeting,
JRA2: Quality Assurance Overview EGEE is proposed as a project funded by the European Union under contract IST JRA.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGI Operations Tiziana Ferrari EGEE User.
Enabling Grids for E-sciencE EGEE-II Meeting EGEE-II SA2 activity Tziouvaras Chrysostomos, MSc NTUA, 14 th March 2006.
EMI INFSO-RI SA1 Session Report Francesco Giacomini (INFN) EMI Kick-off Meeting CERN, May 2010.
EMI INFSO-RI Software Quality Assurance in EMI Maria Alandes Pradillo (CERN) SA2.2 Task Leader.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Deliverable DSA1.4 Jules Wolfrat ARM-9 –
EGEE is a project funded by the European Union under contract IST Roles & Responsibilities Ian Bird SA1 Manager Cork Meeting, April 2004.
INFSO-RI Enabling Grids for E-sciencE Quality Assurance Gabriel Zaquine - JRA2 Activity Manager - CS SI EGEE Final EU Review
EGEE Project Review Fabrizio Gagliardi EDG-7 30 September 2003 EGEE is proposed as a project funded by the European Union under contract IST
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations Automation Team Kickoff Meeting.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks SA2 Networking support for EGEE III Xavier.
Induction: What is EGEE? –April 26-28, EGEE organisation and procedures John Murison, EGEE Training Team EGEE is funded by the European Union.
Components Selection Validation Integration Deployment What it could mean inside EGI
Capacity Building in: GEO Strategic Plan 2016 – 2025 and Work Programme 2016 Andiswa Mlisa GEO Secretariat Workshop on Capacity Building and Developing.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Country Reports AAM Brussels.
JRA1 Middleware re-engineering
Bob Jones EGEE Technical Director
Regional Operations Centres Core infrastructure Centres
BIL 424 NETWORK ARCHITECTURE AND SERVICE PROVIDING.
NA5: Policy and International Cooperation
SA1 Status Report EGEE Grid Operations & Management
JRA3 Introduction Åke Edlund EGEE Security Head
SA1 Execution Plan Status and Issues
Integrated Management System and Certification
Ian Bird GDB Meeting CERN 9 September 2003
Long-term Grid Sustainability
JRA1 (Middleware) Overview
Service Level Agreement/Description between CE ROC and Sites
Report on SLA progress Ioannis Liabotis <ilaboti at grnet.gr>
Infrastructure Support
Networking support (SA2) tasks for EGI
Maite Barroso, SA1 activity leader CERN 27th January 2009
Funded by the Erasmus+ Programme EPP JO-EPPKA2-CBHE-JP Lina Tsakalou
LCG Operations Workshop, e-IRG Workshop
Leigh Grundhoefer Indiana University
Collaboration Board Meeting
JRA2 Quality Assurance Geneviève Romier CNRS-UREC
Presentation transcript:

JRA2: Quality Assurance Gabriel Zaquine - BT EGEE-II Final EU Review (CERN) 8-9 July 2008

Contents Goals and organisation Processes and procedures Metrics programme Industry QA standards follow-up Summary JRA2- G. Zaquine - EGEE-II Final EU Review - 8-9 July 2008

Goals Coordinate the overall Quality Assurance (QA) aspects of the project to ensure its processes, services and deliverables are of high-quality Establish quality procedures, methods and associated tools for all project activities Establish a metrics programme for key targets of the project and each activity including measures of the performance of each partner through the project activities and their contribution to the programme of work Foster industry QA standards best practices JRA2- G. Zaquine - EGEE-II Final EU Review - 8-9 July 2008

JRA2 in numbers Manpower: 3 partners, 2 countries, 3 FTE EGEE-II Budget JRA2- G. Zaquine - EGEE-II Final EU Review - 8-9 July 2008

QA organisation The EGEE-II Quality Assurance activity is organised through the Quality Assurance Group (QAG) composed of QA representatives from each activity coordinated by the JRA2 Quality coordinator The main role of the Activity QA representatives is to: define the Activity Quality & Measurement Plans ensure that quality measures as agreed are applied inside their activity The execution of the verification and testing activities is the responsibility of the individual activities JRA2- G. Zaquine - EGEE-II Final EU Review - 8-9 July 2008

Processes and Procedures Capitalize on best practices JRA2 Quality Assurance Processes and Procedures Capitalize on best practices JRA2- G. Zaquine - EGEE-II Final EU Review - 8-9 July 2008

Processes / Procedures Used by several other projects Fully documented JRA2- G. Zaquine - EGEE-II Final EU Review - 8-9 July 2008

Project Management The Project management procedures and related material have proven successful in managing a large project since EGEE phase I In particular, many mechanisms have been established by the project to assess project progress: PEB weekly meetings, all-activity meetings (AAM) on a quarterly basis Quarterly reports, periodical reports, cost claims Activities’ execution plans and associated project tracking tool (PPT) Deliverables and milestones reviews Users’ surveys Metrics program: Overall metrics, Activity Metrics, Partner activity review External Advisory Committee reports EU reviews Many of this material has been used by related projects For EGEE-III, in order to monitor the progress of the programme of work in each country and the involvement of individual partners, a country-based review process will be used systematically across the whole project JRA2- G. Zaquine - EGEE-II Final EU Review - 8-9 July 2008

Middleware As a means of reinforcing the long-term stability of gLite, several actions have been performed by the project, such as: gLite restructuring including dependency challenge and dependency review (cross activity code review) More robust software and better maintainability & portability Review of gLite authorization (MJRA1.7) Framework re-design Adoption of the ETICS build system More control of the software structure Moving to a continuous release process Allowing independent component releases JRA1 and SA3 will continue Working closely with the ETICS-2 project particularly with the automated build systems Automating the testing and certification procedures will continue The acceptance criteria will be enforced with a check list that defines the criteria at each stage in the release process JRA2- G. Zaquine - EGEE-II Final EU Review - 8-9 July 2008

Operations (1/2) Quality Assurance is an inherent part of the everyday work of the Operations Operations have shown a continual improvement in the quality of the service with clarified work procedures of deployment and day-to-day operations of both production and pre-production services as well as the improvement of various automated tools A wide range of metrics are available in a metrics portal, and used in reporting and as part of assessments of both sites and services Several mechanisms have been put in place to gather feedback weekly operations meeting operations workshops feedback from GGUS for user support survey of the user community on all aspects of the operations of the infrastructure JRA2- G. Zaquine - EGEE-II Final EU Review - 8-9 July 2008

Operation (2/2) The SA1 activity is introducing SLAs and the means of collecting the metrics that will be used to monitor them A general strategy of the Service Activities in EGEE-III will be to optimise the activities in order to reduce the overall level of effort required in the future to manage a sustainable infrastructure improving tools needed to support operations monitoring automation of alarms develop a full set of Service Level Agreements (SLA) at several levels, associated with mechanisms to monitor and verify these SLAs JRA2- G. Zaquine - EGEE-II Final EU Review - 8-9 July 2008

Security Management Best practices have been considered and described in MJRA2.2.1 “Security audit, strategy and plan”, and progressively implemented Security audits have been performed on aspects of Grid operations and middleware by various groups within the project as well as external third parties (BARC/EUIndiaGrid and CNES) Security Service Challenges (SSC) is an ongoing process on many grid sites, running in its 3rd phase http://cern.ch/osct/ssc.html The goal of theses challenges is to test the incident of our security tools, policies and procedures and identify where improvements can be made Procedures generally understood Proposed enhancements for Access control (ex: blocking a user at the authZ level) and tractability - Logging and audit trails (logs) Review of gLite authorization (MJRA1.7) Framework re-design In addition, the OSG is part of the overall work as partner in the security groups, with exchange of experiences from similar efforts within the US This will be expended to other projects for the future JRA2- G. Zaquine - EGEE-II Final EU Review - 8-9 July 2008

JRA2 Quality Assurance Metrics programme Provide a powerful tool for assessment at each level : - Project overall metrics - Activity metrics - Partner metrics JRA2- G. Zaquine - EGEE-II Final EU Review - 8-9 July 2008

Overall main metrics The project identified overall metrics, with “target” values defined in the Technical Annex http://egee-jra2.web.cern.ch/EGEE-JRA2/EGEE-II/QoS/Follow-up/Overall.htm The project has exceeded all targets set for the project JRA2- G. Zaquine - EGEE-II Final EU Review - 8-9 July 2008

Activity metrics A full set of activity metrics have been provided by each activity. These are grouped into themes, namely: project management, training, dissemination, usage, operation, middleware JRA2- G. Zaquine - EGEE-II Final EU Review - 8-9 July 2008

SA1 metrics … as part of everyday monitoring tool The project has implemented the metrics defined in MSA1.1 More than 40 metrics including timeseries data as well as snapshot data providing more finely-grained data http://egee-docs.web.cern.ch/egee-docs/list.php?dir=./mig/production/& Size metrics (CPU, Storage, Users) Operation metrics (Site availability, Site reliability) Usage metrics (Jobs, Data transfer, CPU, Storage) User support tickets (Numbers, Types, Response times) Services metrics (CE, SE, SRM, BDII, RB, LFC, FTS, VOMS, etc) The site availability and reliability metrics calculated by SAM & GridView are used for sites’ SLA conformance JRA2- G. Zaquine - EGEE-II Final EU Review - 8-9 July 2008

Availability of a Site / Services Site availability is calculated as: logical AND of all service types If a site has several instances of a service type (e.g. Computing Elements), the service is deemed to be available if any of the instances are available (logical OR) https://edms.cern.ch/document/860386 ..... █ D  = Down (unscheduled) █ NA = No test result Available █ M = Maintenance (scheduled) █ U = Up JRA2- G. Zaquine - EGEE-II Final EU Review - 8-9 July 2008

From Partner Activity Review to Federation & Country review Given the size of the EGEE-II (more than 90 partners), the partners’ metrics programme was devised to ensure that all partners are performing as expected The Partner Activity Reviews have been organized with a format depending on the nature of each activity http://egee-jra2.web.cern.ch/EGEE-JRA2/EGEE-II/QoS/Follow-up/PartnerMetrics.htm The Activity reviews went well. The Activity Managers found these reviews useful. It was a forum to discuss issues, examine progress & contributions and establish better communication between partners This is being expanded to “federation reviews” in EGEE-III Monitor the progress of the programme of work in each country and the involvement of individual partners Examine progress and contribution and identify and promote regional successes and initiatives for re-use in other countries Verify the state of readiness for the transition to EGI JRA2- G. Zaquine - EGEE-II Final EU Review - 8-9 July 2008

JRA2 Quality Assurance Industry QA standards follow-up “Evaluate best practices that could be progressively applied to EGEE” - ITIL group - Relationships with ETICS JRA2- G. Zaquine - EGEE-II Final EU Review - 8-9 July 2008

Following up industry standards and best practices (1/2) The EGEE ITIL group has been set up with the goal to assess ITIL best practices Trainings “Introduction to ITIL” and dedicated presentations at EGEE conferences ITIL Services Level Management (SLA modeled on ITIL recommendations) ROC / Site HARDWARE AND CONNECTIVITY CRITERIA DESCRIPTION OF SERVICES COVERED SERVICE HOURS AVAILABILITY SUPPORT SERVICE REPORTING AND REVIEWING https://edms.cern.ch/document/860386 GGUS/ROCs for users support service https://edms.cern.ch/document/888089 ITIL Service Desk and Incident Management Evaluation for EGEE Networks Operations Centre (ENOC) Importance of a common terminology e.g. Incident, Problem, Known error Evaluation vs GGUS Out of 103 ITIL items analysed from the ITIL “Incident Management”, most are fulfilled by the present arrangements of GGUS Improvement of Interface between GGUS central part and regional parts in order to improve Problem Management Enforcement of Incident Manager ITIL will be useful at EGI stage as a set of best practice and common terminology The QAG will pursue the work on ITIL based on ITILV 3 within EGEE-III ITIL 3 JRA2- G. Zaquine - EGEE-II Final EU Review - 8-9 July 2008

Following up industry standards and best practices (2/2) Relationships with ETICS have continued JRA2 has followed the work of the ETICS QA software certification model based on ISO and CMMi standards, including: automated evaluation modules for Static analysis Coding style Structural testing Functional testing Standards compliance For EGEE-III, the QAG will encourage ISO and CMMi best practices toward Software Engineering part JRA2- G. Zaquine - EGEE-II Final EU Review - 8-9 July 2008

Summary The QA objectives planned for the project have been achieved, and QA is in active use across all activities The quality of the service of the EGEE infrastructure has been continuously increased, with clarified work procedures, as well as the improvement of various automated tools The project has exceeded all targets set for the second year of the project In the list of measures for continuous quality improvement foreseen within EGEE-III Quality Assurance will be pervasive throughout all EGEE-III activities and each activity will plan for their QA measures as part of the activity tasks and milestones Further implement the Services Level Management process with a full set of Service Level Agreements (SLA) at several levels, associated with mechanisms to monitor and verify these SLAs The acceptance software component criteria will be enforced at each stage Security auditing to cooperate more with other projects To monitor the progress of the programme of work in each country and the involvement of individual partners, a federation reviews process will be used systematically across the whole project The QAG group will also pursue the work on ITIL based on ITIL 3, and the work with ETICS concerning Software Engineering, encouraging ISO and CMMi best practices JRA2- G. Zaquine - EGEE-II Final EU Review - 8-9 July 2008