MW Readiness WG Update Andrea Manzi Maria Dimou Lionel Cons Maarten Litmaath On behalf of the WG participants GDB 09/09/2015.

Slides:



Advertisements
Similar presentations
The Middleware Readiness Working Group LHCb Computing Workshop LHCb Computing Workshop Maria Dimou IT/SDC 2014/05/22.
Advertisements

New VOMS servers campaign GDB, 8 th Oct 2014 Maarten Litmaath IT/SDC.
LHC Experiment Dashboard Main areas covered by the Experiment Dashboard: Data processing monitoring (job monitoring) Data transfer monitoring Site/service.
WLCG Operations Coordination report Maria Dimou / CERN With input and on behalf of the WLCG Operations Coordination team May 2015 GDB CERN indico event.
LHCC Comprehensive Review – September WLCG Commissioning Schedule Still an ambitious programme ahead Still an ambitious programme ahead Timely testing.
Status of WLCG Tier-0 Maite Barroso, CERN-IT With input from T0 service managers Grid Deployment Board 9 April Apr-2014 Maite Barroso Lopez (at)
5 November 2001F Harris GridPP Edinburgh 1 WP8 status for validating Testbed1 and middleware F Harris(LHCb/Oxford)
OSG Services at Tier2 Centers Rob Gardner University of Chicago WLCG Tier2 Workshop CERN June 12-14, 2006.
SRM 2.2: status of the implementations and GSSD 6 th March 2007 Flavia Donno, Maarten Litmaath INFN and IT/GD, CERN.
News from the HEPiX IPv6 Working Group David Kelsey (STFC-RAL) WLCG GDB, CERN 8 July 2015.
News from the HEPiX IPv6 Working Group David Kelsey (STFC-RAL) GridPP35, Liverpool 11 Sep 2015.
2 Sep Experience and tools for Site Commissioning.
Marian Babik, Luca Magnoni SAM Test Framework. Outline  SAM Test Framework  Update on Job Submission Timeouts  Impact of Condor and direct CREAM tests.
First attempt for validating/testing Testbed 1 Globus and middleware services WP6 Meeting, December 2001 Flavia Donno, Marco Serra for IT and WPs.
And Tier 3 monitoring Tier 3 Ivan Kadochnikov LIT JINR
Maarten Litmaath (CERN), GDB meeting, CERN, 2006/02/08 VOMS deployment Extent of VOMS usage in LCG-2 –Node types gLite 3.0 Issues Conclusions.
MW Readiness Verification Status Andrea Manzi IT/SDC 21/01/ /01/15 2.
CERN Using the SAM framework for the CMS specific tests Andrea Sciabà System Analysis WG Meeting 15 November, 2007.
Towards a Global Service Registry for the World-Wide LHC Computing Grid Maria ALANDES, Laurence FIELD, Alessandro DI GIROLAMO CERN IT Department CHEP 2013.
Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Usage of virtualization in gLite certification Andreas Unterkircher.
CERN11 th February WLCG Ops Coordination [GDB Report] Josep Flix (PIC/CIEMAT) On behalf of the WLCG Operations Coordination Team GDB – CERN.
Efi.uchicago.edu ci.uchicago.edu FAX status developments performance future Rob Gardner Yang Wei Andrew Hanushevsky Ilija Vukotic.
Stefano Belforte INFN Trieste 1 Middleware February 14, 2007 Resource Broker, gLite etc. CMS vs. middleware.
MW Readiness WG Update Andrea Manzi Maria Dimou Lionel Cons 10/12/2014.
1 User Analysis Workgroup Discussion  Understand and document analysis models  Best in a way that allows to compare them easily.
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
WLCG infrastructure monitoring proposal Pablo Saiz IT/SDC/MI 16 th August 2013.
Information System Status and Evolution Maria Alandes Pradillo, CERN CERN IT Department, Grid Technology Group GDB 13 th June 2012.
Report from the WLCG Operations and Tools TEG Maria Girone / CERN & Jeff Templon / NIKHEF WLCG Workshop, 19 th May 2012.
Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Tools and techniques for managing virtual machine images Andreas.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
Jan 2010 OSG Update Grid Deployment Board, Feb 10 th 2010 Now having daily attendance at the WLCG daily operations meeting. Helping in ensuring tickets.
SAM Sensors & Tests Judit Novak CERN IT/GD SAM Review I. 21. May 2007, CERN.
WLCG Software Lifecycle First ideas for a post EMI approach 0.
Testing and integrating the WLCG/EGEE middleware in the LHC computing Simone Campana, Alessandro Di Girolamo, Elisa Lanciotti, Nicolò Magini, Patricia.
CERN IT Department CH-1211 Geneva 23 Switzerland t A proposal for improving Job Reliability Monitoring GDB 2 nd April 2008.
Andrea Manzi CERN On behalf of the DPM team HEPiX Fall 2014 Workshop DPM performance tuning hints for HTTP/WebDAV and Xrootd 1 16/10/2014.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks SA3 partner collaboration tasks & process.
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT Grid Technology SL Section Software Lifecycle Duarte Meneses.
WLCG ‘Weekly’ Service Report ~~~ WLCG Management Board, 5 th August 2008.
Enabling Grids for E-sciencE INFSO-RI Enabling Grids for E-sciencE Gavin McCance GDB – 6 June 2007 FTS 2.0 deployment and testing.
WLCG Operations Coordination Andrea Sciabà IT/SDC 10 th July 2013.
The Grid Storage System Deployment Working Group 6 th February 2007 Flavia Donno IT/GD, CERN.
WLCG Operations Coordination report Maria Alandes, Andrea Sciabà IT-SDC On behalf of the WLCG Operations Coordination team GDB 9 th April 2014.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE Operations: Evolution of the Role of.
SAM Status Update Piotr Nyczyk LCG Management Board CERN, 5 June 2007.
Status of gLite-3.0 deployment and uptake Ian Bird CERN IT LCG-LHCC Referees Meeting 29 th January 2007.
INFSO-RI Enabling Grids for E-sciencE File Transfer Software and Service SC3 Gavin McCance – JRA1 Data Management Cluster Service.
News from the HEPiX IPv6 Working Group David Kelsey (STFC-RAL) HEPIX, BNL 13 Oct 2015.
The HEPiX IPv6 Working Group David Kelsey (STFC-RAL) EGI OMB 19 Dec 2013.
SAM architecture EGEE 07 Service Availability Monitor for the LHC experiments Simone Campana, Alessandro Di Girolamo, Nicolò Magini, Patricia Mendez Lorenzo,
Acronyms GAS - Grid Acronym Soup, LCG - LHC Computing Project EGEE - Enabling Grids for E-sciencE.
INFN/IGI contributions Federated Clouds Task Force F2F meeting November 24, 2011, Amsterdam.
Outcome should be a documented strategy Not everything needs to go back to square one! – Some things work! – Some work has already been (is being) done.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Status of ARGUS support Peter Solagna – EGI.eu.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI GLUE 2: Deployment and Validation Stephen Burke egi.eu EGI OMB March 26 th.
WLCG Operations Coordination report Maria Dimou Andrea Sciabà IT/SDC On behalf of the WLCG Operations Coordination team GDB 12 th November 2014.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
ALICE WLCG operations report Maarten Litmaath CERN IT-SDC ALICE T1-T2 Workshop Torino Feb 23, 2015 v1.2.
HEPiX IPv6 Working Group David Kelsey (STFC-RAL) GridPP33 Ambleside 22 Aug 2014.
WLCG Operations Coordination Andrea Sciabà IT/SDC GDB 11 th September 2013.
WLCG IPv6 deployment strategy
OpenSSL and Java 7 vs. 512-bit proxy keys
gLite->EMI2/UMD2 transition
Andrea Manzi, Oliver Keeble
CREAM Status and Plans Massimo Sgaravatto – INFN Padova
Grid status ALICE Offline week Nov 3, Maarten Litmaath CERN-IT v1.0
DPM releases and platforms status
Operations Officer, EGI
Andrea Manzi, Oliver Keeble
Presentation transcript:

MW Readiness WG Update Andrea Manzi Maria Dimou Lionel Cons Maarten Litmaath On behalf of the WG participants GDB 09/09/2015

Outline The Products The VOs workflows The Volunteer Sites Verification results EGI Liaisons The Software Next Steps MW Readiness WG Update2

The Products tested DPM (ATLAS and CMS) CREAM CE (ATLAS and CMS) dCache (ATLAS and CMS) StoRM (ATLAS) EOS (CMS) HTCondor (Condor-g) (ATLAS) ARC-CE (CMS) VOMS-Clients (LHCb) MW Readiness WG Update3 in RED products that started to be tested since the latest GDB report in DEC 2014

The Products yet to test WN ( CMS) Xrootd (CMS) ARGUS, as discussed in the ARGUS Collaboration FTS3, first decided not to be included but IT-SDC would like to some MW readiness tests are done using the FTS3 CERN pilot Any others? Readiness#Product_Teams MW Readiness WG Update4

The VOs ATLAS, workflow made for this effortworkflow – Panda jobs and Rucio transfers CMS, workflow made for this effortworkflow – HC jobs and Phedex transfers LHCb, generic document for certificationdocument ALICE, not yet… MW Readiness WG Update5

The Volunteer Sites Edinburgh (DPM for ATLAS) INFN-Napoli (CREAM CE for ATLAS ) Triumf & NDGF (dCache for ATLAS) QMUL & CNAF (StoRM for ATLAS) GRIF (DPM and WN for CMS ) Legnaro (CREAM CE for CMS) PIC (dCache for CMS) CERN (EOS for CMS and ARGUS - pending) Brunel (ARC-CE for CMS) MW Readiness WG Update6 in RED the sites that started testing after the latest GDB report in DEC 2014

Versions recently verified for ATLAS DPM-Xrootd ( with Xrootd4) OK CREAM-CE : OK dCache 2.10.x at TRIUMF. Issue found on FTS -> SRM interaction, fixed 2.12.x, 2.13.x at NDGF, OK Storm 1.11.[5,9] at QMUL ( issue found on fixed in ) CNAF still some issues when running HC jobs HTCondor ( condor-g) OK MW Readiness WG Update7

Versions recently verified for CMS DPM-Xrootd ( with Xrootd4) OK CREAM-CE : OK dCache , issue reported with Xrootd monitoring plugin EOS 0.7.x OK ARC-CE 15.0 : issue found with WMS submission by Brunel ( not more relevant to WLCG) MW Readiness WG Update8

Versions recently Verified for LHCb voms-clients One option in voms-proxy-init compared to the v2 was not working and it could not be integrated within LHCb framework should have fix this, waiting for feedback MW Readiness WG Update9

Argus During the Argus collaboration meeting in December it has been agreed to perform Argus scalability tests in the context of Middleware Readiness WG Mainly to understand what is causing strange service outages under high load It has been agreed with CERN T0 to have a testbed installed here The Argus collaboration has then discussed to wait for a new CentOS7 version of the service running with JAVA 8 – newer versions of Java and other 3rd party dependencies might already have issues fixed that are relevant to Argus 10 MW Readiness WG Update

CentOS 7 We started collecting info from the different volunteer sites about their availability in testing CentOS 7 versions of MW. GRIF and Edinburgh are available for now We are waiting for the first versions certified by the PTs – At the moment we know that FTS3, DM Clients, dCache should be ready We have also to discuss with the Experiment representatives the priorities of Products to validate, because with a new platform in theory all MW components have to be verified.. 11 MW Readiness WG Update

MW Readiness and EGI – Quite good communication with EGI via participation in URT meetings – Some sites participating in MW readiness are also reporting to EGI their verifications ( e.g. QMUL and Brunel) – We already discussed to work together for CentOS7 MW verification as they are preparing the first release ( UMD4) 12 MW Readiness WG Update

Software 13 MW Readiness WG Update Pakiti client MWR Package DB MWR APP MWR APP YUM repos SSB Site Admins MW officer

Pakiti-client Software responsible for publishing packages installed on a node to a set of Collectors. MWR Package DB Pakiti Server 2.0 adapted by EGI CSIRT The new version supporting the "tag" option is in EPEL the next version will support Debian, FreeBSD and OpenBSD thanks to patches submitted through GitHub no release date yet since these are not a priority for MWR 14 MW Readiness WG Update

MWR Package Database The service has been moved to new virtual machines ( puppet based) the package database fully supports the "tag" option default is "UNKNOWN" if not given via the Pakiti client should be set to "MWR" for volunteer sites' machines, and to “PROD” for future production nodes the instructions on Twiki have been updated to use the new VMs ePackageReporter ePackageReporter 15 MW Readiness WG Update

Pakiti client related actions for sites for all volunteer sites: upgrade the pakiti-client rpm (yum update) change the configuration file to use the new service set the tag by adding "--tag MWR" to Pakiti client's invocation switch off the old VMs when everybody has moved Deadline was the end of August ( waiting for few sites ) 16 MW Readiness WG Update

MW Readiness App v0.1 The software responsible for storing and accessing information about MW Products, baselines, Products version and related packages, input for SSB metrics Production Instance is available This first version offers WLCG MW products and baseline management Admin interface (local accounts for now, integration with SSO next) REST APIs 17 MW Readiness WG Update

Baselines/Product Views 18 MW Readiness WG Update

Admin Views 19 MW Readiness WG Update

REST-API HTML View – – Basic CRUD operations Non CRUD ops examples: Get current baseline for a product – curl ' readiness.cern.ch/apis/products/DPM/get_baselin e/’ 20 MW Readiness WG Update

Integration with PK DB updated to make use of the TAG information: – To distinguish packages installed on the production and MWR nodes Correct handling of platforms/ packages repositories type ( yum or url) New View to show the Product versions installed at the Site (based on the packages installed) – New View to show the MW Product verification reports – MW Readiness App v0.2 MW Readiness WG Update

SSB Integration: – It calculates for each site running the given product, the percentage of hosts running a version >= the current baseline Info cached and polled by SSB metrics – dev.cern.ch/dashboard/request.py/siteview?view =MWR#currentView=MWR dev.cern.ch/dashboard/request.py/siteview?view =MWR#currentView=MWR 22 MW Readiness App v0.2 MW Readiness WG Update

New View to give site managers access to the packages installed at their site hosts (access based on certificates) is almost there Similar to what Pakiti server does dev.cern.ch/sitedetail/CERN-DPM-TESTBED/ dev.cern.ch/sitedetail/CERN-DPM-TESTBED/ Access given to a predefined list of DNs. TODO : gather info from GOCDB in order to give access to Site Managers only to their host packages information 23 MW Readiness App v0.2 MW Readiness WG Update

Software : What’s next 24 Perform MW app internal WG testing Complete SSO integration Start inputting data ( i.e. packages from repositories) Give access to Volunteer sites’ managers Based on the testing activity decide then the deployment in production – Pakiti-client in WLCG – MW app dedicated to WLCG prod MW Readiness WG Update

WG Desires / Next steps More Volunteer site participation in the meetings More involvement from LHCb & ALICE Note the next meeting date 2015/09/16 4pm CEST AgendaAgenda Check our twiki and Jira tracker (also its dashboard view)twiki Jira tracker dashboard view MW Readiness WG Update25

26 Questions? mwreadiness-app MW Readiness WG Update