Presentation is loading. Please wait.

Presentation is loading. Please wait.

WLCG Operations Coordination Andrea Sciabà IT/SDC 10 th July 2013.

Similar presentations


Presentation on theme: "WLCG Operations Coordination Andrea Sciabà IT/SDC 10 th July 2013."— Presentation transcript:

1 WLCG Operations Coordination Andrea Sciabà IT/SDC 10 th July 2013

2 Outline  Status of task forces  News from the WLCG Operations Planning meeting  Experiment plans  New activities  Conclusions 10 July 2013 WLCG Operations Coordination – A. Sciabà 2

3 Middleware  EMI-3 WN and UI already installed for testing at few sites but not yet recommended  Issue with VOMS client (fix available in EMI-3)  Validation ongoing  Under test at Liverpool and DESY  Baseline version in EMI-3 for some services  BDII_top, L&B, StoRM, WMS 10 July 2013 WLCG Operations Coordination – A. Sciabà 3

4 SL6 migration  Deployment status  Total number of Tier1s Done: 6/15 (Alice 4/9, Atlas 4/12, CMS 2/9, LHCb 3/8)  3 of the “not done” are in progress (PIC will finish this week)  Total number of Tier2s Done: 30/124 (Alice 7/39, Atlas 13/86, CMS 18/61, LHCb 8/45)  CVMFS issue with SL6 reported at previous meeting was fixed by a kernel update 10 July 2013 WLCG Operations Coordination – A. Sciabà 4

5 gLExec  The tentative deadline for enabling gLExec at sites was October 1 st  the actual deadline will likely be coupled to the timeline for the WN migration to SL6  ~100 tickets already opened to sites (~10 already solved and verified)  USCMS ~100% OK, USATLAS no plans yet  Deployment status trackedtracked 10 July 2013 WLCG Operations Coordination – A. Sciabà 5

6 SHA-2  New CERN CA certificate available in IGTF  DIRAC services ready for SHA-2  Several ATLAS services tested and ready (AGIS, PanDA, DDM, …)  EGI just started to run SAM tests for SHA-2 compliance of site services  Migration to VOMS-Admin to be carefully planned  VO managers will need time to learn 10 July 2013 WLCG Operations Coordination – A. Sciabà 6

7 CVMFS  Deployment for ALICE has begun  GGUS tickets sent to ALICE sites, already some closed  Issue with latest version (2.1.11), fix soon to be released: sites should install it when available and skip 2.1.11 10 July 2013 WLCG Operations Coordination – A. Sciabà 7

8 FTS-3  RAL server production-ready, CERN very soon  Pilot services also at ASGC, BNL  BNL, KIT, CNAF will deploy production servers  Under discussion at PIC, IN2P3-CC  Next milestones  July: migrate some production transfers to FTS-3 at CERN and RAL in “FTS-2-like mode”  August: gain experience and include other servers 10 July 2013 WLCG Operations Coordination – A. Sciabà 8

9 xrootd  Both AAA and FAX have ~40 sites each  Not all of them produce monitoring information  Almost all needed plugins in the WLCG repository  The dCache one missing, needs some finishing touches  Request to register all xrootd endpoints and redirectors in GOCDB/OIM  Allows to declare downtimes, run ad-hoc SAM tests, etc.  Need to solve an issue with DPM  Only local traffic is monitored 10 July 2013 WLCG Operations Coordination – A. Sciabà 9

10 Tracking tools evolution  Savannah-to-JIRA migration status  Instructions updated Instructions  GGUS tracker transition status updatedstatus  Further development will wait for the upgrade to JIRA 6 this month  Savannah-to-GGUS bridge for CMS being moved to GGUS-only  Progress trackedtracked  Today the new GGUS SU for “Grid monitoring” will be created  Will eventually supersede Dashboard and SAM SUs 10 July 2013 WLCG Operations Coordination – A. Sciabà 10

11 perfSONAR  Version 3.3 released, will be deployed in the next three months on WLCG  Sites are strongly encouraged to upgrade to/install this version  Sites which did not do it already should publish their instances in GOCDB/OIM  Testing the new modular Dashboard, including the API 10 July 2013 WLCG Operations Coordination – A. Sciabà 11

12 ALICE plans  ALICE increasingly committed to CVMFS  AliEn being developed and tested for it  Working on rationalising SAM tests  Import results from MonALISA  Xrootd, VOBOX 10 July 2013 WLCG Operations Coordination – A. Sciabà 12

13 ATLAS plans  Residual need for shared area soon to be eliminated  Simulation validated for multicore  Sites encouraged to deploy more queues  All sites should deploy perfSONAR  All sites should provide WebDAV access for storage management operations (or discuss an alternative with ATLAS) by September  Widely use xrootd for WAN and LAN data access after summer  Main use cases for FAX are fail-over for local access failures and breaking jobs-to-data locality  Russian Proto-T1 is contributing to production (but no tape yet)  Migrate ATLAS central services to OpenStack VMs with SL6 during 2014  Start stress testing RUCIO in July and release first official version of JEDI by end of summer 10 July 2013 WLCG Operations Coordination – A. Sciabà 13

14 CMS plans  Multicore  Pilots can now run several single-threaded CMSSW processes  Commission multi-threaded CMSSW by end 2013  CRAB3-PanDA integration open to beta testers  Xrootd federation  Integrate > 90% of sites by autumn  Disk-tape separation  Start testing in autumn  Opportunistic resources  Non-CMS sites, clouds, HPCs, via Parrot and CVMFS  Interest in using grid.cern.ch for Grid clients 10 July 2013 WLCG Operations Coordination – A. Sciabà 14

15 LHCb plans  Introducing Tier2Ds (with disk storage for analysis jobs)  Looking into using perfSONAR data as quality metric and to choose Tier-2’s for reprocessing campaigns  Start working on algorithms to take decisions based on data popularity metrics  Enhance SAM tests by publishing information from DIRAC  Align strategy with WLCG monitoring consolidation project  By end 2013, new software releases only for SL6 to use C++11 features  T1 sites should provide SL6 resources 10 July 2013 WLCG Operations Coordination – A. Sciabà 15

16 New activities  Just started collaborating with the Hepix IPv6 working group on WLCG application testing  Contribute the site perspective to the new WLCG Monitoring Consolidation project  All monitoring experts from sites welcome to contribute via mailing list  Pepe Flix will represent the OCCT in the project  New task force on Job/Machine Features just launched  Coordinated by Stefan Roiser 10 July 2013 WLCG Operations Coordination – A. Sciabà 16

17 Conclusions  Steady progress for all task forces  Last quarter of 2013 as target date for many of them  Experiment plans focus on common topics  CVMFS adoption  Monitoring (SAM, perfSONAR)  New data management  Tools, storage federations, protocols, etc.  Multicore in production  Virtualisation, clouds, opportunistic resources 10 July 2013 WLCG Operations Coordination – A. Sciabà 17


Download ppt "WLCG Operations Coordination Andrea Sciabà IT/SDC 10 th July 2013."

Similar presentations


Ads by Google