Download presentation
Presentation is loading. Please wait.
Published byIris Griffith Modified over 8 years ago
1
WLCG Operations Coordination Andrea Sciabà IT/SDC 10 th July 2013
2
Outline Status of task forces News from the WLCG Operations Planning meeting Experiment plans New activities Conclusions 10 July 2013 WLCG Operations Coordination – A. Sciabà 2
3
Middleware EMI-3 WN and UI already installed for testing at few sites but not yet recommended Issue with VOMS client (fix available in EMI-3) Validation ongoing Under test at Liverpool and DESY Baseline version in EMI-3 for some services BDII_top, L&B, StoRM, WMS 10 July 2013 WLCG Operations Coordination – A. Sciabà 3
4
SL6 migration Deployment status Total number of Tier1s Done: 6/15 (Alice 4/9, Atlas 4/12, CMS 2/9, LHCb 3/8) 3 of the “not done” are in progress (PIC will finish this week) Total number of Tier2s Done: 30/124 (Alice 7/39, Atlas 13/86, CMS 18/61, LHCb 8/45) CVMFS issue with SL6 reported at previous meeting was fixed by a kernel update 10 July 2013 WLCG Operations Coordination – A. Sciabà 4
5
gLExec The tentative deadline for enabling gLExec at sites was October 1 st the actual deadline will likely be coupled to the timeline for the WN migration to SL6 ~100 tickets already opened to sites (~10 already solved and verified) USCMS ~100% OK, USATLAS no plans yet Deployment status trackedtracked 10 July 2013 WLCG Operations Coordination – A. Sciabà 5
6
SHA-2 New CERN CA certificate available in IGTF DIRAC services ready for SHA-2 Several ATLAS services tested and ready (AGIS, PanDA, DDM, …) EGI just started to run SAM tests for SHA-2 compliance of site services Migration to VOMS-Admin to be carefully planned VO managers will need time to learn 10 July 2013 WLCG Operations Coordination – A. Sciabà 6
7
CVMFS Deployment for ALICE has begun GGUS tickets sent to ALICE sites, already some closed Issue with latest version (2.1.11), fix soon to be released: sites should install it when available and skip 2.1.11 10 July 2013 WLCG Operations Coordination – A. Sciabà 7
8
FTS-3 RAL server production-ready, CERN very soon Pilot services also at ASGC, BNL BNL, KIT, CNAF will deploy production servers Under discussion at PIC, IN2P3-CC Next milestones July: migrate some production transfers to FTS-3 at CERN and RAL in “FTS-2-like mode” August: gain experience and include other servers 10 July 2013 WLCG Operations Coordination – A. Sciabà 8
9
xrootd Both AAA and FAX have ~40 sites each Not all of them produce monitoring information Almost all needed plugins in the WLCG repository The dCache one missing, needs some finishing touches Request to register all xrootd endpoints and redirectors in GOCDB/OIM Allows to declare downtimes, run ad-hoc SAM tests, etc. Need to solve an issue with DPM Only local traffic is monitored 10 July 2013 WLCG Operations Coordination – A. Sciabà 9
10
Tracking tools evolution Savannah-to-JIRA migration status Instructions updated Instructions GGUS tracker transition status updatedstatus Further development will wait for the upgrade to JIRA 6 this month Savannah-to-GGUS bridge for CMS being moved to GGUS-only Progress trackedtracked Today the new GGUS SU for “Grid monitoring” will be created Will eventually supersede Dashboard and SAM SUs 10 July 2013 WLCG Operations Coordination – A. Sciabà 10
11
perfSONAR Version 3.3 released, will be deployed in the next three months on WLCG Sites are strongly encouraged to upgrade to/install this version Sites which did not do it already should publish their instances in GOCDB/OIM Testing the new modular Dashboard, including the API 10 July 2013 WLCG Operations Coordination – A. Sciabà 11
12
ALICE plans ALICE increasingly committed to CVMFS AliEn being developed and tested for it Working on rationalising SAM tests Import results from MonALISA Xrootd, VOBOX 10 July 2013 WLCG Operations Coordination – A. Sciabà 12
13
ATLAS plans Residual need for shared area soon to be eliminated Simulation validated for multicore Sites encouraged to deploy more queues All sites should deploy perfSONAR All sites should provide WebDAV access for storage management operations (or discuss an alternative with ATLAS) by September Widely use xrootd for WAN and LAN data access after summer Main use cases for FAX are fail-over for local access failures and breaking jobs-to-data locality Russian Proto-T1 is contributing to production (but no tape yet) Migrate ATLAS central services to OpenStack VMs with SL6 during 2014 Start stress testing RUCIO in July and release first official version of JEDI by end of summer 10 July 2013 WLCG Operations Coordination – A. Sciabà 13
14
CMS plans Multicore Pilots can now run several single-threaded CMSSW processes Commission multi-threaded CMSSW by end 2013 CRAB3-PanDA integration open to beta testers Xrootd federation Integrate > 90% of sites by autumn Disk-tape separation Start testing in autumn Opportunistic resources Non-CMS sites, clouds, HPCs, via Parrot and CVMFS Interest in using grid.cern.ch for Grid clients 10 July 2013 WLCG Operations Coordination – A. Sciabà 14
15
LHCb plans Introducing Tier2Ds (with disk storage for analysis jobs) Looking into using perfSONAR data as quality metric and to choose Tier-2’s for reprocessing campaigns Start working on algorithms to take decisions based on data popularity metrics Enhance SAM tests by publishing information from DIRAC Align strategy with WLCG monitoring consolidation project By end 2013, new software releases only for SL6 to use C++11 features T1 sites should provide SL6 resources 10 July 2013 WLCG Operations Coordination – A. Sciabà 15
16
New activities Just started collaborating with the Hepix IPv6 working group on WLCG application testing Contribute the site perspective to the new WLCG Monitoring Consolidation project All monitoring experts from sites welcome to contribute via mailing list Pepe Flix will represent the OCCT in the project New task force on Job/Machine Features just launched Coordinated by Stefan Roiser 10 July 2013 WLCG Operations Coordination – A. Sciabà 16
17
Conclusions Steady progress for all task forces Last quarter of 2013 as target date for many of them Experiment plans focus on common topics CVMFS adoption Monitoring (SAM, perfSONAR) New data management Tools, storage federations, protocols, etc. Multicore in production Virtualisation, clouds, opportunistic resources 10 July 2013 WLCG Operations Coordination – A. Sciabà 17
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.