CERN IT Department CH-1211 Geneva 23 Switzerland www.cern.ch/i t T0 report WLCG operations Workshop Barcelona, 07/07/2014 Maite Barroso, CERN IT.

Slides:



Advertisements
Similar presentations
CERN IT Department CH-1211 Genève 23 Switzerland t Next generation of virtual infrastructure with Hyper-V Michal Kwiatek, Juraj Sucik, Rafal.
Advertisements

Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES News on monitoring for CMS distributed computing operations Andrea.
Database Services for Physics at CERN with Oracle 10g RAC HEPiX - April 4th 2006, Rome Luca Canali, CERN.
LHCC Comprehensive Review – September WLCG Commissioning Schedule Still an ambitious programme ahead Still an ambitious programme ahead Timely testing.
Status of WLCG Tier-0 Maite Barroso, CERN-IT With input from T0 service managers Grid Deployment Board 9 April Apr-2014 Maite Barroso Lopez (at)
Site report: Tokyo Tomoaki Nakamura ICEPP, The University of Tokyo 2014/12/10Tomoaki Nakamura1.
CERN - IT Department CH-1211 Genève 23 Switzerland t Tier0 database extensions and multi-core/64 bit studies Maria Girone, CERN IT-PSS LCG.
CERN IT Department CH-1211 Genève 23 Switzerland t EIS section review of recent activities Harry Renshall Andrea Sciabà IT-GS group meeting.
ATLAS Metrics for CCRC’08 Database Milestones WLCG CCRC'08 Post-Mortem Workshop CERN, Geneva, Switzerland June 12-13, 2008 Alexandre Vaniachine.
News from the HEPiX IPv6 Working Group David Kelsey (STFC-RAL) WLCG GDB, CERN 8 July 2015.
News from the HEPiX IPv6 Working Group David Kelsey (STFC-RAL) GridPP35, Liverpool 11 Sep 2015.
WLCG Service Report ~~~ WLCG Management Board, 24 th November
Workshop Summary (my impressions at least) Dirk Duellmann, CERN IT LCG Database Deployment & Persistency Workshop.
CCRC’08 Weekly Update Jamie Shiers ~~~ LCG MB, 1 st April 2008.
CERN Physics Database Services and Plans Maria Girone, CERN-IT
MW Readiness Verification Status Andrea Manzi IT/SDC 21/01/ /01/15 2.
CERN - IT Department CH-1211 Genève 23 Switzerland t Oracle Real Application Clusters (RAC) Techniques for implementing & running robust.
WLCG operations A. Sciabà, M. Alandes, J. Flix, A. Forti WLCG collaboration workshop July , Barcelona.
MW Readiness WG Update Andrea Manzi Maria Dimou Lionel Cons 10/12/2014.
CERN-IT Oracle Database Physics Services Maria Girone, IT-DB 13 December 2004.
CERN Database Services for the LHC Computing Grid Maria Girone, CERN.
Handling ALARMs for Critical Services Maria Girone, IT-ES Maite Barroso IT-PES, Maria Dimou, IT-ES WLCG MB, 19 February 2013.
LCG Introduction John Gordon, STFC GDB June 8 th 2011.
CERN - IT Department CH-1211 Genève 23 Switzerland t High Availability Databases based on Oracle 10g RAC on Linux WLCG Tier2 Tutorials, CERN,
Oracle for Physics Services and Support Levels Maria Girone, IT-ADC 24 January 2005.
LCG Report from GDB John Gordon, STFC-RAL MB meeting February24 th, 2009.
Tier-1 Andrew Sansum Deployment Board 12 July 2007.
Last update 29/01/ :01 LCG 1Maria Dimou- cern-it-gd Maria Dimou IT/GD CERN VOMS server deployment LCG Grid Deployment Board
CERN IT Department CH-1211 Geneva 23 Switzerland t WLCG Operation Coordination Luca Canali (for IT-DB) Oracle Upgrades.
CERN IT Department CH-1211 Geneva 23 Switzerland t Eva Dafonte Perez IT-DB Database Replication, Backup and Archiving.
Operating Systems & Information Services CERN IT Department CH-1211 Geneva 23 Switzerland t OIS Drupal at CERN Juraj Sucik Jarosław Polok.
Maria Girone CERN - IT Tier0 plans and security and backup policy proposals Maria Girone, CERN IT-PSS.
CERN IT Department CH-1211 Genève 23 Switzerland t Migration from ELFMs to Agile Infrastructure CERN, IT Department.
Eygene Ryabinkin, on behalf of KI and JINR Grid teams Russian Tier-1 status report May 9th 2014, WLCG Overview Board meeting.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES CVMFS deployment status Ian Collier – STFC Stefan Roiser – CERN.
Testing CernVM-FS scalability at RAL Tier1 Ian Collier RAL Tier1 Fabric Team WLCG GDB - September
SL5 Site Status GDB, September 2009 John Gordon. LCG SL5 Site Status ASGC T1 - will be finished before mid September. Actually the OS migration process.
Enabling Grids for E-sciencE INFSO-RI Enabling Grids for E-sciencE Gavin McCance GDB – 6 June 2007 FTS 2.0 deployment and testing.
CERN IT Department CH-1211 Genève 23 Switzerland t SL(C) 5 Migration at CERN CHEP 2009, Prague Ulrich SCHWICKERATH Ricardo SILVA CERN, IT-FIO-FS.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES Andrea Sciabà Ideal information system - CMS Andrea Sciabà IS.
WLCG critical services update Andrea Sciabà WLCG operations coordination meeting December 18, 2014.
INRNE's participation in LCG Elena Puncheva Preslav Konstantinov IT Department.
Development of a Tier-1 computing cluster at National Research Centre 'Kurchatov Institute' Igor Tkachenko on behalf of the NRC-KI Tier-1 team National.
Database Project Milestones (+ few status slides) Dirk Duellmann, CERN IT-PSS (
CERN IT Department CH-1211 Geneva 23 Switzerland t ES 1 how to profit of the ATLAS HLT farm during the LS1 & after Sergio Ballestrero.
WLCG Operations Coordination report Maria Alandes, Andrea Sciabà IT-SDC On behalf of the WLCG Operations Coordination team GDB 9 th April 2014.
ATLAS Distributed Computing ATLAS session WLCG pre-CHEP Workshop New York May 19-20, 2012 Alexei Klimentov Stephane Jezequel Ikuo Ueda For ATLAS Distributed.
MW Readiness WG Update Andrea Manzi Maria Dimou Lionel Cons Maarten Litmaath On behalf of the WG participants GDB 09/09/2015.
Platform & Engineering Services CERN IT Department CH-1211 Geneva 23 Switzerland t PES Improving resilience of T0 grid services Manuel Guijarro.
Computing Facilities CERN IT Department CH-1211 Geneva 23 Switzerland t CF Cluman: Advanced Cluster Management for Large-scale Infrastructures.
News from the HEPiX IPv6 Working Group David Kelsey (STFC-RAL) HEPIX, BNL 13 Oct 2015.
The HEPiX IPv6 Working Group David Kelsey (STFC-RAL) EGI OMB 19 Dec 2013.
Platform & Engineering Services CERN IT Department CH-1211 Geneva 23 Switzerland t PES Agile Infrastructure Project Overview : Status and.
CERN - IT Department CH-1211 Genève 23 Switzerland t Service Level & Responsibilities Dirk Düllmann LCG 3D Database Workshop September,
Database Requirements Updates from LHC Experiments WLCG Grid Deployment Board Meeting CERN, Geneva, Switzerland February 7, 2007 Alexandre Vaniachine (Argonne)
WLCG Operations Coordination report Maria Dimou Andrea Sciabà IT/SDC On behalf of the WLCG Operations Coordination team GDB 12 th November 2014.
WLCG Operations Coordination Andrea Sciabà IT/SDC GDB 11 th September 2013.
Dynamic Extension of the INFN Tier-1 on external resources
WLCG IPv6 deployment strategy
LHCOPN/LHCONE status report pre-GDB on Networking CERN, Switzerland 10th January 2017
IT Services Katarzyna Dziedziniewicz-Wojcik IT-DB.
Helge Meinhard, CERN-IT Grid Deployment Board 04-Nov-2015
LCG Service Challenge: Planning and Milestones
LHCOPN update Brookhaven, 4th of April 2017
Andrea Chierici On behalf of INFN-T1 staff
3D Application Tests Application test proposals
Database Readiness Workshop Intro & Goals
Deployment of IPv6-only CPU on WLCG – an update from the HEPiX IPv6 WG
Update from the HEPiX IPv6 WG
Workshop Summary Dirk Duellmann.
Presentation transcript:

CERN IT Department CH-1211 Geneva 23 Switzerland t T0 report WLCG operations Workshop Barcelona, 07/07/2014 Maite Barroso, CERN IT

Outline Facilities Next Linux version Network Cloud Grid and batch services Databases Summary 2

Facilities Wigner (Budapest) –Additional capacity installed: mainly for openstack, and for EOS, plus some for business continuity for DB services –Wigner participated for the first time in the last HEPiX workshop –Network to Wigner Extensive testing done on the Geant 100 Gbps to identify the source of the flaps observed all segments of the link have been tested without errors, still source of the problem not identified It is possible that cleaning of the fibres ahead of the tests have resolved the problem. If not, then only an incompatibility between the Brocade and Alcatel equipment remains as a possible cause. 3

Linux: next version Plan: Adopt CentOS 7 –adding CERN specific setup via addon repositories Next Linux version at CERN.pdf CentOS 7 is approaching release –within few weeks We expect to have a CERN customized test installation available in July/August CERN own version certification ? –Is it still necessary ? –To be discussed with Linux Certification Committee 4

Network (1) LHCONE –Increased CERN LHCONE bandwidth to 30Gbps (was 20Gbps) –working on the definition of a LHCONE AUP that can guarantee enough security while being doable in reality –Organization of LHCONE Asian workshop ( is on going. It aims to expand LHCONE connectivity to sites in Asia. LHCOPN –Connected KI and JINR sites of the Russian Tier1s. They have two 10G links to CERN, one via Amsterdam and one via Wigner. –Bandwidth to US Tier1s will increase with the upcoming deployment of the ESnet PoP at CERN 5

Network (2) IPv6 –From the network point of view, IPv6 deployment is finished –IT services are becoming dual stack. Right now: (smtp, imap, pop, owa) lxplus-ipv6 Ldap web redirection –HEPiX IPv6 WG testing of IPv6 compliance of WLCG applications taking advantages of the deployment of IPv6 at CERN –CERN, KIT, PIC, NDGF, IN2P3 have IPV6 connectivity over the LHCOPN 6

Cloud (1) All components now run latest Havana-3 release –Planning the upgrade to Icehouse Continues to grow –Today: 2800 servers, 7000 VMs, 150 TB Volumes Work in progress: –Commissioning resources in Wigner for experiments Until now: only batch service –SSO, Kerberos integration, accounting with Ceilometer –Adding hardware Aim: 6000 compute nodes this year 7

Cloud (2) 8

Cloud (3) VM provisioning 9

Services (1) VOMRS to VOMS-admin migration –ATLAS, ALICE, CMS and LHCb still run VOMRS We need the new release to migrate this VOs as they need to sync with the CERN HR DB and in the current version this doesn't work Expected mid-July –voms-admin in production for the rest of the VOs (test, ops, geant4,...) LFC –Decommissioned for Atlas early June, all data is kept for the moment –In contact with LHCb about the expected end date of their need for an LFC service FTS: Agreed to stop FTS2 on August 1 st 10

Services (2) Batch: SLC6 migration: SLC5 CEs decommissioned, no grid job submission to SLC5 –SLC5 WNs final migration ongoing Batch system migration, from LSF to HTCondor –Goals: scalability, dynamism, dispatch rate, query scaling –Replacement candidates: SLURM feels too young HTCondor mature and promising Son of Grid Engine fast, a bit rough –More details of selection process: /22/material/slides/0.pdf /22/material/slides/0.pdf 11

Services (3) Batch system migration, from LSF to HTCondor –Setting up pilot, will open to experiments Start with 10 nodes, plus CREAM CE for Condor, for grid submissions Work is ongoing on integrating AFS token granting and extension –Full capacity test in parallel, ~5000 nodes –Close contact with developers New Squid service: –request from Atlas for a more generic Squid service covering their needs in view of Frontier as well as the already covered CVMFS needs Implementation will be an extension of existing service, different alias, same instance 12

Databases (1) Oracle version upgrade –Majority of DB services upgraded to (including half of the Tier1 sites) –Few DB services upgraded to (LHCb offline, ATLARC, COMPASS, LANDB, …) –End of 11.2 support in January 2018; looking at moving to 12c gradually HW and Storage evolution –New HW installation RAC50 in BARN, migration of production services completed by May –New HW installations being prepared: RAC51 in BARN and Wigner (for Disaster Recovery) –New generation of storage from NetApp Integration with the Agile 13

Databases (2) Replication evolution –Replication Technology Evolution Workshop took in June 3rd-4th –Replication tests T0 to T1 using production data on-going –Plan to migrate from Streams to Golden Gate agreed with experiments and Tier0 Database as a Service evolution (DBoD) –New HW and Storage installations –SW upgrades: MySQL (migrating to 5.6) and Oracle (migrating to 12c multi-tenancy) –PostgreSQL (version 9.2) since September 2013 More details: Evolution of Database Services today at 17:10 14

Summary Getting experience with recent changes: –Wigner –Cloud VM provisioning –IPv6 And preparing the next ones: –Quattor phase out –Next Linux version –HTCondor In a continuous feedback loop with the experiments and WLCG 15