December 2012 - GDB Summary See also: https://www.gridpp.ac.uk/wiki/GDB_12th_December_2012 Jeremy’s notes.

Slides:



Advertisements
Similar presentations
WLCG Operations and Tools TEG Monitoring – Experiment Perspective Simone Campana and Pepe Flix Operations TEG Workshop, 23 January 2012.
Advertisements

Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES News on monitoring for CMS distributed computing operations Andrea.
LHC Experiment Dashboard Main areas covered by the Experiment Dashboard: Data processing monitoring (job monitoring) Data transfer monitoring Site/service.
EGI-Engage Recent Experiences in Operational Security: Incident prevention and incident handling in the EGI and WLCG infrastructure.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks From ROCs to NGIs The pole1 and pole 2 people.
Climate Sciences: Use Case and Vision Summary Philip Kershaw CEDA, RAL Space, STFC.
EMI INFSO-RI SA2 - Quality Assurance Alberto Aimar (CERN) SA2 Leader EMI First EC Review 22 June 2011, Brussels.
LCG and HEPiX Ian Bird LCG Project - CERN HEPiX - FNAL 25-Oct-2002.
Network and Transfer WG Metrics Area Meeting Shawn McKee, Marian Babik Network and Transfer Metrics Kick-off Meeting 26 h November 2014.
PanDA Multi-User Pilot Jobs Maxim Potekhin Brookhaven National Laboratory Open Science Grid WLCG GDB Meeting CERN March 11, 2009.
Workshop summary Ian Bird, CERN WLCG Workshop; DESY, 13 th July 2011 Accelerating Science and Innovation Accelerating Science and Innovation.
European Middleware Initiative (EMI) – Release Process Doina Cristina Aiftimiei (INFN) EGI Technical Forum, Amsterdam 17. Sept.2010.
1 Resource Provisioning Overview Laurence Field 12 April 2015.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks GStat 2.0 Joanna Huang (ASGC) Laurence Field.
Enabling Grids for E-sciencE System Analysis Working Group and Experiment Dashboard Julia Andreeva CERN Grid Operations Workshop – June, Stockholm.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Multi-level monitoring - an overview James.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE-EGI Grid Operations Transition Maite.
Impact of end of EMI+EGI-SA3 April 2013: EMI project finishes EGI-Inspire-SA3 finishes (mainly CERN affected) EGI-Inspire continues until April 2014 EGI.eu.
Virtualised Worker Nodes Where are we? What next? Tony Cass GDB /12/12.
WLCG operations A. Sciabà, M. Alandes, J. Flix, A. Forti WLCG collaboration workshop July , Barcelona.
1 User Analysis Workgroup Discussion  Understand and document analysis models  Best in a way that allows to compare them easily.
INFSO-RI Enabling Grids for E-sciencE Enabling Grids for E-sciencE Pre-GDB Storage Classes summary of discussions Flavia Donno Pre-GDB.
Your university or experiment logo here The European Landscape John Gordon GridPP24 RHUL 15 th April 2010.
Information System Status and Evolution Maria Alandes Pradillo, CERN CERN IT Department, Grid Technology Group GDB 13 th June 2012.
Security Policy Update David Kelsey UK HEP Sysman, RAL 1 Jul 2011.
Handling ALARMs for Critical Services Maria Girone, IT-ES Maite Barroso IT-PES, Maria Dimou, IT-ES WLCG MB, 19 February 2013.
HEPiX IPv6 Working Group David Kelsey GDB, CERN 11 Jan 2012.
Report from the WLCG Operations and Tools TEG Maria Girone / CERN & Jeff Templon / NIKHEF WLCG Workshop, 19 th May 2012.
2012 Objectives for CernVM. PH/SFT Technical Group Meeting CernVM/Subprojects The R&D phase of the project has finished and we continue to work as part.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
EMI INFSO-RI SA1 Session Report Francesco Giacomini (INFN) EMI Kick-off Meeting CERN, May 2010.
European Middleware Initiative (EMI) The Software Engineering Model Alberto Di Meglio (CERN) Interim Project Director.
WLCG Software Lifecycle First ideas for a post EMI approach 0.
LCG Support for Pilot Jobs John Gordon, STFC GDB December 2 nd 2009.
Plans for Service Challenge 3 Ian Bird LHCC Referees Meeting 27 th June 2005.
Julia Andreeva on behalf of the MND section MND review.
EMI INFSO-RI European Middleware Initiative (EMI) Alberto Di Meglio (CERN)
PIC port d’informació científica EGEE – EGI Transition for WLCG in Spain M. Delfino, G. Merino, PIC Spanish Tier-1 WLCG CB 13-Nov-2009.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Monitoring of the LHC Computing Activities Key Results from the Services.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations Automation Team Kickoff Meeting.
Ian Bird Overview Board; CERN, 8 th March 2013 March 6, 2013
1 Update at RAL and in the Quattor community Ian Collier - RAL Tier1 HEPiX FAll 2010, Cornell.
Enabling Grids for E-sciencE INFSO-RI Enabling Grids for E-sciencE Gavin McCance GDB – 6 June 2007 FTS 2.0 deployment and testing.
WLCG Operations Coordination Andrea Sciabà IT/SDC 10 th July 2013.
LCG Issues from GDB John Gordon, STFC WLCG MB meeting September 28 th 2010.
Components Selection Validation Integration Deployment What it could mean inside EGI
WLCG Operations Coordination report Maria Alandes, Andrea Sciabà IT-SDC On behalf of the WLCG Operations Coordination team GDB 9 th April 2014.
ATLAS Distributed Computing ATLAS session WLCG pre-CHEP Workshop New York May 19-20, 2012 Alexei Klimentov Stephane Jezequel Ikuo Ueda For ATLAS Distributed.
WLCG Status Report Ian Bird Austrian Tier 2 Workshop 22 nd June, 2010.
SAM Status Update Piotr Nyczyk LCG Management Board CERN, 5 June 2007.
Ian Bird LCG Project Leader Status of EGEE  EGI transition WLCG LHCC Referees’ meeting 21 st September 2009.
WLCG Operations Coordination and Commissioning Maria Girone, CERN IT On behalf of the Operations Coordination Team 11 th March OSG All Hands Meeting,
INFSO-RI Enabling Grids for E-sciencE File Transfer Software and Service SC3 Gavin McCance – JRA1 Data Management Cluster Service.
Setting up NGI operations Ron Trompert EGI-InSPIRE – ROD teams workshop1.
Outcome should be a documented strategy Not everything needs to go back to square one! – Some things work! – Some work has already been (is being) done.
WLCG Operations Coordination report Maria Dimou Andrea Sciabà IT/SDC On behalf of the WLCG Operations Coordination team GDB 12 th November 2014.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
Site notifications with SAM and Dashboards Marian Babik SDC/MI Team IT/SDC/MI 12 th June 2013 GDB.
Maria Alandes Pradillo, CERN Training on GLUE 2 information validation EGI Technical Forum September 2013.
The HEPiX Virtualisation Working Group Towards a Grid of Clouds Tony Cass CHEP 2012 May 24 th 2012.
WLCG Operations Coordination Andrea Sciabà IT/SDC GDB 11 th September 2013.
EGI-InSPIRE RI EGI-InSPIRE RI EGI-InSPIRE Software provisioning and HTC Solution Peter Solagna Senior Operations Manager.
Operations Coordination Team Maria Girone, CERN IT-ES GDB, 11 July 2012.
Ian Bird, CERN WLCG Project Leader Amsterdam, 24 th January 2012.
WLCG IPv6 deployment strategy
Update from the HEPiX IPv6 WG
WLCG Collaboration Workshop;
Connecting the European Grid Infrastructure to Research Communities
Leigh Grundhoefer Indiana University
Unsupported middleware migration update
Presentation transcript:

December GDB Summary See also: Jeremy’s notes

Operations Coordination Team Link: IGTF would like CAs to move from SHA-1 to SHA-2 signatures ASAP, to anticipate concerns about the long- term safety of the former. Overview of status: Middleware: EMI-2 SL5 WNs deployment is proceeding at WLCG sites. Test queues page Strong warning against upgrading to SL6. It has not yet been validated by ATLAS. SL6 is OK for ALICE, CMS and LHCb. Glexec: The "ops" gLExec SAM test has been added to the EGI ROC_OPERATORS profile => failures raise alarms in the EGI operations dashboard. 1st Goal: EGI ROD teams to try and get the relevant sites in their regions to reach 75% availability (Dec-Jan). Status = Tracking tools :E.g. savannah-GGUS bridge technical issues and savannah-JIRA migration. FTS3: Functional tests for the "FTS2-like configuration" ongoing for ATLAS and CMS (RAL, ASGC and CERN) Squid monitoring: Objective: since squids are multipurpose and multi-VO, move monitoring to WLCG responsibility and integrate with common WLCG operations. ATLAS/CMS ( move to Aggregated Topology Provider (ATP). CVMFS: Used for - Software installation, conditions data, nightly build releases beta includes (last resort) NFS support. All expts. want full CVMFS deployment – target 30 th April. Current status perfSONAR: Mesh tested – script needed at the moment but native in Sites will be asked to move to the mesh configuration. MDM vs PS interoperation not yet clear… backend data is compatible.

EMI-2 deployment Upgrade deadline for Sept. EOL components is 17 th Dec. Jan 31st: all DPM, LFC and WN must be upgraded to EMI or decommissioned From 10 th Dec: ROD teams open GGUS tickets asking for a plan in 10 working days. Alarms not handled properly by ROD, and tickets assigned to unresponsive sites are escalated to NGIs and then COD. EMI-1 probes being developed -> alarms in March. Requirements for Glue 2 More sites publishing glue 2 since November. Profile document For Jan: Integrate glue-validator in SAM. Longer term: Integrate glue-validator in the resource BDII Feedback for ginfo wanted.

Oracle at T1s ATLAS Distributed databases in LS1 and Run2. Oracle + Frontier – 4 T1s continue with present system. Remove HOTDISK space token from CVMFS sites. Take conditions files from CVMFS. Tag evolution using Hadoop/Hbase in place of Oracle backend. CMS Stable Frontier infrastructure for conditions. (Looking at backup) Need Oracle for FTS only at T1s. Have 4 other DBs: PhEDEx; To tracking; Organised processin; conditions.

DPM community + workshop A lot of recent development for DM-Lite but the old interfaces remain unchanged Feedback areas: I/O concurrency lmits; drain performance; hot file replication; storage ‘rebalancing’; metadata consistency utitlies; inter-VO quotas Question on enabling http/xrootd. Should be a couple of hours work. “During a recent DPM workshop at LAL (on 5th December 2012), representatives of several countries met to discuss the future of DPM. Those present recognised the importance of DPM and agreed to establish and contribute effort to a DPM Collaboration with the express aim of driving the DPM project forwards after EMI ends on 31st March The effort indicated as available across the collaboration partners was more than adequate to maintain, develop and support DPM as an efficient storage solution. A collaboration agreement and the assignment of responsibilities will be completed in due course”

Virtualized WNs Virtualized WN : where we are? what next ? HEPiX vwg model (and S/w) endorsed by the EGI federated cloud task force. The HEPiX virtualisation working group was formed to facilitate the instantiation of user-generated virtual machine images at HEPiX (and WLCG) sites. The HEPiX VWG developed a policy that introduced the concept of image endorsers: people who would guarantee that generated images could be used safely at sites. The policy was approved. Now looking at implementations.

How this could be used Central Task Queue Site A Site B Site C Shared Image Repository (VMIC) User VO service Instance requests Commercial cloud Payload pull Image maintainer Cloud bursting Slide courtesy of Ulrich Schwickerath

Software Defined Networks for Big-Data Science TCP is the underlying data transfer protocol Fragile; sensitive to loss; poor deployment of infrastructure Discussed OpenFlow Powerful network abstraction Files / Storage Benefits Simplicity for the end-site Works with off-the-shelf, open-source controller Topology simplification Generic code for the network provider Virtual switch can be layered over optical, routed or switched network elements OpenFlow support needed on edge devices only, core stays same Programmability for applications Allows end-sites to innovate and use the WAN effectively

MW provisioning lifecycle Overall approach has been accepted Most feedback was very constructive – very few comments reflect a significant lack of awareness of emi’s status and work Some major and several minor issues that need clarification: – Role of EGI, open-ended-EMI-consortium, ScienceSoft – What about other sciences? – How to deal with the different middleware stacks? – Where are the gaps? what can we get from EPEL/Fedora what not where will testing resources come from – Organization of level 1,2,3 support and the role of GGUS – Rollback – Meta-Packages – Version management – Repositories (a potential can of worms) – OS Platforms – PTs’ resources for communication and coordination – PTs’ ability to run Pilot services – Release frequency – Configuration Management (YAIM or not YAIM ) – -> Some ‘answers’ given in Markus’s talk. New proposal draft soon… for discussion in Jan.

NDGF Tier1 and NeIC – status update Grid-centric NDGF is evolving into generic Nordic e-Infrastructure WLCG and EGI services are integral part of NeIC NDGF-T1 is kept as a name for the Nordic Tier1 (for practical reasons) It is business as usual, despite re-branding