Network and Transfer WG Metrics Area Meeting Shawn McKee, Marian Babik Network and Transfer Metrics Kick-off Meeting 26 h November 2014.

Slides:



Advertisements
Similar presentations
Nagios Integration January , perfSONAR-PS Developers Meeting Jason Zurawski, Internet2 Brian Tierney, ESnet.
Advertisements

Update on OSG/WLCG perfSONAR infrastructure Shawn McKee, Marian Babik HEPIX Spring Workshop, Oxford 23 rd - 27 th March 2015.
Integrating Network and Transfer Metrics to Optimize Transfer Efficiency and Experiment Workflows Shawn McKee, Marian Babik for the WLCG Network and Transfer.
PerfSONAR in ATLAS/WLCG Shawn McKee, Marian Babik ATLAS Jamboree / Network Section 3 rd December 2014.
Network Performance Measurement Atlas Tier 2 Meeting at BNL December Joe Metzger
Use Cases. Summary Define and understand slow transfers – Identify weak links, narrow down the source – Understand what perfSONAR measurements mean wrt.
CERN IT Department CH-1211 Genève 23 Switzerland t Service Management GLM 15 November 2010 Mats Moller IT-DI-SM.
Integration and Sites Rob Gardner Area Coordinators Meeting 12/4/08.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EG recent developments T. Ferrari/EGI.eu ADC Weekly Meeting 15/05/
Connect communicate collaborate perfSONAR MDM updates: New interface, new possibilities Domenico Vicinanza perfSONAR MDM Product Manager
PerfSONAR Information Services Update Jason Zurawski Feb 2, 2009 Winter Joint Techs 2009, College Station Texas.
News from the HEPiX IPv6 Working Group David Kelsey (STFC-RAL) GridPP35, Liverpool 11 Sep 2015.
Connect communicate collaborate perfSONAR MDM updates: New interface, new weathermap, towards a complete interoperability Domenico Vicinanza perfSONAR.
Network and Transfer Metrics WG Meeting Shawn McKee, Marian Babik Network and Transfer Metrics WG Meeting 8 th April 2015.
Network and Transfer Metrics WG Meeting Shawn McKee, Marian Babik perfSONAR Operations Sub-group 22 nd October 2014.
Update on OSG/WLCG Network Services Shawn McKee, Marian Babik 2015 WLCG Collaboration Workshop 12 th April 2015.
Connect. Communicate. Collaborate perfSONAR MDM Service for LHC OPN Loukik Kudarimoti DANTE.
MW Readiness WG Update Andrea Manzi Maria Dimou Lionel Cons 10/12/2014.
Update on WLCG/OSG perfSONAR Infrastructure Shawn McKee, Marian Babik HEPiX Fall 2015 Meeting at BNL 13 October 2015.
Network and Transfer Metrics WG Meeting Shawn McKee, Marian Babik Network and Transfer Metrics WG Meeting 18 h March 2015.
WLCG perfSONAR-PS Update Shawn McKee/University of Michigan WLCG Network and Transfers Metrics Co-Chair Spring 2014 HEPiX LAPP, Annecy, France May 21 st,
WLCG Network and Transfer Metrics WG After One Year Shawn McKee, Marian Babik GDB 4 th November
HEPiX IPv6 Working Group David Kelsey GDB, CERN 11 Jan 2012.
Network and Transfer WG perfSONAR operations Shawn McKee, Marian Babik Network and Transfer Metrics WG Meeting 28 h January 2015.
Update on Network and Transfer Metrics WG Shawn McKee, Marian Babik GDB 8 th October 2014.
Connect communicate collaborate LHCONE Diagnostic & Monitoring Infrastructure Richard Hughes-Jones DANTE Delivery of Advanced Network Technology to Europe.
PerfSONAR Update Shawn McKee/University of Michigan LHCONE/LHCOPN Meeting Cambridge, UK February 9 th, 2015.
WLCG Technical Evolution Group: Operations and Tools Maria Girone & Jeff Templon Kick-off meeting, 24 th October 2011.
Julia Andreeva on behalf of the MND section MND review.
Data Transfer Service Challenge Infrastructure Ian Bird GDB 12 th January 2005.
PerfSONAR for LHCOPN/LHCONE Update Shawn McKee/University of Michigan LHCONE/LHCOPN Meeting Amsterdam, NL October 28 th, 2015.
US LHC Tier-2 Network Performance BCP Mar-3-08 LHC Community Network Performance Recommended BCP Eric Boyd Deputy Technology Officer Internet2.
PIC port d’informació científica EGEE – EGI Transition for WLCG in Spain M. Delfino, G. Merino, PIC Spanish Tier-1 WLCG CB 13-Nov-2009.
WLCG Latency Mesh Comments + – It can be done, works consistently and already provides useful data – Latency mesh stable, once configured sonars are stable.
GEMINI: Active Network Measurements Martin Swany, Indiana University.
Connect. Communicate. Collaborate JRA1 Status Update Stephan Kraft, RRZE FAU Erlangen-Nürnberg JRA1 Montpellier Meeting, October 2006.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations Automation Team Kickoff Meeting.
CMS: T1 Disk/Tape separation Nicolò Magini, CERN IT/SDC Oliver Gutsche, FNAL November 11 th 2013.
Strawman LHCONE Point to Point Experiment Plan LHCONE meeting Paris, June 17-18, 2013.
David Foster, CERN GDB Meeting April 2008 GDB Meeting April 2008 LHCOPN Status and Plans A lot more detail at:
WLCG Operations Coordination report Maria Alandes, Andrea Sciabà IT-SDC On behalf of the WLCG Operations Coordination team GDB 9 th April 2014.
ATLAS Distributed Computing ATLAS session WLCG pre-CHEP Workshop New York May 19-20, 2012 Alexei Klimentov Stephane Jezequel Ikuo Ueda For ATLAS Distributed.
MW Readiness WG Update Andrea Manzi Maria Dimou Lionel Cons Maarten Litmaath On behalf of the WG participants GDB 09/09/2015.
WLCG Operations Coordination and Commissioning Maria Girone, CERN IT On behalf of the Operations Coordination Team 11 th March OSG All Hands Meeting,
Using Check_MK to Monitor perfSONAR Shawn McKee/University of Michigan North American Throughput Meeting March 9 th, 2016.
HEPiX IPv6 Working Group David Kelsey david DOT kelsey AT stfc DOT ac DOT uk (STFC-RAL) HEPiX, Vancouver 26 Oct 2011.
1 Network related topics Bartosz Belter, Wojbor Bogacki, Marcin Garstka, Maciej Głowiak, Radosław Krzywania, Roman Łapacz FABRIC meeting Poznań, 25 September.
WLCG Operations Coordination news and meeting restructuring Maria Alandes Pradillo Josep Flix Alessandra Forti Andrea Sciabà WLCG operations coordination.
Setting up NGI operations Ron Trompert EGI-InSPIRE – ROD teams workshop1.
News from the HEPiX IPv6 Working Group David Kelsey (STFC-RAL) HEPIX, BNL 13 Oct 2015.
The HEPiX IPv6 Working Group David Kelsey (STFC-RAL) EGI OMB 19 Dec 2013.
WLCG Operations Coordination report Maria Dimou Andrea Sciabà IT/SDC On behalf of the WLCG Operations Coordination team GDB 12 th November 2014.
Campana (CERN-IT/SDC), McKee (Michigan) 16 October 2013 Deployment of a WLCG network monitoring infrastructure based on the perfSONAR-PS technology.
Dissemination and User Feedback Castor deployment team Castor Readiness Review – June 2006.
Open Science Grid Configuring RSV OSG Resource & Service Validation Thomas Wang Grid Operations Center (OSG-GOC) Indiana University.
HEPiX IPv6 Working Group David Kelsey (STFC-RAL) GridPP33 Ambleside 22 Aug 2014.
Operations Coordination Team Maria Girone, CERN IT-ES GDB, 11 July 2012.
PerfSONAR operations meeting 3 rd October Agenda Propose changes to the current operations of perfSONAR Discuss current and future deployment model.
T0-T1 Networking Meeting 16th June Meeting
WLCG IPv6 deployment strategy
Shawn McKee, Marian Babik for the
perfSONAR-PS Deployment: Status/Plans
LHCOPN/LHCONE perfSONAR Update
Support for IPv6-only CPU – an update from the HEPiX IPv6 WG
Update from the HEPiX IPv6 WG
Monitoring at a Multi-Site Tier 1
Alerting/Notifications (MadAlert)
Deployment & Advanced Regular Testing Strategies
LHCONE perfSONAR: Status and Plans
WLCG and support for IPv6-only CPU
Presentation transcript:

Network and Transfer WG Metrics Area Meeting Shawn McKee, Marian Babik Network and Transfer Metrics Kick-off Meeting 26 h November 2014

● Status and progress in perfSONAR – T2.1 Commissioning/Operations – T2.2 Storage – T2.3 Configuration ● Metrics area – T1.1: Gather requirements and use cases – T1.2: Review existing transfer and network metrics – T1.3: Determine current test coverage – T1.4: Topology mapping Outline Network Monitoring and Metrics WG Meeting

ALL: Send comments and suggestions on the proposed list of topics/tasks and on the way WG will be organized ALL: Volunteer to lead tasks in the metrics area (T1s) Julia: Send a list of topics concerning xRootD tasks to the WG. To be discussed with WLCG OPS Coordination – Separate meeting held on the topic – agreed to follow up on status of GLED deployment and support Marian: Setup WG JIRA and report to WLCG OPS coordination every 2 weeks on the status of ongoing tasks. – JIRA at – WLCG OPS Reports at s#Reports s#Reports Marian, Shawn: Prepare abstract for CHEP2015 (done) Network Monitoring and Metrics WG Meeting 3 Actions from last meeting

4 Network Monitoring Status perfSONAR Network Monitoring and Metrics WG Meeting

perfSONAR 3.4 released Oct 14 th Restructuring support and operations – Introduced site-level support via GGUS Rewritten documentatio n – Responded to ShellShock and Poodle – Sites advised to terminated their instances – Performed security audit and established security procedures Testing and validation of the new perfSONAR central configuration perfSONAR 3.4 update campaign – Includes migration to the new configuration system – Security considerations documented – Progressing well (111 sonars updated out of 214) – Deadline 8 th January 5 perfSONAR ops Network Monitoring and Metrics WG Meeting

Deployed in OSG production Introduces central interface to reconfigure the entire network – All aspects – tests parameters, mesh participation – List of available sonars taken from GOCDB and OIM – Supports hierarchical support model (per mesh admins) – Web interface – Connected to OSG crawler and perfSONAR infrastructure monitoring Site reconfiguration needed to adopt – Run as part of 3.4 campaign perfSONAR data store – Deployed in OSG ITB – several major issues fixed – Scale tests on-going this week – Operationally ready for production Network Monitoring and Metrics WG Meeting 6 perfSONAR config and store

7 perfSONAR metrics Network Monitoring and Metrics WG Meeting

We gather a number of metrics: – Topology/path-information via traceroute – One-way delay via OWAMP – Packet-loss via OWAMP – Usable bandwidth via BWCTL WLCG perfSONAR coverage – ESnet has some nice pages on using perfSONAR to identify problems – testing/evaluating-network-performance/ testing/evaluating-network-performance/ Some examples from Jason Zurawski follow Network Monitoring and Metrics WG Meeting 8 Metrics and Their Use

Traceroute is fundamental to any of the other metrics. Without it we don’t know what path was being measured. In the toolkit: Network Monitoring and Metrics WG Meeting 9 Traceroute

Network Monitoring and Metrics WG Meeting 10 OWAMP: Overloaded Link Example of a campus with an overloaded uplink to the WAN. You can see daily overloads as campus gets busy each day

Network Monitoring and Metrics WG Meeting 11 Bandwidth (iperf) perfSONAR measures usable bandwidth Bandwidth changes as MTU and window setting adjusted

Network Monitoring and Metrics WG Meeting 12 Bandwidth: Bad vs Good Routing

Network Monitoring and Metrics WG Meeting 13 Drastic BW Change Spikes of packet loss, almost always during business hours Function of the load on the line/time of day This was traced to regional network

The monitoring infrastructure is sensitive for a reason – so that it finds the problems in all layers of the OSI stack. End-to-end data transmission (or just about any other use case) suffers because of a problems that may be unseen or not understood. Understanding comes from learning to use the tools, learning to trust them, and having universal availability. Comprehensive solutions will save time in the end Network Monitoring and Metrics WG Meeting 14 Key Messages about Metrics

15 Metrics Area Network Monitoring and Metrics WG Meeting

Available as Google document – OuvbEHZnZp0XkWkwdkPQTQic0VbH1mc/edit?usp=sha ring Asking for your input – FTS, FAX, PhEDEx, Rucio, PanDA by Dec 5 – Experiments by Dec 12 Next year – Strawman – important to receive initial input on coverage, test characteristics, etc. – Regular meetings next year 28 Jan, 18 Feb, 18 March, 8 Apr (all at 4pm CEST) Network Monitoring and Metrics WG Meeting 16 Questionnaire

Shawn at CERN next week – perfSONAR office on Thursday (4 th Dec) Network Monitoring and Metrics WG Meeting 17 AOB

18 Backup Network Monitoring and Metrics WG Meeting

Network Monitoring and Metrics WG Meeting 19 iperf INFN PIC

Network Monitoring and Metrics WG Meeting 20 owamp+iperf INFN PIC