Deployment & Advanced Regular Testing Strategies

Slides:



Advertisements
Similar presentations
Chapter 3: Planning a Network Upgrade
Advertisements

Nagios Integration January , perfSONAR-PS Developers Meeting Jason Zurawski, Internet2 Brian Tierney, ESnet.
Networks Unit 3 & 4 IP&M JEOPARDY Acknowledgements: VITTA for the jeopardy pro-forma Mark Kelly’s Network slideshow.
Network Time Protocol (NTP) August 9 th 2011, OSG Site Admin Workshop Jason Zurawski – Internet2 Research Liaison.
Network+ Guide to Networks, Fourth Edition
NovaBACKUP 10 xSP Technical Training By: Nathan Fouarge
This is the way an organisation distributes the data across its network. It uses different types of networks to communicate the information across it.
Network+ Guide to Networks, Fourth Edition Chapter 1 An Introduction to Networking.
1 ESnet Network Measurements ESCC Feb Joe Metzger
COMP1321 Digital Infrastructure Richard Henson February 2014.
Computer Networks. A computer network is defined as the interconnection of 2 or more independent computers or/and peripherals. Computer Network.
Internet2 Network Observatory Update Matt Zekauskas, Measurement SIG 2006 Fall Member Meeting 4-Dec-2006.
Summary - Part 2 - Objectives The purpose of this basic IP technology training is to explain video over IP network. This training describes how video can.
PerfSONAR-PS Functionality February 11 th 2010, APAN 29 – perfSONAR Workshop Jeff Boote, Assistant Director R&D.
© 2002 Global Knowledge Network, Inc. All rights reserved. Windows Server 2003 MCSA and MCSE Upgrade Clustering Servers.
WLCG Latency Mesh Comments + – It can be done, works consistently and already provides useful data – Latency mesh stable, once configured sonars are stable.
TCP Traffic Characteristics—Deep buffer Switch
GEMINI: Active Network Measurements Martin Swany, Indiana University.
DICE Diagnostic Service Joe Metzger Joint Techs Measurement Working Group January
Day 13 Intro to MANs and WANs. MANs Cover a larger distance than LANs –Typically multiple buildings, office park Usually in the shape of a ring –Typically.
Connect communicate collaborate Performance Metrics & Basic Tools Robert Stoy, DFN EGI TF, Madrid September 2013.
Advanced Network Diagnostic Tools Richard Carlson EVN-NREN workshop.
Campana (CERN-IT/SDC), McKee (Michigan) 16 October 2013 Deployment of a WLCG network monitoring infrastructure based on the perfSONAR-PS technology.
Redundancy. Single point of failure Hierarchical design produces many single points of failure Redundancy provides alternate paths, but may undermine.
1 Network Measurement Challenges LHC E2E Network Research Meeting October 25 th 2006 Joe Metzger Version 1.1.
GGF 17 - May, 11th 2006 FI-RG: Firewall Issues Overview Document update and discussion The “Firewall Issues Overview” document.
Chapter 3 Part 1 Switching and Bridging
Troubleshooting Ben Fineman,
Featrues of Compputer Networks
Instructor Materials Chapter 1: LAN Design
Module 9: Preparing to Administer a Server
Bentley Systems, Incorporated
Lecture (2).
Asynchronous Transfer Mode
Report from WLCG Workshop 2017: WLCG Network Requirements GDB - CERN 12th of July 2017
Frame Relay lab1.
Instructor Materials Chapter 9: Testing and Troubleshooting
Optical-Based Switching Solutions
Semester 4 - Chapter 3 – WAN Design
Introduction & Motivation
Networking for the Future of Science
What is Fibre Channel? What is Fibre Channel? Introduction
The pScheduler Command-Line Interface
Network Performance - Theory
Monitoring the US ATLAS Network Infrastructure with perfSONAR-PS
Monitoring Appliance Status
Objectives Differentiate between the different editions of Windows Server 2003 Explain Windows Server 2003 network models and server roles Identify concepts.
MONITORING MICROSOFT WINDOWS SERVER 2003
Introduction & Motivation
Lecture 6: TCP/IP Networking By: Adal Alashban
Introduction to Networks
Chapter 16: Distributed System Structures
Installation & Basic Configuration
Basic Configuration & Deployment
Installation & Basic Configuration
Technical Communication
ESnet Network Measurements ESCC Feb Joe Metzger
Introduction & Motivation
An Introduction to Computer Networking
Deployment & Advanced Regular Testing Strategies
Computer networking In this section of notes you will learn the rudiments of networking, the components of a network and how to secure a network.
Good Things Come In Small Packages
Big-Data around the world
Unit 11- Computer Networks
Network Performance Measurement
Network+ Guide to Networks, Fourth Edition
Performance Measuring & Monitoring
Internet2 E2E piPEs Project
Module 9: Preparing to Administer a Server
Overview of Networking
Presentation transcript:

Deployment & Advanced Regular Testing Strategies JISC perfSONAR training Antoine Delvaux, PSNC, antoine.delvaux@man.poznan.pl Ivan Garnizov, FAU – DFN, ivan.garnizov@fau.de 22-23 November 2017 This document is a result of work by the perfSONAR Project (http://www.perfsonar.net) and is licensed under CC BY-SA 4.0 (https://creativecommons.org/licenses/by-sa/4.0/). November 22, 2017 © 2017, http://www.perfsonar.net

Importance of Regular Testing We can’t wait for users to report problems and then fix them (soft failures can go unreported for years!) Things just break sometimes Failing optics Somebody messed around in a patch panel and kinked a fiber Hardware goes bad Problems that get fixed have a way of coming back System defaults come back after hardware/software upgrades New employees may not know why the previous employee set things up a certain way and back out fixes Important to continually collect, archive, and alert on active throughput test results Soft failures are everywhere – and they may hide other soft failures November 22, 2017 © 2017, http://www.perfsonar.net

© 2017, http://www.perfsonar.net Performance History November 22, 2017 © 2017, http://www.perfsonar.net

MaDDash: http://ps-dashboard.es.net This is a clean dashboard image – note to people that the dashboard they look at ‘live’ may not be clean. This is ok – dashboards are meant to find problems, not be green all the time. November 22, 2017 © 2017, http://www.perfsonar.net

© 2017, http://www.perfsonar.net Regular Testing There are a couple of ways to do this: Beacon: Let others test to you (e.g. no regular configuration is needed) Island: Pick some hosts to test to – you store the data locally. No coordination with others is needed Mesh: full coordination between you and others (e.g. consume a testing configuration that includes tests to everyone, and incorporate into a visualization) Most people will use this way of testing instead of the mesh November 22-23, 2017 © 2017, http://www.perfsonar.net

Regular Testing - Beacon The beacon setup is typically employed by a network provider (regional, backbone, exchange point) A service to the users (allows people to test into the network) Can be configured with Layer 2 connectivity if needed If no regular tests are scheduled, minimum requirements for local storage. Makes the most sense to enable all services (bandwidth and latency) November 22-23, 2017 © 2017, http://www.perfsonar.net

Regular Testing - Island The island setup allows a site to test against any number of the 1200+ perfSONAR nodes around the world, and store the data locally. No coordination required with other sites Allows a view of near horizon testing (e.g. short latency – campus, regional) and far horizon (backbone network, remote collaborators). OWAMP is particularly useful for determining packet loss in the previous cases. Throughput will not be as valuable when the latency is small November 22-23, 2017 © 2017, http://www.perfsonar.net

© 2017, http://www.perfsonar.net Regular Testing - Mesh A full mesh requires more coordination: A full mesh means all hosts involved are running the same test configuration A partial mesh could mean only a small number of related hosts are running a testing configuration In either case – bandwidth and latency will be valuable test cases November 22-23, 2017 © 2017, http://www.perfsonar.net

© 2017, http://www.perfsonar.net Develop a Test Plan What are you going to measure? Achievable bandwidth 2-3 regional destinations 4-8 important collaborators 4-8 times per day to each destination 20 second tests within a region, longer across oceans and continents Loss/Availability/Latency OWAMP: ~10-20 collaborators over diverse paths Interface Utilization & Errors (via SNMP) What are you going to do with the results? NAGIOS Alerts Reports to user community MaDDash On bandwidth – make note that localized (e.g. in the LAN or metro) bandwidth tests don’t give you much and may case loss themselves. Test bandwidth to things further out to get the most out of the measurement. Packet loss should be done everywhere (especially LAN/MAN distances) November 22, 2017 © 2017, http://www.perfsonar.net

perfSONAR Deployment Locations Critical to deploy near key resources such as DTNs More perfSONAR hosts allow segments of the path to be tested separately Reduced visibility for devices between perfSONAR hosts Must rely on counters or other means where perfSONAR can’t go Effective test methodology derived from protocol behavior TCP suffers much more from packet loss as latency increases TCP is more likely to cause loss as latency increases Testing should leverage this in two ways Design tests so that they are likely to fail if there is a problem Mimic the behavior of production traffic as much as possible Note: don’t design your tests to succeed The point is not to “be green” even if there are problems The point is to find problems when they come up so that the problems are fixed quickly A lot of people want to prove that they are not the problem, and design their tests to succeed. This is not helpful. November 22, 2017 © 2017, http://www.perfsonar.net

Sample Site Deployment Place nodes where the people are located. End of the 14:00 session. November 22, 2017 © 2017, http://www.perfsonar.net

MaDDash, Archives and other considerations November 22, 2017 © 2017, http://www.perfsonar.net

MaDDash & The Mesh Config Measurement results are more useful when they can be “seen”, because this implies they will be acted on. MaDDash is a software package that can be used to visualize the results of many perfSONAR tests The Mesh Config is a way to manage multiple nodes, by giving them a uniform testing schedule E.g. this is in contrast to the other method of configuration the “Island” model Changes node from ‘testing as an island’ to being a part of a larger testing strategy More at: http://docs.perfsonar.net/manage_choose.html November 22, 2017 © 2017, http://www.perfsonar.net

© 2017, http://www.perfsonar.net Types of archives Local archiving Default configuration Need disk space on the perfSONAR node Centralised archiving Need to be setup and managed Disk space requirements depending on the number of nodes Need to grant access to all nodes for archiving November 22, 2017 © 2017, http://www.perfsonar.net

© 2017, http://www.perfsonar.net NTP Note that it may take a day to fully stabilize the clock Pick 4 – 5 Close servers for NTP We have a fast way to do this, or you can manually select Can also add your own servers if you don’t like ours November 22, 2017 © 2017, http://www.perfsonar.net

© 2017, http://www.perfsonar.net NTP Services like BWCTL and OWAMP require a stable time source. The toolkit interface allows you to configure your own host, or choose a public one we know of. November 22, 2017 © 2017, http://www.perfsonar.net

Deployment & Advanced Regular Testing Strategies JISC perfSONAR training Antoine Delvaux, PSNC, antoine.delvaux@man.poznan.pl Ivan Garnizov, FAU – DFN, ivan.garnizov@fau.de 22-23 November 2017 This document is a result of work by the perfSONAR Project (http://www.perfsonar.net) and is licensed under CC BY-SA 4.0 (https://creativecommons.org/licenses/by-sa/4.0/). November 22, 2017 © 2017, http://www.perfsonar.net