Networking for the Future of Science

Slides:



Advertisements
Similar presentations
Multi Domain Monitoring NORDUnet 2008 Espoo, Jon Kåre Hellan, UNINETT R&D.
Advertisements

Circuit Monitoring July 16 th 2011, OGF 32: NMC-WG Jason Zurawski, Internet2 Research Liaison.
Network Measurements Session Introduction Joe Metzger Network Engineering Group ESnet Eric Boyd Deputy Technology Officer Internet2 July Joint.
Connect. Communicate. Collaborate WI5 – tools implementation Stephan Kraft October 2007, Sevilla.
Connect. Communicate. Collaborate GÉANT2 JRA1 & perfSONAR Loukik Kudarimoti, DANTE 28 th May, 2006 RNP Workshop, Curitiba.
MAGGIE NIIT- SLAC On Going Projects Measurement & Analysis of Global Grid & Internet End to end performance.
Performance Management (Best Practices) REF: Document ID
Network Performance Measurement Atlas Tier 2 Meeting at BNL December Joe Metzger
PerfSONAR Performance Monitoring Framework Matt Zekauskas, GENI Measurement Workshop June 26, 2009 Madison, Wisconsin.
1 ESnet Network Measurements ESCC Feb Joe Metzger
Connect. Communicate. Collaborate perfSONAR and Wavelengths Monitoring LHC meeting, Cambridge, 16 of June 2006 Matthias Hamm - DFN Nicolas Simar - DANTE.
Connect communicate collaborate perfSONAR MDM updates: New interface, new possibilities Domenico Vicinanza perfSONAR MDM Product Manager
Connect. Communicate. Collaborate 1 ICISP, Cap Esterel (France), August 26-28, 2006 Complementary Visualization of perfSONAR Performance Measurements Andreas.
Connect communicate collaborate perfSONAR MDM updates: New interface, new weathermap, towards a complete interoperability Domenico Vicinanza perfSONAR.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks perfSONAR deployment over Spanish LHC Tier.
Internet2 Performance Update Jeff W. Boote Senior Network Software Engineer Internet2.
1 Measuring Circuit Based Networks Joint Techs Feb Joe Metzger
Connect. Communicate. Collaborate Implementing Multi-Domain Monitoring Services for European Research Networks Szymon Trocha, PSNC A. Hanemann, L. Kudarimoti,
Connect. Communicate. Collaborate GÉANT2 and the GRID Domenico Vicinanza DANTE EGEE 08 Meeting, Istanbul September 2008.
Connect. Communicate. Collaborate Hades – Going Operational Roland Karch, RRZE FAU Erlangen-Nürnberg JRA1 Montpellier Meeting, October 2006.
1 Network Measurement Summary ESCC, Feb Joe Metzger ESnet Engineering Group Lawrence Berkeley National Laboratory.
Connect communicate collaborate Intercontinental Multi-Domain Monitoring for the LHC Community Domenico Vicinanza perfSONAR MDM Product Manager DANTE –
Connect. Communicate. Collaborate perfSONAR MDM Service for LHC OPN Loukik Kudarimoti DANTE.
PerfSONAR-PS Functionality February 11 th 2010, APAN 29 – perfSONAR Workshop Jeff Boote, Assistant Director R&D.
Connect. Communicate. Collaborate Operations of Multi Domain Network Services Marian Garcia Vidondo, DANTE COO TNC 2008, Bruges May.
Jeremy Nowell EPCC, University of Edinburgh A Standards Based Alarms Service for Monitoring Federated Networks.
Internet2’s Dynamic Circuit Infrastructure Ciena CoreDirectors OSCARS + DRAGON for dynamic circuit allocation ION.
Connect. Communicate. Collaborate Using PerfSONAR tools in a production environment Marian Garcia, Operations Manager, DANTE Joint Tech Workshop, 16 th.
Connect. Communicate. Collaborate GEANT2 Monitoring Services Emma Apted, DANTE Operations EGEE III, Budapest, 3 rd October 2007.
Connect communicate collaborate perfSONAR MDM for LHCOPN/LHCONE: partnership, collaboration, interoperability, openness Domenico Vicinanza perfSONAR MDM.
Connect communicate collaborate LHCONE Diagnostic & Monitoring Infrastructure Richard Hughes-Jones DANTE Delivery of Advanced Network Technology to Europe.
US LHC Tier-2 Network Performance BCP Mar-3-08 LHC Community Network Performance Recommended BCP Eric Boyd Deputy Technology Officer Internet2.
PerfSONAR-PS Working Group Aaron Brown/Jason Zurawski January 21, 2008 TIP 2008 – Honolulu, HI.
Connect communicate collaborate Connectivity Services, Autobahn and New Services Domenico Vicinanza, DANTE EGEE’09, Barcelona, 21 st -25 th September 2009.
GEMINI: Active Network Measurements Martin Swany, Indiana University.
Connect. Communicate. Collaborate JRA1 Status Update Stephan Kraft, RRZE FAU Erlangen-Nürnberg JRA1 Montpellier Meeting, October 2006.
Charaka Palansuriya EPCC, The University of Edinburgh An Alarms Service for Federated Networks Charaka.
1 LHCOPN Monitoring Directions January 2007 Joe Metzger
David Foster, CERN GDB Meeting April 2008 GDB Meeting April 2008 LHCOPN Status and Plans A lot more detail at:
DICE Diagnostic Service Joe Metzger Joint Techs Measurement Working Group January
Connect communicate collaborate perfSONAR MDM News Domenico Vicinanza DANTE (UK)
1 Network related topics Bartosz Belter, Wojbor Bogacki, Marcin Garstka, Maciej Głowiak, Radosław Krzywania, Roman Łapacz FABRIC meeting Poznań, 25 September.
Advanced Network Diagnostic Tools Richard Carlson EVN-NREN workshop.
Connect communicate collaborate perfSONAR MDM updates Domenico Vicinanza DANTE, Cambridge, UK perfSONAR MDM Product Manager
Campana (CERN-IT/SDC), McKee (Michigan) 16 October 2013 Deployment of a WLCG network monitoring infrastructure based on the perfSONAR-PS technology.
1 Deploying Measurement Systems in ESnet Joint Techs, Feb Joseph Metzger ESnet Engineering Group Lawrence Berkeley National Laboratory.
1 Network Measurement Challenges LHC E2E Network Research Meeting October 25 th 2006 Joe Metzger Version 1.1.
© Airspan Networks Inc. Automatic QoS Testing over IEEE Standard.
LHC Path Monitoring Tools Deployment Planning Jeff Boote Internet2/R&D May 27, 2008 US ATLAS T2/T3 Workshop at UM.
perfSONAR WG Meeting (06FMM)
LHC T0/T1 networking meeting
Internet2 End-to-End Performance Initiative
Path Monitoring Tools Deployment Planning for U.S. T123
GÉANT2 update - II Otto Kreiter, DANTE.
Robert Szuman – Poznań Supercomputing and Networking Center, Poland
PerfSONAR: Development Status
Monitoring Appliance Status
Network Monitoring and Troubleshooting with perfSONAR MDM
Internet2 Performance Update
Deployment & Advanced Regular Testing Strategies
Thanks to everyone for attending!!
ESnet Network Measurements ESCC Feb Joe Metzger
E2E piPES Project Russ Hobby, Internet2 HENP Working Group Meeting
Network Performance Measurement
E2E piPEs Overview Eric L. Boyd Internet2 24 February 2019.
MAGGIE NIIT- SLAC On Going Projects
“Detective”: Integrating NDT and E2E piPEs
Internet2 E2E piPEs Project
Congestion Control (from Chapter 05)
Presentation transcript:

Networking for the Future of Science perfSONAR Software for LHC Network Measurement Joe Metzger Energy Sciences Network Lawrence Berkeley National Laboratory July 27 2007 LHCOPN Meeting at SARA in Amsterdam Networking for the Future of Science

Disclaimer The information in this presentation represents my understanding of the growing consensus within the perfSONAR measurement community. It may not represent the views of all of the interested parties (LHC scientists, T[0123] center operators, NRENs, Backbone Networks, Gigapops, Campus network operators, etc.). The consensus building process has started, but: It will take a long time to get consensus on the requirements, and even longer on the tools. The LHC community needs measurement infrastructure now

Measurement Infrastructure Scope The tools must meet the needs the whole LHC community Tier 0 Tier 0 to Tier 1 networks (LHCOPN) Tier 1s Tier 0 to Tier 1 link(s) Tier 1 to other Tier 1 network Tier 1 to Tier 2 & 3 network Tier 2s Tier 1 to Tier 2 network Tier 3s Tier 1 to Tier 3 network Tier 2 to Tier 3 network The tools must meet the needs of other application communities The backbone networks expect to use the same tool set for other user communities. Not all requirements are satisfied by existing tools. Additional development and a phased deployment will be necessary

High Level Requirements Allow distinguishing between Application and Network problems Quickly identify network problems, even if they are not currently affecting applications Identify and react to changes in the underlying network Allow application managers to understand and react to changes in topology & capacity and re-tune applications as necessary Provide the network data necessary to correlate application performance changes with network topology changes Eventually allow applications to automatically react to network changes (network awareness) The infrastructure & tools used to support the LHCOPN must be able to simultaneously support the Tier 1 centers and other application communities as well. Provide access to capacity and topology trend data Support application managers network capacity planning Facilitate identifying and quantifying when the LHC experiment traffic is impacting other users.

Technical Requirements Circuit Status Monitor up/down status of cross domain circuits What Determine the status of a circuit Publish status via a web services interface Provide tools to visualize state Generate NOC alarms when circuits change states Why Determine when circuits are available Simplify debugging of end to end circuit problems

Technical Requirements Interface Statistics Monitor Link/Circuit Capacity, Utilization & Errors What Publish statistics via a web services interface Provide tools to visualize the data Generate NOC alarms when thresholds are crosses Why Allow determining usage patterns Simplify throughput problem diagnosis Capacity Planning

Technical Requirements Latency Continuously measure end-to-end delay What Manage multiple sparse meshes of continuous tests and store results in an MA Publish results via a standardized web service interface Provide a tool to visualize the data Provide tools to automatically analyze data and generate NOC alarms Why Measure & document actual availability Provide time references for when problems occurred and when they were fixed Detect & assist in diagnosing common causes of performance degradation Packet Loss Congestion related Non-Congestion related Queuing & Jitter caused by congestion Routing Issues: changes, asymmetry, flapping, etc

Technical Requirements Bandwidth Make regular scheduled bandwidth measurements across paths of interest What Manage multiple regularly scheduled sparse meshes of tests and store results in an MA Publish results via a standardized web service interface Provide a tool to visualize the data Provide tools to automatically analyze data and generate NOC alarms Why Detect performance problems Identify when problems appeared Document performance delivered

Technical Requirements Topology Measure & Publish Topology of primary and backup paths What Publish logical topology via a web services interface Provide tools to visualize the data over time Why Set user expectations Facilitate network problem diagnosis Allow correlating logical topology to measurements of the physical topology Understand …

Implementation Details Circuit Status Monitor up/down status of cross domain circuits Tools E2Emp or SQLMA E2Emon Configuration Each Domain publishes the status of their portions of cross domain circuits. E2Ecu monitors all LHC circuits? Any NOC can run E2Emon to monitor the subset of circuits that they have responsibility for

Implementation Details Interface Statistics Monitor Link/Circuit Capacity, Utilization & Errors Tools RRDma, PS-SNMP MA Configuration Each domain sets up a Measurement Archive publishing statistics about their network interfaces supporting LHC Capacity Utilization Input Errors Output Drops

Implementation Details Latency Continuously measure end-to-end delay Tools Hades OWAMP/AMI Pinger Configuration Each Domain deploys 1 or more Measurement Points inside their LHC Center Hades and/or OWAMP Deploy 1 Scheduler & MA for each cluster or community One for LHCOPN One for each Tier 1 that wants to measure their customers Deploy a Pinger MA in any community where all of the customers are not able or willing to maintain stable Hades/Owamp MPs.

Implementation Details Bandwidth Make regular scheduled bandwidth measurements across paths of interest Tools BWCTL & BWCTL MP AMI Scheduler & MA Configuration Deploy 1 GE connected MP in inside each LHC center Deploy 1 Scheduler & MA per cluster of MP’s One for the LHCOPN One per Tier 1 that wants to measure their Tier 2 service

Implementation Details Topology Measure & Publish Topology of primary and backup paths Tools Still Under Development CNIS Internet2 Topology Service This is not a significant concern for the LHCOPN as long as it continues to be a well defined static topology fully described with the E2Emon tools. This is an issue when considering Tier 2 traffic which will stress the R&E Networking Infrastructure!

Implementation Details Other Issues Implementation Details Additional PerfSONAR services Lookup Service AAA Priorities Schedule Implementation Structure Software Service

PerfSONAR Solutions Current Status NEED TO UPDATE WITH JEFFs Email Dates PerfSONAR Solutions Current Status Attribute Functionality perfSONAR Tool(s) Date Circuit Up/Down Measure & Archive E2E_MP & SQLma Deployed Visualize E2Emon Alarm Link Utilization, Errors & Capacity RRDMA Utilization & Capacity RRDMA Input Errors & Output Drops PS-SNMPMA Done ?? Beta Aug 1, Package Sep 1 perfSONARUI, Visual Traceroute ? Round Trip Delay (ICMP) & Traceroute PingerMA Ping MP Aug 15? perfSONARUI Plugin? One way Delay Tests between MPs Schedule On-demand AMI MA & Scheduler Hades Owamp MP Beta Sep 15, Package Oct 1 October Archive AMI MA Oct 1 PerfSONARUI CNM (If same as HADES MA) I2 CGI (Done in Aug,packaged in OCT) Done ?? Beta Aug, Package Oct Being worked on in Internet2. Generate a plan in December 07, implement 08 Bandwidth Tests between MPs Schedule & Measure BWCTL BWCTL_MP (DFN one) AMI scheduler Dne AMI_MA DFN MA? PerfSONAR UI Plugin Web CGI scripts Fall? Look at it Spring 08