Measurement and Fault-Finding Using MAGGIE and PIPES. Presented at the HENP SIG Internet2 Members Meeting, Indianapolis, October Paola Grosso (SLAC) and Warren Matthews (GA Tech)
Internet2 Members Meeting, Indianapolis, October Background There are a large number of tools available to help diagnose network problems.There are a large number of tools available to help diagnose network problems. Many networks have a network monitoring project, often with a web page showing graphs.Many networks have a network monitoring project, often with a web page showing graphs.
Internet2 Members Meeting, Indianapolis, October Toward a monitoring Infrastructure Grid tools and data intensive science require a cross-domain monitoring infrastructure with standardized publishing.Grid tools and data intensive science require a cross-domain monitoring infrastructure with standardized publishing. –Resource Brokers –DoE science community Monitoring InfrastructureMonitoring Infrastructure –IEPM-BW and MAGGIE (SLAC and DoE) –AMI and PIPES (Internet2)
Internet2 Members Meeting, Indianapolis, October IEPM-BWIEPM-BW SLAC Stanford CENICAbilene SLAC SNV HSTN SEA ATL CLV IPLS NY Rice UTDallas NCSA UMich I2 SOX UFL APAN RIKEN INFN-Roma INFN-Milan CESnet APAN Geant (Only routes from SLAC crossing Abilene are shown)
Internet2 Members Meeting, Indianapolis, October AMIAMI STTL SNVA LOSA NYCM WASH ATLA CHIN IPLS KSCY DNVR HSTN
Internet2 Members Meeting, Indianapolis, October Mysterious low throughput peak. Human intervention required.
Internet2 Members Meeting, Indianapolis, October Graph contains 99.31% of the data. Tail (not shown) contains 0.69% of the data. By utilizing other measurement projects, problem location can be narrowed. Measurements indicate a very tight distribution of delays across Abilene
Internet2 Members Meeting, Indianapolis, October PublishingPublishing Unfeasible to look at (possibly hundreds) of graphs.Unfeasible to look at (possibly hundreds) of graphs. Publish the data in standard wayPublish the data in standard way W3C Web Service, GGF OGSI Grid ServiceW3C Web Service, GGF OGSI Grid Service –Currently XMLRPC and SOAP servers
Internet2 Members Meeting, Indianapolis, October DemoDemo /usr/local/bin/perl use SOAP::Lite; my $web_service = SOAP::Lite -> service(' -> NetworkMeasurement('V4', ’SNVA', 'IPLS', ' ', ' '); print $web_service->{resultSet}->{result}->{summaryData}->{minimum},"\n"; print $web_service->{resultSet}->{result}->{summaryData}->{average},"\n"; print $web_service->{resultSet}->{result}->{summaryData}->{maximum},"\n"; (Soap::lite extracts the data from the XML) Client: Response:
Internet2 Members Meeting, Indianapolis, October VisualizationVisualization Numerous tools are being developed to assist with visualizing measurementsNumerous tools are being developed to assist with visualizing measurements –Monalisa, Advisor, Internet Detective, Anemos Some will leverage 3rd party measurements via web servicesSome will leverage 3rd party measurements via web services
Internet2 Members Meeting, Indianapolis, October Fault Finding Tools will (or should) provide some automated fault-finding as a starting point for human interventionTools will (or should) provide some automated fault-finding as a starting point for human intervention Use techniques such as Tony McGregor’s plateau concept for AMP and RIPE-TT rolling average to determine unusual behaviour.Use techniques such as Tony McGregor’s plateau concept for AMP and RIPE-TT rolling average to determine unusual behaviour. Write to logWrite to log
Internet2 Members Meeting, Indianapolis, October Trouble Detection $ tail maggie.log 09/29/ :58:47 (1:14) gnt Alarm (AThresh=38.33) 09/29/ :25:45 (1:16) gnt Concern (CThresh=87.08) 09/29/ :55:21 (1:17) gnt Within boundaries Date and TimeBinNodeThroughput (iperf) Status
Internet2 Members Meeting, Indianapolis, October Net RAT In addition to quantitative measurements from ping, iperf etcIn addition to quantitative measurements from ping, iperf etc NetRAT will provide qualitative classification of nodes, hops and routes.NetRAT will provide qualitative classification of nodes, hops and routes. Cross-referencing data will identify hotspots (hop.performance = “poor”).Cross-referencing data will identify hotspots (hop.performance = “poor”).
Internet2 Members Meeting, Indianapolis, October SummarySummary Sites, Networks, Grids must interoperate to meet the needs of modern science and demanding applications.Sites, Networks, Grids must interoperate to meet the needs of modern science and demanding applications. A Monitoring Infrastructure consisting of measurements from many projects published in standard ways is required.A Monitoring Infrastructure consisting of measurements from many projects published in standard ways is required.
Internet2 Members Meeting, Indianapolis, October LinksLinks IEPM Home PageIEPM Home PageIEPM Home PageIEPM Home Page Internet2 e2epi and PIPES home pageInternet2 e2epi and PIPES home pageInternet2 e2epi and PIPES home pageInternet2 e2epi and PIPES home page OWAMP dataOWAMP dataOWAMP dataOWAMP data GGF NMWGGGF NMWGGGF NMWGGGF NMWG Monalisa, Advisor, Internet Detective, AnemosMonalisa, Advisor, Internet Detective, AnemosMonalisaAdvisor,Internet Detective,AnemosMonalisaAdvisor,Internet Detective,Anemos AMP Event DetectionAMP Event DetectionAMP Event DetectionAMP Event Detection