1 SLAC IEPM PingER and BW monitoring & tools PingER Presented by Les Cottrell, SLAC At LBNL, Jan 21, 2003 www.slac.stanford.edu/grp/scs/net/talk03/lbl-jan04.ppt.

Slides:



Advertisements
Similar presentations
Ronn Ritke Tony McGregor NLANR/MNA (UCSD/SDSC) Funded by the National Science Foundation/CISE/SCI cooperative agreement no. ANI
Advertisements

Active Measurement Project on KREONET & APAN Manhee Lee Supercomputing Center Korea Institute of Science & Technology Information.
1 Traceanal: a tool for analyzing and representing traceroutes Les Cottrell, Connie Logg, Ruchi Gupta, Jiri Navratil SLAC, for the E2Epi BOF, Columbus.
PingER Management1 Error Reporting Model for Ping End-to-End Reporting (PingER Management)
1 Correlating Internet Performance & Route Changes to Assist in Trouble- shooting from an End-user Perspective Les Cottrell, Connie Logg, Jiri Navratil.
1 SLAC Internet Measurement Data Les Cottrell, Jerrod Williams, Connie Logg, Paola Grosso SLAC, for the ISMA Workshop, SDSC June,
1 Quantifying the Digital Divide from Within and Without Les Cottrell, SLAC Internet2 Members Meeting SIG on Hard to Reach Network Places, Washington,
1 Evaluation of Techniques to Detect Significant Performance Problems using End-to-end Active Network Measurements Les Cottrell, SLAC 2006 IEEE/IFIP Network.
MAGGIE NIIT- SLAC On Going Projects Measurement & Analysis of Global Grid & Internet End to end performance.
1 Network Monitoring for SCIC Les Cottrell, SLAC For ICFA meeting September, 2005 Initially funded by DoE Field Work proposal. Currently partially funded.
1 PingER: Methodology, Uses & Results Les Cottrell SLAC, Warren Matthews GATech Extending the Reach of Advanced Networking: Special International Workshop.
Internet Bandwidth Measurement Techniques Muhammad Ali Dec 17 th 2005.
1 ICFA/SCIC Network Monitoring Prepared by Les Cottrell, SLAC, for ICFA
Network Monitoring grid network performance measurement, simulation & analysis Presented by Warren Matthews at the Performance.
Network Performance Measurement Atlas Tier 2 Meeting at BNL December Joe Metzger
Reading Report 14 Yin Chen 14 Apr 2004 Reference: Internet Service Performance: Data Analysis and Visualization, Cross-Industry Working Team, July, 2000.
1 Monitoring Internet connectivity of Research and Educational Institutions Les Cottrell – SLAC/Stanford University Prepared for the workshop on “Developing.
What we have learned from developing and running ABwE Jiri Navratil, Les R.Cottrell (SLAC)
PingER: Research Opportunities and Trends R. Les Cottrell, SLAC University of Malaya.
ICFA/SCIC Monitoring WG Les Cottrell – SLAC representing the ICFA/SCIC Monitoring WG Prepared for the ICFA-SCIC, phone meeting, Jan 15, 2003
Quantifying the Digital Divide: A scientific overview of the connectivity of South Asian and African Countries Les Cottrell SLAC, Aziz Rehmatullah NIIT,
POSTECH DP&NM Lab. Internet Traffic Monitoring and Analysis: Methods and Applications (1) 4. Active Monitoring Techniques.
1 Quantifying the Digital Divide from Within and Without Les Cottrell, SLAC International ICFA Workshop on HEP Networking, Grid and Digital Divide Issues.
LAN and WAN Monitoring at SLAC Connie Logg September 21, 2005.
1 Using Netflow data for forecasting Les Cottrell SLAC and Fawad Nazir NIIT, Presented at the CHEP06 Meeting, Mumbai India, February
Comparison of Public End-to-End Bandwidth Estimation tools on High-Speed Links Alok Shriram, Margaret Murray, Young Hyun, Nevil Brownlee, Andre Broido,
Comparison of Public End-to-End Bandwidth Estimation tools on High- Speed Links Alok Shriram, Margaret Murray, Young Hyun, Nevil Brownlee, Andre Broido,
DoE SciDAC high-performance networking research project: INCITE INCITE.rice.edu 2004 Technical Challenges INCITE R. Baraniuk, E. Knightly, R. Nowak, R.
DataGrid Wide Area Network Monitoring Infrastructure (DWMI) Connie Logg February 13-17, 2005.
Measurement & Analysis of Global Grid & Internet End to end performance (MAGGIE) Network Performance Measurement.
1 ESnet/HENP Active Internet End-to-end Performance & ESnet/University performance Les Cottrell – SLAC Presented at the ESSC meeting Albuquerque, August.
1 Overview of IEPM-BW - Bandwidth Testing of Bulk Data Transfer Tools Connie Logg & Les Cottrell – SLAC/Stanford University Presented at the Internet 2.
IEPM-BW: Bandwidth Change Detection and Traceroute Analysis and Visualization Connie Logg, Joint Techs Workshop February 4-9, 2006.
1 The PingER Project: Measuring the Digital Divide PingER Presented by Les Cottrell, SLAC At the SIS Show Palexpo/Geneva December 2003.
1 Network Monitoring for SCIC Les Cottrell, SLAC ICFA/SCIC meeting August 24, aug05.ppt Initially.
1 Measurements of Internet performance for NIIT, Pakistan Jan – Feb 2004 PingER From Les Cottrell, SLAC For presentation by Prof. Arshad Ali, NIIT.
1 Network Measurement Summary ESCC, Feb Joe Metzger ESnet Engineering Group Lawrence Berkeley National Laboratory.
Multiplicative Wavelet Traffic Model and pathChirp: Efficient Available Bandwidth Estimation Vinay Ribeiro.
1 Internet End-to-end Monitoring Project - Overview Les Cottrell – SLAC/Stanford University Partially funded by DOE/MICS Field Work Proposal on Internet.
1 Quantifying the Digital Divide: focus Africa Prepared by Les Cottrell, SLAC for the NSF IRNC meeting, March 11,
IEPM. Warren Matthews (SLAC) Presented at the ESCC Meeting Miami, FL, February 2003.
Measurement in the Internet Measurement in the Internet Paul Barford University of Wisconsin - Madison Spring, 2001.
1 IEPM/PingER Project Les Cottrell, SLAC DoE 2004 PI Network Research Meeting, FNAL Sep ‘04
1 Internet Performance Monitoring for the HENP Community Les Cottrell & Warren Matthews – SLAC Presented.
DoE SciDAC high-performance networking research project: INCITE INCITE.rice.edu 2004 Technical Challenges INCITE R. Baraniuk, E. Knightly, R. Nowak, R.
Internet Connectivity and Performance for the HEP Community. Presented at HEPNT-HEPiX, October 6, 1999 by Warren Matthews Funded by DOE/MICS Internet End-to-end.
Digital Divide and PingER Prepared by Les Cottrell for the ICFA meeting, August 15, aug03.html Partially.
Navigating PingER Les Cottrell – SLAC Presented at the Optimization Technologies for Low-Bandwidth Networks, ICTP Workshop,
1 PingER performance to Bangladesh Prepared by Les Cottrell, SLAC for Prof. Hilda Cerdeira May 27, 2004 Partially funded by DOE/MICS Field Work Proposal.
Igniting Internet Innovation
1 WAN Monitoring Prepared by Les Cottrell, SLAC, for the Joint Engineering Taskforce Roadmap Workshop JLab April 13-15,
1 IEPM / PingER project & PPDG Les Cottrell – SLAC Presented at the NGI workshop, Berkeley, 7/21/99 Partially funded by DOE/MICS Field Work Proposal on.
1 Quantifying the Digital Divide Prepared by Les Cottrell, SLAC for the Internet2/World Bank meeting, Feb 7,
1 PingER6 Preliminary PingER Monitoring Results from the 6Bone/6REN. Warren Matthews Les Cottrell.
Pinger and IEPM-BW activity at FNAL By Frank Nagy FTP/CCF Computing Division Fermilab.
Toward a Measurement Infrastructure. Warren Matthews (SLAC) Presented at the e2e Workshop Miami, FL, February 2003.
Milestones/Dates/Status Impact and Connections
Using Netflow data for forecasting
Prepared by Les Cottrell & Hadrien Bullot, SLAC & EPFL, for the
The PingER Project: Measuring the Digital Divide
Wide Area Networking at SLAC, Feb ‘03
Digital Divide and PingER
Experiences in Traceroute and Available Bandwidth Change Analysis
PingER: An Effort to Quantify the Digital Divide
Experiences in Traceroute and Available Bandwidth Change Analysis
SLAC monitoring Web Services
Correlating Internet Performance & Route Changes to Assist in Trouble-shooting from an End-user Perspective Les Cottrell, Connie Logg, Jiri Navratil SLAC.
MAGGIE NIIT- SLAC On Going Projects
Quantifying the Global Digital Divide
The PingER Project: Measuring the Digital Divide
Presentation transcript:

1 SLAC IEPM PingER and BW monitoring & tools PingER Presented by Les Cottrell, SLAC At LBNL, Jan 21,

2 History of the PingER Project Early 1990’s: SLAC begins pinging nodes around the world to evaluate the quality of Internet connectivity between SLAC and other HEP Institutions. Around 1996: The PingER project was funded making it the first Internet end-to-end monitoring tool available to the HEP community. Today: Believed to be the most extensive Internet end-to-end performance monitoring tool in the world PingER

3 PingER Today Today, the PingER Project includes 35 Monitoring- hosts in 12 countries. They are monitoring Remote- hosts in 80 countries. Over 55 remote sites. PingER THESE COUNTRIES COVER 75% OF THE WORLD POPULATION AND 99% OF THE INTERNET CONNECTED POPULATION!!! Just added Pakistan! Colored by region Colored countries have remote PingER hosts

4 PingER Architecture There are three types of hosts Remote-hosts: hosts being monitored Monitoring-hosts: Make ping measurements to remote hosts Archive/Analysis- hosts: gather data from Monitoring-sites, analyze & make reports Archive Monitoring REMOTE PingER

5 Methodology Every 30 mins send 11*100Byte followed by 10*1000Byte pings from monitor to remote host Low impact: –By default < 100bits/s per monitor-remote host pair –Can reduce to ~ 10bits/s –No need for co-scheduling of monitors Uses ubiquitous ping –No software to install at any of over 500 remote hosts –Very important for hosts in developing countries By centrally gathering the data, archiving, analyzing and reporting, the requirements for monitoring hosts are minimal (typically 1-2 days to install etc.)

6 Performance is improving Developed world improving factor of 10 in 4-5 years S.E. Europe, Russia, catching up India & Africa worse off & falling behind Developing world 3-10 years behind Worldwide performance Many institutes in developing world have less performance than a household in N. America or Europe

7 Current State – Aug ‘03 (throughput Mbps) Within region performance better –E.g. Ca|EDU|GOV-NA, Hu-SE Eu, Eu-Eu, Jp-E Asia, Au-Au, Ru- Ru|Baltics Africa, Caucasus, Central & S. Asia all bad Bad < 200kbits/s < DSL Poor > 200 < 500kbits/s Acceptable > 500kbits/s, < 1000kbits/s Good > 1000kbits/s Monitoring Country Remote regions

8 Network Readiness Index vs Throughput NRI from Center for International Development, Harvard U. NRI correlates reasonably well with Network Readiness Internet for all focus A&R focus NRI Top 14 Finland 5.92 US 5.79 Singapore 5.74 Sweden 5.58 Iceland 5.51 Canada 5.44 UK5.35 Denmark 5.33 Taiwan5.31 Germany5.29 Netherlands 5.28 Israel 5.22 Switzerland 5.18 Korea 5.10

9 Typical uses Troubleshooting  Discerning if a reported problem is network related  Identify the time a problem started  Provide quantitative analysis for Network specialists  Identifying step functions, periodic network behavior, and recognize problems affecting multiple sites.  Setting expectations (e.g. SLAs)  Identifying need to upgrade  Providing quantitative information to Policy makers & Funding agencies  Seeing the effects of upgrades PingER

10 Pakistan performance Karachi NIIT/Rawalpindi Islamabad Lahore Loss % RTT ms Routes: ESnet (hops 3-6) - SNV SINGTEL (7-12) - Karachi Pakistan Telecom Karachi Rawalpindi Routes: ESnet (hops 3-6) - SNV SINGTEL (7-12) - Karachi Pakistan Telecom Karachi Lahore Routes: ESnet (hops 3-8) - DC ATT (9-21) - Karachi

11 NIIT performance from U.S. (SLAC) Ping RTT & Loss Nb. Heavy losses during congested day-times Bandwidth measurements using packet pair dispersion & TCP ABW (pkt-pair dispersion):Average To NIIT: ~350Kbits/s From NIIT: 365 Kbits/s Iperf/TCP: Average: To NIIT: ~320Kbits/s From: NIIT 40Kbits/s Can also derive throughput (assuming standard TCP) from RTT & loss using: BW~1.2*S(1460B)/(RTT*sqrt(loss)  ~ 260Kbits/s Nominal path bottleneck capacity 1Mbits/s Preliminary results, started measurements end Dec Avg daily: loss~1-2%, RTT~320ms

12 In Summary PingER provides ongoing support for monitoring and maintaining the quality of Internet connectivity for the world wide scientific community. Information is available publicly on the web www-iepm.slac.stanford.edu/cgi-wrap/pingtable.pl PingER also quantifies the extent of the “Digital Divide” and provides information to policy makers and funding agencies. PingER

13 IEPM-BW Need something for high-performance links –10pings/30 mins, i.e. min=0.21% in day, or 0.007% in month (10 -8 BER) – today’s better links exceed this –Ping losses may not be like TCP losses Need for Grid, HENP applications and high- performance network connections –Set expectations, planning –Trouble-shooting, improving performance –Application steering –Testing new transports (e.g. FAST, HS-TCP, RBUDP, UDT), applications, monitoring tools (e.g. QIperf, packet-pair techniques …) in production environments –Compare with passive measurements, advertised capacities

14 Methodology Monitoring host every 90 minutes (+- randomization) cycles through collaborating hosts at several remote sites: –Sends active probes in-turn for: bbftp, gridtcp, bbcp, iperf1, iperf, (qiperf), ping, abwe … Also measures traceroutes at 15min intervals Uses ssh for code deployment, management and to start & stop servers remotely –Deploy server code for iperf, ABwE, bbftp, GridFTP & various utilities 10 monitoring sites, each with between 2 and 40 remote hosts monitored –Main users SLAC (BaBar) & FNAL (D0, CDF, CMS) Data archived, analyzed, displayed at monitoring hosts

15 Deployment 100Mbits/s host Monitor 125 measured bw Aug ‘02 HENPGbits/s host Net research

16 Visualization Time series: –Overplot multiple metrics –+ route changes –Zoom, history –Choose individual metrics Histograms Scatter plots Access to data

17 Traceroutes Analyse for unique routes, assign route #s Display route # at start, then “.” if no change If significant change, the display route # in red Hour of day Links to: –History –Reverse –Single host –Raw data –Summary for ing –Available BW –Topology Host Several routes changes simultaneously Hour of day Demo

18 Topology Select times & hosts & direction on table Mouse_over to see router name Click on router to see sub path below Colored by deduced AS Click on end nodes to see names of all hops

19 Performance (ABwE) Current bottleneck capacity (Usually limited by 100FE) Cross-traffic Available bandwidth Iperf (90m) Mbits/s 24 hours Requires ABwE server (mirror) at remote sites Gets performance for both directions Low impact 40 * 1000 byte packets Less than a second for result Can do “real-time” performance monitoring

20

21 Heavy load (xtraffic) appeared It shows new DBC on the path Normal situation ABwE/Iperf match: Hadrian to UFL IPLS shows traffic Mbits/s CALREN shows sending traffic 600 Mbits/s

22 Abing CLI Demo abing command line tool –Since low impact (40*1000 packets) can run like ping

23 Navigation MonALISA

24 Prediction, trouble shooting For ABwE: Working on auto detection of long term (many minutes) step changes in bandwidth –Developed simple algorithm and qualifying effectiveness –Looking at NLANR (McGregor/H-W Braun plateau change detector) /Abstracts/talk_03.htmlhttp:// /Abstracts/talk_03.html –Look at correlation between performance & route changes & RTT –For significant changes, gather: RTT, routes (fwd/rev, before & after if changed), NDT info, bandwidth info (fwd & rev) –Fold in diurnal changes –Generate real-time alerts with filtering Predictions Diurnal demo

25 Program API Not realistic to look at thousands of graphs Programs also want to look at data. E.g. –Data placement for replica servers –Analysis, visualization (e.g. MonALISA) –Trouble shooting Correlate data from many sources when suspect/spot problem Publish the data in standard way W3C Web Service, GGF OGSI Grid Service –Currently XMLRPC and SOAP servers –Using Network Measurement Working Group schema ( NM-WG.xsd)NM-WG.xsd Demo mainly proof of principal, to access IEPM single & multistream iperf, multistream GridFTP & bbftp, ABwE and PingER data –Not pushing deployment and use until schema more solid

26 IEPM SOAP Client #!/usr/local/bin/perl -w use SOAP::Lite; my $node = "node1.cacr.caltech.edu"; my $timePeriod=" T143000"; my $measurement = SOAP::Lite ->service(' ->GetBandwidthAchievableTCP("$node", "$timePeriod"); print “Host=“.$measurement->{'subject'}->{'destination'}->{'name'},"\n"; print $measurement->{'subject'}->{'destination'}->{'address'}->{'IP'},"\n"; print “Times:\n”.$measurement->{'path.bandwidth.achievable.TCP'} ->{'timestamp'}->{'startTime'},"\n"; print “Values:\n”.$measurement->{'path.bandwidth.achievable.TCP'} ->{'achievableThroughputResult'}->{'value'},"\n"; Host=node1.cacr.caltech.edu Not-disclosed Times: Values: Results For more see: Demo:

27 For More Information PingER: –www-iepm.slac.stanford.edu/pinger /www-iepm.slac.stanford.edu/pinger / ICFA/SCIC Network Monitoring report, Jan04 – The PingER Project: Active Internet Performance Monitoring for the HENP Community, IEEE Communications Magazine on Network Traffic Measurements and Experiments.The PingER Project: Active Internet Performance Monitoring for the HENP Community IEPM-BW – ABWE: www-iepm.slac.stanford.edu/bw/abwe/abwe-cf-iperf.html and moat.nlanr.net/PAM2003/PAM2003papers/3781.pdf www-iepm.slac.stanford.edu/bw/abwe/abwe-cf-iperf.html moat.nlanr.net/PAM2003/PAM2003papers/3781.pdf PingER