1 Internet End-to-end Monitoring Project - Overview Les Cottrell – SLAC/Stanford University Partially funded by DOE/MICS Field Work Proposal on Internet.

Slides:



Advertisements
Similar presentations
Web100 at SLAC Presented at the Web100 Workshop, Boulder, CO, August 2002.
Advertisements

1 QoS on Best-effort IP Networks Les Cottrell – SLAC Presented at the Joint SG13/SG16 Workshop Panel.
QoS Solutions Confidential 2010 NetQuality Analyzer and QPerf.
1 High Performance Active End-to- end Network Monitoring Les Cottrell, Connie Logg, Warren Matthews, Jiri Navratil, Ajay Tirumala – SLAC Prepared for the.
1 Traceanal: a tool for analyzing and representing traceroutes Les Cottrell, Connie Logg, Ruchi Gupta, Jiri Navratil SLAC, for the E2Epi BOF, Columbus.
1 Internet End-to-end Monitoring Project at SLAC Les Cottrell, Connie Logg, Jerrod Williams, Gary Buhrmaster Site visit to SLAC by DoE program managers.
PingER Management1 Error Reporting Model for Ping End-to-End Reporting (PingER Management)
1 SLAC Internet Measurement Data Les Cottrell, Jerrod Williams, Connie Logg, Paola Grosso SLAC, for the ISMA Workshop, SDSC June,
1 Quantifying the Digital Divide from Within and Without Les Cottrell, SLAC Internet2 Members Meeting SIG on Hard to Reach Network Places, Washington,
MAGGIE NIIT- SLAC On Going Projects Measurement & Analysis of Global Grid & Internet End to end performance.
1 Network Monitoring for SCIC Les Cottrell, SLAC For ICFA meeting September, 2005 Initially funded by DoE Field Work proposal. Currently partially funded.
1 PingER: Methodology, Uses & Results Les Cottrell SLAC, Warren Matthews GATech Extending the Reach of Advanced Networking: Special International Workshop.
Internet Bandwidth Measurement Techniques Muhammad Ali Dec 17 th 2005.
Federation – Smokeping, PingER Integration Asma Shamshad Bit-4A.
1 ICFA/SCIC Network Monitoring Prepared by Les Cottrell, SLAC, for ICFA
Network Monitoring grid network performance measurement, simulation & analysis Presented by Warren Matthews at the Performance.
Reading Report 14 Yin Chen 14 Apr 2004 Reference: Internet Service Performance: Data Analysis and Visualization, Cross-Industry Working Team, July, 2000.
KEK Network Qi Fazhi KEK SW L2/L3 Switch for outside connections Central L2/L3 Switch A Netscreen Firewall Super Sinet Router 10GbE 2 x GbE IDS.
1 Monitoring Internet connectivity of Research and Educational Institutions Les Cottrell – SLAC/Stanford University Prepared for the workshop on “Developing.
PingER: Research Opportunities and Trends R. Les Cottrell, SLAC University of Malaya.
ICFA/SCIC Monitoring WG Les Cottrell – SLAC representing the ICFA/SCIC Monitoring WG Prepared for the ICFA-SCIC, phone meeting, Jan 15, 2003
Module 10: Monitoring ISA Server Overview Monitoring Overview Configuring Alerts Configuring Session Monitoring Configuring Logging Configuring.
POSTECH DP&NM Lab. Internet Traffic Monitoring and Analysis: Methods and Applications (1) 4. Active Monitoring Techniques.
LAN and WAN Monitoring at SLAC Connie Logg September 21, 2005.
GridNM Network Monitoring Architecture (and a bit about my phd) Yee-Ting Li, 1 st Year UCL, 17 th June 2002.
workshop eugene, oregon What is network management? System & Service monitoring  Reachability, availability Resource measurement/monitoring.
1 Using Netflow data for forecasting Les Cottrell SLAC and Fawad Nazir NIIT, Presented at the CHEP06 Meeting, Mumbai India, February
The ProactiveWatch Monitoring Service. Are These Problems For You? Your business gets disrupted when your IT environment has issues Your employee and.
Measurement & Analysis of Global Grid & Internet End to end performance (MAGGIE) Network Performance Measurement.
1 ESnet/HENP Active Internet End-to-end Performance & ESnet/University performance Les Cottrell – SLAC Presented at the ESSC meeting Albuquerque, August.
1 Overview of IEPM-BW - Bandwidth Testing of Bulk Data Transfer Tools Connie Logg & Les Cottrell – SLAC/Stanford University Presented at the Internet 2.
1 The PingER Project: Measuring the Digital Divide PingER Presented by Les Cottrell, SLAC At the SIS Show Palexpo/Geneva December 2003.
1 Quantifying the Digital Divide Les Cottrell – SLAC Prepared for the ICFA-SCIC video meeting, May 2003
1 Network Monitoring for SCIC Les Cottrell, SLAC ICFA/SCIC meeting August 24, aug05.ppt Initially.
1 Measurements of Internet performance for NIIT, Pakistan Jan – Feb 2004 PingER From Les Cottrell, SLAC For presentation by Prof. Arshad Ali, NIIT.
1 Measuring The Digital Divide Prepared by: Les Cottrell SLAC, Shahryar Khan NIIT/SLAC, Jared Greeno SLAC, Qasim Lone NIIT/SLAC Presentation to Princess.
1 Quantifying the Digital Divide: focus Africa Prepared by Les Cottrell, SLAC for the NSF IRNC meeting, March 11,
1 SLAC IEPM PingER and BW monitoring & tools PingER Presented by Les Cottrell, SLAC At LBNL, Jan 21,
IEPM. Warren Matthews (SLAC) Presented at the ESCC Meeting Miami, FL, February 2003.
1 IEPM/PingER Project Les Cottrell, SLAC DoE 2004 PI Network Research Meeting, FNAL Sep ‘04
1 Internet Performance Monitoring for the HENP Community Les Cottrell & Warren Matthews – SLAC Presented.
ICFA Standing Committee on Interregional Connectivity (SCIC) ICFA Standing Committee on Interregional Connectivity (SCIC) Harvey B. Newman Harvey B. Newman.
Internet Connectivity and Performance for the HEP Community. Presented at HEPNT-HEPiX, October 6, 1999 by Warren Matthews Funded by DOE/MICS Internet End-to-end.
Grid Network Performance Monitoring for e-Science.
INDIANAUNIVERSITYINDIANAUNIVERSITY Status of FAST TCP and other TCP alternatives John Hicks TransPAC HPCC Engineer Indiana University APAN Meeting – Hawaii.
1 PingER performance to Bangladesh Prepared by Les Cottrell, SLAC for Prof. Hilda Cerdeira May 27, 2004 Partially funded by DOE/MICS Field Work Proposal.
1 WAN Monitoring Prepared by Les Cottrell, SLAC, for the Joint Engineering Taskforce Roadmap Workshop JLab April 13-15,
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Mario Reale – GARR NetJobs: Network Monitoring Using Grid Jobs.
1 IEPM / PingER project & PPDG Les Cottrell – SLAC Presented at the NGI workshop, Berkeley, 7/21/99 Partially funded by DOE/MICS Field Work Proposal on.
1 Quantifying the Digital Divide Prepared by Les Cottrell, SLAC for the Internet2/World Bank meeting, Feb 7,
1 PingER6 Preliminary PingER Monitoring Results from the 6Bone/6REN. Warren Matthews Les Cottrell.
Pinger and IEPM-BW activity at FNAL By Frank Nagy FTP/CCF Computing Division Fermilab.
Toward a Measurement Infrastructure. Warren Matthews (SLAC) Presented at the e2e Workshop Miami, FL, February 2003.
Paola Grosso SLAC October
Introduction to Networks
Milestones/Dates/Status Impact and Connections
High Speed File Replication
Warren Matthews and Les Cottrell (SLAC)
Using Netflow data for forecasting
Prepared by Les Cottrell & Hadrien Bullot, SLAC & EPFL, for the
The PingER Project: Measuring the Digital Divide
Wide Area Networking at SLAC, Feb ‘03
Digital Divide and PingER
High Performance Active End-to-end Network Monitoring
File Transfer Issues with TCP Acceleration with FileCatalyst
PingER: An Effort to Quantify the Digital Divide
IEPM. Warren Matthews (SLAC)
MAGGIE NIIT- SLAC On Going Projects
Quantifying the Global Digital Divide
The PingER Project: Measuring the Digital Divide
Presentation transcript:

1 Internet End-to-end Monitoring Project - Overview Les Cottrell – SLAC/Stanford University Partially funded by DOE/MICS Field Work Proposal on Internet End-to-end Performance Monitoring (IEPM), also supported by IUPAP

2 Why Driven by users: HENP physicists with worldwide collaborations with hundreds to thousands of scientists in many tens to hundreds of institutes –Set expectations for planning Where to locate clusters, data, how to replicate data –Trouble-shooting –SLAs First project starting 1995 was PingER

3 Measurement Architecture Uses existing ubiquitous Internet ping infrastructure, no tools to install Hierarchical vs. full mesh, each monitoring site chooses remote sites Lightweight – –low network impact (100bits/s/path) –no special machines –trivial to add monitored sites Runs continuously since 1995 WWW Archive Monitoring Remote HEPNRC Reports & Data Cache Monitoring SLAC Ping HTTP Archive 1 monitor host remote host pair

4 PingER Measurement Methodology Measurement host admin choose remote hosts of interest –sends 21 pings each 30 mins to each chosen remote host –Records RTT, loss, jitter, unreachable, out of order … –Records data in local cache Archive host gathers data from measurements hosts regularly (at least daily) –Archives, analyzes and generates reports from data –Make reports and data publicly available via the web Requirements: –Remote host: need a host accessible to pings, and a contact in case host does not respond (almost no effort) –Monitoring host: a low end host to make measurements, file space for cache, admin to install toolkit, choose remote hosts, build configuration file, respond to archivers in case unable to get data & keep it running (<<10% FTE) –Archive site: probably about 20% of an FTE

5 PingER deployment Measurements from –34 monitors in 14 countries –Over 600 remote hosts –Over 77 countries –Over 3300 monitor-remote site pairs –Measurements go back to Jan-95 –Reports on RTT, loss, reachability, jitter, reorders, duplicates … Countries monitored –Contain 78% of world population –99% of online users of Internet –Mainly A&R sites Monitoring Sites Remote Sites Recently added: –BD, CO, GH, GU, JO, MO, NG, PK Mature, low impact, excellent view of world Internet, e.g. for quantifying Digital Divide

6 Results

7 History - Round Trip Time (RTT) Improving by % year More direct paths Replacing satellites with land lines –Satellite >~550ms Faster lines & network equipment Lower limit speed of light in fiber Typical lower limit today ~ distance/(0.3 * (0.6 * c)) Speed of light in fiber

8 History - Loss Loss more critical than RTT Losses cause timeouts of typically seconds 40-50% improve/yr Best networks below 0.1% Russia, SE Europe, China several years behind

9 Loss to world from US Using year 2000, fraction of world’s population/country from

10 Losses: World by region, Jan ‘02 5%=bad Russia, S America bad Balkans, M East, Africa, S Asia, Caucasus poor

11 History - Throughput quality improvements from US TCP BW < MSS/(RTT*sqrt(loss)) (1) (1) Macroscopic Behavior of the TCP Congestion Avoidance Algorithm, Matthis, Semke, Mahdavi, Ott, Computer Communication Review 27(3), July % annual improvement ~ factor 10/4yr ~Factor 100 improvement in 8 years

12 Summary - results Internet A&R connectivity performance is improving –RTT 10-20%/yr, loss 50%/yr, throughput 80%/yr –Reduced use of satellites, mainly use for new hard to get to areas (e.g. S. Russian Republics) China, S.E. Europe, Russia rate of change keeps up but several years behind India, S. America performance is where N. America & W. Europe were 4 – 5 years ago Improvements need constant investments to understand & improve

13 More Information IEPM/PingER home site: –www-iepm.slac.stanford.edu/www-iepm.slac.stanford.edu/ African connectivity –

14 IEPM-BW = PingER NG Driven by data replication needs of HENP, PPDG, DataGrid –No longer ship plane/truck loads of data Latency is poor Now ship all data by network (TB/day today, double each year) –Complements PingER, but for high performance nets Build an infrastructure to make E2E network (e.g. iperf, packet pair dispersion) & application (FTP) measurements for high-performance A&R networking Started SC2001

15 Tasks Develop/deploy a simple, robust ssh based E2E app & net measurement and management infrastructure for making regular measurements –Major step is setting up collaborations, getting trust, accounts/passwords –Can use dedicated or shared hosts, located at borders or with real applications –COTS hardware & OS (Linux or Solaris) simplifies application integration Integrate base set of measurement tools (ping, iperf, bbcp …), provide simple (cron) scheduling Develop data extraction, reduction, analysis, reporting, simple forecasting & archiving

16 Purposes Compare & validate tools –With one another (pipechar vs pathload vs iperf or bbcp vs bbftp vs GridFTP vs Tsunami) –With passive measurements, –With web100 Evaluate TCP stacks (FAST, Sylvain, HS TCP, Frank Kelley …) –Trouble shooting –Set expectations, planning –Understand requirements for high performance performance issues, in network, OS, cpu, disk/file system etc. –Provide public access to results for people & applications

17 Deployment SLAC monitoring about 40 remote hosts 10 other monitoring sites running code –APAN, FNAL, NIKHEF, INFN SLAC running production –U Mich, I2, Manchester, UCL, GA Tech evaluating If everything goes right it takes about minutes to install a new monitoring site –Usually longer due to need to get web server, ssh keys, ports unblocked, disk space

18 Results Time series data, scatter plots, histograms CPU utilization required (MHz/Mbits/s) jumbo and standard, new stacks Forecasting Diurnal behavior characterization Disk throughput as function of OS, file system, caching Correlations with passive, web100

19 Next steps Rewrite (again) based on experiences –Improved ability to add new tools to measurement engine and integrate into extraction, analysis GridFTP, tsunami, UDPMon, pathload … –Improved robustness, error diagnosis Need improved scheduling Want to look at other security mechanisms