UT-BATTELLE U.S. Department of Energy Oak Ridge National Laboratory Net100: developing network-aware operating systems New (9/01) DOE-funded (Office of.

Slides:



Advertisements
Similar presentations
Using NetLogger and Web100 for TCP analysis Data Intensive Distributed Computing Group Lawrence Berkeley National Laboratory Brian L. Tierney.
Advertisements

ORNL Net100 status July 31, UT-BATTELLE U.S. Department of Energy Oak Ridge National Laboratory ORNL Net100 Focus Areas (first year) –TCP optimizations.
1 TCP Congestion Control. 2 TCP Segment Structure source port # dest port # 32 bits application data (variable length) sequence number acknowledgement.
Maximizing End-to-End Network Performance Thomas Hacker University of Michigan October 5, 2001.
Profiling Network Performance in Multi-tier Datacenter Applications
Congestion Control Tanenbaum 5.3, /12/2015Congestion Control (A Loss Based Technique: TCP)2 What? Why? Congestion occurs when –there is no reservation.
Transport Layer3-1 Congestion Control. Transport Layer3-2 Principles of Congestion Control Congestion: r informally: “too many sources sending too much.
CSEE W4140 Networking Laboratory Lecture 7: TCP flow control and congestion control Jong Yul Kim
High-performance bulk data transfers with TCP Matei Ripeanu University of Chicago.
TCP Congestion Control TCP sources change the sending rate by modifying the window size: Window = min {Advertised window, Congestion Window} In other words,
1 Chapter 3 Transport Layer. 2 Chapter 3 outline 3.1 Transport-layer services 3.2 Multiplexing and demultiplexing 3.3 Connectionless transport: UDP 3.4.
Transport Level Protocol Performance Evaluation for Bulk Data Transfers Matei Ripeanu The University of Chicago Abstract:
NDT Tools Tutorial: How-To setup your own NDT server Rich Carlson Summer 04 Joint Tech July 19, 2004.
Introduction 1 Lecture 14 Transport Layer (Congestion Control) slides are modified from J. Kurose & K. Ross University of Nevada – Reno Computer Science.
KEK Network Qi Fazhi KEK SW L2/L3 Switch for outside connections Central L2/L3 Switch A Netscreen Firewall Super Sinet Router 10GbE 2 x GbE IDS.
Courtesy: Nick McKeown, Stanford 1 TCP Congestion Control Tahir Azim.
Development of network-aware operating systems Tom Dunigan
Transport Layer 4 2: Transport Layer 4.
Transport Layer3-1 Chapter 3 outline r 3.1 Transport-layer services r 3.2 Multiplexing and demultiplexing r 3.3 Connectionless transport: UDP r 3.4 Principles.
Transport Layer3-1 Chapter 3 outline 3.1 Transport-layer services 3.2 Multiplexing and demultiplexing 3.3 Connectionless transport: UDP 3.4 Principles.
Maximizing End-to-End Network Performance Thomas Hacker University of Michigan October 26, 2001.
Principles of Congestion Control Congestion: informally: “too many sources sending too much data too fast for network to handle” different from flow control!
1 Project Goals Project Elements Future Plans Scheduled Accomplishments Project Title: Net Developing Network-Aware Operating Systems PI: G. Huntoon,
1 BWdetail: A bandwidth tester with detailed reporting Masters of Engineering Project Presentation Mark McGinley April 19, 2007 Advisor: Malathi Veeraraghavan.
NET100 Development of network-aware operating systems Tom Dunigan
UT-BATTELLE U.S. Department of Energy Oak Ridge National Laboratory Net100 PIs: Wendy Huntoon/PSC, Tom Dunigan/ORNL, Brian Tierney/LBNL Impact and Connections.
TCP performance Sven Ubik FTP throughput capacity load ftp.uninett.no 12.3 Mb/s 1.2 Gb/s 80 Mb/s (6.6%) ftp.stanford.edu 1.3 Mb/s 600.
Network-aware OS DOE/MICS Project Review August 18, 2003 Tom Dunigan Matt Mathis Brian Tierney
Data Transport Challenges for e-VLBI Julianne S.O. Sansa* * With Arpad Szomoru, Thijs van der Hulst & Mike Garret.
Pavel Cimbál, Sven Ubik CESNET TNC2005, Poznan, 9 June 2005 Tools for TCP performance debugging.
NET100 … as seen from ORNL Tom Dunigan November 8, 2001.
Iperf Quick Mode Ajay Tirumala & Les Cottrell. Sep 12, 2002 Iperf Quick Mode at LBL – Les Cottrell & Ajay Tirumala Iperf QUICK Mode Problem – Current.
NET100 Development of network-aware operating systems Tom Dunigan
National Center for Atmospheric Research Pittsburgh Supercomputing Center National Center for Supercomputing Applications Web100 Basil Irwin & George Brett.
Wide Area Network Performance Analysis Methodology Wenji Wu, Phil DeMar, Mark Bowden Fermilab ESCC/Internet2 Joint Techs Workshop 2007
Network-aware OS DOE/MICS Project Final Review September 16, 2004 Tom Dunigan Matt Mathis Brian Tierney ORNL.
Transport Layer 3-1 Chapter 3 Transport Layer Computer Networking: A Top Down Approach 6 th edition Jim Kurose, Keith Ross Addison-Wesley March
Web100/Net100 at Oak Ridge National Lab Tom Dunigan August 1, 2002.
Transport Layer 3- Midterm score distribution. Transport Layer 3- TCP congestion control: additive increase, multiplicative decrease Approach: increase.
TCP transfers over high latency/bandwidth networks Internet2 Member Meeting HENP working group session April 9-11, 2003, Arlington T. Kelly, University.
Web100 Basil Irwin National Center for Atmospheric Research Matt Mathis Pittsburgh Supercomputing Center Halloween, 2000.
CSE Computer Networks Prof. Aaron Striegel Department of Computer Science & Engineering University of Notre Dame Lecture 19 – March 23, 2010.
Winter 2008CS244a Handout 71 CS244a: An Introduction to Computer Networks Handout 7: Congestion Control Nick McKeown Professor of Electrical Engineering.
Advance Computer Networks Lecture#09 & 10 Instructor: Engr. Muhammad Mateen Yaqoob.
Data Transport Challenges for e-VLBI Julianne S.O. Sansa* * With Arpad Szomoru, Thijs van der Hulst & Mike Garret.
Development of a QoE Model Himadeepa Karlapudi 03/07/03.
NET100 Development of network-aware operating systems Tom Dunigan
TCP transfers over high latency/bandwidth networks & Grid DT Measurements session PFLDnet February 3- 4, 2003 CERN, Geneva, Switzerland Sylvain Ravot
9/29/04 GGF Random Thoughts on Application Performance and Network Characteristics Distributed Systems Department Lawrence Berkeley National Laboratory.
Network-aware OS ESCC Miami February 5, 2003 Tom Dunigan Matt Mathis Brian Tierney
Network-aware OS DOE/MICS Project Review August 18, 2003 Tom Dunigan Matt Mathis Brian Tierney CSM lunch.
UT-BATTELLE U.S. Department of Energy Oak Ridge National Laboratory Net100 year 1 leftovers (proposal): PSC –none ORNL –router access to SNMP data (besides.
Network-aware OS DOE/MICS ORNL site visit January 8, 2004 ORNL team: Tom Dunigan, Nagi Rao, Florence Fowler, Steven Carter Matt Mathis Brian.
@Yuan Xue A special acknowledge goes to J.F Kurose and K.W. Ross Some of the slides used in this lecture are adapted from their.
A TCP Tuning Daemon SC2002 November 19, 2002 Tom Dunigan Matt Mathis Brian Tierney
TCP EE122 Discussion 10/18/13.
CS450 – Introduction to Networking Lecture 19 – Congestion Control (2)
Chapter 3 outline 3.1 transport-layer services
Chapter 6 TCP Congestion Control
Chapter 3 outline 3.1 Transport-layer services
Transport Protocols over Circuits/VCs
Understanding Throughput & TCP Windows
Chapter 6 TCP Congestion Control
Sven Ubik TCP performance Sven Ubik
Transport Layer: Congestion Control
Chapter 3 outline 3.1 Transport-layer services
TCP flow and congestion control
Anant Mudambi, U. Virginia
Using NetLogger and Web100 for TCP analysis
Presentation transcript:

UT-BATTELLE U.S. Department of Energy Oak Ridge National Laboratory Net100: developing network-aware operating systems New (9/01) DOE-funded (Office of Science) project ($1M/yr, 3 yrs) Principal investigators –Matt Mathis, PSC ( ) –Brian Tierney, LBNL ( ) –Tom Dunigan, ORNL ( ) Objective: –measure and understand end-to-end network and application performance –tune network applications (grid and bulk transfer) Components –active network probes and passive sensors (leverage Web100) –network metrics data base –tuning daemon (WAD) to tune network flows based on network metrics

UT-BATTELLE U.S. Department of Energy Oak Ridge National Laboratory Net100: applied Web100 Web100 –Linux 2.4 kernel mods –100+ TCP variables per flow Net100 –Add Web100 to iperf/ttcp –Monitoring/tuning daemon Java applet bandwidth/client tester –fake WWW server provides html and applet –applet connects to bwserver 3 sockets (control, bwin, bwout) server reports Web100 variables to applet (window sizes, losses, RTT) – Try it

UT-BATTELLE U.S. Department of Energy Oak Ridge National Laboratory Net100 network measurement Active measurement –Net100 probes at LBL, ORNL, NCAR, PSC, NERSC –scheduled set of path probes ( iperf with Web100 mods, traceroute, pipechar ) –local and centralized database ( netlogger ) –interface to other probers (NIMI, surveyor, Pinger, ?) Passive measurement –Web100 daemon records TCP info on designated flows Web100 data collected when flow terminates Web100 TCP info: losses, timeouts, reordering, cwnd, ssthresh, RTT,… use netlogger to report to central data base –other passive sensors (SNMP data, LBL’s tcpdump monitor, ?) Query tools –for dynamic application tuning –for network engineering and statistical studies

UT-BATTELLE U.S. Department of Energy Oak Ridge National Laboratory Net100: tuning Work-around Daemon (WAD) Version 0 –use network performance data to tune flows –tune unknowing sender/receiver –config file with “tuning info” ? –Based on Web100/Linux 2.4 To be done –“applying” measurement info –adding more knobs to kernel –tune on non-Linux OS Related work –Feng’s Dynamic Right Sizing –Linux 2.4 auto-tuning/caching –Mathis TCP buffer tunning

UT-BATTELLE U.S. Department of Energy Oak Ridge National Laboratory TCP losses TCP is lossy by design Changing : bandwidths –9.6 Kbs… 1.5 Mbs..45 …100…1000…? Mbs Unchanging : –speed of light (RTT) –MTU (still 1500 bytes) –TCP congestion avoidance recovery after a loss can be very slow on today’s high delay/bandwidth links –proportional to MSS/RTT 2 Linear recovery at 0.5 Mb/s! Instantaneous bandwidth Average bandwidth Early startup losses

UT-BATTELLE U.S. Department of Energy Oak Ridge National Laboratory Net100 tuning Avoid losses –use “optimal” buffer sizes determined from network measurements –ECN capable routers/hosts –reduce bursts (TCP Vegas, ?) Faster recovery –bigger MSS (jumbo frames) –speculative recovery (D-SACK) –modified congestion avoidance? Autotune (WAD variables) –Buffer sizes –Dupthresh (reordering resilience) –Del ACK, Nagle –AIMD –Virtual MSS –initial window, ssthresh non-TCP solutions (rate-based, ?) (tests with TCP-over-UDP, atou, NERSC to ORNL)

UT-BATTELLE U.S. Department of Energy Oak Ridge National Laboratory Net100 status Completed –network probes at ORNL, PSC, NCAR, LBL, NERSC –preliminary schema for network data –initial Web100 sensor daemon and tuning daemon In progress –TCP tuning extensions to Linux/Web100 kernel –analysis of TCP tuning options –deriving tuning info from network measurements Future –interactions with other network measurement sources –multipath/parallel path selection/tuning