DataTAG Meeting CERN 7-8 May 03 R. Hughes-Jones Manchester 1 High Throughput: Progress and Current Results Lots of people helped: MB-NG team at UCL MB-NG.

Slides:



Advertisements
Similar presentations
GridPP Meeting Edinburgh 4-5 Feb 04 R. Hughes-Jones Manchester 1 High Performance Networking for ALL Members of GridPP are in many Network collaborations.
Advertisements

E-Science All Hands Meeting 1-4 Sep 03 R. Hughes-Jones Manchester 1 High Bandwidth High Throughput in the MB-NG & DataTAG Projects Richard Hughes-Jones,
TCP transfers over high latency/bandwidth network & Grid TCP Sylvain Ravot
MB - NG MB-NG Meeting UCL 1 Nov 02 R. Hughes-Jones Manchester 1 Status of Task 2 Traffic Generation and Measurement.
MB - NG MB-NG Technical Meeting 03 May 02 R. Hughes-Jones Manchester 1 Task2 Traffic Generation and Measurement Definitions Pass-1.
MB - NG MB-NG Jan 2002 R. Hughes-Jones Manchester Some Edge (Bbone) Router requirements Connect to the test systems in the IP domain. Accept marked packets.
DataTAG CERN Oct 2002 R. Hughes-Jones Manchester Initial Performance Measurements With DataTAG PCs Gigabit Ethernet NICs (Work in progress Oct 02)
August 10, Circuit TCP (CTCP) Helali Bhuiyan
CALICE, Mar 2007, R. Hughes-Jones Manchester 1 Protocols Working with 10 Gigabit Ethernet Richard Hughes-Jones The University of Manchester
JIVE VLBI Network Meeting 15 Jan 2003 R. Hughes-Jones Manchester The EVN-NREN Project Richard Hughes-Jones The University of Manchester.
GNEW2004 CERN March 2004 R. Hughes-Jones Manchester 1 End-2-End Network Monitoring What do we do ? What do we use it for? Richard Hughes-Jones Many people.
Meeting on ATLAS Remote Farms. Copenhagen 11 May 2004 R. Hughes-Jones Manchester Networking for ATLAS Remote Farms Richard Hughes-Jones The University.
GridPP meeting Feb 03 R. Hughes-Jones Manchester WP7 Networking Richard Hughes-Jones.
CAIDA Bandwidth Estimation Meeting San Diego June 2002 R. Hughes-Jones Manchester 1 EU DataGrid - Network Monitoring Richard Hughes-Jones, University of.
CdL was here DataTAG/WP7 Amsterdam June 2002 R. Hughes-Jones Manchester 1 EU DataGrid - Network Monitoring Richard Hughes-Jones, University of Manchester.
Slide: 1 Richard Hughes-Jones PFLDnet2005 Lyon Feb 05 R. Hughes-Jones Manchester 1 Investigating the interaction between high-performance network and disk.
Optical Networking Status of Discussion in the UK Richard Hughes-Jones The University of Manchester Particle Physics Network Coordination Group TERENA.
DataGrid WP7 Meeting CERN April 2002 R. Hughes-Jones Manchester Some Measurements on the SuperJANET 4 Production Network (UK Work in progress)
JIVE VLBI Network Meeting 28 Jan 2004 R. Hughes-Jones Manchester Brief Report on Tests Related to the e-VLBI Project Richard Hughes-Jones The University.
CALICE UCL, 20 Feb 2006, R. Hughes-Jones Manchester 1 10 Gigabit Ethernet Test Lab PCI-X Motherboards Related work & Initial tests Richard Hughes-Jones.
EDG WP7 Networking Demonstration uDemonstration sending HEP data CERN to SARA Multiple streams of TCP packets Tuned TCP parameters: ifconfig eth0 txqueuelen.
PFLDNet Argonne Feb 2004 R. Hughes-Jones Manchester 1 UDP Performance and PCI-X Activity of the Intel 10 Gigabit Ethernet Adapter on: HP rx2600 Dual Itanium.
© 2006 Open Grid Forum Interactions Between Networks, Protocols & Applications HPCN-RG Richard Hughes-Jones OGF20, Manchester, May 2007,
Slide: 1 Richard Hughes-Jones CHEP2004 Interlaken Sep 04 R. Hughes-Jones Manchester 1 Bringing High-Performance Networking to HEP users Richard Hughes-Jones.
ESLEA Bedfont Lakes Dec 04 Richard Hughes-Jones Network Measurement & Characterisation and the Challenge of SuperComputing SC200x.
CdL was here DataTAG CERN Sep 2002 R. Hughes-Jones Manchester 1 European Topology: NRNs & Geant SuperJANET4 CERN UvA Manc SURFnet RAL.
MB - NG MB-NG Meeting UCL 17 Jan 02 R. Hughes-Jones Manchester 1 Discussion of Methodology for MPLS QoS & High Performance High throughput Investigations.
02 nd April 03Networkshop Managed Bandwidth Next Generation F. Saka UCL NETSYS (NETwork SYStems centre of excellence)
GGF4 Toronto Feb 2002 R. Hughes-Jones Manchester Initial Performance Measurements Gigabit Ethernet NICs 64 bit PCI Motherboards (Work in progress Mar 02)
13th-14th July 2004 University College London End-user systems: NICs, MotherBoards, TCP Stacks & Applications Richard Hughes-Jones.
KEK Network Qi Fazhi KEK SW L2/L3 Switch for outside connections Central L2/L3 Switch A Netscreen Firewall Super Sinet Router 10GbE 2 x GbE IDS.
Slide: 1 Richard Hughes-Jones e-VLBI Network Meeting 28 Jan 2005 R. Hughes-Jones Manchester 1 TCP/IP Overview & Performance Richard Hughes-Jones The University.
UDT as an Alternative Transport Protocol for GridFTP Raj Kettimuthu Argonne National Laboratory The University of Chicago.
Slide: 1 Richard Hughes-Jones Mini-Symposium on Optical Data Networking, August 2005, R. Hughes-Jones Manchester 1 Using TCP/IP on High Bandwidth Long.
Experience with Loss-Based Congestion Controlled TCP Stacks Yee-Ting Li University College London.
High TCP performance over wide area networks Arlington, VA May 8, 2002 Sylvain Ravot CalTech HENP Working Group.
Transport Layer3-1 Announcements r Collect homework r New homework: m Ch3#3,4,7,10,12,16,18-20,25,26,31,33,37 m Due Wed Sep 24 r Reminder: m Project #1.
Increasing Web Server Throughput with Network Interface Data Caching October 9, 2002 Hyong-youb Kim, Vijay S. Pai, and Scott Rixner Rice Computer Architecture.
MB - NG MB-NG Meeting Dec 2001 R. Hughes-Jones Manchester MB – NG SuperJANET4 Development Network SuperJANET4 Production Network Leeds RAL / UKERNA RAL.
ESLEA Bits&Bytes, Manchester, 7-8 Dec 2006, R. Hughes-Jones Manchester 1 Protocols DCCP and dccpmon. Richard Hughes-Jones The University of Manchester.
Robin HJ & R. Hughes-Jones Manchester Sep 1999 Gigabit Ethernet in Ptolemy Status Sep 99 : Stars that exist : –GigEChipTranslate between GigEPacket and.
Experiences Tuning Cluster Hosts 1GigE and 10GbE Paul Hyder Cooperative Institute for Research in Environmental Sciences, CU Boulder Cooperative Institute.
CAIDA Bandwidth Estimation Meeting San Diego June 2002 R. Hughes-Jones Manchester UDPmon and TCPstream Tools to understand Network Performance Richard.
Transport Layer3-1 TCP throughput r What’s the average throughout of TCP as a function of window size and RTT? m Ignore slow start r Let W be the window.
PFLDNet Workshop February 2003 R. Hughes-Jones Manchester Some Performance Measurements Gigabit Ethernet NICs & Server Quality Motherboards Richard Hughes-Jones.
Collaboration Meeting, 4 Jul 2006, R. Hughes-Jones Manchester 1 Collaborations in Networking and Protocols HEP and Radio Astronomy Richard Hughes-Jones.
TERENA Networking Conference, Zagreb, Croatia, 21 May 2003 High-Performance Data Transport for Grid Applications T. Kelly, University of Cambridge, UK.
Transport Layer 3-1 Chapter 3 Transport Layer Computer Networking: A Top Down Approach 6 th edition Jim Kurose, Keith Ross Addison-Wesley March
DataGrid WP7 Meeting Amsterdam Nov 01 R. Hughes-Jones Manchester 1 UDPmon Measuring Throughput with UDP  Send a burst of UDP frames spaced at regular.
MB - NG MB-NG Meeting UCL 17 Jan 02 R. Hughes-Jones Manchester 1 Discussion of Methodology for MPLS QoS & High Performance High throughput Investigations.
GNEW2004 CERN March 2004 R. Hughes-Jones Manchester 1 Lessons Learned in Grid Networking or How do we get end-2-end performance to Real Users ? Richard.
18/09/2002Presentation to Spirent1 Presentation to Spirent 18/09/2002.
TCP transfers over high latency/bandwidth networks & Grid DT Measurements session PFLDnet February 3- 4, 2003 CERN, Geneva, Switzerland Sylvain Ravot
Midterm Review Chapter 1: Introduction Chapter 2: Application Layer
Networks ∙ Services ∙ People Richard-Hughes Jones eduPERT Training Session, Porto A Hands-On Session udpmon for Network Troubleshooting 18/06/2015.
Final EU Review - 24/03/2004 DataTAG is a project funded by the European Commission under contract IST Richard Hughes-Jones The University of.
ESLEA VLBI Bits&Bytes Workshop, 31 Aug 2006, R. Hughes-Jones Manchester 1 vlbi_udp Throughput Performance and Stability. Richard Hughes-Jones The University.
Connect. Communicate. Collaborate 4 Gigabit Onsala - Jodrell Lightpath for e-VLBI Richard Hughes-Jones.
DataGrid WP7 Meeting Jan 2002 R. Hughes-Jones Manchester Initial Performance Measurements Gigabit Ethernet NICs 64 bit PCI Motherboards (Work in progress)
MB MPLS MPLS Technical Meeting Sep 2001 R. Hughes-Jones Manchester SuperJANET Development Network Testbed – Cisco GSR SuperJANET4 C-PoP – Cisco GSR.
iperf a gnu tool for IP networks
Approaches towards congestion control
R. Hughes-Jones Manchester
Networking between China and Europe
Transport Protocols over Circuits/VCs
Mar 2001 ATLAS T2UK Meeting R. Hughes-Jones
MB-NG Review High Performance Network Demonstration 21 April 2004
MB – NG SuperJANET4 Development Network
TCP flow and congestion control
Achieving reliable high performance in LFNs (long-fat networks)
Presentation transcript:

DataTAG Meeting CERN 7-8 May 03 R. Hughes-Jones Manchester 1 High Throughput: Progress and Current Results Lots of people helped: MB-NG team at UCL MB-NG team at Manchester Andrew McNab MB - NG

DataTAG Meeting CERN 7-8 May 03 R. Hughes-Jones Manchester 2 What’s Involved in Performance  End Hosts  NIC: operation & PCI design – use of modern PCI commands  Chipset & PCIX  CPU, memory & memory-bus  OS kernel: drivers & TCP UDP IP stack  Disk sub-system: disks, controllers & interconnects  Routers  Blades: operation  Switching / routing fabric: bus, crossbar, non-blocking operation  Policies  The Network (s)  Framing  Bandwidth  Load & Congestion

DataTAG Meeting CERN 7-8 May 03 R. Hughes-Jones Manchester 3 PC MB – NG SuperJANET4 Development Network (22 Mar 02) UCL OSM- 1OC48- POS-SS MCC OSM- 1OC48- POS-SS MAN Gigabit Ethernet 2.5 Gbit POS Access 2.5 Gbit POS core MPLS Admin. Domains SJ4 Dev PC 3ware RAID0 PC 3ware RAID0

DataTAG Meeting CERN 7-8 May 03 R. Hughes-Jones Manchester 4 Initial Measurements  UDP throughput very variable  TCP throughput very variable  Lon  Man < Man  Lon ?  Investigations:  Packet loss several %  Errors on interface (ifconfig)  Overruns

DataTAG Meeting CERN 7-8 May 03 R. Hughes-Jones Manchester 5 txqueuelen-vs-sendstalls  TCP throughput very variable  Lon  Man < Man  Lon ?  throughput very variable

DataTAG Meeting CERN 7-8 May 03 R. Hughes-Jones Manchester 6 Interrupt Coalescence Investigations  TCP mem-mem lon2-man1  Tx 64 Tx-abs 64  Rx 0 Rx-abs 128  Mbit/s Mbit/s  Tx 64 Tx-abs 64  Rx 20 Rx-abs 128  Mbit/s Mbit/s  Tx 64 Tx-abs 64  Rx 80 Rx-abs 128  Mbit/s +- 1 Mbit/s

DataTAG Meeting CERN 7-8 May 03 R. Hughes-Jones Manchester 7 24 Hours HighSpeed TCP mem-mem  TCP mem-mem lon2-man1  Tx 64 Tx-abs 64  Rx 64 Rx-abs 128  Mbit/s Mbit/s

DataTAG Meeting CERN 7-8 May 03 R. Hughes-Jones Manchester 8 Raid0 Performance (1)  Maxdor 3.5 Series DiamondMax PLus Gb ATA/133  Write Slight increase with number of disks  Read  3 Disks OK  Write 100 MBytes/s  Read 130 MBytes/s

DataTAG Meeting CERN 7-8 May 03 R. Hughes-Jones Manchester 9 Raid0 Performance (2)  Maxdor 3.5 Series DiamondMax PLus Gb ATA/133  No difference for Write  Larger Stripe lower the performance  Write 100 MBytes/s  Read 120 MBytes/s

DataTAG Meeting CERN 7-8 May 03 R. Hughes-Jones Manchester 10 Gridftp Throughput HighSpeedTCP  Int Coal  Txqueuelen 2000  TCP buffer 1 M byte (rtt*BW = 750kbytes)  Interface throughput  Acks received  Data moved  520 Mbit/s  Same for B2B tests  So its not that simple!

DataTAG Meeting CERN 7-8 May 03 R. Hughes-Jones Manchester 11 Gridftp Throughput + Web100  Throughput Mbit/s:  See alternate 600/800 Mbit and zero  Cwnd smooth  No dup Ack / send stall / timeouts

DataTAG Meeting CERN 7-8 May 03 R. Hughes-Jones Manchester 12 Gridftp Throughput + Web100  Throughput Mbit/s vs Recv Window Size  Zero throughput independent of Recv Window Size  Bytes sent  Bytes received  Waits 0.4s at start !

DataTAG Meeting CERN 7-8 May 03 R. Hughes-Jones Manchester 13 http data transfers HighSpeed TCP  Apachie web server out of the box!  prototype client - curl http library  1Mbyte TCP buffers  2Gbyte file  Throughput 72 MBytes/s  Cwnd - some variation  No dup Ack / send stall / timeouts

DataTAG Meeting CERN 7-8 May 03 R. Hughes-Jones Manchester 14 http data transfers (2)  Limited by:  Sender  Receive window size

DataTAG Meeting CERN 7-8 May 03 R. Hughes-Jones Manchester 15  Int Coal  Txqueuelen 500 (no stall in rtt)  TCP buffer 750k byte (rtt*BW = 750k bytes)  1 stream every 60 s:  man1  lon2  man2  lon2  man3  lon2  Sample ever 10ms  Send rates:  940 Mbit/s  450 Mbit/s  300 Mbit/s TCP sharing man1-lon2 mem-mem web100

DataTAG Meeting CERN 7-8 May 03 R. Hughes-Jones Manchester 16  Int Coal  Txqueuelen 500 (no stall in rtt)  TCP buffer 750k byte (rtt*BW = 750k bytes)  1 stream every 60 s:  man1  lon2  man2  lon2  man3  lon2  Sample ever 10ms  Time in send limit:  Sender  Cwind  Recv wind TCP sharing man1-lon2 mem-mem web100

DataTAG Meeting CERN 7-8 May 03 R. Hughes-Jones Manchester 17  1Stream:  No Dup ACKs  No SACKs  No Sendstalls  Why does Cwnd vary TCP sharing man1-lon2 the WHY?

DataTAG Meeting CERN 7-8 May 03 R. Hughes-Jones Manchester 18  2Streams:  Many Dup ACKs  Many SACKs   Why does Cwnd have large variations 2 TCP streams man1-lon2 - the WHY?

DataTAG Meeting CERN 7-8 May 03 R. Hughes-Jones Manchester 19  2Streams:  Dips in throughput due to Dup ACK   ~4 losses /sec  A bit regular ?  Cwnd decreases:  1 point 33%  Ramp starts at 62%  Slope 70Bytes/us 2 TCP streams man1-lon2 - the WHY? (2) 1 sec

DataTAG Meeting CERN 7-8 May 03 R. Hughes-Jones Manchester 20  3Streams:  Dips in throughput due to Dup ACK  3 TCP streams man1-lon2 - the WHY? 10 sec

DataTAG Meeting CERN 7-8 May 03 R. Hughes-Jones Manchester 21 TCP sharing man1-lon2 - the WHY?  There is (a) correlation 