Connect. Communicate. Collaborate 4 Gigabit Onsala - Jodrell Lightpath for e-VLBI The iNetTest Unit Development of Real Time eVLBI at Jodrell Bank Observatory.

Slides:



Advertisements
Similar presentations
Progress With iBOBs at Jodrell Bits & Bytes Meeting, JBO, th Dec 2007 Jonathan Hargreaves Electronic Engineer, Jodrell Bank Observatory.
Advertisements

Performance Analysis of Daisy- Chained CPUs Based On Modeling Krzysztof Korcyl, Jagiellonian University, Krakow Radoslaw Trebacz Jagiellonian University,
Project Partners Project Collaborators The Council for the Central Laboratory of the Research Councils Funded by EPSRC GR/T04465/01
CALICE, Mar 2007, R. Hughes-Jones Manchester 1 Protocols Working with 10 Gigabit Ethernet Richard Hughes-Jones The University of Manchester
Implement a 2x2 MIMO OFDM-based channel measurement system (no data yet) at 2.4 GHz Perform baseband processing and digital up and down conversion on Nallatech.
JIVE VLBI Network Meeting 15 Jan 2003 R. Hughes-Jones Manchester The EVN-NREN Project Richard Hughes-Jones The University of Manchester.
Meeting on ATLAS Remote Farms. Copenhagen 11 May 2004 R. Hughes-Jones Manchester Networking for ATLAS Remote Farms Richard Hughes-Jones The University.
Slide: 1 Richard Hughes-Jones T2UK, October 06 R. Hughes-Jones Manchester 1 Update on Remote Real-Time Computing Farms For ATLAS Trigger DAQ. Richard Hughes-Jones.
CdL was here DataTAG/WP7 Amsterdam June 2002 R. Hughes-Jones Manchester 1 EU DataGrid - Network Monitoring Richard Hughes-Jones, University of Manchester.
IEEE Real Time 2007, Fermilab, 29 April – 4 May R. Hughes-Jones Manchester 1 Using FPGAs to Generate Gigabit Ethernet Data Transfers & The Network Performance.
DataGrid WP7 Meeting CERN April 2002 R. Hughes-Jones Manchester Some Measurements on the SuperJANET 4 Production Network (UK Work in progress)
JIVE VLBI Network Meeting 28 Jan 2004 R. Hughes-Jones Manchester Brief Report on Tests Related to the e-VLBI Project Richard Hughes-Jones The University.
EE 4272Spring, 2003 Chapter 14 LAN Systems Ethernet (CSMA/CD)  ALOHA  Slotted ALOHA  CSMA  CSMA/CD Token Ring /FDDI Fiber Channel  Fiber Channel Protocol.
DataTAG Meeting CERN 7-8 May 03 R. Hughes-Jones Manchester 1 High Throughput: Progress and Current Results Lots of people helped: MB-NG team at UCL MB-NG.
Introduction. 2 What Is SmartFlow? SmartFlow is the first application to test QoS and analyze the performance and behavior of the new breed of policy-based.
PFLDNet Argonne Feb 2004 R. Hughes-Jones Manchester 1 UDP Performance and PCI-X Activity of the Intel 10 Gigabit Ethernet Adapter on: HP rx2600 Dual Itanium.
© 2006 Open Grid Forum Interactions Between Networks, Protocols & Applications HPCN-RG Richard Hughes-Jones OGF20, Manchester, May 2007,
CdL was here DataTAG CERN Sep 2002 R. Hughes-Jones Manchester 1 European Topology: NRNs & Geant SuperJANET4 CERN UvA Manc SURFnet RAL.
DataLink Layer1 Ethernet Technologies: 10Base2 10: 10Mbps; 2: 200 meters (actual is 185m) max distance between any two nodes without repeaters thin coaxial.
GigE Knowledge. BODE, Company Profile Page: 2 Table of contents  GigE Benefits  Network Card and Jumbo Frames  Camera - IP address obtainment  Multi.
IP Network Basics. For Internal Use Only ▲ Internal Use Only ▲ Course Objectives Grasp the basic knowledge of network Understand network evolution history.
Document Number ETH West Diamond Avenue - Third Floor, Gaithersburg, MD Phone: (301) Fax: (301)
5 Feb 2002Alternative Ideas for the CALICE Backend System 1 Alternative Ideas for the CALICE Back-End System Matthew Warren and Gordon Crone University.
Straw electronics Straw Readout Board (SRB). Full SRB - IO Handling 16 covers – Input 16*2 links 400(320eff) Mbits/s Control – TTC – LEMO – VME Output.
Introduction1-1 Data Communications and Computer Networks Chapter 5 CS 3830 Lecture 27 Omar Meqdadi Department of Computer Science and Software Engineering.
Data Comm. & Networks Instructor: Ibrahim Tariq Lecture 3.
DLS Digital Controller Tony Dobbing Head of Power Supplies Group.
EVN-NREN Meeting, Zaandan, 31 Oct 2006, R. Hughes-Jones Manchester 1 FABRIC 4 Gigabit Work & VLBI-UDP Performance and Stability. Richard Hughes-Jones The.
Cluster Computers. Introduction Cluster computing –Standard PCs or workstations connected by a fast network –Good price/performance ratio –Exploit existing.
High Bandwidth Data Acquisition and Network Streaming in VLBI Jan Wagner, Guifré Molera et al. TKK / Metsähovi Radio Observatory.
Nov 3, 2009 RN - 1 Jet Propulsion Laboratory California Institute of Technology Current Developments for VLBI Data Acquisition Equipment at JPL Robert.
Introduction VLBI_UDP is an application being developed to transfer VLBI data using the UDP protocol. Initially developed by Richard Hughes- Jones for.
1 Network Performance Optimisation and Load Balancing Wulf Thannhaeuser.
MB - NG MB-NG Meeting Dec 2001 R. Hughes-Jones Manchester MB – NG SuperJANET4 Development Network SuperJANET4 Production Network Leeds RAL / UKERNA RAL.
Lecture 4 Overview. Ethernet Data Link Layer protocol Ethernet (IEEE 802.3) is widely used Supported by a variety of physical layer implementations Multi-access.
CAIDA Bandwidth Estimation Meeting San Diego June 2002 R. Hughes-Jones Manchester UDPmon and TCPstream Tools to understand Network Performance Richard.
High bit rate tests between Manchester and JIVE Looking at data rates attainable with UDP along with packet loss and reordering statistics Simon Casey,
FPGA firmware of DC5 FEE. Outline List of issue Data loss issue Command error issue (DCM to FEM) Command lost issue (PC with USB connection to GANDALF)
VLBI_UDP An application for transferring VLBI data via UDP protocol Simon Casey e-VLBI meeting, Haystack 20 Sep 2006.
An Efficient Gigabit Ethernet Switch Model for Large-Scale Simulation Dong (Kevin) Jin.
1 Level 1 Pre Processor and Interface L1PPI Guido Haefeli L1 Review 14. June 2002.
Networks ∙ Services ∙ People Richard-Hughes Jones eduPERT Training Session, Porto A Hands-On Session udpmon for Network Troubleshooting 18/06/2015.
Final EU Review - 24/03/2004 DataTAG is a project funded by the European Commission under contract IST Richard Hughes-Jones The University of.
1 eVLBI Developments at Jodrell Bank Observatory Ralph Spencer, Richard Hughes- Jones, Simon Casey, Paul Burgess, The University of Manchester.
Cluster Computers. Introduction Cluster computing –Standard PCs or workstations connected by a fast network –Good price/performance ratio –Exploit existing.
3 Sep 2009SLM1 of 12 SLM performance and limitations based on HW tests.
Data delayed Real Time Data Transfer for Very Long Baseline Interferometry Simon Casey, Richard Hughes-Jones, Stephen Kershaw, Ralph Spencer, Matt Strong.
Building A Network: Cost Effective Resource Sharing
LHCb upgrade Workshop, Oxford, Xavier Gremaud (EPFL, Switzerland)
S.Anvar, V.Gautard, H.Le Provost, F.Louis, K.Menager, Y.Moudden, B.Vallage, E.Zonca, on behalf of the KM3NeT consortium 1 IRFU/SEDI-CEA Saclay F
Broadband Interfacing eMerlin-JIVE (1-10Gbps) Onsala-Jodrell Bank (
Connect communicate collaborate Performance Metrics & Basic Tools Robert Stoy, DFN EGI TF, Madrid September 2013.
UniBoard Progress Meeting, December 2009 Jonathan Hargreaves, JIVE EVN Correlator Design UniBoard Progress Meeting, December 2009 Contract no
FABRIC WP1.2 Broadband Data Path: Protocols and Processor Interface Bonn 20/09/07 Ralph Spencer The University of Manchester.
UniBoard Meeting, October 12-13th 2010 Jonathan Hargreaves, JIVE EVN Correlator Design UniBoard Meeting, October th 2010 Contract no
ROD Activities at Dresden Andreas Glatte, Andreas Meyer, Andy Kielburg-Jeka, Arno Straessner LAr Electronics Upgrade Meeting – LAr Week September 2009.
Connect communicate collaborate Supporting Pan-European advanced applications - GÉANT Project liaison and support Richard Hughes-Jones, Domenico Vicinanza,
PC-based L0TP Status Report “on behalf of the Ferrara L0TP Group” Ilaria Neri University of Ferrara and INFN - Italy Ferrara, September 02, 2014.
Connect. Communicate. Collaborate 4 Gigabit Onsala - Jodrell Lightpath for e-VLBI Richard Hughes-Jones.
Progress on EXPReS at JBO
JIVE UniBoard Correlator External Review
Overview of 4Gbps connectivity for Nordic-MERLIN fringe tests
Networking between China and Europe
Transport Protocols over Circuits/VCs
JIVE UniBoard Correlator (JUC) Firmware
John Conway Bonn, Sept 20th, 2007
MB – NG SuperJANET4 Development Network
Progress on EXPreS at JBO Ralph Spencer
Building A Network: Cost Effective Resource Sharing
New DCM, FEMDCM DCM jobs DCM upgrade path
Presentation transcript:

Connect. Communicate. Collaborate 4 Gigabit Onsala - Jodrell Lightpath for e-VLBI The iNetTest Unit Development of Real Time eVLBI at Jodrell Bank Observatory 7 th International eVLBI Workshop, Shanghai, th June 2008 Richard Hughes-Jones DANTE Jonathan Hargreaves JBO

Connect. Communicate. Collaborate The 4 Gig Path for EXPReS Onsala – Jodrell Multi: Domain Vendor Technology

Connect. Communicate. Collaborate First Test Path 4 Gig TDM/SDH Lighpath over GÉANT2+ 10 Gigabit Ethernet Lambda over NORDUnet

Connect. Communicate. Collaborate UDP Throughput Stoc-Lon Alcatel Metro Core Connect MCC Flow control OFF Kernel web100_pktd-plus Myricom 10G-PCIE-8A-R Fibre –rx-usecs=25 Coalescence ON MTU 9000 bytes Max throughput 4.05 Gbit/s Packet loss as expected Sending host, 3 CPUs idle For 90% in kernel mode Receiving host 3 CPUs idle For <20 µs packets, 1 CPU is ~30% in kernel mode

Connect. Communicate. Collaborate UDP Jitter Stoc-Lon Alcatel Metro Core Connect MCC Flow control OFF Kernel web100_pktd-plus Myricom 10G-PCIE-8A-R Fibre –rx-usecs=0 Coalescence OFF 8972 byte packets 16.5 μs spacing Peak in packet spacing at 16 μs as expected Other peaks at 19 and 51 µs

Connect. Communicate. Collaborate UDP Throughput Lon-Stoc Alcatel Metro Core Connect MCC Flow control ON Kernel web100_pktd Myricom 10G-PCIE-8A-R Fibre –rx-usecs=25 Coalescence ON MTU 9000 bytes Max throughput 4.05 Gbit/s No Packet loss – flow control Sending host, 3 CPUs idle For 60% in kernel mode Receiving host 3 CPUs idle For <20 µs packets, 1 CPU is ~30% in kernel mode

Connect. Communicate. Collaborate TCP reno Throughput Lon-Stoc Alcatel Metro Core Connect MCC Flow control ON 27 VC-4 RTT 31 ms Kernel web100_pktd-plus Myricom 10G-PCIE-8A-R Fibre –rx-usecs=25 Coalescence ON MTU 9000 bytes TCP buffer 32 MByte 2*BDP iperf Ave throughput 3.85 Gbit/s iperf Max throughput 4.02 Gbit/s 3 re-transmits Due to “other reductions”

Connect. Communicate. Collaborate TCP reno Throughput Stoc-Lon MCC NO Flow control 27 VC-4 RTT 31 ms Kernel web100_pktd-plus Myricom 10G-PCIE-8A-R Fibre –rx-usecs=25 Coalescence ON MTU 9000 bytes TCP buffer 32 MByte 2*BDP iperf Ave throughput 192 Mbit/s iperf Max throughput 465 Mbit/s TCP Cwnd to 0 Many re-transmits

Connect. Communicate. Collaborate Alcatel MCC Buffer size Classic Bottleneck 10 Gbit/s input 4 Gbit/s output Use udpmon to send a stream of spaced UDP packets Measure packet number of first lost frame as function of w packet spacing Slope gives buffer size ~57 kBytes

Connect. Communicate. Collaborate Use UDP to emulate TCP slowstart udpmon sends bursts of spaced packets: –32 packets –Jumbo 8000 bytes –back2back –4 ms between bursts Path Lon-Ams_FF-Prague-Paris- Lon Rtt 55.5 ms See 13 packets then loose 1 in 3 Confirm the TCP problem!

Connect. Communicate. Collaborate iNetTest: iBoB FPGA with 10 GE

Connect. Communicate. Collaborate iNetTest: Control & Operation 10/100 Ethernet 10Gb Port 0 iNetTest iBOB 1 Send and receive packets of selected length and spacing Count packets transmitted and received, calculate histograms of arrival times Control PC IP control of multiple iBOBs 10Gb Port 1 10Gb Port 0 10Gb Port 1 Ten Gb Network iNetTest iBOB n Expansion to more than two iBOBs iNetTest iBOB 2 Send and receive packets of selected length and spacing Count packets transmitted and received, calculate histograms of arrival times

Connect. Communicate. Collaborate iNetTest: Simulink design Jonathan Hargreaves JBO Network test parameter CSRs 10GE MAC core Inter-packet time histogram Transmit & receive event time log

Connect. Communicate. Collaborate iNetTest: Details iNetTest FPGA runs with a 200 MHz (5 ns) clocks Two iNetTest units can be controlled over ethernet IP from a PC iNetTest Ethernet IP address and MAC preset in code – last digit selected by jumper Ten GE IP address, Port and Gateway can be user configured Automatic or manual ARPing. ARP tables for each port can be examined iNetTest responds to and generates PING UDP packets can be sent by firmware between two iNetTests or between iNetTest and PCs User selects the number of packets to send, their length, and the time between them iBOBs count received and transmitted packets, and store up to 2048 time stamped events per port iBOBs generate histograms of the arrival time distribution for received packets

Connect. Communicate. Collaborate UDP Throughput vs. Packet Spacing PC Kernel web100_pktd-plus Myricom 10G-PCIE-8A-R CX4 –rx-usecs=25 Coalescence ON MTU 9000 bytes UDP Packets Max throughput 9.4 Gbit/s iBoB Packet 8234 Data: Header: MHz clock Max rate 6.6 Gbit/s See 6.44Gbit/s 200 MHz clock - Linespeed

Connect. Communicate. Collaborate PC to iNetTest Packet Jitter Kernel web100_pktd-plus Myricom 10G-PCIE-8A-R Fibre Packet separation 100 µs iBoB bins 2320 ns Width ± 34 µs Similar to PC-PC but no extra peaks Structure caused by PC The PC – PC plots look like:

Connect. Communicate. Collaborate iNetTest – iBOB to iBOB Results 10 GE 3m CX4 cable link 8192 byte packets 1 million transmitted at line rate between two iNetTest units. 5 ns bins B2B 10 ns FWHM With Switch: –Fujutsu XG ns –HP 6400cl 35 ns long tail Number of Packets Time between packet arrivals (ns)

Connect. Communicate. Collaborate Back to the 4 Gig Path

Connect. Communicate. Collaborate Stockholm – Manc. with TSS 4 Gig TDM/SDH Lighpath over GÉANT2+ 4 Gigabit Ethernet Path over TSS NORDUnet 4 Gig on Ethernet Optical Transmission JANET & NNW

Connect. Communicate. Collaborate UDP Throughput Stoc-Man with TSS Alcatel TSS – 10GE – MCC Kernel web100_pktd Myricom 10G-PCIE-8A-R Fibre –rx-usecs=75 Coalescence ON MTU 9000 bytes Max throughput 3.78 Gbit/s (Previously was 4.05 Gbit/s) Packet loss 10 % at Gbit/s for 8192 Bytes Man-Stoc: No packet Gbit/s

Connect. Communicate. Collaborate Packet loss Stoc-Man with TSS Alcatel TSS – 10GE – MCC Send 1M packets from PC in Stockholm 8192 bytes All cross the TSS cloud All enter Alcatel MCC as 10 GE frames Packet loss in MCC Card ~10% at 4096 Mbit/s Classic “bottleneck” performance But WHY???

Connect. Communicate. Collaborate UDP Jitter Man-Stoc with TSS Alcatel MCC – 10GE – TSS – PC Kernel web100_pktd Myricom 10G-PCIE-8A-R Fibre –rx-usecs=0 Coalescence OFF 8192 byte packets Alcatel MCC is TDM 28 VC-4 Can only send packets at 16 µs Not at 10 Gbit/s Peak packet spacing ~ 6 µs ie packets arriving at 10 Gbit/s Suggests packet bunching in the network Sto-Man no TSS 16 µs

Connect. Communicate. Collaborate UDP 1-way delay Man-Stoc with TSS Alcatel MCC – 10GE – TSS – PC Kernel web100_pktd Myricom 10G-PCIE-8A-R Fibre –rx-usecs=0 Coalescence OFF 8192 byte packets Structure similar to interrupt coalescence but this is OFF As function of recv time see gaps of ~ 130 us packets at ~ 10Gbit/s Again suggests packet bunching in the network

Connect. Communicate. Collaborate Last week: Onsala – Jodrell iNetTest - iNetTest 4 Gig TDM/SDH Lighpath over GÉANT2+ 4 Gigabit Ethernet Path over TSS NORDUnet 4 Gig on Ethernet Optical Transmission JANET & NNW

Connect. Communicate. Collaborate iNetTest: Onsala  JBO UDP Packet Jitter 1M packets 8192 byte packets 16 µs spacing Gbit/s 10 % packet loss Peaks 14 & 15.7 µs Tail to 130 µs Cyclic variations spacing 3.25 µs 45 packets at 6.5 µs (line rate)

Connect. Communicate. Collaborate Development of Real Time eVLBI at JBO eMERLIN Import – Onsala to Jodrell eMERLIN Export – Jodrell to JIVE

Connect. Communicate. Collaborate Overview of eVLBI at JBO eMERLIN CORRELATOR JBOJIVE 4Gbps light path Onsala JBO CX4 4Gbps Or fibre if > 15m CX4 4Gbps iBOB 5SwitchiBOB 4SwitchADCStation BoardVSI 4Gbps light path CX4 1Gbps iBOB 0SwitchStation BoardVSI VSI to ZDOK Switch iBOB 1Station BoardVSI VSI to ZDOK iBOB 2Station BoardVSI VSI to ZDOK iBOB 3Station BoardVSI VSI to ZDOK VLBI Mk V b receiv ers

Connect. Communicate. Collaborate Onsala to JBO: Transmit eMERLIN Import ZDOK0ZDOK0 iADC 2 channels 1024MSPS per channel 8 bit sampling 10GB PORT. 4GB/s out ASSEMBLE PACKETS Divide data into 8192 byte packets plus 16 byte MkVC compatible header Alternatively generate test packets as per iNetTest Count packets transmitted DATA Left Pol ADCCLK ONEPPS Convert data to 2 bit Measure signal power on both channels Generate real time clock based on ADC clock Pass data & sync from ADC clock to iBOB 200MHz clock domain Power PC iBOB control and monitoring via registers on the OBP bus 10/100 ethernet Measured RMS Signal Power Read and Set Real Time Clock Set up test mode parameters Control Data Flow ON/OFF Set Network Source and Destination DATA Right Pol iBOB

Connect. Communicate. Collaborate Onsala to JBO: Receive ZDOK0ZDOK0 10GB PORT. 4GB/s in RE-ORDER PACKETS Store data in SRAM location according to its time stamp and sequence number Regenerate Sync and Data Valid signals FIFO 16k deep Smooth out bunching due to network delay variations Power PC iBOB control and monitoring via registers on the OBP bus 10/100 ethernet iBOB STREAM OUT Clock data out of SRAM on 128MHz correlator clock Stream the left and right polarisations to the Station Board VSI chips ZDOK1ZDOK1 VSI Chip 0 Route left polarisation data to input stage of Station Board Send correlator clock to iBOB VSI Chip 1 Route right polarisation data to input stage of Station Board Station Board SRAM0 512 packets L pol. data SRAM1 512 packets R pol. data DATA 32 bit 128MHz clock

Connect. Communicate. Collaborate Onsala to JBO - Status iADC and transmit iBOB tested with ADC clocked at 1GS/s in diagnostic mode – ADC is set to output a fixed bit pattern iBOB receiver and VSI chip coded in Simulink and VHDL Next Steps Test iADC and transmit iBOB with sinusoidal test signal Generated deliberately out-of-order packets and bursts of greater than 4Gbps to test receiver buffering

Connect. Communicate. Collaborate JBO to JIVE: Transmit – Overview eMERLIN Export ZDOK0ZDOK0 10GB PORT. 1 Gb/s out REMOVE DELAY MODEL (COURSE) Use 72 bit wide bt 512k deep SRAM to remove eMERLIN delay model with 16 microsecond resolution FORMAT Generate byte MkVb frames Power PC iBOB control and monitoring via registers on the OBP bus 10/100 ethernet iBOB ZDOK1ZDOK1 VSI Chip 0 Process left polarisation data Detail on next slide VSI Chip 1 Process right polarisation data Detail on next slide Station Board DATA 32 bit CLOCK VALID PDATA CLOCK VALID PACKETISE Divide each MkVb frame into two 5008 byte packets (incl header) Add Mk5C header to each packet

Connect. Communicate. Collaborate JBO to JIVE: Transmit -VSI Chip Detail VSI Chip 0 - Process left polarisation data Monitor & Control Bus (MCB) Station Board DATA 32 bit CLOCK VALID PDATA Select a 128MHz band from the filter bank Subtract Merlin Delay (Fine) Variable coefficient FIR filter changes delay in steps of 1/16th of original sample period Reclock for delays up to 16ns Mixer 0 FIR Low pass filter Decimate by 8/ taps Convert 4/8 bit data to 2 bit VLBI Mixer 1 Mixer 8/16 Stream data out to iBOB over VSI cable MCB Interface Load band selection, delay model and filter parameters Mixer Bank Select 8 or 16 VLBI bands from 128MHz input Remove eMERLIN offset (a multiple of 10kHz)

Connect. Communicate. Collaborate JBO to JIVE: Status VSI chip DSP functions currently under development and simulation Transmit iBOB re-uses packetising code from iNetTest Next Steps VSI chip MCB interface – based on Verilog code provided by Dave Fort at Penticton Transmit iBOB needs coarse delay and MkVb formatting

Connect. Communicate. Collaborate Summary UDP on a 28 VC-4, 4.2 Gigabit lightpath provides the performance required for EXPReS TCP performance poor due to small Ethernet buffer size on the equipment connecting 10GE to the 4 Gig Lightpath Ethernet flow-control with short tails helps TCP The network behaves correctly. Care needed on choice of PC for 10 Gigabit. FPGA iNetTest solutions work very well. Onsala – Jodrell 4 Gigabit path now in place. Packet loss and packet spacing need more understanding. VLBI FPGA application progressing well.

Connect. Communicate. Collaborate Any Questions ?

Connect. Communicate. Collaborate For More Information For latest news and factsheets For research activities [note: only one “s”] Contact information: Richard Hughes-Jones, DANTE, Jonathan Hargreaves, JBO, EXPReS is made possible through the support of the European Commission (DG- INFSO), Sixth Framework Programme, Contract #02664

Connect. Communicate. Collaborate GÉANT2 Topology November 2006

Connect. Communicate. Collaborate TCP BIC Throughput Lon-Stoc Alcatel Metro Core Connect MCC Flow control ON 27 VC-4 RTT 31 ms Kernel web100_pktd-plus Myricom 10G-PCIE-8A-R Fibre –rx-usecs=25 Coalescence ON MTU 9000 bytes TCP buffer 32 MByte 2*BDP iperf Ave throughput 3.98 Gbit/s iperf Max throughput 4.02 Gbit/s Several re-transmits

Connect. Communicate. Collaborate TCP BIC Throughput Stoc-Lon MCC NO Flow control 27 VC-4 RTT 31 ms Kernel web100_pktd-plus Myricom 10G-PCIE-8A-R Fibre –rx-usecs=25 Coalescence ON MTU 9000 bytes TCP buffer 32 MByte 2*BDP iperf Ave throughput 71.6 Mbit/s iperf Max throughput 192 Mbit/s TCP Cwnd rapidly opens Many more re-transmits

Connect. Communicate. Collaborate TCP BIC Throughput Lon-Stoc But Faster and Longer MCC Flow control 48 VC-4 RTT 60 ms Kernel web100_pktd-plus Myricom 10G-PCIE-8A-R Fibre –rx-usecs=25 Coalescence ON MTU 9000 bytes TCP buffer 59 MByte 1 * BDP txquelen 1300 … iperf Ave throughput 6.5 Gbit/s iperf Max throughput ~6.9 Gbit/s Some glitches & re-transmits