Ken Wong Jon Turner and Prashanth Pappu Washington University Distributed Queueing Gigabit Kits (June 2002)

Slides:



Advertisements
Similar presentations
Martin Suchara, Ryan Witt, Bartek Wydrowski California Institute of Technology Pasadena, U.S.A. TCP MaxNet Implementation and Experiments on the WAN in.
Advertisements

Technische universiteit eindhoven 1 Problem 16: Design-space Exploration Jeroen Voeten, Bart Theelen Eindhoven University of Technology Embedded Systems.
RED Enhancement Algorithms By Alina Naimark. Presented Approaches Flow Random Early Drop - FRED By Dong Lin and Robert Morris Sabilized Random Early Drop.
William Stallings Data and Computer Communications 7 th Edition Chapter 13 Congestion in Data Networks.
24-1 Chapter 24. Congestion Control and Quality of Service (part 1) 23.1 Data Traffic 23.2 Congestion 23.3 Congestion Control 23.4 Two Examples.
Abhay.K.Parekh and Robert G.Gallager Laboratory for Information and Decision Systems Massachusetts Institute of Technology IEEE INFOCOM 1992.
Jon Turner Extreme Networking Achieving Nonstop Network Operation Under Extreme Operating Conditions.
High-performance bulk data transfers with TCP Matei Ripeanu University of Chicago.
MULTIMEDIA TRAFFIC MANAGEMENT ON TCP/IP OVER ATM-UBR By Dr. ISHTIAQ AHMED CH.
EE 122: Router Design Kevin Lai September 25, 2002.
CS 268: Lecture 12 (Router Design) Ion Stoica March 18, 2002.
CS4514 Networks1 Distributed Dynamic Channel Selection in Chaotic Wireless Networks By: Matthias Ihmig and Peter Steenkiste Presented by: James Cialdea.
Medium Start in TCP-Friendly Rate Control Protocol CS 217 Class Project Spring 04 Peter Leong & Michael Welch.
Lecture 5: Congestion Control l Challenge: how do we efficiently share network resources among billions of hosts? n Last time: TCP n This time: Alternative.
Performance and Robustness Testing of Explicit-Rate ABR Flow Control Schemes Milan Zoranovic Carey Williamson October 26, 1999.
UNIVERSITY OF ELECTRONIC SCIENCE & TECHNOLOGY OF CHINA IEEE INFOCOM 2015, Hong Kong RAPIER: Integrating Routing and Scheduling for Coflow-aware Data Center.
UCB Improvements in Core-Stateless Fair Queueing (CSFQ) Ling Huang U.C. Berkeley cml.me.berkeley.edu/~hlion.
1 A State Feedback Control Approach to Stabilizing Queues for ECN- Enabled TCP Connections Yuan Gao and Jennifer Hou IEEE INFOCOM 2003, San Francisco,
Localized Asynchronous Packet Scheduling for Buffered Crossbar Switches Deng Pan and Yuanyuan Yang State University of New York Stony Brook.
1 IP routers with memory that runs slower than the line rate Nick McKeown Assistant Professor of Electrical Engineering and Computer Science, Stanford.
Computer Networks Switching Professor Hui Zhang
Load Balanced Birkhoff-von Neumann Switches
Cost-Performance Tradeoffs in MPLS and IP Routing Selma Yilmaz Ibrahim Matta Boston University.
Jon Turner (and a cast of thousands) Washington University Design of a High Performance Active Router Active Nets PI Meeting - 12/01.
Washington WASHINGTON UNIVERSITY IN ST LOUIS How to Implement the WaveVideo Plugin in an MSR Router.
1 Robust Transport Protocol for Dynamic High-Speed Networks: enhancing XCP approach Dino M. Lopez Pacheco INRIA RESO/LIP, ENS of Lyon, France Congduc Pham.
Time Parallel Simulations II ATM Multiplexers and G/G/1 Queues.
Control Processor Switch Fabric ATM Switch Core Port Processors FPX SPC LC IPPOPP FPX SPC LC IPPOPP FPX SPC LC IPPOPP FPX SPC LC IPPOPP FPX SPC LC IPPOPP.
ACN: RED paper1 Random Early Detection Gateways for Congestion Avoidance Sally Floyd and Van Jacobson, IEEE Transactions on Networking, Vol.1, No. 4, (Aug.
Univ. of TehranAdv. topics in Computer Network1 Advanced topics in Computer Networks University of Tehran Dept. of EE and Computer Engineering By: Dr.
High-speed TCP  FAST TCP: motivation, architecture, algorithms, performance (by Cheng Jin, David X. Wei and Steven H. Low)  Modifying TCP's Congestion.
Engineering Jon Turner Computer Science & Engineering Washington University Coarse-Grained Scheduling for Multistage Interconnects.
1 IEEE Meeting July 19, 2006 Raj Jain Modeling of BCN V2.0 Jinjing Jiang and Raj Jain Washington University in Saint Louis Saint Louis, MO
Cisco 3 - Switching Perrine. J Page 16/4/2016 Chapter 4 Switches The performance of shared-medium Ethernet is affected by several factors: data frame broadcast.
1 Dr. Ali Amiri TCOM 5143 Lecture 8 Capacity Assignment in Centralized Networks.
Stress Resistant Scheduling Algorithms for CIOQ Switches Prashanth Pappu Applied Research Laboratory Washington University in St Louis “Stress Resistant.
Interconnect simulation. Different levels for Evaluating an architecture Numerical models – Mathematic formulations to obtain performance characteristics.
Interconnect simulation. Different levels for Evaluating an architecture Numerical models – Mathematic formulations to obtain performance characteristics.
Advanced Technology Laboratories Practical Considerations for Smoothing Multimedia Traffic over Packet- Switched Networks Christos Tryfonas
CEN 5501C - Computer Networks - Spring UF/CISE - Newman1 Computer Networks Chapter 5 – Hubs, Switches, VLANs, Fast Ethernet.
Department of Computer Science MapReduce for the Cell B. E. Architecture Marc de Kruijf University of Wisconsin−Madison Advised by Professor Sankaralingam.
Washington WASHINGTON UNIVERSITY IN ST LOUIS 1 DTI Visit - John DeHart- 4/25/2001 Agenda l WU/ARL Background – John DeHart (15 minutes) l DTI Background.
T. S. Eugene Ngeugeneng at cs.rice.edu Rice University1 COMP/ELEC 429/556 Introduction to Computer Networks Principles of Congestion Control Some slides.
Network teleology Damon Wischik
CprE 458/558: Real-Time Systems (G. Manimaran)1 CprE 458/558: Real-Time Systems Real-Time Networks – WAN Packet Scheduling.
Scheduling Determines which packet gets the resource. Enforces resource allocation to each flows. To be “Fair”, scheduling must: –Keep track of how many.
We used ns-2 network simulator [5] to evaluate RED-DT and compare its performance to RED [1], FRED [2], LQD [3], and CHOKe [4]. All simulation scenarios.
OPERATING SYSTEMS CS 3530 Summer 2014 Systems and Models Chapter 03.
ISP and Egress Path Selection for Multihomed Networks Amogh Dhamdhere, Constantine Dovrolis Networking and Telecommunications Group Georgia Institute of.
Multi-objective Topology Synthesis and FPGA Prototyping Framework of Application Specific Network-on-Chip m Akram Ben Ahmed Xinyu LI, Omar Hammami.
Washington WASHINGTON UNIVERSITY IN ST LOUIS SPC II Architecture.
HP Labs 1 IEEE Infocom 2003 End-to-End Congestion Control for InfiniBand Jose Renato Santos, Yoshio Turner, John Janakiraman HP Labs.
Univ. of TehranIntroduction to Computer Network1 An Introduction to Computer Networks University of Tehran Dept. of EE and Computer Engineering By: Dr.
Univ. of TehranIntroduction to Computer Network1 An Introduction to Computer Networks University of Tehran Dept. of EE and Computer Engineering By: Dr.
Ethernet Packet Filtering – Part 2 Øyvind Holmeide 10/28/2014 by.
Instructor Materials Chapter 6: Quality of Service
OPERATING SYSTEMS CS 3502 Fall 2017
Topics discussed in this section:
CS 268: Router Design Ion Stoica February 27, 2003.
Traffic Engineering with AIMD in MPLS Networks
Data Center Networks and Switching and Queueing
TCP, XCP and Fair Queueing
Lecture 19 – TCP Performance
Demonstration of a High Performance Active Router DARPA Demo - 9/24/99
Presented by LINGLING MENG( ), XUN XU( )
Multi-hop Coflow Routing and Scheduling in Data Centers
Computer Science Division
Design Review.
EE 122: Lecture 7 Ion Stoica September 18, 2001.
Understanding Congestion Control Mohammad Alizadeh Fall 2018
Presentation transcript:

Ken Wong Jon Turner and Prashanth Pappu Washington University Distributed Queueing Gigabit Kits (June 2002)

2 -Ken Wong, June 2002 Distributed Queueing Switch Fabric TI IOIOIO IO IO IO Control Processor Routing Sched. Routing Sched. Routing Sched. Routing Sched. Routing Sched. Routing Sched. queue per output periodic queue length reports Scheduler paces each queue according to backlog share

3 -Ken Wong, June 2002 What is Distributed Queueing? Performs like an output queueing system »I.e., Maximize output link utilization »Without the need for a switch with speed-up of N Goals of the DQ Algorithm »Avoid switch fabric congestion (and therefore cell loss) »Avoid output underflow (maximize output link utilization) Topics »The STRESS experiment »Discrete-event simulation results »SPC-only prototype measurements »Current work

4 -Ken Wong, June 2002 Stress Test can vary number of inputs and outputs used, and length of “phases” K = 4 Sources R = 5 Phases 70 Mbps S x L = 2 x 70 = 140 Mbps L = 70 Mbps

5 -Ken Wong, June 2002 Stress Test Simulation - Backlog

6 -Ken Wong, June 2002 Basic Ideas To Avoid Output Underflow »Input-side backlog B(i,j) indicates need for switch bandwidth »Output-side backlog B(j) indicates less need for switch bandwidth (back pressure) Apportion switch capacity based on relative input- side backlogs to avoid switch congestion and output underflow

7 -Ken Wong, June 2002 Basic DQ Algorithm Goal: Avoid switch congestion and output queue underflow. Let hi(i,j) be input i’s share of input-side backlog to output j. »Avoid switch congestion by sending from input i to output j at rate  L  S  hi(i,j) »where L is external link rate and S is switch speedup Let lo(i,j) be input i’s share of total backlog for output j. »Avoid underflow of queue at output j by sending from input i to output j at rate  L  lo(i,j) »This works if L  (lo(i,1)+···+lo(i,n))  L  S for all i Let wt(i,j) be the ratio of lo(i,j) to lo(i,1)+···+lo(i,n). Let rate(i,j)=L  S  min ( wt(i,j), hi(i,j) ). Note: Algorithm avoids congestion and output underflow for large enough S.

8 -Ken Wong, June 2002 Stress Test Simulation - Backlog

9 -Ken Wong, June 2002 Stress Test Simulation - Min Rates

10 -Ken Wong, June 2002 Stress Test Simulation - Actual Rates

11 -Ken Wong, June 2002 SPC Code DQ cell (40 of 48 bytes for N=8 ports) »Port number »Output backlog B(i) »N input backlogs B(i,j) SPC kernel at each port (Every D = 500 usec) »Multicast 1 DQ cell containing its backlogs to all ports using vpi/vci 0/61 »Read incoming DQ cells from all ports »Call dq_set_pace() dq_set_pace »Compute rate(i,j) for all j »Set APIC pacing rates for each j

12 -Ken Wong, June 2002 Stress Test Measurement Results

13 -Ken Wong, June 2002 Improving Basic Algorithm Allocated rates will oscillate when backlogs are near 0 »Use artificial minimum backlog in hi(i,j) and artificial minimum rate in lo(i,j) Does not always make full use of available input bandwidth »Does not reallocate bandwidth that is lost when queues are “output limited” »Extend algorithm to reallocate “excess” rate Allocate rates in decreasing order of largest lo(i,j)/hi(i,j) I.e., rate(i,j) can “donate” excess rate to remaining rates

14 -Ken Wong, June 2002 Current/Future Work “Design and Evaluation of a High-Performance Dynamically Extensible Router,” DANCE Conference, May FCFS property across input streams SPC »Output-limited rate redistribution algorithm »Dynamic configuration of DQ parameters and algorithm INFOCOM 2003 paper (Pappu, Turner, Wong) »Handles unequal link speed case FPGA implementation Extension to fair queueing Extension to reserved bandwidth flows

15 -Ken Wong, June 2002 Summary Fluid model simulator (C++) Discrete-event simulator (C++) »Study alternative DQ algorithms »Understand operating characteristics in controlled environment »Pre-testing of SPC (integer) algorithm SPC-only prototype »Basically works »Great monitoring tools (Java GUI, DQ-cell capture, Other utilities) »Great traffic generator (AAL5Generator) FPX version coming Lots of fun

16 -Ken Wong, June 2002 Backlog Shares Let hi(i,j) be input i’s share of input-side backlog to output j: Let lo(i,j) be input i’s share of total backlog for output j:

17 -Ken Wong, June 2002 Allocated Rates r(i,j) where

18 -Ken Wong, June 2002 Artificial Backlogs & Rates Let hi(i,j) be input i’s share of input-side backlog to output j: Let lo(i,j) be input i’s share of total backlog for output j:

19 -Ken Wong, June 2002 Rate Redistribution Algorithm Revised rate allocation at input i: R = S  L repeat n times Let j be unassigned queue with largest ratio lo(i,j)/hi(i,j) Let wt(i,j) = lo(i,j)/(sum of lo(i,q) for unassigned queues q) rate(i,j) = min{R  wt(i,j), S  L  hi(i,j)} R = R - rate(i,j)