Jon Turner Resilient Cell Resequencing in Terabit Routers.

Slides:



Advertisements
Similar presentations
Switching Techniques In large networks there might be multiple paths linking sender and receiver. Information may be switched as it travels through various.
Advertisements

Lecture 4. Topics covered in last lecture Multistage Switching (Clos Network) Architecture of Clos Network Routing in Clos Network Blocking Rearranging.
1 IK1500 Communication Systems IK1330 Lecture 3: Networking Anders Västberg
Jaringan Komputer Lanjut Packet Switching Network.
1 CONGESTION CONTROL. 2 Congestion Control When one part of the subnet (e.g. one or more routers in an area) becomes overloaded, congestion results. Because.
TELE202 Lecture 8 Congestion control 1 Lecturer Dr Z. Huang Overview ¥Last Lecture »X.25 »Source: chapter 10 ¥This Lecture »Congestion control »Source:
William Stallings Data and Computer Communications 7 th Edition Chapter 13 Congestion in Data Networks.
24-1 Chapter 24. Congestion Control and Quality of Service (part 1) 23.1 Data Traffic 23.2 Congestion 23.3 Congestion Control 23.4 Two Examples.
Traffic Shaping Why traffic shaping? Isochronous shaping
1.  Congestion Control Congestion Control  Factors that Cause Congestion Factors that Cause Congestion  Congestion Control vs Flow Control Congestion.
CS 408 Computer Networks Congestion Control (from Chapter 05)
Priority Scheduling and Buffer Management for ATM Traffic Shaping Authors: Todd Lizambri, Fernando Duran and Shukri Wakid Present: Hongming Wu.
Router Architecture : Building high-performance routers Ian Pratt
IP Telephony Project By: Liane Lewin Shahar Eytan Guided By: Ran Cohen - IBM Vitali Sokhin - Technion.
Nick McKeown CS244 Lecture 6 Packet Switches. What you said The very premise of the paper was a bit of an eye- opener for me, for previously I had never.
Making Parallel Packet Switches Practical Sundar Iyer, Nick McKeown Departments of Electrical Engineering & Computer Science,
Scaling Internet Routers Using Optics Producing a 100TB/s Router Ashley Green and Brad Rosen February 16, 2004.
EE 4272Spring, 2003 Chapter 14 LAN Systems Ethernet (CSMA/CD)  ALOHA  Slotted ALOHA  CSMA  CSMA/CD Token Ring /FDDI Fiber Channel  Fiber Channel Protocol.
COMP680E by M. Hamdi 1 Course Exam: Review April 17 (in-Class)
Internetworking Fundamentals (Lecture #2) Andres Rengifo Copyright 2008.
Error Checking continued. Network Layers in Action Each layer in the OSI Model will add header information that pertains to that specific protocol. On.
Dragonfly Topology and Routing
Router Architectures An overview of router architectures.
Load Balanced Birkhoff-von Neumann Switches
ATM SWITCHING. SWITCHING A Switch is a network element that transfer packet from Input port to output port. A Switch is a network element that transfer.
TO p. 1 Spring 2006 EE 5304/EETS 7304 Internet Protocols Tom Oh Dept of Electrical Engineering Lecture 9 Routers, switches.
1 Copyright © Monash University ATM Switch Design Philip Branch Centre for Telecommunications and Information Engineering (CTIE) Monash University
CPS 356: Introduction to Computer Networks Lecture 7: Switching technologies Ch 2.8.2, 3.1, 3.4 Xiaowei Yang
Advance Computer Networking L-8 Routers Acknowledgments: Lecture slides are from the graduate level Computer Networks course thought by Srinivasan Seshan.
Survey of Performance Analysis on Banyan Networks Written By Nathan D. Truhan Kent State University.
CS 447 Network & Data Communication QoS (Quality of Service) & DiffServ Introduction Department of Computer Science Southern Illinois University Edwardsville.
Chapter 12 Transmission Control Protocol (TCP)
Univ. of TehranAdv. topics in Computer Network1 Advanced topics in Computer Networks University of Tehran Dept. of EE and Computer Engineering By: Dr.
CSC 581 Communication Networks II Chapter 7c: Congestion Control Dr. Cheer-Sun Yang.
Applied research laboratory 1 Scaling Internet Routers Using Optics Isaac Keslassy, et al. Proceedings of SIGCOMM Slides:
ISLIP Switch Scheduler Ali Mohammad Zareh Bidoki April 2002.
Packet Forwarding. A router has several input/output lines. From an input line, it receives a packet. It will check the header of the packet to determine.
Case Study: The Abacus Switch CS Goals and Considerations Handles cell relay (fixed-size packets) Can be modified to handle variable-sized packets.
Stress Resistant Scheduling Algorithms for CIOQ Switches Prashanth Pappu Applied Research Laboratory Washington University in St Louis “Stress Resistant.
MASCON: A Single IC Solution to ATM Multi-Channel Switching With Embedded Multicasting Ali Mohammad Zareh Bidoki April 2002.
1 IEX8175 RF Electronics Avo Ots telekommunikatsiooni õppetool, TTÜ raadio- ja sidetehnika inst.
An End-to-End Service Architecture r Provide assured service, premium service, and best effort service (RFC 2638) Assured service: provide reliable service.
1 How scalable is the capacity of (electronic) IP routers? Nick McKeown Professor of Electrical Engineering and Computer Science, Stanford University
CCNA3 Module 4 Brierley Module 4. CCNA3 Module 4 Brierley Topics LAN congestion and its effect on network performance Advantages of LAN segmentation in.
Indian Institute of Technology Bombay 1 Communication Networks Prof. D. Manjunath
Univ. of TehranIntroduction to Computer Network1 An Introduction to Computer Networks University of Tehran Dept. of EE and Computer Engineering By: Dr.
An End-to-End Service Architecture r Provide assured service, premium service, and best effort service (RFC 2638) Assured service: provide reliable service.
Timestamp snooping: an approach for extending SMPs Milo M. K. Martin et al. Summary by Yitao Duan 3/22/2002.
Univ. of TehranIntroduction to Computer Network1 An Introduction to Computer Networks University of Tehran Dept. of EE and Computer Engineering By: Dr.
Chapter 10 Congestion Control in Data Networks and Internets 1 Chapter 10 Congestion Control in Data Networks and Internets.
Chapter 5 Peer-to-Peer Protocols and Data Link Layer Timing Recovery.
Congestion Control in Data Networks and Internets
Instructor Materials Chapter 6: Quality of Service
Topics discussed in this section:
Chapter 6 TCP Congestion Control
Queue Management Jennifer Rexford COS 461: Computer Networks
© 2008 Cisco Systems, Inc. All rights reserved.Cisco ConfidentialPresentation_ID 1 Chapter 6: Quality of Service Connecting Networks.
CONGESTION CONTROL.
Advance Computer Networking
Congestion Control (from Chapter 05)
Chapter 6 TCP Congestion Control
Switching Techniques.
EE 122: Lecture 7 Ion Stoica September 18, 2001.
Congestion Control (from Chapter 05)
TCP Overview.
Chapter 2 Switching.
Congestion Control (from Chapter 05)
Congestion Control (from Chapter 05)
Congestion Control (from Chapter 05)
Switching Chapter 2 Slides Prepared By: -
Presentation transcript:

Jon Turner Resilient Cell Resequencing in Terabit Routers

2 - Jonathan Turner Why Resequencing? Multistage interconnection networks with buffered switch elements and dynamic routing. »scalable, bandwidth-efficient architecture »makes good use of modern CMOS ICs - single chip can provide 100 Gb/s thruput and buffer thousands of cells Load Balancing Stage Input Line Cards Output Line Cards Shared Memory Switch Elements

3 - Jonathan Turner Resequencing with Sequence Numbers Drawbacks »each output needs N resequencing arrays »initialization when line card comes on-line »timeouts needed to cope with lost cells »multicast requires per flow resequencing arrays Sequence Numbers Added Reseq. Arrays indexed by seq. #

4 - Jonathan Turner Time-Based Resequencing Single resequencing buffer per output. Cells held until “age” exceeds threshold (T ). Options for late cells. »discard (strict resequencing) or buffer (loose) Timestamps Added Timestamp Ordered Reseq. Buffer Output Buffer

5 - Jonathan Turner Henrion’s Strict Resequencer Design Implemented using linked lists in common memory. Constant time per cell. arriving cell ts “ready” cells T slot timing wheel now insert at timestamp mod T current time modulo T

6 - Jonathan Turner now Implementing Loose Resequencing Only approximates loose resequencing. »must still discard “really late” cells. »low cost of large timing wheel allows good approximation lag now T arriving cell ts 1 ts 1 +T late arriving cell ts 2 ts 2 +T “Normal” insertion range “ready” cells “waiting” cells next lag new cell inserted at timestamp+T Cannot just insert late cells into output list.

7 - Jonathan Turner Fast-Forwarding the Lag Pointer Lag pointer must advance to next non-empty list on every clock tick for constant time operation. »no time to check successive pointers in timing wheel »use fast-forward bits to speed-up process summary word bit for current time next non-empty timing wheel slot Two memory reads suffice to find next slot. »32 bit words allows range of 1024 »128 bit words allows range of 16,384

8 - Jonathan Turner Synchronization Time-based resequencing requires synchronization of all line cards. »in small routers, requires just a common backplane signal »in large routers, line cards connected to network only by optical data cables Requires low-level clock synchronization protocol. »“master” line card issues periodic broadcast synchronization messages »network forwards sync messages with constant delay only approximate synchronization is necessary »new clock master selected on failure Independent line card clocks require adjustments. »suspend transmission when delaying clock

9 - Jonathan Turner Performance of Strict Resequencing Simple random traffic. 3 stage network, 8 port SEs, 512 (shared) cell buffers. 1st stage SEs use round robin load balancing for each input. Late cells rare with small speedup For systems with 10G links, delay for 256 cells is 10  s.

10 - Jonathan Turner Performance on Adversarial Traffic Growing network delay Cells discarded when delay exceeds T Delay drops as SE buffers drain Resequencer recovers when delay drops below T 2:1 overload at “target” output

11 - Jonathan Turner Performance on Bursty Traffic 100% input load. Input picks an output at random – overload at 1 in 4 outputs Stays with target output for geometrically distributed time. Poor performance even for large speedups.

12 - Jonathan Turner Loose Reseq. with Adversarial Traffic Arriving cells are younger than oldest waiting cells, so no resequencing errors

13 - Jonathan Turner Loose Reseq. with Bursty Traffic Tolerates about 3x longer bursts than strict reseq.

14 - Jonathan Turner Adaptive Resequencing Adjust age threshold to match observed delay. »parameters: window size (W), short term delay difference bound (  ) »variables: max delay in current measurement window (d 0 ) and previous measurement window (d -1 ) »age threshold =  + max{d 0, d -1 } »implement by extending loose, fixed threshold design Theorem. (simplified). If cell c 1 enters interconnection network just before c 2 and exits no later than  after c 2, then adaptive resequencer with W   forwards c 1 before c 2. Resequencing errors caused by excessive delay variability, rather than large delays.

15 - Jonathan Turner Performance on Adversarial Traffic Arriving cells are younger than oldest waiting cells, so no resequencing errors Resequencer adds small increment to network delay. Age threshold tracks network delay. Resequencer occupancy stays bounded.

16 - Jonathan Turner Performance on Bursty Traffic Tolerates about 2x longer bursts than loose reseq. Less sensitive to extremely large bursts Performance degrades when switch buffers fill. Caused by delay variation in first stage.

17 - Jonathan Turner Boosting Performance for Long Bursts Limiting first stage buffering cuts variability. Small first stage buffer gives good performance with modest “extra” delay Smallest buffer reduces switch throughput by 2%.

18 - Jonathan Turner Speeding up Threshold Reductions Downward age threshold adjustments are delayed by window mechanism. Can reduce delay using “finer-grained” windows. »max delay d 0,d -1,d -2,... in k measurement windows »age threshold =  + max{d 0, d -1,d -2,...} Theorem. (simplified). If cell c 1 enters interconnection network just before c 2 and exits no later than  after c 2, then adaptive resequencer with (k  1)W   forwards c 1 before c 2. For large k, max delay in threshold adjustment is cut almost in half.

19 - Jonathan Turner Closing Remarks Adaptive resequencing can virtually eliminate resequencing errors in multistage networks. »to handle most extreme traffic, need to limit delay variability in load balancing stages »in systems that regulate the overall flow of traffic, extreme cases should not arise at all Henrion’s strict resequencer can be modified for adaptive resequencing – O(1) time per cell. More sophisticated interconnection network can increase throughput and reduce delay variability. »per destination queues with controlled buffer sharing »per destination flow control and load balancing »timestamp-ordered switch element queues