Presentation is loading. Please wait.

Presentation is loading. Please wait.

The Concurrent Matching Switch Architecture Bill Lin (University of California, San Diego) Isaac Keslassy (Technion, Israel)

Similar presentations


Presentation on theme: "The Concurrent Matching Switch Architecture Bill Lin (University of California, San Diego) Isaac Keslassy (Technion, Israel)"— Presentation transcript:

1 The Concurrent Matching Switch Architecture Bill Lin (University of California, San Diego) Isaac Keslassy (Technion, Israel)

2 IEEE INFOCOM, Barcelona, April 23-29, 20062 Motivation  Traffic demands expected to grow, driven in part by increasing broadband adoption  10x increase in broadband subscription in just last 3 years, already over 100 million subscribers  1.25-2.4 Gbps fiber to homes emerging (GPON, GEPON, EPON, BPON …)  Larger routers needed for consolidation  Operators need scalable routers that provide good performance

3 IEEE INFOCOM, Barcelona, April 23-29, 20063 Limitations of Previous Routers  Output-Queueing (OQ) Switch  Well-known to provide good performance, but scalability hampered by need for internal N speedup  Crossbar Switches, using Input-Queueing (IQ) or Combined Input-Output Queueing (CIOQ)  Huge body of literature, but scalability hampered by need for centralized scheduling and arbitrary per- packet switch configurations

4 IEEE INFOCOM, Barcelona, April 23-29, 20064 Limitations of Previous Routers  Load-Balanced Routers  No centralized scheduler  Scalable fixed configuration switch fabric in optics  Guarantees 100% throughput  100 Tb/s design with 160 Gb/s linecards shown  But packets may be delivered “out-of-order”

5 IEEE INFOCOM, Barcelona, April 23-29, 20065 Out R R R R/N R R R Basic Load-Balanced Router R/N In Linecards A1 A2 A3 B1 C1 C2 B1 B2 C1

6 IEEE INFOCOM, Barcelona, April 23-29, 20066 Out R R R R/N R R R Basic Load-Balanced Router R/N In Linecards A1 A2 A3 B1 C1 C2 B1 B2 C1 Many Fabric Options (any spreading device)  Space: Full uniform mesh  Wavelength: Static WDM  Time: Round-robin switches Just need fixed uniform rate channels at R/N No dynamic switch reconfigurations Many Fabric Options (any spreading device)  Space: Full uniform mesh  Wavelength: Static WDM  Time: Round-robin switches Just need fixed uniform rate channels at R/N No dynamic switch reconfigurations

7 IEEE INFOCOM, Barcelona, April 23-29, 20067 Out R R R R/N R R R Basic Load-Balanced Router R/N In Linecards A1 A2 A3 B1 C1 C2 B1 B2 C1 Out of Order !

8 IEEE INFOCOM, Barcelona, April 23-29, 20068 Packet Ordering Problem  Out-of-order packet delivery is undesirable (e.g. bad for TCP)  Previous techniques (e.g. EDF, UFS, FOFF)  Accumulate and delay packets at input/middle ports  And/or delay and re-order packets at middle/output ports  However, these techniques are unsatisfactory because they add substantial delays

9 IEEE INFOCOM, Barcelona, April 23-29, 20069 Impact on Avg. Delay (N = 128, uniform traffic) Basic Load-Balanced UFS FOFF Significant Delay

10 IEEE INFOCOM, Barcelona, April 23-29, 200610 Concurrent Matching Switch (CMS)  Basic idea  Retain load-balanced router structure and scalability of a fixed optical mesh, no dynamic reconfiguration  Instead of packets, load-balance “request tokens” to N parallel “schedulers”  Each scheduler independently solves its own matching  Packets delivered in order based on matching results Goal is to provide much lower average delay than accumulation-based methods for ensuring packet order while retaining 100% throughput and scalability

11 IEEE INFOCOM, Barcelona, April 23-29, 200611 Out R R R R R R Architecture Linecards A1 B1 C1 C2 C1 B2 C2 Retain Fixed Configuration Meshes BUT move packet buffers to INPUT A2 A3 A4

12 IEEE INFOCOM, Barcelona, April 23-29, 200612 Out R R R R R R Architecture Linecards A1 B1 C1 C2 C1 B2 C2 A2 A3 A4 2 0 1 1 0 1 1 0 0 0 0 1 0 0 1 0 1 1 0 1 0 0 0 0 0 0 0 Add N 2 Token Counters

13 IEEE INFOCOM, Barcelona, April 23-29, 200613 Out R R R R R R Arrival Phase Linecards A1 C1 C2 C1 C2 A2 A3 A4 2 0 1 1 0 1 1 0 0 0 0 1 0 0 1 0 1 1 0 1 0 0 0 0 0 0 0 B1 B2 A1 A2 B1 B2 C2 C3 C4

14 IEEE INFOCOM, Barcelona, April 23-29, 200614 Out R R R R R R Arrival Phase Linecards A1 C1 C2 C1 C2 A2 A3 A4 2 0 1 1 0 1 1 0 0 1 0 1 0 0 1 0 1 1 1 1 0 0 0 0 1 0 0 B1 B2 B1 B2 C2 C3 C4 A1 A2

15 IEEE INFOCOM, Barcelona, April 23-29, 200615 Out R R R R R R Arrival Phase Linecards A1 C1 C2 C1 C2 A2 A3 A4 2 1 1 1 0 1 1 0 0 1 0 1 0 1 1 0 1 1 1 1 0 0 1 0 1 0 0 B1 B2 A1 A2 B1 B2 C2 C3 C4

16 IEEE INFOCOM, Barcelona, April 23-29, 200616 Out R R R R R R Arrival Phase Linecards A1 C1 C2 C1 C2 A2 A3 A4 2 1 1 1 0 1 1 0 0 1 0 1 0 1 1 0 1 2 1 1 1 0 1 0 1 0 1 B1 B2 A1 A2 B1 B2 C2 C3 C4

17 IEEE INFOCOM, Barcelona, April 23-29, 200617 Out R R R R R R Matching Phase Linecards A1 C1 C2 A2 A3 A4 2 1 1 1 0 1 1 0 0 1 0 1 0 1 1 0 1 2 1 1 1 0 1 0 1 0 1 B1 B2 A1 A2 B1 B2 C1 C2 C1 C2 C3 C4

18 IEEE INFOCOM, Barcelona, April 23-29, 200618 Out R R R R R R Matching Phase Linecards A1 C1 C2 2 1 1 1 0 1 1 0 0 1 0 1 0 1 1 0 1 2 1 1 1 0 1 0 1 0 1 B1 A2 A3 A4 B1 A1 A2 C1 B1 B2 C2 C1 C2 C3 C4

19 IEEE INFOCOM, Barcelona, April 23-29, 200619 Out R R R R R R Matching Phase Linecards A1 C1 C2 1 1 1 0 0 1 0 0 0 1 0 0 0 0 1 0 0 2 1 1 0 0 0 0 1 0 0 B1 A2 A3 A4 B1 B2 C3 C4 A1 A2 C1 B1 C1 B2 C2

20 IEEE INFOCOM, Barcelona, April 23-29, 200620 Out R R R R R R Departure Phase Linecards A1 C1 C2 1 1 1 0 0 1 0 0 0 1 0 0 0 0 1 0 0 2 1 1 0 0 0 0 1 0 0 B1 A2 A3 A4 B1 B2 C3 C4 A1 A2 C1 B1 C1 B2 C2

21 IEEE INFOCOM, Barcelona, April 23-29, 200621 Distributed Operation  All linecards operate in parallel in a fully distributed manner  Arrival, matching, and departure phases overlap in a pipeline manner

22 IEEE INFOCOM, Barcelona, April 23-29, 200622 Main Ideas  Each middle linecard acts as a “micro-router” with 1/N th of the arrival traffic  Therefore, it gets N time slots to think about the schedule, time complexity amortized by a factor of N  If each micro-router can guarantee 100% throughput, so can the overall switch  Each micro-router can work the way that it wants, leveraging huge body of existing work on scheduling CMS provides a new way of aggregating routers together. Therefore, provides a new way of thinking about scaling routers.

23 IEEE INFOCOM, Barcelona, April 23-29, 200623 Practicality  Well-studied randomized approximations to Maximum Weighted Matching have been shown to achieve very good results [Tassiulas 1998] [Giaccone, Prabhakar & Shah, 2003]  These algorithms only require O(N) complexity using sequential hardware, but can provide 100% throughput guarantees with no speedup and good delay results  Amortized over N time slots, CMS with these scheduling algorithms can achieve  O(1) time complexity (independent of switch size)  100% throughput  Good delay results  Packet ordering

24 IEEE INFOCOM, Barcelona, April 23-29, 200624 Experimental Results (N = 128, uniform traffic) Basic Load-Balanced UFS FOFF CMS Difference of N time slots for matching phase

25 IEEE INFOCOM, Barcelona, April 23-29, 200625 Conclusions  CMS is scalable  Leverages scalability of fixed optical meshes  Fully distributed  Can achieve O(1) time complexity  CMS achieves good performance  Guarantees 100% throughput  Guarantees packet ordering  Experimentally achieves low packet delays  CMS provides new way of thinking about scaling routers and connects huge body of existing literature on scheduling to load-balanced routers

26 Thank You


Download ppt "The Concurrent Matching Switch Architecture Bill Lin (University of California, San Diego) Isaac Keslassy (Technion, Israel)"

Similar presentations


Ads by Google