A Scalable Switch for Service Guarantees Bill Lin (University of California, San Diego) Isaac Keslassy (Technion, Israel)

Slides:



Advertisements
Similar presentations
1 EE384Y: Packet Switch Architectures Part II Load-balanced Switch (Borrowed from Isaac Keslassys Defense Talk) Nick McKeown Professor of Electrical Engineering.
Advertisements

1 Maintaining Packet Order in Two-Stage Switches Isaac Keslassy, Nick McKeown Stanford University.
A Novel 3D Layer-Multiplexed On-Chip Network
Configuring a Load-Balanced Switch in Hardware Srikanth Arekapudi, Shang-Tse (Da) Chuang, Isaac Keslassy, Nick McKeown Stanford University.
Fast Buffer Memory with Deterministic Packet Departures Mayank Kabra, Siddhartha Saha, Bill Lin University of California, San Diego.
Clean Slate Design for the Internet Designing a Predictable Backbone Network with Valiant Load Balancing NSF 100 x 100 Clean.
High-Performance Networking Group Isaac Keslassy, Nick McKeown
KARL NADEN – NETWORKS (18-744) FALL 2010 Overview of Research in Router Design.
Submitters: Erez Rokah Erez Goldshide Supervisor: Yossi Kanizo.
Frame-Aggregated Concurrent Matching Switch Bill Lin (University of California, San Diego) Isaac Keslassy (Technion, Israel)
A Load-Balanced Switch with an Arbitrary Number of Linecards Isaac Keslassy, Shang-Tse Chuang, Nick McKeown.
Scaling Internet Routers Using Optics UW, October 16 th, 2003 Nick McKeown Joint work with research groups of: David Miller, Mark Horowitz, Olav Solgaard.
Isaac Keslassy, Shang-Tse (Da) Chuang, Nick McKeown Stanford University The Load-Balanced Router.
Making Parallel Packet Switches Practical Sundar Iyer, Nick McKeown Departments of Electrical Engineering & Computer Science,
Packet-Mode Emulation of Output-Queued Switches David Hay, CS, Technion Joint work with Hagit Attiya (CS, Technion), Isaac Keslassy (EE, Technion)
1 Comnet 2006 Communication Networks Recitation 5 Input Queuing Scheduling & Combined Switches.
The Concurrent Matching Switch Architecture Bill Lin (University of California, San Diego) Isaac Keslassy (Technion, Israel)
Scaling Internet Routers Using Optics Producing a 100TB/s Router Ashley Green and Brad Rosen February 16, 2004.
1 Architectural Results in the Optical Router Project Da Chuang, Isaac Keslassy, Nick McKeown High Performance Networking Group
Packet-Mode Emulation of Output-Queued Switches David Hay, CS, Technion Joint work with Hagit Attiya (CS) and Isaac Keslassy (EE)
Using Load-Balancing To Build High-Performance Routers Isaac Keslassy, Shang-Tse (Da) Chuang, Nick McKeown Stanford University.
048866: Packet Switch Architectures Dr. Isaac Keslassy Electrical Engineering, Technion MSM.
048866: Packet Switch Architectures Dr. Isaac Keslassy Electrical Engineering, Technion The.
Guaranteed Smooth Scheduling in Packet Switches Isaac Keslassy (Stanford University), Murali Kodialam, T.V. Lakshman, Dimitri Stiliadis (Bell-Labs)
048866: Packet Switch Architectures Dr. Isaac Keslassy Electrical Engineering, Technion Scaling.
A Load-Balanced Switch with an Arbitrary Number of Linecards Isaac Keslassy, Shang-Tse (Da) Chuang, Nick McKeown Stanford University.
048866: Packet Switch Architectures Dr. Isaac Keslassy Electrical Engineering, Technion Course.
Scaling Internet Routers Using Optics Isaac Keslassy, Shang-Tse Da Chuang, Kyoungsik Yu, David Miller, Mark Horowitz, Olav Solgaard, Nick McKeown Department.
1 Internet Routers Stochastics Network Seminar February 22 nd 2002 Nick McKeown Professor of Electrical Engineering and Computer Science, Stanford University.
1 EE384Y: Packet Switch Architectures Part II Load-balanced Switches Nick McKeown Professor of Electrical Engineering and Computer Science, Stanford University.
Scheduling in Delay Graphs with Applications to Optical Networks Isaac Keslassy (Stanford University), Murali Kodialam, T.V. Lakshman, Dimitri Stiliadis.
1 Trend in the design and analysis of Internet Routers University of Pennsylvania March 17 th 2003 Nick McKeown Professor of Electrical Engineering and.
1 Achieving 100% throughput Where we are in the course… 1. Switch model 2. Uniform traffic  Technique: Uniform schedule (easy) 3. Non-uniform traffic,
Optimal Load-Balancing Isaac Keslassy (Technion, Israel), Cheng-Shang Chang (National Tsing Hua University, Taiwan), Nick McKeown (Stanford University,
1 Netcomm 2005 Communication Networks Recitation 5.
048866: Packet Switch Architectures Dr. Isaac Keslassy Electrical Engineering, Technion Maximal.
048866: Packet Switch Architectures Dr. Isaac Keslassy Electrical Engineering, Technion Scheduling.
Distributed Scheduling Algorithms for Switching Systems Shunyuan Ye, Yanming Shen, Shivendra Panwar
1 Growth in Router Capacity IPAM, Lake Arrowhead October 2003 Nick McKeown Professor of Electrical Engineering and Computer Science, Stanford University.
Pipelined Two Step Iterative Matching Algorithms for CIOQ Crossbar Switches Deng Pan and Yuanyuan Yang State University of New York, Stony Brook.
Localized Asynchronous Packet Scheduling for Buffered Crossbar Switches Deng Pan and Yuanyuan Yang State University of New York Stony Brook.
1 IP routers with memory that runs slower than the line rate Nick McKeown Assistant Professor of Electrical Engineering and Computer Science, Stanford.
Load Balanced Birkhoff-von Neumann Switches
Nick McKeown CS244 Lecture 7 Valiant Load Balancing.
Merits of a Load-Balanced AAPN 1.Packets within a flow are transported to their correct destinations in sequence. This is due to the 1:1 logical connection.
High Speed Stable Packet Switches Shivendra S. Panwar Joint work with: Yihan Li, Yanming Shen and H. Jonathan Chao New York State Center for Advanced Technology.
Summary of switching theory Balaji Prabhakar Stanford University.
Advance Computer Networking L-8 Routers Acknowledgments: Lecture slides are from the graduate level Computer Networks course thought by Srinivasan Seshan.
Designing Packet Buffers for Internet Routers Friday, October 23, 2015 Nick McKeown Professor of Electrical Engineering and Computer Science, Stanford.
EE384y EE384Y: Packet Switch Architectures Part II Scaling Crossbar Switches Nick McKeown Professor of Electrical Engineering and Computer Science,
Applied research laboratory 1 Scaling Internet Routers Using Optics Isaac Keslassy, et al. Proceedings of SIGCOMM Slides:
1 Performance Guarantees for Internet Routers ISL Affiliates Meeting April 4 th 2002 Nick McKeown Professor of Electrical Engineering and Computer Science,
Guaranteed Smooth Scheduling in Packet Switches Isaac Keslassy (Stanford University), Murali Kodialam, T.V. Lakshman, Dimitri Stiliadis (Bell-Labs)
Belgrade University Aleksandra Smiljanić: High-Capacity Switching Switches with Input Buffers (Cisco)
Buffered Crossbars With Performance Guarantees Shang-Tse (Da) Chuang Cisco Systems EE384Y Thursday, April 27, 2006.
SNRC Meeting June 7 th, Crossbar Switch Scheduling Nick McKeown Professor of Electrical Engineering and Computer Science, Stanford University
1 A quick tutorial on IP Router design Optics and Routing Seminar October 10 th, 2000 Nick McKeown
1 How scalable is the capacity of (electronic) IP routers? Nick McKeown Professor of Electrical Engineering and Computer Science, Stanford University
Improving Matching algorithms for IQ switches Abhishek Das John J Kim.
Reduced Rate Switching in Optical Routers using Prediction Ritesh K. Madan, Yang Jiao EE384Y Course Project.
Block-Based Packet Buffer with Deterministic Packet Departures Hao Wang and Bill Lin University of California, San Diego HSPR 2010, Dallas.
Scheduling algorithms for CIOQ switches Balaji Prabhakar.
A Load Balanced Switch with an Arbitrary Number of Linecards I.Keslassy, S.T.Chuang, N.McKeown ( CSL, Stanford University ) Some slides adapted from authors.
A Load-Balanced Switch with an Arbitrary Number of Linecards Offense Anwis Das.
Input buffered switches (1)
EE384Y: Packet Switch Architectures Scaling Crossbar Switches
CS 740: Advance Computer Networks Hand-out on Router Design
Advance Computer Networking
EE 122: Lecture 7 Ion Stoica September 18, 2001.
Techniques and problems for
Presentation transcript:

A Scalable Switch for Service Guarantees Bill Lin (University of California, San Diego) Isaac Keslassy (Technion, Israel)

IEEE Hot Interconnects XIII, August 17-19, Motivation  Scalability: Traffic demands growing, driven in part by increasing broadband adoption  10x increase in broadband subscription in just last 3 years, already over 100 million subscribers  Gbps fiber to homes emerging (GPON, GEPON, EPON, BPON …)  Service Guarantees: Operators need bandwidth partitioning capabilities  Provide guaranteed rates in service-level agreements  Enable logical partitioning of converged networks  Traffic engineering in general

IEEE Hot Interconnects XIII, August 17-19, Router Wish List  Scalable in line rates and number of linecards  e.g. R = 160 Gbps (new packet every 2ns), thousands of linecards, petabit capacity  No centralized scheduler  No per-packet dynamic switch reconfigurations  Low complexity linecards  Provide performance guarantees  100% throughput guarantee  Service guarantees  No packet reordering

IEEE Hot Interconnects XIII, August 17-19, Existing Architectures  Output-Queueing (OQ) Switch  Well-known rate guarantees possible with Weighted Fair Queueing or Deficit Round-Robin scheduling But OQ switches require speedup of N  Crossbar Switches, using Input-Queueing (IQ) or Combined Input-Output Queueing (CIOQ)  OQ emulation possible But expensive centralized scheduling and per-packet dynamic switch reconfigurations  Birkhoff-von Neumann decomposition If traffic matrix known, can provide rate guarantees with distributed scheduling, but still requires per-packet dynamic switch reconfigurations

IEEE Hot Interconnects XIII, August 17-19, Existing Architectures (cont’d)  Load-Balanced Switches  Chang et al., “Load balanced Birkhoff-von Neumann switches, Part I: one-stage buffering”, Computer Communications, 2002  Keslassy et al., “Scaling Internet routers using optics”, ACM SIGCOMM 2003 A key idea: fixed configuration uniform meshes in optics, no dynamic switch reconfigurations Showed 100 Tb/s load-balanced router with R = 160 Gbps and N = 640 linecards  Showed 100% throughput for “best effort” traffic, but no service guarantees

IEEE Hot Interconnects XIII, August 17-19, This Talk  Presents the Interleaved Matching Switch (IMS)  Like a load-balanced switch, use fixed configuration uniform meshes, implemented with an optical fabric  No arbitrary per-packet switch reconfiguration  Can emulate any IQ or CIOQ switch  Can emulate a Birkhoff-von Neumann switch  If traffic matrix known, can ensure 100% throughput, service guarantees, and packet ordering  Show we can use O(1) distributed online scheduling

IEEE Hot Interconnects XIII, August 17-19, Out R R R R/N R R R Generic Load-Balanced Switch Using Fixed Configuration Uniform Meshes R/N In Linecards

IEEE Hot Interconnects XIII, August 17-19, Out R R R R/N R R R Generic Load-Balanced Switch Using Fixed Configuration Uniform Meshes R/N Linecards In

IEEE Hot Interconnects XIII, August 17-19, Out R R R R/N R R R Generic Load-Balanced Switch Using Fixed Configuration Uniform Meshes R/N Linecards In Many Fabric Options (any spreading device)  Space: Full uniform mesh  Wavelength: Static WDM  Time: Round-robin switches Just need fixed uniform rate channels at R/N No dynamic switch reconfigurations Many Fabric Options (any spreading device)  Space: Full uniform mesh  Wavelength: Static WDM  Time: Round-robin switches Just need fixed uniform rate channels at R/N No dynamic switch reconfigurations

IEEE Hot Interconnects XIII, August 17-19, Out R R R R/N R R R From Load-Balanced Switch R/N Linecards In

IEEE Hot Interconnects XIII, August 17-19, Out R R R R/N R R R To Interleaved Matching Switch R/N Linecards Move main packet buffers to INPUT Add coordination slots in MIDDLE Retain Fixed Configuration Meshes

IEEE Hot Interconnects XIII, August 17-19, How It Works  IMS works by emulating an IQ or CIOQ crossbar switch, but without per-packet dynamic switch reconfigurations (will show how centralized scheduling can be avoided later)

IEEE Hot Interconnects XIII, August 17-19, How It Works

IEEE Hot Interconnects XIII, August 17-19, How It Works R/N R R R Linecards R/N Linecards A1 A2 A1 B1 C1 C2 C1 B2 C2 A1 A2 R R R Out Interleaved Matching Switch R R R XBARLinecards Out R R R R R R Linecards A1 A2 A1 B1 C1 C2 C1 B2 C2 A1 A2 R R R Crossbar Switch

IEEE Hot Interconnects XIII, August 17-19, How It Works R/N R R R Linecards R/N Linecards A1 A2 A1 B1 C1 C2 C1 B2 C2 A1 A2 R R R Out Interleaved Matching Switch R R R XBARLinecards Out R R R R R R Linecards A1 A2 A1 B1 C1 C2 C1 B2 C2 A1 A2 R R R Crossbar Switch

IEEE Hot Interconnects XIII, August 17-19, How It Works R/N R R R Linecards R/N Linecards A1 A2 A1 B1 C1 C2 C1 B2 C2 A2 R R R Out Interleaved Matching Switch R R R XBARLinecards Out R R R R R R Linecards A1 A2 A1 B1 C1 C2 C1 B2 C2 A1 A2 R R R Crossbar Switch B1 C1 A1

IEEE Hot Interconnects XIII, August 17-19, How It Works R/N R R R Linecards R/N Linecards A1 A2 A1 B1 C1 C2 C1 B2 C2 A2 R R R Out Interleaved Matching Switch R R R XBARLinecards Out R R R R R R Linecards A1 A2 A1 B1 C1 C2 C1 B2 C2 A1 A2 R R R Crossbar Switch R R B1 C1 A1 Differences with crossbar switch  No dynamic switch reconfigurations  Departure times delayed by 2N time slots, N time slots per mesh, otherwise same sequence  Packet transfers initiated at each time slot to next MIDDLE linecard in round-robin order Differences with crossbar switch  No dynamic switch reconfigurations  Departure times delayed by 2N time slots, N time slots per mesh, otherwise same sequence  Packet transfers initiated at each time slot to next MIDDLE linecard in round-robin order

IEEE Hot Interconnects XIII, August 17-19, How It Works R/N R R R Linecards R/N Linecards A1 A2 A1 B1 C1 C2 B2 C2 A2 R R R Out Interleaved Matching Switch R R R XBARLinecards Out R R R R R R Linecards A1 A2 A1 B1 C1 C2 B2 C2 A2 R R R Crossbar Switch R R C1 C2 C1 C2

IEEE Hot Interconnects XIII, August 17-19, How It Works R/N R R R Linecards R/N Linecards A2 A1 B1 C1 C2 B2 C2 A2 R R R Out Interleaved Matching Switch R R R XBARLinecards Out R R R R R R Linecards A1 A2 A1 B1 C1 C2 B2 C2 A2 R R R Crossbar Switch R R C2 C1 C2 A1 B1 C1

IEEE Hot Interconnects XIII, August 17-19, How It Works R/N R R R Linecards R/N Linecards A2 A1 C1 C2 B2 C2 A2 R R R Out Interleaved Matching Switch R R R XBARLinecards Out R R R R R R Linecards A2 A1 C1 C2 B2 C2 A2 R R R Crossbar Switch R R C2 B1 B2 B1 B2 Crossbar MATCHINGS are INTERLEAVED across MIDDLE linecards (analogous to memory interleaving)

IEEE Hot Interconnects XIII, August 17-19, IQ and CIOQ Switch Emulation  An IMS can emulate any IQ or CIOQ switch.

IEEE Hot Interconnects XIII, August 17-19, When Traffic Matrix is Known  When traffic matrix is known, can perform Birkhoff-von Neumann decomposition offline  Given any admissible traffic matrix  Can decompose into a series of permutation matrices ( ) such that where

IEEE Hot Interconnects XIII, August 17-19, Example  Consider following example:  Use weighted fair queueing to schedule each permutation matrix proportionally to its corresponding weight

IEEE Hot Interconnects XIII, August 17-19, Distributed Storage and Scheduling  Distributed storage: each input linecard only stores its corresponding “rows”  Distributed scheduling: each input linecard only responsible for scheduling its own VOQs  O(1) time/hardware complexity: use deficit round-robin scheduling (many efficient variants)

IEEE Hot Interconnects XIII, August 17-19, Birkhoff-von Neumann Emulation  If traffic matrix known, an IMS can guarantee 100% throughput and guaranteed flow rates when combined with Birkhoff-von Neumann decomposition and online fair scheduling

IEEE Hot Interconnects XIII, August 17-19, Frame-Based Decomposition  If traffic matrix  can be converted to an integer matrix by multiplying by an integer F, then  can be decomposed into F permutations  Known decomposition algorithms (if F is integer multiple of N )  Birkhoff-von Neumann: O( N 3.5 )  Slepian-Duguid: O( N 3 )  New efficient formulation using edge-coloring  O( N 2 log N)

IEEE Hot Interconnects XIII, August 17-19, Conclusions  Scalability  IMS leverages scalability of fixed optical meshes  If traffic matrix known, distributed online scheduling can achieve O(1) time and hardware complexity  Emulation  IMS can emulate any IQ or CIOQ switch under same speedup and matching  Guarantees  If traffic matrix known, can ensure 100% throughput, service guarantees, and packet ordering via Birkhoff-von Neumann switch emulation  For integer matrices, new edge coloring formulation

Thank You