Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 1 High Speed Router Design Shivkumar Kalyanaraman Rensselaer Polytechnic Institute

Slides:



Advertisements
Similar presentations
Router Internals CS 4251: Computer Networking II Nick Feamster Spring 2008.
Advertisements

Router Internals CS 4251: Computer Networking II Nick Feamster Fall 2008.
Based on slides from Dave Andersen and Nick Feamster
Router Architecture : Building high-performance routers Ian Pratt
Chen-Nien Tsai 2008/10/16 1 Wireless and Broadband Network Laboratory (WBNLAB) Dept. of CSIE, NTUT.
Lecture 11 Switching & Forwarding EECS 122 University of California Berkeley.
May 28th, 2002Nick McKeown 1 Scaling routers: Where do we go from here? HPSR, Kobe, Japan May 28 th, 2002 Nick McKeown Professor of Electrical Engineering.
MEMS and its Applications Optical Routing, an example Shashi Mysore Computer Science UCSB.
High-Speed Router Design. Content Classes of Routers Components of a Router High-Speed Router Lookup Advances in Switching Fabrics Speeding Up Output.
Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 1 High Speed Router Design Shivkumar Kalyanaraman Rensselaer Polytechnic Institute
1 Circuit Switching in the Core OpenArch April 5 th 2003 Nick McKeown Professor of Electrical Engineering and Computer Science, Stanford University
10 - Network Layer. Network layer r transport segment from sending to receiving host r on sending side encapsulates segments into datagrams r on rcving.
1 Architectural Results in the Optical Router Project Da Chuang, Isaac Keslassy, Nick McKeown High Performance Networking Group
1 OR Project Group II: Packet Buffer Proposal Da Chuang, Isaac Keslassy, Sundar Iyer, Greg Watson, Nick McKeown, Mark Horowitz
1 Internet Routers Stochastics Network Seminar February 22 nd 2002 Nick McKeown Professor of Electrical Engineering and Computer Science, Stanford University.
EE 122: Router Design Kevin Lai September 25, 2002.
IEE, October 2001Nick McKeown1 High Performance Routers Slides originally by Nick McKeown Professor of Electrical Engineering and Computer Science, Stanford.
048866: Packet Switch Architectures Dr. Isaac Keslassy Electrical Engineering, Technion Introduction.
Nick McKeown 1 Memory for High Performance Internet Routers Micron February 12 th 2003 Nick McKeown Professor of Electrical Engineering and Computer Science,
1 Trend in the design and analysis of Internet Routers University of Pennsylvania March 17 th 2003 Nick McKeown Professor of Electrical Engineering and.
Katz, Stoica F04 EECS 122: Introduction to Computer Networks Switch and Router Architectures Computer Science Division Department of Electrical Engineering.
Analysis of a Memory Architecture for Fast Packet Buffers Sundar Iyer, Ramana Rao Kompella & Nick McKeown (sundaes,ramana, Departments.
August 20 th, A 2.5Tb/s LCS Switch Core Nick McKeown Costas Calamvokis Shang-tse Chuang Accelerating The Broadband Revolution P M C - S I E R R.
1 Growth in Router Capacity IPAM, Lake Arrowhead October 2003 Nick McKeown Professor of Electrical Engineering and Computer Science, Stanford University.
Router Architectures An overview of router architectures.
Can Google Route? Building a High-Speed Switch from Commodity Hardware Guido Appenzeller, Matthew Holliman Q2/2002.
Router Architectures An overview of router architectures.
Router Design (Nick Feamster) February 11, Today’s Lecture The design of big, fast routers Partridge et al., A 50 Gb/s IP Router Design constraints.
4: Network Layer4b-1 Router Architecture Overview Two key router functions: r run routing algorithms/protocol (RIP, OSPF, BGP) r switching datagrams from.
Chapter 4 Queuing, Datagrams, and Addressing
1 IP routers with memory that runs slower than the line rate Nick McKeown Assistant Professor of Electrical Engineering and Computer Science, Stanford.
Computer Networks Switching Professor Hui Zhang
Professor Yashar Ganjali Department of Computer Science University of Toronto
Optics in Internet Routers Mark Horowitz, Nick McKeown, Olav Solgaard, David Miller Stanford University
Advance Computer Networking L-8 Routers Acknowledgments: Lecture slides are from the graduate level Computer Networks course thought by Srinivasan Seshan.
Department of Computer and IT Engineering University of Kurdistan Computer Networks II Router Architecture By: Dr. Alireza Abdollahpouri.
1 Processing packets in packet switches CS343 May 7 th 2003 Nick McKeown Professor of Electrical Engineering and Computer Science, Stanford University.
Designing Packet Buffers for Internet Routers Friday, October 23, 2015 Nick McKeown Professor of Electrical Engineering and Computer Science, Stanford.
Winter 2006EE384x1 EE384x: Packet Switch Architectures I Parallel Packet Buffers Nick McKeown Professor of Electrical Engineering and Computer Science,
Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 1 ECSE-6600: Internet Protocols Informal Quiz #14 Shivkumar Kalyanaraman: GOOGLE: “Shiv RPI”
ISLIP Switch Scheduler Ali Mohammad Zareh Bidoki April 2002.
Nick McKeown1 Building Fast Packet Buffers From Slow Memory CIS Roundtable May 2002 Nick McKeown Professor of Electrical Engineering and Computer Science,
1 Performance Guarantees for Internet Routers ISL Affiliates Meeting April 4 th 2002 Nick McKeown Professor of Electrical Engineering and Computer Science,
1 Router Design Bruce Davie with help from Hari Balakrishnan & Nick McKeown.
An Introduction to Packet Switching Nick McKeown Assistant Professor of Electrical Engineering and Computer Science, Stanford University
CS 4396 Computer Networks Lab Router Architectures.
Forwarding.
Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 1 Challenges in Modern Multi-Tera- bit Class Switch Design.
Winter 2006EE384x Handout 11 EE384x: Packet Switch Architectures Handout 1: Logistics and Introduction Professor Balaji Prabhakar
Opticomm 2001Nick McKeown1 Do Optics Belong in Internet Core Routers? Keynote, Opticomm 2001 Denver, Colorado Nick McKeown Professor of Electrical Engineering.
IEE, October 2001Nick McKeown1 High Performance Routers IEE, London October 18 th, 2001 Nick McKeown Professor of Electrical Engineering and Computer Science,
Lecture Note on Switch Architectures. Function of Switch.
1 A quick tutorial on IP Router design Optics and Routing Seminar October 10 th, 2000 Nick McKeown
1 How scalable is the capacity of (electronic) IP routers? Nick McKeown Professor of Electrical Engineering and Computer Science, Stanford University
Packet Switch Architectures The following are (sometimes modified and rearranged slides) from an ACM Sigcomm 99 Tutorial by Nick McKeown and Balaji Prabhakar,
The Fork-Join Router Nick McKeown Assistant Professor of Electrical Engineering and Computer Science, Stanford University
Network Layer4-1 Chapter 4 Network Layer All material copyright J.F Kurose and K.W. Ross, All Rights Reserved Computer Networking: A Top Down.
Univ. of TehranIntroduction to Computer Network1 An Introduction to Computer Networks University of Tehran Dept. of EE and Computer Engineering By: Dr.
Univ. of TehranComputer Network1 Computer Networks Computer Networks (Graduate level) University of Tehran Dept. of EE and Computer Engineering By: Dr.
Techniques for Fast Packet Buffers Sundar Iyer, Nick McKeown Departments of Electrical Engineering & Computer Science, Stanford.
Network layer (addendum) Slides adapted from material by Nick McKeown and Kevin Lai.
1 Building big router from lots of little routers Nick McKeown Assistant Professor of Electrical Engineering and Computer Science, Stanford University.
Weren’t routers supposed
Addressing: Router Design
Chapter 4: Network Layer
Lecture 11 Switching & Forwarding
EE 122: Lecture 7 Ion Stoica September 18, 2001.
Project proposal: Questions to answer
Techniques and problems for
Chapter 4: Network Layer
Presentation transcript:

Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 1 High Speed Router Design Shivkumar Kalyanaraman Rensselaer Polytechnic Institute Also based on slides of S. Keshav (Ensim), Douglas Comer (Purdue), Minkenberg (IBM Zurich), Sonia Fahmy (Purdue) Raj Yavatkar (Intel), Cyriel Minkenberg (IBM Zurich), Sonia Fahmy (Purdue) Many slides thanks to Nick McKeown (Stanford),

Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 2 q Introduction q Evolution of High-Speed Routers q High Speed Router Components: q Lookup Algorithm q Switching q Classification, Scheduling q Multi-Tbps Routers: Challenges & Trends Overview

Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 3 What do switches/routers look like? Access routers e.g. ISDN, ADSL Core router e.g. OC48c POS Core ATM switch

Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 4 Dimensions, Power Consumption Cisco GSR 12416Juniper M160 6ft 19 ” 2ft Capacity: 160Gb/s Power: 4.2kW 3ft 2.5ft 19 ” Capacity: 80Gb/s Power: 2.6kW

Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 5 Where high performance packet switches are used Enterprise WAN access & Enterprise Campus Switch - Carrier Class Core Router - ATM Switch - Frame Relay Switch The Internet Core Edge Router

Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 6 Where are routers? Ans: Points of Presence (POPs) A B C POP1 POP3 POP2 POP4 D E F POP5 POP6 POP7 POP8

Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 7 POP with smaller routersPOP with large routers q Interfaces: Price >$200k, Power > 400W q Space, power, interface cost economics! q About 50-60% of i/fs are used for interconnection within the POP. q Industry trend is towards large, single router per POP. Why the Need for Big/Fast/Large Routers?

Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 8 Job of router architect q For a given set of features:

Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 9 Performance metrics 1. Capacity  “maximize C, s.t. volume < 2m 3 and power < 5kW” 2. Throughput q Maximize usage of expensive long-haul links. q Trivial with work-conserving output-queued routers 3. Controllable Delay q Some users would like predictable delay. q This is feasible with output-queueing plus weighted fair queuing (WFQ). WFQ

Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 10 DWDM Link speed x2/8 months Router capacity x2.2/18 months Moore’s law x2/18 m DRAM access rate x1.1/18 m Internet x2/yr

Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 11 Alt: Memory Bandwidth Commercial DRAM q Memory speed is not keeping up with Moore’s Law. DRAM 1.1x / 18months Moore’s Law 2x / 18 months Router Capacity 2.2x / 18months Line Capacity 2x / 7 months

Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 12 An Example: Packet buffers 40Gb/s router linecard Buffer Memory Write Rate, R One 40B packet every 8ns Read Rate, R One 40B packet every 8ns 10Gbits Buffer Manager  Use SRAM? + Fast enough random access time, but - Too low density to store 10Gbits of data.  Use DRAM? + High density means we can store data, but - Can’t meet random access time.

Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 13 Eg: Problems w/ Output Queuing q Output queued switches are impractical R R R R DRAM NR data R R R R output 1 N Can’t I just use N separate memory devices per output?

Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 14 Packet processing is getting harder CPU Instructions per minimum length packet since 1996

Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 15 Basic Ideas: Part I

Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 16 First-Generation IP Routers  Most Ethernet switches and cheap packet routers  Bottleneck can be CPU, host-adaptor or I/O bus  What is costly? Bus ? Memory? Interface? CPU? Shared Backplane Line Interface CPU Memory CPU Buffer Memory Line Interface DMA MAC Line Interface DMA MAC Line Interface DMA MAC

Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 17 First Generation Routers Shared Backplane Line Interface CPU Memory Route Table CPU Buffer Memory Line Interface MAC Line Interface MAC Line Interface MAC Fixed length “DMA” blocks or cells. Reassembled on egress linecard Fixed length cells or variable length packets Typically <0.5Gb/s aggregate capacity

Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 18 Output 2 Output N First Generation Routers Queueing Structure: Shared Memory Large, single dynamically allocated memory buffer: N writes per “cell” time N reads per “cell” time. Limited by memory bandwidth. Input 1 Output 1 Input N Input 2 Numerous work has proven and made possible: q Fairness q Delay Guarantees q Delay Variation Control q Loss Guarantees q Statistical Guarantees

Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 19 Second-Generation IP Routers CPU Buffer Memory Line Card DMA MAC LocalBufferMemory Line Card DMA MAC LocalBufferMemory Line Card DMA MAC LocalBufferMemory  Port mapping intelligence in line cards  Higher hit rate in local lookup cache  What is costly? Bus ? Memory? Interface? CPU?

Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 20 Second Generation Routers Route Table CPU Line Card Buffer Memory Line Card MAC Buffer Memory Line Card MAC Buffer Memory Fwding Cache Fwding Cache Fwding Cache MAC Slow Path Drop Policy Drop Policy Or Backpressure Output Link Scheduling Buffer Memory Typically <5Gb/s aggregate capacity

Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 21 Route Table CPU Second Generation Routers As caching became ineffective Line Card Buffer Memory Line Card MAC Buffer Memory Line Card MAC Buffer Memory Fwding Table Fwding Table Fwding Table MAC Exception Processor

Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 22 Second Generation Routers Queuing Structure: Combined Input and Output Queuing (CIOQ) Bus 1 write per “cell” time1 read per “cell” time Rate of writes/reads determined by bus speed

Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 23 Third-Generation Switches/Routers Line Card MAC Local Buffer Memory CPU Card Line Card MAC Local Buffer Memory Switched Backplane Line Interface CPU Memory  Third generation switch provides parallel paths (fabric)  What’s costly? Bus? Memory, CPU?

Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 24 Third Generation Routers Line Card MAC Local Buffer Memory CPU Card Line Card MAC Local Buffer Memory Switched Backplane Line Interface CPU Memory Fwding Table Routing Table Fwding Table Typically <50Gb/s aggregate capacity

Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 25 Arbiter Third Generation Routers Queueing Structure Switch 1 write per “cell” time1 read per “cell” time Rate of writes/reads determined by switch fabric speedup

Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 26 Arbiter Third Generation Routers Queueing Structure: VOQs Switch 1 write per “cell” time1 read per “cell” time Rate of writes/reads determined by switch fabric speedup Per-flow/class or per- output queues (VOQs) Per-flow/class or per- input queues Flow-control backpressure

Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 27 Third Generation Routers: limits 19” or 23” 7’ Size-constrained: 19” or 23” wide. Power-constrained: ~<8kW. Supply: 100A/200A maximum at 48V

Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 28 Fourth Generation: Clustering/Multi-stage Switch Core Linecards Optical links 100’s of feet

Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 29 Key: Physically Separating Switch Core and Linecards q Distributes power over multiple racks. q Allows all buffering to be placed on the linecard: q Reduces power. q Places complex scheduling, buffer mgmt, drop policy etc. on linecard.

Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 30 Fourth Generation Routers/Switches Switch Core Linecards Optical links 100’s of feet The LCS Protocol

Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 31 Linecard LCSLCS LCSLCS 1: Req Physical Separation 3: Data Switch Scheduler Switch Scheduler 2: Grant/credit Seq num Switch Fabric Switch Fabric Switch Port Req Grant 1 RTT Per Queue Counters

Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 32 Physical Separation Aligning Cells Switch Scheduler Switch Scheduler Switch Fabric Switch Fabric LCSLCS LCSLCS LCSLCS Switch Core Linecard

Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 33 Fourth Generation Routers/Switches Queueing Structure 1 write per “cell” time 1 read per “cell” time Rate of writes/reads determined by switch fabric speedup Lookup & Drop Policy Output Scheduling Virtual Output Queues Output Scheduling Output Scheduling Switch Fabric Switch Arbitration Linecard Switch Core (Bufferless) Lookup & Drop Policy Lookup & Drop Policy Typically <5Tb/s aggregate capacity