Jon Turner (and a cast of thousands) Washington University Design of a High Performance Active Router Active Nets PI Meeting - 12/01.

Slides:



Advertisements
Similar presentations
IP Router Architectures. Outline Basic IP Router Functionalities IP Router Architectures.
Advertisements

NetFPGA Project: 4-Port Layer 2/3 Switch Ankur Singla Gene Juknevicius
Jon Turner Extreme Networking Achieving Nonstop Network Operation Under Extreme Operating Conditions.
Ken Wong Jon Turner and Prashanth Pappu Washington University Distributed Queueing Gigabit Kits (June 2002)
Router Architecture : Building high-performance routers Ian Pratt
Jon Turner Extreme Networking Achieving Nonstop Network Operation Under Extreme Operating Conditions Fred.
Spring 2002CS 4611 Router Construction Outline Switched Fabrics IP Routers Tag Switching.
4-1 Network layer r transport segment from sending to receiving host r on sending side encapsulates segments into datagrams r on rcving side, delivers.
10 - Network Layer. Network layer r transport segment from sending to receiving host r on sending side encapsulates segments into datagrams r on rcving.
EE 122: Router Design Kevin Lai September 25, 2002.
CS 268: Lecture 12 (Router Design) Ion Stoica March 18, 2002.
ECE 526 – Network Processing Systems Design
Localized Asynchronous Packet Scheduling for Buffered Crossbar Switches Deng Pan and Yuanyuan Yang State University of New York Stony Brook.
Computer Networks Switching Professor Hui Zhang
Washington WASHINGTON UNIVERSITY IN ST LOUIS Design of the MultiService Router (MSR): A Platform for Networking Research Fred Kuhns.
High Speed Digital Design Project SpaceWire Router Student: Asaf Bercovich Instructor: Mony Orbach Semester: Winter 2009/ Semester Project Date:
ECE 526 – Network Processing Systems Design Network Processor Architecture and Scalability Chapter 13,14: D. E. Comer.
Washington WASHINGTON UNIVERSITY IN ST LOUIS How to Implement the WaveVideo Plugin in an MSR Router.
A 50-Gb/s IP Router 참고논문 : Craig Partridge et al. [ IEEE/ACM ToN, June 1998 ]
Applied research laboratory David E. Taylor Users Guide: Fast IP Lookup (FIPL) in the FPX Gigabit Kits Workshop 1/2002.
Control Processor Switch Fabric ATM Switch Core Port Processors FPX SPC LC IPPOPP FPX SPC LC IPPOPP FPX SPC LC IPPOPP FPX SPC LC IPPOPP FPX SPC LC IPPOPP.
Washington WASHINGTON UNIVERSITY IN ST LOUIS Packet Routing Within MSR Fred Kuhns
Salim Hariri HPDC Laboratory Enhanced General Switch Management Protocol Salim Hariri Department of Electrical and Computer.
Engineering Jon Turner Computer Science & Engineering Washington University Coarse-Grained Scheduling for Multistage Interconnects.
Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 1 ECSE-6600: Internet Protocols Informal Quiz #14 Shivkumar Kalyanaraman: GOOGLE: “Shiv RPI”
Routing Prefix Caching in Network Processor Design Huan Liu Department of Electrical Engineering Stanford University
January 10, Kits Workshop 1 Washington WASHINGTON UNIVERSITY IN ST LOUIS A Smart Port Card Tutorial --- Software John DeHart Washington University.
4/19/20021 TCPSplitter: A Reconfigurable Hardware Based TCP Flow Monitor David V. Schuehler.
Lecture 13: Reconfigurable Computing Applications October 10, 2013 ECE 636 Reconfigurable Computing Lecture 11 Reconfigurable Computing Applications.
Lecture 12: Reconfigurable Systems II October 20, 2004 ECE 697F Reconfigurable Computing Lecture 12 Reconfigurable Systems II: Exploring Programmable Systems.
Field Programmable Port Extender (FPX) 1 Modular Design Techniques for the FPX.
Jon Turner Extreme Networking Achieving Nonstop Network Operation Under Extreme Operating Conditions DARPA.
Washington WASHINGTON UNIVERSITY IN ST LOUIS 1 DTI Visit - John DeHart- 4/25/2001 Agenda l WU/ARL Background – John DeHart (15 minutes) l DTI Background.
21-Dec-154/598N: Computer Networks Cell Switching (ATM) Connection-oriented packet-switched network Used in both WAN and LAN settings Signaling (connection.
Field Programmable Port Extender (FPX) 1 NCHARGE: Remote Management of the Field Programmable Port Extender (FPX) Todd Sproull Washington University, Applied.
Univ. of TehranComputer Network1 Advanced topics in Computer Networks University of Tehran Dept. of EE and Computer Engineering By: Dr. Nasser Yazdani.
Lecture Note on Switch Architectures. Function of Switch.
Washington WASHINGTON UNIVERSITY IN ST LOUIS SPC II Architecture.
Spring 2000CS 4611 Router Construction Outline Switched Fabrics IP Routers Extensible (Active) Routers.
Field Programmable Port Extender (FPX) 1 Remote Management of the Field Programmable Port Extender (FPX) Todd Sproull Washington University, Applied Research.
Jon Turner Extreme Networking Achieving Nonstop Network Operation Under Extreme Operating Conditions DARPA.
Network layer (addendum) Slides adapted from material by Nick McKeown and Kevin Lai.
Graciela Perera Department of Computer Science and Information Systems Slide 1 of 18 INTRODUCTION NETWORKING CONCEPTS AND ADMINISTRATION CSIS 3723 Graciela.
CS 268: Lecture 10 Router Design and Packet Lookup
CS 268: Router Design Ion Stoica February 27, 2003.
Addressing: Router Design
Reference Router on NetFPGA 1G
Chapter 4: Network Layer
CprE / ComS 583 Reconfigurable Computing
An NP-Based Router for the Open Network Lab
Using the Open Network Lab
A Proposed Architecture for the GENI Backbone Platform
Demonstration of a High Performance Active Router DARPA Demo - 9/24/99
An NP-Based Router for the Open Network Lab Overview by JST
Next steps for SPP & ONL 2/6/2007
Network Core and QoS.
Bridges and Extended LANs
Packet Switch Architectures
Remote Management of the Field Programmable Port Extender (FPX)
Layered Protocol Wrappers Design and Interface review
Router Construction Outline Switched Fabrics IP Routers
EE 122: Lecture 7 Ion Stoica September 18, 2001.
Chapter 4 Network Layer Computer Networking: A Top Down Approach 5th edition. Jim Kurose, Keith Ross Addison-Wesley, April Network Layer.
Network Layer: Control/data plane, addressing, routers
Project proposal: Questions to answer
Reference Router on NetFPGA 1G
Lecture 12 Input/Output (programmer view)
Chapter 4: Network Layer
Packet Switch Architectures
Network Core and QoS.
Presentation transcript:

Jon Turner (and a cast of thousands) Washington University Design of a High Performance Active Router Active Nets PI Meeting - 12/01

2 - Jonathan Turner - December 5, 2001 Switch Fabric IPPOPP SPC TI IPPOPP SPC TI IPPOPP SPC TI IPPOPP SPC TI IPPOPP SPC TI IPPOPP SPC TI Control Processor Washington University Active Router Smart Port Card Sys. FPGA 64 MB Pentium Cache North Bridge APIC ATM Switch Core Transmisson InterfacesEmbedded Processors Control Processor global coordination & control routing protocols build routing tables and otherinformation needed by SPCs active plugin code server

3 - Jonathan Turner - December 5, 2001 SPC Software Architecture Gen. Filters Flow & Route Lookup... virtual output queues... Plugin Control plugins Input Side Processing Distributed Queueing Gen. Filters Flow Lookup output queues... Plugin Control plugins Rate Control... reassembly queues Output Side Processing

4 - Jonathan Turner - December 5, 2001 SPC Throughput - Packets Per Second

5 - Jonathan Turner - December 5, 2001 Comparison with SPC 2

6 - Jonathan Turner - December 5, 2001 SPC Throughput - Mb/s

7 - Jonathan Turner - December 5, 2001 SPC Throughput vs. Packet Length

8 - Jonathan Turner - December 5, 2001 Distributed Queueing Switch Fabric TI IOIOIO IO IO IO Control Processor Routing Sched. Routing Sched. Routing Sched. Routing Sched. Routing Sched. Routing Sched. queue per output periodic queue length reports Scheduler paces each queue according to backlog share

9 - Jonathan Turner - December 5, 2001 Distributed Queueing Algorithm Goal: avoid switch congestion and output queue underflow. Let hi(i,j) be input i’s share of input-side backlog to output j. »can avoid switch congestion by sending from input i to output j at rate  L  S  hi(i,j) »where L is external link rate and S is switch speedup Let lo(i,j) be input i’s share of total backlog for output j. »can avoid underflow of queue at output j by sending from input i to output j at rate  L  lo(i,j) »this works if L  (lo(i,1)+···+lo(i,n))  L  S for all i Let wt(i,j) be the ratio of lo(i,j) to lo(i,1)+···+lo(i,n). Let rate(i,j)=L  S  lo(wt(i,j),hi(i,j)). Note: algorithm avoids congestion and for avoids underflow for large enough S. »what is the smallest value of S for which underflow cannot occur?

10 - Jonathan Turner - December 5, 2001 Stress Test

11 - Jonathan Turner - December 5, 2001 Stress Test Simulation - Min Rates

12 - Jonathan Turner - December 5, 2001 Stress Test Simulation - Actual Rates

13 - Jonathan Turner - December 5, 2001 Stress Test Simulation - Backlog

14 - Jonathan Turner - December 5, 2001 Stress Test Measurement Results

15 - Jonathan Turner - December 5, 2001 Switch Fabric IPPOPP FPX SPC TI IPPOPP FPX SPC TI IPPOPP FPX SPC TI IPPOPP FPX SPC TI IPPOPP FPX SPC TI IPPOPP FPX SPC TI Control Processor Reconfigurable Hardware Extension Field Programmable Port Extenders Field Programmable Port Ext. Network Interface Device Reprogrammable Application Device SDRAM 128 MB SRAM 4 MB

16 - Jonathan Turner - December 5, 2001 Switch Fabric IPPOPP FPX SPC TI IPPOPP FPX SPC TI IPPOPP FPX SPC TI IPPOPP FPX SPC TI IPPOPP FPX SPC TI IPPOPP FPX SPC TI Control Processor Active Packet Processing Smart Port Card Sys. FPGA MB Pentium Cache North Bridge APIC

17 - Jonathan Turner - December 5, 2001 Logical Port Architecture Gen. Filters Flow Lookup active flow queues return queues... output queues... PCU plugins SPC FPX Output Side Processing Gen. Filters Flow & Route Lookup active flow queues return queues... virtual output queues... PCU plugins SPC FPX Input Side Processing

18 - Jonathan Turner - December 5, 2001 Fast IP Lookup (Eatherton & Dittia) Multibit trie with clever data encoding. »small memory requirements (4-6 bytes per prefix typical) »small memory bandwidth, simple lookup yields fast lookup rates »updates have negligible impact on lookup performance Avoid impact of external memory latency on throughput by interleaving several concurrent lookups. »8 lookup engine config. uses about 10% of Virtex 1000E logic cells address: , *010,001, * 1, internal bit vector external bit vector

19 - Jonathan Turner - December 5, 2001 Lookup Throughput & Latency linear throughput gain negligible latency increase

20 - Jonathan Turner - December 5, 2001 Update Performance reasonable update rates have little impact 1 update every 10  s

21 - Jonathan Turner - December 5, 2001 Performance of Combined Traffic

22 - Jonathan Turner - December 5, 2001 Summmary and Status Latest version of SPC software nearly complete. »additional testing of distributed queueing »testing of new output queueing subsystem - QSDRR »porting active applications to new plugin environment SPC2 almost ready for production. »finalizing details of PC board schematic and layout »overload performance testing on development system Completion of FPX design & integration with SPC. »low level debugging of FPX interface circuit »distributed queueing implementation in FPX »FIPL extension for flow classification »enhance active flow, output queueing subsystems