Jon Turner Extreme Networking Achieving Nonstop Network Operation Under Extreme Operating Conditions DARPA.

Slides:



Advertisements
Similar presentations
IP Router Architectures. Outline Basic IP Router Functionalities IP Router Architectures.
Advertisements

NetFPGA Project: 4-Port Layer 2/3 Switch Ankur Singla Gene Juknevicius
A Scalable and Reconfigurable Search Memory Substrate for High Throughput Packet Processing Sangyeun Cho and Rami Melhem Dept. of Computer Science University.
OpenFlow overview Joint Techs Baton Rouge. Classic Ethernet Originally a true broadcast medium Each end-system network interface card (NIC) received every.
Jon Turner Extreme Networking Achieving Nonstop Network Operation Under Extreme Operating Conditions.
1 ELEN 602 Lecture 18 Packet switches Traffic Management.
Router Architecture : Building high-performance routers Ian Pratt
Jon Turner Extreme Networking Achieving Nonstop Network Operation Under Extreme Operating Conditions Fred.
Traffic Management - OpenFlow Switch on the NetFPGA platform Chun-Jen Chung( ) SriramGopinath( )
What's inside a router? We have yet to consider the switching function of a router - the actual transfer of datagrams from a router's incoming links to.
Spring 2002CS 4611 Router Construction Outline Switched Fabrics IP Routers Tag Switching.
4-1 Network layer r transport segment from sending to receiving host r on sending side encapsulates segments into datagrams r on rcving side, delivers.
Chapter 4 Network Layer slides are modified from J. Kurose & K. Ross CPE 400 / 600 Computer Communication Networks Lecture 14.
10 - Network Layer. Network layer r transport segment from sending to receiving host r on sending side encapsulates segments into datagrams r on rcving.
FF-1 9/30/2003 UTD Practical Priority Contention Resolution for Slotted Optical Burst Switching Networks Farid Farahmand The University of Texas at Dallas.
ECE 526 – Network Processing Systems Design Packet Processing II: algorithms and data structures Chapter 5: D. E. Comer.
Router Architectures An overview of router architectures.
Chapter 4 Queuing, Datagrams, and Addressing
Computer Networks Switching Professor Hui Zhang
Paper Review Building a Robust Software-based Router Using Network Processors.
ECE 526 – Network Processing Systems Design Network Processor Architecture and Scalability Chapter 13,14: D. E. Comer.
CS/CoE 536 : Lockwood 1 CS/CoE 536 Reconfigurable System On Chip Design Lecture 9 : MP3 Working Draft Washington University Fall 2002
Jon Turner (and a cast of thousands) Washington University Design of a High Performance Active Router Active Nets PI Meeting - 12/01.
A 50-Gb/s IP Router 참고논문 : Craig Partridge et al. [ IEEE/ACM ToN, June 1998 ]
Applied research laboratory David E. Taylor Users Guide: Fast IP Lookup (FIPL) in the FPX Gigabit Kits Workshop 1/2002.
Control Processor Switch Fabric ATM Switch Core Port Processors FPX SPC LC IPPOPP FPX SPC LC IPPOPP FPX SPC LC IPPOPP FPX SPC LC IPPOPP FPX SPC LC IPPOPP.
Router Architecture Overview
Washington WASHINGTON UNIVERSITY IN ST LOUIS Packet Routing Within MSR Fred Kuhns
1 The Internet and Networked Multimedia. 2 Layering  Internet protocols are designed to work in layers, with each layer building on the facilities provided.
Univ. of TehranAdv. topics in Computer Network1 Advanced topics in Computer Networks University of Tehran Dept. of EE and Computer Engineering By: Dr.
EECB 473 DATA NETWORK ARCHITECTURE AND ELECTRONICS PREPARED BY JEHANA ERMY JAMALUDDIN Basic Packet Processing: Algorithms and Data Structures.
4/19/20021 TCPSplitter: A Reconfigurable Hardware Based TCP Flow Monitor David V. Schuehler.
Lecture 13: Reconfigurable Computing Applications October 10, 2013 ECE 636 Reconfigurable Computing Lecture 11 Reconfigurable Computing Applications.
Hot Interconnects TCP-Splitter: A Reconfigurable Hardware Based TCP/IP Flow Monitor David V. Schuehler
Lecture 12: Reconfigurable Systems II October 20, 2004 ECE 697F Reconfigurable Computing Lecture 12 Reconfigurable Systems II: Exploring Programmable Systems.
Field Programmable Port Extender (FPX) 1 Modular Design Techniques for the FPX.
Jon Turner Extreme Networking Achieving Nonstop Network Operation Under Extreme Operating Conditions DARPA.
Jon Turner Resilient Cell Resequencing in Terabit Routers.
Washington WASHINGTON UNIVERSITY IN ST LOUIS 1 DTI Visit - John DeHart- 4/25/2001 Agenda l WU/ARL Background – John DeHart (15 minutes) l DTI Background.
Forwarding.
High-Speed Policy-Based Packet Forwarding Using Efficient Multi-dimensional Range Matching Lakshman and Stiliadis ACM SIGCOMM 98.
OpenFlow MPLS and the Open Source Label Switched Router Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan,
Lecture Note on Switch Architectures. Function of Switch.
1 A quick tutorial on IP Router design Optics and Routing Seminar October 10 th, 2000 Nick McKeown
Washington WASHINGTON UNIVERSITY IN ST LOUIS SPC II Architecture.
Washington WASHINGTON UNIVERSITY IN ST LOUIS MSR Tasks for Fall 2001 Fred Kuhns, John DeHart and Ken Wong.
Packet Switch Architectures The following are (sometimes modified and rearranged slides) from an ACM Sigcomm 99 Tutorial by Nick McKeown and Balaji Prabhakar,
Spring 2000CS 4611 Router Construction Outline Switched Fabrics IP Routers Extensible (Active) Routers.
CS/CoE 536 : Lockwood 1 CS/CoE 536 Reconfigurable System On Chip Design Lecture 11 : Priority and Per-Flow Queuing in Machine Problem 3 (Revision 2) Washington.
Network Layer4-1 Chapter 4 Network Layer All material copyright J.F Kurose and K.W. Ross, All Rights Reserved Computer Networking: A Top Down.
CS/CoE 536 : Lockwood 1 CS/CoE 536 Reconfigurable System On Chip Design Lecture 10 : MP3 Working Draft Washington University Fall 2002
Field Programmable Port Extender (FPX) 1 Remote Management of the Field Programmable Port Extender (FPX) Todd Sproull Washington University, Applied Research.
Graciela Perera Department of Computer Science and Information Systems Slide 1 of 18 INTRODUCTION NETWORKING CONCEPTS AND ADMINISTRATION CSIS 3723 Graciela.
Washington University
Using the Open Network Lab
CprE / ComS 583 Reconfigurable Computing
An NP-Based Router for the Open Network Lab
Using the Open Network Lab
Demonstration of a High Performance Active Router DARPA Demo - 9/24/99
An NP-Based Router for the Open Network Lab Overview by JST
Next steps for SPP & ONL 2/6/2007
Network Core and QoS.
Remote Management of the Field Programmable Port Extender (FPX)
Implementing an OpenFlow Switch on the NetFPGA platform
Fred Kuhns and Alex Chandra Applied Research Laboratory
Chapter 4 Network Layer Computer Networking: A Top Down Approach 5th edition. Jim Kurose, Keith Ross Addison-Wesley, April Network Layer.
Network Layer: Control/data plane, addressing, routers
Multicast Support in the NSP (the “low-level” details)
Packet Switch Architectures
Network Core and QoS.
Presentation transcript:

Jon Turner Extreme Networking Achieving Nonstop Network Operation Under Extreme Operating Conditions DARPA PI Meeting, January 27-29, 2003

2 - Jonathan Turner – January 27-29, 2003 Project Overview Motivation »data networks have become mission-critical resource »networks often subject to extreme traffic conditions »need to design networks for worst-case conditions »technology advances making extreme defenses practical Extreme network services »Lightweight Flow Setup (LFS) »Network Access Service (NAS) »Reserved Tree Service (RTS) Key router technology components »Super-Scalable Packet Scheduling (SPS) »Dynamic Queues with Auto-aggregation (DQA) »Scalable Distributed Queueing (SDQ)

3 - Jonathan Turner – January 27-29, 2003 Switch Fabric IPPOPP FPX SPC Line Card IPPOPP FPX SPC Line Card IPPOPP FPX SPC Line Card IPPOPP FPX SPC Line Card IPPOPP FPX SPC Line Card IPPOPP FPX SPC Line Card Control Processor Prototype Extreme Router

4 - Jonathan Turner – January 27-29, 2003 Switch Fabric IPPOPP FPX SPC Line Card IPPOPP FPX SPC Line Card IPPOPP FPX SPC Line Card IPPOPP FPX SPC Line Card IPPOPP FPX SPC Line Card IPPOPP FPX SPC Line Card Control Processor Prototype Extreme Router

5 - Jonathan Turner – January 27-29, 2003 Switch Fabric IPPOPP FPX SPC Line Card IPPOPP FPX SPC Line Card IPPOPP FPX SPC Line Card IPPOPP FPX SPC Line Card IPPOPP FPX SPC Line Card IPPOPP FPX SPC Line Card Control Processor Prototype Extreme Router Field Programmable Port Ext. Network Interface Device Reprogrammable Application Device SDRAM 128 MB SRAM 4 MB Field Programmable Port Extenders ATM Switch Core

6 - Jonathan Turner – January 27-29, 2003 Switch Fabric IPPOPP FPX SPC Line Card IPPOPP FPX SPC Line Card IPPOPP FPX SPC Line Card IPPOPP FPX SPC Line Card IPPOPP FPX SPC Line Card IPPOPP FPX SPC Line Card Control Processor Prototype Extreme Router Embedded Processors Smart Port Card 2 Flash Disk 128 MB Pentium Cache North Bridge FPGA APIC

7 - Jonathan Turner – January 27-29, 2003 Switch Fabric IPPOPP FPX SPC Line Card IPPOPP FPX SPC Line Card IPPOPP FPX SPC Line Card IPPOPP FPX SPC Line Card IPPOPP FPX SPC Line Card IPPOPP FPX SPC Line Card Control Processor Prototype Extreme Router Gigabit Ethernet GBIC Framer FPGA

8 - Jonathan Turner – January 27-29, 2003 Performance of SPC-2 Largest gain at small packet sizes. PCI bus limits performance for large packets

9 - Jonathan Turner – January 27-29, 2003 More SPC-2 Performance Throughput loss at high loads due to PCI bus contention and input priority.

10 - Jonathan Turner – January 27-29, 2003 Field Programmable Port Extender (FPX) n Network Interface Device (NID) routes cells to/from RAD. n Reprogrammable Application Device (RAD) functions: »will implement core router functions in extensible router »may also implement arbitrary packet processing functions n Functions for extreme router. »high speed packet storage manager »packet classification & route lookup –fast route lookup –exact match filters –32 general filters »flexible queue manager –per-flow queues for reserved flows –route packets to/from SPC Network Interface Device Reprogrammable App. Device (400 Kg+80 KB) SDRAM (64 MB) SRAM (1 MB) 2 Gb/s interface   6.4 Gb/s 100 MHz SRAM (1 MB) SDRAM (64 MB) MHz

11 - Jonathan Turner – January 27-29, 2003 Logical Port Architecture Input Side Processing special flow queues... virtual output queues... PCU plugins SPC FPX reassembly contexts DQ Packet Classification & Route Lookup Output Side Processing special flow queues... output queues... PCU plugins SPC FPX reassembly contexts RC Packet Classification

12 - Jonathan Turner – January 27-29, 2003 FPX Packet Processor Block Diagram Control Data Path Control Cell Processor DQ Status & Rate Control Route & Filter Updates Register Set Updates & Status SRAM Register Set Queue Manager ISAR SDRAM from SW from LC to SW to LC OSAR Packet Storage Manager (includes free space list) Header Pointer Discard Pointer Header Proc. Classification and Route Lookup

13 - Jonathan Turner – January 27-29, 2003 Classification and Route Lookup (CARL) Three lookup engines. »route lookup for routing datagrams - best prefix »flow filters for multicast & reserved flows - exact »general filters (32) for management - exhaustive Input processing. »parallel check of all three »return highest priority exclusive and highest priority non-exclusive »general filters have unique priority »all flow filters share single priority »ditto for routes Route Lookup General Filters Flow Filters Input Demux Result Proc. & Priority Resolution bypass headers Output processing. »no route lookup on output Route lookup & flow filters share off-chip SRAM General filters processed on-chip

14 - Jonathan Turner – January 27-29, 2003 on-chip SRAM 11tag+data tag+data -- tag+data-- off-chip SRAM Exact Match Lookup Exact match lookup table used for reserved flows. »includes LFS, signaled QOS flows and multicast »and, flows requiring processing by SPCs »each of these flows has separate queue in QM »multicast flows have two queues (recycling multicast) »implemented using hashing srcdst packet 56 simple hash ingress valid egress valid tag =[src,dst,sport, dport,proto] data includes 2 outputs+2 QIDs LFS rates packet,byte counters flags separate memory areas for ingress and egress packets

15 - Jonathan Turner – January 27-29, 2003 General Filter Match General filter match considers full 5-tuple »prefix match on source and destination addresses »range match on source and destination ports »exact or wildcard match on protocol »each filter has a priority and may be exclusive or non- exclusive Intended primarily for management filters. »firewall filters »class-based monitoring »class-based special processing Implemented using parallel exhaustive search. »limit of 32 filters matcher filter memory matcher

16 - Jonathan Turner – January 27-29, 2003 Fast IP Lookup (Eatherton & Dittia) Multibit trie with clever data encoding. »small memory requirements (<7 bytes per prefix) »small memory bandwidth, simple lookup yields fast lookup rates »updates have negligible impact on lookup performance Avoid impact of external memory latency on throughput by interleaving several concurrent lookups. »8 lookup engine config. uses about 6% of Virtex 2000E logic cells address: , *010,001, * 1, internal bit vector external bit vector

17 - Jonathan Turner – January 27-29, 2003 SRAM Bandwidth – 450 MB/s Lookup Throughput linear throughput gain Split tree cuts storage by 30%

18 - Jonathan Turner – January 27-29, 2003 Update Performance reasonable update rates have little impact 1 update per  s

19 - Jonathan Turner – January 27-29, 2003 Queue Manager Logical View (QM) arriving packets... SPC pkt. sched. to SPC from SPC... res. flow queues VOQ pkt. sched. datagram queue to output 1 to output 8      to output 0 DQ to switch datagram queues... link pkt. sched.... to link res. flow queues 64 hashed datagram queues for traffic isolation separate queues for each reserved flow separate queue set for each output. separate queue for each SPC flow

20 - Jonathan Turner – January 27-29, 2003 Backlogged TCP Flows with Tail Discard with large buffers get large delay variance with small buffers get underflow and low throughput

21 - Jonathan Turner – January 27-29, 2003 DRR with Discard from Longest Queue Smaller fluctuations, but still significant.

22 - Jonathan Turner – January 27-29, 2003 Queue State DRR Add hysteresis to packet discard policy »discard from same queue until shortest non-empty queue. low variation, even with small queues, low delay, no tuning

23 - Jonathan Turner – January 27-29, 2003 Packet Scheduling with Approx. Radix Sorting To implement virtual time schedulers, need to quickly find the queue whose “lead packet” has the smallest virtual finish time. »using priority queue, this requires O (log n) time for n queues Use approximate radix sorting, with compensation – O (1). »timing wheels with increasing granularity and range »approximate sorting produces inter-packet timing errors »observe errors & compensate when next packet scheduled Fast-forward bits used to skip to empty slots. Scheduler puts no limit on number of queues. Two copies of data structure needed for approx. version of WF2Q+. wheel 1wheel 2wheel 3 output list fast forward bits

24 - Jonathan Turner – January 27-29, 2003 Resource Usage Estimates Key resources in Xilinx FPGAs »flip flops - 38,400 »lookup tables (LUTs) - 38,400 each can implement any 4 input Boolean function »block RAMs (4 Kbits each) - 160

25 - Jonathan Turner – January 27-29, 2003 FPGA Performance Characteristics

26 - Jonathan Turner – January 27-29, 2003 Summary Version 1 Hardware status. »hardware operating in lab, passing packets »but, still have some bugs to correct »one day for typical test-diagnose-correction cycle »version 1 has simplified queue manager Planning several system demos in next month. »system level throughput testing – focus on lookup proc. »verifying basic fair queueing behavior »TCP SYN attack suppressor SPC-resident plugin monitors new TCP connections going to server when too many “half-open” connections, oldest are reset flow filters inserted for stable connections, enabling hw forwarding Expect to complete version 2 hardware in next six months.