ECE 526 – Network Processing Systems Design

Slides:



Advertisements
Similar presentations
IP Router Architectures. Outline Basic IP Router Functionalities IP Router Architectures.
Advertisements

Layer 3 Switching. Routers vs Layer 3 Switches Both forward on the basis of IP addresses But Layer 3 switches are faster and cheaper However, Layer 3.
CS 4700 / CS 5700 Network Fundamentals Lecture 7: Bridging (From Hub to Switch by Way of Tree) Revised 1/14/13.
Chapter 4 Conventional Computer Hardware Architecture
1 ELEN 602 Lecture 18 Packet switches Traffic Management.
CSC457 Seminar YongKang Zhu December 6 th, 2001 About Network Processor.
Chapter 8 Hardware Conventional Computer Hardware Architecture.
Router Architecture : Building high-performance routers Ian Pratt
ECE 526 – Network Processing Systems Design Software-based Protocol Processing Chapter 7: D. E. Comer.
What's inside a router? We have yet to consider the switching function of a router - the actual transfer of datagrams from a router's incoming links to.
Spring 2002CS 4611 Router Construction Outline Switched Fabrics IP Routers Tag Switching.
1 COMP 206: Computer Architecture and Implementation Montek Singh Mon, Dec 5, 2005 Topic: Intro to Multiprocessors and Thread-Level Parallelism.
Processor Technology and Architecture
Computational Astrophysics: Methodology 1.Identify astrophysical problem 2.Write down corresponding equations 3.Identify numerical algorithm 4.Find a computer.
Chapter 10 Switching Fabrics. Outline Physical Interconnection Physical box with backplane Individual blades plug into backplane slots Each blade contains.
1 Multi - Core fast Communication for SoPC Multi - Core fast Communication for SoPC Technion – Israel Institute of Technology Department of Electrical.
1 K. Salah Module 4.0: Network Components Repeater Hub NIC Bridges Switches Routers VLANs.
ECE 526 – Network Processing Systems Design IXP XScale and Microengines Chapter 18 & 19: D. E. Comer.
Chapter 9 Classification And Forwarding. Outline.
Connecting Devices and Multi-Homed Machines. Layer 1 (Physical) Devices Repeater: Extends distances by repeating a signal Extends distances by repeating.
1 Chapter 4 The Central Processing Unit and Memory.
Chapter 17 Microprocessor Fundamentals William Kleitz Digital Electronics with VHDL, Quartus® II Version Copyright ©2006 by Pearson Education, Inc. Upper.
Router Architectures An overview of router architectures.
Router Architectures An overview of router architectures.
4: Network Layer4b-1 Router Architecture Overview Two key router functions: r run routing algorithms/protocol (RIP, OSPF, BGP) r switching datagrams from.
Chapter 4 Queuing, Datagrams, and Addressing
1 Instant replay  The semester was split into roughly four parts. —The 1st quarter covered instruction set architectures—the connection between software.
Computer Architecture and Organization
Paper Review Building a Robust Software-based Router Using Network Processors.
LECTURE 9 CT1303 LAN. LAN DEVICES Network: Nodes: Service units: PC Interface processing Modules: it doesn’t generate data, but just it process it and.
Information and Communication Technology Fundamentals Credits Hours: 2+1 Instructor: Ayesha Bint Saleem.
Introduction to Interconnection Networks. Introduction to Interconnection network Digital systems(DS) are pervasive in modern society. Digital computers.
ECE 526 – Network Processing Systems Design Network Processor Architecture and Scalability Chapter 13,14: D. E. Comer.
E0001 Computers in Engineering1 The System Unit & Memory.
The Computer Systems By : Prabir Nandi Computer Instructor KV Lumding.
A 50-Gb/s IP Router 참고논문 : Craig Partridge et al. [ IEEE/ACM ToN, June 1998 ]
TO p. 1 Spring 2006 EE 5304/EETS 7304 Internet Protocols Tom Oh Dept of Electrical Engineering Lecture 9 Routers, switches.
Types of Parallelism Chapter 17 Justin Bellomi. Characterizations of Parallelism  Computer Architects characterize the type and amount of parallelism.
Computer Architecture Lecture10: Input/output devices Piotr Bilski.
Router Architecture Overview
ECE 526 – Network Processing Systems Design Networking: protocols and packet format Chapter 3: D. E. Comer Fall 2008.
Department of Computer and IT Engineering University of Kurdistan Computer Networks II Router Architecture By: Dr. Alireza Abdollahpouri.
Local-Area-Network (LAN) Architecture Department of Computer Science Southern Illinois University Edwardsville Fall, 2013 Dr. Hiroshi Fujinoki
Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 1 ECSE-6600: Internet Protocols Informal Quiz #14 Shivkumar Kalyanaraman: GOOGLE: “Shiv RPI”
Chapter 2 Data Manipulation © 2007 Pearson Addison-Wesley. All rights reserved.
ECE 526 – Network Processing Systems Design Computer Architecture: traditional network processing systems implementation Chapter 4: D. E. Comer.
Chapter 2 Data Manipulation. © 2005 Pearson Addison-Wesley. All rights reserved 2-2 Chapter 2: Data Manipulation 2.1 Computer Architecture 2.2 Machine.
CS 4396 Computer Networks Lab Router Architectures.
ECE 526 – Network Processing Systems Design Network Processor Introduction Chapter 11,12: D. E. Comer.
Computer Hardware A computer is made of internal components Central Processor Unit Internal External and external components.
21-Dec-154/598N: Computer Networks Cell Switching (ATM) Connection-oriented packet-switched network Used in both WAN and LAN settings Signaling (connection.
EECB 473 Data Network Architecture and Electronics Lecture 1 Conventional Computer Hardware Architecture
Rehab AlFallaj.  Network:  Nodes: Service units: PC Interface processing Modules: it doesn’t generate data, but just it process it and do specific task.
Chapter 2 Data Manipulation © 2007 Pearson Addison-Wesley. All rights reserved.
1 Packet Network Simulator-on-Chip Henry Wong Danyao Wang University of Toronto Connections 2009 ECE Graduate Symposium.
Chapter 2: Data Manipulation
1 ECE 526 – Network Processing Systems Design System Implementation Principles I Varghese Chapter 3.
Spring 2000CS 4611 Router Construction Outline Switched Fabrics IP Routers Extensible (Active) Routers.
A Low-Area Interconnect Architecture for Chip Multiprocessors Zhiyi Yu and Bevan Baas VLSI Computation Lab ECE Department, UC Davis.
WAN Technologies. 2 Large Spans and Wide Area Networks MAN networks: Have not been commercially successful.
Hardware Architecture
Chapter 4: Network Layer
CS 31006: Computer Networks – The Routers
Bridges and Extended LANs
Packet Switch Architectures
Router Construction Outline Switched Fabrics IP Routers
Chapter 4: Network Layer
Packet Switch Architectures
Presentation transcript:

ECE 526 – Network Processing Systems Design Hardware Architecture for Protocol Processing Chapter 8: D. E. Comer

Goal Understand hardware architecture of protocol processing Learn the key metric of protocol processing system: aggregated packet rate Learn the key requirements of protocol processing system High throughput Scalability Survey mechanisms to design scalable protocol processing systems Ning Weng ECE 526

Outline First generation of network processing system architecture Figure of metric of network processing system Possible ways to improve performance of network processing systems Ning Weng ECE 526

1st Generation Network System Traditional software-based router Using conventional hardware Single general-purpose processor handles most tasks Single shared memory I/O over a shared bus NIC use same design as other I/O devices Cheap but performance is poor! Ning Weng ECE 526

Figure of Metric of Network Systems Interface data rate Rate at which data enters /leaves Aggregate data rate Sum of interface rates Measure of total data rate system can handle Note: aggregate rate crucial if CPU handles traffic from all interfaces Could be misleading if packet size varying and processing cost constant Aggregate packet rate Sum of the number of packets enters / leaves system More important for protocol processing (no touch payload) Why? Packet rate vs. data rate CPU metric: per-packet rate Interface hardware metric: per-bit (data) rate Small packet is critical for constant data rate and constant processing cost per packet Header processing: forwarding Payload processing: encrypting or string matching Ning Weng ECE 526

Data Rate vs. Packet Rate Packet size: small 64 byte; large 1518 byte For protocol processing, with same data rate, which is more difficult for network processing system? Smallest packet or Biggest packet How to calculate the packet rate? Ning Weng ECE 526

Aggregate Packet Rate Ning Weng ECE 526

Time per Packet Aggregate packet rate determines time per packet Each packet processing requires in the order of 100s to 1000s instruction per packet Ning Weng ECE 526

Feasibility Analysis Design a software router data rate 10Gbps Assuming small packets (64B) Assuming each packet need 10,000 instruction to process Can Intel 80986@2009 do the job? CPU:24Ghz 1 billion transistors Address bus bit: 64 CPU is a RISC machine which can execute an instruction per clock cycle Hint: What is the packet rate? What is the processing requirement in MIPS? Single CPU router lacks scalability. How multi-core? Ning Weng ECE 526

Scalability The capability of a system that can be easily extended in “size” and performance E.g., CPU with more memory slots and disk slots; router can add more ports or faster links Why we care scalability? Design a new system is timing consuming and expensive Performance requirement increase fast Others How can we make a network system more scalable? Optimized processing engines Intelligent NICs Parallel processing by duplicating processing engines + NICs Ning Weng ECE 526

Processing Power Overcoming processing bottlenecks: Other improvements Specialized hardware (ASICs) Fine-grained parallelism Symmetric coarse-grain parallelism Asymmetric coarse-grain parallelism Special-purpose coprocessors Other improvements NICs with onboard processing Smart NICs => basically same as per-port processing engines Ning Weng ECE 526

Parallelism in Processors Fine-grained parallelism Exploits instruction-level parallelism Examples: VLIW, SMT, etc. Limited due to workload Symmetric coarse-grain parallelism Multiple parallel identical CPUs Inter-processor communication can limit performance Asymmetric coarse-grain parallelism Multiple parallel different CPUs E.g., one processor for layer 2, one for layer 3 Special-purpose coprocessors Custom logic for lookups, checksums, etc. High-performance but not (fully) programmable Key question: how such a system can be programmed Duplicate processing engines --- Advance router architecture Ning Weng ECE 526

Advanced Router Architecture S.Keshav etc. IEEE Communication 1998 Port: point of attachment for a physical link Switching Fabric (SF): interconnect input & output ports Line Card: the device connecting between link and SF Routing Processor: create forwarding tables using routing protocol Queues: buffers between input port and SF or SF and output port Ning Weng ECE 526

Advanced Router Architecture Changing requirements Increasing link speed Increasing number of ports Increasing routing tables Increasing processing complexity Scalable system design: Exploit parallelism wherever possible Per-port, per-flow, per-packet, instruction-level One Processing engine per port (instead of single CPU) Multiple processors per port “Better” processors Ning Weng ECE 526

Reminder Read Comer: chapter 11 & 12 Sep. 19: project group leader email me your group members for the project Sep. 24: homework 1 due Ning Weng ECE 526