Chapter 8 Hardware Conventional Computer Hardware Architecture.

Slides:



Advertisements
Similar presentations
IP Router Architectures. Outline Basic IP Router Functionalities IP Router Architectures.
Advertisements

Layer 3 Switching. Routers vs Layer 3 Switches Both forward on the basis of IP addresses But Layer 3 switches are faster and cheaper However, Layer 3.
A Scalable and Reconfigurable Search Memory Substrate for High Throughput Packet Processing Sangyeun Cho and Rami Melhem Dept. of Computer Science University.
Khaled A. Al-Utaibi  Computers are Every Where  What is Computer Engineering?  Design Levels  Computer Engineering Fields  What.
Chapter 4 Conventional Computer Hardware Architecture
Computer Architecture and Data Manipulation Chapter 3.
EECB 473 Data Network Architecture and Electronics Lecture 3 Packet Processing Functions.
CSC457 Seminar YongKang Zhu December 6 th, 2001 About Network Processor.
© 2009 Pearson Education Inc., Upper Saddle River, NJ. All rights reserved.1 Computer Networks and Internets, 5e By Douglas E. Comer Lecture PowerPoints.
What's inside a router? We have yet to consider the switching function of a router - the actual transfer of datagrams from a router's incoming links to.
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Computer Science: An Overview Tenth Edition by J. Glenn Brookshear Chapter.
Spring 2002CS 4611 Router Construction Outline Switched Fabrics IP Routers Tag Switching.
4-1 Network layer r transport segment from sending to receiving host r on sending side encapsulates segments into datagrams r on rcving side, delivers.
Multiprocessors ELEC 6200: Computer Architecture and Design Instructor : Agrawal Name: Nam.
10 - Network Layer. Network layer r transport segment from sending to receiving host r on sending side encapsulates segments into datagrams r on rcving.
Chapter 10 Switching Fabrics. Outline Physical Interconnection Physical box with backplane Individual blades plug into backplane slots Each blade contains.
Chapter 6 Packet Processing Functions
ECE 526 – Network Processing Systems Design
Chapter 9 Classification And Forwarding. Outline.
1 Chapter 4 The Central Processing Unit and Memory.
Router Architectures An overview of router architectures.
Router Architectures An overview of router architectures.
4: Network Layer4b-1 Router Architecture Overview Two key router functions: r run routing algorithms/protocol (RIP, OSPF, BGP) r switching datagrams from.
Chapter 4 Queuing, Datagrams, and Addressing
Chapter 6 High-Speed LANs Chapter 6 High-Speed LANs.
Computer System Architectures Computer System Software
Introduction to Interconnection Networks. Introduction to Interconnection network Digital systems(DS) are pervasive in modern society. Digital computers.
ECE 526 – Network Processing Systems Design Network Processor Architecture and Scalability Chapter 13,14: D. E. Comer.
1 Internetworking: Concepts, Architecture, and Protocols.
Basics and Architectures
ATM SWITCHING. SWITCHING A Switch is a network element that transfer packet from Input port to output port. A Switch is a network element that transfer.
Paper Review: XiSystem - A Reconfigurable Processor and System
A 50-Gb/s IP Router 참고논문 : Craig Partridge et al. [ IEEE/ACM ToN, June 1998 ]
Types of Parallelism Chapter 17 Justin Bellomi. Characterizations of Parallelism  Computer Architects characterize the type and amount of parallelism.
Computers Are Your Future Eleventh Edition Chapter 2: Inside the System Unit Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall1.
Computers organization & Assembly Language Chapter 0 INTRODUCTION TO COMPUTING Basic Concepts.
Router Architecture Overview
Data and Computer Communications Circuit Switching and Packet Switching.
SJSU SPRING 2011 PARALLEL COMPUTING Parallel Computing CS 147: Computer Architecture Instructor: Professor Sin-Min Lee Spring 2011 By: Alice Cotti.
The variety Of Processors And Computational Engines CS – 355 Chapter- 4 `
CCNA 3 Week 4 Switching Concepts. Copyright © 2005 University of Bolton Introduction Lan design has moved away from using shared media, hubs and repeaters.
Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 1 ECSE-6600: Internet Protocols Informal Quiz #14 Shivkumar Kalyanaraman: GOOGLE: “Shiv RPI”
Chapter 2 Data Manipulation © 2007 Pearson Addison-Wesley. All rights reserved.
Chapter 2 Data Manipulation. © 2005 Pearson Addison-Wesley. All rights reserved 2-2 Chapter 2: Data Manipulation 2.1 Computer Architecture 2.2 Machine.
CS 4396 Computer Networks Lab Router Architectures.
Data Management for Decision Support Session-4 Prof. Bharat Bhasker.
Forwarding.
21-Dec-154/598N: Computer Networks Cell Switching (ATM) Connection-oriented packet-switched network Used in both WAN and LAN settings Signaling (connection.
EECB 473 Data Network Architecture and Electronics Lecture 1 Conventional Computer Hardware Architecture
Chapter 2 Data Manipulation © 2007 Pearson Addison-Wesley. All rights reserved.
Chapter 2: Data Manipulation
1 ECE 526 – Network Processing Systems Design System Implementation Principles I Varghese Chapter 3.
Spring 2000CS 4611 Router Construction Outline Switched Fabrics IP Routers Extensible (Active) Routers.
Network Layer4-1 Chapter 4 Network Layer All material copyright J.F Kurose and K.W. Ross, All Rights Reserved Computer Networking: A Top Down.
Chapter 11 System Performance Enhancement. Basic Operation of a Computer l Program is loaded into memory l Instruction is fetched from memory l Operands.
Univ. of TehranIntroduction to Computer Network1 An Introduction to Computer Networks University of Tehran Dept. of EE and Computer Engineering By: Dr.
COMPUTER NETWORKS CS610 Lecture-22 Hammad Khalid Khan.
Lecture 13 Parallel Processing. 2 What is Parallel Computing? Traditionally software has been written for serial computation. Parallel computing is the.
Graciela Perera Department of Computer Science and Information Systems Slide 1 of 18 INTRODUCTION NETWORKING CONCEPTS AND ADMINISTRATION CSIS 3723 Graciela.
The Concept of Universal Service
COMPUTER GRAPHICS CHAPTER 38 CS 482 – Fall 2017 GRAPHICS HARDWARE
Chapter 4: Network Layer
What’s “Inside” a Router?
Packet Switch Architectures
Router Construction Outline Switched Fabrics IP Routers
Chapter 4 Network Layer Computer Networking: A Top Down Approach 5th edition. Jim Kurose, Keith Ross Addison-Wesley, April Network Layer.
Chapter 4: Network Layer
ECE 671 – Lecture 8 Network Adapters.
Packet Switch Architectures
Presentation transcript:

Chapter 8 Hardware Conventional Computer Hardware Architecture

Outline The Traditional Software Router Measures Of Speed Fine-grain parallelism Symmetric coarse-grain parallelism Asymmetric coarse-grain parallelism Special-purpose coprocessors NICs with onboard processing Smart NICs with onboard stacks Cell switching Data pipelines

The Traditional Software Router The hardware architecture used with a software- based network system The CPU handles all protocol processing tasks except for framing and onboard address recognition framing & address recognition framing & address recognition all other processing Standard CPUNIC 2 NIC 1

Two Measures Of Speed Data rate (bits per second) – Per interface rate – Aggregate rate Packet rate (packets per second) – Per interface rate – Aggregate rate

Processing Speed For Two Reasons A router must be able to handle packets as they arrive from a given network, the processing speed determines the maximum data rate of a network that can be attached to the router A router must be able to handle packets arriving from multiple networks, the processing speed limits the possible topologies with which the router can be used

Aggregate Data Rate Total rate at which data can arrive or leave a network system The maximum aggregate data rate of a system is important because it limits the type and numbers of networks connections the system can handle

Aggregate Packet Rate For protocol processing tasks that have a fixed cost per packet, the number of packets processed is more important than the aggregate data rate How many packets arrive per second over a network – Depends on the network’s throughput rate and the size of the packets

Digital Circuit Speeds Technology Network Packet Rate Packet Rate Data Rate For small Packets For large Packets In Gbps In Kpps In Kpps 10Base-T Base-T OC OC , Base-T , OC , OC , OC , ,278.4 Key concept: maximum packet rate occurs with minimum-size packets

Bar Chart Of Example Packet Rates Gray areas show rates for large packets

Packet Rate And Software Router Feasibility The exact rate depends on the CPU speeds, bus bandwidth, and memory latency as well as the amount of processing The amount of processing required depends on the packet content Software running on a general-purpose processor is an insufficient architecture to handle high-speed networks because the aggregate packet rate exceeds the capabilities of current CPUs

Maximum per-packet processing time in microseconds of small and large packets for various technologies Technology Time per Packet Time per Packet For small Packets For large Packets (In μs) (In μs) 10Base-T , Base-T OC OC Base-T OC OC OC

Possible Ways To Solve The CPU Bottleneck Fine-grain parallelism Symmetric coarse-grain parallelism Asymmetric coarse-grain parallelism Special-purpose coprocessors NICs with onboard processing Smart NICs with onboard stacks Cell switching Data pipelines

Fine-Grain Parallelism (Instruction- Level Parallelism) Multiple CPU to work together Instruction-level parallelism does not achieve significantly higher performance – Few packet processing functions are amenable to fine-grain optimization – A program must spend time setting up the parallel instructions – Only improves CPU performance – Expensive

Symmetric Coarse-Grain Parallelism Offer a set of N identical CPUs Advantages – Network system designers did not need to invent new symmetric multiprocessor hardware – Vender had ported a conventional Unix operating system to their multiprocessor hardware  Familiar

Processing Capability Processing capability does not scale linearly as the number of processors increases – Most multiprocessor systems use a shared memory paradigm where all processors share a kernel address space – Packet processing software must coordinate access to data structure such as packet queues – A multiprocessor architecture does not automatically increase the I/O bandwidth

Asymmetric Coarse-Grain Parallelism Uses multiple, heterogeneous processor that can operate simultaneously The Advantage arises from the ability to specialize – Each processor in an Asymmetric system can be optimized for a specific task Drawbacks – Need general-purpose instructions – Difficult to program – May not perform well for a specific task or a specific protocol – Expensive to design and build

Special-Purpose Coprocessors Coprocessors : an architecture that contains a general-purpose CPU plus one or more special- purpose processor Each coprocessor is designed to perform a specific function  all coprocessors function under of the CPU The chief advantage lies in the freedom it gives a designer It can also be a small logic circuit that performs one operation  does not need general-purpose instructions, and does not need a fetch-execute cycle

Special-Purpose Coprocessors (con’t) A coprocessor is a piece of hardware that operates under control of the CPU A processor need not be sophisticated; the coprocessor only need to perform on specific task To optimize computation, move operations that account for the most CPU time from software into hardware

ASIC Coprocessor Implementation Application Specific Integrated Circuit (ASIC) refers to an integrated circuit (IC) that has been customer-designed for a specific need The availability of ASIC technology is especially pertinent to coprocessors Designers attempt to make the coprocessor general enough to work with many protocol

NICs With Onboard Processing Many protocol processing tasks are I/O bound An obvious optimization consists of moving processing onto NIC  IP checksum, packet encryption or compression The chief advantage of onboard processing lies in reduce CPU load  a NIC only needs to handle packets from a single interface What components are used to create smart NICs ? – ASIC hardware : incorporate special-purpose chips in to a NIC – Embedded RISC hardware : contains an onboard RAM and an onboard ROM

An optimized system with smart NIC Standard CPUSmart NIC 2 Smart NIC 1 Most layer 2 processing some layer 3 processing all other processing Most layer 2 processing some layer 3 processing NIC handles layers 2 and 3 CPU only handles exceptions

Smart NICs With Onboard Stacks A RISC processor makes it possible to add more protocol processing functionality to a NIC Constrains arise that limit the scalability of a system that uses smart NICs in a conventional computer  the data path between NICs becomes a bottleneck In a traditional computer system, the data path includes the bus to which the NIC attaches and memory

Existing protocols Redesign protocols – Allow sender to choose a size up to the maximum – Make hardware design more difficult – Are not well-suited to applications like voice that require bounded latency Variable-size packets – Each address is globally known – Arises from forwarding overhead

Cells And Connection-Oriented Addressing Requires new protocol, new packet formats, and a connection-oriented paradigm Fixed-size packets – Allows fixed-size buffers – Guaranteed time to transmit/receive Relative (connection-oriented) addressing – Smaller address size – Label on packet changes at each switch – Requires connection setup Example: ATM

Data Pipelines Move each packet through series of processors Each processor handles some tasks Assessment – Well-suited to many protocol processing tasks – Individual processor can be fast Advantage – Much less complex and run faster – All stages can operate at the same time

5-stage data pipeline Lookup the des. Add.Computing the outgoing checksum fragmentationEncapsulation the datagramDecode the datagram

QUESTION ?