Networks: Routing, Deadlock, Flow Control, Switch Design, Case Studies Alvin R. Lebeck CPS 220.

Slides:



Advertisements
Similar presentations
Compsci 221 / ECE 259 Advanced Computer Architecture II (Parallel Computer Architecture) Interconnection Networks Copyright 2012 Alvin R. Lebeck Duke.
Advertisements

What is Flow Control ? Flow Control determines how a network resources, such as channel bandwidth, buffer capacity and control state are allocated to packet.
1 Lecture 12: Interconnection Networks Topics: dimension/arity, routing, deadlock, flow control.
1 Lecture 23: Interconnection Networks Topics: communication latency, centralized and decentralized switches (Appendix E)
10 - Network Layer. Network layer r transport segment from sending to receiving host r on sending side encapsulates segments into datagrams r on rcving.
NUMA Mult. CSE 471 Aut 011 Interconnection Networks for Multiprocessors Buses have limitations for scalability: –Physical (number of devices that can be.
Communication operations Efficient Parallel Algorithms COMP308.
1 Lecture 13: Interconnection Networks Topics: flow control, router pipelines, case studies.
1 Lecture 25: Interconnection Networks Topics: flow control, router microarchitecture Final exam:  Dec 4 th 9am – 10:40am  ~15-20% on pre-midterm  post-midterm:
1 Lecture 24: Interconnection Networks Topics: topologies, routing, deadlocks, flow control Final exam reminders:  Plan well – attempt every question.
Chapter 10 Introduction to Wide Area Networks Data Communications and Computer Networks: A Business User’s Approach.
1 Lecture 24: Interconnection Networks Topics: communication latency, centralized and decentralized switches (Sections 8.1 – 8.5)
Issues in System-Level Direct Networks Jason D. Bakos.
EECS 570: Fall rev1 1 Chapter 10: Scalable Interconnection Networks.
1 Lecture 24: Interconnection Networks Topics: topologies, routing, deadlocks, flow control.
1 Lecture 26: Interconnection Networks Topics: flow control, router microarchitecture.
1 Lecture 25: Interconnection Networks Topics: communication latency, centralized and decentralized switches, routing, deadlocks (Appendix E) Review session,
Computer Architecture II 1 Computer architecture II Network topologies.
IP-UDP-RTP Computer Networking (In Chap 3, 4, 7) 건국대학교 인터넷미디어공학부 임 창 훈.
Chapter 4 Queuing, Datagrams, and Addressing
Switching, routing, and flow control in interconnection networks.
Interconnection Networks. Applications of Interconnection Nets Interconnection networks are used everywhere! ◦ Supercomputers – connecting the processors.
1 Lecture 23: Interconnection Networks Topics: Router microarchitecture, topologies Final exam next Tuesday: same rules as the first midterm Next semester:
1 The Turn Model for Adaptive Routing. 2 Summary Introduction to Direct Networks. Deadlocks in Wormhole Routing. System Model. Partially Adaptive Routing.
On-Chip Networks and Testing
Winter 2006 ENGR 9861 – High Performance Computer Architecture March 2006 Interconnection Networks.
Networks-on-Chips (NoCs) Basics
Dynamic Networks CS 213, LECTURE 15 L.N. Bhuyan CS258 S99.
1 Lecture 7: Interconnection Network Part I: Basic Definitions Part II: Message Passing Multicomputers.
QoS Support in High-Speed, Wormhole Routing Networks Mario Gerla, B. Kannan, Bruce Kwan, Prasasth Palanti,Simon Walton.
Dynamic Interconnect Lecture 5. COEN Multistage Network--Omega Network Motivation: simulate crossbar network but with fewer links Components: –N.
CPS 356: Introduction to Computer Networks Lecture 7: Switching technologies Ch 2.8.2, 3.1, 3.4 Xiaowei Yang
1 Flow Identification Assume you want to guarantee some type of quality of service (minimum bandwidth, maximum end-to-end delay) to a user Before you do.
Multiprocessor Interconnection Networks Todd C. Mowry CS 740 November 3, 2000 Topics Network design issues Network Topology.
ECE669 L21: Routing April 15, 2004 ECE 669 Parallel Computer Architecture Lecture 21 Routing.
Anshul Kumar, CSE IITD CSL718 : Multiprocessors Interconnection Mechanisms Performance Models 20 th April, 2006.
ECE 259 / CPS 221 Advanced Computer Architecture II (Parallel Computer Architecture) Interconnection Networks Copyright 2004 Daniel J. Sorin Duke University.
1 Lecture 15: Interconnection Routing Topics: deadlock, flow control.
Anshul Kumar, CSE IITD ECE729 : Advanced Computer Architecture Lecture 27, 28: Interconnection Mechanisms In Multiprocessors 29 th, 31 st March, 2010.
Interconnection Networks Alvin R. Lebeck CPS 220.
Forwarding.
NC2 (No6) 1 Maximally Adaptive Routing Maximize adaptivity for a double-x routing based on turn model. Virtual network 0 Virtual network 1 Maximally adaptive.
Interconnect Networks Basics. Generic parallel/distributed system architecture On-chip interconnects (manycore processor) Off-chip interconnects (clusters.
Virtual-Channel Flow Control William J. Dally
1 Lecture 24: Interconnection Networks Topics: communication latency, centralized and decentralized switches, routing, deadlocks (Appendix F)
1 Switching and Forwarding Sections Connecting More Than Two Hosts Multi-access link: Ethernet, wireless –Single physical link, shared by multiple.
1 Lecture 14: Interconnection Networks Topics: dimension vs. arity, deadlock.
1 Lecture 22: Interconnection Networks Topics: Routing, deadlock, flow control, virtual channels.
COMP8330/7330/7336 Advanced Parallel and Distributed Computing Communication Costs in Parallel Machines Dr. Xiao Qin Auburn University
Graciela Perera Department of Computer Science and Information Systems Slide 1 of 18 INTRODUCTION NETWORKING CONCEPTS AND ADMINISTRATION CSIS 3723 Graciela.
The network-on-chip protocol
Advanced Computer Networks
Lecture 23: Interconnection Networks
Physical constraints (1/2)
Azeddien M. Sllame, Amani Hasan Abdelkader
Interconnection Network Design Contd.
Introduction to Scalable Interconnection Network Design
Switching, routing, and flow control in interconnection networks
Lecture 14: Interconnection Networks
Introduction to Scalable Interconnection Networks
Copyright 2012 Alvin R. Lebeck
Interconnection Networks Contd.
Chapter 4 Network Layer Computer Networking: A Top Down Approach 5th edition. Jim Kurose, Keith Ross Addison-Wesley, April Network Layer.
Lecture: Interconnection Networks
CS 6290 Many-core & Interconnect
Networks: Routing and Design
Lecture 25: Interconnection Networks
Switching, routing, and flow control in interconnection networks
Virtual LAN (VLAN).
Presentation transcript:

Networks: Routing, Deadlock, Flow Control, Switch Design, Case Studies Alvin R. Lebeck CPS 220

2 © Alvin R. Lebeck 1999 CPS 220 Admin Homework #5 Due Dec 3 Projects… Final… (yes it will be cumulative)

3 © Alvin R. Lebeck 1999 CPS 220 Review: Terms Network characterized by Topology –physical structure of the graph Routing Algorithm –which paths through network can message flow Switching Strategy –How data in message traverses its route –Circuit Switched vs. Packet Switched Flow Control –When does a packet (or portions of it) move along its route

4 © Alvin R. Lebeck 1999 CPS 220 Review: Organization Given topology constructed by linking switches and network interfaces, must deliver packet from node A to node B Link: cable with connectors on each end –connect switches to other switches or network interfaces Switch: N inputs N outputs (degree N) Phit: Minimum # of bits physically moved across link in one cycle (Can pipeline on single wire) Flit: Minimum # of bits move across link as a single unit Packet: unit that requires routing information, some number of flits

5 © Alvin R. Lebeck 1999 CPS 220 Review: Switches At minimum, must route inputs to outputs Cross-bar Control Routing, Scheduling Receiver Input Buffer Output Buffer Transmitter Output Ports Input Ports VLSI makes it easier to create larger fully connected switches

6 © Alvin R. Lebeck 1999 CPS 220 Review: Routing Store-and-forward Cut-through Virtual cut-through Wormhole

7 © Alvin R. Lebeck 1999 CPS 220 Routing Algorithm How do I know where a packet should go? Arithmetic Source-Based Table Lookup Adaptive—route based on network state (e.g., contention)

8 © Alvin R. Lebeck 1999 CPS 220 Arithmetic Routing For regular topology, simple arithmetic to determine route 2D Mesh (Also called NEWS network) –packet header contains signed offset to destination –switch ++ or -- one field of header (x or y dimension) –when x == 0 and y == 0, then at correct processor Requires ALU in switch Must recompute CRC

9 © Alvin R. Lebeck 1999 CPS 220 Source Based and Table Lookup Routing Source Based Source specifies output port for each switch in route Very Simple Switches –no control state –strip output port off header Myrinet uses this Table Lookup Very Small Header, index into table for output port Big tables, must be kept up to date...

10 © Alvin R. Lebeck 1999 CPS Deterministic v.s. Adaptive Routing Deterministic—follows a pre- specified route –mesh: dimension-order routing »(x1, y1) -> (x2, y2) »first Dx = x2 - x1, »then Dy = y2 - y1, –hypercube: edge-cube routing »X = x0x1x2...xn -> Y = y0y1y2...yn »R = X xor Y »Traverse dimensions of differing address in order –tree: common ancestor Adaptive—route determined by contention for output port

11 © Alvin R. Lebeck 1999 CPS 220 Deadlock Key is to allow receiving even if you can’t send. What is livelock?

12 © Alvin R. Lebeck 1999 CPS 220 Deadlock Free Routing Virtual Channels –Not virtual cut-through –Add buffers so, flits of wormhole packets can be interleaved Up*-Down* –Number switches: higher = farther away from processors –route up, make one turn, route down Turn Model Routing –Restrict order of turns »West First »North Last »Negative First –Can increase number of hops –Can handle faulty nodes

13 © Alvin R. Lebeck 1999 CPS 220 A Generic Switch At minimum, must route inputs to outputs Cross-bar Control Routing, Scheduling Receiver Input Buffer Output Buffer Transmitter Output Ports Input Ports VLSI makes it easier to create larger fully connected switches

14 © Alvin R. Lebeck 1999 CPS 220 Switch Design Issues ports, pin limited data path (cross bar designs) non-blocking crossbar routing logic per input –ALU –table –Finite State Machine for cut-through

15 © Alvin R. Lebeck 1999 CPS 220 Switch Buffering need to absorb some transient peaks Centralized shared buffer pool –need high bandwidth –one output could hog all buffer space Distributed input buffers

16 © Alvin R. Lebeck 1999 CPS 220 Input Buffering buffer per port routing logic head of line (HOL) blocking –subsequent packet may be routed to unused output port

17 © Alvin R. Lebeck 1999 CPS 220 Output Buffering Buffers “logically” associated with output –split on either side of cross bar Arbitration for physical link (output scheduling) –static priority –random –round-robin –oldest-first Effects of adaptive routing? –Select output based on availability –requires feedback from output port

18 © Alvin R. Lebeck 1999 CPS 220 Stacked Dimension Switch Uses only 2x2 switch to build higher dimension switch 2x2 z in z ou t y in x in y out x out Host in Host out

19 © Alvin R. Lebeck 1999 CPS 220 Congestion Control Packet switched networks do not reserve bandwidth; this leads to contention Solution: prevent packets from entering until contention is reduced (e.g., metering lights) Options: –End-to-end Flow Control –Link-level Flow Control

20 © Alvin R. Lebeck 1999 CPS 220 Flow Control Packet discarding: If a packet arrives at a switch and there is no room in the buffer, the packet is discarded –no communication between switches, requires higher level protocol Flow control: between pairs of receivers and senders; use feedback to tell the sender when it is allowed to send the next packet

21 © Alvin R. Lebeck 1999 CPS 220 Link-Level Flow Control Data Ready Transfer single flit when receiver is ready Could have long links with many flits in flight

22 © Alvin R. Lebeck 1999 CPS 220 Credit-based (Window) Flow Control Receiver gives N credits to sender –sender decrements count –stops sending if zero –receiver sends back credit as it drains its buffer –bundle credits to reduce overhead Must account for link latency

23 © Alvin R. Lebeck 1999 CPS 220 Water Level high water, low water stop & go back to source switch (Myrinet) can send redundant stop/go Stop Go Incoming phits Outgoing phits

24 © Alvin R. Lebeck 1999 CPS 220 Case Study Cray T3D 1024 switch nodes each connected to 2 processors 3D Torus, bidirectional, 300 MB/s Link: 16 bits, 8 control bits Variable size packet (multiple of 16 bits) Logical request & response networks –2 virtual channels each for deadlock Stacked dimension routing Wormhole for large packets, virtual cut-through for small packets

25 © Alvin R. Lebeck 1999 CPS 220 IBM SP-2 (Vulcan) Switch has eight bidirectional 40 MB/s links Link: 8 data bits, 1 tag, 1 reverse flow-control Flit is 16 bits, phit is 8 input FIFO + output FIFO + central buffer byte segments

26 © Alvin R. Lebeck 1999 CPS 220 Next Time Multiprocessor Architectures