Auburn University http://www.eng.auburn.edu/~xqin COMP8330/7330/7336 Advanced Parallel and Distributed Computing Interconnection Networks (Part 2) Dr.

Slides:



Advertisements
Similar presentations
Shantanu Dutt Univ. of Illinois at Chicago
Advertisements

Super computers Parallel Processing By: Lecturer \ Aisha Dawood.
Taxanomy of parallel machines. Taxonomy of parallel machines Memory – Shared mem. – Distributed mem. Control – SIMD – MIMD.
CSCI 8150 Advanced Computer Architecture Hwang, Chapter 2 Program and Network Properties 2.4 System Interconnect Architectures.
1 Lecture 23: Interconnection Networks Topics: communication latency, centralized and decentralized switches (Appendix E)
Advanced Topics in Algorithms and Data Structures An overview of the lecture 2 Models of parallel computation Characteristics of SIMD models Design issue.
1 CSE 591-S04 (lect 14) Interconnection Networks (notes by Ken Ryu of Arizona State) l Measure –How quickly it can deliver how much of what’s needed to.
NUMA Mult. CSE 471 Aut 011 Interconnection Networks for Multiprocessors Buses have limitations for scalability: –Physical (number of devices that can be.
Interconnection Network PRAM Model is too simple Physically, PEs communicate through the network (either buses or switching networks) Cost depends on network.
Parallel Computing Platforms
Models of Parallel Computation Advanced Algorithms & Data Structures Lecture Theme 12 Prof. Dr. Th. Ottmann Summer Semester 2006.
Interconnection Network Topologies
1 Lecture 25: Interconnection Networks Topics: communication latency, centralized and decentralized switches, routing, deadlocks (Appendix E) Review session,
Interconnection Networks in Multiprocessor Systems By: Wallun Chan Course: CS 147 Text: Chapter 12, p Professor: Sin-Min Lee.
Chapter 5 Array Processors. Introduction  Major characteristics of SIMD architectures –A single processor(CP) –Synchronous array processors(PEs) –Data-parallel.
Interconnect Network Topologies
Interconnection Networks. Applications of Interconnection Nets Interconnection networks are used everywhere! ◦ Supercomputers – connecting the processors.
1 The Turn Model for Adaptive Routing. 2 Summary Introduction to Direct Networks. Deadlocks in Wormhole Routing. System Model. Partially Adaptive Routing.
Computer Science Department
Interconnect Networks
Network Topologies Topology – how nodes are connected – where there is a wire between 2 nodes. Routing – the path a message takes to get from one node.
Lecture 12: Parallel Sorting Shantanu Dutt ECE Dept. UIC.
Multiprocessor systems Objective n the multiprocessors’ organization and implementation n the shared-memory in multiprocessor n static and dynamic connection.
PPC Spring Interconnection Networks1 CSCI-4320/6360: Parallel Programming & Computing (PPC) Interconnection Networks Prof. Chris Carothers Computer.
CSE Advanced Computer Architecture Week-11 April 1, 2004 engr.smu.edu/~rewini/8383.
Dynamic Interconnect Lecture 5. COEN Multistage Network--Omega Network Motivation: simulate crossbar network but with fewer links Components: –N.
Parallel Computer Architecture and Interconnect 1b.1.
CHAPTER 12 INTRODUCTION TO PARALLEL PROCESSING CS 147 Guy Wong page
Course Wrap-Up Miodrag Bolic CEG4136. What was covered Interconnection network topologies and performance Shared-memory architectures Message passing.
1 Dynamic Interconnection Networks Miodrag Bolic.
Lecture 3 Innerconnection Networks for Parallel Computers
Anshul Kumar, CSE IITD CSL718 : Multiprocessors Interconnection Mechanisms Performance Models 20 th April, 2006.
Computer Science and Engineering Parallel and Distributed Processing CSE 8380 January Session 4.
An Overview of Parallel Computing. Hardware There are many varieties of parallel computing hardware and many different architectures The original classification.
Anshul Kumar, CSE IITD ECE729 : Advanced Computer Architecture Lecture 27, 28: Interconnection Mechanisms In Multiprocessors 29 th, 31 st March, 2010.
Birds Eye View of Interconnection Networks
1 Interconnection Networks. 2 Interconnection Networks Interconnection Network (for SIMD/MIMD) can be used for internal connections among: Processors,
Computer Science and Engineering Copyright by Hesham El-Rewini Advanced Computer Architecture.
Super computers Parallel Processing
Basic Communication Operations Carl Tropper Department of Computer Science.
Unit-8 Sorting Algorithms Prepared By:-H.M.PATEL.
COMP8330/7330/7336 Advanced Parallel and Distributed Computing Tree-Based Networks Cache Coherence Dr. Xiao Qin Auburn University
COMP8330/7330/7336 Advanced Parallel and Distributed Computing Communication Costs in Parallel Machines Dr. Xiao Qin Auburn University
1 Computer Architecture & Assembly Language Spring 2001 Dr. Richard Spillman Lecture 26 – Alternative Architectures.
INTERCONNECTION NETWORK
Network Connected Multiprocessors
Auburn University COMP8330/7330/7336 Advanced Parallel and Distributed Computing Parallel Hardware Dr. Xiao Qin Auburn.
Overview Parallel Processing Pipelining
Parallel Architecture
Interconnect Networks
CS 704 Advanced Computer Architecture
Dynamic connection system
Lecture 23: Interconnection Networks
Connection System Serve on mutual connection processors and memory .
Interconnection topologies
Refer example 2.4on page 64 ACA(Kai Hwang) And refer another ppt attached for static scheduling example.
Parallel and Multiprocessor Architectures
Multiprocessors Interconnection Networks
Lecture 14: Interconnection Networks
Indirect Networks or Dynamic Networks
Interconnection Network Design Lecture 14
Mesh-Connected Illiac Networks
Static Interconnection Networks
High Performance Computing & Bioinformatics Part 2 Dr. Imad Mahgoub
Advanced Computer and Parallel Processing
Multiprocessors Interconnection Networks
Dynamic Interconnection Networks
CS 6290 Many-core & Interconnect
Birds Eye View of Interconnection Networks
Advanced Computer and Parallel Processing
Presentation transcript:

Auburn University http://www.eng.auburn.edu/~xqin COMP8330/7330/7336 Advanced Parallel and Distributed Computing Interconnection Networks (Part 2) Dr. Xiao Qin Auburn University http://www.eng.auburn.edu/~xqin xqin@auburn.edu Lec02

Recap: Static and Dynamic Interconnection Networks Classification of interconnection networks: (a) a static network; and (b) a dynamic network.

Recap: Bus A collection of parallel communication wires together with hardware controlling the bus. Communication wires are shared by the devices Pros: easy to build, good broadcast Some of the simplest and earliest parallel machines used buses. Cons: As the number of devices connected to the bus increases, contention for use of the bus increases, and performance decreases. However, the bandwidth of the shared bus is a major bottleneck. All processors access a common bus for exchanging data. The distance between any two nodes is O(1) in a bus. The bus also provides a convenient broadcast media. Typical bus based machines are limited to dozens of nodes. Sun Enterprise servers and Intel Pentium based shared-bus multiprocessors are examples of such architectures.

Recap: Crossbars A crossbar network uses an p×m grid of switches to connect p inputs to m outputs in a non-blocking manner. A completely non-blocking crossbar network connecting p processors to b memory banks.

Multistage Interconnection Networks Network Topologies: Multistage Networks Multistage interconnection networks (MINs) are a class of high-speed computer networks usually composed of processing elements (PEs) on one end of the network and memory elements (MEs) on the other end, connected by switching elements (SEs). The switching elements themselves are usually connected to each other in stages, hence the name. Such networks include networks, omega networks, delta networks and many other types. MINs are typically used in high-performance or parallel computing as a low-latency interconnection (as opposed to traditional packet switching networks), though they could be implemented on top of a packet switching network. Though the network is typically used for routing purposes, it could also be used as a co-processor to the actual processors for such uses as sorting; cyclic shifting, as in a perfect shufflenetwork; and bitonic sorting. Looks like a pipeline One of the most commonly used multistage interconnects This network consists of log p stages, where p is the number of inputs/outputs. The schematic of a typical multistage interconnection network.

Multistage Omega Network At each stage, input i is connected to output j if: Q1: What does this expression mean? Each stage of the Omega network implements a perfect shuffle. A perfect shuffle interconnection for eight inputs and outputs. Multistage networks include networks, omega networks, delta networks and many other types.

Multistage Omega Network The perfect shuffle patterns are connected using 2×2 switches. The switches operate in two modes – crossover or passthrough. Draw a black box (2 in and 2 out) representing a switch component Two switching configurations of the 2 × 2 switch: (a) Pass-through; (b) Cross-over. Q2: How many modes does a switch component operate in?

Multistage Omega Network A complete Omega network with the perfect shuffle interconnects and switches can now be illustrated: Q3: If the number of processors is p, what is the cost? Give one minute to compute the cost A complete omega network connecting eight inputs and eight outputs. An omega network has p/2 × log p switching nodes, and the cost of such a network grows as (p log p).

Multistage Omega Network: Routing Let s be the binary representation of the source and d be that of the destination processor. The data traverses the link to the first switching node. If the most significant bits of s and d are the same, then the data is routed in pass-through mode by the switch else, it switches to crossover. This process is repeated for each of the log p switching stages. Note that this is not a non-blocking switch. Give the students two minutes to read this policy, then explain This is a reference slide. Q4: How does this routing policy work?

Multistage Omega Network Routing Walkthrough this example An example of blocking in omega network: one of the messages (010 to 111 or 110 to 100) is blocked at link AB.

Completely Connected and Star Connected Networks Example of an 8-node completely connected network. (a) A completely-connected network of eight nodes; (b) a star connected network of nine nodes.

Completely Connected Network Each processor is connected to every other processor. The number of links in the network scales as? O(p2). While the performance scales very well, the hardware complexity is not realizable for large values of p. In this sense, these networks are static counterparts of crossbars. Q5: What is a problem here?

Star Connected Network Every node is connected only to a common node at the center. Distance between any pair of nodes is? O(1). However, the central node becomes a bottleneck. In this sense, star connected networks are static counterparts of buses. O(1) = O(2) Reference: http://stackoverflow.com/questions/1997262/what-are-the-differences-between-o1-and-o2-in-algorithm-analysis There is no difference between O(1) and O(2). Algorithms classifying as O(1) are O(2) and vice versa. In fact, O(c1) is O(c2) for any positive constants c1 and c2. O(c) where c is a positive constants simply means that the runtime is bounded independent of the input or problem size. From this it is clear (informally) that O(1) and O(2) are equal. Formally, consider a function f in O(1). Then there is a constant c such that f(n) <= c * 1 for all n. Let d = c / 2. Then f(n) <= c = (c / 2) * 2 = d * 2 which shows that f is O(2). Similarly if g is O(2) there is a constant c such that g(n) <= c * 2 for all n. Let d = 2 * c. Then g(n) <= c * 2 = d = d * 1 which shows that g is O(1). Therefore O(1) = O(2). Q6: What is a problem here?

Linear Arrays, Meshes, and k-d Meshes A special case of a d-dimensional mesh is a hypercube. Here, d = log p, where p is the total number of nodes.

Network Topologies: Linear Arrays Linear arrays: (a) with no wraparound links; (b) with wraparound link. In a linear array, each node has two neighbors, one to its left and one to its right. If the nodes at either end are connected, we refer to it as a 1-D torus or a ring.

Two- and Three Dimensional Meshes Two and three dimensional meshes: (a) 2-D mesh with no wraparound; (b) 2-D mesh with wraparound link (2-D torus); and (c) a 3-D mesh with no wraparound. A generalization to 2 dimensions has nodes with 4 neighbors, to the north, south, east, and west. Q7: How many neighbors for 3D? A further generalization to d dimensions has nodes with 2d neighbors.

Hypercubes and their Construction Construction of hypercubes from hypercubes of lower dimension. Q8: How many neighbors? Number of neighbors = D -> log(p)

Properties of Hypercubes The distance between any two nodes is at most log p. (Why?) Each node has log p neighbors. The distance between two nodes is given by the number of bit positions at which the two nodes differ.

Summary Multistage Omega Network Completely Connected Star Connected Networks Linear Arrays, Meshes, and k-d Meshes Hypercubes