1 Lecture 1: Parallel Architecture Intro Course organization:  ~18 parallel architecture lectures (based on text)  ~10 (recent) paper presentations 

Slides:



Advertisements
Similar presentations
Multiple Processor Systems
Advertisements

© 2009 Fakultas Teknologi Informasi Universitas Budi Luhur Jl. Ciledug Raya Petukangan Utara Jakarta Selatan Website:
Distributed Systems CS
Multiprocessors— Large vs. Small Scale Multiprocessors— Large vs. Small Scale.
Multiple Processor Systems
CSCI 8150 Advanced Computer Architecture Hwang, Chapter 1 Parallel Computer Models 1.2 Multiprocessors and Multicomputers.
Multiprocessors CSE 4711 Multiprocessors - Flynn’s Taxonomy (1966) Single Instruction stream, Single Data stream (SISD) –Conventional uniprocessor –Although.
Introduction to MIMD architectures
Background Computer System Architectures Computer System Software.
1 Introduction to MIMD Architectures Sima, Fountain and Kacsuk Chapter 15 CSE462.
Multiple Processor Systems Chapter Multiprocessors 8.2 Multicomputers 8.3 Distributed systems.
Multiprocessors ELEC 6200: Computer Architecture and Design Instructor : Agrawal Name: Nam.
11/14/05ELEC Fall Multi-processor SoCs Yijing Chen.
1 Lecture 1: Parallel Architecture Intro Course organization:  ~5 lectures based on Culler-Singh textbook  ~5 lectures based on Larus-Rajwar textbook.
1 Lecture 1: Introduction Course organization:  4 lectures on cache coherence and consistency  2 lectures on transactional memory  2 lectures on interconnection.
Multiprocessors CSE 471 Aut 011 Multiprocessors - Flynn’s Taxonomy (1966) Single Instruction stream, Single Data stream (SISD) –Conventional uniprocessor.
1 Lecture 23: Multiprocessors Today’s topics:  RAID  Multiprocessor taxonomy  Snooping-based cache coherence protocol.
CPE 731 Advanced Computer Architecture Multiprocessor Introduction
1 Lecture 1: Introduction Course organization:  13 lectures on parallel architectures  ~5 lectures on cache coherence, consistency  ~3 lectures on TM.
Lecture 37: Chapter 7: Multiprocessors Today’s topic –Introduction to multiprocessors –Parallelism in software –Memory organization –Cache coherence 1.
Computer System Architectures Computer System Software
1 Lecture 20: Parallel and Distributed Systems n Classification of parallel/distributed architectures n SMPs n Distributed systems n Clusters.
Multiple Processor Systems. Multiprocessor Systems Continuous need for faster and powerful computers –shared memory model ( access nsec) –message passing.
Fall 2000M.B. Ibáñez Lecture 01 Introduction What is an Operating System? The Evolution of Operating Systems Course Outline.
August 15, 2001Systems Architecture II1 Systems Architecture II (CS ) Lecture 12: Multiprocessors: Non-Uniform Memory Access * Jeremy R. Johnson.
Parallel Computer Architecture and Interconnect 1b.1.
CSIE30300 Computer Architecture Unit 15: Multiprocessors Hsin-Chou Chi [Adapted from material by and
Multiple Processor Systems Chapter Multiprocessors 8.2 Multicomputers 8.3 Distributed systems.
ECE200 – Computer Organization Chapter 9 – Multiprocessors.
Distributed Shared Memory Based on Reference paper: Distributed Shared Memory, Concepts and Systems.
Spring 2003CSE P5481 Issues in Multiprocessors Which programming model for interprocessor communication shared memory regular loads & stores message passing.
Copyright © 2011 Curt Hill MIMD Multiple Instructions Multiple Data.
MODERN OPERATING SYSTEMS Third Edition ANDREW S. TANENBAUM Chapter 8 Multiple Processor Systems Tanenbaum, Modern Operating Systems 3 e, (c) 2008 Prentice-Hall,
PARALLEL PROCESSOR- TAXONOMY. CH18 Parallel Processing {Multi-processor, Multi-computer} Multiple Processor Organizations Symmetric Multiprocessors Cache.
Outline Why this subject? What is High Performance Computing?
3/12/2013Computer Engg, IIT(BHU)1 PARALLEL COMPUTERS- 2.
3/12/2013Computer Engg, IIT(BHU)1 PARALLEL COMPUTERS- 1.
3/12/2013Computer Engg, IIT(BHU)1 INTRODUCTION-2.
Multiprocessor So far, we have spoken at length microprocessors. We will now study the multiprocessor, how they work, what are the specific problems that.
August 13, 2001Systems Architecture II1 Systems Architecture II (CS ) Lecture 11: Multiprocessors: Uniform Memory Access * Jeremy R. Johnson Monday,
Multiprocessor  Use large number of processor design for workstation or PC market  Has an efficient medium for communication among the processor memory.
1 Lecture 17: Multiprocessors Topics: multiprocessor intro and taxonomy, symmetric shared-memory multiprocessors (Sections )
CDA-5155 Computer Architecture Principles Fall 2000 Multiprocessor Architectures.
Background Computer System Architectures Computer System Software.
Introduction Goal: connecting multiple computers to get higher performance – Multiprocessors – Scalability, availability, power efficiency Job-level (process-level)
The University of Adelaide, School of Computer Science
Lecture 13 Parallel Processing. 2 What is Parallel Computing? Traditionally software has been written for serial computation. Parallel computing is the.
INTRODUCTION TO HIGH PERFORMANCE COMPUTING AND TERMINOLOGY.
Feeding Parallel Machines – Any Silver Bullets? Novica Nosović ETF Sarajevo 8th Workshop “Software Engineering Education and Reverse Engineering” Durres,
Group Members Hamza Zahid (131391) Fahad Nadeem khan Abdual Hannan AIR UNIVERSITY MULTAN CAMPUS.
Multiprocessor System Distributed System
CHAPTER SEVEN PARALLEL PROCESSING © Prepared By: Razif Razali.
CMSC 611: Advanced Computer Architecture
CS5102 High Performance Computer Systems Thread-Level Parallelism
CS 147 – Parallel Processing
CMSC 611: Advanced Computer Architecture
Shared Memory Multiprocessors
Lecture 1: Parallel Architecture Intro
Chapter 17 Parallel Processing
Multiprocessors - Flynn’s taxonomy (1966)
Multiple Processor Systems
Multiple Processor Systems
Introduction to Multiprocessors
Parallel Processing Architectures
Distributed Systems CS
Multiple Processor and Distributed Systems
High Performance Computing
Chapter 4 Multiprocessors
Lecture 24: Virtual Memory, Multiprocessors
Lecture 23: Virtual Memory, Multiprocessors
Presentation transcript:

1 Lecture 1: Parallel Architecture Intro Course organization:  ~18 parallel architecture lectures (based on text)  ~10 (recent) paper presentations  2 lectures on parallel algorithms  student project presentations Textbook: Parallel Computer Architecture, Culler, Singh Projects: simulation-based, creative, be prepared to spend time towards end of semester Grading: 50% project, 10% paper presentation, 10% class participation, 30% take-home final

2 Parallel Architecture History A parallel computer is a collection of processing elements that communicate and cooperate to solve large problems fast Up to 80’s: intended for large applications, costing millions of dollars 90’s: every organization owns a “cluster” or an “SMP”, costing an order of magnitude less 21 st century: most desktops/workstations will have multiple cores and simultaneous threads on a single die – perhaps, motivated by designers’ inability to scale single core performance

3 Parallel Architecture Trends Source: Mark Hill, Ravi Rajwar

4 CMP/SMT Papers CMP/SMT/Multiprocessor papers in recent conferences:  ISCA:  HPCA:

5 Bottomline Expect an increase in multiprocessor papers Performance stagnates unless we learn to transform traditional applications into parallel threads It’s all about the data! Data management: distribution, coherence, consistency It’s also about the programming model: onus on application writer / compiler / hardware

6 Communication Architecture At first, the hardware for shared memory and message passing were different – today, a single parallel machine can efficiently support different programming models CAD Database Scientific modeling Multiprogramming Shared Message address passing Compilation or library OS support Communication hardware Parallel applications Programming models User/System boundary HW/SW boundary

7 Shared Memory Architectures Key differentiating feature: the address space is shared, i.e., any processor can directly address any memory location and access them with load/store instructions Cooperation is similar to a bulletin board – a processor writes to a location and that location is visible to reads by other threads

8 Shared Address Space Shared Private Process P1 Process P2 Process P3 Shared Pvt P1 Pvt P2 Pvt P3 Virtual address space of each process Physical address space

9 Symmetric Multiprocessors (SMP) A collection of processors, a collection of memory: both are connected through some interconnect (usually, the fastest possible) Symmetric because latency for any processor to access any memory is constant – uniform memory access (UMA) Proc 1Proc 2Proc 3Proc 4 Mem 1Mem 2Mem 3Mem 4

10 Distributed Memory Multiprocessors Each processor has local memory that is accessible through a fast interconnect The different nodes are connected as I/O devices with (potentially) slower interconnect Local memory access is a lot faster than remote memory – non-uniform memory access (NUMA) Advantage: can be built with commodity processors and many applications will perform well thanks to locality Proc 1Mem 1Proc 2Mem 2Proc 3Mem 3Proc 4Mem 4

11 Message Passing Programming model that can apply to clusters of workstations, SMPs, and even a uniprocessor Sends and receives are used for effecting the data transfer – usually, each process ends up making a copy of data that is relevant to it Each process can only name local addresses, other processes, and a tag to help distinguish between multiple messages

12 Title Bullet