ECE/CS 757: Advanced Computer Architecture II

Slides:



Advertisements
Similar presentations
The University of Adelaide, School of Computer Science
Advertisements

SE-292 High Performance Computing
4. Shared Memory Parallel Architectures 4.4. Multicore Architectures
Multiprocessors— Large vs. Small Scale Multiprocessors— Large vs. Small Scale.
CS 284a, 7 October 97Copyright (c) , John Thornley1 CS 284a Lecture Tuesday, 7 October 1997.
1 Lecture 1: Parallel Architecture Intro Course organization:  ~5 lectures based on Culler-Singh textbook  ~5 lectures based on Larus-Rajwar textbook.
1 Lecture 1: Introduction Course organization:  4 lectures on cache coherence and consistency  2 lectures on transactional memory  2 lectures on interconnection.
Parallel Computer Architectures
Parallel Computer Architecture: Essentials for Both Computer Scientists and Engineers Edward F. Gehringer †* Yan.
Lecture 37: Chapter 7: Multiprocessors Today’s topic –Introduction to multiprocessors –Parallelism in software –Memory organization –Cache coherence 1.
Multi-core Processing The Past and The Future Amir Moghimi, ASIC Course, UT ECE.
CS668- Lecture 2 - Sept. 30 Today’s topics Parallel Architectures (Chapter 2) Memory Hierarchy Busses and Switched Networks Interconnection Network Topologies.
Introduction CSE 410, Spring 2008 Computer Systems
Multi-Core Architectures
Multi-core systems System Architecture COMP25212 Daniel Goodman Advanced Processor Technologies Group.
Grad Student Visit DayUniversity of Wisconsin-Madison Wisconsin Computer Architecture Guri SohiMark HillMikko LipastiDavid WoodKaru Sankaralingam Nam Sung.
Spring 2003CSE P5481 Issues in Multiprocessors Which programming model for interprocessor communication shared memory regular loads & stores message passing.
1 Wisconsin Computer Architecture Guri SohiMark HillMikko LipastiDavid WoodKaru Sankaralingam Nam Sung Kim.
Anshul Kumar, CSE IITD Other Architectures & Examples Multithreaded architectures Dataflow architectures Multiprocessor examples 1 st May, 2006.
YEAR 2006 The University of Auckland | New Zealand PRESENTATION Computer Science 703 Advance Computer Architecture 2006 Semester 1 Preparation for Test.
CS5222 Adv. Comp. Arch. Part 0 Page.1 Chi C.H. Fall 2003 NUS CS5222 Advanced Computer Architecture Part 0: Course Introduction Fall Term, 2003/2004 Chi.
ECE/CS 757: Advanced Computer Architecture II Instructor:Mikko H Lipasti Spring 2009 University of Wisconsin-Madison Lecture notes based on slides created.
August 13, 2001Systems Architecture II1 Systems Architecture II (CS ) Lecture 11: Multiprocessors: Uniform Memory Access * Jeremy R. Johnson Monday,
ECE/CS 552: Shared Memory © Prof. Mikko Lipasti Lecture notes based in part on slides created by Mark Hill, David Wood, Guri Sohi, John Shen and Jim Smith.
Lecture 0. Course Introduction Prof. Taeweon Suh Computer Science Education Korea University COM515 Advanced Computer Architecture.
CDA-5155 Computer Architecture Principles Fall 2000 Multiprocessor Architectures.
ECE/CS 552: Multithreading and Multicore © Prof. Mikko Lipasti Lecture notes based in part on slides created by Mark Hill, David Wood, Guri Sohi, John.
Introduction Goal: connecting multiple computers to get higher performance – Multiprocessors – Scalability, availability, power efficiency Job-level (process-level)
Lecture 1: Introduction CprE 585 Advanced Computer Architecture, Fall 2004 Zhao Zhang.
The University of Adelaide, School of Computer Science
EE 382 Processor DesignWinter 98/99Michael Flynn 1 EE382 Processor Design Winter 1998 Chapter 8 Lectures Multiprocessors, Part I.
Elec/Comp 526 Spring 2015 High Performance Computer Architecture Instructor Peter Varman DH 2022 (Duncan Hall) rice.edux3990 Office Hours Tue/Thu.
Introduction CSE 410, Spring 2005 Computer Systems
ECE/CS 757: Advanced Computer Architecture II
Lynn Choi School of Electrical Engineering
Parallel Architecture
Lecture 5 Approaches to Concurrency: The Multiprocessor
ECE 3055: Computer Architecture and Operating Systems
Prof. Onur Mutlu Carnegie Mellon University
ECE/CS 757: Advanced Computer Architecture II Midterm 2 Review
The University of Adelaide, School of Computer Science
The University of Adelaide, School of Computer Science
Multiprocessors Oracle SPARC M core, 64MB L3 cache (8 x 8 MB), 1.6TB/s. 256 KB of 4-way SA L2 ICache, 0.5 TB/s per cluster. 2 cores share 256 KB,
Multi-Processing in High Performance Computer Architecture:
ECE/CS 757: Advanced Computer Architecture II Interconnects
CS775: Computer Architecture
Multi-Processing in High Performance Computer Architecture:
The University of Adelaide, School of Computer Science
CMSC 611: Advanced Computer Architecture
The University of Adelaide, School of Computer Science
Advanced Computer Architectures
Lecture 1: Parallel Architecture Intro
Multiprocessors - Flynn’s taxonomy (1966)
Coe818 Advanced Computer Architecture
/ Computer Architecture and Design
ECE/CS 752: Midterm 2 Review ECE/CS 752 Fall 2017
EE 4xx: Computer Architecture and Performance Programming
ECE/CS 552: Pipelining to Superscalar
The University of Adelaide, School of Computer Science
Chapter 4 Multiprocessors
The University of Adelaide, School of Computer Science
Lecture 17 Multiprocessors and Thread-Level Parallelism
Course Outline for Computer Architecture
Advanced Computer Architecture 5MD00 / 5Z033 Overview
Lecture 17 Multiprocessors and Thread-Level Parallelism
CSL718 : Multiprocessors 13th April, 2006 Introduction
The University of Adelaide, School of Computer Science
The University of Adelaide, School of Computer Science
Lecture 17 Multiprocessors and Thread-Level Parallelism
Multiprocessor System Interconnects
Presentation transcript:

ECE/CS 757: Advanced Computer Architecture II Instructor:Mikko H Lipasti Spring 2017 University of Wisconsin-Madison Lecture notes based on slides created by John Shen, Mark Hill, David Wood, Guri Sohi, and Jim Smith, Natalie Enright Jerger, and probably others

Midterm 1 Review Introductory material Cores, multithreading, multicore Multiprocessor software Memory systems & coherence Consistency © Lipasti

Introduction Thread-level parallelism Multiprocessor Systems Cache Coherence Basics Snoopy Scalable Flynn Taxonomy (SISD/SIMD/MIMD) UMA vs. NUMA © Lipasti

Introduction - Readings Skim Smith beta ch 1 Gem5 Sutter&Larus 21st Century white paper © Lipasti

Cores (Review of 752) Iron law Beyond pipelining Superscalar challenges Instruction flow Register data flow Memory Dataflow Modern memory interface What was not covered Virtual memory Power Many implementation/design details © Lipasti

Multithreading Historical Multi-threaded Cores 6600, HEP In-Order Multi-threaded Cores Out-of-Order Multi-threaded Cores Resource Sharing Thread Scheduling Case Studies Intel Pentium 4 © Lipasti

Multicore Summary Objective: Design Issues Case studies resource sharing Design Issues Where to connect Shared vs private caches Coherence L1, L2 protocols, policies Interconnect Bus, crossbar, ring, mesh Case studies © Lipasti

Cores/Multithreading/Multicore - Readings Skim Smith ch. 3 Olukuton CMP Kumar Heterogeneous CMP © Lipasti

MP Software Important Multiprocessor Interfaces Programming Models API ABI ISA Programming Models Major Abstractions Processes & threads Communication Synchronization Shared Memory API description Implementation at ABI, ISA levels ISA support Message Passing (cursory coverage only) © Lipasti

MP Software - Readings Skim Smith Ch. 2 Scott synthesis lecture Hillis & Steele © Lipasti

Memory Systems & Cache Coherence DRAM & memory controllers Memory hierarchy review Coherence States Snoopy bus-based Invalidate Protocols Invalidate protocol optimizations Update Protocols (brief) Directory protocols Implementation issues © Lipasti

Coherence Readings Skim Smith Ch. 4 Synthesis lecture on coherence & consistency Conway et al. (Opteron) © Lipasti

Consistency Models UNDERSTANDING CONSISTENCY MODELS Atomicity Program Ordering Visibility/Causality POPULAR CONSISTENCY MODELS Sequential Consistency IBM/370 Processor Consistency SPARC TSO/PSO/RMO Weak Ordering PowerPC Weak Consistency VISIBILITY MEMORY REFERENCE REORDERING IMPLEMENTATION ISSUES © Lipasti

Consistency - Readings Synthesis lecture on coherence & consistency Hill: position paper on simple models © Lipasti

Midterm 1 Review Introductory material Multiprocessor software Cores and multithreading Multicore Coherence Consistency © Lipasti