YEAR 2006 The University of Auckland | New Zealand PRESENTATION Computer Science 703 Advance Computer Architecture 2006 Semester 1 Preparation for Test.

Slides:



Advertisements
Similar presentations
CSE 502: Computer Architecture
Advertisements

SE-292 High Performance Computing
Multiple Processor Systems
Transactional Memory Overview Olatunji Ruwase Fall 2007 Oct
Transactional Memory (TM) Evan Jolley EE 6633 December 7, 2012.
Multiprocessors CSE 4711 Multiprocessors - Flynn’s Taxonomy (1966) Single Instruction stream, Single Data stream (SISD) –Conventional uniprocessor –Although.
[ 1 ] Agenda Overview of transactional memory (now) Two talks on challenges of transactional memory Rebuttals/panel discussion.
Lock vs. Lock-Free memory Fahad Alduraibi, Aws Ahmad, and Eman Elrifaei.
1 Multiprocessors. 2 Idea: create powerful computers by connecting many smaller ones good news: works for timesharing (better than supercomputer) bad.
1 Lecture 7: Consistency Models Topics: sequential consistency, requirements to implement sequential consistency, relaxed consistency models.
1 Lecture 1: Parallel Architecture Intro Course organization:  ~5 lectures based on Culler-Singh textbook  ~5 lectures based on Larus-Rajwar textbook.
1 Lecture 1: Introduction Course organization:  4 lectures on cache coherence and consistency  2 lectures on transactional memory  2 lectures on interconnection.
1 Lecture 24: Multiprocessors Today’s topics:  Directory-based cache coherence protocol  Synchronization  Consistency  Writing parallel programs Reminder:
1 Lecture 18: Coherence Protocols Topics: coherence protocols for symmetric and distributed shared-memory multiprocessors (Sections )
1 CSE SUNY New Paltz Chapter Nine Multiprocessors.
Topic ? Course Overview. Guidelines Questions are rated by stars –One Star Question  Easy. Small definition, examples or generic formulas –Two Stars.
1 Lecture 20: Protocols and Synchronization Topics: distributed shared-memory multiprocessors, synchronization (Sections )
Processor Consistency [Goodman 1989]* Processor Consistency is a memory model in which the result of any execution is the same as if the operations of.
Parallel Computer Architecture: Essentials for Both Computer Scientists and Engineers Edward F. Gehringer †* Yan.
Lecture 37: Chapter 7: Multiprocessors Today’s topic –Introduction to multiprocessors –Parallelism in software –Memory organization –Cache coherence 1.
Network Topologies Topology – how nodes are connected – where there is a wire between 2 nodes. Routing – the path a message takes to get from one node.
18-447: Computer Architecture Lecture 30B: Multiprocessors Prof. Onur Mutlu Carnegie Mellon University Spring 2013, 4/22/2013.
A brief overview about Distributed Systems Group A4 Chris Sun Bryan Maden Min Fang.
Multi-core systems System Architecture COMP25212 Daniel Goodman Advanced Processor Technologies Group.
August 15, 2001Systems Architecture II1 Systems Architecture II (CS ) Lecture 12: Multiprocessors: Non-Uniform Memory Access * Jeremy R. Johnson.
Course Wrap-Up Miodrag Bolic CEG4136. What was covered Interconnection network topologies and performance Shared-memory architectures Message passing.
CPE731: Advanced Computer Architecture Course Introduction Dr. Gheith Abandah د. غيث علي عبندة.
In-network cache coherence MICRO’2006 Noel Eisley et.al, Princeton Univ. Presented by PAK, EUNJI.
MODERN OPERATING SYSTEMS Third Edition ANDREW S. TANENBAUM Chapter 8 Multiple Processor Systems Tanenbaum, Modern Operating Systems 3 e, (c) 2008 Prentice-Hall,
RSIM: An Execution-Driven Simulator for ILP-Based Shared-Memory Multiprocessors and Uniprocessors.
1 Introduction ELG 6158 Digital Systems Architecture Miodrag Bolic.
1 Lecture 19: Scalable Protocols & Synch Topics: coherence protocols for distributed shared-memory multiprocessors and synchronization (Sections )
Super computers Parallel Processing
Lecture 27 Multiprocessor Scheduling. Last lecture: VMM Two old problems: CPU virtualization and memory virtualization I/O virtualization Today Issues.
Corse Overview Miodrag Bolic ELG7187 Topics in Computers: Multiprocessor Systems on Chip.
Computer Architecture Lecture 27: Multiprocessors Prof. Onur Mutlu Carnegie Mellon University Spring 2015, 4/6/2015.
COMP8330/7330/7336 Advanced Parallel and Distributed Computing Tree-Based Networks Cache Coherence Dr. Xiao Qin Auburn University
VU-Advanced Computer Architecture Lecture 1-Introduction 1 Advanced Computer Architecture CS 704 Advanced Computer Architecture Lecture 1.
740: Computer Architecture Memory Consistency Prof. Onur Mutlu Carnegie Mellon University.
Elec/Comp 526 Spring 2015 High Performance Computer Architecture Instructor Peter Varman DH 2022 (Duncan Hall) rice.edux3990 Office Hours Tue/Thu.
1 Computer Architecture & Assembly Language Spring 2001 Dr. Richard Spillman Lecture 26 – Alternative Architectures.
Multiprocessors – Locks
Maurice Herlihy and J. Eliot B. Moss,  ISCA '93
COSC6385 Advanced Computer Architecture
Parallel Architecture
18-447: Computer Architecture Lecture 30B: Multiprocessors
Transactional Memory : Hardware Proposals Overview
ECE/CS 757: Advanced Computer Architecture II Midterm 2 Review
The University of Adelaide, School of Computer Science
The University of Adelaide, School of Computer Science
Lecture 11: Consistency Models
CPE731: Advanced Computer Architecture Course Introduction
Lecture 18: Coherence and Synchronization
The University of Adelaide, School of Computer Science
CMSC 611: Advanced Computer Architecture
The University of Adelaide, School of Computer Science
Lecture 1: Parallel Architecture Intro
ECE/CS 757: Advanced Computer Architecture II
ECE/CS 752: Midterm 2 Review ECE/CS 752 Fall 2017
Lecture 25: Multiprocessors
Lecture 25: Multiprocessors
The University of Adelaide, School of Computer Science
Lecture 17 Multiprocessors and Thread-Level Parallelism
Lecture 24: Multiprocessors
Lecture: Coherence Topics: wrap-up of snooping-based coherence,
Lecture 17 Multiprocessors and Thread-Level Parallelism
Lecture 19: Coherence and Synchronization
Lecture 18: Coherence and Synchronization
The University of Adelaide, School of Computer Science
Lecture 17 Multiprocessors and Thread-Level Parallelism
Presentation transcript:

YEAR 2006 The University of Auckland | New Zealand PRESENTATION Computer Science 703 Advance Computer Architecture 2006 Semester 1 Preparation for Test James Goodman

YEAR 2006 The University of Auckland | New Zealand PRESENTATION Test Information Time: Tuesday, May 2: Place: CS 169 (note change from previously announced) Open-book, notes Calculators permitted Coverage: lecture material through Tuesday, 11Apr

YEAR 2006 The University of Auckland | New Zealand PRESENTATION Required Readings You are responsible for material in these papers P. Sweazey and A.J. Smith, “A class of compatible cache consistency protocols and their support by the IEEE Futurebus,” Proc. Thirteenth International Symposium on Computer Architecture (ISCA-13), Tokyo, Japan, pp , June M. Herlihy and J.E.B. Moss, “Transactional Memory: Architectural Support for Lock-Free Data Structures,” Proc. International Symposium on Computer Architecture (ISCA- 93), ACM Press, 1993, pp R. Rajwar & J.R. Goodman, “Speculative Lock Elision: enabling highly concurrent multithreaded execution,” 34th Annual International Symposium on Microarchitecture (MICRO-34), December 2001, pp

YEAR 2006 The University of Auckland | New Zealand PRESENTATION Readings You will likely find these papers to be helpful in understanding the lecture material Hennessy & Patterson, “Distributed shared-memory architectures,” Section 6.5 from Computer Architecture: A Quantitative Approach (3rd Ed.), W.-H. Wang, J.-L Baer, & H.M. Levy, “Organization and performance of a two-level virtual-real cache hierarchy,” ISCA- 16, pp , June Mark Hill, “Processors should support simple memory- consistency models,” IEEE Computer, 31(8), pp , August 1998.

YEAR 2006 The University of Auckland | New Zealand PRESENTATION First-week topics Overview, Moore’s Law Multiprocessing, Multithreading, & Multicores Multiprocessing Issues –Lost updates –Memory ordering –Cache coherence

YEAR 2006 The University of Auckland | New Zealand PRESENTATION Threads and Thread programming From assignment, you should have experience with threads Creation Use of Mutex Condition variables Thread-safe, MT-safe functions

YEAR 2006 The University of Auckland | New Zealand PRESENTATION Interconnection networks & topologies Crossbar vs. buses Direct vs. indirect networks Trees, fat trees, mesh, torus, ring, hypercube Perfect shuffle, Omega Topological measures (diameter, degree, bisection bandwidth) Avoiding deadlock in routing, virtual circuits Store-and-Forward routing Wormhole & Cut-through routing

YEAR 2006 The University of Auckland | New Zealand PRESENTATION Simulation General Simulation techniques –time-based, event-based and process-based simulation –validation of results Architecture-specific simulation –instruction emulation –trace collection, reduction, processing Simulation tools

YEAR 2006 The University of Auckland | New Zealand PRESENTATION Scalable Memory Systems Interaction of Virtually-addressed cache Multi-level cache Cache coherence Non-blocking cache

YEAR 2006 The University of Auckland | New Zealand PRESENTATION Scalable Memory Systems (2) Directory-based protocols vs. snooping –Snooping has serious limitations of scale –Directory-based is always slower, but scalable –Basic protocol is simpler (3 states), but requires more serial events Maintaining a sharing list in the directory Distributed writes (why they are slow) Dealing with races

YEAR 2006 The University of Auckland | New Zealand PRESENTATION Cache Coherence-MOESI Model Attributes –ownership –exclusiveness –validity Five states –Allowed state changes –Permitted combination of states Example of lock contention

YEAR 2006 The University of Auckland | New Zealand PRESENTATION Better programming Models Critical sections Atomic RMW operations –T&S, T&T&S, Atomic Swap –Compare & Swap –LL/SC Notion of transactional memory –Atomic insertion of a transaction (linearizability) –Hardware support (SLE) –Implementing Transactional Memory Hardware Software