by Brian Wheatman and Elaine Gan

Slides:



Advertisements
Similar presentations
Cache Coherence. Memory Consistency in SMPs Suppose CPU-1 updates A to 200. write-back: memory and cache-2 have stale values write-through: cache-2 has.
Advertisements

Implementation and Verification of a Cache Coherence protocol using Spin Steven Farago.
Overview Assignment 8: hints Assignment 7: solution Deadlocks
1 Lecture 6: Directory Protocols Topics: directory-based cache coherence implementations (wrap-up of SGI Origin and Sequent NUMA case study)
Multi-core systems System Architecture COMP25212 Daniel Goodman Advanced Processor Technologies Group.
1 ICS 214B: Transaction Processing and Distributed Data Management Lecture 5: Tree-based Concurrency Control and Validation Currency Control Professor.
CSE 321: Case Studies in Component-Based Software.
Parallel Processing (CS526) Spring 2012(Week 6).  A parallel algorithm is a group of partitioned tasks that work with each other to solve a large problem.
Study of Hurricane and Tornado Operating Systems By Shubhanan Bakre.
DDM – A Cache Only Memory Architecture Hagersten, Landin, and Haridi (1991) Presented by Patrick Eibl.
A Scalable Approach to Thread-Level Speculation J. Gregory Steffan, Christopher B. Colohan, Antonia Zhai, and Todd C. Mowry Carnegie Mellon University.
Some random lecture title. Administrivia Grades back: Midterm P1 Rollout P2 M1 Office hours this Thurs: 4:00-5:00 Quiz on Thurs (more on this in a moment)
1 Lecture 2: Snooping and Directory Protocols Topics: Snooping wrap-up and directory implementations.
Shared Memory Consistency Models: A Tutorial By Sarita V Adve and Kourosh Gharachorloo Presenter: Meenaktchi Venkatachalam.
DISCOVERY LEARNING: THE PARALLELOGRAM Ask the child to find the area b h Give the child scissors and a paper cutout of a parallelogram Insight: All parallelograms.
B.Ramamurthy9/19/20151 Operating Systems u Bina Ramamurthy CS421.
Development of a Resource Locking Algorithm for a Kernel Debugging User-space API Library TJHSST Computer Systems Lab Timothy Wismer.
1 Multithreaded Programming Concepts Myongji University Sugwon Hong 1.
Constructive Computer Architecture Cache Coherence Arvind Computer Science & Artificial Intelligence Lab. Massachusetts Institute of Technology November.
Performance of the Shasta distributed shared memory protocol Daniel J. Scales Kourosh Gharachorloo 創造情報学専攻 M グェン トアン ドゥク.
By J. Burns and J. Pachl Based on a presentation by Irina Shapira and Julia Mosin Uniform Self-Stabilization 1 P0P0 P1P1 P2P2 P3P3 P4P4 P5P5.
Caltech CS184 Spring DeHon 1 CS184b: Computer Architecture (Abstractions and Optimizations) Day 12: May 3, 2003 Shared Memory.
Transactions and Concurrency Control. Concurrent Accesses to an Object Multiple threads Atomic operations Thread communication Fairness.
1 Lecture 3: Coherence Protocols Topics: consistency models, coherence protocol examples.
Multiprocessor  Use large number of processor design for workstation or PC market  Has an efficient medium for communication among the processor memory.
6.175 Project Presentation Tony (Sen) Chang Antonio Rivera.
“Lab. 5” – Updating Lab. 3 to use DMA Test we understand DMA by using some simple memory to memory DMA Make life more interesting, since hardware is involved,
Dear parents: At stage 7 children are working with decimal numbers and it is important that they know what decimal numbers are. The strategy that follows.
December 1, 2006©2006 Craig Zilles1 Threads & Atomic Operations in Hardware  Previously, we introduced multi-core parallelism & cache coherence —Today.
Lecture 8: Snooping and Directory Protocols
Cache Coherence Constructive Computer Architecture Arvind
Lecture 20: Consistency Models, TM
6.175 Final Project Part 0: Understanding Non-Blocking Caches and Cache Coherency Answers.
תרגול מס' 5: MESI Protocol
Architecture and Design of AlphaServer GS320
Atomic Operations in Hardware
Atomic Operations in Hardware
ITEC 202 Operating Systems
Cache Coherence for Shared Memory Multiprocessors
Task Scheduling for Multicore CPUs and NUMA Systems
Notation Addresses are ordered triples:
Assignment 4 – (a) Consider a symmetric MP with two processors and a cache invalidate write-back cache. Each block corresponds to two words in memory.
Cache Coherence Constructive Computer Architecture Arvind
CMSC 611: Advanced Computer Architecture
Reflection Helia Zamani.
Krste Asanovic Electrical Engineering and Computer Sciences
Lecture 19: Deadlock: Conditions, Detection and Avoidance
Constructive Computer Architecture Tutorial 7 Final Project Overview
Cache Coherence Constructive Computer Architecture Arvind
Designing Parallel Algorithms (Synchronization)
Operating Systems Bina Ramamurthy CSE421 11/27/2018 B.Ramamurthy.
Interconnect with Cache Coherency Manager
Lecture 5: Snooping Protocol Design Issues
Cache Coherence Constructive Computer Architecture Arvind
Mutually exclusive nothing in common.
CC protocol for blocking caches
MODERN OPERATING SYSTEMS Third Edition ANDREW S
2P13 Week 3.
Geoff Gunow and Sara Sinback (geogunow, sinback)
Lucas Lancellotti Lucas Santana
Patrick Lowe and Jyotishka Biswas
Edwin Africano Adityanarayanan Radhakrishnan
Ian Reynolds, Obasi Onuoha, Phillip Cherner
Lecture 19: Deadlock: Conditions, Detection and Avoidance
CHAPTER 8 Resources and Resource Access Control
A. T. Clements, M. F. Kaashoek, N. Zeldovich, R. T. Morris, and E
Cache Coherence Constructive Computer Architecture Arvind
Lecture 10: Directory-Based Examples II
Ch 3.
How parents and students can work together using IT to assist learning
Presentation transcript:

by Brian Wheatman and Elaine Gan Cache Coherence by Brian Wheatman and Elaine Gan

How Far We Got… Our sixcache deadlocks on mc_multiply2 Our stq and lhusm fail on only 2 tests mc_incrementers and mc_spin_lock Deadlocking and livelocking

Difficulties Coding and Debugging Parent Protocol Processor Getting things mutually exclusive without imposing arbitrary order Deadlocking in Ex5 MessageRouter could not fire due to implicit guard splitting into two rules

Difficulties Coding and Debugging Trouble with Ex. 8 (mc_spin_lock & mc_mulitply2) spent hours for what ended up being a typo Where we are now SixCache - debugging why there is livelocking

Difficulties Understanding the Protocol Fairness in dealing with child requests in MessageRouter Didn’t surface until mc_incrementors in Ex 8 When to downgrade or invalidate

What Improvements? For the Class a better understanding of what bluespec does with the code Perhaps some examples from actual processors

What Improvements? For the Final Project More even split between Part 1 and Part 2 More individual testing Exercise 5 – whole mem hierarchy Give out the assignment earlier or have some stages be done for earlier labs