Download presentation
Presentation is loading. Please wait.
Published byClaude Norton Modified over 9 years ago
1
Understanding Parallel Computers Parallel Processing EE 613
2
Balancing Machine Specifics With Portability How much do we need to know about the machine architecture –Performance goal Game developers, embedded systems, and hardware vendors Coding specific to machine –Lifetime goal Portability Generalized code
3
A Look At Six Parallel Computers Chip Multiprocessors –Intel Core Duo –AMD Dual Core Opteron Heterogeneous Chip Designs –GPU, FPGA, Cell –Vector Clusters –Node – processors, RAM, disk, memory not shared –Typical - eight nodes, control processor, switch –Blade server – includes com ports and cooling fans Supercomputers – BlueGene/L (440 PowerPC’s)
4
CSE524 Parallel Algorithms Lawrence Snyder
8
Now we can put multiple cores on a single chip.
9
CSE524 Parallel Algorithms Lawrence Snyder MESI – Modified, Exclusive, Shared, Invalid
10
CSE524 Parallel Algorithms Lawrence Snyder
12
MOESI – Modified, Owned, Exclusive, Shared, Invalid
13
CSE524 Parallel Algorithms Lawrence Snyder Both designs implement a coherent shared memory.
14
CSE524 Parallel Algorithms Lawrence Snyder
15
Symmetric Multiprocessor (SMP) Each processor makes memory requests over the common memory bus All cache controllers snoop the memory bus and adjust the tags on their cached values to ensure coherent cache usage.
16
CSE524 Parallel Algorithms Lawrence Snyder Example – po & p1 have copies of Block X, but p2 writes to Block X.
18
CSE524 Parallel Algorithms Lawrence Snyder
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.