Download presentation
Presentation is loading. Please wait.
Published byMarcia Webb Modified over 9 years ago
1
Parallel Computers Today Oak Ridge / Cray Jaguar > 1.75 PFLOPS Two Nvidia 8800 GPUs > 1 TFLOPS Intel 80- core chip > 1 TFLOPS TFLOPS = 10 12 floating point ops/sec PFLOPS = 1,000,000,000,000,000 / sec (10 15 )
2
Supercomputers 1976:Cray-1, 133 MFLOPS (10 6 ) Supercomputers 1976: Cray-1, 133 MFLOPS (10 6 )
3
Trends in processor clock speed
4
AMD Opteron 12-core chip
5
AMD Opteron 6-core layout detail
6
The nVidia G80 GPU 128 streaming floating point processors @1.5Ghz 1.5 Gb Shared RAM with 86Gb/s bandwidth 500 Gflop on one chip (single precision)
7
More Detail on GPU Architecture
8
Cray XMT (highly multithreaded shared memory)
9
Top 500 List http://www.top500.org/list/2010/11/100 Graph 500 List http://www.graph500.org/Results.html
10
Generic Parallel Machine Architecture Key architecture question: Where is the interconnect, and how fast? Key algorithm question: Where is the data? Proc Cache L2 Cache L3 Cache Memory Storage Hierarchy Proc Cache L2 Cache L3 Cache Memory Proc Cache L2 Cache L3 Cache Memory potential interconnects
11
4-core Intel Nehalem chip (2 per Triton node):
12
Triton memory hierarchy Node Memory Proc Cache L2 Cache L3 Cache Proc Cache L2 Cache Proc Cache L2 Cache Proc Cache L2 Cache Proc Cache L2 Cache L3 Cache Proc Cache L2 Cache Proc Cache L2 Cache Proc Cache L2 Cache Chip Node
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.