STARFIRE Extending the SMP Envelope

Slides:



Advertisements
Similar presentations
6-April 06 by Nathan Chien. PCI System Block Diagram.
Advertisements

Slides Prepared from the CI-Tutor Courses at NCSA By S. Masoud Sadjadi School of Computing and Information Sciences Florida.
Thoughts on Shared Caches Jeff Odom University of Maryland.
Multiple Processor Systems
Cache Coherent Distributed Shared Memory. Motivations Small processor count –SMP machines –Single shared memory with multiple processors interconnected.
Distributed Processing, Client/Server, and Clusters
The University of Adelaide, School of Computer Science
IBM RS6000/SP Overview Advanced IBM Unix computers series Multiple different configurations Available from entry level to high-end machines. POWER (1,2,3,4)
1 Multiprocessors. 2 Idea: create powerful computers by connecting many smaller ones good news: works for timesharing (better than supercomputer) bad.
CS252/Patterson Lec /23/01 CS213 Parallel Processing Architecture Lecture 7: Multiprocessor Cache Coherency Problem.
1 Lecture 23: Multiprocessors Today’s topics:  RAID  Multiprocessor taxonomy  Snooping-based cache coherence protocol.
Sun FIRE Jani Raitavuo Niko Ronkainen. Sun FIRE 15K Most powerful and scalable Up to 106 processors, 576 GB memory and 250 TB online disk storage Fireplane.
CPE 731 Advanced Computer Architecture Snooping Cache Multiprocessors Dr. Gheith Abandah Adapted from the slides of Prof. David Patterson, University of.
Parallel Computer Architectures
CPU Chips The logical pinout of a generic CPU. The arrows indicate input signals and output signals. The short diagonal lines indicate that multiple pins.
Lecture 37: Chapter 7: Multiprocessors Today’s topic –Introduction to multiprocessors –Parallelism in software –Memory organization –Cache coherence 1.
Semester 1 Module 8 Ethernet Switching Andres, Wen-Yuan Liao Department of Computer Science and Engineering De Lin Institute of Technology
1 Chapter 7: Storage Systems Introduction Magnetic disks Buses RAID: Redundant Arrays of Inexpensive Disks.
Redundant Array of Inexpensive Disks aka Redundant Array of Independent Disks (RAID) Modified from CCT slides.
1 Fault Tolerance in the Nonstop Cyclone System By Scott Chan Robert Jardine Presented by Phuc Nguyen.
PCI Team 3: Adam Meyer, Christopher Koch,
STARFIRE: Extending the SMP Envelope Alan Charlesworth Presented By Bob Koutsoyannis.
Frank Casilio Computer Engineering May 15, 1997 Multithreaded Processors.
ECE200 – Computer Organization Chapter 9 – Multiprocessors.
BUS IN MICROPROCESSOR. Topics to discuss Bus Interface ISA VESA local PCI Plug and Play.
Parallel Programming on the SGI Origin2000 With thanks to Igor Zacharov / Benoit Marchand, SGI Taub Computer Center Technion Moshe Goldberg,
Computer Architecture System Interface Units Iolanthe II approaches Coromandel Harbour.
I/O Computer Organization II 1 Interconnecting Components Need interconnections between – CPU, memory, I/O controllers Bus: shared communication channel.
Data Management for Decision Support Session-4 Prof. Bharat Bhasker.
Sun Starfire: Extending the SMP Envelope Presented by Jen Miller 2/9/2004.
Computer Architecture System Interface Units Iolanthe II in the Bay of Islands.
Understanding Parallel Computers Parallel Processing EE 613.
Spring EE 437 Lillevik 437s06-l22 University of Portland School of Engineering Advanced Computer Architecture Lecture 22 Distributed computer Interconnection.
1 Lecture 17: Multiprocessors Topics: multiprocessor intro and taxonomy, symmetric shared-memory multiprocessors (Sections )
CDA-5155 Computer Architecture Principles Fall 2000 Multiprocessor Architectures.
Introduction Goal: connecting multiple computers to get higher performance – Multiprocessors – Scalability, availability, power efficiency Job-level (process-level)
The University of Adelaide, School of Computer Science
Multi Processing prepared and instructed by Shmuel Wimer Eng. Faculty, Bar-Ilan University June 2016Multi Processing1.
Lynn Choi School of Electrical Engineering
Bus Systems ISA PCI AGP.
Architecture and Design of AlphaServer GS320
Computer Engineering 2nd Semester
The University of Adelaide, School of Computer Science
The University of Adelaide, School of Computer Science
CS 147 – Parallel Processing
Cache Memory Presentation I
BIC 10503: COMPUTER ARCHITECTURE
CMSC 611: Advanced Computer Architecture
Directory-based Protocol
Lecture 2: Snooping-Based Coherence
Chip-Multiprocessor.
Multiprocessor Introduction and Characteristics of Multiprocessor
Introduction to Computing
Comparison of Two Processors
Interconnect with Cache Coherency Manager
Computer Science Division
CS703 - Advanced Operating Systems
Leveraging Optical Technology in Future Bus-based Chip Multiprocessors
High Performance Computing
CSC3050 – Computer Architecture
The University of Adelaide, School of Computer Science
Computer Science Division
Co-designed Virtual Machines for Reliable Computer Systems
Lecture 24: Virtual Memory, Multiprocessors
Lecture 23: Virtual Memory, Multiprocessors
Introduction To Distributed Systems
The University of Adelaide, School of Computer Science
Multiprocessors and Multi-computers
Multiprocessor System Interconnects
Presentation transcript:

STARFIRE Extending the SMP Envelope Alan Charlesworth, Sun Microsystems Presented by Mahmut Yilmaz 01/30/2006

Introduction Cache-coherent symmetric multi-processor architecture (24-64 UltraSparc II microprocessors running @ 250MHz, sharing max 64GB memory) Goals: Increase system memory bandwidth Reduce memory latency as much as possible Provide Unix server flexibility (Dynamic System Domains) Improve system reliability, availability, and serviceability How? Multiple outstanding cache misses, split-transaction buses, separate address and data paths, wider data paths, higher system clocks… Starfire cabinet1 1 http://www.filibeto.org/~aduritz/truetrue/e10000/starfire-interconnect.html

Improvements in Bus Capacity Bus snooping rate increased from 2.5M/s (1991) to 167M/s (1997). How? : Bus clock rate: 40 MHz  100 MHz Circuit switched protocol  Packet switched protocol (separates requests from replies) Separate address and data bus Interleaved multiple snoop buses (4 address buses) Cache block size: 32 bytes  64 bytes Data bus: 8 bytes  16 bytes

System

STARFIRE Interconnect Ultra Port Architecture interconnect (writeback MOESI – X-Modified, S-Modified, X-Clean, S-Clean, Invalid) Packet-switched data transaction with ECC codes (16bytes + 2bytes) 2 level interconnect On-board: Processor, SBUS cards, memory  Address & data ports Centerplane: Transfers address & data between boards 2 clock cycle address transaction 2 low-order cache-block address bits determine the address bus to use 8 clock cycle board to board data transfer Buses vs Point-to-point routers? Buses: Faster Point-to-point routers: Bandwidth, partitioning, reliability, availability, and serviceability

Interconnect Reliability ECC for both address and data bus (ASIC help) Failed components: System attempts to recover without any service interruption Redundant components: Optional Crash recovery: Automatic System Recovery (ASR) – requires redundant components

Dynamic System Domains Starfire can be easily subdivided into domains for boards using System Service Processor: Each domain is a totally isolated SMP Good for many applications: LAN consolidation; development, production and test; software migration; special I/O or network functions … Picture: http://www.filibeto.org/~aduritz/truetrue/e10000/starfire-interconnect.html

Price/Performance Throughput is higher than other HPC computers with similar cost Easy administration, reliable and serviceable Starfire Cost2 2 Alan Charlesworth, Nicholas Aneshansley, Mark Haakmeester, Dan Drogichen, Gary Gilbert , Ricki Williams, Andrew Phelps, “The Starfire SMP Interconnect”

Questions Is the price/performance comparison to other HPC computers fair? What if we have a cluster of Starfire servers? Uniform latencies? What is the performance cost adding redundancy? Centerplane: Single Point of Failure? Can we replace centerplane?

Fair Comparison? Graph: Alan Charlesworth, Nicholas Aneshansley, Mark Haakmeester, Dan Drogichen, Gary Gilbert , Ricki Williams, Andrew Phelps, “The Starfire SMP Interconnect”