● Cell Broadband Engine Architecture Processor ● Ryan Layer ● Ben Kreuter ● Michelle McDaniel ● Carrie Ruppar.

Slides:



Advertisements
Similar presentations
Larrabee Eric Jogerst Cortlandt Schoonover Francis Tan.
Advertisements

Parallel Processing with PlayStation3 Lawrence Kalisz.
Systems and Technology Group © 2006 IBM Corporation Cell Programming Tutorial - JHD24 May 2006 Cell Programming Tutorial Jeff Derby, Senior Technical Staff.
4. Shared Memory Parallel Architectures 4.4. Multicore Architectures
Slides Prepared from the CI-Tutor Courses at NCSA By S. Masoud Sadjadi School of Computing and Information Sciences Florida.
AMD OPTERON ARCHITECTURE Omar Aragon Abdel Salam Sayyad This presentation is missing the references used.
Instructor Notes We describe motivation for talking about underlying device architecture because device architecture is often avoided in conventional.
Cell Broadband Engine. INF5062, Carsten Griwodz & Pål Halvorsen University of Oslo Cell Broadband Engine Structure SPE PPE MIC EIB.
Lecture Objectives: 1)Explain the limitations of flash memory. 2)Define wear leveling. 3)Define the term IO Transaction 4)Define the terms synchronous.
1 Microprocessor-based Systems Course 4 - Microprocessors.
Using Cell Processors for Intrusion Detection through Regular Expression Matching with Speculation Author: C˘at˘alin Radu, C˘at˘alin Leordeanu, Valentin.
1 Sec (2.1) Computer Architectures. 2 For temporary storage of information, the CPU contains cells, or registers, that are conceptually similar to main.
EECS 470 Superscalar Architectures and the Pentium 4 Lecture 12.
Synergistic Processing In Cell’s Multicore Architecture Michael Gschwind, et al. Presented by: Jia Zou CS258 3/5/08.
Cell Broadband Processor Daniel Bagley Meng Tan. Agenda  General Intro  History of development  Technical overview of architecture  Detailed technical.
PlayStation 2 Architecture Irin Jose Farid Momin Quy Ngo Olivia Wong.
Emotion Engine A look at the microprocessor at the center of the PlayStation2 gaming console Charles Aldrich.
J. A. Kahle, M. N. Day, H. P. Hofstee, C. R. Johns, T. R. Maeurer, and D. Shippy IBM Systems and Technology Group IBM Journal of Research and Development.
Programming the Cell Multiprocessor Işıl ÖZ. Outline Cell processor – Objectives – Design and architecture Programming the cell – Programming models CellSs.
Basic Operational Concepts of a Computer
Cell Architecture. Introduction The Cell concept was originally thought up by Sony Computer Entertainment inc. of Japan, for the PlayStation 3 The architecture.
Introduction to the Cell multiprocessor J. A. Kahle, M. N. Day, H. P. Hofstee, C. R. Johns, T. R. Maeurer, D. Shippy (IBM Systems and Technology Group)
Cell Systems and Technology Group. Introduction to the Cell Broadband Engine Architecture  A new class of multicore processors being brought to the consumer.
Evaluation of Multi-core Architectures for Image Processing Algorithms Masters Thesis Presentation by Trupti Patil July 22, 2009.
Agenda Performance highlights of Cell Target applications
1/21 Cell Processor (Cell Broadband Engine Architecture) Mark Budensiek.
1 The IBM Cell Processor – Architecture and On-Chip Communication Interconnect.
Kevin Eady Ben Plunkett Prateeksha Satyamoorthy.
Super computers Parallel Processing By Lecturer: Aisha Dawood.
Interconnection Networks Computer Architecture: A Quantitative Approach 4 th Edition, Appendix E Timothy Mark Pinkston University of Southern California.
Sam Sandbote CSE 8383 Advanced Computer Architecture The IBM Cell Architecture Sam Sandbote CSE 8383 Advanced Computer Architecture April 18, 2006.
LYU0703 Parallel Distributed Programming on PS3 1 Huang Hiu Fung Wong Chung Hoi Supervised by Prof. Michael R. Lyu Department of Computer.
Optimizing Ray Tracing on the Cell Microprocessor David Oguns.
A Level Computing for AQA Teacher’s Resource CD-ROM 4 CHAPTER: Functional characteristics of a processor.
Presented by Jeremy S. Meredith Sadaf R. Alam Jeffrey S. Vetter Future Technologies Group Computer Science and Mathematics Division Research supported.
Aarul Jain CSE520, Advanced Computer Architecture Fall 2007.
IBM Cell Processor Ryan Carlson, Yannick Lanner-Cusin, & Cyrus Stoller CS87: Parallel and Distributed Computing.
1/21 Cell Processor Systems Seminar Diana Palsetia (11/21/2006)
Lecture 13 Parallel Processing. 2 What is Parallel Computing? Traditionally software has been written for serial computation. Parallel computing is the.
CIT 140: Introduction to ITSlide #1 CSC 140: Introduction to IT Operating Systems.
Introduction to Operating Systems Concepts
Computer Hardware What is a CPU.
Computer Organization and Architecture Lecture 1 : Introduction
Itanium® 2 Processor Architecture
Lynn Choi School of Electrical Engineering
Protection in Virtual Mode
Chapter 10: Computer systems (1)
CS 286 Computer Architecture & Organization
Multiscalar Processors
RAM, CPUs, & BUSES Egle Cebelyte.
Cell Architecture.
Architecture & Organization 1
Buses.
Architecture & Organization 1
BIC 10503: COMPUTER ARCHITECTURE
Microprocessor & Assembly Language
Introduction to CUDA Programming
ECEG-3202 Computer Architecture and Organization
Introduction to Computing
Comparison of Two Processors
Operating Systems Chapter 5: Input/Output Management
Coe818 Advanced Computer Architecture
ECEG-3202 Computer Architecture and Organization
Overview Prof. Eric Rotenberg
What is Computer Architecture?
What is Computer Architecture?
CS 143A Principles of Operating Systems
Large data arrays processing on Cell Broadband Engine
Interconnection Network and Prefetching
Presentation transcript:

● Cell Broadband Engine Architecture Processor ● Ryan Layer ● Ben Kreuter ● Michelle McDaniel ● Carrie Ruppar

Element Interconnect Bus (EIB) ● Communication bus connecting the on-chip elements including an arbitration unit (traffic light) ● Circular ring of four 16B wide unidirectional channels – Counter-rotate in pairs – Each channel can handle up to 3 transactions concurrently – Each participant on bus has a 16B read port and 16B write port

Flexibility of Cell's Resources ● Job Queue – PPE maintains job queue, schedules jobs in SPEs, monitors progress – SPE runs “mini kernel” to fetch and execute job and synch with PPE ● Self-multitasking of SPEs – Kernel and scheduling distributed across SPEs – Task synchronization done via mutexes or semaphores ● Stream Processing – Each SPE runs a distinct program

Memory Hierarchy: PPE ● PPE – Power Processor Element ● Caches: – Instruction Cache: 32K; 2-way set associative – L1 Cache: 32K; 4-way set associative – L2 Cache: 512K; 8-way set associative ● Contains Memory Management Units for accessing memory

Memory Hierarchy: SPE ● SPE - Synergistic Processor Element ● Vector CPU ● Local Store: 256K, each, ● Contain Memory Management Units, rather than coherency mechanism

Memory Hierarchy: XDR Interface ● Cell’s DRAM interface ● Dual channel Rambus XIO macro which interfaces to Rambus XDR memory ● Each channel is 36-bits wide ● Cell’s XDR memory bus runs at 400MHz ● 3.2GHz data signaling rates => memory bandwidth of 25.6GB/s

PPE Microarchitecture ● Out of order execution ● Dynamic branch prediction – Specialized “branch processor” exposes control of branching and branch prediction to the software ● 32k L1 D$, 32k L1 I$, 512k L2 ● VMX unit – SIMD floating point (not part of the SPE!)

SPE Microarchitecture ● Static branch prediction ● Dual issue – one compute, one memory instruction per cycle ● No cache – not really needed ● 128 registers

Synergistic Processor Unit ISA SPU LS MFC SPE PPE Main Memory EIB SPU LS MFC SPU LS MFC... clgt cc, a, b brz cc, else then: a d, d, a br done else: ai d, d, 1 done: clgt cc, a, b a d_a, d, a ai d_1, d, 1 selb d, d_1, d_a, cc if (a > b) d += a else d += 1 [Jack Dongarra CS 594] conditional bitconditional mask – SPU ISA != PPE ISA (VMX) – Vector instructions – ½ the power, ½ the area, same performance ● no cache: local store ● no branch prediction: hints, less branching

Questions ?

References ● Cell Architecture Explained, Version 2: ● ● Wikipedia: ● ● SPU Instruction Set Architecture: ● 01.ibm.com/chips/techlib/techlib.nsf/techdocs/76CA6C F F2C44 ● Cell software development: ● ● The Cell project at IBM Research: ●