architectural overview

Slides:



Advertisements
Similar presentations
Introduction to Computers Section 4A. home Decimal Number System Called base 10 because 10 symbols are available.
Advertisements

Machine cycle.
DSPs Vs General Purpose Microprocessors
Lecture 4 Introduction to Digital Signal Processors (DSPs) Dr. Konstantinos Tatas.
AMD OPTERON ARCHITECTURE Omar Aragon Abdel Salam Sayyad This presentation is missing the references used.
Processor Overview Features Designed for consumer and wireless products RISC Processor with Harvard Architecture Vector Floating Point coprocessor Branch.
Fall EE 333 Lillevik 333f06-l20 University of Portland School of Engineering Computer Organization Lecture 20 Pipelining: “bucket brigade” MIPS.
Microprocessors. Von Neumann architecture Data and instructions in single read/write memory Contents of memory addressable by location, independent of.
Khaled A. Al-Utaibi  Computers are Every Where  What is Computer Engineering?  Design Levels  Computer Engineering Fields  What.
PlayStation2 as a General Purpose Computer (The Emotion Engine vs. general PC architectures)
1 Microprocessor-based Systems Course 4 - Microprocessors.
Chris Foster Brian Moore Scott Thibaudeau Overview I/O EE – Emotion Engine!! Graphics Synthesizer Comparison.
VIRAM-1 Architecture Update and Status Christoforos E. Kozyrakis IRAM Retreat January 2000.
PowerPC 601 Stephen Tam. To be tackled today Architecture Execution Units Fixed-Point (Integer) Unit Floating-Point Unit Branch Processing Unit Cache.
GCSE Computing - The CPU
PlayStation 2 Architecture Irin Jose Farid Momin Quy Ngo Olivia Wong.
What’s on the Motherboard? The two main parts of the CPU are the control unit and the arithmetic logic unit. The control unit retrieves instructions from.
Inside The CPU. Buses There are 3 Types of Buses There are 3 Types of Buses Address bus Address bus –between CPU and Main Memory –Carries address of where.
GPGPU overview. Graphics Processing Unit (GPU) GPU is the chip in computer video cards, PS3, Xbox, etc – Designed to realize the 3D graphics pipeline.
© 2007 Elsevier Lecture 6: Embedded Processors Embedded Computing Systems Mikko Lipasti, adapted from M. Schulte Based on slides and textbook from Wayne.
Prof. Milo Martin for CIS700
SUPERSCALAR EXECUTION. two-way superscalar The DLW-2 has two ALUs, so it’s able to execute two arithmetic instructions in parallel (hence the term two-way.
Emotion Engine A look at the microprocessor at the center of the PlayStation2 gaming console Charles Aldrich.
1 Copyright © 2011, Elsevier Inc. All rights Reserved. Appendix E Authors: John Hennessy & David Patterson.
Motivation Mobile embedded systems are present in: –Cell phones –PDA’s –MP3 players –GPS units.
© Paradigm Publishing Inc. 2-1 Chapter 2 Input and Processing.
Basic Microcomputer Design. Inside the CPU Registers – storage locations Control Unit (CU) – coordinates the sequencing of steps involved in executing.
Simultaneous Multithreading: Maximizing On-Chip Parallelism Presented By: Daron Shrode Shey Liggett.
Introduction to Computing: Lecture 4
3 1 3 C H A P T E R Hardware: Input, Processing, and Output Devices.
Microcontrollers Microcontroller (MCU) – An integrated electronic computing device that includes three major components on a single chip Microprocessor.
Topic:The Motorola M680X0 Family Team:Ulrike Eckardt Frederik Fleck André Kudra Jan Schuster Date:Thursday, 12/10/1998 CS-350 Computer Organization Term.
By Michael Butler, Leslie Barnes, Debjit Das Sarma, Bob Gelinas This paper appears in: Micro, IEEE March/April 2011 (vol. 31 no. 2) pp 마이크로 프로세서.
CLEMSON U N I V E R S I T Y AVR32 Micro Controller Unit Atmel has created the first processor architected specifically for 21st century applications that.
Computers organization & Assembly Language Chapter 0 INTRODUCTION TO COMPUTING Basic Concepts.
ARM for Wireless Applications ARM11 Microarchitecture On the ARMv6 Connie Wang.
What is µP? “An integrated circuit containing … a central processing unit (CPU) and a means to access external memory” -- (Ball 2000)
3/29: Processors Roll Call Lecture: CPU’s Other (?)
Chapter 2 Data Manipulation. © 2005 Pearson Addison-Wesley. All rights reserved 2-2 Chapter 2: Data Manipulation 2.1 Computer Architecture 2.2 Machine.
Sam Sandbote CSE 8383 Advanced Computer Architecture The IBM Cell Architecture Sam Sandbote CSE 8383 Advanced Computer Architecture April 18, 2006.
Department of Industrial Engineering Sharif University of Technology Session# 6.
Different Microprocessors Tamanna Haque Nipa Lecturer Dept. of Computer Science Stamford University Bangladesh.
12/13/ _01 1 Computer Organization EEC-213 Computer Organization Electrical and Computer Engineering.
Next Generation ISA Itanium / IA-64. Operating Environments IA-32 Protected Mode/Real Mode/Virtual Mode - if supported by the OS IA-64 Instruction Set.
Academic PowerPoint Computer System – Architecture.
The Intel 86 Family of Processors
P5 Micro architecture : Intel’s Fifth generation
Playstation2 Architecture Architecture Hardware Design.
Pentium Architecture Arithmetic/Logic Units (ALUs) : – There are two parallel integer instruction pipelines: u-pipeline and v-pipeline – The u-pipeline.
Chapter 2 Turning Data into Something You Can Use
The Central Processing Unit (CPU)
Different Microprocessors Tamanna Haque Nipa Lecturer Dept. of Computer Science Stamford University Bangladesh.
Case Study: Implementing the MPEG-4 AS Profile on a Multi-core System on Chip Architecture R 楊峰偉 R 張哲瑜 R 陳 宸.
Chao Han ELEC6200 Computer Architecture Fall 081ELEC : Han: PowerPC.
Emotion Engine™ AKA the “Playstation 2” Architecture Or The progeny of a MIPS and a DSP By Idan Gazit – June 2002.
Computer Architecture Lecture 24 Parallel Processing Ralph Grishman November 2015 NYU.
Sun Microsystems’ UltraSPARC-IIi a Stunt-Free Presentation by Christine Munson Amanda Peters Carl Sadler.
1 Basic Processor Architecture. 2 Building Blocks of Processor Systems CPU.
GPGPU introduction. Why is GPU in the picture Seeking exa-scale computing platform Minimize power per operation. – Power is directly correlated to the.
CPIT Program Execution. Today, general-purpose computers use a set of instructions called a program to process data. A computer executes the.
Protection in Virtual Mode
Visit for more Learning Resources
Microprocessor & Assembly Language
Overview Prof. Eric Rotenberg
CS 252 Spring 2000 Jeff Herman John Loo Xiaoyi Tang
COMPUTER ARCHITECTURES FOR PARALLEL ROCESSING
Microprocessor Architecture
CSE 502: Computer Architecture
Introduction to Computers
Presentation transcript:

architectural overview Sony Emotion Engine architectural overview 2002.5.20 Kim L. Vu

Acronyms ALU Arithmetic Logic Unit COP Coprocessor DMAC Direct Memory Access Controller DSP Digital Sound Processing EE Emotion Engine EFU Elementary Functional Unit GIF Graphics Interface IPU Image Processing Unit MAC Multiply-Accumulate RDRAM Rambus Dynamic RAM SPRAM Scratch-Pad RAM VPU Vector Processing Unit

Overview (ps2 architecture)

Emotion Engine VU0 thought simulation, AI, physics calculations SIMD, VLIW architecture VU1 fixed geometry calculations CPU + FPU program control IPU real-time image data decompression

Emotion Engine Features 300Mhz MIPS III CPU Two-issue superscalar,128-bit multimedia extensions 16k, 2-way instruction cache 8k, 2-way data cache 16k “scratch pad” RAM Vector Units Both have 4 FMACS + 1 FDIV EFP (Elementary function unit) in VU1 1 FMAC + 1 FDIV 128-bit data bus IPU – MPEG2 decoder unit 10-channel DMA Controller

CPU Core Features MIPS III Instruction Set architecture 6 stage pipeline PC Select | Fetch | Register | Exec | Cache Access | Write Back Two 64-bit integers ALUs ALUs can be combined in”lock step” to execute 128-bit SIMD operations Load/Store Unit Branch Execution Unit 64-entry two-branch prediction mechanism 32 128-bit registers

Vector unit performance Microarchitecturally identical 4 FMACS 1 FDIV 1 Load/Store Unit 1 ALU 1 random number generator 2 issue VLIW (64-bit bundle) Two operating modes VLIW and Coprocessor mode Throughput FMAC operation – 1 cycle FDIV operation – 7 cycles 4x4 matrix * vector – 4 cycles 4x4 matrix * matrix – 16 cycles

VU Features Features VU0 VU1 Job VLIW mode Coprocessor mode Flexible calculations Fixed 3D calculations VLIW mode Yes Coprocessor mode No VPU components 4k instruction RAM 4k data RAM VIF 16k instruction RAM 16k data RAM GIF EFU Performance 4 FMACS (2.4 Gflops) 1 FDIV (0.04 Gflops) 1 EFU (0.64 Gflops)    

VPU0 Design Strategy 2-modes : VLIW and coprocessor Runs mainly in coprocessor mode Lower opcode always NOP Controlled by CPU Executes 32-bit MIPS coprocessor instructions Processes 4 parallel FP instructions

VPU1 Design Strategy Can only run in VLIW mode Executes 64-bit VLIW bundle Accessed by 3D display list 3D display list contain boht instruction and data in same structure

VU Instruction Formats Instruction bundle has 2 parts “upper” (SIMD) + “lower” “Lower” execution unit “Upper” execution unit FP div/sqrt/reverse sqrt 4 parallel FP add/sub Load/store 4 parallel FP mul EFU(1 FMAC + 1FDIV) 4 parallel FP add/msub Jump/branch Random number generator

EE Teams

Team 1 Handles physics, program control, AI and behavior calculations Members work closely together with each other Ease of communication through 128-bit dedicated busses from CPU to FPU and VU0 SPRAM – acts as CPU and VU0’s shared workspace

Team 2 Handles simple geometric calculations Members act as equal partners Dedicated 128-bit bus from VPU1 to GIF

Team Interoperation Serial connection Parallel connection VPU0 acts as VPU1’s coprocessor SPRAM is used to transfer data to VPU1 VPU1 renders final image Parallel connection GIF monitors the status of the graphics synthesizer Both teams independently and asynchronously sends display lists

References Atsushi Kunimatsu, et. al., “Vector Unit Architecture for Emotion Synthesis”, IEEE Micro, Vol. 20, No. 2, March/April 2000, pp. 40-47 K. Kutaragie et. al., “A Microprocessor with 128b CPU, 10 Floating-Point MACS, 4 Floating-Point Dividers, and MPEG2 Decoder,” ISSCC (Int’l Solid-States Circuit Conf.) Digest Tech. Papers, IEEE Press, Piscatawey, New Jersey, Feb. 1999, pp. 256-257 F. Micheal Raam, et. al., “A High-Bandwidth Superscalar Microprocessor for Multimedia Applications,” ISSCC Digest Tech. Papers, IEEE Press, Feb. 1999, pp. 258-259 Sound and Vision: A Technical Overview of the Emotion Engine by John Stokes, Ars Technica http://arstechnica.com/reviews/1q00/playstation2/ee-1.html The Playstation2 vs. the PC: A System-level Comparison of Two 3D Platforms by John Stokes, Ars Technica http://arstechnica.com/cpu/2q00/ps2/ps2vspc-1.html