CIS 501: Comp. Arch. | Prof. Joe Devietti | Xbox1/PS41 CIS 501: Computer Architecture Unit 12: Putting it All Together: The Xbox One/PS4 Game Consoles.

Slides:



Advertisements
Similar presentations
Larrabee Eric Jogerst Cortlandt Schoonover Francis Tan.
Advertisements

CIS 501: Computer Architecture | Prof. Joe Devietti | Introduction 1 CIS 501: Computer Architecture Unit 1: Introduction Slides developed by Joe Devietti,
Lecture 38: Chapter 7: Multiprocessors Today’s topic –Vector processors –GPUs –An example 1.
Computer Architecture and Design Fall 2009 Indraneil Gokhale.
Introduction to Microprocessors and Microcomputers.
 $18 Billion Entertainment Industry  40% increase from 2006  Demographics  Fastest growing group: 50+ (25%)  Average age: 33  40% of all gamers.
Adam Kunk Anil John Pete Bohman.  Released by IBM in 2010 (~ February)  Successor of the POWER6  Implements IBM PowerPC architecture v2.06  Clock.
OS Case Study: The Xbox 360  Instructor: Rob Nash  Readings: See citations in the slides.
Room: E-3-31 Phone: Dr Masri Ayob TK 2123 COMPUTER ORGANISATION & ARCHITECTURE Lecture 4: Computer Performance.
1 ITCS 6/8010 CUDA Programming, UNC-Charlotte, B. Wilkinson, Jan 19, 2011 Emergence of GPU systems and clusters for general purpose High Performance Computing.
1 Pipelining for Multi- Core Architectures. 2 Multi-Core Technology Single Core Dual CoreMulti-Core + Cache + Cache Core 4 or more cores.
Adam Kunk Anil John Pete Bohman.  Released by IBM in 2010 (~ February)  Successor of the POWER6  Shift from high frequency to multi-core  Implements.
CS 7810 Lecture 24 The Cell Processor H. Peter Hofstee Proceedings of HPCA-11 February 2005.
Chapter 1 Sections 1.1 – 1.3 Dr. Iyad F. Jafar Introduction.
GPGPU overview. Graphics Processing Unit (GPU) GPU is the chip in computer video cards, PS3, Xbox, etc – Designed to realize the 3D graphics pipeline.
 WII U  Playstation 4  Xbox One  OUYA  Future Technologies.
Prof. Milo Martin for CIS700
Comparison of Next Generation Gaming Architectures Presented By Dela Tsiagbe Presented By Dela Tsiagbe.
Emotion Engine A look at the microprocessor at the center of the PlayStation2 gaming console Charles Aldrich.
Computer performance.
Cell Architecture. Introduction The Cell concept was originally thought up by Sony Computer Entertainment inc. of Japan, for the PlayStation 3 The architecture.
1 VLSI and Computer Architecture Trends ECE 25 Fall 2012.
Semiconductor Memory 1970 Fairchild Size of a single core –i.e. 1 bit of magnetic core storage Holds 256 bits Non-destructive read Much faster than core.
Simultaneous Multithreading: Maximizing On-Chip Parallelism Presented By: Daron Shrode Shey Liggett.
Next-Generation Consoles Brenden Schubert & Nathaniel Williams With a Special talk by John F. Rhoads on Video Game Ethics.
Practical PC, 7th Edition Chapter 17: Looking Under the Hood
Physics 413 Chapter 1 Computer Architecture What is a Digital Computer ? A computer is essentially a fast electronic calculating machine. What is a program.
Computer Graphics Graphics Hardware
Company LOGO High Performance Processors Miguel J. González Blanco Miguel A. Padilla Puig Felix Rivera Rivas.
Copyright © 2007 Heathkit Company, Inc. All Rights Reserved PC Fundamentals Presentation 27 – A Brief History of the Microprocessor.
3. April 2006Bernd Panzer-Steindel, CERN/IT1 HEPIX 2006 CPU technology session some ‘random walk’
CS/ECE 3330 Computer Architecture Kim Hazelwood Fall 2009.
Dragged, Kicking and Screaming: Multicore Architecture and Video Games.
High Performance Computing Processors Felix Noble Mirayma V. Rodriguez Agnes Velez Electric and Computer Engineer Department August 25, 2004.
NVIDIA Fermi Architecture Patrick Cozzi University of Pennsylvania CIS Spring 2011.
Computer Organization and Design Computer Abstractions and Technology
Hyper Threading Technology. Introduction Hyper-threading is a technology developed by Intel Corporation for it’s Xeon processors with a 533 MHz system.
Xbox MB system memory IBM 3-way symmetric core processor ATI GPU with embedded EDRAM 12x DVD Optional Hard disk.
Introduction.  This course is all about how computers work  But what do we mean by a computer?  Different types: desktop, servers, embedded devices.
What is gaming console & platform? A game console is a device which outputs video signal into TV screen to display the video game. A platform is in which.
THE BRIEF HISTORY OF 8085 MICROPROCESSOR & THEIR APPLICATIONS
CIS 501: Comp. Arch. | Prof. Joe Devietti | Xbox1/PS41 CIS 501: Computer Architecture Unit 12: Putting it All Together: The Xbox One/PS4 Game Consoles.
MULTICORE PROCESSOR TECHNOLOGY.  Introduction  history  Why multi-core ?  What do you mean by multicore?  Multi core architecture  Comparison of.
Computer Architecture Introduction Lynn Choi Korea University.
UltraSPARC III Hari P. Ananthanarayanan Anand S. Rajan.
Sony PlayStation 3 Sony also laid out the technical specs of the device. The PlayStation 3 will feature the much-vaunted Cell processor, which will run.
Chapter 5: Computer Systems Design and Organization Dr Mohamed Menacer Taibah University
Succeeding with Technology Chapter 2 Hardware Designed to Meet the Need The Digital Revolution Integrated Circuits and Processing Storage Input, Output,
DR. SIMING LIU SPRING 2016 COMPUTER SCIENCE AND ENGINEERING UNIVERSITY OF NEVADA, RENO CS 219 Computer Organization.
GPGPU introduction. Why is GPU in the picture Seeking exa-scale computing platform Minimize power per operation. – Power is directly correlated to the.
Lecture 1: Introduction CprE 585 Advanced Computer Architecture, Fall 2004 Zhao Zhang.
Lecture # 10 Processors Microcomputer Processors.
VU-Advanced Computer Architecture Lecture 1-Introduction 1 Advanced Computer Architecture CS 704 Advanced Computer Architecture Lecture 1.
Emergence of GPU systems for general purpose high performance computing ITCS 4145/5145 © Barry Wilkinson GPUIntro.ppt Oct 30, 2014.
Hardware Architecture
CPU (Central Processing Unit). The CPU is the brain of the computer. Sometimes referred to simply as the processor or central processor, the CPU is where.
Xbox 360 Architecture Presenter: Ataç Deniz Oral Date: 30/11/06.
Multi-Core CPUs Matt Kuehn. Roadmap ► Intel vs AMD ► Early multi-core processors ► Threads vs Physical Cores ► Multithreading and Multi-core processing.
ALPHA 21164PC. Alpha 21164PC High-performance alternative to a Windows NT Personal Computer.
SPRING 2012 Assembly Language. Definition 2 A microprocessor is a silicon chip which forms the core of a microcomputer the concept of what goes into a.
William Stallings Computer Organization and Architecture 6th Edition
Introduction to Microcontrollers
Cell Architecture.
What happens inside a CPU?
Basic Computer Organization
Unit 2 Computer Systems HND in Computing and Systems Development
BIC 10503: COMPUTER ARCHITECTURE
NVIDIA Fermi Architecture
Chip&Core Architecture
Presentation transcript:

CIS 501: Comp. Arch. | Prof. Joe Devietti | Xbox1/PS41 CIS 501: Computer Architecture Unit 12: Putting it All Together: The Xbox One/PS4 Game Consoles Slides originally developed by Joe Devietti, Milo Martin & Amir Roth at University of Pennsylvania

CIS 501: Comp. Arch. | Prof. Joe Devietti | Xbox1/PS42 This Unit: Putting It All Together Anatomy of a game console Microsoft Xbox One/Sony PS4 Focus mostly on CPU chip Briefly talk about system Graphics processing unit (GPU) I/O and other devices Application OS FirmwareCompiler I/O Memory Digital Circuits Gates & Transistors CPU

CIS 501: Comp. Arch. | Prof. Joe Devietti | Xbox1/PS43 Sources compared-to-playstation-4/ compared-to-playstation-4/ AMD HotChips 2012 presentation on Jaguar Wikipedia for sales numbers

CIS 501: Comp. Arch. | Prof. Joe Devietti | Xbox1/PS44 What is Computer Architecture? Plans The role of a computer architect: “Technology” Logic Gates SRAM DRAM Circuit Techniques Packaging Magnetic Storage Flash Memory Goals Function Performance Reliability Cost/Manufacturability Energy Efficiency Time to Market Computer PCs Servers PDAs Mobile Phones Supercomputers Game Consoles Embedded Design Manufacturing

CIS 501: Comp. Arch. | Prof. Joe Devietti | Xbox1/PS45 Microsoft XBox Game Console History Xbox First game console by Microsoft, released in 2001 Glorified PC: 733 Mhz x86 Intel CPU, 64MB DRAM, NVIDIA GPU Ran modified version of Windows OS ~25 million sold Xbox nd generation, released in 2005 Custom hardware: 3.2 Ghz 3-core PowerPC processor, ATI GPU 80M sold as of October 2013 [Source: Wikipedia] Xbox One 3 rd generation, released 2013 Semi-custom hardware: 1.6GHz 8-core AMD CPU, AMD GPU 1M sold in first 24 hours

Sony PlayStation History PlayStation 2 released in MHz MIPS CPU >155M sold as of March 2012 [Wikipedia] PlayStation 3 released late GHz IBM Cell Processor: 1 PowerPC core + 8 mini-cores 80M sold as of November 2013 [Wikipedia] PlayStation 4 released late 2013 Semi-custom hardware: 1.6GHz 8-core AMD CPU, AMD GPU basically same as Xbox One 2+M sold as of December 2013 CIS 501: Comp. Arch. | Prof. Joe Devietti | Xbox1/PS46

AMD Jaguar Core CIS 501: Comp. Arch. | Prof. Joe Devietti | Xbox1/PS47 AMD Jaguar core 2-way OoO 28nm transistors

CIS 501: Comp. Arch. | Prof. Joe Devietti | Xbox1/PS48

9

10

CIS 501: Comp. Arch. | Prof. Joe Devietti | Xbox1/PS411

CIS 501: Comp. Arch. | Prof. Joe Devietti | Xbox1/PS412

Xbox One SoC 13 c/o Chipworks

PlayStation 4 SoC 14 c/o Chipworks

CIS 501: Comp. Arch. | Prof. Joe Devietti | Xbox1/PS415 c/o Anandtech

Xbox One OS Runs two different virtualized Oses (on MS Hyper-V) Windows kernel Xbox OS each OS gets a hard partition of physical resources enables better multitasking? CIS 501: Comp. Arch. | Prof. Joe Devietti | Xbox1/PS416

Putting It All Together Unit 1: Introduction Unit 2: ISAs Unit 3: Technology Unit 4: Performance Unit 5: Pipelining & Branch Prediction Unit 6: Caches Unit 7: Virtual Memory Unit 8: Superscalar Unit 9: Scheduling Unit 10: Multicore Unit 11: Vectors CIS 501: Comp. Arch. | Prof. Joe Devietti | XBox 36017

CIS 501: Comp. Arch. | Prof. Joe Devietti | XBox Microsoft Turns to IBM for XBox 360 Microsoft is mostly a software company Turned to IBM & ATI for XBox 360 design Sony & Nintendo also turned to IBM (for PS3 & Wii, respectively) Design principles of XBox 360 [Andrews & Baker, 2006] Value for 5-7 years  big performance increase over last generation Support anti-aliased high-definition video 30+ fps)  extremely high pixel fill rate (goal: 100+ million pixels/s) Flexible to suit dynamic range of games  balance hardware, homogenous resources Programmability (easy to program)  listened to software developers

CIS 501: Comp. Arch. | Prof. Joe Devietti | XBox More on Games Workload Graphics, graphics, graphics Special highly-parallel graphics processing unit (GPU) Much like on PCs today But general-purpose, too “The high-level game code is generally a database management problem, with plenty of object-oriented code and pointer manipulation. Such a workload needs a large L2 and high integer performance.” [Andrews & Baker, 2006] Wanted only a modest number of modest, fast cores Not one big core Not dozens of small cores (leave that to the GPU) Quote from Seymour Cray

XBox 360 System from 30,000 Feet CIS 501: Comp. Arch. | Prof. Joe Devietti | XBox [Krewell, Microprocessor Report, Oct 21, 2005]

CIS 501: Comp. Arch. | Prof. Joe Devietti | XBox XBox 360 System [Andrews & Baker, IEEE Micro, Mar/Apr 2006]

CIS 501: Comp. Arch. | Prof. Joe Devietti | XBox XBox 360 “Xenon” Processor ISA: 64-bit PowerPC chip RISC ISA Like MIPS, but with condition codes Fixed-length 32-bit instructions bit general purpose registers (GPRs) ISA Extended with VMX-128 operations 128 registers, 128-bits each Packed “vector” operations Example: four 32-bit floating point numbers One instruction: VR1 * VR2  VR3 Four single-precision operations Also supports conversion to Microsoft DirectX data formats Similar to Altivec (and Intel’s MMX, SSE, SSE2, etc.) Works great for 3D graphics kernels and compression

CIS 501: Comp. Arch. | Prof. Joe Devietti | XBox XBox 360 “Xenon” Processor Peak performance: ~75 gigaflops Gigaflop = 1 billion floating points operations per second Pipelined superscalar processor 3.2 Ghz operation Superscalar: two-way issue VMX-128 instructions (four single-precision operations at a time) Hardware multithreading: two threads per processor Three processor cores per chip Result: 3.2 * 2 * 4 * 3 = ~77 gigaflops

CIS 501: Comp. Arch. | Prof. Joe Devietti | XBox [Andrews & Baker, IEEE Micro, Mar/Apr 2006] XBox 360 “Xenon” Chip (IBM) 165 million transistors IBM’s 90nm process Three cores 3.2 Ghz Two-way superscalar Two-way multithreaded Shared 1MB cache

CIS 501: Comp. Arch. | Prof. Joe Devietti | XBox “Xenon” Processor Pipeline [Brown, IBM, Dec 2005] Four-instruction fetch Two-instruction “dispatch” Five functional units “VMX128” execution “decoupled” from other units 14-cycle VMX dot-product Branch predictor: “4K” G-share predictor Unclear if 4KB or 4K 2-bit counters Per thread

CIS 501: Comp. Arch. | Prof. Joe Devietti | XBox XBox 360 Memory Hiearchy 128B cache blocks throughout 32KB 2-way set-associative instruction cache (per core) 32KB 4-way set-associative data cache (per core) Write-through, lots of store buffering Parity 1MB 8-way set-associative second-level cache (per chip) Special “skip L2” prefetch instruction MESI cache coherence Error Correcting Codes (ECC) 512MB GDDR3 DRAM, dual memory controllers Total of 22.4 GB/s of memory bandwidth Direct path to GPU

CIS 501: Comp. Arch. | Prof. Joe Devietti | XBox Xenon Multicore Interconnect [Brown, IBM, Dec 2005]

CIS 501: Comp. Arch. | Prof. Joe Devietti | XBox XBox 360 System [Andrews & Baker, IEEE Micro, Mar/Apr 2006]

CIS 501: Comp. Arch. | Prof. Joe Devietti | XBox XBox Graphics Subsystem [Andrews & Baker, IEEE Micro, Mar/Apr 2006] 28.8 GB/s link bandwidth 10.8 GB/s FSB bandwidth link each way 22.4 GB/s DRAM bandwidth

CIS 501: Comp. Arch. | Prof. Joe Devietti | XBox Graphics “Parent” Die (ATI) 232 million transistors 500 Mhz 48 unified shader ALUs Mini-cores for graphics [Andrews & Baker, IEEE Micro, Mar/Apr 2006]

CIS 501: Comp. Arch. | Prof. Joe Devietti | XBox GPU “daughter” die (NEC) 100 million transistors 10MB eDRAM “Embedded” NEC Electronics Anti-aliasing Render at 4x resolution, then sample Z-buffering Track the “depth” of pixels 256GB/s internal bandwidth [Andrews & Baker, IEEE Micro, Mar/Apr 2006]