ESE534: Computer Organization

Slides:



Advertisements
Similar presentations
Slides Prepared from the CI-Tutor Courses at NCSA By S. Masoud Sadjadi School of Computing and Information Sciences Florida.
Advertisements

Caltech CS184a Fall DeHon1 CS184a: Computer Architecture (Structures and Organization) Day1: September 25, 2000 Introduction and Overview.
Caltech CS184a Fall DeHon1 CS184a: Computer Architecture (Structures and Organization) Day20: November 29, 2000 Review.
CS294-6 Reconfigurable Computing Day 5 September 8, 1998 Comparing Computing Devices.
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 15: March 12, 2007 Interconnect 3: Richness.
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 9: February 7, 2007 Instruction Space Modeling.
EET 4250: Chapter 1 Performance Measurement, Instruction Count & CPI Acknowledgements: Some slides and lecture notes for this course adapted from Prof.
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 1: January 8, 2007 Introduction and Overview.
Penn ESE Spring DeHon 1 FUTURE Timing seemed good However, only student to give feedback marked confusing (2 of 5 on clarity) and too fast.
Reconfigurable Computing. This Class is About Reconfigurable Computing Computer Architecture Coping with Change.
Penn ESE534 Spring DeHon 1 ESE534: Computer Organization Day 11: March 3, 2014 Instruction Space Modeling 1.
CBSSS 2002: DeHon Architecture as Interface André DeHon Friday, June 21, 2002.
Caltech CS184 Winter DeHon CS184: Computer Architecture (Structure and Organization) Day 1: January 3, 2005 Introduction and Overview.
Sogang University Advanced Computing System Chap 1. Computer Architecture Hyuk-Jun Lee, PhD Dept. of Computer Science and Engineering Sogang University.
Operating Systems Lecture 02: Computer System Overview Anda Iamnitchi
Penn ESE370 Fall DeHon 1 ESE370: Circuit-Level Modeling, Design, and Optimization for Digital Systems Day 1: September 5, 2012 Introduction and.
C OMPUTER O RGANIZATION AND D ESIGN The Hardware/Software Interface 5 th Edition Chapter 1 Computer Abstractions and Technology Sections 1.5 – 1.11.
Penn ESE370 Fall DeHon 1 ESE370: Circuit-Level Modeling, Design, and Optimization for Digital Systems Day 1: September 8, 2010 Introduction and.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 7: January 24, 2003 Instruction Space.
Penn ESE370 Fall DeHon 1 ESE370: Circuit-Level Modeling, Design, and Optimization for Digital Systems Day 1: August 27, 2014 Introduction and Overview.
Penn ESE534 Spring DeHon 1 ESE534: Computer Organization Day 1: January 13, 2010 Introduction and Overview.
Processor Level Parallelism. Improving the Pipeline Pipelined processor – Ideal speedup = num stages – Branches / conflicts mean limited returns after.
Single-Chip Heterogeneous Computing: Does the Future Include Custom Logic, FPGAs, and GPGPUs? Wasim Shaikh Date: 10/29/2015.
Caltech CS184 Winter DeHon CS184: Computer Architecture (Structure and Organization) Day 1: January 6, 2003 Introduction and Overview.
Penn ESE534 Spring DeHon 1 ESE534: Computer Organization Day 1: January 11, 2012 Introduction and Overview.
Penn ESE534 Spring DeHon 1 ESE534: Computer Organization Day 11: February 20, 2012 Instruction Space Modeling.
Computer Organization IS F242. Course Objective It aims at understanding and appreciating the computing system’s functional components, their characteristics,
SPRING 2012 Assembly Language. Definition 2 A microprocessor is a silicon chip which forms the core of a microcomputer the concept of what goes into a.
ESE532: System-on-a-Chip Architecture
ESE532: System-on-a-Chip Architecture
ESE532: System-on-a-Chip Architecture
Computer Organization and Machine Language Programming CPTG 245
ESE534: Computer Organization
ESE532: System-on-a-Chip Architecture
ESE532: System-on-a-Chip Architecture
ESE532: System-on-a-Chip Architecture
ESE534: Computer Organization
ECE354 Embedded Systems Introduction C Andras Moritz.
Lecture 1: CS/ECE 3810 Introduction
ESE532: System-on-a-Chip Architecture
Stateless Combinational Logic and State Circuits
CSE 410, Spring 2006 Computer Systems
ESE534 Computer Organization
ESE532: System-on-a-Chip Architecture
Morgan Kaufmann Publishers
ESE532: System-on-a-Chip Architecture
Architecture & Organization 1
Day 1: August 28, 2013 Introduction and Overview
CS775: Computer Architecture
Lecture 1: CS/ECE 3810 Introduction
EEL 3705 / 3705L Digital Logic Design
Lecture 2: Performance Today’s topics: Technology wrap-up
Architecture & Organization 1
ESE534: Computer Organization
ESE534: Computer Organization
ESE534: Computer Organization
ESE534: Computer Organization
A High Performance SoC: PkunityTM
1.1 The Characteristics of Contemporary Processors, Input, Output and Storage Devices Types of Processors.
ESE534: Computer Organization
Computer Evolution and Performance
CS184a: Computer Architecture (Structures and Organization)
ESE534: Computer Organization
ESE534: Computer Organization
ESE534: Computer Organization
ESE534: Computer Organization
ESE532: System-on-a-Chip Architecture
Reconfigurable Computing (EN2911X, Fall07)
Presentation transcript:

ESE534: Computer Organization Day 1: August 31, 2016 Introduction and Overview Penn ESE534 Fall 2016 -- DeHon

Today Matter Computes Architecture Matters This Course (short) Unique Nature of This Course ESE532 vs. ESE534 (vs. CIS501) Change More on this course Penn ESE534 Fall 2016 -- DeHon

Power of Computation Which set of gates is more powerful? Set 1: AND2, AND3, AND4 Set 2: AND2, OR2 Set 3: NAND2 Set 4: AND2, XOR2 (assume have unlimited number of gates in each set) Penn ESE534 Fall 2016 -- DeHon

Review (assert?): Two Universality Facts NAND gate Universality [Day 2, ESE170/CIS240] We can implement any computation by interconnecting a sufficiently large network of NAND gates Turing Machine is Universal [CIS262] We can implement any computable function with a TM We can build a single TM which can be programmed to implement any computable function Day 2 reading (on Canvas) SciAm-level review Penn ESE534 Fall 2016 -- DeHon

Matter Computes We can build NAND gates out of: transistors (semiconductor devices) physical laws of electron conduction mechanical switches basic physical mechanics protein binding / promotion / inhibition Basic biochemical reactions …many other things Weiss/ NSC 2001 Penn ESE534 Fall 2016 -- DeHon

LEGOTM Logic Gates http://goldfish.ikaruga.co.uk/logic.html Penn ESE534 Fall 2016 -- DeHon

Starting Point Given sufficient raw materials: can implement any computable function Our goal in computer architecture is not to figure out how to compute new things rather, it is an engineering problem Penn ESE534 Fall 2016 -- DeHon

Engineering Problem Implement a computation: Optimization problem with least resources (in fixed resources) with least cost in least time (in fixed time) with least energy With fixed energy budget Optimization problem how do we do it best? Penn ESE534 Fall 2016 -- DeHon

Quote “An Engineer can do for a dime what everyone else can do for a dollar.” Penn ESE534 Fall 2016 -- DeHon

Questions Which part should I buy? Processor, multicore, vector, FPGA, GPU, …. Which should I spend my time programming? Given a chip (100mm2 of Silicon) How should I fill it? Can do better than step-and-repeat RISC core? How turn area (transistors) into performance? When building a System-on-a-Chip (SoC) How much area should go into: Processor cores, GPUs, FPGA logic, memory, interconnect, custom functions (which) …. ? Penn ESE534 Fall 2016 -- DeHon

How much difference? Experience running things on multiple architectures? E.g. GPU, FPGA, Processor…. Preferably at same technology node. Same Silicon die area Penn ESE534 Fall 2016 -- DeHon

Architecture Matters? How much difference is there between architectures? How badly can I be wrong in implementing/picking the wrong architecture? How efficient is the IA-32, IA-64, GPGPU? Is there much room to do better? Why did Intel buy Altera? Is architecture done? A solved problem? Penn ESE534 Fall 2016 -- DeHon

Peak Computational Densities from Model Small slice of space only 2 parameters 100 density across Large difference in peak densities large design space! Penn ESE534 Fall 2016 -- DeHon

Yielded Efficiency Large variation in yielded density FPGA (c=w=1) “Processor” (c=1024, w=64) …now let’s look at one “real” example to ground this observation Large variation in yielded density large design space! Penn ESE534 Fall 2016 -- DeHon

Energy Efficiency Large variation  large design space Processor W=64, I=128 Multicontext FPGA2014 Large variation  large design space Penn ESE534 Fall 2016 -- DeHon

Architecture Not Done Many ways, not fully understood design space requirements of computation limits on requirements, density... …and the costs are changing optimal solutions change dominant constraints change creating new challenges and opportunities Penn ESE534 Fall 2016 -- DeHon

Personal Goal? Develop systematic design Parameterize design space Memory Develop systematic design Parameterize design space adapt to costs Understand/capture req. of computing Efficiency metrics (similar to information theory?) …we’ll see a start at these this term Compute Interconnect Penn ESE534 Fall 2016 -- DeHon

Architecture Not Done Not here to just teach you the forms which are already understood (though, will do that and give you a strong understanding of their strengths and weaknesses) Goal: enable you to design and synthesize new and better architectures Penn ESE534 Fall 2016 -- DeHon

Your Questions What questions are you hoping this course will help you answer? Penn ESE534 Fall 2016 -- DeHon

This Course (short) How to organize computations Requirements Design space Characteristics of computations Building blocks compute, interconnect, retiming, instructions, control Comparisons, limits, tradeoffs Penn ESE534 Fall 2016 -- DeHon

This Course Sort out: Basis for design and analysis Techniques Custom, RISC, SIMD, Vector, VLIW, Multithreaded, Superscalar, EPIC, MIMD, FPGA, GPGPUs, multicore Basis for design and analysis Techniques [more detail at end] Penn ESE534 Fall 2016 -- DeHon

Graduate Class Assume you are here to learn Reading Problems Motivated Mature Reading Not 1:1 with lecture and assignments Won’t be policing you You may need to follow some links beyond “required” reading Problems May not be fully, tightly specified Penn ESE534 Fall 2016 -- DeHon

Uniqueness of Class Penn ESE534 Fall 2016 -- DeHon

Not a Traditional Arch. Class Traditional class (240, 371, 501) focus RISC Processor history undergraduate class on mP internals then graduate class on details This class much broader in scope develop design space see RISC processors in context of alternatives Penn ESE534 Fall 2016 -- DeHon

Authority/History ``Science is the belief in the ignorance of experts.'' -- Richard Feynman Traditional Architecture has been too much about history and authority Should be more about engineering evaluation physical world is “final authority” Goal: Teach you to think critically and independently about computer design. Penn ESE534 Fall 2016 -- DeHon

Focus Focus on raw computing organization Not worry about nice abstractions, models 501, 371, 240 provide a few good models Instruction Set Architecture (ISA) Shared Memory Transactional… …and you should know others Dataflow, streaming, data parallel, … Penn ESE534 Fall 2016 -- DeHon

Relation to ESE532 534 – this course 532 – System-on-a-Chip Architecture Deep into design space and continuum How to build compute, interconnect, memory Analysis Fundamentals Theory Why X better than Y More relevant substrate designers Both Real-Time and Best-effort New course in Spring Probably 33% overlap Broader (all CMPE) HW/SW codesign More Hands-on Code in C Map to Zynq Accelerate an application More relevant to (P)SoC user Real-time focus Penn ESE534 Fall 2016 -- DeHon

Distinction CIS240, 371 ,501 ESE532 Best Effort Computing Run as fast as you can Binary compatible ISA separation Shared memory parallelism Hardware-Software codesign Willing to recompile, maybe rewrite code Define/refine hardware Real-Time Guarantee meet deadline Non shared-memory models Penn ESE534 Fall 2016 -- DeHon

Next Few Lectures Quick run through logic/arithmetic basics make sure everyone remembers (some see for first time?) get us ready to start with observations about the key components of computing devices Trivial/old hat for many But will be some observations couldn’t make in ESE170/CIS371 May be fast if seeing for first time Background quiz intended to help me tune Penn ESE534 Fall 2016 -- DeHon

Change A key feature of the computer industry has been rapid and continual change. We must be prepared to adapt. True of this course as well ….things are still changing… We’ll try to figure it out together… Penn ESE534 Fall 2016 -- DeHon

What has changed? Speed [Discuss] Capacity Joules/op Mfg cost Size Ratio of fast memory to dense memory Wire delay vs. Gate delay Onchip vs. inter-chip Joules/op Mfg cost Per transistor Per wafer NRE (Non-recurring engineering) Reliability Limited by Transistors, energy… [Discuss] Capacity Total Per die Size Applications Number Size/complexity of each Types/variety Use Environment Embedded Mission critical Penn ESE534 Fall 2016 -- DeHon

Intel’s Moore’s Law (Scaling) >1000x Penn ESE534 Fall 2016 -- DeHon

1983 (early VLSI) Early RISC processors Xilinx XC2064 RISC = Reduced Instruction Set Computer RISC-II, 40K transistors MIPS, 24K transistors ~10MHz clock cycle Xilinx XC2064 64 4-LUTs LUT = Look-Up Table 4-LUT – program to be any gate of 4 inputs Penn ESE534 Fall 2016 -- DeHon

Today CPUs FPGAs (e.g. Stratix 10) Billions of transistors Intel 18-core Xeon E5 CPUs Billions of transistors 22+ CPU per die 7B transistors Multi-issue, 64b processors GHz clock cycles MByte caches (50MB?+) FPGAs (e.g. Stratix 10) >5M bit processing elements >200 Mbits of on-chip RAM Altera Stratix IV Penn ESE534 Fall 2016 -- DeHon

Today SoC: Apple A9x Dual 64-bit ARM 2.3GHz 3MB L2 cache 12 GPU cores Custom accelerators Image Processor? 147mm2 16nm FinFET Chipworks Die Photo Penn ESE534 Fall 2016 -- DeHon

MOS Transistor Scaling (1974 to present) [0.5x per 2 nodes] Pitch Gate [from Andrew Kahng] Source: 2001 ITRS - Exec. Summary, ORTC Figure Penn ESE534 Fall 2016 -- DeHon

Will This Last Forever? Pitch Gate [Moore, ISSCC2003] Penn ESE534 Fall 2016 -- DeHon

More Chip Capacity? Cosmic Cube / CACM 1985 Should a 2014 single-chip multiprocessor look like a 1983 multiprocessor systems? Processorprocessor latency? Inter-processor bandwidth costs? Cost of customization? Memory CP SE I/O Program Memory MP Calisto™ BCM1500 Nichols/Microprocessor Forum 2001 Penn ESE534 Fall 2016 -- DeHon

Memory Levels Why do we have 5+ levels of memory today? Apple II, IBM PC had 2 MIPS-X had 3 Penn ESE534 Fall 2016 -- DeHon

Historical Power Scaling [Horowitz et al. / IEDM 2005] Penn ESE534 Fall 2016 -- DeHon

Interesting Times Challenges to continue scaling Power density Reliability What does the end-of-scaling mean to architecture? Penn ESE534 Fall 2016 -- DeHon

Class Components Penn ESE534 Fall 2016 -- DeHon

Class Components Lecture (incl. preclass exercise) Slides on web before class (you can print if want a follow-along copy) Reading [~1 required paper/lecture] No text (online: Canvas, IEEE, ACM) 9 assignments (roughly 1 per week) Final design/analysis exercise (~4 weeks) Note syllabus, course admin online Penn ESE534 Fall 2016 -- DeHon

Preclass Exercise Like Background Quiz but more focused Motivate the topic of the day Introduce a problem Introduce a design space, tradeoff, transform Work for ~5 minutes before start lecturing Penn ESE534 Fall 2016 -- DeHon

Feedback Will have anonymous feedback sheets for each lecture Clarity? Speed? Vocabulary? General comments Penn ESE534 Fall 2016 -- DeHon

Howard Roark’s Critique of the Parthenon -- Ayn Rand Fountainhead Quote Howard Roark’s Critique of the Parthenon -- Ayn Rand Penn ESE534 Fall 2016 -- DeHon

Fountainhead Parthenon Quote “Look,” said Roark. “The famous flutings on the famous columns---what are they there for? To hide the joints in wood---when columns were made of wood, only these aren’t, they’re marble. The triglyphs, what are they? Wood. Wooden beams, the way they had to be laid when people began to build wooden shacks. Your Greeks took marble and they made copies of their wooden structures out of it, because others had done it that way. Then your masters of the Renaissance came along and made copies in plaster of copies in marble of copies in wood. Now here we are making copies in steel and concrete of copies in plaster of copies in marble of copies in wood. Why?” Penn ESE534 Fall 2016 -- DeHon

Penn ESE534 Fall 2016 -- DeHon

Computer Architecture Parallel Are we making: copies in submicron CMOS of copies in early NMOS of copies in discrete TTL of vacuum tube computers? Penn ESE534 Fall 2016 -- DeHon

Admin Your action: Find course web page Read it, including the policies Find Syllabus Find assignment 1 Find lecture slides Will try to post before lecture Find reading assignments Find reading for lecture 2 on blackboard …for this lecture if you haven’t already Penn ESE534 Fall 2016 -- DeHon

Big Ideas Matter Computes Efficiency of architectures varies widely Computation design is an engineering discipline Costs change  Best solutions (architectures) change Learn to cut through hype analyze, think, critique, synthesize Penn ESE534 Fall 2016 -- DeHon