Lectures Slides and Figures from MKP and Sudhakar Yalamanchili

Slides:



Advertisements
Similar presentations
COMPUTER ARCHITECTURE & OPERATIONS I Instructor: Yaohang Li.
Advertisements

Computer Abstractions and Technology
Chapter 1 CSF 2009 Computer Performance. Defining Performance Which airplane has the best performance? Chapter 1 — Computer Abstractions and Technology.
Chapter 1. Introduction This course is all about how computers work But what do we mean by a computer? –Different types: desktop, servers, embedded devices.
Computer Organization and Design David. Paterson and John L. Hennessy
Chapter 1 Computer Abstractions and Technology. Chapter 1 — Computer Abstractions and Technology — 2 The Computer Revolution Progress in computer technology.
EET 4250: Chapter 1 Performance Measurement, Instruction Count & CPI Acknowledgements: Some slides and lecture notes for this course adapted from Prof.
Morgan Kaufmann Publishers Computer Abstractions and Technology
Chapter 1 Computer Abstractions and Technology. Chapter 1 — Computer Abstractions and Technology — 2 The Computer Revolution Progress in computer technology.
Chapter 1 CSF 2009 Computer Abstractions and Technology.
C OMPUTER O RGANIZATION AND D ESIGN The Hardware/Software Interface 5 th Edition Chapter 1 Computer Abstractions and Technology.
Last Time Performance Analysis It’s all relative
EET 4250: Chapter 1 Computer Abstractions and Technology Acknowledgements: Some slides and lecture notes for this course adapted from Prof. Mary Jane Irwin.
Sogang University Advanced Computing System Chap 1. Computer Architecture Hyuk-Jun Lee, PhD Dept. of Computer Science and Engineering Sogang University.
C OMPUTER O RGANIZATION AND D ESIGN The Hardware/Software Interface 5 th Edition Chapter 1 Computer Abstractions and Technology Sections 1.5 – 1.11.
(1) ECE 3056: Architecture, Concurrency and Energy in Computation Lecture Notes by MKP and Sudhakar Yalamanchili Sudhakar Yalamanchili (Some small modifications.
Chapter 1 — Computer Abstractions and Technology — 1 Understanding Performance Algorithm Determines number of operations executed Programming language,
Chapter 1 — Computer Abstractions and Technology — 1 The Computer Revolution Progress in computer technology – Underpinned by Moore’s Law Makes novel applications.
Chapter 1 Performance & Technology Trends Read Sections 1.5, 1.6, and 1.8.
Chapter 1 Computer Abstractions and Technology. Chapter 1 — Computer Abstractions and Technology — 2 The Computer Revolution Progress in computer technology.
Introduction.  This course is all about how computers work  But what do we mean by a computer?  Different types: desktop, servers, embedded devices.
Chapter 1 Technology Trends and Performance. Chapter 1 — Computer Abstractions and Technology — 2 Technology Trends Electronics technology continues to.
Chapter 1 Computer Abstractions and Technology. Chapter 1 — Computer Abstractions and Technology — 2 The Computer Revolution Progress in computer technology.
Chapter 1 — Computer Abstractions and Technology — 1 Below Your Program Application software – Written in high-level language System software – Compiler:
CPS3340 COMPUTER ARCHITECTURE Fall Semester, /03/2013 Lecture 3: Computer Performance Instructor: Ashraf Yaseen DEPARTMENT OF MATH & COMPUTER SCIENCE.
DR. SIMING LIU SPRING 2016 COMPUTER SCIENCE AND ENGINEERING UNIVERSITY OF NEVADA, RENO Session 2 Computer Organization.
COMPUTER ARCHITECTURE & OPERATIONS I Instructor: Yaohang Li.
C OMPUTER O RGANIZATION AND D ESIGN The Hardware/Software Interface 5 th Edition Chapter 1 Computer Abstractions and Technology.
Chapter 1 — Computer Abstractions and Technology — 1 Uniprocessor Performance Constrained by power, instruction-level parallelism, memory latency.
Chapter 1 Computer Abstractions and Technology. Chapter 1 — Computer Abstractions and Technology — 2 The Computer Revolution Progress in computer technology.
Chapter 1 Computer Abstractions and Technology. Chapter 1 — Computer Abstractions and Technology — 2 The Computer Revolution Progress in computer technology.
Chapter 1 Performance & Technology Trends. Outline What is computer architecture? Performance What is performance: latency (response time), throughput.
C OMPUTER O RGANIZATION AND D ESIGN The Hardware/Software Interface ARM Edition Chapter 1 Computer Abstractions and Technology.
Chapter 1 Computer Abstractions and Technology. Chapter 1 — Computer Abstractions and Technology — 2 The Computer Revolution Progress in computer technology.
C OMPUTER O RGANIZATION AND D ESIGN The Hardware/Software Interface 5 th Edition Chapter 1 Computer Abstractions and Technology.
Computer Architecture & Operations I
Computer Architecture & Operations I
Morgan Kaufmann Publishers Computer Abstractions and Technology
Morgan Kaufmann Publishers Technology Trends and Performance
Morgan Kaufmann Publishers Computer Abstractions and Technology
Computer Architecture and Organization
ECE 3056: Architecture, Concurrency and Energy in Computation
Computer Architecture & Operations I
Morgan Kaufmann Publishers
CS161 – Design and Architecture of Computer Systems
ECE 3055: Computer Architecture and Operating Systems
Morgan Kaufmann Publishers Computer Abstractions and Technology
Computer Architecture & Operations I
Morgan Kaufmann Publishers Computer Architecture
Xiang Lian The University of Texas-Pan American
Lectures Slides and Figures from MKP and Sudhakar Yalamanchili
Morgan Kaufmann Publishers Computer Abstractions and Technology
Uniprocessor Performance
Morgan Kaufmann Publishers Computer Abstractions and Technology
Morgan Kaufmann Publishers
Morgan Kaufmann Publishers Computer Abstractions and Technology
ECE 154A Introduction to Computer Architecture
COSC 3406: Computer Organization
Architecture & Organization 1
Morgan Kaufmann Publishers Computer Abstractions and Technology
COSC 3406: Computer Organization
Morgan Kaufmann Publishers Computer Abstractions and Technology
Morgan Kaufmann Publishers
Architecture & Organization 1
Morgan Kaufmann Publishers Computer Abstractions and Technology
Morgan Kaufmann Publishers Computer Abstractions and Technology
Morgan Kaufmann Publishers Computer Abstractions and Technology
Morgan Kaufmann Publishers Computer Abstractions and Technology
Morgan Kaufmann Publishers Computer Abstractions and Technology
Computer Abstractions and Technology
Presentation transcript:

Lectures Slides and Figures from MKP and Sudhakar Yalamanchili Introduction Lectures Slides and Figures from MKP and Sudhakar Yalamanchili

Reading Sections 1.1, 1.2, 1.3, 1.4, 1.5, 1.7, 1.8 Key Ideas Types of systems  implications for computer architecture The impact of technology New rules  the power wall and parallelism

Historical Perspective ENIAC built in World War II was the first general purpose computer Used for computing artillery firing tables 80 feet long by 8.5 feet high and several feet wide Each of the twenty 10 digit registers was 2 feet long Used 18,000 vacuum tubes Performed 1900 additions per second Since then Moore’s Law Transistor density doubles every 18-24 months Modern version #cores double every 18-24 months

The Modern Era Internet of Things? Hundred’s of dollars Millions Battery operated Internet capable Millions Contrast with warehouse scale computing, e.g., Google and Amazon Software as a service Backend for mobile devices Power consumption limited

Morgan Kaufmann Publishers November 3, 2017 Opening the Box Capacitive multitouch LCD screen 3.8 V, 25 Watt-hour battery Computer board Chapter 1 — Computer Abstractions and Technology

Morgan Kaufmann Publishers November 3, 2017 Inside the Processor Apple A5 Chapter 1 — Computer Abstractions and Technology

Warehouse Scale Computers SUN MD S20: Water cooled containers 187.5KW Google data center in Oregon Power densities of 10-20 KW/m2  footprint  cost From R. Katz, “Tech Titans Building Boom,” IEEE Spectrum, February 2009,http://www.spectrum.ieee.org/feb09/7327

Morgan Kaufmann Publishers November 3, 2017 Inside the Core (CPU) Datapath: performs operations on data Control: sequences datapath, memory, ... Cache memory Small fast SRAM memory for immediate access to data www.wikipedia.org (IBM HS20 blade server) Chapter 1 — Computer Abstractions and Technology

Morgan Kaufmann Publishers November 3, 2017 Inside the Processor AMD Barcelona: 4 processor cores Chapter 1 — Computer Abstractions and Technology

Morgan Kaufmann Publishers November 3, 2017 Reminder ECE 2035 High-level language Level of abstraction closer to problem domain Assembly language Textual representation of instructions Hardware representation Encoded instructions and data ECE 3056 How does this work? Chapter 1 — Computer Abstractions and Technology

Moore’s Law Goal: Sustain Performance Scaling From wikipedia.org

Uniprocessor Performance Morgan Kaufmann Publishers November 3, 2017 Uniprocessor Performance §1.8 The Sea Change: The Switch to Multiprocessors Constrained by power, instruction-level parallelism, memory latency Chapter 1 — Computer Abstractions and Technology

Feature Size We are currently at 0.032µm and moving towards 0.022µm Source: Courtesy H.H. Lee, ECE 3055

New Rules: The End of Dennard Scaling GATE SOURCE DRAIN tox L Voltage is no longer scaling at the same rate Slower scaling in power per transistor  increasing power densities From R. Dennard, et al., “Design of ion-implanted MOSFETs with very small physical dimensions,” IEEE Journal of Solid State Circuits, vol. SC-9, no. 5, pp. 256-268, Oct. 1974.

Post Dennard Performance Scaling W. J. Dally, Keynote IITC 2012 15

Morgan Kaufmann Publishers November 3, 2017 Power Wall In CMOS IC technology ×30 5V → 1V ×1000 Chapter 1 — Computer Abstractions and Technology

Memory Wall “Moore’s Law” µProc 60%/yr. 1000 100 Processor-Memory CPU “Moore’s Law” 100 Processor-Memory Performance Gap: (grows 50% / year) 10 DRAM 7%/yr. DRAM 1 Time

X2: 300mm wafer, 117 chips, 90nm technology Understanding Cost X2: 300mm wafer, 117 chips, 90nm technology X4: 45nm technology What happens if you simply port a design across technology generations? What about design costs? Hardware and software

Integrated Circuit Cost Morgan Kaufmann Publishers November 3, 2017 Integrated Circuit Cost Nonlinear relation to area and defect rate Wafer cost and area are fixed Defect rate determined by manufacturing process Die area determined by architecture and circuit design Chapter 1 — Computer Abstractions and Technology

Impact on Design From http://umairmohsin.wordpress.com/2009/12/23/beyond-the-core-intel-roadmap-2010/

Average Transistor Cost Per Year Source: Courtesy H.H. Lee, ECE 3055

Morgan Kaufmann Publishers November 3, 2017 A Safe Place for Data Volatile main memory Loses instructions and data when power off Non-volatile secondary memory Magnetic disk Flash memory Optical disk (CDROM, DVD) Chapter 1 — Computer Abstractions and Technology

Morgan Kaufmann Publishers November 3, 2017 Networks Communication and resource sharing Local area network (LAN): Ethernet Within a building Wide area network (WAN: the Internet Wireless network: WiFi, Bluetooth Chapter 1 — Computer Abstractions and Technology

Morgan Kaufmann Publishers November 3, 2017 Parallelism Multicore microprocessors More than one processor per chip Parallel programming Compare with instruction level parallelism Hardware executes multiple instructions at once Hidden from the programmer Hard to do Programming for performance Load balancing Optimizing communication and synchronization Chapter 1 — Computer Abstractions and Technology

Multicore, Many Core, and Heterogeneity NVIDIA Keplar Performance scaling via increasing core count The advent of heterogeneous computing AMD Trinity Intel Ivy Bridge Different instruction sets

Eight Great Ideas Design for Moore’s Law Use abstraction to simplify design Make the common case fast Performance via parallelism Performance via pipelining Performance via prediction Hierarchy of memories Dependability via redundancy

Morgan Kaufmann Publishers November 3, 2017 Concluding Remarks New Rules Power and energy efficiency are driving concerns Cost is an exercise in mass production Relationship to instruction set architecture (ISA)? Instruction set architecture (ISA) The hardware/software interface is the vehicle for portability and cost management Multicore Core scaling vs. frequency scaling Need for parallel programming  need to think parallel! Chapter 1 — Computer Abstractions and Technology

Study Guide Moore’s Law Technology Trends Understanding Cost What is it? What are the cost and performance consequences? Technology Trends Explain the reason for the shift to power and energy efficient computing Understanding Cost What are the major elements of cost? Multicore processor Distinguishing features Basic Components of a Modern Processor

Glossary Energy efficiency Dennard Scaling Die yield Feature size Heterogeneity Moore’s Law Multicore architecture Memory Wall Performance scaling Parallel programming Power efficiency Power Wall Tick-tock development model Wafer