CS203 – Advanced Computer Architecture

Slides:



Advertisements
Similar presentations
Computer Structure Power Management Lihu Rappoport and Adi Yoaz Thanks to Efi Rotem for many of the foils.
Advertisements

Power Reduction Techniques For Microprocessor Systems
Introduction to CMOS VLSI Design Lecture 21: Scaling and Economics
Lecture 2: Modern Trends 1. 2 Microprocessor Performance Only 7% improvement in memory performance every year! 50% improvement in microprocessor performance.
1 ECE 570– Advanced Computer Architecture Dr. Patrick Chiang Winter 2013 Tues/Thurs 2-4PMPM.
1 Introduction Background: CS 3810 or equivalent, based on Hennessy and Patterson’s Computer Organization and Design Text for CS/EE 6810: Hennessy and.
1 Copyright © 2012, Elsevier Inc. All rights reserved. Chapter 1 Fundamentals of Quantitative Design and Analysis Computer Architecture A Quantitative.
S. Reda EN160 SP’08 Design and Implementation of VLSI Systems (EN1600) Lecture 14: Power Dissipation Prof. Sherief Reda Division of Engineering, Brown.
CS 7810 Lecture 12 Power-Aware Microarchitecture: Design and Modeling Challenges for Next-Generation Microprocessors D. Brooks et al. IEEE Micro, Nov/Dec.
Design and Implementation of VLSI Systems (EN0160)
8/18/05ELEC / Lecture 11 ELEC / (Fall 2005) Special Topics in Electrical Engineering Low-Power Design of Electronic Circuits.
S. Reda EN160 SP’08 Design and Implementation of VLSI Systems (EN1600) Lecture 18: Scaling Theory Prof. Sherief Reda Division of Engineering, Brown University.
Institute of Digital and Computer Systems 1 Fabio Garzia / Finding Peak Performance in a Process23/06/2015 Chapter 5 Finding Peak Performance in a Process.
From Compaq, ASP- DAC00. Power Consumption Power consumption is on the rise due to: - Higher integration levels (more devices & wires) - Rising clock.
S. Reda EN160 SP’07 Design and Implementation of VLSI Systems (EN0160) Lecture 13: Power Dissipation Prof. Sherief Reda Division of Engineering, Brown.
Power-Aware Computing 101 CS 771 – Optimizing Compilers Fall 2005 – Lecture 22.
Introduction to CMOS VLSI Design Lecture 21: Scaling and Economics Credits: David Harris Harvey Mudd College (Material taken/adapted from Harris’ lecture.
Power-aware Computing n Dramatic increases in computer power consumption: » Some processors now draw more than 100 watts » Memory power consumption is.
Power, Energy and Delay Static CMOS is an attractive design style because of its good noise margins, ideal voltage transfer characteristics, full logic.
Single-Chip Multi-Processors (CMP) PRADEEP DANDAMUDI 1 ELEC , Fall 08.
CS 423 – Operating Systems Design Lecture 22 – Power Management Klara Nahrstedt and Raoul Rivas Spring 2013 CS Spring 2013.
1 Copyright © 2012, Elsevier Inc. All rights reserved. Chapter 1 Fundamentals of Quantitative Design and Analysis Computer Architecture A Quantitative.
1 Copyright © 2012, Elsevier Inc. All rights reserved. Chapter 1 Fundamentals of Quantitative Design and Analysis Computer Architecture A Quantitative.
Lecture 2: Fundamentals of Computer Design Kai Bu
1 VLSI and Computer Architecture Trends ECE 25 Fall 2012.
EZ-COURSEWARE State-of-the-Art Teaching Tools From AMS Teaching Tomorrow’s Technology Today.
17 Sep 2002Embedded Seminar2 Outline The Big Picture Who’s got the Power? What’s in the bag of tricks?
Computer Architecture- 1 Ping-Liang Lai ( 賴秉樑 ) Chapter 1 Fundamentals of Computer Design Computer Architecture 計算機結構.
Lecture 03: Fundamentals of Computer Design - Trends and Performance Kai Bu
1 Copyright © 2012, Elsevier Inc. All rights reserved. Chapter 1 Fundamentals of Quantitative Design and Analysis Computer Architecture A Quantitative.
Last Time Performance Analysis It’s all relative
Multi-core Programming Introduction Topics. Topics General Ideas Moore’s Law Amdahl's Law Processes and Threads Concurrency vs. Parallelism.
Basics of Energy & Power Dissipation Lecture notes S. Yalamanchili, S. Mukhopadhyay. A. Chowdhary.
Sogang University Advanced Computing System Chap 1. Computer Architecture Hyuk-Jun Lee, PhD Dept. of Computer Science and Engineering Sogang University.
1 EE 587 SoC Design & Test Partha Pande School of EECS Washington State University
® 1 VLSI Design Challenges for Gigascale Integration Shekhar Borkar Intel Corp. October 25, 2005.
The University of Adelaide, School of Computer Science
1 CS/EE 6810: Computer Architecture Class format:  Most lectures on YouTube *BEFORE* class  Use class time for discussions, clarifications, problem-solving,
MS108 Computer System I Lecture 2 Metrics Prof. Xiaoyao Liang 2014/2/28 1.
C OMPUTER O RGANIZATION AND D ESIGN The Hardware/Software Interface 5 th Edition Chapter 1 Computer Abstractions and Technology Sections 1.5 – 1.11.
[Tim Shattuck, 2006][1] Performance / Watt: The New Server Focus Improving Performance / Watt For Modern Processors Tim Shattuck April 19, 2006 From the.
Basics of Energy & Power Dissipation
Morgan Kaufmann Publishers
© Digital Integrated Circuits 2nd Inverter Digital Integrated Circuits A Design Perspective The Inverter Jan M. Rabaey Anantha Chandrakasan Borivoje Nikolic.
By Nasir Mahmood.  The NoC solution brings a networking method to on-chip communication.
Computer Science and Engineering Power-Performance Considerations of Parallel Computing on Chip Multiprocessors Jian Li and Jose F. Martinez ACM Transactions.
FPGA-Based System Design: Chapter 6 Copyright  2004 Prentice Hall PTR Topics n Low power design. n Pipelining.
DR. SIMING LIU SPRING 2016 COMPUTER SCIENCE AND ENGINEERING UNIVERSITY OF NEVADA, RENO Session 3 Computer Evolution.
LOW POWER DESIGN METHODS
Power-Optimal Pipelining in Deep Submicron Technology
High Performance Computer Architecture:
What is *Computer Architecture*
CS203 – Advanced Computer Architecture
Green cloud computing 2 Cs 595 Lecture 15.
LOW POWER DESIGN METHODS V.ANANDI ASST.PROF,E&C MSRIT,BANGALORE.
Hot Chips, Slow Wires, Leaky Transistors
Morgan Kaufmann Publishers
Fundamentals of Computer Design - Trends and Performance
Architecture & Organization 1
The University of Adelaide, School of Computer Science
Lecture 2: Performance Today’s topics: Technology wrap-up
Architecture & Organization 1
Transistors on lead microprocessors double every 2 years Moore’s Law in Microprocessors Transistors on lead microprocessors double every 2 years.
Chapter 1 Fundamentals of Computer Design
Fundamentals of Computer Design - Trends and Performance
The University of Adelaide, School of Computer Science
Computer Evolution and Performance
The University of Adelaide, School of Computer Science
The University of Adelaide, School of Computer Science
Utsunomiya University
Presentation transcript:

CS203 – Advanced Computer Architecture Technology Trends

Trends in Technology - Density Integrated circuit technology Transistor density: 35%/year Die size: 10-20%/year Integration overall: 40-55%/year DRAM capacity: 25-40%/year (slowing) Flash capacity: 50-60%/year 15-20X cheaper/bit than DRAM Magnetic disk technology: 40%/year 15-25X cheaper/bit then Flash 300-500X cheaper/bit than DRAM

Bandwidth and Latency Trends The University of Adelaide, School of Computer Science 27 April 2017 Bandwidth and Latency Trends Bandwidth or throughput Total work done in a given time 10,000-25,000X improvement for processors 300-1200X improvement for memory and disks Latency or response time Time between start and completion of an event 30-80X improvement for processors 6-8X improvement for memory and disks Chapter 2 — Instructions: Language of the Computer

Log-log plot of bandwidth and latency milestones

Transistors

The University of Adelaide, School of Computer Science 27 April 2017 Transistors and Wires Feature size Minimum size of transistor or wire in x or y dimension 10 microns in 1971 to .032 microns in 2011 32nm  22nm (‘12)  14nm (‘14)  10nm (?) Transistor performance scales linearly Wire delay does not improve with feature size! Integration density scales quadratically Chapter 2 — Instructions: Language of the Computer

Technology Scaling Moore’s Law: All these features scale by 1/S, S=SQRT(2) every 2 years

Moore’s Law

Impact of Scaling on Characteristics Device Characteristics Feature Dependence Scaling Transistor Gain (β) W/(L. tox) S Current (Ids) β(Vdd-Vth)2 Resistance Vdd/Ids 1 Gate Capacitance (W.L)/tox 1/S Gate delay R.C Clock Frequency 1/(R.C) Circuit Area W.L 1/S2 Wire Resistance 1/(w.h) S2 Wire Capacitance h/s

Effects of Technology Scaling Scaling dimensions doubles device density Frequency increases by 41% Scaling voltage simultaneously keep the power constant If voltage is not scaled then clock frequency can scale even faster but dynamic power grows! Threshold voltage scaling causes gate leakage power growth!

Technology Trends Wire delays don’t scale like logic delays Processor structures must expand to support more instructions Thus wire delays dominate the cycle time; slow wires must be local Design complexity Processors are becoming so complex that a large fraction of the development of a processor or system is dedicated to verification Chip density is increasing much faster than the productivity of verification engineers (new tools, speed of systems) CMOS endpoint CMOS is rapidly reaching the limits of miniaturization Feature sizes will reach atomic dimensions in less than 15 years Options???? Quantum computing Nanotechnology Analog computing Performance remains a critical design factor

Power and energy

Pstatic = VIsub ≈ Ve-KVt/T Power Total power: dynamic + static (leakage) Pdynamic = αCV2f Pstatic = VIsub ≈ Ve-KVt/T Power/energy are critical problems Power (immediate energy dissipation) must be dissipated Otherwise temperature goes up (affects performance, correctness and may possibly destroy the circuit, short term or long term) Effect on the supply of power to the chip Energy (depends on power and speed) Costly; global problem Battery operated devices

The University of Adelaide, School of Computer Science 27 April 2017 Power and Energy Problem: Get power in, get power out Thermal Design Power (TDP) Characterizes sustained power consumption Used as target for power supply and cooling system Lower than peak power, higher than average power consumption Clock rate can be reduced dynamically to limit power consumption Energy per task is often a better measurement Chapter 2 — Instructions: Language of the Computer

Dynamic Energy and Power The University of Adelaide, School of Computer Science 27 April 2017 Dynamic Energy and Power Dynamic power Pdynamic = αCV2f α = fraction of clock period where output 0  1 (at most ½) Dynamic energy Transistor switch from 0  1 or 1  0 Edynamic = αCV2 Reducing clock rate reduces power, not energy Chapter 2 — Instructions: Language of the Computer

The University of Adelaide, School of Computer Science 27 April 2017 Power Trends Intel 80386 consumed ~ 2 W 3.3 GHz Intel Core i7 consumes 130 W Heat must be dissipated from 1.5 x 1.5 cm chip This is the limit of what can be cooled by air Chapter 2 — Instructions: Language of the Computer

Why multi-core? Pdynamic = αCV2f Dynamic power favors parallel processing over higher clock rate Dynamic power roughly proportional to f3 Take a proc. and replicate it 4 times: 4x speedup & 4x power Take a proc and clock it 4 times faster: 4x speedup but 64x dynamic power!

Example of quantifying power Suppose 15% reduction in voltage results in a 15% reduction in frequency. What is impact on dynamic power?

Static Power Because leakage current flows even when a transistor is off, now static power important too Leakage current increases in processors with smaller transistor sizes Increasing the number of transistors increases power even if they are turned off Very low power systems gate voltage to inactive modules to control loss due to leakage

The University of Adelaide, School of Computer Science 27 April 2017 Reducing Power Techniques for reducing power: Do nothing well (Idle low power modes) Dynamic Voltage-Frequency Scaling Low power state for DRAM, disks Overclocking, turning off cores Chapter 2 — Instructions: Language of the Computer

Cost

The University of Adelaide, School of Computer Science 27 April 2017 Trends in Cost Cost driven down by learning curve Yield DRAM: price closely tracks cost Microprocessors: price depends on volume 10% less for each doubling of volume Chapter 2 — Instructions: Language of the Computer

Integrated Circuit Cost The University of Adelaide, School of Computer Science 27 April 2017 Integrated Circuit Cost Integrated circuit Defects per unit area = 0.016-0.057 defects per square cm (2010) N = process-complexity factor = 11.5-15.5 (40 nm, 2010) Chapter 2 — Instructions: Language of the Computer