Keeping Hot Chips Cool Ruchir Puri, Leon Stok, Subhrajit Bhattacharya IBM T.J. Watson Research Center Yorktown Heights, NY Circuits R-US.

Slides:



Advertisements
Similar presentations
Feb. 17, 2011 Midterm overview Real life examples of built chips
Advertisements

Subthreshold SRAM Designs for Cryptography Security Computations Adnan Gutub The Second International Conference on Software Engineering and Computer Systems.
Leakage Energy Management in Cache Hierarchies L. Li, I. Kadayif, Y-F. Tsai, N. Vijaykrishnan, M. Kandemir, M. J. Irwin, and A. Sivasubramaniam Penn State.
Dynamic and Leakage Power Reduction in MTCMOS Circuits Using an Automated Efficient Gate Clustering Technique Mohab Anis, Shawki Areibi *, Mohamed Mahmoud.
University of Michigan Electrical Engineering and Computer Science University of Michigan Electrical Engineering and Computer Science University of Michigan.
Power Reduction Techniques For Microprocessor Systems
Dynamic Voltage Scaling Using Both Headers and Footers Kyle Craig and Roy Matthews ECE 632.
5/9/2015 A 32-bit ALU with Sleep Mode for Leakage Power Reduction Manish Kulkarni Department of Electrical and Computer Engineering Auburn University,
Adaptive Techniques for Leakage Power Management in L2 Cache Peripheral Circuits Houman Homayoun Alex Veidenbaum and Jean-Luc Gaudiot Dept. of Computer.
Mehdi Alimadadi, Samad Sheikhaei, Guy Lemieux, Shahriar Mirabbasi, Patrick Palmer University of British Columbia (UBC) Vancouver, BC, Canada A 3GHz Switching.
Introduction to CMOS VLSI Design Lecture 18: Design for Low Power David Harris Harvey Mudd College Spring 2004.
S. Reda EN160 SP’08 Design and Implementation of VLSI Systems (EN1600) Lecture 14: Power Dissipation Prof. Sherief Reda Division of Engineering, Brown.
11/5/2004EE 42 fall 2004 lecture 281 Lecture #28 PMOS LAST TIME: NMOS Electrical Model – NMOS physical structure: W and L and d ox, TODAY: PMOS –Physical.
Low-Power CMOS SRAM By: Tony Lugo Nhan Tran Adviser: Dr. David Parent.
Micro-Architecture Techniques for Sensor Network Processors Amir Javidi EECS 598 Feb 25, 2010.
On-Line Adjustable Buffering for Runtime Power Reduction Andrew B. Kahng Ψ Sherief Reda † Puneet Sharma Ψ Ψ University of California, San Diego † Brown.
Static Memory Outline –Types of Static Memory –Static RAM –Battery Backup –EPROM –Flash Memory –EEPROM Goal –Understand types of static memory –Understand.
CSE477 L26 System Power.1Irwin&Vijay, PSU, 2002 Low Power Design in Microarchitectures and Memories [Adapted from Mary Jane Irwin (
S. Reda EN160 SP’07 Design and Implementation of VLSI Systems (EN0160) Lecture 13: Power Dissipation Prof. Sherief Reda Division of Engineering, Brown.
Lecture 5 – Power Prof. Luke Theogarajan
Lecture 7: Power.
Lecture 7: Power.
Low Power Design of Integrated Systems Assoc. Prof. Dimitrios Soudris
The CMOS Inverter Slides adapted from:
1 EE 587 SoC Design & Test Partha Pande School of EECS Washington State University
1 EE 587 SoC Design & Test Partha Pande School of EECS Washington State University
EE466: VLSI Design Power Dissipation. Outline Motivation to estimate power dissipation Sources of power dissipation Dynamic power dissipation Static power.
CSE477 L26 System Power.1Irwin&Vijay, PSU, 2002 TKT-1527 Digital System Design Issues Low Power Techniques in Microarchitectures and Memories Mary Jane.
ENGG 6090 Topic Review1 How to reduce the power dissipation? Switching Activity Switched Capacitance Voltage Scaling.
Low Power Techniques in Processor Design
Power Reduction for FPGA using Multiple Vdd/Vth
GREEN COMPUTING Power Consumption Basics in ICT Products
Low-Power Wireless Sensor Networks
On-chip power distribution in deep submicron technologies
Logic Synthesis For Low Power CMOS Digital Design.
Dept. of Computer Science, UC Irvine
Logic Synthesis for Low Power(CHAPTER 6) 6.1 Introduction 6.2 Power Estimation Techniques 6.3 Power Minimization Techniques 6.4 Summary.
Penn ESE370 Fall DeHon 1 ESE370: Circuit-Level Modeling, Design, and Optimization for Digital Systems Day 19: October 15, 2014 Energy and Power.
1 EE 587 SoC Design & Test Partha Pande School of EECS Washington State University
Washington State University
LOGO Ultralow-Power Design in Near-Threshold Region Prof. : M. Shams Name: Yiqi Chang Student #:
XIAOYU HU AANCHAL GUPTA Multi Threshold Technique for High Speed and Low Power Consumption CMOS Circuits.
Guy Lemieux, Mehdi Alimadadi, Samad Sheikhaei, Shahriar Mirabbasi University of British Columbia, Canada Patrick Palmer University of Cambridge, UK SoC.
Leakage reduction techniques Three major leakage current components 1. Gate leakage ; ~ Vdd 4 2. Subthreshold ; ~ Vdd 3 3. P/N junction.
경종민 Low-Power Design for Embedded Processor.
Basics of Energy & Power Dissipation
Dec 3, 2008Sheth: MS Thesis1 A Hardware-Software Processor Architecture Using Pipeline Stalls For Leakage Power Management Khushboo Sheth Master’s Thesis.
Post-Layout Leakage Power Minimization Based on Distributed Sleep Transistor Insertion Pietro Babighian, Luca Benini, Alberto Macii, Enrico Macii ISLPED’04.
Introduction to Clock Tree Synthesis
Sp09 CMPEN 411 L14 S.1 CMPEN 411 VLSI Digital Circuits Spring 2009 Lecture 14: Designing for Low Power [Adapted from Rabaey’s Digital Integrated Circuits,
Patricia Gonzalez Divya Akella VLSI Class Project.
FPGA-Based System Design: Chapter 6 Copyright  2004 Prentice Hall PTR Topics n Low power design. n Pipelining.
A Class presentation for VLSI course by : Maryam Homayouni
Z. Feng MTU EE4800 CMOS Digital IC Design & Analysis 6.1 EE4800 CMOS Digital IC Design & Analysis Lecture 6 Power Zhuo Feng.
Click to edit Master title style Progress Update Energy-Performance Characterization of CMOS/MTJ Hybrid Circuits Fengbo Ren 05/28/2010.
1 Dual-V cc SRAM Class presentation for Advanced VLSIPresenter:A.Sammak Adopted from: M. Khellah,A 4.2GHz 0.3mm 2 256kb Dual-V CC SRAM Building Block in.
CS203 – Advanced Computer Architecture
PROCEED: Pareto Optimization-based Circuit-level Evaluation Methodology for Emerging Devices Shaodi Wang, Andrew Pan, Chi-On Chui and Puneet Gupta Department.
LOW POWER DESIGN METHODS
Copyright © 2009, Intel Corporation. All rights reserved. Power Gate Design Optimization and Analysis with Silicon Correlation Results Yong Lee-Kee, Intel.
Power-Optimal Pipelining in Deep Submicron Technology
YASHWANT SINGH, D. BOOLCHANDANI
LOW POWER DESIGN METHODS V.ANANDI ASST.PROF,E&C MSRIT,BANGALORE.
SECTIONS 1-7 By Astha Chawla
Hot Chips, Slow Wires, Leaky Transistors
Microarchitectural Techniques for Power Gating of Execution Units
Circuit Design Techniques for Low Power DSPs
Lecture 7: Power.
Power and Heat Power Power dissipation in CMOS logic arises from the following sources: Dynamic power due to switching current from charging and discharging.
Lecture 7: Power.
Presentation transcript:

Keeping Hot Chips Cool Ruchir Puri, Leon Stok, Subhrajit Bhattacharya IBM T.J. Watson Research Center Yorktown Heights, NY Circuits R-US

So, What’s Going On ? At 65nm node Static Power is equal to Active Power  Clock distribution accounts for half of active power

Why Can’t We Keep Scaling V t ?

Low Power Opportunities Most of the Power reduction techniques exploit this positive slack. Power4 Timing Histogram 5%10%15%20% Exploiting positive slacks

Low Power Levers Structural Techniques  Voltage Islands  Multi-threshold devices  Multi-oxide devices  Minimize capacitance by custom design  Power efficient circuits  Parallelism in micro-architecture Dynamic Techniques  Clock gating  Power gating  Variable frequency  Variable voltage supply  Variable device threshold

Outline Clock & Latch Optimization Clock Power Active Power Leakage Power Voltage Islands Power Gating

Outline Clock & Latch Optimization Clock Power Active Power Leakage Power Voltage Islands Power Gating

Minimizing Active Power: Coarse Grained Voltage Islands Trade off power for delay by running functional blocks at different voltages Can use mix of Low and High V t to balance performance and leakage Switch off inactive blocks to reduce leakage power  E.g.: Telecom ASIC 1.0/1.2 V islands saved: 16 % active power 50 % standby power  High VT

Fine-Grained Voltage Islands Secondary power drop V ddl = 1.2V V ddh = 1.5V PowerPC 405 No timing degrade, and no area increase for the core!

Outline Clock & Latch Optimization Clock Power Active Power Leakage Power Voltage Islands Power Gating

Minimizing Clock Power: Local Clock buffer - Latch clustering Clocks consume large amount of power in high-performance designs  Large portion of that power goes to the last stage of the clock tree Minimize the Capacitive loading on local clock buffers by clustering latches around them.  Tradeoff between latch placement flexibility and clock power savings  Reduction in clock skew between capturing and launching latch compensates for loss in latch placement flexibility.

Clock Power Savings Reduces total capacitance on the local clock buffer by 25% Direct savings in clock power in the Random Control Logic

Outline Clock & Latch Optimization Clock Power Active Power Leakage Power Voltage Islands Power Gating

Minimizing Leakage Power: Power Supply Gating Leakage power is now more than switching power  Limits the performance of microprocessors Power gating is one of the most effective ways of minimizing leakage power  Cut-off power to inactive units/components Dynamic/workload based power gating  Reduces both gate and sub-threshold leakage  Over x reduction in leakage with little or no cycle time penalty.

L2 P1P2 P3 P4 Dedicated Units L2 P1P2 P3 P4 offon More Power Available to Scalar Units Higher SPEC Performance Dedicated Units Available for Higher Application Performance Performance on Demand Power Gating Concept

Normal Operation Mode CORE I ACTIVE V DS,LINEAR VDDL GNDL V DS V GS = V DD I DS To reduce the performance degradation, the voltage drop across SLEEP transistor should be minimized to reduce active leakage current. Requires sizing up of footer device I DS,MAX V GS = 0 V VGND

Sleep Mode CORE VGND VDDL GNDL V DS V GS = V DD I DS During the sleep mode, all of the internal capacitive nodes and VGND node are charged up to near V DD. Requires sizing down of footer device to reduce standby leakage. I DS,MAX V GS = 0 V

Wake-Up Mode CORE I TURN_ON VDDL GNDL V DS V GS = V DD I DS When the SLEEP transistor is turned on, the maximum instant current can flow. Requires sizing up of footer device. I DS,MAX V GS = 0 V VGND Rs

Sleep / Wake / Run State Control enable fence deassert wake/run run Enter sleep state charge off assert wake off discharge run Exit sleep state assert run disable fence sleep run (idle) discharge cycle (wake) charge cycles & )

< 1% Frequency Loss 10x-20x Leakage Reduction Footer Selection and Sizing 100x 50x 25x 20x 15.5x 33x Leakage Reduction

Power vs Performance Tradeoff ~8% Performance Degradation Due to Sleep Transistor with 1% area overhead Target Specification: 250MHz at 0.9V ~ 500MHz at 1.4V 1% footer size is used for a 2-stage pipelined 40-bit ALU 130nm Hardware

More Than 8% Performance Degradation Less Than 2% Performance Degradation 130nm Hardware Sleep Transistor Sizing and Performance

Leakage Power Reduction ~2000 x ~8.4 x Leakage Suppression using Power Gating Structure with 1% area overhead Leakage Suppression Using VDD Scaling 130nm Hardware

Physical Design: External Footer Switch

Physical Design: Internal Footer Switch Internal fine-grained power gating is more efficient in addressing:  Electro-Migration and Current Delivery.

Ground Redistribution M2 V1 M1 Contact M3 V2 Footer Cell Logic Device The ‘real’ chip-level ground distribution is M4 and above. It is unchanged by power gating Global ground Virtual ground This part of the redistribution is electrically similar to an unmodified distribution

Without FootersWith Footers Footer Rows Physical Design: Footer Insertion

Gated and non-gated logic have identical width 5% total area overhead for power gating 20X leakage reduction <1% performance degradation Power Gating in High-Performance Non-gated LogicGated Logic

Power Gating: Footer area overhead 10.4% 5.7% 10mV Virtual Ground

Conclusions Power is the limiting factor in traditional CMOS scaling and must be dealt with aggressively  Controlling leakage is crucial for future scaling  Power gating and voltage islands are effective techniques to minimize leakage and active power  Special consideration to clock distribution must be given in high performance designs to minimize clock power In order to keep hot chips cool, a holistic power minimization approach across the whole design stack is required which must include :  Device level techniques  Circuit level techniques  System level power management