IAY 0600 Digital Systems Design

Slides:

Advertisements

Similar presentations

Topics Electrical properties of static combinational gates:

Advertisements

UNIT 4 BASIC CIRCUIT DESIGN CONCEPTS

Power Reduction Techniques For Microprocessor Systems

Introduction to CMOS VLSI Design Lecture 18: Design for Low Power David Harris Harvey Mudd College Spring 2004.

S. Reda EN160 SP’08 Design and Implementation of VLSI Systems (EN1600) Lecture 14: Power Dissipation Prof. Sherief Reda Division of Engineering, Brown.

S. RossEECS 40 Spring 2003 Lecture 28 Today… Analyzing digital computation at a very low level! The Latch Pipelined Datapath Control Signals Concept of.

Mehdi Amirijoo1 Power estimation n General power dissipation in CMOS n High-level power estimation metrics n Power estimation of the HW part.

S. Reda EN160 SP’07 Design and Implementation of VLSI Systems (EN0160) Lecture 13: Power Dissipation Prof. Sherief Reda Division of Engineering, Brown.

Lecture 5 – Power Prof. Luke Theogarajan

IAY 0600 Digitaalsüsteemide disain FSM Decomposition Synthesis Lab. 6 Alexander Sudnitson Tallinn University of Technology.

Lecture 7: Power.

Lecture 21, Slide 1EECS40, Fall 2004Prof. White Lecture #21 OUTLINE –Sequential logic circuits –Fan-out –Propagation delay –CMOS power consumption Reading:

Power, Energy and Delay Static CMOS is an attractive design style because of its good noise margins, ideal voltage transfer characteristics, full logic.

Digital Integrated Circuits for Communication

EE466: VLSI Design Power Dissipation. Outline Motivation to estimate power dissipation Sources of power dissipation Dynamic power dissipation Static power.

Power Reduction for FPGA using Multiple Vdd/Vth

Lecture 2 1 Computer Elements Transistors (computing) –How can they be connected to do something useful? –How do we evaluate how fast a logic block is?

Logic Synthesis For Low Power CMOS Digital Design.

Logic Synthesis for Low Power(CHAPTER 6) 6.1 Introduction 6.2 Power Estimation Techniques 6.3 Power Minimization Techniques 6.4 Summary.

Basics of Energy & Power Dissipation Lecture notes S. Yalamanchili, S. Mukhopadhyay. A. Chowdhary.

Sogang University Advanced Computing System Chap 1. Computer Architecture Hyuk-Jun Lee, PhD Dept. of Computer Science and Engineering Sogang University.

1 EE 587 SoC Design & Test Partha Pande School of EECS Washington State University

Modern VLSI Design 2e: Chapter 3 Copyright  1998 Prentice Hall PTR Topics n Electrical properties of static combinational gates: –transfer characteristics;

Why Low Power Testing? 台大電子所李建模.

1 Interconnect/Via. 2 Delay of Devices and Interconnect.

Basics of Energy & Power Dissipation

Bi-CMOS Prakash B.

Introduction to Clock Tree Synthesis

FPGA-Based System Design: Chapter 6 Copyright  2004 Prentice Hall PTR Topics n Low power design. n Pipelining.

EE141 © Digital Integrated Circuits 2nd Combinational Circuits 1 A few notes for your design  Finger and multiplier in schematic design  Parametric analysis.

Modern VLSI Design 3e: Chapter 3 Copyright  1998, 2002 Prentice Hall PTR Topics n Electrical properties of static combinational gates: –transfer characteristics;

CDA 4253 FPGA System Design RTL Design Methodology 1 Hao Zheng Comp Sci & Eng USF.

Z. Feng MTU EE4800 CMOS Digital IC Design & Analysis 6.1 EE4800 CMOS Digital IC Design & Analysis Lecture 6 Power Zhuo Feng.

EE 466/586 VLSI Design Partha Pande School of EECS Washington State University

CS203 – Advanced Computer Architecture

LOW POWER DESIGN METHODS

COE 360 Principles of VLSI Design Delay. 2 Definitions.

CMOS OUTLINE » Fan-out » Propagation delay » CMOS power consumption.

COMP541 Transistors and all that… a brief overview

Smruti R. Sarangi IIT Delhi

Digital Design - Sequential Logic Design

CS203 – Advanced Computer Architecture

Lecture 11: Sequential Circuit Design

Digital Integrated Circuits for Communication

COMP541 Transistors and all that… a brief overview

VLSI Testing Lecture 5: Logic Simulation

The Inverter EE4271 VLSI Design Professor Shiyan Hu Office: EERC 518

LOW POWER DESIGN METHODS V.ANANDI ASST.PROF,E&C MSRIT,BANGALORE.

VLSI Testing Lecture 5: Logic Simulation

Estimate power saving by clock slowdown for s5378 in 180nm and 32nm CMOS Chao Han ELEC 6270.

Vishwani D. Agrawal Department of ECE, Auburn University

Reading: Hambley Ch. 7; Rabaey et al. Sec. 5.2

Day 16: September 15, 2010 Energy and Power

COOLRUNNER II REAL DIGITAL CPLD

Created by Luis Chioye Presented by Cynthia Sosa

332:479 Concepts in VLSI Design Lecture 24 Power Estimation

OUTLINE » Fan-out » Propagation delay » CMOS power consumption

Design Technologies Custom Std Cell Performance Gate Array FPGA Cost.

Dual Mode Logic An approach for high speed and energy efficient design

CSE 370 – Winter Sequential Logic - 1

Instructor：Po-Yu Kuo 教師：郭柏佑

Day 21: October 29, 2010 Registers Dynamic Logic

The Inverter EE4271 VLSI Design Dr. Shiyan Hu Office: EERC 731

Lecture 7: Power.

Lecture 7: Power.

Reading: Hambley Ch. 7; Rabaey et al. Secs. 5.2, 5.5, 6.2.1

Low Power Digital Design

COMP541 Transistors and all that… a brief overview

Combinational Circuits

Lecture 4 Sequential units. Registers

Presentation transcript:

IAY 0600 Digital Systems Design Low Power Design Lab. 6 IAY 0600 Digital Systems Design Alexander Sudnitson Tallinn University of Technology

Motivation for low power design Low power design is important from different reasons Device temperature Failure rate, Cooling and packaging costs Life of the battery Meantime between charging, System cost Environment Overall energy consumption

Power and Energy Power is drawn from a voltage source attached to the VDD pin(s) of a chip. Instantaneous Power: Energy: Average Power:

Low Power or Low Energy design E(T) = ∫ P(t) dt Power Direct impact on instantaneous energy consumption and temperature Power consumption is critical for heat dissipation limited systems Energy Power integrated over time is energy and impact on battery shelf life and environment Energy consumption is critical for battery-powered systems

CMOS We will restrict our attention to CMOS devices, this technology being the most widely adopted in current VLSI systems. Static, complementary CMOS gates are remarkably efficient in their use of power to perform computation However, leakage increasingly threatens to drive up chip power consumption We consider inverter as circuit used for power consumption analysis

Power consumption analysis Vin Vout VDD GND Static dissipation due to leakage circuit Short-circuit dissipation Charge and discharge of a load capacitor P = Pdyn + Psc + Plk Pdyn is dynamic or switching power (is due to charging and discharging load capacitances); Psc is shirt-circuit power; Plk is leakage power (is static in nature)

Dynamic Energy Consumption Vin Vout CL Vdd Transition Power Energy/transition = 1/2*CL * VDD2 Total energy (both charge and discharge) = CL * VDD2 Power = CL * VDD2 * f

Consumption in CMOS Voltage (Volt, V) Water pressure (bar) Current (Ampere, A) Water quantity per second (liter/s) Energy Amount of Water CL 1 Energy consumption is proportional to capacitive load! Source: Frank Sill (Federal University of Minas Gerais)

Leakage energy Short-circuit Power Vin Vout CL Vdd Energy/transition = tsc * VDD * Ipeak * P 0/11/0 Power = tsc * VDD * Ipeak * f

Independent of switching Leakage energy Vout Sub-threshold current Drain junction leakage OFF Gate leakage Independent of switching

Define and quantity power For CMOS chips, traditional dominant energy consumption has been in switching transistors, called dynamic power For mobile devices, energy better metric For a fixed task, slowing clock rate (frequency switched) reduces power, but not energy Dropping voltage helps both

Energy and performance In some cases, energy can be saved by reducing performance. Speed decreases linearly, power decreases as V2. Power goes down faster than performance. Example of quantifying power Suppose 15% reduction in voltage results in a 15% reduction in frequency. What is impact on dynamic power?

Activity factor Suppose the system clock frequency = f Let fsw = af, where a = activity factor If the signal is a clock, a = 1 If the signal switches once per cycle, a = ½ Dynamic power:

Rules for reducing power consumption Turn it off. Eliminates leakage current. Slow it down, reduce voltage. Performance is linear with clock frequency. Power is V2. Don’t change its inputs. Activity-dependent.

Levels of abstraction Physical: Minimize capacitance. Gate: Use low leakage gates. Combinational: Avoid switches. Register-transfer: Avoid using units. Architecture: Slow things down, turn them off.

Logic/circuit optimizations Turn off gate where possible. Not an option in most FPGAs, but it should be. Operate gate at low voltage. Speed decreases linearly, power decreases as V2.

State assignment for low power State assignment for low power has also been explored. In general, the state assignment problem has targeted minimizing area, and this approach tends to reduce power as well. Low-power state assignment techniques assignment augment the state transition graph of the state machine with the state probabilities and transition probabilities between states, and use these probabilities to guide the state assignment. Adjacent binary encodings are assigned to states connected with high probability edges of the graph. This minimizes the number of state signal transitions, thus attempting to minimize transitions in the next state and output signal combinational logic. One approach attempts to minimize area in conjunction with switching activity by generating multiple sets of state encodings with similar switching energy costs from which a final assignment is chosen on the basis of area.

State assignment impact on power (counter encoding ex.) Gray Code Binary Code S0 000 S1 001 S2 011 010 S3 S4 110 100 S5 111 101 S6 S7 Total number of transitions 8 14 Max transitions per clock cycle 1 3 Table compares Gray and binary state assignments. Comparison shows that Gray technique reduces both the average number of logic transitions per clock and the overall number of transitions for a cycle of the state machine.

Register-transfer optimizations Hold inputs when a unit’s output will not be used. Put register at inputs. Turn off units when they won’t be used for several cycles. Can’t selectively turn off LEs in most FPGAs. Not an option in most FPGAs, but it should be.

Guard evaluation Guard Latches S Combinational Logic S Guarded evaluation relies on input blocking for transition reduction. Transparent latches are added to inputs of existing logic and are appropriately disabled when the logic output can be determined without new input values being driven from the disabled latches. This technique is common in the design of datapath functions in low-power processors.

Clock gating & Enable D Q C CLK

Circuit with clock drivers and clock gating CLK CLK GATING SIGNAL R1 R2 R3 CL3 R4 CL4 &

FSM stochastic analysis Given the FSM description and the input probabilities, the probabilistic behavior of a FSM can be studied by regarding its transition structure as a Markov chain. A Markov process is a stochastic process, where the past has no influence on the future. In other words, the future behavior depends only on the current state of the process (a “Markov property”). Markov process is called a Markov chain (MC) if its state space is discrete (either finite or countable). One example of MC is the process of playing a board game, where player's next action is determined entirely by rolling a dice. In order to make a move, one takes into account only the current state of the board. It doesn't really matter how the game progressed to that state. Alternatively, in a card game player's move is motivated not only by the cards he or she currently holds, but also the cards which have already been used during the course of the game. Using steady state probabilities, which are received in the result of such analysis, it is possible to build different kinds of quantitative estimations of FSM’s stochastic behavior.

A Case Study: Low-Power Design To demonstrate the use of applets in conjunction with FPGA-based development boards, the procedure of computational kernel extraction and implementation will be considered in Lab. Sequential circuits may have an extremely large number of reachable states, but probabilistic analysis show that during normal operation only a relatively small subset is actually being visited. A power optimization paradigm is based on the concept of computational kernel, a highly optimized logic block, which mimics the steady-state behaviour of the original specification.

Probability distribution of the FSM The first step of computational kernel extraction procedure is probabilistic analysis of the FSM. State Steady state probability init0 0.5000001408 init1 0.3346775136 init2 0.0877016376 init4 0.0584677584 IOwait 0.0161290368 read0 0.0006720432 write0 RMACK WMACK read1 0.0003360216 It is seen that FSM “opus”-benchmark spends 83% of its operation time in states “init0” and “init1”.

Decomposed FSM network After computational kernel is identified, it should be separated from the rest of the circuit. The applet of additive decomposition is used to divide the original circuit into two alternatively working sub-FSMs.

Decomposition applet

Implementation summary VHDL description for prototype FSM and decomposed network can be generated by decomposition applet. This descriptions are used to implement and verify both designs using FPGA-based development board. XPower Analyzer is a tool for power consumption estimation featured in Xilinx ISE. It is used to evaluate the quality of the decomposed design in comparison with the original. Design Area (LUTs) Power Consumptions (mW) Original 25 4.65 Decomposed 36 1.85 As it is seen from the table, the dynamic power consumption has been reduced by the factor of 2.5, while area overhead is 44%.