Power Estimation Dr. Elwin Chandra Monie.

Slides:



Advertisements
Similar presentations
Switching circuits Composed of switching elements called “gates” that implement logical blocks or switching expressions Positive logic convention (active.
Advertisements

10/4-6/05ELEC / Lecture 111 ELEC / (Fall 2005) Special Topics in Electrical Engineering Low-Power Design of Electronic Circuits.
Copyright Agrawal & Srivaths, 2007 Low-Power Design and Test, Lecture 3 1 Low-Power Design and Test Logic-Level Power Estimation Vishwani D. Agrawal Auburn.
A Look at Chapter 4: Circuit Characterization and Performance Estimation Knowing the source of delays in CMOS gates and being able to estimate them efficiently.
Minimum Dynamic Power CMOS Circuit Design by a Reduced Constraint Set Linear Program Tezaswi Raja Vishwani Agrawal Michael L. Bushnell Rutgers University,
Copyright 2001, Agrawal & BushnellDay-1 PM Lecture 4a1 Design for Testability Theory and Practice Lecture 4a: Simulation n What is simulation? n Design.
Aug 23, ‘021Low-Power Design Minimum Dynamic Power Design of CMOS Circuits by Linear Program Using Reduced Constraint Set Vishwani D. Agrawal Agere Systems,
Spring 07, Feb 13 ELEC 7770: Advanced VLSI Design (Agrawal) 1 ELEC 7770 Advanced VLSI Design Spring 2007 Binary Decision Diagrams Vishwani D. Agrawal James.
Design of Variable Input Delay Gates for Low Dynamic Power Circuits
Fall 2006, Oct. 31, Nov. 2 ELEC / Lecture 10 1 ELEC / (Fall 2006) Low-Power Design of Electronic Circuits Power Analysis:
Dec. 6, 2005ELEC Glitch Power1 Low power design: Insert delays to eliminate glitches Yijing Chen Dec.6, 2005 Auburn university.
8/19/04ELEC / ELEC / Advanced Topics in Electrical Engineering Designing VLSI for Low-Power and Self-Test Fall 2004 Vishwani.
Enhanced Dual-Transition Probabilistic Power Estimation with Selective Supergate Analysis Fei Huand Vishwani D. Agrawal Department of ECE, Auburn University,
Lecture 6 Testability Measures
Copyright Agrawal, 2007 ELEC6270 Fall 07, Lecture 3 1 ELEC 5270/6270 Fall 2007 Low-Power Design of Electronic Circuits Logic-Level Power Estimation Vishwani.
Nov. 8, 001Low-Power Design Digital Circuit Design for Minimum Transient Energy Vishwani D. Agrawal Circuits and Systems Research Lab, Agere Systems (Bell.
Spring 07, Feb 8 ELEC 7770: Advanced VLSI Design (Agrawal) 1 ELEC 7770 Advanced VLSI Design Spring 2007 Logic Equivalence Vishwani D. Agrawal James J.
8/18/05ELEC / Lecture 11 ELEC / (Fall 2005) Special Topics in Electrical Engineering Low-Power Design of Electronic Circuits.
Lecture #24 Gates to circuits
11/2-4/04ELEC / ELEC / (Fall 2004) Advanced Topics in Electrical Engineering Designing VLSI for Low-Power and Self-Test.
Fall 2006, Nov. 28 ELEC / Lecture 11 1 ELEC / (Fall 2006) Low-Power Design of Electronic Circuits Power Analysis: High-Level.
May 28, 2003Minimum Dynamic Power CMOS1 Minimum Dynamic Power CMOS Circuits Vishwani D. Agrawal Rutgers University, Dept. of ECE Piscataway, NJ 08854
10/11/05ELEC / Lecture 121 ELEC / (Fall 2005) Special Topics in Electrical Engineering Low-Power Design of Electronic Circuits.
9/29/05ELEC / Lecture 101 ELEC / (Fall 2005) Special Topics in Electrical Engineering Low-Power Design of Electronic Circuits.
Fall 2006, Oct. 17 ELEC / Lecture 9 1 ELEC / (Fall 2006) Low-Power Design of Electronic Circuits Power Analysis: Logic Level.
Copyright 2001, Agrawal & BushnellVLSI Test: Lecture 81 Lecture 8 Testability Measures n Origins n Controllability and observability n SCOAP measures 
Modern VLSI Design 2e: Chapter 4 Copyright  1998 Prentice Hall PTR Topics n Crosstalk. n Power optimization.
Digital Integrated Circuits© Prentice Hall 1995 Combinational Logic COMBINATIONAL LOGIC.
Charles Kime & Thomas Kaminski © 2004 Pearson Education, Inc. Terms of Use (Hyperlinks are active in View Show mode) Terms of Use Lecture 11 – Design Concepts.
1 Delay Estimation Most digital designs have multiple data paths some of which are not critical. The critical path is defined as the path the offers the.
Modern VLSI Design 4e: Chapter 4 Copyright  2008 Wayne Wolf Topics n Interconnect design. n Crosstalk. n Power optimization.
A comparison between different logic synthesis techniques from the digital switching noise viewpoint G. Boselli, V. Ciriani, V. Liberali G. Trucco Dept.
EE 5900 Advanced Algorithms for Robust VLSI CAD, Spring 2009 Static Timing Analysis and Gate Sizing.
Chapter 07 Electronic Analysis of CMOS Logic Gates
EEE2243 Digital System Design Chapter 7: Advanced Design Considerations by Muhazam Mustapha, extracted from Intel Training Slides, April 2012.
4. Electrons and electronics 4.5 Digital electronics.
Modern VLSI Design 3e: Chapter 4 Copyright  1998, 2002 Prentice Hall PTR Topics n Interconnect design. n Crosstalk. n Power optimization.
4. Combinational Logic Networks Layout Design Methods 4. 2
March 28, Glitch Reduction for Altera Stratix II devices Tomasz S. Czajkowski PhD Candidate University of Toronto Supervisor: Professor Stephen D.
Solid-State Devices & Circuits
Manufacture Testing of Digital Circuits
Copyright Agrawal, 2007Lectures 5, 6, 7: Power Analysis1 CSV881: Low-Power Design Gate-Level Power Analysis Vishwani D. Agrawal James J. Danaher Professor.
ELEC Digital Logic Circuits Fall 2015 Delay and Power Vishwani D. Agrawal James J. Danaher Professor Department of Electrical and Computer Engineering.
ELEC Digital Logic Circuits Fall 2014 Delay and Power Vishwani D. Agrawal James J. Danaher Professor Department of Electrical and Computer Engineering.
COE 360 Principles of VLSI Design Delay. 2 Definitions.
Copyright 2001, Agrawal & BushnellVLSI Test: Lecture 61 Lecture 6 Logic Simulation n What is simulation? n Design verification n Circuit modeling n True-value.
Digital Integrated Circuits for Communication
VLSI Testing Lecture 5: Logic Simulation
VLSI Testing Lecture 5: Logic Simulation
Vishwani D. Agrawal James J. Danaher Professor
VLSI Testing Lecture 4: Testability Analysis
Vishwani D. Agrawal Department of ECE, Auburn University
VLSI Testing Lecture 6: Fault Simulation
Algorithms and representations Structural vs. functional test
Reading: Hambley Ch. 7; Rabaey et al. Sec. 5.2
ECE 553: TESTING AND TESTABLE DESIGN OF DIGITAL SYSTES
VLSI Testing Lecture 6: Fault Simulation
Chapter 2 Interconnect Analysis
ELEC Digital Logic Circuits Fall 2014 Logic Testing (Chapter 12)
ELEC 5270/6270 Spring 2015 Low-Power Design of Electronic Circuits Pseudo-nMOS, Dynamic CMOS and Domino CMOS Logic Vishwani D. Agrawal James J. Danaher.
COMBINATIONAL LOGIC.
Lecture 5 Fault Modeling
Chapter 2 Interconnect Analysis Delay Modeling
Vishwani D. Agrawal James J. Danaher Professor
Homework 2 [due Jan. 31] [1] Given the circuit as shown below and a unit step voltage source at the input node s, use SPICE to simulate the circuit and.
FPGA Glitch Power Analysis and Reduction
ENGR 1 Presentation Thomas Matthews.
VLSI Testing Lecture 4: Testability Analysis
VLSI Testing Lecture 3: Fault Modeling
ELEC 7770 Advanced VLSI Design Spring 2012 Timing Simulation and STA
Presentation transcript:

Power Estimation Dr. Elwin Chandra Monie

Abstraction, Complexity, Accuracy Abstraction level Computing resources Analysis accuracy Algorithm Least Worst Software and system Hardware behavior Register transfer Logic Circuit Device Most Best

Spice Simulation Circuit/device level analysis Circuit modeled as network of transistors, capacitors, resistors and voltage/current sources. Node current equations using Kirchhoff’s current law. Average and instantaneous power computed from supply voltage and device current. Analysis is accurate but expensive Used to characterize parts of a larger circuit.

Gate-Level Power Analysis Pre-simulation analysis: Partition circuit into channel connected gate components. Determine node capacitances from layout analysis (accurate) or from wire-load model* (approximate). Determine dynamic and static power from Spice for each gate. Determine gate delays using Spice or Elmore delay model. * Wire-load model estimates capacitance of a net by its pin-count. See Yeap, p. 39.

Gate-Level Power Analysis (Cont.) Run discrete-event (event-driven) logic simulation with a set of input vectors. Monitor the toggle count of each net and obtain capacitive power dissipation: Pcap = Σ Ck V 2 f all nodes k Where: Ck is the total node capacitance being switched, as determined by the simulator. V is the supply voltage. f is the clock frequency, i.e., the number of vectors applied per unit time

Gate-Level Power Analysis (Cont.) Monitor dynamic energy events at the input of each gate and obtain internal switching power dissipation: Pint = Σ Σ E(g,e) F(g,e) gates g events e Where E(g,e) = energy of event e of gate g, pre-computed from Spice. F(g,e) = occurrence frequency of the event e at gate g, observed by logic simulation.

Gate-Level Power Analysis (Cont.) Monitor the static power dissipation state of each gate and obtain the static power dissipation: Pstat = Σ Σ P(g,s) T(g,s)/ T gates g states s Where P(g,s) = static power dissipation of gate g for state s, obtained from Spice. T(g,s) = duration of state s at gate g, obtained from logic simulation. T = vector period.

Gate-Level Power Analysis Sum up all three components of power: P = Pcap + Pint + Pstat

Switching Frequency Number of transitions per unit time: N(t) T = ─── For a continuous signal: T = lim ─── t→∞ t T is defined as transition density.

Static Signal Probabilities Observe signal for interval t 0 + t 1 Signal is 1 for duration t 1 Signal is 0 for duration t 0 Signal probabilities: p 1 = t 1/(t 0 + t 1) p 0 = t 0/(t 0 + t 1) = 1 – p 1

Static Transition Probabilities T 01 = p 0 Prob{signal is 1 | signal was 0} = p 0 p1 T 10 = p 1 Prob{signal is 0 | signal was 1} = p 1 p 0 T = T 01 + T 10 = 2 p 0 p 1 = 2 p 1 (1 – p 1) Transition density: T = 2 p 1 (1 – p 1)

Static Transition Frequency 0.25 0.2 0.1 0.0 f = p1(1 – p1) 0 0.25 0.5 0.75 1.0 p1

Inaccuracy in Transition Density p1 = 0.5 T = 1.0 1/fck p1 = 0.5 T = 4/6 p1 = 0.5 T = 1/6 Observe that the formula, T = 2 p1 (1 – p1), is not correct.

Inaccuracy in Transition Density p1 = 0.5 T = 1.0 1/fck p1 = 0.5 T = 4/6 p1 = 0.5 T = 1/6 Observe that the formula, T = 2 p1 (1 – p1), is not correct.

Cause for Error and Correction Probability of transition is not independent of the present state of the signal. Determine probability p 01 of a 0→1 transition. Recognize p 01 ≠ p 0 × p 1 We obtain p 1 = (1 – p 1)p 01 + p 1 p 11 p 01 p 1 = ───────── 1 – p 11 + p 01

Correction (Cont.) Since p 11 + p 10 = 1, i.e., given that the signal was previously 1, its present value can be either 1 or 0. Therefore, p 01 p 1 = ────── p 10 + p 01 This uniquely gives signal probability as a function of transition probabilities.

Transition and Signal Probabilities p01 = p10 = 0.5 p1 = 0.5 1/fck p01 = p10 = 1/3 p1 = 0.5 p01 = p10 = 1/6 p1 = 0.5

Probabilities: p0, p1, p00, p01, p10, p11

Transition Density T = 2 p 1 (1 – p 1) = p 0 p 01 + p 1 p 10

Power Calculation Power can be estimated if transition density is known for all signals. Calculation of transition density requires Signal probabilities Transition densities for primary inputs; computed from vector statistics

Signal Probabilities x1 x2 x1 x2 x1 x2 x1 + x2 – x1x2 x1 1 - x1

Signal Probabilities 0.5 x1 x2 x3 x1 x2 0.5 0.25 0.625 0.5 y = 1 - (1 - x1x2) x3 = 1 - x3 + x1x2x3 = 0.625 X1 X2 X3 Y 0 0 0 1 0 0 1 0 0 1 0 1 0 1 1 0 1 0 0 1 1 0 1 0 1 1 0 1 1 1 1 1 Ref: K. P. Parker and E. J. McCluskey, “Probabilistic Treatment of General Combinational Networks,” IEEE Trans. on Computers, vol. C-24, no. 6, pp. 668-670, June 1975.

Correlated Signal Probabilities 0.5 x1 x2 x1 x2 0.5 0.25 0.625? y = 1 - (1 - x1x2) x2 = 1 – x2 + x1x2x2 = 1 – x2 + x1x2 = 0.75 (correct value) X1 X2 Y 0 0 1 0 1 0 1 0 1 1 1 1

Correlated Signal Probabilities 0.5 x1 + x2 – x1x2 x1 x2 0.5 0.75 0.375? y = (x1 + x2 – x1x2) x2 = x1x2 + x2x2 – x1x2x2 = x1x2 + x2 – x1x2 = x2 = 0.5 (correct value) X1 X2 Y 0 0 0 0 1 1 1 0 0 1 1 1

Observation Numerical computation of signal probabilities is accurate for fanout-free circuits.

Remedies Use Shannon’s expansion theorem to compute signal probabilities. Use Boolean difference formula to compute transition densities.

Shannon’s Expansion Theorem C. E. Shannon, “A Symbolic Analysis of Relay and Switching Circuits,” Trans. AIEE, vol. 57, pp. 713-723, 1938. Consider: Boolean variables, X1, X2, . . . , Xn Boolean function, F(X1, X2, . . . , Xn) Then F = Xi F(Xi=1) + Xi’ F(Xi=0) Where Xi’ is complement of X1 Cofactors, F(Xi=j) = F(X1, X2, . . , Xi=j, . . , Xn), j = 0 or 1

Expansion About Two Inputs F = XiXj F(Xi=1, Xj=1) + XiXj’ F(Xi=1, Xj=0) + Xi’Xj F(Xi=0, Xj=1) + Xi’Xj’ F(Xi=0, Xj=0) In general, a Boolean function can be expanded about any number of input variables. Expansion about k variables will have 2k terms.

Correlated Signal Probabilities X1 X2 X1 X2 Y = X1 X2 + X2’ X1 X2 Y 0 0 1 0 1 0 1 0 1 1 1 1 Shannon expansion about the reconverging input, X2: Y = X2 Y(X2 = 1) + X2’ Y(X2 = 0) = X2 (X1) + X2’ (1)

Correlated Signals When the output function is expanded about all reconverging input variables, All cofactors correspond to fanout-free circuits. Signal probabilities for cofactor outputs can be calculated without error. A weighted sum of cofactor probabilities gives the correct probability of the output. For two reconverging inputs: f = xixj f(Xi=1, Xj=1) + xi(1-xj) f(Xi=1, Xj=0) + (1-xi)xj f(Xi=0, Xj=1) + (1-xi)(1-xj) f(Xi=0, Xj=0)

Correlated Signal Probabilities X1 X2 X1 X2 Y = X1 X2 + X2’ Shannon expansion about the reconverging input, X2: Y = X2 Y(X2=1) + X2’ Y(X2=0) = X2 (X1) + X2’ (1) y = x2 (0.5) + (1-x2) (1) = 0.5 (0.5) + (1-0.5) (1) = 0.75 X1 X2 Y 0 0 1 0 1 0 1 0 1 1 1 1

Example 0.5 Supergate 0.25 Point of reconv. 0.5 0.0 0.5 1.0 0.5 1 0.0 0.0 1.0 0.5 0.375 0.5 Reconv. signal Signal probability for supergate output = 0.5 Prob{rec. signal = 1} + 1.0 Prob{rec. signal = 0} = 0.5 × 0.5 + 1.0 × 0.5 = 0.75 S. C. Seth and V. D. Agrawal, “A New Model for Computation of Probabilistic Testability in Combinational Circuits,” Integration, the VLSI Journal, vol. 7, no. 1, pp. 49-75, April 1989.

Probability Calculation Algorithm Partition circuit into supergates. Definition: A supergate is a circuit partition with a single output such that all fanouts that reconverge at the output are contained within the supergate. Identify reconverging and non-reconverging inputs of each supergate. Compute signal probabilities from PI to PO: For a supergate whose input probabilities are known Enumerate reconverging input states For each input state do gate by gate probability computation Sum up corresponding signal probabilities, weighted by state probabilities

Calculating Transition Density 1 Boolean function x1, T1 . xn, Tn y, T(Y) = ? n

Boolean Difference ∂Y Boolean diff(Y, Xi) = ── = Y(Xi=1) ⊕ Y(Xi=0) ∂Xi Boolean diff(Y, Xi) = 1 means that a path is sensitized from input Xi to output Y. Prob(Boolean diff(Y, Xi) = 1) is the probability of transmitting a toggle from Xi to Y. Probability of Boolean difference is determined from the probabilities of cofactors of Y with respect to Xi. F. F. Sellers, M. Y. Hsiao and L. W. Bearnson, “Analyzing Errors with the Boolean Difference,” IEEE Trans. on Computers, vol. C-17, no. 7, pp. 676-683, July 1968.

Transition Density n T(y) = Σ T(Xi) Prob(Boolean diff(Y, Xi) = 1) i=1 F. Najm, “Transition Density: A New Measure of Activity in Digital Circuits,” IEEE Trans. CAD, vol. 12, pp. 310-323, Feb. 1993.

Power Computation For each primary input, determine signal probability and transition density for given vectors. For each internal node and primary output Y, find the transition density T(Y), using supergate partitioning and the Boolean difference formula. Compute power, P = Σ 0.5CY V2 T(Y) all Y where CY is the capacitance of node Y and V is supply voltage.

Transition Density and Power 0.2, 1 X1 X2 X3 0.06, 0.7 0.3, 2 Ci 0.436, 3.24 Y 0.4, 3 CY Transition density Signal probability Power = 0.5 V 2 (0.7Ci + 3.24CY)

Prob. Method vs. Logic Sim. Circuit No. of gates Probability method Logic Simulation Error % Av. density CPU s* C432 160 3.46 0.52 3.39 63 +2.1 C499 202 11.36 0.58 8.57 241 +29.8 C880 383 2.78 1.06 3.25 132 -14.5 C1355 346 4.19 1.39 6.18 408 -32.2 C1908 880 2.97 2.00 5.01 464 -40.7 C2670 1193 3.50 3.45 4.00 619 -12.5 C3540 1669 4.47 3.77 4.49 1082 -0.4 C5315 2307 3.52 6.41 4.79 1616 -26.5 C6288 2406 25.10 5.67 34.17 31057 C7552 3512 3.83 9.85 5.08 2713 -24.2 * CONVEX c240

Problem 1 For equiprobable inputs analyze the 0→1 transition probabilities of all gates in the two implementations of a four-input AND gate shown below. Assuming that the gates have zero delays, which implementation will consume less average dynamic power? E E A B C D A B C D F G G F Chain structure Tree structure

Total transitions/vector Problem 1 Solution Given the primary input probabilities, P(A) = P(B) = P(C) = P(D) = 0.5, signal and transition (0→1) probabilities are as follows: Signal name Chain Tree Prob(sig.= 1) Prob(0→1) Prob(sig.=1) E 0.2500 0.1875 F 0.1250 0.1094 G 0.0625 0.0586 Total transitions/vector 0.3555 0.4336 The tree implementation consumes 100×(0.4336 – 0.3555)/0.3555 = 22% more average dynamic power. This advantage of the chain structure may be somewhat reduced because of glitches caused by unbalanced path delays.

Problem 2 Assume that the two-input AND gates in Problem 1 each has one unit of delay. Find input vector pairs for each implementation that will consume the peak dynamic power. Which implementation consumes less peak dynamic power? E E A B C D A B C D F G G F Chain structure Tree structure

Problem 2 Solution For the chain structure, a vector pair {A B C D} = {1110},{1011} will produce four gate transitions as shown below. A B C D E F G A=11 B=10 E=10 C=11 F=10 D=01 G=00 Time units 0 1 2 3

Problem 2 Solution (Cont.) The tree structure has balanced delay paths. So it cannot make more than 3 gate transitions. A vector pair {ABCD} = {1111},{1010} will produce three transitions as shown below. A B C D E F G A=11 B=10 E=10 C=11 D=10 F=10 G=10 Time units 0 1 2 3 Therefore, just counting the gate transitions, we find that the chain consumes 100(4 – 3)/3 = 33% higher peak power than the tree.