Load-Sensitive Flip-Flop Characterization

Slides:



Advertisements
Similar presentations
Digital System Clocking: High-Performance and Low-Power Aspects Vojin G. Oklobdzija, Vladimir M. Stojanovic, Dejan M. Markovic, Nikola M. Nedovic Wiley-Interscience.
Advertisements

Logic Gate Delay Modeling -1 Bishnu Prasad Das Research Scholar CEDT, IISc, Bangalore
Topics Electrical properties of static combinational gates:
Selected Design Topics. Integrated Circuits Integrated circuit (informally, a chip) is a semiconductor crystal (most often silicon) containing the electronic.
Introduction to CMOS VLSI Design Sequential Circuits.
VLSI Design EE 447/547 Sequential circuits 1 EE 447/547 VLSI Design Lecture 9: Sequential Circuits.
MICROELETTRONICA Sequential circuits Lection 7.
Logical Effort A Method to Optimize Circuit Topology Swarthmore College E77 VLSI Design Adem Kader David Luong Mark Piper December 6, 2005.
Fall 06, Sep 19, 21 ELEC / Lecture 6 1 ELEC / (Fall 2005) Special Topics in Electrical Engineering Low-Power Design of Electronic.
Logical Effort.
Prof. John Nestor ECE Department Lafayette College Easton, Pennsylvania ECE VLSI Circuit Design Lecture 13 - More about.
1 MICROELETTRONICA Logical Effort and delay Lection 4.
EE141 © Digital Integrated Circuits 2nd Combinational Circuits 1 Logical Effort - sizing for speed.
EE 447 VLSI Design Lecture 5: Logical Effort. EE 447 VLSI Design 5: Logical Effort2 Outline Introduction Delay in a Logic Gate Multistage Logic Networks.
Introduction to CMOS VLSI Design Lecture 10: Sequential Circuits Credits: David Harris Harvey Mudd College (Material taken/adapted from Harris’ lecture.
Z. Feng MTU EE4800 CMOS Digital IC Design & Analysis 5.1 EE4800 CMOS Digital IC Design & Analysis Lecture 5 Logic Effort Zhuo Feng.
Author: D. Brooks, V.Tiwari and M. Martonosi Reviewer: Junxia Ma
The CMOS Inverter Slides adapted from:
EGRE 427 Advanced Digital Design Figures from Application-Specific Integrated Circuits, Michael John Sebastian Smith, Addison Wesley, 1997 Chapter 3 ASIC.
Digital System Clocking:
© Digital Integrated Circuits 2nd Sequential Circuits Digital Integrated Circuits A Design Perspective Designing Sequential Logic Circuits Jan M. Rabaey.
Ronny Krashinsky Seongmoo Heo Michael Zhang Krste Asanovic MIT Laboratory for Computer Science SyCHOSys Synchronous.
Logical Effort: optimal CMOS device sizing Albert Chun (M.A.Sc. Candidate) Ottawa-Carleton Institute for Electrical & Computer Engineering (OCIECE) Ottawa,
Modern VLSI Design 2e: Chapter 3 Copyright  1998 Prentice Hall PTR Topics n Electrical properties of static combinational gates: –transfer characteristics;
Optimal digital circuit design Mohammad Sharifkhani.
Introduction to CMOS VLSI Design Lecture 5: Logical Effort GRECO-CIn-UFPE Harvey Mudd College Spring 2004.
XIAOYU HU AANCHAL GUPTA Multi Threshold Technique for High Speed and Low Power Consumption CMOS Circuits.
Linear Delay Model In general the propagation delay of a gate can be written as: d = f + p –p is the delay due to intrinsic capacitance. –f is the effort.
Physical Synthesis Buffer Insertion, Gate Sizing, Wire Sizing,
1 Energy-Efficient Register Access Jessica H. Tseng and Krste Asanović MIT Laboratory for Computer Science, Cambridge, MA 02139, USA SBCCI2000.
EE141 © Digital Integrated Circuits 2nd Combinational Circuits 1 A few notes for your design  Finger and multiplier in schematic design  Parametric analysis.
Modern VLSI Design 3e: Chapter 3 Copyright  1998, 2002 Prentice Hall PTR Topics n Electrical properties of static combinational gates: –transfer characteristics;
EE415 VLSI Design THE INVERTER [Adapted from Rabaey’s Digital Integrated Circuits, ©2002, J. Rabaey et al.]
Modern VLSI Design 3e: Chapter 4 Copyright  1998, 2002 Prentice Hall PTR Topics n Transistor sizing: –Spice analysis. –Logical effort.
Load-Sensitive Flip-Flop Characterization Seongmoo Heo and Krste Asanović MIT Laboratory for Computer Science WVLSI 2001.
Implementation of LFSR Counter Using CMOS VLSI Technology.
1 EE 382M VLSI 1 EE 360R Computer-Aided Integrated Circuit Design Lab 1 Demo Fall 2011 Whitney J. Wadlow.
COE 360 Principles of VLSI Design Delay. 2 Definitions.
Power-Optimal Pipelining in Deep Submicron Technology
Digital Integrated Circuits A Design Perspective
Lecture 11: Sequential Circuit Design
Copyright © 2004 The McGraw-Hill Companies, Inc. All rights reserved.
Digital Integrated Circuits A Design Perspective
Digital Fundamentals Floyd Chapter 7 Tenth Edition
Written by Whitney J. Wadlow
Timing issues.
Day 29: November 18, 2011 Dynamic Logic
SEQUENTIAL LOGIC -II.
Reading: Hambley Ch. 7; Rabaey et al. Sec. 5.2
Introduction to CMOS VLSI Design Lecture 10: Sequential Circuits
Timing Analysis 11/21/2018.
University of California Davis
Lecture 6: Logical Effort
Topics Circuit design for FPGAs: Logic elements. Interconnect.
Chapter 10 Timing Issues Rev /11/2003 Rev /28/2003
Lecture 6: Logical Effort
Introduction to CMOS VLSI Design Lecture 5: Logical Effort
Estimating Delays Would be nice to have a “back of the envelope” method for sizing gates for speed Logical Effort Book by Sutherland, Sproull, Harris Chapter.
332:578 Deep Submicron VLSI Design Lecture 14 Design for Clock Skew
RC Modeling and Logical Effort Basics
Characterization of C2MOS Flip-Flop in Sub-Threshold Region
Lecture 6: Logical Effort
Activity-Sensitive Flip-Flop and Latch Selection for Reduced Energy
VLSI Testing Lecture 7: Delay Test
Day 16: October 12, 2012 Performance: Gates
Lecture 26 Logic BIST Architectures
Reading: Hambley Ch. 7; Rabaey et al. Secs. 5.2, 5.5, 6.2.1
Low Power Digital Design
Advanced Logical Effort
COMBINATIONAL LOGIC - 2.
Presentation transcript:

Load-Sensitive Flip-Flop Characterization Seongmoo Heo and Krste Asanović MIT Laboratory for Computer Science http://www.cag.lcs.mit.edu/scale WVLSI 2001 April 19, 2001

Motivation Flip-flops are one of the most important components in synchronous VLSI designs. Critical effect on cycle time Large fraction of total system power Previously published work has failed to consider the effect of circuit loading on the relative ranking of flip-flop structures. [Kawaguchi et al. ’98] [Ko and Balsara ’00] [Kong et al ’00] [Lang et al ’97] [Nikolic et al ’00] [Nogawa and Ohtomo ’98] [Stojanovic and Oklobdzija ’99] [Stollo et al ’00] [Yuan and Svensson ’91] [Zyuban and Kogge ’99] [H.P. et al ’96] [J.M. et al ’96] Fixed and usually overly large output load Large or non-specified input drive No output buffering

Observation Different flip-flop designs have different inherent parasitics and output drive strength. Different number and complexity of logic gates Different kinds of feedback

Observation Different flip-flop designs have different inherent parasitics and output drive strength. Different number and complexity of logic gates Different kinds of feedback D Q D Q D Q

Observation 2. Output loads in a circuit vary significantly. Flip-flop output load instances in a microprocessor datapath 120 (A custom-designed 32-bit MIPS CPU in 0.25mm process) 100 80 7.2fF (4 min inv gate cap) 60 115.2fF (64 min inv gate cap) # of instances 28.8fF (16 min inv gate cap) 40 20 1.8fF (min inv gate cap)

Our Proposal Load effects must be considered in flip-flop characterization to avoid sub-optimal selection. We will present energy and delay measurements for various flip-flops across a range of output loading conditions(EE and absolute load size) and show that the relative rankings of structures vary. We will show that output buffering at high load can lead to the better performance and energy consumption for some structures.

Related Work Traditional Buffer Sizing Logical Effort [Sutherland and Sproull] Logical Effort: drive strength of a circuit structure Electrical Effort: the ratio of output load to input load Delay = intrinsic parasitic delay + LE x EE

Overview Flip-Flop Designs Test Bench & Simulation Setup Delay and Energy Characterization Delay Analysis Energy-versus-Delay Analysis Summary

Flip-Flop Designs Fully static and single-ended [Nikolic et al ’00]

Test Bench Sized clock buffer to give equal rise/fall time Used a fixed, realistic input driver Varied output load from 4 min inv cap(7.2fF) to 64 min inv cap(115.2fF). 4 Load and Drive Configurations EE4-min: min input drive, 4 min inv load (7.2fF) EE16-min: min input drive, 16 min inv load (28.8fF) EE64-min: min input drive, 64 min inv load (115.2fF) EE4-big: 16x min input drive, 64 min inv load (115.2fF) 4 min inv cap 16 min inv cap 64 min inv cap FF

Simulation Setup 0.25μm TSMC CMOS process, Vdd=2.5V, T=25°C Hspice Levenberg-Marquardt method was used for transistor size optimization. Transistor widths optimized for each load and drive conf. to give min delay or min energy for a given delay (transistor lengths were fixed at minimum.) Parasitic capacitances included in the circuit netlists.

Delay and Energy Characterization Minimum D-Q delay [Stojanovic et al. ’99] (.Measure command) Total energy = input energy + internal energy + clock energy – output energy A single test waveform with ungated clock and data toggling every cycle For a full characterization of energy dissipation, more realistic activity patterns should be considered [Heo, Krashinsky, Asanovic ARVLSI’01]. FF 4 min inv load 16 min inv load 64 min inv load

Speed Ranking Without Buffering 3.5 (Transistors sized at each load point, but only for min delay) 3.0 Delay = const. intrinsic parasitic delay + output drive delay (= load size × driving capability) Driving Capability = f(# of stages, complexity) PPCFF SAFF 1.5 MSAFF HLFF 1.0 SSAPL 0.5

Influence of Buffering on Performance (Assuming no penalty for inverting output) 1.5 1 0.5 PPCFF SAFF MSAFF Delay (ns) 0 20 40 80 Load (min inv cap) : unbuffered : one inverter : two inverters (Min. input drive was used.) 1.5 1 0.5 HLFF SSAPL 0 20 40 80

Speed Ranking With Buffering Allowed 3.5 Less speed variation compared to original flip-flops 3.0 PPCFF SAFF MSAFF HLFF 1.5 SSAPL 1.0 0.5

Energy-Delay Curve : EE4-min Each point sized for min energy for a given delay Energy(fJ) Delay(ns) EE4-min: min. drive + 4 min inv load(7.2fF)

Energy-Delay Curve : EE4-min Energy(fJ) PPCFF-unbuf Delay(ns) EE4-min: min. drive + 4 min inv load(7.2fF)

Energy-Delay Curve : EE16-min SSAPL-unbuf Energy(fJ) Delay(ns) EE16-min: min. drive + 16 min inv load(28.8fF)

Energy-Delay Curve : EE16-min SSAPL-unbuf Energy(fJ) SSAPL-buf Delay(ns) EE16-min: min. drive + 16 min inv load(28.8fF)

Energy-Delay Curve : EE16-min HLFF-unbuf Energy(fJ) Delay(ns) EE16-min: min. drive + 16 min inv load(28.8fF)

Energy-Delay Curve : EE16-min HLFF-unbuf Energy(fJ) HLFF-buf Delay(ns) EE16-min: min. drive + 16 min inv load(28.8fF)

Energy-Delay Curve : EE16-min Energy(fJ) PPCFF-unbuf Delay(ns) EE16-min: min. drive + 16 min inv load(28.8fF)

Energy-Delay Curve : EE64-min Energy(fJ) MSAFF-unbuf Delay(ns) EE64-min: min. drive + 64 min inv load(115.2fF)

Energy-Delay Curve : EE64-min HLFF-unbuf HLFF-buf Energy(fJ) Delay(ns) EE64-min: min. drive + 64 min inv load(115.2fF)

Energy-Delay Curve : EE4-big Energy(fJ) Delay(ns) EE4-big: 16x min. drive + 64 min inv load(115.2fF)

Energy-Delay Curve : EE4-min vs EE4-big PPCFF-unbuf MSAFF-unbuf SAFF-unbuf

Energy-Delay Curve : EE4-min vs EE4-big PPCFF-unbuf MSAFF-unbuf MSAFF-unbuf SAFF-unbuf PPCFF-unbuf SAFF-unbuf

Summary Different flip-flops have different gains and parasitics. Real VLSI designs exhibit a variety of flip-flop output loads. The output load size affects the relative performance and energy consumption of different flip-flop designs. Therefore, output load effects should be accounted for when comparing flip-flops. Electrical effort Absolute output load size Output buffering