Load-Sensitive Flip-Flop Characterization Seongmoo Heo and Krste Asanović MIT Laboratory for Computer Science WVLSI 2001.

Slides:



Advertisements
Similar presentations
Logic Gate Delay Modeling -1 Bishnu Prasad Das Research Scholar CEDT, IISc, Bangalore
Advertisements

Topics Electrical properties of static combinational gates:
1 Lecture 16 Timing  Terminology  Timing issues  Asynchronous inputs.
Introduction to CMOS VLSI Design Sequential Circuits.
VLSI Design EE 447/547 Sequential circuits 1 EE 447/547 VLSI Design Lecture 9: Sequential Circuits.
Introduction to CMOS VLSI Design Sequential Circuits
MICROELETTRONICA Sequential circuits Lection 7.
Lecture 11: Sequential Circuit Design. CMOS VLSI DesignCMOS VLSI Design 4th Ed. 11: Sequential Circuits2 Outline  Sequencing  Sequencing Element Design.
Introduction to CMOS VLSI Design Lecture 10: Sequential Circuits David Harris Harvey Mudd College Spring 2004.
Differential pass transistor pulsed latch Moo-Young Kim, Inhwa Jung, Young-Ho Kwak, Chulwoo Kim 指導老師 : 魏凱城 老師 學 生 : 蕭荃泰彰化師範大學積體電路設計研究所.
Sequential Circuits. Outline  Floorplanning  Sequencing  Sequencing Element Design  Max and Min-Delay  Clock Skew  Time Borrowing  Two-Phase Clocking.
Z. Feng MTU EE4800 CMOS Digital IC Design & Analysis EE4800 CMOS Digital IC Design & Analysis Lecture 11 Sequential Circuit Design Zhuo Feng.
Logical Effort A Method to Optimize Circuit Topology Swarthmore College E77 VLSI Design Adem Kader David Luong Mark Piper December 6, 2005.
Fall 06, Sep 19, 21 ELEC / Lecture 6 1 ELEC / (Fall 2005) Special Topics in Electrical Engineering Low-Power Design of Electronic.
 C. H. Ziesler etal., 2003 Energy Recovering ASIC Design Advanced Computer Architecture Laboratory Department of Electrical Engineering and Computer Science.
ECEN 248: INTRODUCTION TO DIGITAL SYSTEMS DESIGN Dr. Shi Dept. of Electrical and Computer Engineering.
ENGIN112 L28: Timing Analysis November 7, 2003 ENGIN 112 Intro to Electrical and Computer Engineering Lecture 28 Timing Analysis.
Introduction to CMOS VLSI Design Lecture 5: Logical Effort David Harris Harvey Mudd College Spring 2004.
Logical Effort.
EE166 Final Presentation Patsapol Kriausakul Sung Min Park Dennis Won Howard Yuan.
Prof. John Nestor ECE Department Lafayette College Easton, Pennsylvania ECE VLSI Circuit Design Lecture 13 - More about.
FUNCTIONAL OVERVIEW Design a synchronous 4-bit up and down counter Operates at 25MHz on the positive edge of the clock Designed to drive a 10pF capacitive.
1 MICROELETTRONICA Logical Effort and delay Lection 4.
CMOS VLSI Design4: DC and Transient ResponseSlide 1 EE466: VLSI Design Lecture 05: DC and transient response – CMOS Inverters.
Introduction to CMOS VLSI Design Lecture 5: Logical Effort
EE141 © Digital Integrated Circuits 2nd Combinational Circuits 1 Logical Effort - sizing for speed.
EE 447 VLSI Design Lecture 5: Logical Effort. EE 447 VLSI Design 5: Logical Effort2 Outline Introduction Delay in a Logic Gate Multistage Logic Networks.
Introduction to CMOS VLSI Design Lecture 10: Sequential Circuits Credits: David Harris Harvey Mudd College (Material taken/adapted from Harris’ lecture.
S. Reda EN160 SP’07 Design and Implementation of VLSI Systems (EN0160) Lecture 23: Sequential Circuit Design (1/3) Prof. Sherief Reda Division of Engineering,
Z. Feng MTU EE4800 CMOS Digital IC Design & Analysis 5.1 EE4800 CMOS Digital IC Design & Analysis Lecture 5 Logic Effort Zhuo Feng.
Author: D. Brooks, V.Tiwari and M. Martonosi Reviewer: Junxia Ma
Lecture 21, Slide 1EECS40, Fall 2004Prof. White Lecture #21 OUTLINE –Sequential logic circuits –Fan-out –Propagation delay –CMOS power consumption Reading:
Cascading CMOS gates. Elettronica T A.A Digital Integrated Circuits © Prentice Hall 2003 Cascading CMOS Goal l Designing for minimum propagation.
The CMOS Inverter Slides adapted from:
EGRE 427 Advanced Digital Design Figures from Application-Specific Integrated Circuits, Michael John Sebastian Smith, Addison Wesley, 1997 Chapter 3 ASIC.
Electronic Counters.
Logic Optimization Mohammad Sharifkhani. Reading Textbook II, Chapters 5 and 6 (parts related to power and speed.) Following Papers: –Nose, Sakurai, 2000.
Digital System Clocking:
© Digital Integrated Circuits 2nd Sequential Circuits Digital Integrated Circuits A Design Perspective Designing Sequential Logic Circuits Jan M. Rabaey.
Design of Robust, Energy-Efficient Full Adders for Deep-Submicrometer Design Using Hybrid-CMOS Logic Style Sumeer Goel, Ashok Kumar, and Magdy A. Bayoumi.
Ronny Krashinsky Seongmoo Heo Michael Zhang Krste Asanovic MIT Laboratory for Computer Science SyCHOSys Synchronous.
Elmore Delay, Logical Effort
Chapter 07 Electronic Analysis of CMOS Logic Gates
Logical Effort: optimal CMOS device sizing Albert Chun (M.A.Sc. Candidate) Ottawa-Carleton Institute for Electrical & Computer Engineering (OCIECE) Ottawa,
Optimal digital circuit design Mohammad Sharifkhani.
Lecture 6: Logical Effort
Introduction to CMOS VLSI Design Lecture 5: Logical Effort GRECO-CIn-UFPE Harvey Mudd College Spring 2004.
XIAOYU HU AANCHAL GUPTA Multi Threshold Technique for High Speed and Low Power Consumption CMOS Circuits.
Comparative Analysis of Ultra-Low Voltage Flip-Flops for Energy Efficiency Bo Fu and Paul Ampadu IEEE International Symposium on Circuits and Systems,pp ,
Linear Delay Model In general the propagation delay of a gate can be written as: d = f + p –p is the delay due to intrinsic capacitance. –f is the effort.
The George Washington University School of Engineering and Applied Science Department of Electrical and Computer Engineering ECE122 – 30 Lab 2: CMOS Design.
Bi-CMOS Prakash B.
Sp09 CMPEN 411 L18 S.1 CMPEN 411 VLSI Digital Circuits Spring 2009 Lecture 16: Static Sequential Circuits [Adapted from Rabaey’s Digital Integrated Circuits,
1 Energy-Efficient Register Access Jessica H. Tseng and Krste Asanović MIT Laboratory for Computer Science, Cambridge, MA 02139, USA SBCCI2000.
EE141 © Digital Integrated Circuits 2nd Combinational Circuits 1 A few notes for your design  Finger and multiplier in schematic design  Parametric analysis.
Modern VLSI Design 3e: Chapter 3 Copyright  1998, 2002 Prentice Hall PTR Topics n Electrical properties of static combinational gates: –transfer characteristics;
EE415 VLSI Design THE INVERTER [Adapted from Rabaey’s Digital Integrated Circuits, ©2002, J. Rabaey et al.]
EE415 VLSI Design. Read 4.1, 4.2 COMBINATIONAL LOGIC.
Modern VLSI Design 3e: Chapter 4 Copyright  1998, 2002 Prentice Hall PTR Topics n Transistor sizing: –Spice analysis. –Logical effort.
1 Timing Closure and the constant delay paradigm Problem: (timing closure problem) It has been difficult to get a circuit that meets delay requirements.
The George Washington University School of Engineering and Applied Science Department of Electrical and Computer Engineering ECE122 – 30 Lab 2: NAND gate.
Implementation of LFSR Counter Using CMOS VLSI Technology.
Power-Optimal Pipelining in Deep Submicron Technology
Reading: Hambley Ch. 7; Rabaey et al. Sec. 5.2
Introduction to CMOS VLSI Design Lecture 5: Logical Effort
Estimating Delays Would be nice to have a “back of the envelope” method for sizing gates for speed Logical Effort Book by Sutherland, Sproull, Harris Chapter.
Activity-Sensitive Flip-Flop and Latch Selection for Reduced Energy
Load-Sensitive Flip-Flop Characterization
Advanced Logical Effort
COMBINATIONAL LOGIC - 2.
Presentation transcript:

Load-Sensitive Flip-Flop Characterization Seongmoo Heo and Krste Asanović MIT Laboratory for Computer Science WVLSI 2001 April 19, 2001

Motivation Flip-flops are one of the most important components in synchronous VLSI designs. o Critical effect on cycle time o Large fraction of total system power Previously published work has failed to consider the effect of circuit loading on the relative ranking of flip-flop structures. [Kawaguchi et al. ’98] [Ko and Balsara ’00] [Kong et al ’00] [Lang et al ’97] [Nikolic et al ’00] [Nogawa and Ohtomo ’98] [Stojanovic and Oklobdzija ’99] [Stollo et al ’00] [Yuan and Svensson ’91] [Zyuban and Kogge ’99] [H.P. et al ’96] [J.M. et al ’96] o Fixed and usually overly large output load o Large or non-specified input drive o No output buffering

Observation 1.Different flip-flop designs have different inherent parasitics and output drive strength. o Different number and complexity of logic gates o Different kinds of feedback

Observation 1.Different flip-flop designs have different inherent parasitics and output drive strength. o Different number and complexity of logic gates o Different kinds of feedback D Q D Q D Q

Observation 2. Output loads in a circuit vary significantly. Flip-flop output load instances in a microprocessor datapath (A custom-designed 32-bit MIPS CPU in 0.25  m process) # of instances fF (4 min inv gate cap) 28.8fF (16 min inv gate cap) 115.2fF (64 min inv gate cap) 1.8fF (min inv gate cap)

Our Proposal Load effects must be considered in flip-flop characterization to avoid sub-optimal selection. We will present energy and delay measurements for various flip- flops across a range of output loading conditions(EE and absolute load size) and show that the relative rankings of structures vary. We will show that output buffering at high load can lead to the better performance and energy consumption for some structures.

Related Work Traditional Buffer Sizing Logical Effort [Sutherland and Sproull] o Logical Effort: drive strength of a circuit structure o Electrical Effort: the ratio of output load to input load o Delay = intrinsic parasitic delay + LE x EE

Overview Flip-Flop Designs Test Bench & Simulation Setup Delay and Energy Characterization Delay Analysis Energy-versus-Delay Analysis Summary

Flip-Flop Designs Fully static and single-ended [Nikolic et al ’00]

Test Bench Sized clock buffer to give equal rise/fall time Used a fixed, realistic input driver Varied output load from 4 min inv cap(7.2fF) to 64 min inv cap(115.2fF). 4 Load and Drive Configurations o EE4-min: min input drive, 4 min inv load (7.2fF) o EE16-min: min input drive, 16 min inv load (28.8fF) o EE64-min: min input drive, 64 min inv load (115.2fF) o EE4-big: 16x min input drive, 64 min inv load (115.2fF) FF 4 min inv cap 16 min inv cap 64 min inv cap

Simulation Setup 0.25μm TSMC CMOS process, Vdd=2.5V, T=25°C Hspice Levenberg-Marquardt method was used for transistor size optimization. o Transistor widths optimized for each load and drive conf. to give min delay or min energy for a given delay (transistor lengths were fixed at minimum.) o Parasitic capacitances included in the circuit netlists.

Delay and Energy Characterization Minimum D-Q delay [Stojanovic et al. ’99] (.Measure command) Total energy = input energy + internal energy + clock energy – output energy A single test waveform with ungated clock and data toggling every cycle o For a full characterization of energy dissipation, more realistic activity patterns should be considered [Heo, Krashinsky, Asanovic ARVLSI’01]. FF 4 min inv load 16 min inv load 64 min inv load

Speed Ranking Without Buffering Delay = const. intrinsic parasitic delay + output drive delay (= load size × driving capability) Driving Capability = f(# of stages, complexity) PPCFF SAFF MSAFF HLFF SSAPL (Transistors sized at each load point, but only for min delay)

Influence of Buffering on Performance : unbuffered : one inverter : two inverters (Min. input drive was used.) (Assuming no penalty for inverting output) SSAPL HLFF MSAFF SAFF PPCFF Delay (ns) Load (min inv cap)

Speed Ranking With Buffering Allowed Less speed variation compared to original flip-flops PPCFF SAFF MSAFF HLFF SSAPL

Energy-Delay Curve : EE4-min EE4-min: min. drive + 4 min inv load(7.2fF) Delay(ns) Energy(fJ) Each point sized for min energy for a given delay

Energy-Delay Curve : EE4-min Energy(fJ) Delay(ns) PPCFF-unbuf EE4-min: min. drive + 4 min inv load(7.2fF)

Energy-Delay Curve : EE16-min EE16-min: min. drive + 16 min inv load(28.8fF) Delay(ns) Energy(fJ) SSAPL-unbuf

Energy(fJ) Delay(ns) SSAPL-buf EE16-min: min. drive + 16 min inv load(28.8fF) Energy-Delay Curve : EE16-min SSAPL-unbuf

Energy(fJ) Delay(ns) HLFF-unbuf EE16-min: min. drive + 16 min inv load(28.8fF) Energy-Delay Curve : EE16-min

Energy(fJ) Delay(ns) HLFF-buf EE16-min: min. drive + 16 min inv load(28.8fF) Energy-Delay Curve : EE16-min HLFF-unbuf

Energy(fJ) Delay(ns) PPCFF-unbuf EE16-min: min. drive + 16 min inv load(28.8fF) Energy-Delay Curve : EE16-min

EE64-min: min. drive + 64 min inv load(115.2fF) Energy(fJ) Delay(ns) MSAFF-unbuf Energy-Delay Curve : EE64-min

Energy(fJ) Delay(ns) HLFF-unbuf HLFF-buf EE64-min: min. drive + 64 min inv load(115.2fF) Energy-Delay Curve : EE64-min

EE4-big: 16x min. drive + 64 min inv load(115.2fF) Energy(fJ) Delay(ns) Energy-Delay Curve : EE4-big

Energy-Delay Curve : EE4-min vs EE4-big EE4-min MSAFF-unbuf SAFF-unbuf PPCFF-unbuf EE4-big

MSAFF-unbuf SAFF-unbuf PPCFF-unbuf MSAFF-unbuf SAFF-unbuf Energy-Delay Curve : EE4-min vs EE4-big EE4-min EE4-big

Summary Different flip-flops have different gains and parasitics. Real VLSI designs exhibit a variety of flip-flop output loads. The output load size affects the relative performance and energy consumption of different flip-flop designs. Therefore, output load effects should be accounted for when comparing flip-flops. 1.Electrical effort 2.Absolute output load size 3.Output buffering