Introduction to CMOS VLSI Design Clock Skew-tolerant circuits.

Slides:



Advertisements
Similar presentations
Transmission Gate Based Circuits
Advertisements

Sequential Circuit Design
CSET 4650 Field Programmable Logic Devices
Modern VLSI Design 4e: Chapter 5 Copyright  2008 Wayne Wolf Topics n Performance analysis of sequential machines.
Introduction to CMOS VLSI Design Sequential Circuits.
VLSI Design EE 447/547 Sequential circuits 1 EE 447/547 VLSI Design Lecture 9: Sequential Circuits.
Introduction to CMOS VLSI Design Sequential Circuits
MICROELETTRONICA Sequential circuits Lection 7.
Modern VLSI Design 4e: Chapter 5 Copyright  2008 Wayne Wolf Topics n Memory elements. n Basics of sequential machines.
ELEC 256 / Saif Zahir UBC / 2000 Timing Methodology Overview Set of rules for interconnecting components and clocks When followed, guarantee proper operation.
Lecture 11: Sequential Circuit Design. CMOS VLSI DesignCMOS VLSI Design 4th Ed. 11: Sequential Circuits2 Outline  Sequencing  Sequencing Element Design.
Lecture: 1.6 Tri-states, Mux, Latches & Flip Flops
Introduction to CMOS VLSI Design Lecture 10: Sequential Circuits David Harris Harvey Mudd College Spring 2004.
Introduction to CMOS VLSI Design Lecture 1: Circuits & Layout
(Neil west - p: ). Finite-state machine (FSM) which is composed of a set of logic input feeding a block of combinational logic resulting in a set.
Sequential Circuits. Outline  Floorplanning  Sequencing  Sequencing Element Design  Max and Min-Delay  Clock Skew  Time Borrowing  Two-Phase Clocking.
Z. Feng MTU EE4800 CMOS Digital IC Design & Analysis EE4800 CMOS Digital IC Design & Analysis Lecture 11 Sequential Circuit Design Zhuo Feng.
Introduction to CMOS VLSI Design Lecture 19: Design for Skew David Harris Harvey Mudd College Spring 2004.
S. Reda EN160 SP’07 Design and Implementation of VLSI Systems (EN1600) Lecture 21: Dynamic Combinational Circuit Design Prof. Sherief Reda Division of.
EE141 © Digital Integrated Circuits 2nd Timing Issues 1 Digital Integrated Circuits A Design Perspective Timing Issues Jan M. Rabaey Anantha Chandrakasan.
CMPEN 411 VLSI Digital Circuits Spring 2009 Lecture 17: Dynamic Sequential Circuits And Timing Issues [Adapted from Rabaey’s Digital Integrated Circuits,
Combinational circuits Lection 6
Lecture 21: Packaging, Power, & Clock
CSE477 L19 Timing Issues; Datapaths.1Irwin&Vijay, PSU, 2002 CSE477 VLSI Digital Circuits Fall 2002 Lecture 19: Timing Issues; Introduction to Datapath.
Clock Design Adopted from David Harris of Harvey Mudd College.
Chapter 11 Timing Issues in Digital Systems Boonchuay Supmonchai Integrated Design Application Research (IDAR) Laboratory August 20, 2004; Revised - July.
A Novel Clock Distribution and Dynamic De-skewing Methodology Arjun Kapoor – University of Colorado at Boulder Nikhil Jayakumar – Texas A&M University,
Lecture 8: Clock Distribution, PLL & DLL
S. Reda EN160 SP’08 Design and Implementation of VLSI Systems (EN1600) Lecture 22: Sequential Circuit Design (1/2) Prof. Sherief Reda Division of Engineering,
Modern VLSI Design 2e: Chapter 5 Copyright  1998 Prentice Hall PTR Topics n Memory elements. n Basics of sequential machines.
Introduction to CMOS VLSI Design Circuit Families.
Circuit Families Adopted from David Harris of Harvey Mudd College.
Introduction to CMOS VLSI Design Lecture 10: Sequential Circuits
Chapter #6: Sequential Logic Design 6.2 Timing Methodologies
Introduction to CMOS VLSI Design Lecture 10: Sequential Circuits Credits: David Harris Harvey Mudd College (Material taken/adapted from Harris’ lecture.
S. Reda EN160 SP’07 Design and Implementation of VLSI Systems (EN0160) Lecture 23: Sequential Circuit Design (1/3) Prof. Sherief Reda Division of Engineering,
Contemporary Logic Design Sequential Logic © R.H. Katz Transparency No Chapter #6: Sequential Logic Design Sequential Switching Networks.
Z. Feng MTU EE4800 CMOS Digital IC Design & Analysis 10.1 EE4800 CMOS Digital IC Design & Analysis Lecture 10 Combinational Circuit Design Zhuo Feng.
© Digital Integrated Circuits 2nd Sequential Circuits Digital Integrated Circuits A Design Perspective Designing Sequential Logic Circuits Jan M. Rabaey.
EE415 VLSI Design DYNAMIC LOGIC [Adapted from Rabaey’s Digital Integrated Circuits, ©2002, J. Rabaey et al.]
EE 447 VLSI Design Lecture 8: Circuit Families.
Ratioed Circuits Ratioed circuits use weak pull-up and stronger pull-down networks. The input capacitance is reduced and hence logical effort. Correct.
Introduction to CMOS VLSI Design Lecture 25: Package, Power, Clock, and I/O David Harris Harvey Mudd College Spring 2007.
Modern VLSI Design 3e: Chapter 3 Copyright  1998, 2002 Prentice Hall PTR Topics n Pseudo-nMOS gates. n DCVS gates. n Domino gates.
EE4800 CMOS Digital IC Design & Analysis
Digital System Clocking: High-Performance and Low-Power Aspects Vojin G. Oklobdzija, Vladimir M. Stojanovic, Dejan M. Markovic, Nikola M. Nedovic Wiley-Interscience.
Modern VLSI Design 4e: Chapter 3 Copyright  2008 Wayne Wolf Topics n Pseudo-nMOS gates. n DCVS logic. n Domino gates. n Design-for-yield. n Gates as IP.
Lecture 10: Circuit Families. CMOS VLSI DesignCMOS VLSI Design 4th Ed. 10: Circuit Families2 Outline  Pseudo-nMOS Logic  Dynamic Logic  Pass Transistor.
Advanced VLSI Design Unit 04: Combinational and Sequential Circuits.
Introduction to CMOS VLSI Design Lecture 9: Circuit Families
Sp09 CMPEN 411 L18 S.1 CMPEN 411 VLSI Digital Circuits Spring 2009 Lecture 16: Static Sequential Circuits [Adapted from Rabaey’s Digital Integrated Circuits,
Introduction to Clock Tree Synthesis
Clocking System Design
EE 466/586 VLSI Design Partha Pande School of EECS Washington State University
Dynamic Logic.
EE141 Combinational Circuits 1 Chapter 6 (I) Designing Combinational Logic Circuits Dynamic CMOS LogicDynamic CMOS Logic V1.0 5/4/2003.
EE 466/586 VLSI Design Partha Pande School of EECS Washington State University
Penn ESE370 Fall DeHon 1 ESE370: Circuit-Level Modeling, Design, and Optimization for Digital Systems Day 20: October 25, 2010 Pass Transistors.
1 Recap: Lecture 4 Logic Implementation Styles:  Static CMOS logic  Dynamic logic, or “domino” logic  Transmission gates, or “pass-transistor” logic.
EE141 Timing Issues 1 Chapter 10 Timing Issues Rev /11/2003 Rev /28/2003 Rev /05/2003.
EE141 Timing Issues 1 Chapter 10 Timing Issues Rev /11/2003.
1 Clockless Logic Montek Singh Thu, Mar 2, Review: Logic Gate Families  Static CMOS logic  Dynamic logic, or “domino” logic  Transmission gates,
Lecture 11: Sequential Circuit Design
Introduction to CMOS VLSI Design Lecture 10: Sequential Circuits
CMOS VLSI Design Chapter 13 Clocks, DLLs, PLLs
Lecture 10: Circuit Families
Chapter 10 Timing Issues Rev /11/2003 Rev /28/2003
CMOS VLSI Design Chapter 13 Clocks, DLLs, PLLs
332:578 Deep Submicron VLSI Design Lecture 14 Design for Clock Skew
Lecture 10: Circuit Families
Presentation transcript:

Introduction to CMOS VLSI Design Clock Skew-tolerant circuits

CMOS VLSI Design 2 Outline  Clock Distribution  Clock Skew  Skew-Tolerant Static Circuits  Traditional Domino Circuits  Skew-Tolerant Domino Circuits

CMOS VLSI Design 3 Review Timing Definitions

CMOS VLSI Design 4 Clocking  Synchronous systems use a clock to keep operations in sequence –Distinguish this from previous or next –Determine speed at which machine operates  Clock must be distributed to all the sequencing elements –Flip-flops and latches  Also distribute clock to other elements –Domino circuits and memories

CMOS VLSI Design 5 Clock Distribution  On a small chip, the clock distribution network is just a wire –And possibly an inverter for clkb  On practical chips, the RC delay of the wire resistance and gate load is very long –Variations in this delay cause clock to get to different elements at different times –This is called clock skew  Most chips use repeaters to buffer the clock and equalize the delay –Reduces but doesn’t eliminate skew

CMOS VLSI Design 6 Example  Skew comes from differences in gate and wire delay –With right buffer sizing, clk 1 and clk 2 could ideally arrive at the same time. –But power supply noise changes buffer delays –clk 2 and clk 3 will always see RC skew

CMOS VLSI Design 7 Skew Impact  Ideally full cycle is available for work  Skew adds sequencing overhead  Increases hold time too t pd

CMOS VLSI Design 8 Cycle Time Trends  Much of CPU performance comes from higher f –f is improving faster than simple process shrinks –Sequencing overhead is bigger part of cycle

CMOS VLSI Design 9 Solutions  Reduce clock skew –Careful clock distribution network design –Plenty of metal wiring resources  Analyze clock skew –Only budget actual, not worst case skews –Local vs. global skew budgets  Tolerate clock skew –Choose circuit structures insensitive to skew

CMOS VLSI Design 10 Clock Dist. Networks  Ad hoc  Grids  H-tree  Hybrid

CMOS VLSI Design 11 Clock Grids  Use grid on two or more levels to carry clock  Make wires wide to reduce RC delay  Ensures low skew between nearby points  But possibly large skew across die

CMOS VLSI Design 12 Alpha Clock Grids

CMOS VLSI Design 13 H-Trees  Fractal structure –Gets clock arbitrarily close to any point –Matched delay along all paths  Delay variations cause skew  A and B might see big skew

CMOS VLSI Design 14 Itanium 2 H-Tree  Four levels of buffering: –Primary driver –Repeater –Second-level clock buffer –Gater  Route around obstructions

CMOS VLSI Design 15 Hybrid Networks  Use H-tree to distribute clock to many points  Tie these points together with a grid  Ex: IBM Power4, PowerPC –H-tree drives sector buffers –Buffers drive total of 1024 points –All points shorted together with grid

CMOS VLSI Design 16 Skew Tolerance  Flip-flops are sensitive to skew because of hard edges –Data launches at latest rising edge of clock –Must setup before earliest next rising edge of clock –Overhead would shrink if we can soften edge  Latches tolerate moderate amounts of skew –Data can arrive anytime latch is transparent

CMOS VLSI Design 17 Skew: Latches 2-Phase Latches Pulsed Latches

CMOS VLSI Design 18 Dynamic Circuit Review  Static circuits are slow because fat pMOS load input  Dynamic gates use precharge to remove pMOS transistors from the inputs –Precharge:  = 0output forced high –Evaluate:  = 1output may pull low static dynamic

CMOS VLSI Design 19 Domino Circuits  Dynamic inputs must monotonically rise during evaluation –Place inverting stage between each dynamic gate –Dynamic / static pair called domino gate  Domino gates can be safely cascaded

CMOS VLSI Design 20 Domino Timing  Domino gates are 1.5 – 2x faster than static CMOS –Lower logical effort because of reduced C in  Challenge is to keep precharge off critical path  Look at clocking schemes for precharge and eval –Traditional schemes have severe overhead –Skew-tolerant domino hides this overhead

CMOS VLSI Design 21 Traditional Domino Ckts  have high sequencing overhead, hard edge in each half-cycle.  first domino gates does not evaluate until rising edge of the clock, but the results must set up at the latch before falling edge of the clock  If removing the latch, could soften the falling edge and cut the overhead.  The latch serves two functions: –prevent nonmonotonic signals from entering the next domino gate while it evaluates –hold the results of the half-cycle while it precharges and the next half-cycle evaluates.

CMOS VLSI Design 22 Traditional Domino Ckts  Hide precharge time by ping-ponging between half-cycles –When clk is high (low), the first half-cycle evaluates (precharges) and the second precharges (evaluates) –Latches hold results during precharge –Overhead of each latch is setup time and D-to-Q propa. delay. assume t pdq is larger, then time for compu. is t pd

CMOS VLSI Design 23 Clock Skew  Skew increases sequencing overhead –Evaluate at latest rising edge –Setup at latch by earliest falling edge –Assume skew and setup time > propa. delay t pdq time for computation t pd

CMOS VLSI Design 24 Time Borrowing  Logic may not exactly fit half-cycle –No flexibility to borrow time to balance logic between half cycles  Traditional domino sequencing overhead is about 25% of cycle time in fast systems!

CMOS VLSI Design 25 Relaxing the Timing  Sequencing overhead caused by hard edges –Data departs dynamic gate on late rising edge –Must setup at latch on early falling edge  Latch functions –Prevent glitches on inputs of domino gates –Holds results during precharge  Is the latch really necessary? –No glitches if inputs come from other domino –Can we hold the results in another way?

CMOS VLSI Design 26 Skew-Tolerant Domino  Use overlapping clocks to eliminate latches at phase boundaries. –Second phase evaluates using results of first

CMOS VLSI Design 27 Clks nonoverlapping, circuit fails 1.  1 falls, node a precharges high, node b low 2.  2 rises, the input to the first domino gate has fallen, i.e., b is low, node c will never discharge and the circuit loses information.

CMOS VLSI Design 28  1 and  2 overlap,  2 rises while b still holds correct value,  2 evaluates using the results of  1 Clks overlapping, circuit works  2 is evaluates, b is low node c is floating

CMOS VLSI Design 29 Full Keeper  After second phase evaluates, first phase precharges  Input to second phase falls –Violates monotonicity?  But we no longer need the value  Now the second gate has a floating output –Need full keeper to hold it either high or low

CMOS VLSI Design 30 Latch is unnecessary  As long as the clock overlap is long enough that the second phase can evaluate before the first precharges, the latch between phases is unnecessary

CMOS VLSI Design 31 Time Borrowing  Overlap can be used to –Tolerate clock skew –Permit time borrowing  No sequencing overhead

CMOS VLSI Design 32 Multiple Phases  With more clock phases, each phase overlaps more –Permits more skew tolerance and time borrowing

CMOS VLSI Design 33 Clock Generation

CMOS VLSI Design 34 Summary  Clock skew effectively increases setup and hold times in systems with hard edges  Managing skew –Reduce: good clock distribution network –Analyze: local vs. global skew –Tolerate: use systems with soft edges  Flip-flops and traditional domino are costly  Latches and skew-tolerant domino perform at full speed even with moderate clock skews.