Boosters for Driving Long On-chip Interconnects : Design Issues, Interconnect Synthesis and Comparison with Repeaters Ankireddy Nalamalpu Intel Corporation/Hillsboro.

Slides:



Advertisements
Similar presentations
EE 201A Modeling and Optimization for VLSI LayoutJeff Wong and Dan Vasquez EE 201A Noise Modeling Jeff Wong and Dan Vasquez Electrical Engineering Department.
Advertisements

CSET 4650 Field Programmable Logic Devices
2007 MURI Review The Effect of Voltage Fluctuations on the Single Event Transient Response of Deep Submicron Digital Circuits Matthew J. Gadlage 1,2, Ronald.
Elettronica T A.A Digital Integrated Circuits © Prentice Hall 2003 Inverter CMOS INVERTER.
1 Arkadiy Morgenshtein, Eby G. Friedman, Ran Ginosar, Avinoam Kolodny Technion – Israel Institute of Technology Timing Optimization in Logic with Interconnect.
Lecture 5: DC & Transient Response
Introduction to CMOS VLSI Design Lecture 19: Design for Skew David Harris Harvey Mudd College Spring 2004.
Noise Model for Multiple Segmented Coupled RC Interconnects Andrew B. Kahng, Sudhakar Muddu †, Niranjan A. Pol ‡ and Devendra Vidhani* UCSD CSE and ECE.
Introduction to CMOS VLSI Design Clock Skew-tolerant circuits.
Sequential Definitions  Use two level sensitive latches of opposite type to build one master-slave flipflop that changes state on a clock edge (when the.
CSE477 L19 Timing Issues; Datapaths.1Irwin&Vijay, PSU, 2002 CSE477 VLSI Digital Circuits Fall 2002 Lecture 19: Timing Issues; Introduction to Datapath.
Clock Design Adopted from David Harris of Harvey Mudd College.
RTL Hardware Design by P. Chu Chapter 161 Clock and Synchronization.
Design of Variable Input Delay Gates for Low Dynamic Power Circuits
Designing Combinational Logic Circuits: Part2 Alternative Logic Forms:
EE4271 VLSI Design Interconnect Optimizations Buffer Insertion.
Introduction to CMOS VLSI Design Lecture 4: DC & Transient Response
Lecture 8: Clock Distribution, PLL & DLL
MICRO-MODEM RELIABILITY SOLUTION FOR NOC COMMUNICATIONS Arkadiy Morgenshtein, Evgeny Bolotin, Israel Cidon, Avinoam Kolodny, Ran Ginosar Technion – Israel.
S. Reda EN160 SP’07 Design and Implementation of VLSI Systems (EN0160) Lecture 15: Interconnects & Wire Engineering Prof. Sherief Reda Division of Engineering,
Introduction to CMOS VLSI Design Lecture 4: DC & Transient Response Credits: David Harris Harvey Mudd College (Material taken/adapted from Harris’ lecture.
© Digital Integrated Circuits 2nd Inverter CMOS Inverter: Digital Workhorse  Best Figures of Merit in CMOS Family  Noise Immunity  Performance  Power/Buffer.
Evolution of implementation technologies
On-Line Adjustable Buffering for Runtime Power Reduction Andrew B. Kahng Ψ Sherief Reda † Puneet Sharma Ψ Ψ University of California, San Diego † Brown.
S. Reda EN160 SP’07 Design and Implementation of VLSI Systems (EN0160) Lecture 22: Material Review Prof. Sherief Reda Division of Engineering, Brown University.
Effects of Global Interconnect Optimizations on Performance Estimation of Deep Sub-Micron Design Yu (Kevin) Cao 1, Chenming Hu 1, Xuejue Huang 1, Andrew.
CMOS VLSI Design4: DC and Transient ResponseSlide 1 EE466: VLSI Design Lecture 05: DC and transient response – CMOS Inverters.
1 Modeling and Optimization of VLSI Interconnect Lecture 1: Introduction Avinoam Kolodny Konstantin Moiseev.
Analysis and Avoidance of Cross-talk in on-chip buses Chunjie Duan Ericsson Wireless Communications Anup Tirumala Jasmine Networks Sunil P Khatri University.
Effects of Global Interconnect Optimizations on Performance Estimation of Deep Sub-Micron Design Yu Cao, Chenming Hu, Xuejue Huang, Andrew B. Kahng, Sudhakar.
Noise and Delay Uncertainty Studies for Coupled RC Interconnects Andrew B. Kahng, Sudhakar Muddu † and Devendra Vidhani ‡ UCLA Computer Science Department,
The CMOS Inverter Slides adapted from:
1 EE 587 SoC Design & Test Partha Pande School of EECS Washington State University
Robust Low Power VLSI R obust L ow P ower VLSI Finding the Optimal Switch Box Topology for an FPGA Interconnect Seyi Ayorinde Pooja Paul Chaudhury.
EE466: VLSI Design Power Dissipation. Outline Motivation to estimate power dissipation Sources of power dissipation Dynamic power dissipation Static power.
Mary Jane Irwin ( ) Modified by Dr. George Engel (SIUE)
Modern VLSI Design 4e: Chapter 4 Copyright  2008 Wayne Wolf Topics n Interconnect design. n Crosstalk. n Power optimization.
Ch 10 MOSFETs and MOS Digital Circuits
Crosstalk Analysis in UDSM technologies
Power Reduction for FPGA using Multiple Vdd/Vth
Research on Analysis and Physical Synthesis Chung-Kuan Cheng CSE Department UC San Diego
EE415 VLSI Design DYNAMIC LOGIC [Adapted from Rabaey’s Digital Integrated Circuits, ©2002, J. Rabaey et al.]
Lecture 12 Review and Sample Exam Questions Professor Lei He EE 201A, Spring 2004
Chapter 07 Electronic Analysis of CMOS Logic Gates
ECE Advanced Digital Systems Design Lecture 12 – Timing Analysis Capt Michael Tanner Room 2F46A HQ U.S. Air Force Academy I n t e g r i.
DCSL & LVDCSL: A High Fan-in, High Performance Differential Current Switch Logic Families Dinesh Somasekhaar, Kaushik Roy Presented by Hazem Awad.
1 Interconnect and Packaging Lecture 8: Clock Meshes and Shunts Chung-Kuan Cheng UC San Diego.
Optimal digital circuit design Mohammad Sharifkhani.
Modern VLSI Design 3e: Chapter 4 Copyright  1998, 2002 Prentice Hall PTR Topics n Interconnect design. n Crosstalk. n Power optimization.
1 Bus Encoding for Total Power Reduction Using a Leakage-Aware Buffer Configuration 班級:積體所碩一 學生:林欣緯 指導教授:魏凱城 老師 IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION.
Introduction to CMOS VLSI Design MOS devices: static and dynamic behavior.
CSE477 L07 Pass Transistor Logic.1Irwin&Vijay, PSU, 2003 CSE477 VLSI Digital Circuits Fall 2003 Lecture 07: Pass Transistor Logic Mary Jane Irwin (
Topics Combinational network delay.
Modern VLSI Design 4e: Chapter 3 Copyright  2008 Wayne Wolf Topics n Wire delay. n Buffer insertion. n Crosstalk. n Inductive interconnect. n Switch logic.
Solid-State Devices & Circuits
EE415 VLSI Design THE INVERTER [Adapted from Rabaey’s Digital Integrated Circuits, ©2002, J. Rabaey et al.]
Dynamic Logic.
EE141 Combinational Circuits 1 Chapter 6 (I) Designing Combinational Logic Circuits Dynamic CMOS LogicDynamic CMOS Logic V1.0 5/4/2003.
VADA Lab.SungKyunKwan Univ. 1 L5:Lower Power Architecture Design 성균관대학교 조 준 동 교수
-1- Delay Uncertainty and Signal Criticality Driven Routing Channel Optimization for Advanced DRAM Products Samyoung Bang #, Kwangsoo Han ‡, Andrew B.
Effects of Inductance on the Propagation Delay and Repeater Insertion in VLSI Circuits Yehea I. Ismail and Eby G. Friedman, Fellow, IEEE.
1 Modeling and Optimization of VLSI Interconnect Lecture 2: Interconnect Delay Modeling Avinoam Kolodny Konstantin Moiseev.
The Interconnect Delay Bottleneck.
Reading: Hambley Ch. 7; Rabaey et al. Sec. 5.2
Jason Cong, David Zhigang Pan & Prasanna V. Srinivas
Multiple Drain Transistor-Based FPGA Architectures
332:578 Deep Submicron VLSI Design Lecture 14 Design for Clock Skew
A New Hybrid FPGA with Nanoscale Clusters and CMOS Routing Reza M. P
Applications of GTX Y. Cao, X. Huang, A.B. Kahng, F. Koushanfar, H. Lu, S. Muddu, D. Stroobandt and D. Sylvester Abstract The GTX (GSRC Technology Extrapolation)
Jason Cong, David Zhigang Pan & Prasanna V. Srinivas
Presentation transcript:

Boosters for Driving Long On-chip Interconnects : Design Issues, Interconnect Synthesis and Comparison with Repeaters Ankireddy Nalamalpu Intel Corporation/Hillsboro Wayne Burleson UMASS/Amherst Partially Funded by SRC under research ID 766

Motivation Interconnect delay will dominate DSM Limited performance by using traditional techniques (Repeaters) for driving on-chip interconnects Repeaters are area and power hungry This study aims to provide New high-performance circuit technique (Booster) for driving interconnects Over-all Booster design methodology to integrate into automatic interconnect synthesis tools Study/comparison Boosters with Repeaters

Repeater Design Classical delay optimal repeater solution when delay of repeaters equals interconnect delay [Bakoglu85] Repeater design solutions model short-channel effects in DSM using Alpha Power MOSFET model [Friedman98b, Nalamalpu00b]

Repeater Design Limitations Limited performance with Repeaters in DSM due to non- negligible interconnect resistance Increasing Repeater Area and Power with technology scaling [Sylvester98, Sylvester99] 700,000 repeaters in 70nm CMOS [Cong99] Increased design problems with repeaters driving bi-directional and multi-source buses Inverting Polarity Technology Generation(  m) Power(W) 1x10 6 2x10 6 3x10 6 4x10 6 5x10 6 6x10 6 # of Repeaters Repeaters + Wire Wires Only # Repeaters [Plot from Sylvester99]

Review of Previous Work Regenerative Feed-back Repeaters for driving programmable interconnections[ Dobbelaere95 ] Extremely sensitive to Noise Meta-stability Two-sided Timing Constraints Limit in performance gain Driver Receiver 2,4…Inverters Interconnect

Review of Previous Work Differential, Small-swing and other design techniques[Lima95,Friedman98a] Requires more circuit design sophistication Cumbersome for automatic interconnect synthesis tools Require multiple Power Supply’s in some cases  We need simple and yet high-performance circuit technique that can be integrated into automatic interconnect synthesis tools

Proposed Design Our proposed circuit (Booster) differs from the existing designs in one or more of the following High Performance Simpler and requires fewer transistors Noise immunity Eliminates Meta-stability We formulate analytical design rules for Boosters to be part of automatic interconnect synthesis tool

Driver Receiver Driver Receiver Inputrin Interconnect bp tn Booster Circuit Skewed Inverters Driver Feed-back bout fout N=1.0 P=2.0 N=1.0 P=2.0 Full Keeper

Booster Simulations RLC 5 T-Interconnect model in 0.16  m CMOS Feedback path Improves the speed of driver Prevents turning-off booster prematurely thereby eliminating two-sided timing constraints Makes circuit glitch immune Inverter Outputs Firing Feed-back Path Input

Booster Design Skewed inverters respond to opposite ends of voltage transition Driving both the inverters to feed-back path improves noise immunity Full keeper helps noise cause Booster firing time depends on switching thresholds of inverters Boosters attach to the wire rather than interrupting it so can be used for bi-directional signals Boosters don’t impact the polarity of the signal

Booster Design Methodology Analytically determine number of boosters and their placements for driving given interconnect load Consider only delay optimality Minimize power/area impact without losing significant speed-up

Boosters no good for driving very short wires due to fast transients in small RC loads (how short?) In-order to place a booster Firing time  < Time Constant  Booster Transient variation  > 2.5, to minimize total number of boosters Booster Placement BR1 BR2BR3 BR1, BR2, BR3 = Boosters Booster Transient Variation  Delay with Booster(ps) Delay without Booster(ps)

Booster Analytical Model Using simple inverter model(which will suffice) Length = L 1 Node(a)Node(b) Length = L 3 Length = L 2 bp tn

Booster Analytical Model Using more accurate alpha-power law based inverter analytical model [Nalamalpu00b] Alpha power MOSFET law [Sakurai90] models short-channel effects Repeater model is within 5% error of SPICE

Rule for Number of Boosters When boosters(BR1, BR2, BR3) are initially off L1,L2 and L3 will be different for identical segment delays due to characteristics of signal propagation along RC line Unlike Repeaters, placing non-optimal number of boosters doesn’t impact performance as much as power Number of Boosters Interconnect Delay Short- circuit Power

To Minimize number of boosters Any down stream booster (e.g. BR2 ) should be fired only after improved upstream signal transient (e.g. A, BR1 is active) propagates downstream (e.g.B) L 1 <L 2 <L 3 <L 4 for identical segment delays L1L1 L2L2 Rule for Number of Boosters L3L3 L4L4 BR1, BR2, BR3, BR4 = Boosters BR1BR2BR3BR4

Booster Placement Sensitivity Realistic floor-plans will have several placement constraints Inter block routing Repeater staggering to reduce inductive and capacitive coupling To ensure the design is manageable (e.g. verifiable, reusable) To maintain datapath’s regularity Block ABlock B Block C No Glue Logic

Booster Placement Sensitivity Repeaters are shown to be sensitive to placement variation[Nalamalpu00a] Worst case placement scenario’s results in performance degradation by as much as 30% Boosters relatively insensitive to placement variation due to its dependence on transient

SPICE Simulations We used delay optimal repeater design solution obtained by using alpha-power MOSFET model[Nalamalpu00b] Booster design rules for finding number of boosters and their placements are used to minimize design cost without losing significant speed (<5%) CMOS 0.16  m process is used for SPICE simulations Interconnect load is represented using RLC 5 T-model RepeatersBoosters

SPICE Simulations C L (pF) R t (k  ) Speed- up (%) D booster D repeater W booster W repeater n booster (  m) (ps) n repeater

Boosters Vs Repeaters Boosters shown to out-perform Repeaters by 20% for all kinds of interconnect loads (both capacitive and resistive dominated) Boosters interconnect driving distance is 3x that of Repeaters resulting in fewer Boosters Significant reduction in Area over Repeaters (more than 100% depending on interconnect load) Boosters are insensitive to placement variation Boosters don’t impact the polarity of the signal

Booster Applications Uni/bi-directional interconnects Multi-source/sink buses Programmable Interconnections in FPGA’s Booster On Switches Off Switches

Booster Applications Long AND domino gates (e.g decoders) Precharge from top of the stack and discharge is from bottom of the stack Bi-directional signaling can be improved using boosters

Booster Limitations Boosters don’t break lines however for buffering, modularity and signal integrity reasons it is desirable to break long lines Boosters are not well understood by CAD tools and designers

Conclusions We presented analytical design solutions, both hard optimization and softer realistic design problems We propose to combine Boosters with Repeaters in some cases to handle both modularity and signal integrity issues Boosters find application in long dynamic ANDs, and multi-source interconnects in addition to conventional point-to-point long lines

Future Work Integration into interconnect synthesis tool with Repeaters Impact on bi-directional multi-source lines which could directly impact VLIW, FPGA, Routers, multi-processor, memory and other highly connected architectures

Acknowledgements Sriram Srinivasan for insightful comments Prof. Arnold Rosenberg for the initial theoretical motivations in exploring booster circuits SRC for partially supporting under Research ID 766 UMASS has filed for several patents related to Booster technology

References [Dobbelaere95] Dobbelaere et al, Regenerative Feed-back Repeaters for Programmable Interconnections, JSSC, 1995 [Lima95] T. Lima et al, Capacitance Coupling Immune Accelerator for Resistive Interconnects, IEEE Trans. on Electron Devices, 1995 [Friedman98a] Secareanu et al, Transparent Repeaters, GLSVLSI,1998 [Nalamalpu00a] A. Nalamalpu et al, Quantifying and Mitigating Placement Constraints, 2000 [Sakurai90] Sakurai et al, Alpha-Power MOSFET Model, JSSC,1990 [Nalamalpu00b] A. Nalamalpu et al, Repeater Ramp based Analytical Model, ISCAS,2000

References [Sylvester98] D.Sylvester et al, Getting to the bottom of deep sub-micron, ICCAD, 1998 [Sylvester99] D.Sylvester et al, Getting to the bottom of deep sub-micron II, ISPD, 1999 [Friedman98b] V.Adler et al, Repeater Design to Reduce Delay and Power, IEEE Trans. Circuits and System II, 1998 [Bakoglu85] Bakoglu et al, Optimal Interconnection Circuits for VLSI, IEEE Trans. Electron Devices, 1985