The Interconnect Delay Bottleneck.

Slides:



Advertisements
Similar presentations
by Alexander Glavtchev
Advertisements

Wires.
Metal Oxide Semiconductor Field Effect Transistors
1 Cleared for Open Publication July 30, S-2144 P148/MAPLD 2004 Rea MAPLD 148:"Is Scaling the Correct Approach for Radiation Hardened Conversions.
1 Worst-case Delay Analysis Considering the Variability of Transistors and Interconnects Takayuki Fukuoka, Tsuchiya Akira and Hidetoshi Onodera Kyoto University.
Adapted from Digital Integrated Circuits, 2nd Ed. 1 IC Layout.
EE466: VLSI Design Lecture 11: Wires
EE 447 VLSI Design Lecture 5: Wires. EE 447VLSI Design 6: Wires2 Outline Introduction Wire Resistance Wire Capacitance Wire RC Delay Crosstalk Wire Engineering.
04/11/02EECS 3121 Lecture 26: Interconnect Modeling, continued EECS 312 Reading: 8.2.2, (text) HW 8 is due now!
Design and Implementation of VLSI Systems (EN0160)
04/09/02EECS 3121 Lecture 25: Interconnect Modeling EECS 312 Reading: 8.3 (text), 4.3.2, (2 nd edition)
S. Reda EN160 SP’08 Design and Implementation of VLSI Systems (EN1600) Lecture 18: Scaling Theory Prof. Sherief Reda Division of Engineering, Brown University.
Interconnect and Packaging Lecture 2: Scalability
Lecture #25a OUTLINE Interconnect modeling
Introduction to CMOS VLSI Design Interconnect: wire.
Design of Integrated-Circuit Interconnects with Accurate Modeling of Chemical-Mechanical Planarization Lei He, Andrew B. Kahng* #, Kingho Tam, Jinjun Xiong.
1 A Novel Metric for Interconnect Architecture Performance Parthasarathi Dasgupta, Andrew B. Kahng, Swamy V. Muddu Dept. of CSE and ECE University of California,
ECE 424 – Introduction to VLSI Design
TLC: Transmission Line Caches Brad Beckmann David Wood Multifacet Project University of Wisconsin-Madison 12/3/03.
EGRE 427 Advanced Digital Design Figures from Application-Specific Integrated Circuits, Michael John Sebastian Smith, Addison Wesley, 1997 Chapter 7 Programmable.
EZ-COURSEWARE State-of-the-Art Teaching Tools From AMS Teaching Tomorrow’s Technology Today.
Written By: Kris Tiri and Ingrid Verbauwhede Presented By: William Whitehouse.
CAD for Physical Design of VLSI Circuits
Limitations of Digital Computation William Trapanese Richard Wong.
The George Washington University School of Engineering and Applied Science Department of Electrical and Computer Engineering ECE122 – Lab 7 MOSFET Parameters.
1 ECE 556 Design Automation of Digital Systems By Prof. Charlie Chung-Ping Chen ECE Department UW-Madison.
High speed silicon Mach-Zehnder modulator
EE141 © Digital Integrated Circuits 2nd Wires 1 Digital Integrated Circuits A Design Perspective The Interconnect Jan M. Rabaey Anantha Chandrakasan Borivoje.
EE415 VLSI Design 1 The Wire [Adapted from Rabaey’s Digital Integrated Circuits, ©2002, J. Rabaey et al.]
EE141 © Digital Integrated Circuits 2nd Wires 1 Digital Integrated Circuits A Design Perspective The Wire Jan M. Rabaey Anantha Chandrakasan Borivoje Nikolic.
Penn ESE370 Fall DeHon 1 ESE370: Circuit-Level Modeling, Design, and Optimization for Digital Systems Day 33: November 20, 2013 Crosstalk.
VLSI Design Lecture 3: Parasitics of CMOS Wires Mohammad Arjomand CE Department Sharif Univ. of Tech. Adapted with modifications from Harris’s lecture.
1 Interconnect/Via. 2 Delay of Devices and Interconnect.
Lecture 14: Wires. CMOS VLSI DesignCMOS VLSI Design 4th Ed. 14: Wires2 Outline  Introduction  Interconnect Modeling –Wire Resistance –Wire Capacitance.
VLSI INTERCONNECTS IN VLSI DESIGN - PROF. RAKESH K. JHA
ADVANCED HIGH DENSITY INTERCONNECT MATERIALS AND TECHNIQUES DIVYA CHALLA.
Trends in IC technology and design J. Christiansen CERN - EP/MIC
Interconnect/Via.
Chapter 4: Secs ; Chapter 5: pp
EE 587 SoC Design & Test Partha Pande School of EECS Washington State University
14 February, 2004SLIP, 2004 Self-Consistent Power/Performance/Reliability Analysis for Copper Interconnects Bipin Rajendran, Pawan Kapur, Krishna C. Saraswat.
-1- Soft Core Viterbi Decoder EECS 290A Project Dave Chinnery, Rhett Davis, Chris Taylor, Ning Zhang.
Penn ESE370 Fall DeHon 1 ESE370: Circuit-Level Modeling, Design, and Optimization for Digital Systems Day 30: November 21, 2012 Crosstalk.
1 Interconnect and Packaging Lecture 2: Scalability Chung-Kuan Cheng UC San Diego.
Outline  Introduction  Wire Resistance  Wire Capacitance  Wire RC Delay  Wire Engineering  Repeaters  Summary.
University of Michigan Advanced Computer Architecture Lab. 2 CAD Tools for Variation Tolerance David Blaauw and Kaviraj Chopra University of Michigan.
Wires & wire delay Lecture 9 Tuesday September 27, 2016.
CS203 – Advanced Computer Architecture
Circuit characterization and Performance Estimation
THE CMOS INVERTER.
Crosstalk If both a wire and its neighbor are switching at the same time, the direction of the switching affects the amount of charge to be delivered and.
by Alexander Glavtchev
EE141 Chapter 4 The Wire March 20, 2003.
Chapter 4 Interconnect.
Architecture & Organization 1
SIDDAGANGA INSTITUTE OF TECHNOLOGY
Day 33: November 19, 2014 Crosstalk
Architecture & Organization 1
An Illustration of 0.1µm CMOS layout design on PC
Day 31: November 23, 2011 Crosstalk
Topics Circuit design for FPGAs: Logic elements. Interconnect.
An Automated Design Flow for 3D Microarchitecture Evaluation
Summary Current density in a signal line was estimated, based on the simple circuit shown in Fig.1. This circuit is scaled down according to ITRS 2003.
Puneet Gupta1 , Andrew B. Kahng1 , Youngmin Kim2, Dennis Sylvester2
Computer Evolution and Performance
Reading (Rabaey et al.): Sections 3.5, 5.6
Technology scaling Currently, technology scaling has a threefold objective: Reduce the gate delay by 30% (43% increase in frequency) Double the transistor.
A New Hybrid FPGA with Nanoscale Clusters and CMOS Routing Reza M. P
Interconnect and Packaging Lecture 2: Scalability
THE INTERCONNECT.
Presentation transcript:

The Interconnect Delay Bottleneck

Interconnect delay Wire delay does not scale with technology, Relative delay is growing even for optimized interconnects Wire delay does not scale with technology, literature says. Is it true?

Experimental Setup Switch 0 Switch 1 The link performance was explored with an experimental setup consisting of a 2-switch test architecture tuning the following physical synthesis parameters: The link length spanned from 1.5mm to 10mm Up to 9 pipeline stages inserted The channel width ranged from 250um to 10um by means of non-routable obstructions. Target frequency for synthesis: from 250MHz to 1Ghz. Two technology libraries utilized: Low-Power Low-Vth 65nm Low-Power Std-Vth 45nm (so that buffers have almost the same delay and the net impact of wire parasitics is pointed out) Switch 1 Switch 0 LINK LENGTH

Link Performance (1) SW 1 LINK LENGTH The performance of the link degrades by incrementing the inter-switch spacing In 65nm even a loose target of 250MHz is not achieved for 8mm links while 1GHz is hardly affordable at 1.5mm. In 45nm the synthesis tool does not achieve the 65nm performance even for the shorter links.

Link Buffer Distribution The place&route in 45nm has required a much higher number of buffer cells with high driving strength. Physical properties of on-chip interconnects in 45nm are responsible for the performance degradation!

Link Performance (2) Let us now use a more modern synthesis flow based on placement-aware logic synthesis (hereafter named the “topographical synthesis”). SW 1 LINK LENGTH A relevant perfomance speedup is achievable by utilizing a topographical approach: 45nm library outperforms 65nm library for long links and aggressive speeds The awereness of back-end information becomes a must in 45nm and beyond

Link Pipelining The required number of pipeline stages to meet the target speed of 1Ghz on the link was determined with incremental place&route steps: Library 1.5mm 3mm 8mm 10mm 45nm 1 2 7 9 65nm 8 => Pipeline stages are inserted manually so to break the link into segments of equal length. Interestingly the trend for both the 45nm and 65nm library is the same! Switch 1 Switch 0

Gate delay Let S be the scaling factor (S=0.7): Load capacitance Voltage swing of interest Device delay Drive current of the device

Gate delay Shrinking of geometries Power and delay reduction Constant power density

Ideal scaling of MOS transistors Smaller interconnect yields larger delays due to the decreasing cross-sectional area L Very high level model which neglects sidewall coupling and fringing capacitances W H tdi E dielectric substrate There are two interconnect scaling scenarios: Local interconnects (10-500 um at 0.18 um) (length scale set by the size of a gate) Global interconnects (length scale set by functional unit size and chip edge) Thickness permittivity resistivity

Interconnect scaling Let S be the scaling factor (S=0.7): Ideal scaling: Horizontal and vertical dimensions are equally scaled to preserve packing density To preserve packing density For process integration For process integration Driven by gate shrinking Bad degradation! Tolerable RC stays constant in spite of the scaling trend Reliability problems

Interconnect scaling Improvement by means of: Quasi ideal scaling: wires scaled more in the horizontal rather than the vertical direction, so that: RC delay tracks S closer!! packing density preserved To preserve packing density To reduce resistance To keep capacitance limited Better than ideal scaling Should scale slightly, but sidewall capacitance accounted for. Tracks s closer Better

Interconnect scaling Ideal scaling: Horizontal and vertical dimensions are equally scaled to preserve packing density Increases with die size Degradation not tolerable

Interconnect scaling Improvement by means of: Constant dimension scaling: By maintaining wide and thick wires at the higher metal levels, RC delay can be controlled routing density penalized! Interconnect size unaffected Thanks to constant cross-section area Thanks to constant width and ILD Just the impact of increased wirelength. Much better!! But still this is a reverse scaling!!

0.13 um Cu interconnect stack

A cross-layer concern The physical-layer tricks documented above are complemented by other techniques to tackle the interconnect delay bottleneck: Migration to new bus architectures Link pipelining Placement-aware logic synthesis