1 A Novel Metric for Interconnect Architecture Performance Parthasarathi Dasgupta, Andrew B. Kahng, Swamy V. Muddu Dept. of CSE and ECE University of California,

Slides:



Advertisements
Similar presentations
Caleb Serafy and Ankur Srivastava Dept. ECE, University of Maryland
Advertisements

Design Rule Generation for Interconnect Matching Andrew B. Kahng and Rasit Onur Topaloglu {abk | rtopalog University of California, San Diego.
OCV-Aware Top-Level Clock Tree Optimization
A Novel 3D Layer-Multiplexed On-Chip Network
Cadence Design Systems, Inc. Why Interconnect Prediction Doesn’t Work.
Native-Conflict-Aware Wire Perturbation for Double Patterning Technology Szu-Yu Chen, Yao-Wen Chang ICCAD 2010.
A Routing Technique for Structured Designs which Exploits Regularity Sabyasachi Das Intel Corporation Sunil P. Khatri Univ. of Colorado, Boulder.
Buffer and FF Insertion Slides from Charles J. Alpert IBM Corp.
Minimum Implant Area-Aware Gate Sizing and Placement
1 Interconnect Layout Optimization by Simultaneous Steiner Tree Construction and Buffer Insertion Presented By Cesare Ferri Takumi Okamoto, Jason Kong.
F.F. Dragan (Kent State) A.B. Kahng (UCSD) I. Mandoiu (UCLA) S. Muddu (Sanera Systems) A. Zelikovsky (Georgia State) Provably Good Global Buffering by.
Coupling-Aware Length-Ratio- Matching Routing for Capacitor Arrays in Analog Integrated Circuits Kuan-Hsien Ho, Hung-Chih Ou, Yao-Wen Chang and Hui-Fang.
Noise Model for Multiple Segmented Coupled RC Interconnects Andrew B. Kahng, Sudhakar Muddu †, Niranjan A. Pol ‡ and Devendra Vidhani* UCSD CSE and ECE.
The Cost of Fixing Hold Time Violations in Sub-threshold Circuits Yanqing Zhang, Benton Calhoun University of Virginia Motivation and Background Power.
Minimum-Buffered Routing of Non- Critical Nets for Slew Rate and Reliability Control Supported by Cadence Design Systems, Inc. and the MARCO Gigascale.
Layer Assignment Algorithm for RLC Crosstalk Minimization Bin Liu, Yici Cai, Qiang Zhou, Xianlong Hong Tsinghua University.
Circuit Retiming with Interconnect Delay CUHK CSE CAD Group Meeting One Evangeline Young Aug 19, 2003.
Boosting: Min-Cut Placement with Improved Signal Delay Andrew B. KahngSherief Reda CSE & ECE Departments University of CA, San Diego La Jolla, CA
F.F. Dragan (Kent State) A.B. Kahng (UCSD) I. Mandoiu (Georgia Tech/UCLA) S. Muddu (Silicon Graphics) A. Zelikovsky (Georgia State) Provably Good Global.
Architectural-Level Prediction of Interconnect Wirelength and Fanout Kwangok Jeong, Andrew B. Kahng and Kambiz Samadi UCSD VLSI CAD Laboratory
Study of Floating Fill Impact on Interconnect Capacitance Andrew B. Kahng Kambiz Samadi Puneet Sharma CSE and ECE Departments University of California,
Local Unidirectional Bias for Smooth Cutsize-delay Tradeoff in Performance-driven Partitioning Andrew B. Kahng and Xu Xu UCSD CSE and ECE Depts. Work supported.
DPIMM-03 1 Performance-Impact Limited Area Fill Synthesis Yu Chen, Puneet Gupta, Andrew B. Kahng (UCLA, UCSD) Supported by Cadence.
A Global Minimum Clock Distribution Network Augmentation Algorithm for Guaranteed Clock Skew Yield A. B. Kahng, B. Liu, X. Xu, J. Hu* and G. Venkataraman*
Accurate Pseudo-Constructive Wirelength and Congestion Estimation Andrew B. Kahng, UCSD CSE and ECE Depts., La Jolla Xu Xu, UCSD CSE Dept., La Jolla Supported.
Ryan Kastner ASIC/SOC, September Coupling Aware Routing Ryan Kastner, Elaheh Bozorgzadeh and Majid Sarrafzadeh Department of Electrical and Computer.
Topography-Aware OPC for Better DOF margin and CD control Puneet Gupta*, Andrew B. Kahng*†‡, Chul-Hong Park†, Kambiz Samadi†, and Xu Xu‡ * Blaze-DFM Inc.
Metal Layer Planning for Silicon Interposers with Consideration of Routability and Manufacturing Cost W. Liu, T. Chien and T. Wang Department of CS, NTHU,
Design of Integrated-Circuit Interconnects with Accurate Modeling of Chemical-Mechanical Planarization Lei He, Andrew B. Kahng* #, Kingho Tam, Jinjun Xiong.
UC San Diego Computer Engineering. VLSI CAD Laboratory.. UC San Diego Computer EngineeringVLSI CAD Laboratory.. UC San Diego Computer EngineeringVLSI CAD.
SLIP 2000April 9, Wiring Layer Assignments with Consistent Stage Delays Andrew B. Kahng (UCLA) Dirk Stroobandt (Ghent University) Supported.
CSE 144 Project Part 2. Overview Multiple rows Routing channel between rows Components of identical height but various width Goal: Implement a placement.
Noise and Delay Uncertainty Studies for Coupled RC Interconnects Andrew B. Kahng, Sudhakar Muddu † and Devendra Vidhani ‡ UCLA Computer Science Department,
A Methodology for Interconnect Dimension Determination By: Jeff Cobb Rajesh Garg Sunil P Khatri Department of Electrical and Computer Engineering, Texas.
Research on Analysis and Physical Synthesis Chung-Kuan Cheng CSE Department UC San Diego
1 Coupling Aware Timing Optimization and Antenna Avoidance in Layer Assignment Di Wu, Jiang Hu and Rabi Mahapatra Texas A&M University.
TSV-Aware Analytical Placement for 3D IC Designs Meng-Kai Hsu, Yao-Wen Chang, and Valerity Balabanov GIEE and EE department of NTU DAC 2011.
ASIC Design Flow – An Overview Ing. Pullini Antonio
A Polynomial Time Approximation Scheme For Timing Constrained Minimum Cost Layer Assignment Shiyan Hu*, Zhuo Li**, Charles J. Alpert** *Dept of Electrical.
1 Global Routing Method for 2-Layer Ball Grid Array Packages Yukiko Kubo*, Atsushi Takahashi** * The University of Kitakyushu ** Tokyo Institute of Technology.
« Pruning Policies for Two-Tiered Inverted Index with Correctness Guarantee » Proceedings of the 30th annual international ACM SIGIR, Amsterdam 2007) A.
Low-Power Gated Bus Synthesis for 3D IC via Rectilinear Shortest-Path Steiner Graph Chung-Kuan Cheng, Peng Du, Andrew B. Kahng, and Shih-Hung Weng UC San.
UC San Diego / VLSI CAD Laboratory Incremental Multiple-Scan Chain Ordering for ECO Flip-Flop Insertion Andrew B. Kahng, Ilgweon Kang and Siddhartha Nath.
An Efficient Clustering Algorithm For Low Power Clock Tree Synthesis Rupesh S. Shelar Enterprise Microprocessor Group Intel Corporation, Hillsboro, OR.
Bus-Pin-Aware Bus-Driven Floorplanning B. Wu and T. Ho Department of Computer Science and Information Engineering NCKU GLSVLSI 2010.
-1- UC San Diego / VLSI CAD Laboratory Construction of Realistic Gate Sizing Benchmarks With Known Optimal Solutions Andrew B. Kahng, Seokhyeong Kang VLSI.
Kwangsoo Han, Andrew B. Kahng, Hyein Lee and Lutong Wang
Kwangsoo Han‡, Andrew B. Kahng‡† and Hyein Lee‡
Impact of Interconnect Architecture on VPSAs (Via-Programmed Structured ASICs) Usman Ahmed Guy Lemieux Steve Wilton System-on-Chip Lab University of British.
A Faster Approximation Scheme for Timing Driven Minimum Cost Layer Assignment Shiyan Hu*, Zhuo Li**, and Charles J. Alpert** *Dept of ECE, Michigan Technological.
1 ε -Optimal Minimum-Delay/Area Zero-Skew Clock Tree Wire-Sizing in Pseudo-Polynomial Time Jeng-Liang Tsai Tsung-Hao Chen Charlie Chung-Ping Chen (National.
Fast Algorithms for Slew Constrained Minimum Cost Buffering S. Hu*, C. Alpert**, J. Hu*, S. Karandikar**, Z. Li*, W. Shi* and C. Sze** *Dept of ECE, Texas.
Simultaneous Analog Placement and Routing with Current Flow and Current Density Considerations H.C. Ou, H.C.C. Chien and Y.W. Chang Electronics Engineering,
ILP-Based Inter-Die Routing for 3D ICs Chia-Jen Chang, Pao-Jen Huang, Tai-Chen Chen, and Chien-Nan Jimmy Liu Department of Electrical Engineering, National.
Maze Routing Algorithms with Exact Matching Constraints for Analog and Mixed Signal Designs M. M. Ozdal and R. F. Hentschke Intel Corporation ICCAD 2012.
Routability-driven Floorplanning With Buffer Planning Chiu Wing Sham Evangeline F. Y. Young Department of Computer Science & Engineering The Chinese University.
-1- UC San Diego / VLSI CAD Laboratory Optimization of Overdrive Signoff Tuck-Boon Chan, Andrew B. Kahng, Jiajia Li and Siddhartha Nath Tuck-Boon Chan,
An Exact Algorithm for Difficult Detailed Routing Problems Kolja Sulimma Wolfgang Kunz J. W.-Goethe Universität Frankfurt.
-1- Delay Uncertainty and Signal Criticality Driven Routing Channel Optimization for Advanced DRAM Products Samyoung Bang #, Kwangsoo Han ‡, Andrew B.
14 February, 2004SLIP, 2004 Self-Consistent Power/Performance/Reliability Analysis for Copper Interconnects Bipin Rajendran, Pawan Kapur, Krishna C. Saraswat.
The Interconnect Delay Bottleneck.
Chapter 4 Interconnect.
RE-Tree: An Efficient Index Structure for Regular Expressions
Revisiting and Bounding the Benefit From 3D Integration
Jason Cong, David Zhigang Pan & Prasanna V. Srinivas
Performance Optimization Global Routing with RLC Crosstalk Constraints
Buffered tree construction for timing optimization, slew rate, and reliability control Abstract: With the rapid scaling of IC technology, buffer insertion.
Puneet Gupta1 , Andrew B. Kahng1 , Youngmin Kim2, Dennis Sylvester2
Jason Cong, David Zhigang Pan & Prasanna V. Srinivas
Under a Concurrent and Hierarchical Scheme
Presentation transcript:

1 A Novel Metric for Interconnect Architecture Performance Parthasarathi Dasgupta, Andrew B. Kahng, Swamy V. Muddu Dept. of CSE and ECE University of California, San Diego La Jolla, California 92093

2 Outline Interconnect Architecture (IA) Interconnect Architecture Metric - Rank Dynamic Programming for Exact Rank Computation Experimental Results and Conclusions

3 Interconnect Architecture An Interconnect Architecture (IA) is a collection of pairs of wiring layers, with all wires in a given layer pair having uniform width (w), height (h), spacing (s) and thickness (t) Repeater Layer-pair (j+1) Repeater Layer-pair j via Schematic of an IA

4 Interconnect Architecture Metrics Traditional IA metrics –Delay, bandwidth, crosstalk noise margin, #layers Conventional metrics do not effectively capture design constraints and process parameters –How does interconnect delay vary with ILD-k ? –How does #layers needed to route a design change with wire width? An evaluation metric should consider –Design parameters –Process parameters –Process constraints –Routability subject to performance constraints

5 [Bohr95], [Moll93] Study effects on delay, crosstalk of changing geometric parameters and technology constraints [Venkatesan01] Method to minimize delay, #layers and area Shows dependence of die area on repeater insertion Previous works do not explicitly consider semi-global, local layers These layers significantly impact routability due to via blockage  Motivates search for new IA quality metric Simple and efficiently computable Sensitive to design parameters such as frequency, WLD Sensitive to process parameters such as ILD permittivity Aware of layout issues such as repeaters, via blockage Previous Work

6 Our Contributions Novel metric for IA performance evaluation Efficient dynamic programming method for exactly computing the metric for given Interconnect Architecture for given Wirelength Distribution (WLD) Studies of effects of materials, geometry, frequency, design parameters, repeater resources on IA metric

7 Performance-, WLD-Aware Routing Model All connections (wires) are two-pin, L-shaped Each segment of an L is assigned to one layer of a tier For a given WLD, longer wires always routed on upper tiers; shorter wires always routed on lower tiers Every wire has a target delay (proportional to clock period) Repeaters inserted as needed to meet delay targets Starting from longer wires first All repeaters used in wires of a tier are of same size Repeater resource (maximum available repeater area) is specified as fraction of total die area Repeaters inserted until repeater area budget exhausted Layer-pair (j+1) Layer-pair j

8 Outline Interconnect Architecture (IA) Interconnect Architecture Metric - Rank Dynamic Programming for Exact Rank Computation Experimental Results and Conclusions

9 The Rank Metric for IA Determines IA quality in terms of how completely target performance is met while embedding all wires Definition 1. The rank of a wire is its index in the WLD, where wires have been arranged in order of non-increasing length Definition 2. The rank of an IA is the index of the highest-rank wire in the WLD that meets its target delay, subject to the constraints: –The given repeater area budget is not exceeded –All lower-rank (= longer) wires in the WLD meet target delays –All wires in the WLD can be assigned to the IA The rank of an IA is zero if not all the wires of the WLD can be assigned to the IA, even without meeting any delay targets

10 Rank of IA: Dependencies Via blockage IA: number of layer pairs W, H, S and T, tech node, gate count and gate parameters Repeater area budget A R Target Delays Rank of the IA TWirelength Number of wires WLD

11 The Rank Computation Problem Given –IA with fixed number of layer-pairs with fixed geometry –WLD W with n wires –Available repeater area A R –Upper bound d i = target delay for each wire Find –An assignment of wires from the WLD to the IA using repeater insertion within the repeater area budget to meet target delays of wires –such that rank of first wire failing to meet target delay is maximized

12 Heart of IA evaluation: meeting target delay Target delays of wires are met with repeater insertion Delay computation Length of wire i = l i ; Maximum wire-length = l max ; Clock frequency = f c, c p = Parasitic capacitance; C L = load capacitance r j, c j = resistance, capacitance per unit length of layer-pair j r o, c o = output resistance, capacitance of min-sized repeater ActualDelay of wire i = f(c p,c l,r i,c i,r o,c o ) TargetDelay of wire i = d i = (l i /l max ) x (1/f c ) OptimumRepeaterSize in wires of tier j = S j = Repeater insertion performed if ActualDelay < TargetDelay Delay Computation and Repeater Insertion

13 Rank Computation…(2) Rank of an IA is computed by assigning maximum number of wires from the WLD to tiers of the IA –by making ActualDelay  TargetDelay –within A R Maximizing the Rank requires optimum combination of –wires assigned to tiers –repeaters assigned to wires Exhaustive search over wires, tiers and repeaters is infeasible How to compute Rank efficiently? –Greedy approach or Dynamic Programming (DP)

14 Can Greed Correctly Compute Rank? Rank(IA) calculation: make ActualDelay  TargetDelay for max #wires, within A R budget –Assign wires to tiers; assign repeaters to wires –Example: Four wires, each requiring 4 repeaters to meet target delay in layer-pair j, but only 1 repeater in layer-pair j+1; repeater budget = 8 repeaters  Greed is not optimal Wires in layer-pair j meet target delay Wires in layer-pair j+1 fail to meet target delay Wire in layer-pair j meets target delay Wires in layer-pair j+1 meet target delay Greedy solution. Rank = 2 Optimum solution. Rank = 4

15 Outline Interconnect Architecture (IA) Interconnect Architecture Metric - Rank Dynamic Programming for Exact Rank Computation Experimental Results and Conclusions

16 DP formulation In DP formulation, Rank computation is considered as a collection of sub problems Sub problem characteristics –number of wires to be assigned (i) –number of tiers used for wire assignment (j) –repeater area used to satisfy delay requirements (r) –number of wires already assigned, that meet target delay (i’) A Boolean array M[i,j,r,i’] indicates the feasibility of wire assignment in successive stages of the DP M[i,j,r,i’] is 1 if –i wires can be assigned to –j tiers –with i’ wires meeting target delays –within r repeater area With all remaining n-i wires assigned to m-j tiers even without meeting target delay

17 DP Formulation…contd. The DP populates cells of M[i,j,r,i’] according to The DP has three main components –A means of preserving partial solutions of assignment –A method, testing feasibility of assigning some wires to tiers satisfying delay constraints within repeater budget –A method, testing feasibility of assigning all remaining wires to remaining tiers without delay considerations M[i, j+1, r, i’] = M[i 1 ’, j, r 1, i 1 ’] ^ …1 M’(i 1 ’, j+1, z r1, r-r 1, r 2, i 2 ’, i) ^…2 M’’(n, i, m, j+1, z r1 +z r2 )…3 s.t.r 1 + r 2 < r, i 1 ’ + i 2 ’ = i’ m = total # of tiers, n = total # of wires, i, i’, i 1 ’, i 2 ’ = wire indices, j = layer-pair index, r, r 1, r 2 = repeater area, z r1,z r2 = # repeaters

18 Components of the DP Algorithm Component (1) of the DP, M[i, j+1, r, i’] checks the feasibility of assigning wires 1,…,i in layer-pairs 1,…, j+1 using at most r repeater area s.t. i’ wires meet target delay Component (2) of the DP, M’(i 1 ’, j+1, z r1, r-r 1, r 2, i 2 ’, i) checks feasibility of assigning i 2 ’ (out of i’) wires in layer-pair j+1 using at most r-r 1 repeater area, where total set of wires assigned is 1,..i in layer-pairs 1,…,j+1 Component (3) of the DP, M’’(n, i, m, j+1, z r1 +z r2 ) checks feasibility of assigning wires i+1,…, n in layer-pairs j+2, …, m without any delay constraint Wire assignment for rank computation is performed by DP components (2) and (3) (2) performs wire assignment by meeting target delay (3) performs wire assignment greedily without delay considerations after repeater area is exhausted

19 Wire Assignment to the IA Wire assignment to the IA is performed by (2), (3) of the DP Wire assignment in (2) takes in to account –delays of wires and repeater insertion –area occupied by wires –via blockage due to wires from higher tiers and repeater vias In (3), greedy bottom-up wire assignment is performed to check: –feasibility of assigning n-i wires to m-j tiers –with via blockage from wires above –without any delay considerations

20 Outline Interconnect Architecture (IA) Interconnect Architecture Metric - Rank Dynamic Programming for Exact Rank Computation Experimental Results and Conclusions

21 Experimental Setup DP algorithm for Rank computation is implemented in C Rank computation is performed for –1M / 4M / 20M gate designs in the 180nm/130nm/90nm technology nodes respectively IAs are specified in tech files based on industry design rules Coarsening of WLD for speedup of rank computation –Binning –Bunching Study of variation of Rank with –ILD permittivity (K) –Miller Coupling Factor (MCF) –Repeater area fraction (R) –Target clock frequency (C)

22 Experimental Results Variation of Rank with K, MCF, C and R for 130nm, 1M gate design KRankMCFRankC R MHz MHz GHz GHz Rank increases with decreasing K – ILD permittivity increases with decreasing MCF – Miller Coupling Factor decreases with increasing C – Target Clock Frequency increases with increasing R – Repeater Area budget

23 Conclusions and Future Work Performance impact of ILD permittivity reduction can be matched by reduction in MCF For high-rank embeddings of future WLDs, material improvements alone are not sufficient Need to co-optimize design and process parameters On-going work includes –Alternative models of target delay requirements –Direct optimization of IA using Rank as the metric –Comparison of optimized IA’s with foundry, ITRS IA’s