SLIP 2008, Newcastle Revisiting Fidelity: A Case of Elmore-based Y-routing Trees Tuhina Samanta*, Prasun Ghosal*, Hafizur Rahaman* and Parthasarathi Dasgupta†

Slides:



Advertisements
Similar presentations
THERMAL-AWARE BUS-DRIVEN FLOORPLANNING PO-HSUN WU & TSUNG-YI HO Department of Computer Science and Information Engineering, National Cheng Kung University.
Advertisements

OCV-Aware Top-Level Clock Tree Optimization
Advanced Interconnect Optimizations. Buffers Improve Slack RAT = 300 Delay = 350 Slack = -50 RAT = 700 Delay = 600 Slack = 100 RAT = 300 Delay = 250 Slack.
Cadence Design Systems, Inc. Why Interconnect Prediction Doesn’t Work.
4/22/ Clock Network Synthesis Prof. Shiyan Hu Office: EREC 731.
ELEN 468 Lecture 261 ELEN 468 Advanced Logic Design Lecture 26 Interconnect Timing Optimization.
1 Advancing Supercomputer Performance Through Interconnection Topology Synthesis Yi Zhu, Michael Taylor, Scott B. Baden and Chung-Kuan Cheng Department.
Wen-Hao Liu1, Yih-Lang Li, and Cheng-Kok Koh Department of Computer Science, National Chiao-Tung University School of Electrical and Computer Engineering,
1 Interconnect Layout Optimization by Simultaneous Steiner Tree Construction and Buffer Insertion Presented By Cesare Ferri Takumi Okamoto, Jason Kong.
National Tsing Hua University Po-Yang Hsu,Hsien-Te Chen,
5/14/ Routing Prof. Shiyan Hu Office: EERC 731.
Constructing Minimal Spanning Steiner Trees with Bounded Path Length Presenter : Cheng-Yin Wu, NTUGIEE Some of the Slides in this Presentation are Referenced.
Coupling-Aware Length-Ratio- Matching Routing for Capacitor Arrays in Analog Integrated Circuits Kuan-Hsien Ho, Hung-Chih Ou, Yao-Wen Chang and Hui-Fang.
Chop-SPICE: An Efficient SPICE Simulation Technique For Buffered RC Trees Myung-Chul Kim, Dong-Jin Lee and Igor L. Markov Dept. of EECS, University of.
Toward Better Wireload Models in the Presence of Obstacles* Chung-Kuan Cheng, Andrew B. Kahng, Bao Liu and Dirk Stroobandt† UC San Diego CSE Dept. †Ghent.
Minimum-Buffered Routing of Non- Critical Nets for Slew Rate and Reliability Control Supported by Cadence Design Systems, Inc. and the MARCO Gigascale.
38 th Design Automation Conference, Las Vegas, June 19, 2001 Creating and Exploiting Flexibility in Steiner Trees Elaheh Bozorgzadeh, Ryan Kastner, Majid.
1 Internet Networking Spring 2006 Tutorial 6 Network Cost of Minimum Spanning Tree.
On Modeling and Sensitivity of Via Count in SOC Physical Implementation Kwangok Jeong Andrew B. Kahng.
University of Toronto Pre-Layout Estimation of Individual Wire Lengths Srinivas Bodapati (Univ. of Illinois) Farid N. Najm (Univ. of Toronto)
1 Internet Networking Spring 2004 Tutorial 6 Network Cost of Minimum Spanning Tree.
A Global Minimum Clock Distribution Network Augmentation Algorithm for Guaranteed Clock Skew Yield A. B. Kahng, B. Liu, X. Xu, J. Hu* and G. Venkataraman*
Processing Rate Optimization by Sequential System Floorplanning Jia Wang 1, Ping-Chih Wu 2, and Hai Zhou 1 1 Electrical Engineering & Computer Science.
ISPD 2000, San DiegoApr 10, Requirements for Models of Achievable Routing Andrew B. Kahng, UCLA Stefanus Mantik, UCLA Dirk Stroobandt, Ghent.
6/29/ Routing Prof. Shiyan Hu Office: EERC 731.
Fuzzy Evolutionary Algorithm for VLSI Placement Sadiq M. SaitHabib YoussefJunaid A. Khan Department of Computer Engineering King Fahd University of Petroleum.
A Resource-level Parallel Approach for Global-routing-based Routing Congestion Estimation and a Method to Quantify Estimation Accuracy Wen-Hao Liu, Zhen-Yu.
UNIVERSITY OF JYVÄSKYLÄ Resource Discovery in Unstructured P2P Networks Distributed Systems Research Seminar on Mikko Vapa, research student.
CDCTree: Novel Obstacle-Avoiding Routing Tree Construction based on Current Driven Circuit Model Speaker: Lei He.
Noise and Delay Uncertainty Studies for Coupled RC Interconnects Andrew B. Kahng, Sudhakar Muddu † and Devendra Vidhani ‡ UCLA Computer Science Department,
VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig 1 FLUTE: Fast Lookup Table Based RSMT Algorithm.
A Methodology for Interconnect Dimension Determination By: Jeff Cobb Rajesh Garg Sunil P Khatri Department of Electrical and Computer Engineering, Texas.
Introduction to Routing. The Routing Problem Apply after placement Input: –Netlist –Timing budget for, typically, critical nets –Locations of blocks and.
Area-I/O Flip-Chip Routing for Chip-Package Co-Design Progress Report 方家偉、張耀文、何冠賢 The Electronic Design Automation Laboratory Graduate Institute of Electronics.
Escape Routing For Dense Pin Clusters In Integrated Circuits Mustafa Ozdal, Design Automation Conference, 2007 Mustafa Ozdal, IEEE Trans. on CAD, 2009.
Operations Research Assistant Professor Dr. Sana’a Wafa Al-Sayegh 2 nd Semester ITGD4207 University of Palestine.
On the Construction of Data Aggregation Tree with Minimum Energy Cost in Wireless Sensor Networks: NP-Completeness and Approximation Algorithms National.
CAFE router: A Fast Connectivity Aware Multiple Nets Routing Algorithm for Routing Grid with Obstacles Y. Kohira and A. Takahashi School of Computer Science.
Global Routing.
FPGA Switch Block Design Dr. Philip Brisk Department of Computer Science and Engineering University of California, Riverside CS 223.
EE 5900 Advanced Algorithms for Robust VLSI CAD, Spring 2009 Static Timing Analysis and Gate Sizing.
1 Global Routing Method for 2-Layer Ball Grid Array Packages Yukiko Kubo*, Atsushi Takahashi** * The University of Kitakyushu ** Tokyo Institute of Technology.
PiCAP: A Parallel and Incremental Capacitance Extraction Considering Stochastic Process Variation Fang Gong 1, Hao Yu 2, and Lei He 1 1 Electrical Engineering.
Archer: A History-Driven Global Routing Algorithm Mustafa Ozdal Intel Corporation Martin D. F. Wong Univ. of Illinois at Urbana-Champaign Mustafa Ozdal.
An Efficient Clustering Algorithm For Low Power Clock Tree Synthesis Rupesh S. Shelar Enterprise Microprocessor Group Intel Corporation, Hillsboro, OR.
New Modeling Techniques for the Global Routing Problem Anthony Vannelli Department of Electrical and Computer Engineering University of Waterloo Waterloo,
Thermal-aware Steiner Routing for 3D Stacked ICs M. Pathak and S.K. Lim Georgia Institute of Technology ICCAD 07.
A Faster Approximation Scheme for Timing Driven Minimum Cost Layer Assignment Shiyan Hu*, Zhuo Li**, and Charles J. Alpert** *Dept of ECE, Michigan Technological.
Modern VLSI Design 3e: Chapter 10 Copyright  1998, 2002 Prentice Hall PTR Topics n CAD systems. n Simulation. n Placement and routing. n Layout analysis.
1 Efficient Obstacle-Avoiding Rectilinear Steiner Tree Construction Chung-Wei Lin, Szu-Yu Chen, Chi-Feng Li, Yao-Wen Chang, Chia-Lin Yang National Taiwan.
Chris Chu Iowa State University Yiu-Chung Wong Rio Design Automation
Radhamanjari Samanta *, Soumyendu Raha * and Adil I. Erzin # * Supercomputer Education and Research Centre, Indian Institute of Science, Bangalore, India.
Routing Tree Construction with Buffer Insertion under Obstacle Constraints Ying Rao, Tianxiang Yang Fall 2002.
Maze Routing Algorithms with Exact Matching Constraints for Analog and Mixed Signal Designs M. M. Ozdal and R. F. Hentschke Intel Corporation ICCAD 2012.
Prof. Shiyan Hu Office: EERC 518
Routing Topology Algorithms Mustafa Ozdal 1. Introduction How to connect nets with multiple terminals? Net topologies needed before point-to-point routing.
Modern VLSI Design 3e: Chapter 3 Copyright  1998, 2002 Prentice Hall PTR Topics n Wire delay. n Buffer insertion. n Crosstalk. n Inductive interconnect.
A Novel Timing-Driven Global Routing Algorithm Considering Coupling Effects for High Performance Circuit Design Jingyu Xu, Xianlong Hong, Tong Jing, Yici.
Dirk Stroobandt Ghent University Electronics and Information Systems Department Multi-terminal Nets do Change Conventional Wire Length Distribution Models.
Proximity Optimization for Adaptive Circuit Design Ang Lu, Hao He, and Jiang Hu.
6/19/ VLSI Physical Design Automation Prof. David Pan Office: ACES Placement (3)
הטכניון - מ.ט.ל. הפקולטה להנדסת חשמל - אביב תשס"ה
Confidential & Proprietary – All Rights Reserved Internal Distribution, October Quality of Service in Multimedia Distribution G. Calinescu (Illinois.
VLSI Physical Design Automation
Jason Cong, David Zhigang Pan & Prasanna V. Srinivas
Interconnect Architecture
Buffered tree construction for timing optimization, slew rate, and reliability control Abstract: With the rapid scaling of IC technology, buffer insertion.
EE4271 VLSI Design, Fall 2016 VLSI Channel Routing.
Clock Tree Routing With Obstacles
Jason Cong, David Zhigang Pan & Prasanna V. Srinivas
Presentation transcript:

SLIP 2008, Newcastle Revisiting Fidelity: A Case of Elmore-based Y-routing Trees Tuhina Samanta*, Prasun Ghosal*, Hafizur Rahaman* and Parthasarathi Dasgupta† *Bengal Engineering & Science University, India †Indian Institute of Management Calcutta, India `

SLIP 2008, Newcastle Outline Y-Routing and Y-Routing Trees Fidelity of Delay Estimators Some Previous works Motivation of our work Construction of Elmore-based Y-Routing tree Fidelity and Rank Correlation Experimental Results Conclusions

SLIP 2008, Newcastle Routing – a critical step For a given set of terminals, connect different subsets (nets) of these terminals Cost components to be minimized wire length number of layers via delay Some recent Challenges in DSM Physical Design D ominance of interconnect length Congestion Parasitic effects Thermal effects

SLIP 2008, Newcastle Steiner trees for Routing Involves the addition of a set of Steiner points to a given set of demand points (terminals) to generate an interconnection (with minimum wire-length, etc.) Traditional Manhattan routing allows edges in only two directions viz. horizontal and vertical Objectives : –Minimizing total/maximum sink delay –Maximizing Required-Arrival-Time at the source for a given set of Required-Arrival-Times at sink terminals

SLIP 2008, Newcastle Generalized (λ-) Steiner trees λ = 2 for Manhattan Steiner trees, λ = 3 for Y-Steiner trees, λ = 4 for X- Steiner trees λ extends from 2 to 4 => number of metal layers increases from two to four More the number of layers => more the number of vias Cost to implement increases with increasing λ

SLIP 2008, Newcastle Y-routing Routing wires along 0-, 120- and 240-degree (hexagonal) orientations a) X-routing gridsb) Y-routing grids H. Chen et al., The Y-Architecture for On-Chip Interconnect: Analysis and Methodology, IEEE TCAD/ICAS, 2004

SLIP 2008, Newcastle Y-routing : Pros and Cons Pros:  number of layers not too large  uniform routing grid  more throughput / routability  simple DRC Cons:  more number of layers than in Manhattan routing  more wire-length than in X-routing  …..

SLIP 2008, Newcastle Y-routing Trees An Example Y-routing tree T. Samanta et al. A Heuristic Method for Constructing Hexagonal Steiner Minimal Trees for Routing in VLSI, ISCAS

SLIP 2008, Newcastle Outline Y-Routing and Y-Routing Trees Fidelity of Delay Estimators Some Previous works Motivation of our work Construction of Elmore-based Y-Routing tree Fidelity and Rank Correlation Experimental Results Conclusions

SLIP 2008, Newcastle Accuracy of Heuristics Heuristics - used to solve hard problems in reasonable time Quality measured by the degree of nearness to the Optimal Solution How to distinguish good heuristics from bad heuristics? What if the Optimal Solution Cost is Unknown?

SLIP 2008, Newcastle Relative Accuracy of Heuristics Measuring relative performances of different heuristics Generate all possible solutions Compute the costs of the generated solutions using each heuristic Find the deviations in the performances of the heuristics based on these costs

SLIP 2008, Newcastle Fidelity of a Heuristic h i, h j = cost of heuristic solution for instances i,j s i, s j = cost of optimal solution for instances i,j m = total number of instances considered Fidelity of heuristic (f) = |(i,j): 0 0 or (s i = s j )| ( ) m 2 J L Ganley, Accuracy and fidelity of fast net length estimates. Integration, the VLSI Journal, 1997.

SLIP 2008, Newcastle Why bother about Signal Delay? Global Routing trees typically constructed with an objective of minimizing circuit delay to increase speed of the circuit Accurate measurement of signal delay very important Exact signal delay measurement is too complex and time consuming Delay estimators should correctly discriminate the good solutions from the bad ones

SLIP 2008, Newcastle Estimating Signal Delay Linear Delay Elmore Delay Higher order moments – 2-Pole Others – Fitted Elmore delay SPICE

SLIP 2008, Newcastle Linear Delay Delay = d 1 + d 2 d1d1 d2d2

SLIP 2008, Newcastle Elmore Delay Delay through an on-path resistor = resistance  downstream capacitance Delay through a path (driver to a sink pin) = Sum of delays through individual edges on the path First moment of interconnect under impulse response Based on the 50% delay Rest of circuit r C1/2 C2 C1/2 Source Delay through r = r.(c1/2 + C2)

SLIP 2008, Newcastle Source, S AB Delay (S, B) = Delay(S, A) + Delay(A, B) Elmore Delay Characteristics Fairly accurate delay estimate at nodes far from source Expressible as a closed-form expression involving only resistors and capacitors Provable upper bound on actual delay for all inputs Additive P. Penfield, J. Rubinstein, M. A. Horowitz. Signal Delay in RC tree networks, IEEE TCAD/ICAS, July 1983.

SLIP 2008, Newcastle Elmore Delay Computation A possible scheme of traversing a RC tree: Pass 1: Compute the effective capacitance at each node of the RC tree Pass 2: At a node, compute the actual Elmore delay (from the source) from the sum of (a) delay up to the predecessor node, and (b) the product of the resistance between the predecessor node and the current node, and the effective capacitance at current node obtained in Pass 1.

SLIP 2008, Newcastle Degree to which an optimal or near-optimal solution according to a delay estimator will also be optimal or near-optimal according to the actual delay. For a set of possible solutions obtained using the estimator, how close are the ranks correlated to those for the solutions obtained by the actual delay measurement? Fidelity of Delay Estimators

SLIP 2008, Newcastle Computing Fidelity Enumerate all possible routing solutions. Rank all tree topologies using the estimator. Rank all tree topologies by SPICE delay model (actual). Find the average difference between the two sets of ranks for each topology.

SLIP 2008, Newcastle Outline Y-Routing and Y-Routing Trees Fidelity of Delay Estimators Some Previous works Motivation of our work Construction of Elmore-based Y-Routing tree Fidelity and Rank Correlation Experimental Results Conclusions

SLIP 2008, Newcastle Some Previous Works Construction of near-optimal routing trees based on Elmore delay (Boese et al, ICCAD. 1993) Optimum wire sizing in routing trees (Cong et al, ACMTODAES, 1996) P-tree – Cheng et al, Based on Permutation Trees Q-Tree – Kahng et al

SLIP 2008, Newcastle Outline Y-Routing and Y-Routing Trees Fidelity of Delay Estimators Some Previous works Motivation of our work Construction of Elmore-based Y-Routing tree Fidelity and Rank Correlation Experimental Results Conclusions

SLIP 2008, Newcastle Existing works: Caveats Fidelity definitions and experiments –Mostly Common-sense based (average, standard deviation, etc.) –Relevant data sets rarely considered –Tested only for Manhattan Architectures –Presence of tie solutions

SLIP 2008, Newcastle Objectives of our work Verifying the fidelity relations of Linear and Elmore Delay for Non-Manhattan routing architectures, specifically the Y-Routing architectures Studying the fidelity values for relevant routing topologies Defining some new fidelity metrics based on non- parametric statistical quantities, and studying the performances of Linear and Elmore Delay based on these new metrics

SLIP 2008, Newcastle Outline Y-Routing and Y-Routing Trees Fidelity of Delay Estimators Some Previous works Motivation of our work Construction of Elmore-based Y-Routing tree Fidelity and Rank Correlation Experimental Results Conclusions

SLIP 2008, Newcastle Proposed Algorithm: Overview Input: A set of given terminals P. Output: A Y-routed Elmore Routing Tree over P Step 1.Construct the Hanan grid Step 2.For each edge e Є E Find the sum (dtot(e)) of the perpendicular distances of all the terminals (in P) from e Step 3.ω(e) = √(length of e × dtot(e)) ω(e) a llows resolving tie cases during shortest path construction ω(e) en ables construction of trees of shorter lengths and thus of reduced Elmore delays

SLIP 2008, Newcastle Proposed Algorithm Step 4.Choose source vertex n 0 and arbitrarily any one sink vertex n Find the shortest (in total ω) path between n 0 and n This forms the partial Steiner Tree Spart Step 5.while not all terminals of P have been selected Step 6. for all the terminals n not yet selected Step 7.find the terminal which when connected to Spart with the shortest path yields minimum Elmore delay Step 8.augment Spart with this shortest path Step 9.return S part

SLIP 2008, Newcastle Proposed Algorithm - Example n0n0

SLIP 2008, Newcastle Proposed Algorithm - Example n0n0

SLIP 2008, Newcastle Proposed Algorithm - Example n0n0

SLIP 2008, Newcastle Proposed Algorithm - Example n0n0

SLIP 2008, Newcastle Proposed Algorithm - Example n0n0

SLIP 2008, Newcastle Proposed Algorithm - Example n0n0

SLIP 2008, Newcastle Outline Y-Routing and Y-Routing Trees Fidelity of Delay Estimators Some Previous works Motivation of our work Construction of Elmore-based Y-Routing tree Fidelity and Rank Correlation Experimental Results Conclusions

SLIP 2008, Newcastle Rank Correlation for Fidelity Does fidelity computation have anything to do with rank correlation? Should fidelity base on discrimination of ALL candidate solutions or only the good solutions? How would the tie cases be handled? Should fidelity be dimensionless?

SLIP 2008, Newcastle Spearman’s Rank Correlation Two variables x and y n pairs of observations on x and y are ordered in magnitude Define rank of ordered observation x j as rank(x j ) = j Spearman’s Rank Correlation is R R =  i=1 n d i 2 /(n(n 2 – 1)), where d i = |rank(x i ) – rank(y i )|

SLIP 2008, Newcastle Kendall’s Tau Two variables x and y n pairs of observations on x and y are ordered in magnitude Q = total number of interchanges required for the y values to order them as x values where 0 < Q < n(n-1)/2 Kendall’s Tau =   = 1 – 4Q / n(n-1)  = (# concordant pairs - # discordant pairs) / total # of pairs

SLIP 2008, Newcastle Kendall’s Tau: Properties If the agreement between the two rankings is perfect (i.e., the two rankings are the same) the coefficient has value 1. If the disagreement between the two rankings is perfect (i.e., one ranking is the reverse of the other) the coefficient has value -1. For all other arrangements the value lies between -1 and 1, and increasing values imply increasing agreement between the rankings. If the rankings are completely independent, the coefficient has value 0 on average.

SLIP 2008, Newcastle Kendall’s Tau: Example DelayTopo1Topo2Topo3Topo4Topo5Topo6Topo7Topo8 Elmore rank SPICE rank Total no. of pairs = 28 No. of concordant pairs = 22 No. of discordant pairs = 6  = 0.57

SLIP 2008, Newcastle Normalized Delay δ = absolute delay value for an estimator µ = the average of all absolute values for the estimator σ = the standard deviation Normalized value for delay is given by δ norm = (δ – μ)/σ Fidelity of Elmore (Linear) = ratio of the normalized delay of Elmore (Linear) to Spice

SLIP 2008, Newcastle Outline Y-Routing and Y-Routing Trees Fidelity of Delay Estimators Some Previous works Motivation of our work Construction of Elmore-based Y-Routing tree Fidelity and Rank Correlation Experimental Results Conclusions

SLIP 2008, Newcastle Technology Node Parameters B A McCoy K D Boese, A B Kahng and G Robins. Fidelity and near- optimality of elmore-based routing constructions. In Proc. of the IEEE International Conference on Computer Design, pp. 81–84, October 1993

SLIP 2008, Newcastle Spearman’s Correlation Coefficient

SLIP 2008, Newcastle Kendall’s Tau

SLIP 2008, Newcastle Normalized Delay Ratios

SLIP 2008, Newcastle Outline Y-Routing and Y-Routing Trees Fidelity of Delay Estimators Some Previous works Motivation of our work Construction of Elmore-based Y-Routing tree Fidelity and Rank Correlation Experimental Results Conclusions

SLIP 2008, Newcastle What did we achieve? An efficient algorithm for constructing –Y-routing tree of minimum Elmore delay Efficient schemes for –Computing fidelities of Elmore and Linear delay estimates against Spice delay values Confirmation that Fidelity of Elmore is better than fidelity of Linear for unbuffered routing trees

SLIP 2008, Newcastle Possible extensions Extending the algorithm for X- and M-routing trees and comparing fidelity results for these architectures Experiments with better delay metrics such as 2-Pole Experiments with larger nets with variations of several parameters