1 Physical Hierarchy Generation with Routing Congestion Control Chin-Chih Chang *, Jason Cong *, Zhigang (David) Pan +, and Xin Yuan * * UCLA Computer.

Slides:



Advertisements
Similar presentations
Porosity Aware Buffered Steiner Tree Construction C. Alpert G. Gandham S. Quay IBM Corp M. Hrkic Univ Illinois Chicago J. Hu Texas A&M Univ.
Advertisements

Group: Wilber L. Duran Duo (Steve) Liu
Optimization of Placement Solutions for Routability Wen-Hao Liu, Cheng-Kok Koh, and Yih-Lang Li DAC’13.
Wen-Hao Liu1, Yih-Lang Li, and Cheng-Kok Koh Department of Computer Science, National Chiao-Tung University School of Electrical and Computer Engineering,
3D-STAF: Scalable Temperature and Leakage Aware Floorplanning for Three-Dimensional Integrated Circuits Pingqiang Zhou, Yuchun Ma, Zhouyuan Li, Robert.
Meng-Kai Hsu, Sheng Chou, Tzu-Hen Lin, and Yao-Wen Chang Electronics Engineering, National Taiwan University Routability Driven Analytical Placement for.
A Size Scaling Approach for Mixed-size Placement Kalliopi Tsota, Cheng-Kok Koh, Venkataramanan Balakrishnan School of Electrical and Computer Engineering.
Ripple: An Effective Routability-Driven Placer by Iterative Cell Movement Xu He, Tao Huang, Linfu Xiao, Haitong Tian, Guxin Cui and Evangeline F.Y. Young.
Coupling-Aware Length-Ratio- Matching Routing for Capacitor Arrays in Analog Integrated Circuits Kuan-Hsien Ho, Hung-Chih Ou, Yao-Wen Chang and Hui-Fang.
FastPlace: Efficient Analytical Placement using Cell Shifting, Iterative Local Refinement and a Hybrid Net Model FastPlace: Efficient Analytical Placement.
Early Days of Circuit Placement Martin D. F. Wong Department of Electrical and Computer Engineering University of Illinois at Urbana-Champaign.
Congestion Driven Placement for VLSI Standard Cell Design Shawki Areibi and Zhen Yang School of Engineering, University of Guelph, Ontario, Canada December.
Layer Assignment Algorithm for RLC Crosstalk Minimization Bin Liu, Yici Cai, Qiang Zhou, Xianlong Hong Tsinghua University.
International Conference on Computer-Aided Design San Jose, CA Nov. 2001ER UCLA UCLA 1 Congestion Reduction During Placement Based on Integer Programming.
38 th Design Automation Conference, Las Vegas, June 19, 2001 Creating and Exploiting Flexibility in Steiner Trees Elaheh Bozorgzadeh, Ryan Kastner, Majid.
ER UCLA UCLA ICCAD: November 5, 2000 Predictable Routing Ryan Kastner, Elaheh Borzorgzadeh, and Majid Sarrafzadeh ER Group Dept. of Computer Science UCLA.
Placement 1 Outline Goal What is Placement? Why Placement?
An Algebraic Multigrid Solver for Analytical Placement With Layout Based Clustering Hongyu Chen, Chung-Kuan Cheng, Andrew B. Kahng, Bo Yao, Zhengyong Zhu.
Processing Rate Optimization by Sequential System Floorplanning Jia Wang 1, Ping-Chih Wu 2, and Hai Zhou 1 1 Electrical Engineering & Computer Science.
A Resource-level Parallel Approach for Global-routing-based Routing Congestion Estimation and a Method to Quantify Estimation Accuracy Wen-Hao Liu, Zhen-Yu.
Metal Layer Planning for Silicon Interposers with Consideration of Routability and Manufacturing Cost W. Liu, T. Chien and T. Wang Department of CS, NTHU,
POLAR 2.0: An Effective Routability-Driven Placer Chris Chu Tao Lin.
7/13/ EE4271 VLSI Design VLSI Routing. 2 7/13/2015 Routing Problem Routing to reduce the area.
Chip Planning 1. Introduction Chip Planning:  Deals with large modules with −known areas −fixed/changeable shapes −(possibly fixed locations for some.
General Routing Overview and Channel Routing
WISCAD – VLSI Design Automation GRIP: Scalable 3-D Global Routing using Integer Programming Tai-Hsuan Wu, Azadeh Davoodi Department of Electrical and Computer.
MGR: Multi-Level Global Router Yue Xu and Chris Chu Department of Electrical and Computer Engineering Iowa State University ICCAD
Power Reduction for FPGA using Multiple Vdd/Vth
1 Coupling Aware Timing Optimization and Antenna Avoidance in Layer Assignment Di Wu, Jiang Hu and Rabi Mahapatra Texas A&M University.
TSV-Aware Analytical Placement for 3D IC Designs Meng-Kai Hsu, Yao-Wen Chang, and Valerity Balabanov GIEE and EE department of NTU DAC 2011.
March 20, 2007 ISPD An Effective Clustering Algorithm for Mixed-size Placement Jianhua Li, Laleh Behjat, and Jie Huang Jianhua Li, Laleh Behjat,
Low-Power Gated Bus Synthesis for 3D IC via Rectilinear Shortest-Path Steiner Graph Chung-Kuan Cheng, Peng Du, Andrew B. Kahng, and Shih-Hung Weng UC San.
VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig 1 EECS 527 Paper Presentation High-Performance.
Archer: A History-Driven Global Routing Algorithm Mustafa Ozdal Intel Corporation Martin D. F. Wong Univ. of Illinois at Urbana-Champaign Mustafa Ozdal.
An Efficient Clustering Algorithm For Low Power Clock Tree Synthesis Rupesh S. Shelar Enterprise Microprocessor Group Intel Corporation, Hillsboro, OR.
New Modeling Techniques for the Global Routing Problem Anthony Vannelli Department of Electrical and Computer Engineering University of Waterloo Waterloo,
1 Wire Length Prediction-based Technology Mapping and Fanout Optimization Qinghua Liu Malgorzata Marek-Sadowska VLSI Design Automation Lab UC-Santa Barbara.
Efficient Multi-Layer Obstacle- Avoiding Rectilinear Steiner Tree Construction Chung-Wei Lin, Shih-Lun Huang, Kai-Chi Hsu,Meng-Xiang Li, Yao-Wen Chang.
Thermal-aware Steiner Routing for 3D Stacked ICs M. Pathak and S.K. Lim Georgia Institute of Technology ICCAD 07.
CSE 494: Electronic Design Automation Lecture 2 VLSI Design, Physical Design Automation, Design Styles.
Massachusetts Institute of Technology 1 L14 – Physical Design Spring 2007 Ajay Joshi.
Jason Cong‡†, Guojie Luo*†, Kalliopi Tsota‡, and Bingjun Xiao‡ ‡Computer Science Department, University of California, Los Angeles, USA *School of Electrical.
IO CONNECTION ASSIGNMENT AND RDL ROUTING FOR FLIP-CHIP DESIGNS Jin-Tai Yan, Zhi-Wei Chen 1 ASPDAC.2009.
VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 6: Detailed Routing © KLMH Lienig 1 What Makes a Design Difficult to Route Charles.
ARCHER:A HISTORY-DRIVEN GLOBAL ROUTING ALGORITHM Muhammet Mustafa Ozdal, Martin D. F. Wong ICCAD ’ 07.
Large Scale Circuit Placement: Gap and Promise Jason Cong UCLA VLSI CAD LAB 1 Joint work with Chin-Chih Chang, Tim Kong, Michail Romesis, Joseph R. Shinnerl,
Tao Lin Chris Chu TPL-Aware Displacement- driven Detailed Placement Refinement with Coloring Constraints ISPD ‘15.
1 ER UCLA ISPD: Sonoma County, CA, April, 2001 An Exact Algorithm for Coupling-Free Routing Ryan Kastner, Elaheh Bozorgzadeh,Majid Sarrafzadeh.
An Effective Congestion Driven Placement Framework André Rohe University of Bonn, Germany joint work with Ulrich Brenner.
Optimality, Scalability and Stability study of Partitioning and Placement Algorithms Jason Cong, Michail Romesis, Min Xie UCLA Computer Science Department.
Routability-driven Floorplanning With Buffer Planning Chiu Wing Sham Evangeline F. Y. Young Department of Computer Science & Engineering The Chinese University.
Unified Quadratic Programming Approach for Mixed Mode Placement Bo Yao, Hongyu Chen, Chung-Kuan Cheng, Nan-Chi Chou*, Lung-Tien Liu*, Peter Suaris* CSE.
International Symposium on Physical Design San Diego, CA April 2002ER UCLA UCLA 1 Routability Driven White Space Allocation for Fixed-Die Standard-Cell.
Congestion Analysis for Global Routing via Integer Programming Hamid Shojaei, Azadeh Davoodi, and Jeffrey Linderoth* Department of Electrical and Computer.
Global Clustering-Based Performance-Driven Circuit Partitioning Jason Cong University of California Los Angeles Chang Wu Aplus Design.
A Novel Timing-Driven Global Routing Algorithm Considering Coupling Effects for High Performance Circuit Design Jingyu Xu, Xianlong Hong, Tong Jing, Yici.
Interconnect Characteristics of 2.5-D System Integration Scheme Yangdong (Steven) Deng & Wojciech P. Maly
A Snap-On Placement Tool Israel Waldman. Introduction.
6/19/ VLSI Physical Design Automation Prof. David Pan Office: ACES Placement (3)
Placement and Routing Algorithms. 2 FPGA Placement & Routing.
VLSI Physical Design Automation
HeAP: Heterogeneous Analytical Placement for FPGAs
VLSI Physical Design Automation
Multilevel Full-Chip Routing for the X-Based Architecture
2 University of California, Los Angeles
Placement and Routing With Congestion Control
mPL 5 Overview ISPD 2005 Placement Contest Entry
EDA Lab., Tsinghua University
VLSI Physical Design Automation
Under a Concurrent and Hierarchical Scheme
Presentation transcript:

1 Physical Hierarchy Generation with Routing Congestion Control Chin-Chih Chang *, Jason Cong *, Zhigang (David) Pan +, and Xin Yuan * * UCLA Computer Science Department + IBM T.J. Watson Research Center This paper is supported in part by SRC, an IBM Faculty Partnership Award, a grant from Intel, and a grant from Fujitsu under the California MICRO program

2 Overview n Motivation and problem formulation for physical hierarchy generation n Algorithm and contributions u Multilevel coarse placement framework u Hierarchical area density control u Fast incremental global routing n Experimental results n Conclusions

3 Challenges in Deep Sub-micron VLSI Designs n Performance Problems – need to optimize the dominating factor, i.e. interconnect delays u See interconnects as early as possible u Optimize interconnects in almost all design stages n Design convergence problems – need to eliminate mismatches between early estimations and final layouts u Accurate estimation/optimization of interconnect delay in early design stages u Consider interconnect routability in early design stages u Consider crosstalk noise impacts in early design stages n Require accurate global interconnect estimation/optimization in early design stages

4 Physical Hierarchy Generation Problem Formulation Hard IPSoft module Same color for modules of the same logic hierarchy Logical Hierarchy Assign modules to physical hierarchy Defines global interconnects Optimization objectives of this work: wire length minimization routing congestion minimization Physical Hierarchy = Placement bins + module locations Other objectives could also be used (not a complete list): performance, noise, power, etc.

5 Discussions on Previous Work on Placement with Routability Considerations n Modeling methods: u Weighted BBOX [Cheng ICCAD’94], weighted BBOX with congestion region expansion: [Yang ICCAD’01] u Reconstruction of Steiner tree on each move: [Tsay Intl. Conf. Asic’92] n Optimization methods: u Recursive partition placement with pre-computed Steiner tree [Mayrhofer ICCAD’90] u Cell padding or region growing/shrinking: [Hou ASPDAC’01], [Sadakane CICC’97], [Parakh DAC’98], [Brenner ISPD’02], [Yang ISPD’02] n Most accurate routing estimation from global routing itself. Need to find tradeoff between accuracy and run time

6 Algorithm Overview: V-shape Multi- Level Coarse Placement Coarsening by clustering Refinement by placement Initial Placement n Congestion driven at the finest few placement levels n Fast global routing for congestion estimation

7 Algorithm Overview - Clustering Finest cluster levelCoarsest cluster level n Clustering: group clusters (or cells) together n Usually under certain area constraints n Clustering criteria: connectivity driven, performance driven, etc.

8 Algorithm Overview - Refinement by Placement Initial Coarsest Level Placement Declustering Placement Declustering Placement Final coarse placement solution n Use the same grid structure in each level of placement n Variable cluster size (may bigger than a bin): handled by hierarchical area density control n Use fast incremental routing for congestion estimation

9 Area Density Problems in Multi-level Coarse Placement n Traditional area density control: n Cell area in each bin < bin area utilization with a small percentage of overflow n Does not work when cluster sizes may have significant variations and may be bigger than a bin n How about use different grid sizes for different levels of clustering? n Hard to find fixed percentages that works n Significant placement cost jump when switch grid sizes

10 Hierarchical Area Density Control n Use the same grid structure for placement for all clustering levels n Impose hierarchy on bin structure for area density control n Each cluster move must satisfy the area constraints on each level in the bin hierarchy n Area constraint for moving a cell of size A n Allowed overflow on each level in the bin hierarchy = kA, k is a small constant (usually 1 or 2) n Work well in multi-level framework: n Area constraints gradually tightened during optimization

11 Fast Incremental A-tree Routing for Multi-pin Nets  Simple incremental A-tree  Recursively Quad-partition grids  Each pin recursively connects to lower left corner of each level of partition  For net with bounding box length B, at most 2 *log B edge updates for each pin move, except the root.  Each edge routed by LZ-router First Quadrant Root(source pin)

12 Fast LZ-routing for Two-pin Connections n Decide HVH or VHV: n Select the less congested layer n Binary search on V-stem (or H-stem) n Initial left region and right region to cover bounding box n Repeat n Query wire usage on both regions n Select region with less congestion n Wire usage query can be done in O(log grid_size) Left region Right region HVH VHV

13 Placement Cost Functions n Wire length driven: Summation of net bounding boxes of all nets n Congestion driven: u Wire usages estimated from the fast global router u Cost = Summation of square of wire usages in all bins u For fixed wire width F cost equivalent to summation of weighted wire length, weight on a bin = wire usage of the bin u For congestion driven run: only turns on congestion driven cost at the finest placement level W1W2 W3 Congestion cost = W1 2 + W2 2 + … + W9 2 W4W5 W6 W7W8 W9

14 Experimental Results on Wire Length Minimization n Multi-level simulated annealing coarse placement n Wire length comparison with GORDIAN-L: u Our engine only turns on wire length optimization u Legalized by DOMINO for wire length comparison Our multi-level engine performs well for big circuits 20k-50k test cases: avqlarge, avqsmall, ibm04, ibm07 50k-100k test cases: ibm09, ibm10 100k-210k test cases: ibm14, ibm15, ibm16, ibm17, ibm18

15 Experimental Results on Congestion Control BBOX WLRouted WLMax boundary congestion Total overflow CPU mPG11111 mPG-cg.rd mPG-cg Test cases: ibm01, ibm04, ibm07, ibm11, ibm13, ibm15 mPG: wire length driven mode mPG-cg: congestion driven at finest clustering level mPG-cg.rd: alternative congestion driven + wire length driven at fines clustering level

16 Conclusions n Multi-level simulated annealing coarse placement u Hierarchical area density control u Fast global routing estimation u Capable of wire length minimization with/without congestion minimization n Compare to GordianL, mPG generates comparable solution with 3-6 times speedup for test cases > 100K n Congestion driven mPG reduce estimated global routing overflows by 50%-80% with 6-19 times CPU time