F.F. Dragan (Kent State) A.B. Kahng (UCSD) I. Mandoiu (Georgia Tech/UCLA) S. Muddu (Silicon Graphics) A. Zelikovsky (Georgia State) Provably Good Global.

Slides:



Advertisements
Similar presentations
Truthful Spectrum Auction Design for Secondary Networks Yuefei Zhu ∗, Baochun Li ∗ and Zongpeng Li † ∗ Electrical and Computer Engineering, University.
Advertisements

OCV-Aware Top-Level Clock Tree Optimization
Linear Programming (LP) (Chap.29)
The Randomization Repertoire Rajmohan Rajaraman Northeastern University, Boston May 2012 Chennai Network Optimization WorkshopThe Randomization Repertoire1.
Optimization of Linear Placements for Wirelength Minimization with Free Sites A. B. Kahng, P. Tucker, A. Zelikovsky (UCLA & UCSD) Supported by grants from.
1 Interconnect Layout Optimization by Simultaneous Steiner Tree Construction and Buffer Insertion Presented By Cesare Ferri Takumi Okamoto, Jason Kong.
F.F. Dragan (Kent State) A.B. Kahng (UCSD) I. Mandoiu (UCLA) S. Muddu (Sanera Systems) A. Zelikovsky (Georgia State) Provably Good Global Buffering by.
Lecture 10: Integer Programming & Branch-and-Bound
A.B. Kahng, Ion I. Mandoiu University of California at San Diego, USA A.Z. Zelikovsky Georgia State University, USA Supported in part by MARCO GSRC and.
1 EL736 Communications Networks II: Design and Algorithms Class8: Networks with Shortest-Path Routing Yong Liu 10/31/2007.
© Yamacraw, 2001 Minimum-Buffered Routing of Non-Critical Nets for Slew Rate and Reliability A. Zelikovsky GSU Joint work with C. Alpert.
Minimum-Buffered Routing of Non- Critical Nets for Slew Rate and Reliability Control Supported by Cadence Design Systems, Inc. and the MARCO Gigascale.
Interval packing problem Multicommodity demand flow in a line Jian Li Sep
38 th Design Automation Conference, Las Vegas, June 19, 2001 Creating and Exploiting Flexibility in Steiner Trees Elaheh Bozorgzadeh, Ryan Kastner, Majid.
F.F. Dragan (Kent State) A.B. Kahng (UCSD) I. Mandoiu (UCLA/UCSD) S. Muddu (Sanera Systems) A. Zelikovsky (Georgia State) Practical Approximation Algorithms.
Power-Aware Placement
Approximation Algorithms
ABSTRACT We consider the problem of buffering a given tree with the minimum number of buffers under load cap and buffer skew constraints. Our contributions.
Provably Good Global Buffering Using an Available Buffer Block Plan F. F. Dragan (Kent) A. B. Kahng (UCLA) I. Mandoiu (Gatech) S. Muddu (Silicon graphics)
1 A Shifting Strategy for Dynamic Channel Assignment under Spatially Varying Demand Harish Rathi Advisors: Prof. Karen Daniels, Prof. Kavitha Chandra Center.
Supply Voltage Degradation Aware Analytical Placement Andrew B. Kahng, Bao Liu and Qinke Wang UCSD CSE Department {abk, bliu,
On Legalization of Row-Based Placements Andrew B. KahngSherief Reda CSE & ECE Departments University of CA, San Diego La Jolla, CA 92093
Yield- and Cost-Driven Fracturing for Variable Shaped-Beam Mask Writing Andrew B. Kahng CSE and ECE Departments, UCSD Xu Xu CSE Department, UCSD Alex Zelikovsky.
1 UCSD VLSI CAD Laboratory ISQED-2009 Revisiting the Linear Programming Framework for Leakage Power vs. Performance Optimization Kwangok Jeong, Andrew.
Non-tree Routing for Reliability & Yield Improvement A.B. Kahng – UCSD B. Liu – Incentia I.I. Mandoiu – UCSD Work supported by Cadence, MARCO GSRC, and.
EDA (CS286.5b) Day 19 Covering and Retiming. “Final” Like Assignment #1 –longer –more breadth –focus since assignment #2 –…but ideas are cummulative –open.
Distributed Combinatorial Optimization
Floorplan Evaluation with Timing-Driven Global Wireplanning, Pin Assignment and Buffer / Wire Sizing Christoph Albrecht Synopsys, Inc., Mountain View formerly.
1 Introduction to Approximation Algorithms Lecture 15: Mar 5.
Ion I. Mandoiu Ph.D. Defense of Research August 11, 2000 Approximation Algorithms for VLSI Routing.
1 A Novel Metric for Interconnect Architecture Performance Parthasarathi Dasgupta, Andrew B. Kahng, Swamy V. Muddu Dept. of CSE and ECE University of California,
Approximation Algorithms: Bristol Summer School 2008 Seffi Naor Computer Science Dept. Technion Haifa, Israel TexPoint fonts used in EMF. Read the TexPoint.
Max-flow/min-cut theorem Theorem: For each network with one source and one sink, the maximum flow from the source to the destination is equal to the minimal.
Introduction to Routing. The Routing Problem Apply after placement Input: –Netlist –Timing budget for, typically, critical nets –Locations of blocks and.
Area-I/O Flip-Chip Routing for Chip-Package Co-Design Progress Report 方家偉、張耀文、何冠賢 The Electronic Design Automation Laboratory Graduate Institute of Electronics.
1 Using Composite Variable Modeling to Solve Integrated Freight Transportation Planning Problems Sarah Root University of Michigan IOE November 6, 2006.
Escape Routing For Dense Pin Clusters In Integrated Circuits Mustafa Ozdal, Design Automation Conference, 2007 Mustafa Ozdal, IEEE Trans. on CAD, 2009.
LP-Based Algorithms for Capacitated Facility Location Hyung-Chan An EPFL July 29, 2013 Joint work with Mohit Singh and Ola Svensson.
Group Construction for Cabin Crew Comparing Constraint Programming with Branch&Price Presentation at SweConsNet 2005 Jesper Hansen Carmen Systems AB
1 Introduction to Approximation Algorithms. 2 NP-completeness Do your best then.
Solving Hard Instances of FPGA Routing with a Congestion-Optimal Restrained-Norm Path Search Space Keith So School of Computer Science and Engineering.
Low-Power Gated Bus Synthesis for 3D IC via Rectilinear Shortest-Path Steiner Graph Chung-Kuan Cheng, Peng Du, Andrew B. Kahng, and Shih-Hung Weng UC San.
Archer: A History-Driven Global Routing Algorithm Mustafa Ozdal Intel Corporation Martin D. F. Wong Univ. of Illinois at Urbana-Champaign Mustafa Ozdal.
New Modeling Techniques for the Global Routing Problem Anthony Vannelli Department of Electrical and Computer Engineering University of Waterloo Waterloo,
Thermal-aware Steiner Routing for 3D Stacked ICs M. Pathak and S.K. Lim Georgia Institute of Technology ICCAD 07.
LP-Based Algorithms for Capacitated Facility Location Hyung-Chan An EPFL July 29, 2013 Joint work with Mohit Singh and Ola Svensson.
IO CONNECTION ASSIGNMENT AND RDL ROUTING FOR FLIP-CHIP DESIGNS Jin-Tai Yan, Zhi-Wei Chen 1 ASPDAC.2009.
ARCHER:A HISTORY-DRIVEN GLOBAL ROUTING ALGORITHM Muhammet Mustafa Ozdal, Martin D. F. Wong ICCAD ’ 07.
Fujitsu Labs, January 20, 2003 Non-tree Routing for Reliability and Yield Improvement Ion Mandoiu CSE Department, UC San Diego Joint work with A.B. Kahng.
CSE 589 Part VI. Reading Skiena, Sections 5.5 and 6.8 CLR, chapter 37.
Fast Algorithms for Slew Constrained Minimum Cost Buffering S. Hu*, C. Alpert**, J. Hu*, S. Karandikar**, Z. Li*, W. Shi* and C. Sze** *Dept of ECE, Texas.
Column Generation By Soumitra Pal Under the guidance of Prof. A. G. Ranade.
1 A Min-Cost Flow Based Detailed Router for FPGAs Seokjin Lee *, Yongseok Cheon *, D. F. Wong + * The University of Texas at Austin + University of Illinois.
CS223 Advanced Data Structures and Algorithms 1 Maximum Flow Neil Tang 3/30/2010.
Implicit Hitting Set Problems Richard M. Karp Erick Moreno Centeno DIMACS 20 th Anniversary.
© Yamacraw, Fall 2002 Power Efficient Range Assignment in Ad-hoc Wireless Networks E. Althous (MPI) G. Calinescu (IL-IT) I.I. Mandoiu (UCSD) S. Prasad.
1 Slides by Yong Liu 1, Deep Medhi 2, and Michał Pióro 3 1 Polytechnic University, New York, USA 2 University of Missouri-Kansas City, USA 3 Warsaw University.
Linear Programming Piyush Kumar Welcome to CIS5930.
Ion I. Mandoiu, Vijay V. Vazirani Georgia Tech Joseph L. Ganley Simplex Solutions A New Heuristic for Rectilinear Steiner Trees.
Approximation Algorithms based on linear programming.
Confidential & Proprietary – All Rights Reserved Internal Distribution, October Quality of Service in Multimedia Distribution G. Calinescu (Illinois.
VLSI Physical Design Automation
Solving bucket-based large flow allocation problems
Data Driven Resource Allocation for Distributed Learning
1.3 Modeling with exponentially many constr.
Chapter 6. Large Scale Optimization
Buffered tree construction for timing optimization, slew rate, and reliability control Abstract: With the rapid scaling of IC technology, buffer insertion.
1.3 Modeling with exponentially many constr.
Fast Min-Register Retiming Through Binary Max-Flow
Presentation transcript:

F.F. Dragan (Kent State) A.B. Kahng (UCSD) I. Mandoiu (Georgia Tech/UCLA) S. Muddu (Silicon Graphics) A. Zelikovsky (Georgia State) Provably Good Global Buffering Using an Available Buffer Block Plan

Global Buffering via Buffer Blocks VDSM  buffer / inverter insertion for all global nets –50nm technology  10^6 buffers Buffer Block (BB) methodology –isolate buffer insertion from block implementations –improve routing area resources (RAR) utilization RAR(2k-buffer block) =   RAR(k-buffer block) For high-end designs   1.6 Buffer block planning [Cong+99] [TangW00] –given block placement + nets –find shape and location of BBs Global buffering via BBs –given nets + BB locations and capacities –find buffered routing for each net

Global Buffering via Buffer Blocks

Global Buffering Problem Given: Pin & BB locations, BB capacities list of 2-pin nets, each net has upper-bound on #buffers parity requirement on #buffers [non-negative weight (criticality coefficient)] L/U bounds on wirelength b/w consecutive buffers/pins Find: buffered routing of a maximum [weighted] number of nets subject to the given constraints

Global Buffering Problem Given: Pin & BB locations, BB capacities list of 2-pin nets, each net has upper-bound on #buffers new parity requirement on #buffers new [non-negative weight (criticality coefficient)] new L/U bounds on wirelength b/w consecutive buffers/pins Find: buffered routing of a maximum [weighted] number of nets subject to the given constraints Previous work: 1 buffer per connection, no weights

Outline of Results Provably good algorithm for the Global Buffering Problem –integer node-capacitated multi-commodity flow (MCF) formulation –approximation algorithm for solving fractional relaxation –provably good randomized rounding based on [RaghavanT87] –allows tradeoff between run-time and solution quality Fast heuristic based on ideas from the approximation algorithm superior to simpler greedy approaches almost matches the provably good algorithm for loosely constrained instances

Integer Program Formulation

High-Level Approach Solve fractional relaxation + rounding –first introduced for global routing [RaghavanT87] –fractional relaxation = node-capacitated multi-commodity flow (MCF) can be solved exactly using Linear Programming (LP) techniques exact LP algorithms are not practical for large instances Key idea: approximate solution to the relaxation –we generalize edge-capacitated MCF approximation of [GargK98, F99] –[GargK98] successfully applied to global routing by [Albrecht00]

Approximating the Fractional MCF  -MCF algorithm w(v) = , f = 0 For i = 1 to N do For k = 1, …, #nets do Find a shortest path p  P for net k While w(p) < min{ 1,  (1+2  )^I } do f(p)= f(p) + 1 For every v  p do w(v)  ( 1 +  /c(v) ) * w(v) End For End While End For Output f/N Run time for  -approximation =

Random walk algorithm [RaghavanT87] –probability of routing a net proportional to net’s flow –probability of choosing an arc proportional to fractional flow along arc –run time = O( #inserted buffers ) –To avoid BB overuse, scale-down fractional flow by 1-  before rounding Rounding to an Integer Solution Modifications approximate MCF underestimates optimum  few violations + unused BB capacity for large  resolve capacity violations by greedily deleting paths greedily route remaining nets using unused BB capacity

Implemented Heuristics  -MCF w/ greedy enhancement –solve fractional MCF with  approximation –round fractional solution via random walks –apply greedy deletion/addition to get feasible solution Greedy –sequentially route nets along shortest available paths 1-shot integer MCF –assign weight w=1 to each BB –repeat until total overused capacity does not decrease for each net find shortest path for each BB r increase weight by factor (1 +  usage(r) / cap(r)) – apply greedy deletion/addition to get feasible solution

Experimental Setup Test instances extracted from next-generation SGI microprocessor ~4,000 nets U=4,000  m, L=500-2,000  m 50 buffer blocks BB capacity –400 (fully routable instances) –50 (hard instances, 50-60% routable)

Fully Routable Instance (4212 nets)

Greedy1-Shot

Running Time vs. Solution Quality

~57% Routable Instance (4212 nets) Greedy1-Shot

Conclusions and Ongoing Work Provably good algorithm based on node- capacitated MCF approximation Extensions: –combine global buffering with BB planning combine with compaction

Combining with compaction

Sum-capacity constraints: cap(BB1) + cap(BB2)  const.

Conclusions and Ongoing Work Provably good algorithm based on node- capacitated MCF approximation Extensions: –combine global buffering with BB planning combine with compaction –enforce channel capacity constraints –multi-terminal nets (ASPDAC-01)