Are Floorplan Representations Important in Digital Design? H. H. Chan, S. N. Adya, I. L. Markov The University of Michigan.

Slides:



Advertisements
Similar presentations
Analysis of Floorplanning Algorithm in EDA Tools
Advertisements

Capo: Robust and Scalable Open-Source Min-cut Floorplacer Jarrod A. Roy, David A. Papa,Saurabh N. Adya, Hayward H. Chan, James F. Lu, Aaron N. Ng, Igor.
Linear Constraint Graph for Floorplan Optimization with Soft Blocks Jia Wang Electrical and Computer Engineering Illinois Institute of Technology Chicago,
Meng-Kai Hsu, Sheng Chou, Tzu-Hen Lin, and Yao-Wen Chang Electronics Engineering, National Taiwan University Routability Driven Analytical Placement for.
A Size Scaling Approach for Mixed-size Placement Kalliopi Tsota, Cheng-Kok Koh, Venkataramanan Balakrishnan School of Electrical and Computer Engineering.
Ripple: An Effective Routability-Driven Placer by Iterative Cell Movement Xu He, Tao Huang, Linfu Xiao, Haitong Tian, Guxin Cui and Evangeline F.Y. Young.
Consistent Placement of Macro-Blocks Using Floorplanning and Standard-Cell Placement Saurabh Adya Igor Markov (University of Michigan)
FastPlace: Efficient Analytical Placement using Cell Shifting, Iterative Local Refinement and a Hybrid Net Model FastPlace: Efficient Analytical Placement.
Placer Suboptimality Evaluation Using Zero-Change Transformations Andrew B. Kahng Sherief Reda VLSI CAD lab UCSD ECE and CSE Departments.
Routability-Driven Blockage-Aware Macro Placement Yi-Fang Chen, Chau-Chin Huang, Chien-Hsiung Chiou, Yao-Wen Chang, Chang-Jen Wang.
Intrinsic Shortest Path Length: A New, Accurate A Priori Wirelength Estimator Andrew B. KahngSherief Reda VLSI CAD Laboratory.
ISQED’2015: D. Seemuth, A. Davoodi, K. Morrow 1 Automatic Die Placement and Flexible I/O Assignment in 2.5D IC Design Daniel P. Seemuth Prof. Azadeh Davoodi.
Constructive Benchmarking for Placement David A. Papa EECS Department University of Michigan Ann Arbor, MI Igor L. Markov EECS.
Faster SAT and Smaller BDDs via Common Function Structure Fadi A. Aloul, Igor L. Markov, Karem A. Sakallah University of Michigan.
Fixed-outline Floorplanning Through Better Local Search
NuCAD ACG - Adjacent Constraint Graph for General Floorplans Hai Zhou and Jia Wang ICCD 2004, San Jose October 11-13, 2004.
An Analytic Placer for Mixed-Size Placement and Timing-Driven Placement Andrew B. Kahng and Qinke Wang UCSD CSE Department {abk, Work.
Placement Feedback: A Concept and Method for Better Min-Cut Placements Andrew B. KahngSherief Reda CSE & ECE Departments University of CA, San Diego La.
On Legalization of Row-Based Placements Andrew B. KahngSherief Reda CSE & ECE Departments University of CA, San Diego La Jolla, CA 92093
UNIVERSITY OF MASSACHUSETTS Dept
1 A Tale of Two Nets: Studies in Wirelength Progression in Physical Design Andrew B. Kahng Sherief Reda CSE Department University of CA, San Diego.
Can Recursive Bisection Alone Produce Routable Placements? Andrew E. Caldwell Andrew B. Kahng Igor L. Markov Supported by Cadence.
Fixed-outline Floorplanner (Parquet) A constraint satisfaction problem “Parquet” based on “floorplan slack” Use better local search to satisfy constraints.
Processing Rate Optimization by Sequential System Floorplanning Jia Wang 1, Ping-Chih Wu 2, and Hai Zhou 1 1 Electrical Engineering & Computer Science.
Placement-Centered Research Directions and New Problems Xiaojian Yang Amir Farrahi Synplicity Inc.
VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 3: Chip Planning © KLMH Lienig 1 Modern Floorplanning Based on B*-Tree and Fast.
Chip Planning 1. Introduction Chip Planning:  Deals with large modules with −known areas −fixed/changeable shapes −(possibly fixed locations for some.
8/15/ VLSI Physical Design Automation Prof. David Pan Office: ACES Lecture 8. Floorplanning (2)
High-Quality, Deterministic Parallel Placement for FPGAs on Commodity Hardware Adrian Ludwin, Vaughn Betz & Ketan Padalia FPGA Seminar Presentation Nov.
CSE 242A Integrated Circuit Layout Automation Lecture: Partitioning Winter 2009 Chung-Kuan Cheng.
SOFT COMPUTING (Optimization Techniques using GA) Dr. N.Uma Maheswari Professor/CSE PSNA CET.
1 Chapter 1 Analysis Basics. 2 Chapter Outline What is analysis? What to count and consider Mathematical background Rates of growth Tournament method.
1 Modern Floorplanning Based on Fast Simulated Annealing Tung-Chieh Chen* and Yao-Wen Chang* # Graduate Institute of Electronics Engineering* Department.
Horizontal Benchmark Extension for Improved Assessment of Physical CAD Research Andrew B. Kahng, Hyein Lee and Jiajia Li UC San Diego VLSI CAD Laboratory.
March 20, 2007 ISPD An Effective Clustering Algorithm for Mixed-size Placement Jianhua Li, Laleh Behjat, and Jie Huang Jianhua Li, Laleh Behjat,
Merge Sort. What Is Sorting? To arrange a collection of items in some specified order. Numerical order Lexicographical order Input: sequence of numbers.
Seeing the Forest and the Trees: Steiner Wirelength Optimization in Placement Jarrod A. Roy, James F. Lu and Igor L. Markov University of Michigan Ann.
An Efficient Clustering Algorithm For Low Power Clock Tree Synthesis Rupesh S. Shelar Enterprise Microprocessor Group Intel Corporation, Hillsboro, OR.
Bus-Driven Floorplanning Hua Xiang*, Xiaoping Tang +, Martin D. F. Wong* * Univ. Of Illinois at Urbana-Champaign + Cadence Design Systems Inc.
Regularity-Constrained Floorplanning for Multi-Core Processors Xi Chen and Jiang Hu (Department of ECE Texas A&M University), Ning Xu (College of CST Wuhan.
Massachusetts Institute of Technology 1 L14 – Physical Design Spring 2007 Ajay Joshi.
Evaluating FERMI features for Data Mining Applications Masters Thesis Presentation Sinduja Muralidharan Advised by: Dr. Gagan Agrawal.
Performance Prediction for Random Write Reductions: A Case Study in Modelling Shared Memory Programs Ruoming Jin Gagan Agrawal Department of Computer and.
Placement. Physical Design Cycle Partitioning Placement/ Floorplanning Placement/ Floorplanning Routing Break the circuit up into smaller segments Place.
Deferred Decision Making Enabled Fixed- Outline Floorplanner Jackey Z. Yan and Chris Chu DAC 2008.
Hyper-heuristics. 2 Outline Hyper-heuristics Hyper-heuristics for strip packing Hyper-heuristics for Stock forecasting Conclusion.
Congestion Estimation and Localization in FPGAs: A Visual Tool for Interconnect Prediction David Yeager Darius Chiu Guy Lemieux The University of British.
A Memetic Algorithm for VLSI Floorplanning Maolin Tang, Member, IEEE, and Xin Yao, Fellow, IEEE IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS—PART.
Solving the Maximum Cardinality Bin Packing Problem with a Weight Annealing-Based Algorithm Kok-Hua Loh University of Maryland Bruce Golden University.
A Stable Fixed-outline Floorplanning Method Song Chen and Takeshi Yoshimura Graduate School of IPS, Waseda University March, 2007.
Chris Chu Iowa State University Yiu-Chung Wong Rio Design Automation
International Workshop on System-Level Interconnection Prediction, Sonoma County, CA March 2001ER UCLA UCLA 1 Wirelength Estimation based on Rent Exponents.
System-level power analysis and estimation September 20, 2006 Chong-Min Kyung.
Routability-driven Floorplanning With Buffer Planning Chiu Wing Sham Evangeline F. Y. Young Department of Computer Science & Engineering The Chinese University.
1 NTUplace: A Partitioning Based Placement Algorithm for Large-Scale Designs Tung-Chieh Chen 1, Tien-Chang Hsu 1, Zhe-Wei Jiang 1, and Yao-Wen Chang 1,2.
ISPD 2001, Sonoma County, April 3rd, Consistent Floorplanning with Super Hierarchical Constraints Yukiko KUBO, Shigetoshi NAKATAKE, and Yoji KAJITANI.
1 Twin Binary Sequences: A Non-Redundant Representation for General Non-Slicing Floorplan Evan Young Department of Computer Science and Engineering The.
Block Packing: From Puzzle-Solving to Chip Design
Design Automation Conference (DAC), June 6 th, Taming the Complexity of Coordinated Place and Route Jin Hu †, Myung-Chul Kim †† and Igor L. Markov.
Interconnect Characteristics of 2.5-D System Integration Scheme Yangdong (Steven) Deng & Wojciech P. Maly
Heuristic Search Planners. 2 USC INFORMATION SCIENCES INSTITUTE Planning as heuristic search Use standard search techniques, e.g. A*, best-first, hill-climbing.
The Early Days of Automatic Floorplan Design
Placement and Routing Algorithms. 2 FPGA Placement & Routing.
Prediction of Interconnect Net-Degree Distribution Based on Rent’s Rule Tao Wan and Malgorzata Chrzanowska- Jeske Department of Electrical and Computer.
Partial Reconfigurable Designs
HeAP: Heterogeneous Analytical Placement for FPGAs
Alternative Representations for Artificial Immune Systems
Revisiting and Bounding the Benefit From 3D Integration
Sheqin Dong, Song Chen, Xianlong Hong EDA Lab., Tsinghua Univ. Beijing
EDA Lab., Tsinghua University
Presentation transcript:

Are Floorplan Representations Important in Digital Design? H. H. Chan, S. N. Adya, I. L. Markov The University of Michigan

Motivation Many FP representations have been proposed since sequence-pair [Murata et.al. ‘96] Emphasize better area optimization results, with appealing properties based on math. results Interconnect is often more important in physical design, but rarely discussed in the literature Does the choice of FP reps matter in interconnect-driven floorplanning?

Outline of the Talk Common Practices in FP Research Background on Representations Evaluation Framework Results and Analysis Conclusions

Current Status Quo Prevailing algorithm: simulated annealing –Can optimize power, performance, etc Most work focuses on FP representations Many FP reps exist, compared in terms of –Size of solution space –Area optimality (capture area-optimal floorplan?) –Asymptotic complexity in “realizing” a floorplan –Complexity of incremental changes in annealing Are these properties relevant in interconnect-driven floorplanning?

No Room For Improvement Left? MCNC benchmark suite is almost always used The suite has only 5 benchmarks, with <50 blks –Area-optimal results on apte, xerox and hp –Extremely close results on ami33 and ami49 Area optimization results are emphasized –Interconnect optimization is often ignored Temperature schedules are rarely reported –Hard to reproduce results

Contexts for Floorplanning Outline-free floorplanning –Minimize a combination of area and HPWL –Many floorplanners are evaluated in this framework Fixed-outline floorplanning –Minimize HPWL subject to a bounding box –More relevant to modern designs Large scale floorplanning –Excellent area-packing results for >500 blks e.g., 10 – 20 copies of ami49 –Hardly interesting w/o interconnect optimization

Outline of the Talk Common Practices in FP Research Background on Representations Evaluation Framework Results and Analysis Conclusions

Families of FP Representations (1) Different FP reps can share the same solution space (“equivalent”) –Sequence pair, TCG, TCG-S –O-tree, B*-tree –Mosaic reps: CBL, Q-sequence, TBS etc Equivalent FP reps produce similar solutions, only differ in runtime We seek to compare solution spaces rather than specific representations

Families of FP Representations (2) We study sequence pair and B*-tree, since They are best studied in the literature –Broad extensions for various constraints e.g. pre-placed blks, rectilinear blks, soft blks –Hierarchical extensions for over 10K blocks (Parquet-in-Capo, MB*-tree) Two “extremes” of representations –Sequence pair: largest solution space –B*-tree: smallest solution space

Sequence Pair (SP) Proposed by Murata et al in 1996 Encodes a floorplan using two permutations O(n 2 ) time to realize a floorplan –O(n lg n), O(n lg lg n) algos exist –O(n 2 ) time algo is easy to implement and fast in practice Size of solution space: n! 2 Some area-optimal solutions Equivalent to TCG, TCG-S

B*-tree Proposed by Chang et al in 2000 Encodes a floorplan by a binary tree and a permutation O(n)-time to realize a floorplan Size of solution space: O(n!2 2n-2 / n 1.5 ) Some area-optimal floorplans All packings are compacted to the bottom Equivalent to O-tree

Sequence Pair vs. B*-tree SP captures low interconnect floorplans, that are not captured by B*-tree SP captures more floorplans than B*-tree –This packing is encoded by SP –Not captured by B*-tree

Outline of the Talk Common Practices in FP Research Background on Representations Evaluation Framework Results and Analysis Conclusions

Evaluation Framework (1) Floorplanner Parquet [Adya et.al ICCD 2001] –Simulated annealer based on sequence pair –Competitive results in Outline-free floorplanning Fixed-outline floorplanning Handling both hard and soft blocks –Embedded in placer Capo 9.0 for mixed-sized placement (standard cells + large macros) [Adya et.al ICCAD 2004]

Evaluation Framework (2) Replace the sequence pair annealer by B*-tree, with minimal changes –Identical temperature schedule –Probabilities of applying moves are identical, except for those specific to B*-trees –Both versions are open source Report averages over 50 independent starts rather than best results Runtime breakdown: Block-packing vs WL eval.

Evaluation Framework (3) Min-cut floorplacement with Capo Use min-cut partitioning whenever possible Otherwise, resort to annealing-based packing –Cluster standard cells into soft blocks –Fixed-outline floorplanning on clustered instances –Min-cut resumes on standard cells Outperforms competitive annealers on large FP instances (> 100 blocks) –Clustered FP instances have up to 300 blks, often <20 Generates thousands of very different FP benchmarks

What to Expect? Wirelength evaluation is very CPU-intesive Count # floating-point ops in floorplan and wirelength evaluation per move (for SP) –Instrument the code by adding counters # ops in HPWL evaluation –Only count arithmetic ops, not assignments # ops in FP evaluation –Worst-case complexity O(n 2 ) is too pessimistic –Runtime of SP eval fits to cn 1.3 t = cn 1.3

Outline of the Talk Common Practices in FP Research Background on Representations Evaluation Framework Results and Analysis Conclusions

Area Optimization ( average performance reported, not best ) B*-tree packs better than SP Similar for fixed-outline FP, B*-tree: higher success rate SP beats B*-tree in evaluation time up to 200 blks, despite their asymptotic complexities --- O(n 2 ) vs. O(n) ami33ami49n100n200n300 SP9.5% 4.8ms 9.1% 7.2ms 9.2% 17.6ms 10.2% 37.9ms 10.7% 64.3ms B*-tree5.6% 7.6ms 5.3% 10.7ms 5.5% 20.3ms 5.9% 39.4ms 6.3% 57.1ms Outline-free FP (deadspace % / time-per-move)

Area + HPWL Optimization Similar performance for SP and B*-tree in area, HPWL, and evaluation time Runtime dominated by HPWL evaluation Same trends for fixed-outline FP and soft blocks ami33ami49n100n200n300 SP14.3% 76e3 29.7ms 14.1% 823e3 48.5ms 11.6% 322e3 93.9ms 13.4% 589e3 219ms 14.1% 707e3 305ms B*-tree15.8% 75e3 31.5ms 16.2% 847e3 50.6ms 11.3% 320e3 99.5ms 12.0% 580e3 219ms 11.9% 700e3 313ms Outline-free FP (deadspace % / HPWL / time-per-move)

Min-Cut Floorplacement A wide range of FP benchmarks –Instances with both hard and soft blocks –Facilitate a rigorous empirical comparison Capo on the IBM-MSwPins benchmarks –Similar performance of SP and B*-tree (<1% diff in wirelength, <2.5% in runtime) Stand-alone Parquet on the generated instances –2000+ fixed-outline instances with both hard and soft blocks –Block counts range from 1 to ~300, often <20 –Similar performance of SP and B*-tree

Wirelength Evaluation Runtime (1) In both outline-free and fixed-outline FP, HPWL evaluation dominates runtime (~80%) Outline-free (TPM area-only / area+wire) ami33ami49n100n200n300 SP4.8ms 29.7ms 7.2ms 48.5ms 17.6ms 93.9ms 37.9ms 219ms 64.3ms 305ms B*-tree7.6ms 31.5ms 10.7ms 50.6ms 20.3ms 99.5ms 39.4ms 219ms 57.1ms 313ms Consistent with our analysis that WL evaluation should dominate runtime.

Wirelength Evaluation Runtime (2) Estimates less accurate in larger benchmarks –Larger netlists may not fit into processor cache FP evaluation is never a runtime bottleneck! ami33ami49n100n200n300 estimated89%88%85%80%76% actual86%87%84%85%82% % time-per-move spent on WL evaluation (analytical estimates versus actual measurements)

Must Improve WL Evaluation Rather than FP Representations! Ideas for speed-ups –Special-case the evaluation of 2-pin nets: no loops, 2 floating-point comparisons only –Remove inessential nets, whose bounding box contains the outline –Conglomerate 2-pin nets between the same blocks into heavy-weight nets These techniques lead to 10% speed-up without loss of quality in Parquet 3 Parquet 4: another 10% speed-up (2x speed-up per move) with improved wirelength (longer temp. schedule) –Better use of cache (float versus double) –Simultaneous computation of min and max (25% less comparisons)

Conclusions We compared the performance of SP and B*-tree in wirelength-driven floorplanning –Surprisingly similar performance! The main bottleneck is interconnect evaluation: >75% runtime in realistic FP instances –Parquet-4: Improved WL eval.  tangible speed-up Asymptotic complexity of FP eval. has little relevance –Realistic FP instances are small (<200 blks) –Worst-case analysis is too pessimistic New FP reps seem irrelevant unless they support incremental wirelength evaluation The status quo in FP literature needs to be changed t = cn 1.3