Standard-Cell Mapping Revisited

Slides:



Advertisements
Similar presentations
Xing Wei, Wai-Chung Tang, Yu-Liang Wu Department of Computer Science and Engineering The Chinese University of HongKong
Advertisements

Improving Placement under the Constant Delay Model Kolja Sulimma 1, Ingmar Neumann 1, Lukas Van Ginneken 2, Wolfgang Kunz 1 1 EE and IT Department University.
FPGA Technology Mapping Dr. Philip Brisk Department of Computer Science and Engineering University of California, Riverside CS 223.
1 DAOmap: A Depth-optimal Area Optimization Mapping Algorithm for FPGA Designs Deming Chen, Jacon Cong ICCAD 2004 Presented by: Wei Chen.
Technology Mapping.
Continuous Retiming EECS 290A Sequential Logic Synthesis and Verification.
DAG-Aware AIG Rewriting Alan Mishchenko, Satrajit Chatterjee, Robert Brayton Department of EECS, University of California Berkeley Presented by Rozana.
Reinventing The Wheel: Developing a New Standard-Cell Synthesis Flow Alan Mishchenko University of California, Berkeley.
Electrical and Computer Engineering Archana Rengaraj ABC Logic Synthesis basics ECE 667 Synthesis and Verification of Digital Systems Spring 2011.
Good Programming Practices for Building Less Memory-Intensive EDA Applications Alan Mishchenko University of California, Berkeley.
05/04/06 1 Integrating Logic Synthesis, Tech mapping and Retiming Presented by Atchuthan Perinkulam Based on the above paper by A. Mishchenko et al, UCAL.
Combinational and Sequential Mapping with Priority Cuts Alan Mishchenko Sungmin Cho Satrajit Chatterjee Robert Brayton UC Berkeley.
ABC: A System for Sequential Synthesis and Verification BVSRC Berkeley Verification and Synthesis Research Center Robert Brayton, Niklas Een, Alan Mishchenko,
Technology Mapping. 2 Technology mapping is the phase of logic synthesis when gates are selected from a technology library to implement the circuit. Technology.
Logic synthesis flow Technology independent mapping –Two level or multilevel optimization to optimize a coarse metric related to area/delay Technology.
1 WireMap FPGA Technology Mapping for Improved Routability Stephen Jang, Xilinx Inc. Billy Chan, Xilinx Inc. Kevin Chung, Xilinx Inc. Alan Mishchenko,
A Semi-Canonical Form for Sequential Circuits Alan Mishchenko Niklas Een Robert Brayton UC Berkeley Michael Case Pankaj Chauhan Nikhil Sharma Calypto Design.
Enhancing Model Checking Engines for Multi-Output Problem Solving Alan Mishchenko Robert Brayton Berkeley Verification and Synthesis Research Center Department.
Global Delay Optimization using Structural Choices Alan Mishchenko Robert Brayton UC Berkeley Stephen Jang Xilinx Inc.
Sequential Equivalence Checking for Clock-Gated Circuits Hamid Savoj Robert Brayton Niklas Een Alan Mishchenko Department of EECS University of California,
A Toolbox for Counter-Example Analysis and Optimization
Reducing Structural Bias in Technology Mapping
Synthesis for Verification
Technology Mapping into General Programmable Cells
Power Optimization Toolbox for Logic Synthesis and Mapping
Mapping into LUT Structures
Delay Optimization using SOP Balancing
Enhancing PDR/IC3 with Localization Abstraction
Robert Brayton Alan Mishchenko Niklas Een
Alan Mishchenko Robert Brayton UC Berkeley
Alan Mishchenko Satrajit Chatterjee Robert Brayton UC Berkeley
Logic Synthesis Primer
Magic An Industrial-Strength Logic Optimization, Technology Mapping, and Formal Verification System Alan Mishchenko UC Berkeley.
A Semi-Canonical Form for Sequential AIGs
Applying Logic Synthesis for Speeding Up SAT
Versatile SAT-based Remapping for Standard Cells
SAT-based Methods: Logic Synthesis and Technology Mapping
Integrating an AIG Package, Simulator, and SAT Solver
A Boolean Paradigm in Multi-Valued Logic Synthesis
LUT Structure for Delay: Cluster or Cascade?
SAT-Based Area Recovery in Technology Mapping
Alan Mishchenko University of California, Berkeley
Canonical Computation without Canonical Data Structure
SAT-Based Optimization with Don’t-Cares Revisited
Canonical Computation Without Canonical Data Structure
Scalable and Scalably-Verifiable Sequential Synthesis
Mapping into LUT Structures
Improvements to Combinational Equivalence Checking
Sungho Kang Yonsei University
Integrating Logic Synthesis, Technology Mapping, and Retiming
Alan Mishchenko UC Berkeley
Alan Mishchenko UC Berkeley (With many thanks to Donald Knuth,
Reinventing The Wheel: Developing a New Standard-Cell Synthesis Flow
Alan Mishchenko UC Berkeley (With many thanks to Donald Knuth for
Integrating an AIG Package, Simulator, and SAT Solver
Improvements in FPGA Technology Mapping
Canonical Computation without Canonical Data Structure
Recording Synthesis History for Sequential Verification
Delay Optimization using SOP Balancing
Canonical Computation without Canonical Data Structure
Reinventing The Wheel: Developing a New Standard-Cell Synthesis Flow
Magic An Industrial-Strength Logic Optimization, Technology Mapping, and Formal Verification System Alan Mishchenko UC Berkeley.
A Practical Approach to Arithmetic Circuit Verification
Innovative Sequential Synthesis and Verification
Robert Brayton Alan Mishchenko Niklas Een
SAT-based Methods: Logic Synthesis and Technology Mapping
Fast Min-Register Retiming Through Binary Max-Flow
Robert Brayton Alan Mishchenko Niklas Een
Alan Mishchenko Department of EECS UC Berkeley
Integrating AIG Package, Simulator, and SAT Solver
Presentation transcript:

Standard-Cell Mapping Revisited Alan Mishchenko Robert Brayton (with special thanks to Satrajit Chatterjee and Niklas Een) Department of EECS UC Berkeley

Overview Introduction Technical part Wrap-up Why mapping is a fundamental problem History of technology mapping in ABC Motivation to continue working on mapping Technical part How one new idea led to three new mappers in one year Several other ideas Our most recent work Wrap-up Preliminary experimental results Conclusions and future work 2

Pros and Cons of Load-Independent Delay Model Approximations are inevitable An approximation is “proper” when it allows us to simplify a problem without missing essential points In our experience, load-independent delay model is a “proper” approximation It simplifies mappers, allows them to scale, and leads to next stages when more accurate models are used Gain-based approach, itself an approximation, enables this delay model From now on, we use load-independent model

Delay Optimality Computing best arrival time at each node in a topological order from inputs to outputs ensures that the earliest possible arrival time at the outputs is found

Area Recovery Two complementary heuristics are traditionally used for area recovery Global view heuristic (area flow) Combines area of a cone and fanout count by computing an “average” area per fanout Local view heuristic (exact area) Provides a detailed view of each gate and allows mapping to be locally optimized (These are somewhat similar to global and detailed placement)

Mapper Pseudo-Code Pre-compute functions implementable using the library Currently, we only look at single gates (no “super-gates”) Enumerate cuts for the subject graph In practice, we enumerate all K-feasible cuts but store only those that have matches with the library Iterate over the subject graph Forward passes First pass computes best delay Next few passes minimize area-flow under delay constraints Next few passes minimize exact-area under delay constraints Backward passes First backward pass produces a legal mapping to be incrementally improved All backward passes compute required times Write out the mapped network

New Idea Used Keep not one, but two cuts at each node A delay-oriented cut Guarantees that the node can meet required times An area-oriented cut Allows for area optimization to kick in, if possible How it impacts the implementation Different procedure to assign matches Different way of computing required times Different procedure to produce a legal mapping Consequences The QoR improves The implementation is more complex but not prohibitively so

Impact on Area Recovery Area flow computation remains roughly the same Exact area computation took some time to implement correctly and efficiently The main difficulty was Using exact area in the forward pass is prohibitive in terms of runtime Using exact area in the backward pass requires clever way of propagating required times After several failures, an efficient implementation was found

Several Other Ideas Used Integers instead of floating-point numbers can be used to represent timing information Makes implementation platform-independent Cuts with matches do not have to be recomputed by the mapper in each round We precompute and store them to reduce runtime Area recovery using the exact-area heuristic can be efficiently performed in a reverse topo order Reduces runtime while keeping the same quality

Comparison of ABC Mappers (delay, before synthesis)

Comparison of ABC Mappers (delay, after synthesis)

Comparison of ABC Mappers (area, before and after synthesis)

Conclusions Introduced technology mapping Elaborated on one interesting idea Reviewed current results and future work

Future Work Small ideas Big ideas Skip dominated matches (2x less runtime) Select better alternative cuts during area recovery (area 1-2% better) Big ideas Use load-dependent delay model Combine mapping with buffering and sizing

Abstract Technology mapping is one of the fundamental problems, along with such problems as circuit restructuring and SAT solving. One flavor of technology mapping looks into minimizing both area and delay, or rather area under delay constraints. A new approach to building delay/area-aware mappers for standard cells and FPGAs was recently proposed. The main idea of this approach is to store two cuts at each node (delay-oriented and area-oriented) rather than one cut, as in much of previous work. This presentation surveys our experience developing a standard-cell mapper based on these ideas