Standard-Cell Mapping Revisited

Slides:

Advertisements

Similar presentations

Xing Wei, Wai-Chung Tang, Yu-Liang Wu Department of Computer Science and Engineering The Chinese University of HongKong

Advertisements

Improving Placement under the Constant Delay Model Kolja Sulimma 1, Ingmar Neumann 1, Lukas Van Ginneken 2, Wolfgang Kunz 1 1 EE and IT Department University.

FPGA Technology Mapping Dr. Philip Brisk Department of Computer Science and Engineering University of California, Riverside CS 223.

1 DAOmap: A Depth-optimal Area Optimization Mapping Algorithm for FPGA Designs Deming Chen, Jacon Cong ICCAD 2004 Presented by: Wei Chen.

Technology Mapping.

Continuous Retiming EECS 290A Sequential Logic Synthesis and Verification.

DAG-Aware AIG Rewriting Alan Mishchenko, Satrajit Chatterjee, Robert Brayton Department of EECS, University of California Berkeley Presented by Rozana.

Reinventing The Wheel: Developing a New Standard-Cell Synthesis Flow Alan Mishchenko University of California, Berkeley.

Electrical and Computer Engineering Archana Rengaraj ABC Logic Synthesis basics ECE 667 Synthesis and Verification of Digital Systems Spring 2011.

Good Programming Practices for Building Less Memory-Intensive EDA Applications Alan Mishchenko University of California, Berkeley.

05/04/06 1 Integrating Logic Synthesis, Tech mapping and Retiming Presented by Atchuthan Perinkulam Based on the above paper by A. Mishchenko et al, UCAL.

Combinational and Sequential Mapping with Priority Cuts Alan Mishchenko Sungmin Cho Satrajit Chatterjee Robert Brayton UC Berkeley.

ABC: A System for Sequential Synthesis and Verification BVSRC Berkeley Verification and Synthesis Research Center Robert Brayton, Niklas Een, Alan Mishchenko,

Technology Mapping. 2 Technology mapping is the phase of logic synthesis when gates are selected from a technology library to implement the circuit. Technology.

Logic synthesis flow Technology independent mapping –Two level or multilevel optimization to optimize a coarse metric related to area/delay Technology.

1 WireMap FPGA Technology Mapping for Improved Routability Stephen Jang, Xilinx Inc. Billy Chan, Xilinx Inc. Kevin Chung, Xilinx Inc. Alan Mishchenko,

A Semi-Canonical Form for Sequential Circuits Alan Mishchenko Niklas Een Robert Brayton UC Berkeley Michael Case Pankaj Chauhan Nikhil Sharma Calypto Design.

Enhancing Model Checking Engines for Multi-Output Problem Solving Alan Mishchenko Robert Brayton Berkeley Verification and Synthesis Research Center Department.

Global Delay Optimization using Structural Choices Alan Mishchenko Robert Brayton UC Berkeley Stephen Jang Xilinx Inc.

Sequential Equivalence Checking for Clock-Gated Circuits Hamid Savoj Robert Brayton Niklas Een Alan Mishchenko Department of EECS University of California,

A Toolbox for Counter-Example Analysis and Optimization

Reducing Structural Bias in Technology Mapping

Synthesis for Verification

Technology Mapping into General Programmable Cells

Power Optimization Toolbox for Logic Synthesis and Mapping

Mapping into LUT Structures

Delay Optimization using SOP Balancing

Enhancing PDR/IC3 with Localization Abstraction

Robert Brayton Alan Mishchenko Niklas Een

Alan Mishchenko Robert Brayton UC Berkeley

Alan Mishchenko Satrajit Chatterjee Robert Brayton UC Berkeley

Logic Synthesis Primer

Magic An Industrial-Strength Logic Optimization, Technology Mapping, and Formal Verification System Alan Mishchenko UC Berkeley.

A Semi-Canonical Form for Sequential AIGs

Applying Logic Synthesis for Speeding Up SAT

Versatile SAT-based Remapping for Standard Cells

SAT-based Methods: Logic Synthesis and Technology Mapping

Integrating an AIG Package, Simulator, and SAT Solver

A Boolean Paradigm in Multi-Valued Logic Synthesis

LUT Structure for Delay: Cluster or Cascade?

SAT-Based Area Recovery in Technology Mapping

Alan Mishchenko University of California, Berkeley

Canonical Computation without Canonical Data Structure

SAT-Based Optimization with Don’t-Cares Revisited

Canonical Computation Without Canonical Data Structure

Scalable and Scalably-Verifiable Sequential Synthesis

Mapping into LUT Structures

Improvements to Combinational Equivalence Checking

Sungho Kang Yonsei University

Integrating Logic Synthesis, Technology Mapping, and Retiming

Alan Mishchenko UC Berkeley

Alan Mishchenko UC Berkeley (With many thanks to Donald Knuth,

Reinventing The Wheel: Developing a New Standard-Cell Synthesis Flow

Alan Mishchenko UC Berkeley (With many thanks to Donald Knuth for

Integrating an AIG Package, Simulator, and SAT Solver

Improvements in FPGA Technology Mapping

Canonical Computation without Canonical Data Structure

Recording Synthesis History for Sequential Verification

Delay Optimization using SOP Balancing

Canonical Computation without Canonical Data Structure

Reinventing The Wheel: Developing a New Standard-Cell Synthesis Flow

Magic An Industrial-Strength Logic Optimization, Technology Mapping, and Formal Verification System Alan Mishchenko UC Berkeley.

A Practical Approach to Arithmetic Circuit Verification

Innovative Sequential Synthesis and Verification

Robert Brayton Alan Mishchenko Niklas Een

SAT-based Methods: Logic Synthesis and Technology Mapping

Fast Min-Register Retiming Through Binary Max-Flow

Robert Brayton Alan Mishchenko Niklas Een

Alan Mishchenko Department of EECS UC Berkeley

Integrating AIG Package, Simulator, and SAT Solver

Presentation transcript:

Standard-Cell Mapping Revisited Alan Mishchenko Robert Brayton (with special thanks to Satrajit Chatterjee and Niklas Een) Department of EECS UC Berkeley

Overview Introduction Technical part Wrap-up Why mapping is a fundamental problem History of technology mapping in ABC Motivation to continue working on mapping Technical part How one new idea led to three new mappers in one year Several other ideas Our most recent work Wrap-up Preliminary experimental results Conclusions and future work 2

Pros and Cons of Load-Independent Delay Model Approximations are inevitable An approximation is “proper” when it allows us to simplify a problem without missing essential points In our experience, load-independent delay model is a “proper” approximation It simplifies mappers, allows them to scale, and leads to next stages when more accurate models are used Gain-based approach, itself an approximation, enables this delay model From now on, we use load-independent model

Delay Optimality Computing best arrival time at each node in a topological order from inputs to outputs ensures that the earliest possible arrival time at the outputs is found

Area Recovery Two complementary heuristics are traditionally used for area recovery Global view heuristic (area flow) Combines area of a cone and fanout count by computing an “average” area per fanout Local view heuristic (exact area) Provides a detailed view of each gate and allows mapping to be locally optimized (These are somewhat similar to global and detailed placement)

Mapper Pseudo-Code Pre-compute functions implementable using the library Currently, we only look at single gates (no “super-gates”) Enumerate cuts for the subject graph In practice, we enumerate all K-feasible cuts but store only those that have matches with the library Iterate over the subject graph Forward passes First pass computes best delay Next few passes minimize area-flow under delay constraints Next few passes minimize exact-area under delay constraints Backward passes First backward pass produces a legal mapping to be incrementally improved All backward passes compute required times Write out the mapped network

New Idea Used Keep not one, but two cuts at each node A delay-oriented cut Guarantees that the node can meet required times An area-oriented cut Allows for area optimization to kick in, if possible How it impacts the implementation Different procedure to assign matches Different way of computing required times Different procedure to produce a legal mapping Consequences The QoR improves The implementation is more complex but not prohibitively so

Impact on Area Recovery Area flow computation remains roughly the same Exact area computation took some time to implement correctly and efficiently The main difficulty was Using exact area in the forward pass is prohibitive in terms of runtime Using exact area in the backward pass requires clever way of propagating required times After several failures, an efficient implementation was found

Several Other Ideas Used Integers instead of floating-point numbers can be used to represent timing information Makes implementation platform-independent Cuts with matches do not have to be recomputed by the mapper in each round We precompute and store them to reduce runtime Area recovery using the exact-area heuristic can be efficiently performed in a reverse topo order Reduces runtime while keeping the same quality

Comparison of ABC Mappers (delay, before synthesis)

Comparison of ABC Mappers (delay, after synthesis)

Comparison of ABC Mappers (area, before and after synthesis)

Conclusions Introduced technology mapping Elaborated on one interesting idea Reviewed current results and future work

Future Work Small ideas Big ideas Skip dominated matches (2x less runtime) Select better alternative cuts during area recovery (area 1-2% better) Big ideas Use load-dependent delay model Combine mapping with buffering and sizing

Abstract Technology mapping is one of the fundamental problems, along with such problems as circuit restructuring and SAT solving. One flavor of technology mapping looks into minimizing both area and delay, or rather area under delay constraints. A new approach to building delay/area-aware mappers for standard cells and FPGAs was recently proposed. The main idea of this approach is to store two cuts at each node (delay-oriented and area-oriented) rather than one cut, as in much of previous work. This presentation surveys our experience developing a standard-cell mapper based on these ideas