NuCAD ELECTRICAL ENGINEERING AND COMPUTER SCIENCE McCormick Northwestern University Robert R. McCormick School of Engineering and Applied Science FA-STAC.

Slides:



Advertisements
Similar presentations
April 2004NUCAD Northwestern University1 Minimal Period Retiming Under Process Variations Jia Wang and Hai Zhou Electrical & Computer Engineering Northwestern.
Advertisements

Gate Sizing for Cell Library Based Designs Shiyan Hu*, Mahesh Ketkar**, Jiang Hu* *Dept of ECE, Texas A&M University **Intel Corporation.
EE 201A Modeling and Optimization for VLSI LayoutJeff Wong and Dan Vasquez EE 201A Noise Modeling Jeff Wong and Dan Vasquez Electrical Engineering Department.
-1- VLSI CAD Laboratory, UC San Diego Post-Routing BEOL Layout Optimization for Improved Time- Dependent Dielectric Breakdown (TDDB) Reliability Tuck-Boon.
EE141 © Digital Integrated Circuits 2nd Wires 1 The Wires Dr. Shiyan Hu Office: EERC 731 Adapted and modified from Digital Integrated Circuits: A Design.
NTHU-CS VLSI/CAD LAB TH EDA De-Shiuan Chiou Da-Cheng Juan Yu-Ting Chen Shih-Chieh Chang Department of CS, National Tsing Hua University, Taiwan Fine-Grained.
Improving Placement under the Constant Delay Model Kolja Sulimma 1, Ingmar Neumann 1, Lukas Van Ginneken 2, Wolfgang Kunz 1 1 EE and IT Department University.
Moon-Su Kim, Sunik Heo, DalHee Lee, DaeJoon Hyun, Byung Su Kim, Bonghyun Lee, Chul Rim, Hyosig Won, Keesup Kim Samsung Electronics Co., Ltd. System LSI.
1 On Convergence of Switching Windows Computation in Presence of Crosstalk Noise Pinhong Chen* +, Yuji Kukimoto +, Chin-Chi Teng +, Kurt Keutzer* *Dept.
Power-Aware Placement
ER UCLA UCLA ICCAD: November 5, 2000 Predictable Routing Ryan Kastner, Elaheh Borzorgzadeh, and Majid Sarrafzadeh ER Group Dept. of Computer Science UCLA.
Statistical Crosstalk Aggressor Alignment Aware Interconnect Delay Calculation Supported by NSF & MARCO GSRC Andrew B. Kahng, Bao Liu, Xu Xu UC San Diego.
Enhanced Dual-Transition Probabilistic Power Estimation with Selective Supergate Analysis Fei Huand Vishwani D. Agrawal Department of ECE, Auburn University,
TH EDA NTHU-CS VLSI/CAD LAB 1 Re-synthesis for Reliability Design Shih-Chieh Chang Department of Computer Science National Tsing Hua University.
Constructing Current-Based Gate Models Based on Existing Timing Library Andrew Kahng, Bao Liu, Xu Xu UC San Diego
Local Unidirectional Bias for Smooth Cutsize-delay Tradeoff in Performance-driven Partitioning Andrew B. Kahng and Xu Xu UCSD CSE and ECE Depts. Work supported.
EE290A 1 Retiming of AND- INVERTER graphs with latches Juliet Holwill 290A Project 10 May 2005.
NTHU-CS VLSI/CAD LAB TH EDA Student : Da-Cheng Juan Advisor : Shih-Chieh Chang Fine-Grained Sleep Transistor Sizing Algorithm for Leakage Power Minimization.
Chung-Kuan Cheng†, Andrew B. Kahng†‡,
On-Line Adjustable Buffering for Runtime Power Reduction Andrew B. Kahng Ψ Sherief Reda † Puneet Sharma Ψ Ψ University of California, San Diego † Brown.
1 UCSD VLSI CAD Laboratory ISQED-2009 Revisiting the Linear Programming Framework for Leakage Power vs. Performance Optimization Kwangok Jeong, Andrew.
NuCAD ELECTRICAL ENGINEERING AND COMPUTER SCIENCE McCormick Northwestern University Robert R. McCormick School of Engineering and Applied Science Nostra-XTalk.
A Global Minimum Clock Distribution Network Augmentation Algorithm for Guaranteed Clock Skew Yield A. B. Kahng, B. Liu, X. Xu, J. Hu* and G. Venkataraman*
Gate Delay Calculation Considering the Crosstalk Capacitances Soroush Abbaspour and Massoud Pedram University of Southern California Los Angeles CA Asia.
Logic Simulation 2 Outline –Timing Models –Simulation Algorithms Goal –Understand timing models –Understand simulation algorithms Reading –Gate-Level Simulation.
A Probabilistic Method to Determine the Minimum Leakage Vector for Combinational Designs Kanupriya Gulati Nikhil Jayakumar Sunil P. Khatri Department of.
Statistical Gate Delay Calculation with Crosstalk Alignment Consideration Andrew B. Kahng, Bao Liu, Xu Xu UC San Diego
1 A Method for Fast Delay/Area Estimation EE219b Semester Project Mike Sheets May 16, 2000.
Noise and Delay Uncertainty Studies for Coupled RC Interconnects Andrew B. Kahng, Sudhakar Muddu † and Devendra Vidhani ‡ UCLA Computer Science Department,
Signal Integrity Methodology on 300 MHz SoC using ALF libraries and tools Wolfgang Roethig, Ramakrishna Nibhanupudi, Arun Balakrishnan, Gopal Dandu Steven.
Andrew B. Kahng‡†, Mulong Luo†, Siddhartha Nath†
Dose Map and Placement Co-Optimization for Timing Yield Enhancement and Leakage Power Reduction Kwangok Jeong, Andrew B. Kahng, Chul-Hong Park, Hailong.
Modern VLSI Design 4e: Chapter 4 Copyright  2008 Wayne Wolf Topics n Interconnect design. n Crosstalk. n Power optimization.
Discrete Gate Sizing CENG 5270 – Tutorial 9 WILLIAM CHOW.
Capturing Crosstalk-Induced Waveform for Accurate Static Timing Analysis Masanori Hashimoto, Yuji Yamada, Hidetoshi Onodera Kyoto University.
POWER-DRIVEN MAPPING K-LUT-BASED FPGA CIRCUITS I. Bucur, N. Cupcea, C. Stefanescu, A. Surpateanu Computer Science and Engineering Department, University.
March 20, 2007 ISPD An Effective Clustering Algorithm for Mixed-size Placement Jianhua Li, Laleh Behjat, and Jie Huang Jianhua Li, Laleh Behjat,
An Efficient Clustering Algorithm For Low Power Clock Tree Synthesis Rupesh S. Shelar Enterprise Microprocessor Group Intel Corporation, Hillsboro, OR.
1 Wire Length Prediction-based Technology Mapping and Fanout Optimization Qinghua Liu Malgorzata Marek-Sadowska VLSI Design Automation Lab UC-Santa Barbara.
Optimal digital circuit design Mohammad Sharifkhani.
Modern VLSI Design 3e: Chapter 4 Copyright  1998, 2002 Prentice Hall PTR Topics n Interconnect design. n Crosstalk. n Power optimization.
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
Fast Algorithms for Slew Constrained Minimum Cost Buffering S. Hu*, C. Alpert**, J. Hu*, S. Karandikar**, Z. Li*, W. Shi* and C. Sze** *Dept of ECE, Texas.
Crosstalk Noise Optimization by Post-Layout Transistor Sizing Masanori Hashimoto Masao Takahashi Hidetoshi Onodera Dept. CCE, Kyoto University.
Eyecharts: Constructive Benchmarking of Gate Sizing Heuristics Puneet Gupta, University of California, Los Angeles Andrew B. Kahng, University of California,
On the Assumption of Normality in Statistical Static Timing Analysis Halim Damerdji, Ali Dasdan, Santanu Kolay February 28, 2005 PrimeTime Synopsys, Inc.
Logic synthesis flow Technology independent mapping –Two level or multilevel optimization to optimize a coarse metric related to area/delay Technology.
Timing Model Reduction for Hierarchical Timing Analysis Shuo Zhou Synopsys November 7, 2006.
Static Timing Analysis
The George Washington University School of Engineering and Applied Science Department of Electrical and Computer Engineering ECE122 – 30 Lab 2: NAND gate.
Example Apply hierarchical clustering with d min to below data where c=3. Nearest neighbor clustering d min d max will form elongated clusters!
-1- Delay Uncertainty and Signal Criticality Driven Routing Channel Optimization for Advanced DRAM Products Samyoung Bang #, Kwangsoo Han ‡, Andrew B.
A Novel Timing-Driven Global Routing Algorithm Considering Coupling Effects for High Performance Circuit Design Jingyu Xu, Xianlong Hong, Tong Jing, Yici.
1 Clarinet: A noise analysis tool for deep submicron design Rafi Levy Gabi Bracha, David Blaauw, Aurobindo Dasgupta, Amir Grinshpon,
Global Delay Optimization using Structural Choices Alan Mishchenko Robert Brayton UC Berkeley Stephen Jang Xilinx Inc.
Reducing Structural Bias in Technology Mapping
Mohammad Gh. Alfailakawi, Imtiaz Ahmad, Suha Hamdan
Power Optimization Toolbox for Logic Synthesis and Mapping
Delay Optimization using SOP Balancing
Andrew B. Kahng and Xu Xu UCSD CSE and ECE Depts.
Jason Cong, David Zhigang Pan & Prasanna V. Srinivas
Alan Mishchenko University of California, Berkeley
Reinventing The Wheel: Developing a New Standard-Cell Synthesis Flow
On the Improvement of Statistical Timing Analysis
Integrating an AIG Package, Simulator, and SAT Solver
Delay Optimization using SOP Balancing
Reinventing The Wheel: Developing a New Standard-Cell Synthesis Flow
Chirayu S. Amin†, Yehea I. Ismail†, and Florentin Dartu*
Fast Min-Register Retiming Through Binary Max-Flow
Jason Cong, David Zhigang Pan & Prasanna V. Srinivas
Presentation transcript:

NuCAD ELECTRICAL ENGINEERING AND COMPUTER SCIENCE McCormick Northwestern University Robert R. McCormick School of Engineering and Applied Science FA-STAC : A framework for fast and accurate static timing analysis with coupling Debasish Das Electrical Engineering and Computer Science Northwestern University Evanston, IL International Conference on Computer Design, San Jose, CA October 2 nd, 2006

June 11, 2015(2) Co-authors Ahmed Shebaita, EECS, Northwestern University Hai Zhou, EECS, Northwestern University Yehea Ismail, EECS, Northwestern University Kip Killpack, Strategic CAD Lab, Intel Corporation Industry Support Cell Library Provider

June 11, 2015(3) Outline Previous Research Accurate Coupling Delay Computation Efficient Iteration Mechanism Experimental Setup Conclusions and future work

June 11, 2015(4) Previous Research (Coupling Model) Coupling cap dominates interconnect parasitics Miller coupling factor (MCF): switching dependent  Step transitions : (0,2) Sapatnekar et.al, ICCAD 2000  Ramp Models : (-1,3) Kahng et.al, DAC 2000 Chen et.al, ICCAD 2000  Exponetial Models : (-1.885,3.885) Ghoneima et.al, ISCAS 2005 Coupling Model Issues:  Models not extended to Timing Analysis

June 11, 2015(5) Previous Research (Static Timing) Timing Analysis with x-cap iterative Iterative analysis with continous models: Chen et.al ICCAD 2000 Iterative analysis with discrete models: Sapatnekar et.al ICCAD 2000, Chen et.al ICCAD 2000, Arunachalam et.al DAC 2000 Iterative analysis issues  Circuit/Coupling structure Ignored  No detailed study of convergence

June 11, 2015(6) Salient features  Waveform based accurate coupling model  Efficient iteration scheme (Chaotic Iteration)  Circuit and Coupling structure exploration  Speeding up iteration scheme using structure NuCAD Presents:

June 11, 2015(7) Outline Previous Research Accurate Coupling Delay Computation Efficient Iteration Mechanism Experimental Setup Conclusions and future work

June 11, 2015(8) Circuit Model Rise/Fall-Delay-Window : (rd l,rd h )/(fd l,fd h ) Rise/Fall-Slew-Window : (rs l,rs h )/(fs l,fs h ) Associated nodes with coupling edge : N1 and N2 NAND CC N1 N2 N3 CC N1 NAND I1 I2 Rise Arc Fall Arc Coupling Edge

June 11, 2015(9) Motivational Example Input Delay Rise  I1 : [2,4] I2: [3,5] Input Delay Fall  I1 : [2.5,3.5] I2: [3.5,4.5] Input Slew Rise/Fall  I1: [0.2,0.6] I2 : [0.4,0.8] Average input slew Rise/Fall  I1 : 0.4 I2 : 0.6 Compute initial switching windows: MCF = 1.0 Rise Window : [2.6,5.3] Rise Slew : [0.5,0.7] Rise Window : [3.0,5.8] Rise Slew : [0.6,0.8] MCF = 1.8

June 11, 2015(10) Coupling Factor Computation Associated Nodes with coupling edge  Victim Node (V)  Aggressor Node (A)  Static timing seeks for worst bounds Waveform generation on V and A  Overlap ratio (k) computation Overlap ratio is defined as the ratio of aggressor output waveform that overlap with victim threshold voltage  Choose waveforms to generate worst possible k  Effective coupling cap : (1+/- 2k)xCC

June 11, 2015(11) Waveform selection Aggressor Victim D o a D o a +t a s D o v D o v +0.5t v s D o v +t v s t t K = 1.0 Victim D o a D o a +t a s D o v D o v +0.5t v s D o v +t v s t t Aggressor K = (D o a +t a s -t v s )/t a s

June 11, 2015(12) Waveform selection Aggressor Victim D o a D o a +t a s D o v D o v +0.5t v s D o v +t v s t t Victim D o a D o a +t a s D o v D o v +0.5t v s D o v +t v s t t Aggressor K = (0.5t v s) /t a s K = (D o a +0.5t v s -D o v )/t a s

June 11, 2015(13) Waveform Selection Victim D o a D o a +t a s t t D o v D o v +0.5t v s D o v +t v s K = 0 Aggressor Victim D o a D o a +t a s t t D o v D o v +0.5t v s D o v +t v s Aggressor K = 0

June 11, 2015(14) Accurate Coupling Delay Computation The idea is ! Compute D and t s from Windows To get bounds (best/worst) on K

June 11, 2015(15) Parameter Selection for K computation: Examples

June 11, 2015(16) Outline Previous Research Accurate Coupling Delay Computation Efficient Iteration Mechanism Experimental Setup Conclusions and future work

June 11, 2015(17) Iteration basics Traditional static timing analysis  Topological order of the circuit Static timing analysis with coupling is ITERATIVE Iterative timing analysis converges to FixPoint  Under a given coupling model (Zhou, ICCAD 2003) Node ordering is important How to make Static Timing Analysis efficient ?  Explore circuit structure for node ordering  Decrease iterations

June 11, 2015(18) Clustering Problems in analysis based on topological order  Any update at d  Propagate to e, f, g, h  If update at d not permanent  Calculation wasted Solution: Clustering  Local cluster (B) : Change in e  Changes f  Global cluster (A) : Two interacting local clusters  Timing Analysis  Convergence on clusters Clustering Issues:  With coupling whole circuit can be one global cluster

June 11, 2015(19) How to use Clustering ideas ? Coupling edges are bidirectional on Timing Graph Select coupling edges  Timing Graph Acyclic G1 G2 G3 G4 G5 G6 G7 G8 Such coupling edges are called Feedback Edges  Example : Coupling edge with fan-out relation Carry out iterations based on feedback edges CC1 CC2

June 11, 2015(20) Feedback Edge Identification Local Coupling Edge  Any change on aggressor should be updated to victim  Update does not occur by fan-out Observation:  Choosing CC1 as local coupling edge Force CC2 to become feedback edge  Choosing CC2 as feedback edge Force CC1 related by fan-out Metric to identify local coupling edge  Coupling Weight = Overlap ratio (K) with 1xCC timing windows G1 G2 G3 G4 G5 G6 G7 G8 CC2 Coupling Edges with no fan-out relation (Local Coupling Edges) CC1

June 11, 2015(21) Coupling Partitioning Algorithm Coupling edges are partitioned into:  Feedback edges (Global Coupling Edges)  Local Coupling Edges Algorithm:  Using BFS identify “Easy” Global Edges  Sort remaining coupling edges by coupling weight  Do Identify highest weighted edge (e) as local Find global edges generated by e (g e ) Remove g e from sorted coupling edges  While (no more coupling edges left)

June 11, 2015(22) Coupling Partitioning Algorithm (Illustration) G1 G2 G3 G4 G5 G6 G7 G8 CC2 CC1 k CC1 = 0.6, k CC2 = 0.8 G1 G2 G3 G4 G6 G7 G8 CC2 CC1 G5 Local Coupling Edge= CC2 G1 G2 G3 G4 G6 G7 G8 CC1 G5 Super-Node formation G1 G2 G3 G4 G6 G7 CC1 G5 G8 CC1 identified as Global Edge

June 11, 2015(23) Coupling Structure Aware Iteration Algorithm Initialization  Add topological sorted nodes in queue  Update coupling capacitance with MCF = 1.0  Update windows on each node Modified Chaotic Iterations  While (queue is not empty) u  Pop a node from queue Update coupling capacitance with new MCFs Update timing windows on u If ( | u old – u new | > ε )  Add fan-out nodes of u to queue  Add nodes to queue coupled by local coupling edges

June 11, 2015(24) Outline Previous Research Accurate Coupling Delay Computation Efficient Iteration Mechanism Experimental Setup Conclusions and future work

June 11, 2015(25) Circuit Modeling Experiments done on ISCAS85 benchmarks Circuit modeled as DAG (Timing Graph) Nodes in Timing Graph are Gates Edges represent interconnect Nodes are mapped to ASIC logic gates  Faraday 90 nm experimental tech library used  Delay tables are used : f( output load, input slew ) Coupling graph generation  Extracted coupling capacitance values are used  Coupling graph is superimposed on timing graph  Each net is assumed to couple with 4 aggressors

June 11, 2015(26) Model Accuracy Results CE denotes number of coupling edges RT : Runtime in seconds, TA : Cell Table Lookup (rd l,rd h ) : Rise delay window 012 Model can be non-conservative !

June 11, 2015(27) Performance Enhancement Results CI : Iterative algorithm proposed by Chen et.al Fast-CI : Coupling structure aware algorithm Global : Number of global edges identified P-RT : Coupling partitioning runtime Max = 62.1% Min = 5.7% Average = 26.8%

June 11, 2015(28) Outline Previous Research Accurate Coupling Delay Computation Efficient Iteration Mechanism Experimental Setup Conclusions and future work

June 11, 2015(29) Conclusions and future work We present FA-STAC  Accurate static timing analysis with coupling  Efficient iteration mechanism to converge faster Novel coupling delay model developed Coupling structure exploited for fast iterations Experimental results on ISCAS benchmarks  Our algorithm give average speed-up of 26.8%  Negligible error in timing windows Future directions  Complex coupling model for local coupling edges Submitted to DATE 2007

June 11, 2015(30) THANK YOU Q & A