NuCAD ELECTRICAL ENGINEERING AND COMPUTER SCIENCE McCormick Northwestern University Robert R. McCormick School of Engineering and Applied Science Nostra-XTalk.

Slides:



Advertisements
Similar presentations
April 2004NUCAD Northwestern University1 Minimal Period Retiming Under Process Variations Jia Wang and Hai Zhou Electrical & Computer Engineering Northwestern.
Advertisements

Gate Sizing for Cell Library Based Designs Shiyan Hu*, Mahesh Ketkar**, Jiang Hu* *Dept of ECE, Texas A&M University **Intel Corporation.
Topics Electrical properties of static combinational gates:
EE 201A Modeling and Optimization for VLSI LayoutJeff Wong and Dan Vasquez EE 201A Noise Modeling Jeff Wong and Dan Vasquez Electrical Engineering Department.
EE141 © Digital Integrated Circuits 2nd Wires 1 The Wires Dr. Shiyan Hu Office: EERC 731 Adapted and modified from Digital Integrated Circuits: A Design.
NTHU-CS VLSI/CAD LAB TH EDA De-Shiuan Chiou Da-Cheng Juan Yu-Ting Chen Shih-Chieh Chang Department of CS, National Tsing Hua University, Taiwan Fine-Grained.
ECE 553: TESTING AND TESTABLE DESIGN OF DIGITAL SYSTES Logic Simulation.
1 Interconnect Layout Optimization by Simultaneous Steiner Tree Construction and Buffer Insertion Presented By Cesare Ferri Takumi Okamoto, Jason Kong.
Moon-Su Kim, Sunik Heo, DalHee Lee, DaeJoon Hyun, Byung Su Kim, Bonghyun Lee, Chul Rim, Hyosig Won, Keesup Kim Samsung Electronics Co., Ltd. System LSI.
A Timed-Automaton Based Method for Accurate Computation of Delay in the Presence of Cross-Talk Serdar Tasiran, Sunil P. Khatri, Sergio Yovine, Robert K.
1 On Convergence of Switching Windows Computation in Presence of Crosstalk Noise Pinhong Chen* +, Yuji Kukimoto +, Chin-Chi Teng +, Kurt Keutzer* *Dept.
Noise Model for Multiple Segmented Coupled RC Interconnects Andrew B. Kahng, Sudhakar Muddu †, Niranjan A. Pol ‡ and Devendra Vidhani* UCSD CSE and ECE.
Copyright 2001, Agrawal & BushnellDay-1 PM Lecture 4a1 Design for Testability Theory and Practice Lecture 4a: Simulation n What is simulation? n Design.
VLSI/CAD Laboratory Department of Computer Science National Tsing Hua University TH EDA Estimation of Maximum Instantaneous Current for Sequential Circuits.
NuCAD ELECTRICAL ENGINEERING AND COMPUTER SCIENCE McCormick Northwestern University Robert R. McCormick School of Engineering and Applied Science FA-STAC.
Statistical Crosstalk Aggressor Alignment Aware Interconnect Delay Calculation Supported by NSF & MARCO GSRC Andrew B. Kahng, Bao Liu, Xu Xu UC San Diego.
Enhanced Dual-Transition Probabilistic Power Estimation with Selective Supergate Analysis Fei Huand Vishwani D. Agrawal Department of ECE, Auburn University,
TH EDA NTHU-CS VLSI/CAD LAB 1 Re-synthesis for Reliability Design Shih-Chieh Chang Department of Computer Science National Tsing Hua University.
Constructing Current-Based Gate Models Based on Existing Timing Library Andrew Kahng, Bao Liu, Xu Xu UC San Diego
Computing Delay with Coupling Using Timed Automata Serdar Tasiran, Yuji Kukimoto, Robert K. Brayton Department of Electrical Engineering & Computer Sciences.
NTHU-CS VLSI/CAD LAB TH EDA Student : Da-Cheng Juan Advisor : Shih-Chieh Chang Fine-Grained Sleep Transistor Sizing Algorithm for Leakage Power Minimization.
Continuous Retiming EECS 290A Sequential Logic Synthesis and Verification.
Chung-Kuan Cheng†, Andrew B. Kahng†‡,
1 UCSD VLSI CAD Laboratory ISQED-2009 Revisiting the Linear Programming Framework for Leakage Power vs. Performance Optimization Kwangok Jeong, Andrew.
Gate Delay Calculation Considering the Crosstalk Capacitances Soroush Abbaspour and Massoud Pedram University of Southern California Los Angeles CA Asia.
Jieyi Long and Seda Ogrenci Memik Dept. of EECS, Northwestern Univ. Jieyi Long and Seda Ogrenci Memik Dept. of EECS, Northwestern Univ. Automated Design.
A Probabilistic Method to Determine the Minimum Leakage Vector for Combinational Designs Kanupriya Gulati Nikhil Jayakumar Sunil P. Khatri Department of.
Statistical Gate Delay Calculation with Crosstalk Alignment Consideration Andrew B. Kahng, Bao Liu, Xu Xu UC San Diego
1 Enhancing Performance of Iterative Heuristics for VLSI Netlist Partitioning Dr. Sadiq M. Sait Dr. Aiman El-Maleh Mr. Raslan Al Abaji. Computer Engineering.
Noise and Delay Uncertainty Studies for Coupled RC Interconnects Andrew B. Kahng, Sudhakar Muddu † and Devendra Vidhani ‡ UCLA Computer Science Department,
Signal Integrity Methodology on 300 MHz SoC using ALF libraries and tools Wolfgang Roethig, Ramakrishna Nibhanupudi, Arun Balakrishnan, Gopal Dandu Steven.
Andrew B. Kahng‡†, Mulong Luo†, Siddhartha Nath†
Modern VLSI Design 4e: Chapter 4 Copyright  2008 Wayne Wolf Topics n Interconnect design. n Crosstalk. n Power optimization.
ICCAD 2003 Algorithm for Achieving Minimum Energy Consumption in CMOS Circuits Using Multiple Supply and Threshold Voltages at the Module Level Yuvraj.
MASSOUD PEDRAM UNIVERSITY OF SOUTHERN CALIFORNIA Interconnect Length Estimation in VLSI Designs: A Retrospective.
Discrete Gate Sizing CENG 5270 – Tutorial 9 WILLIAM CHOW.
Capturing Crosstalk-Induced Waveform for Accurate Static Timing Analysis Masanori Hashimoto, Yuji Yamada, Hidetoshi Onodera Kyoto University.
POWER-DRIVEN MAPPING K-LUT-BASED FPGA CIRCUITS I. Bucur, N. Cupcea, C. Stefanescu, A. Surpateanu Computer Science and Engineering Department, University.
March 20, 2007 ISPD An Effective Clustering Algorithm for Mixed-size Placement Jianhua Li, Laleh Behjat, and Jie Huang Jianhua Li, Laleh Behjat,
Chapter 07 Electronic Analysis of CMOS Logic Gates
An Efficient Clustering Algorithm For Low Power Clock Tree Synthesis Rupesh S. Shelar Enterprise Microprocessor Group Intel Corporation, Hillsboro, OR.
1 Wire Length Prediction-based Technology Mapping and Fanout Optimization Qinghua Liu Malgorzata Marek-Sadowska VLSI Design Automation Lab UC-Santa Barbara.
05/04/06 1 Integrating Logic Synthesis, Tech mapping and Retiming Presented by Atchuthan Perinkulam Based on the above paper by A. Mishchenko et al, UCAL.
A Routing Approach to Reduce Glitches in Low Power FPGAs Quang Dinh, Deming Chen, Martin D. F. Wong Department of Electrical and Computer Engineering University.
EEE2243 Digital System Design Chapter 7: Advanced Design Considerations by Muhazam Mustapha, extracted from Intel Training Slides, April 2012.
Modern VLSI Design 3e: Chapter 4 Copyright  1998, 2002 Prentice Hall PTR Topics n Interconnect design. n Crosstalk. n Power optimization.
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
Combinational and Sequential Mapping with Priority Cuts Alan Mishchenko Sungmin Cho Satrajit Chatterjee Robert Brayton UC Berkeley.
Crosstalk Noise Optimization by Post-Layout Transistor Sizing Masanori Hashimoto Masao Takahashi Hidetoshi Onodera Dept. CCE, Kyoto University.
UC San Diego / VLSI CAD Laboratory Learning-Based Approximation of Interconnect Delay and Slew Modeling in Signoff Timing Tools Andrew B. Kahng, Seokhyeong.
Timing Model Reduction for Hierarchical Timing Analysis Shuo Zhou Synopsys November 7, 2006.
EE415 VLSI Design THE INVERTER [Adapted from Rabaey’s Digital Integrated Circuits, ©2002, J. Rabaey et al.]
Static Timing Analysis
-1- Delay Uncertainty and Signal Criticality Driven Routing Channel Optimization for Advanced DRAM Products Samyoung Bang #, Kwangsoo Han ‡, Andrew B.
A Novel Timing-Driven Global Routing Algorithm Considering Coupling Effects for High Performance Circuit Design Jingyu Xu, Xianlong Hong, Tong Jing, Yici.
1 Modeling and Optimization of VLSI Interconnect Lecture 2: Interconnect Delay Modeling Avinoam Kolodny Konstantin Moiseev.
1 Clarinet: A noise analysis tool for deep submicron design Rafi Levy Gabi Bracha, David Blaauw, Aurobindo Dasgupta, Amir Grinshpon,
Prediction of Interconnect Net-Degree Distribution Based on Rent’s Rule Tao Wan and Malgorzata Chrzanowska- Jeske Department of Electrical and Computer.
Worst Case Crosstalk Noise for Nonswitching Victims in High-Speed Buses Jun Chen and Lei He.
VLSI Testing Lecture 5: Logic Simulation
VLSI Testing Lecture 5: Logic Simulation
Vishwani D. Agrawal Department of ECE, Auburn University
Jason Cong, David Zhigang Pan & Prasanna V. Srinivas
SIDDAGANGA INSTITUTE OF TECHNOLOGY
Chapter 2 Interconnect Analysis Delay Modeling
Reinventing The Wheel: Developing a New Standard-Cell Synthesis Flow
On the Improvement of Statistical Timing Analysis
Reinventing The Wheel: Developing a New Standard-Cell Synthesis Flow
Fast Min-Register Retiming Through Binary Max-Flow
Jason Cong, David Zhigang Pan & Prasanna V. Srinivas
Presentation transcript:

NuCAD ELECTRICAL ENGINEERING AND COMPUTER SCIENCE McCormick Northwestern University Robert R. McCormick School of Engineering and Applied Science Nostra-XTalk : A Predictive Framework for Accurate Static Timing Analysis in UDSM VLSI Circuits Debasish Das, Ahmed Shebaita Yehea Ismail, Hai Zhou EECS, Northwestern Kip Killpack Strategic CAD Lab, Intel

June 26, 2015 (2) Outline Motivation and Previous Research Directed Search Mechanism Static Timer Algorithm Experimental Setup Conclusions and future work

June 26, 2015 (3) Coupling Dominates Coupling cap dominates interconnect parasitics Graph shows ratio of coupling cap vs. ground cap of nets  Parasitics extracted from 65 nm logic block  Industrial Microprocessor design from Intel

June 26, 2015 (4) Previous Research (Coupling Model) Accurate computation of MCF needed  To model the effects of crosstalk Analytical models for MCF computation proposed  Step transitions : (0,2) Sapatnekar et.al, ICCAD 2000  Ramp Models : (-1,3) Kahng et.al, DAC 2000 Chen et.al, ICCAD 2000  Exponential Models : (-1.885,3.885) Ghoneima et.al, ISCAS 2005 Accurate models are applied to timing analysis  Extending Ramp model to Timing Analysis Das et. al, ICCD 2006

June 26, 2015 (5) Previous Research (Timing Analysis) Timing Analysis with coupling iterative Iterative analysis with continuous models: Chen et.al ICCAD 2000 Iterative analysis with discrete models: Sapatnekar et.al TCAD 2000, Chen et.al ICCAD 2000, Arunachalam et.al DAC 2000 Circuit and coupling structure explored to speed up iterative analysis: Das et. al ICCD 2006

June 26, 2015 (6) Issues in Previous Research Analytical models derived MCF based on  Output delay windows on victim and aggressor nets  Output slew windows on victim and aggressor nets Correlation between input timing windows and MCF ignored Correlation between input slew windows and MCF ignored Such assumptions may lead to pessimistic MCF computation

June 26, 2015 (7) Motivational Example Rise Delay Window at I1  [2,4] Fall Delay Window at I2  [3.5,4.5] Static timing assumption on slews  Max victim input slew 0.6  Min aggressor input slew 0.8 Consider 2 timing events from [2,4] (arrival time, slew)  T1 = (3.9,0.6)  T2 = (4.0,0.6) G1 G2 I1 I2 O1 O2 [2,4] [3.5,4.5]

June 26, 2015 (8) Motivational Example MCF due to T1 = 2.5 MCF due to T2 = 2.4 Previous approaches will consider MCF as 2.5  Update [2,4] with MCF 2.5 Ideally  Compute Delay push-out on T1 due to MCF 2.5: T1 po  Compute Delay push-out on T2 due to MCF 2.4: T2 po  Maximum bound of output window max(T1 po,T2 po ) G1 G2 I1 I2 O1 O2 [2,4] [3.5,4.5]

June 26, 2015 (9) Salient Features  Directed Search Mechanism Search for input timing event  Results in worst/best delay push-out  Use of input timing windows on victim and aggressor Accurate gate delay model employed  To take into account non-linearity of devices  Iterative static timer Using Directed Search on a victim cluster  Collection of all aggressor nets connected to victim net

June 26, 2015 (10) Outline Motivation and Previous Research Directed Search Approach Static Timer Algorithm Experimental Setup Conclusions and future work

June 26, 2015 (11) Circuit Model Rise/Fall-Delay-Window : (Di l, Di h ) Rise/Fall-Slew-Window : (si l,si h ) Associated nodes with coupling edge : N1 and N2 NAND CC N1 N2 N3 CC N1 NAND I1 I2 Rise Arc Fall Arc Coupling Edge

June 26, 2015 (12) Circuit Model for Directed Search Enumerate arcs on drivers  Both victim and aggressor net N1 arc1 arc2 N2 arc3 arc4 c Input delay (ID a ) =[Di a l, Di a h ] Input slew (ID a )=[si a l, si a h ] Aggressor Input delay (ID v )=[Di v l, Di v h ] Input slew (IS v )=[si v l, si v h ] output delay (OD v ) D 0 v =[Do v l, Do v h ] output slew (OS v ) t v s =[so v l, so v h ] output delay (OD a ), D 0 a =[Do a l, Do a h ] output slew (OD a ), t a s =[so a l, so a h ] arc1 ID v,IS v ID a,IS a arc3 Victim c arc2 ID v,IS v ID a,IS a arc3 c arc1 ID v,IS v ID a,IS a arc3 c arc2 ID v,IS v ID a,IS a arc4 VictimAggressor OD v,OS v OD a,OS a OD v,OS v OD a,OS a OD v,OS v OD a,OS a OD v,OS v OD a,OS a

June 26, 2015 (13) Circuit Model for Directed Search (contd.) Enumerate arcs on drivers  Both victim and aggressor net N1 arc1 arc2 N2 arc3 arc4 Aggressor Victim Apply Directed Search on 4 possibilities  Choose the one that results in worst delay push-out c ID v,IS v ID a,IS a arc3 c ID v,IS v ID a,IS a arc3 c arc1 ID v,IS v ID a,IS a arc3 c arc2 ID v,IS v ID a,IS a arc4 OD v,OS v OD a,OS a OD v,OS v OD a,OS a OD v,OS v arc1arc2

June 26, 2015 (14) Detailed Circuit Model for Directed Search Input delay (ID a ) =[Di a l, Di a h ] Input slew (ID a )=[si a l, si a h ] Input delay (ID v )=[Di v l, Di v h ] Input slew (IS v )=[si v l, si v h ] output delay (OD v ) D 0 v =[Do v l, Do v h ] output slew (OS v ) t v s =[so v l, so v h ] output delay (OD a ), D 0 a =[Do a l, Do a h ] output slew (OD a ), t a s =[so a l, so a h ] Victim Aggressor Coupled CircuitEquivalent Circuit c arc1 ID v,IS v ID a,IS a arc3 OD v,OS v OD a,OS a C v = C g + (Victim MCF)C c arc1 ID v,IS v ID a,IS a arc3 OD v,OS v OD a,OS a C a = C g + (Aggressor MCF)C c

June 26, 2015 (15) Gate Delay Model We use logic gates from Faraday’s 90 nm cell libraries  Figure (a) shows t v s = f1(C v,si v )  Figure (b) shows t a s = f2(C a,si a )  Figure (c) shows t v d = f3(C v, si v )  Figure (d) shows t a d = f4(C a, si a ) Assuming linear dependance is acceptable Output Waveform on victim  W v = G(D i v + t v d, t v s ) Output Waveform on aggressor  W a = G(D i a + t a d, t a s ) si v h si v l cvcv tvdtvd si v h si v l cvcv tvstvs si a h si a l tadtad si a h si a l caca tastas caca (a) (b) (c) (d)

June 26, 2015 (16) Coupling Model (Das et. al, ICCD 2006) Overlap ratio (k) computation  Overlap ratio is defined as the ratio of aggressor output waveform that overlap with victim threshold voltage Use waveforms W v and W a to compute k

June 26, 2015 (17) Coupling Model (Das et. al, ICCD 2006) Use waveforms W v and W a to compute k (a) Victim MCF = 1±2k, Aggressor MCF = (b) Victim MCF = 1±2k, Aggressor MCF = (c) Victim MCF = 1±2k, Aggressor MCF = (d) Victim MCF = 1±2k, Aggressor MCF =

June 26, 2015 (18) Worst Case Delay Computation Linearly span the domain of k Victim capacitance C v = C g + (1+2k)C c  Compute t v d and t v s Aggressor capacitance and slew are given by si v h si v l cvcv tvstvs si v h si v l cvcv tvdtvd c g +(1+2k)c c

June 26, 2015 (19) Worst Case Delay Computation (contd.) Using t a s we obtain aggressor capacitance [c1,c2] t a d is calculated using c1 and c2 caca si a h si a l tastas si a h si a l caca tadtad c 1 c 2 Worst case delay computation produces Construct Feasible set F Choose one element from F which has worst t v d

June 26, 2015 (20) Outline Motivation and Previous Research Directed Search Approach Static Timer Algorithm Experimental Setup Conclusions and future work

June 26, 2015 (21) Practical Application of Directed Search Victim net coupled with more than one aggressors We model our circuit as directed graph G = (V,E)  V : Gates in combinational circuit  E : F U C where F : Fan-out Edges C : Coupling Edges We give the following definition V is the victim node while A i are its aggressors

June 26, 2015 (22) Worst Case Coupling Capacitance as fix- points We define a switching point as an ordered pair of delay and slew Following set gives all possible switching points in a k-cluster Set formed by pairs of switching points (s i,s j ) is a totally ordered set Directed Search between an victim and aggressor is order preserving transformation Fix-point iteration to get worst case caps

June 26, 2015 (23) Iterative Static Timer Algorithm

June 26, 2015 (24) Outline Motivation and Previous Research Directed Search Approach Static Timer Algorithm Experimental Setup Conclusions and future work

June 26, 2015 (25) Circuit Modeling Experiments done on ISCAS85 benchmarks Circuit modeled as DAG (Timing Graph) Nodes in Timing Graph are Gates Edges represent interconnect Nodes are mapped to ASIC logic gates  Faraday 90 nm experimental tech library used  Delay tables are used : f( output load, input slew ) Coupling graph generation  Extracted coupling capacitance values are used  Coupling graph is superimposed on timing graph  Each net is assumed to couple with 6 aggressors

June 26, 2015 (26) Accuracy Enhancement Results IST-(0,1,2) : Iterative static timer MCFs 0,1,2 (Sapatnekar et. al) IST-DS : Proposed Iterative static timer RT : Runtime TA : Cell Table Accesses GA : Accuracy Gain, GT : Gain in Cell Table Accesses

June 26, 2015 (27) Accuracy Enhancement Results Hold time given by IST-(0,1,2) can be non-conservative Accuracy Gain by proposed algorithm  Average : 25.59%  Highest gain C880 : 45.5% Decrease in Cell Delay Table Lookup by proposed algorithm  Average : 40.1%  Maximum decrease c6288 : 64.8% Decrease in cell delay is not reflected in runtime  Search has high complexity  Search should be used judiciously

June 26, 2015 (28) Outline Motivation and Previous Research Directed Search Approach Static Timer Algorithm Experimental Setup Conclusions and future work

June 26, 2015 (29) Conclusions and future work We present Nostra-XTalk  Directed Search Approach for accurate timing analysis  Iterative static timer using directed search Directed Search is time consuming Directed Search should be selectively applied Can be used in a coupling partitioning based timer  As proposed by Das et. al ICCD 2006  Directed Search can be applied on local clusters Future Directions  Devise algorithms Selectively apply Directed Search Accurate as well as efficient analysis