Chapter 4b Statistical Static Timing Analysis: SSTA Prof. Lei He Electrical Engineering Department University of California, Los Angeles URL: eda.ee.ucla.edu Email: lhe@ee.ucla.edu
Statistic Static Timing Analysis (SSTA) Outline Motivation Statistic Static Timing Analysis (SSTA) Monte Carlo simulation Path-based and block-based SSTA
Variation-aware Timing Analysis How process variation would affect our STA? Min-Max approach would be too risky Corner-based STA is too expensive: 2^n corners To be accurate, analyze timing statistically. But how? Every label (delays) in the DAG is modeled as a R.V. with certain distribution Should use multivariate R.V. analysis Correlation is the Key!
Statistical Static Timing Analysis: SSTA Fairly new (hot) topic Many debates Many new ideas Not quite consistency across different ref. Unfortunately/Fortunately, live with it… In this lecture, cover some typical ones Monte Carlo simulation (Golden case) One path-based approach One block-based approach More for your own entertainment
Statistic Static Timing Analysis (SSTA) Outline Motivation Statistic Static Timing Analysis (SSTA) Monte Carlo simulation Path-based and block-based SSTA
Monte Carlo Simulation Definition: A technique involving the use of random numbers solving physical or mathematical problems Characteristics Physical process is simulated without explicitly knowing equations that describe the system output Only requirement is that the physical system be described by PDF (probabilistic density function)
Randomly sample each R.V. in accordance with its respective PDF Monte Carlo for SSTA Randomly sample each R.V. in accordance with its respective PDF Instantiate a specific DAG Solving STA using the technique we discussed before This is called one Monte Carlo run Run it many times until certain data statistics converge Stopping condition can be fairly sophisticated Finally, extract statistics from Monte Carlo runs PDF of RAT/AT/Slack Yield curve …
Monte Carlo Simulation Pros Conceptually easy Implementation not that difficult Make use of previous STA algorithm Accurate, used as golden case (benchmarking) Cons Computationally expensive No many diagnostic information if something is wrong No incremental computation possible Efficient solution Analytical Statistical static timing analysis (SSTA)
Statistic Static Timing Analysis (SSTA) Outline Motivation Statistic Static Timing Analysis (SSTA) Monte Carlo simulation Path-based and block-based SSTA
SSTA Algorithms Objective Path Based SSTA Block Based SSTA Find probability distribution of circuit delay Path Based SSTA Statistically calculate path delay distributions Find statistical maximum of these path delays Identify potential critical paths Block Based SSTA Traverse DAG to calculate the delay distribution for each node Widely used due to the incremental computation capability
Path-based SSTA [Orshansky DAC-02] Key operations Summation Path delay = sum(node delay) Maximum Critical path delay = max(path delay) Delay model First order approximation Obtained from SPICE simulation
Path-based SSTA: Key Operations Gate delay variance and covariance Path delay variance and covariance
Path-based SSTA: Approximation Maximum operation is approximated Closed form is not known yet Lower and upper bound for path delay mean Let D={D1...Dn } be an arbitrary path delay distribution with correlation Let X={X1...Xn } identical to D but WITHOUT correlation Can prove an upper bound for mean(D): Mean(D) < Mean(X) Similarly an lower bound can be established
Path-based SSTA: Approximation Lower and upper bound for path delay variance Result from theory of Gaussian process: Borell Inequality Variance of max{D1…Dn} around its mean is smaller than variance of a single Di with largest variance What are some of the pros and cons? Pros: Assume linear model of timing wrt variations Efficient for analysis of a single path Cons: May requires analysis of exponential number of paths Path not included in the analysis may become critical Not easily amenable to optimization
Path-based SSTA: Experiment Results Timing approximation is tighter Variation is smaller Mean clock frequency is smaller
Block-based SSTA: [Devgan ICCAD03] AT and gate delays are modeled as R.V. AT as CDFs Gate Delays as PDFs For easy computation Delay distributions can take any form Model CDFs as Piece-Wise Linear functions Model PDFs as constant step functions Cumulative Probability 1.0 A A1 A2 A3 P1 P2 P3
Block-based SSTA: Key Operations Addition AT2 = AT1 D1 AT2 = AT1+D1 1 2 D1 AT1 AT2 ( : convolution, Assuming independence for now) CDF PDF
Block-based SSTA: Key Operations Closed form for addition t2 u1 t1 s1 t1+t2 0.5s1u1(t1+t2-t)2 =
Block-based SSTA: Key Operations Maximum C = max (A, B) CDF of C = CDF of A x CDF of B Assume independence for now Closed form computation via PWL x t2 s2 t1 s1 t3=max(t1,t2) s1s2(t-t1)(t-t2) =
Block-based SSTA: Correlation Correlation due to path re-convergence AT5 and AT6 are correlated due to shared AT4 Exact handling this correlation would cause exponential complexity Utilize the structure of the circuits PI PO AT1 AT2 D1 D2 D3 D4 D6 D7 AT4 AT3 AT5 AT6 D8 D9
Block-based SSTA: Correlation A4 = max(A2+D24, A3+D34) A2 and A3 are related 1 3 2 4 A2 = A1 + D12 and A3 = A1 + D13 A4=max(A1+D12+D24, A1+ D13+D34) =A1+max(D12+D24, D13+D34) This formula works well for this simple case How does this work for general cases?
Block-based SSTA: General Cases General situation An input of a gate can depend on many preceding timing points There may be shared paths in the input cone A B C D 1 2 3 4 z G1 G2 G3 G4
Block-based SSTA: Heuristic Algorithm Create a dependency list Keep track of the re-convergence fanout nodes a particular node depends on Basically a list of pointers Compute the dominant common node For each pair wise max Determine that by statistical dominance and logic level Take out the common part contributed by the dominant node Perform max of the two CDFs An example follows
Block-based SSTA: Example 1 2 3 4 z G1 G2 G3 G4 Dependency list for G4: D 1 2 3 4 z G4 A B C Create the dependency list
Block-based SSTA: Example 1 2 3 4 z G4 A B C Dependency list for G4: Dominant common node D 1 2 3 4 z G4 B C
Block-based SSTA: Example Compute the pair wise max D 1 2 3 4 z G4 B C ATD = max(A1+D1z, A2+ D2z, A3+D3z, A4 + D4z) ATx = AB + max(A1-AB +D1z, A2-AB+ D2z) ATy = Ac + max(A3-Ac +D3z, A4-Ac+ D4z) ATD = max (ATx, ATy)
Block-based SSTA: Experiments Runtime comparison SSTA w/ correlation and w/o correlation Circuit w/ correlation w/o correlation C432 1.5 1.0 C499 1.26 C880 1.22 C1908 1.33 C2670 1.23 C3540 C6288 1.20 C7552 1.30
Block-based SSTA: Experiments Timing distribution SSTA w/ correlation and w/o correlation and Monte Carlo Simulation
Timing analysis is a key part of the design process Summary Timing analysis is a key part of the design process Statistical static timing analysis (SSTA) Arises due to process variation when technology continues to scale More to be done for SSTA Correlations matter Interconnect variability Slew propagation Gate delay models How to guide for optimal design?
References Michael Orshansky and Kurt Keutzer. 2002. A general probabilistic framework for worst case timing analysis. In Proceedings of the 39th annual Design Automation Conference (DAC '02). ACM, New York, NY, USA, 556-561. Anirudh Devgan and Chandramouli Kashyap. 2003. Block-based Static Timing Analysis with Uncertainty. In Proceedings of the 2003 IEEE/ACM international conference on Computer-aided design (ICCAD '03). IEEE Computer Society, Washington, DC, USA