Chapter 2 Interconnect Analysis Delay Modeling

Slides:



Advertisements
Similar presentations
Topics Electrical properties of static combinational gates:
Advertisements

EE 201A Modeling and Optimization for VLSI LayoutJeff Wong and Dan Vasquez EE 201A Noise Modeling Jeff Wong and Dan Vasquez Electrical Engineering Department.
Noise Model for Multiple Segmented Coupled RC Interconnects Andrew B. Kahng, Sudhakar Muddu †, Niranjan A. Pol ‡ and Devendra Vidhani* UCSD CSE and ECE.
A Look at Chapter 4: Circuit Characterization and Performance Estimation Knowing the source of delays in CMOS gates and being able to estimate them efficiently.
Chapter 2: Static Timing Analysis
Chapter 7 Reading on Moment Calculation. Time Moments of Impulse Response h(t) Definition of moments i-th moment Note that m 1 = Elmore delay when h(t)
Constructing Current-Based Gate Models Based on Existing Timing Library Andrew Kahng, Bao Liu, Xu Xu UC San Diego
04/09/02EECS 3121 Lecture 25: Interconnect Modeling EECS 312 Reading: 8.3 (text), 4.3.2, (2 nd edition)
ECE53A RLC Circuits W. Ku 11/29/2007
EE 201A (Starting 2005, called EE 201B) Modeling and Optimization for VLSI Layout Instructor: Lei He
Gate Delay Calculation Considering the Crosstalk Capacitances Soroush Abbaspour and Massoud Pedram University of Southern California Los Angeles CA Asia.
Statistical Gate Delay Calculation with Crosstalk Alignment Consideration Andrew B. Kahng, Bao Liu, Xu Xu UC San Diego
EE4800 CMOS Digital IC Design & Analysis
Noise and Delay Uncertainty Studies for Coupled RC Interconnects Andrew B. Kahng, Sudhakar Muddu † and Devendra Vidhani ‡ UCLA Computer Science Department,
Network Functions Definition, examples , and general property
CSE245: Computer-Aided Circuit Simulation and Verification Lecture Note 2: State Equations Prof. Chung-Kuan Cheng.
Capturing Crosstalk-Induced Waveform for Accurate Static Timing Analysis Masanori Hashimoto, Yuji Yamada, Hidetoshi Onodera Kyoto University.
Chapter 4 Interconnect Analysis. Organization 4.1 Linear System 4.2 Elmore Delay 4.3 Moment Matching and Model Order Reduction –AWE –PRIMA 4.4 Recent.
Chapter 07 Electronic Analysis of CMOS Logic Gates
Fundamentals of Electric Circuits Chapter 16 Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Modern VLSI Design 4e: Chapter 3 Copyright  2008 Wayne Wolf Topics n Wire delay. n Buffer insertion. n Crosstalk. n Inductive interconnect. n Switch logic.
Chapter 2 Interconnect Analysis Prof. Lei He Electrical Engineering Department University of California, Los Angeles URL: eda.ee.ucla.edu
Modern VLSI Design 3e: Chapter 3 Copyright  1998, 2002 Prentice Hall PTR Topics n Wire delay. n Buffer insertion. n Crosstalk. n Inductive interconnect.
Modern VLSI Design 3e: Chapter 3 Copyright  1998, 2002 Prentice Hall PTR Topics n Electrical properties of static combinational gates: –transfer characteristics;
1 Modeling and Optimization of VLSI Interconnect Lecture 2: Interconnect Delay Modeling Avinoam Kolodny Konstantin Moiseev.
הטכניון - מ.ט.ל. הפקולטה להנדסת חשמל - אביב תשס"ה
הטכניון - מ.ט.ל. הפקולטה להנדסת חשמל - אביב תשס"ה
ELECTRIC CIRCUITS EIGHTH EDITION
ELECTRIC CIRCUITS EIGHTH EDITION JAMES W. NILSSON & SUSAN A. RIEDEL.
COE 360 Principles of VLSI Design Delay. 2 Definitions.
Worst Case Crosstalk Noise for Nonswitching Victims in High-Speed Buses Jun Chen and Lei He.
MECH 373 Instrumentation and Measurements
Static Timing Analysis and Gate Sizing Optimization
DEPT.:-ELECTRONICS AND COMMUNICATION SUB: - CIRCUIT & NETWORK
VLSI Testing Lecture 5: Logic Simulation
MECH 373 Instrumentation and Measurements
Vishwani D. Agrawal Department of ECE, Auburn University
Topics Driving long wires..
Crosstalk Overview and Modes.
Reading: Hambley Ch. 7; Rabaey et al. Sec. 5.2
Jason Cong, David Zhigang Pan & Prasanna V. Srinivas
Chapter 2 Interconnect Analysis
Chapter 2 Interconnect Analysis
Fundamentals of Electric Circuits Chapter 16
Chapter 2 Interconnect Analysis Delay Modeling
EE 201C Homework 2 (due Feb 3 ) Wei Wu
Wei Yao Homework 3 Wei Yao
Static Timing Analysis and Gate Sizing Optimization
Chapter4 Bandpass Signalling Bandpass Filtering and Linear Distortion
DC & Transient Response
CSE245: Computer-Aided Circuit Simulation and Verification
Chapter 3b Static Noise Analysis
Crosstalk Overview and Modes.
Simon Lineykin and Sam Ben-Yaakov*
Homework 2 [due Jan. 31] [1] Given the circuit as shown below and a unit step voltage source at the input node s, use SPICE to simulate the circuit and.
EE 201C Homework 2 Fang Gong
Wire Indctance Consequences of on-chip inductance include:
Homework 1: Electrical System
Wei Yao Homework 4 Wei Yao
EE 201C Homework 2 (due Feb 3 ) Wei Wu
Lecture 5: DC & Transient Response
Homework 3 Given the circuit as shown below, and assume the RC tree is driven by a unit size inverter (BSIM3 model and default parameters), compute the.
Introduction to CMOS VLSI Design Lecture 4: DC & Transient Response
Chapter4 Bandpass Signalling Bandpass Filtering and Linear Distortion
Crosstalk Overview and Modes.
Mathematical Models of Control Systems
Power Estimation Dr. Elwin Chandra Monie.
CHAPTER 4 Laplace Transform. EMT Signal Analysis.
Jason Cong, David Zhigang Pan & Prasanna V. Srinivas
Fang Gong Homework 3 Fang Gong
Presentation transcript:

Chapter 2 Interconnect Analysis Delay Modeling Prof. Lei He Electrical Engineering Department University of California, Los Angeles URL: eda.ee.ucla.edu Email: lhe@ee.ucla.edu

Outline Delay models RC tree Elmore delay Gate delay Homework 2 ECE902 VLSI Interconnect Outline Delay models RC tree Elmore delay Gate delay Homework 2 Prepared by Lei He

Input-to-Output Propagation Delay The circuit delay in VLSI circuits consists of two components: The 50% propagation delay of the driving gates (known as the gate delay) The delay of electrical signals through the wires (known as the interconnect delay)

Lumped vs Distributed Interconnect Model ECE902 VLSI Interconnect Lumped vs Distributed Interconnect Model Lumped Distributed R C r c How to analyze the delay for each model? Prepared by Lei He

Lumped RC Model R v(t) u(t) C Impulse response and step response of a lumped RC circuit

Analysis of Lumped RC Model ECE902 VLSI Interconnect R C S-domain ckt equation (current equation) Frequency domain response for step-input Frequency domain response for impulse match initial state: v0 v0(1-eRC/T) Time domain response for step-input: Time domain response for impulse: Prepared by Lei He

50% Delay for lumped RC model V(t) 1v 1 0.5 50% delay How about more complex circuits?

Distributed RC-Tree The network has a single input node ECE902 VLSI Interconnect Distributed RC-Tree R1 C1 s R2 C2 R4 C4 C3 R3 Ci Ri 1 2 3 4 i The network has a single input node All capacitors between node and ground The network does not contain any resistive loop Prepared by Lei He

ECE902 VLSI Interconnect RC-tree Property R1 C1 s R2 C2 R4 C4 C3 R3 Ci Ri 1 2 3 4 i Unique resistive path between the source node s and any other node i of the network  path resistance Rii Example: R44=R1+R3+R4 Prepared by Lei He

RC-tree Property Extended to shared path resistance Rik: ECE902 VLSI Interconnect RC-tree Property R1 C1 s R2 C2 R4 C4 C3 R3 Ci Ri 1 2 3 4 i Extended to shared path resistance Rik: Example: Ri4=R1+R3 Ri2=R1 Prepared by Lei He

The Elmore delay at node i is: ECE902 VLSI Interconnect Elmore Delay Assuming: Each node is initially discharged to ground A step input is applied at time t=0 at node s The Elmore delay at node i is: Theorem: The Elmore delay is equivalent to the first-order time constant of the network Proven acceptable approximation of the real delay Powerful mechanism for a quick estimate Prepared by Lei He

Example Elmore delay at node i is 2 R2 R1 C2 4 1 s R4 R3 C4 C1 3 Ri C3 ECE902 VLSI Interconnect Example R1 C1 s R2 C2 R4 C4 C3 R3 Ci Ri 1 2 3 4 i Elmore delay at node i is Prepared by Lei He

Interpretation of Elmore Delay ECE902 VLSI Interconnect Interpretation of Elmore Delay median of h(t) (T50%) h(t) = impulse response H(t) = step response Definition h(t) = impulse response TD = mean of h(t) = Interpretation H(t) = output response (step process) h(t) = rate of change of H(t) T50%= median of h(t) Elmore delay approximates the median of h(t) by the mean of h(t) Prepared by Lei He

Elmore Delay Approximation

RC-chain (or ladder) Special case: ECE902 VLSI Interconnect RC-chain (or ladder) Special case: Shared-path resistance path resistance R1 C1 R2 C2 RN CN Vin VN Prepared by Lei He

RC-Chain Delay Delay of wire is quadratic function of its length ECE902 VLSI Interconnect RC-Chain Delay R C R C R C Vin VN R=r · L/N C=c·L/N Delay of wire is quadratic function of its length Delay of distributed rc-line is half of lumped RC Prepared by Lei He

Outline Delay models RC tree Elmore delay Gate delay Noise models ECE902 VLSI Interconnect Outline Delay models RC tree Elmore delay Gate delay Noise models Prepared by Lei He

Gate Delay and Output Transition Time The gate delay and the output transition time are functions of both input slew and the output load

General Model of a Gate

Definitions Output Transition Time Gate Delay Vin Vout Time 90% 10% W p n C M out Cdiff Cload Vin Vout Cout

Output Response for Different Loads

Output Transition Time Input Transition Time (s) Output Transition time (s) 10-10 CLoad (F) 10-14 Output transition time as a function of input transition time and output load

ASIC Cell Delay Model Three approaches for gate propagation delay computation are based on: Delay look-up tables K-factor approximation Effective capacitance Delay look-up table is currently in wide use especially in the ASIC design flow Effective capacitance promises to be more accurate when the load is not purely capacitive

Table Look-Up Method 115pS What is the delay when Cload is 505f F and Tin is 90pS?

K-factor Approximation Input Transition Time (s) Output Transition time (s) 10-10 CLoad (F) 10-14 We can fit the output transition time v.s. input transition time and output load as a polynomial function, e.g. A similar equation gives the gate delay

One Dimensional Table Linear model

Two Dimensional Table D1 D4 D3 D2 Quadratic model

Second-order RC-p Model Using Taylor Expansion around s = 0

Second-order RC-p Model (Cont’d) This equation requires creation of a four-dimensional table to achieve high accuracy This is however costly in terms of memory space and computational requirements

Effective Capacitance Approach The “Effective Capacitance” approach attempts to find a single capacitance value that can be replaced instead of the RC-p load such that both circuits behave similarly during transition

Output Response for Effective Capacitance

Effective Capacitance (Cont’d)

Effective Capacitance (Cont’d) 0<k<1 Because of the shielding effect of the interconnect resistance , the driver will only “see” a portion of the far-end capacitance C2 Rp 0 k = 1 Rp ∞ k = 0

Effective Capacitance for Different Resistive Shielding

Macy’s Approach Assumption: If two circuits have the same loads and output transition times, then their effective capacitances are the same => the effective capacitance is only a function of the output transition time and the load

Macy’s Iterative Solution Compute a from C1 and C2 Choose an initial value for Ceff Compute Tout for the given Ceff and Tin Compute b Compute g from a and b Find new Ceff Go to step 3 until Ceff converges

Summary Delay model Elmore delay Gate delay: look-up table, k-factor approximation, effective capacitance 37

References R. Macys and S. McCormick, “A New Algorithm for Computing the “Effective Capacitance” in Deep Sub-micron Circuits”, Custom Integrated Circuits Conference 1998, pp. 313-316 J. Cong, Z. Pan and P. V. Srinivas, "Improved Crosstalk Modeling for Noise Constrained Interconnect Optimization", Asia and South Pacific Design Automation Conference 2001, pp. 373-378 L. H. Chen, M. M.-Sadowska, “Aggressor Alignment for Worst-case Coupling Noise”, International Symposium on Physical Design 2000, pp. 48-54 38

Homework [1] Given the circuit as shown below and a unit step voltage source at the input node s, use SPICE to simulate the circuit and obtain the accurate 50% delay at node n. Also analytically calculate the delay using Elmore method and S2P method. How do they compare with the result obtained by SPICE? R1 C1 s R2 C2 R4 C4 C3 R3 C5 R5 n R1 = 1mΩ R2 = 2mΩ R3 = 2mΩ R4 = 1mΩ R5 = 4mΩ C1 = 1nF C2 = 1nF C3 = 4nF C4 = 4nF C5 = 2nF 1v

Homework [2] Give the circuit as shown below and a unit step voltage source at node s, can we still use the “shared-path” formula to calculate the Elmore delay? Explain why or why not. Use DC analysis method via MATLAB or SPICE to get the 0th -3rd moments of C3 and C5. R1 C1 s R2 C2 R4 C4 C3 R3 C5 C6 n R1 = 1mΩ R2 = 2mΩ R3 = 2mΩ R4 = 1mΩ C1 = 1nF C2 = 1nF C3 = 4nF C4 = 4nF C5 = 2nF C6 = 1nF 1v 40

3. Use the Elmore delay formula to calculate the Elmore delay. Steps for Problem 1 1. Write the SPICE netlist of the circuit and probe the voltage response at node n. 2. Record the time when the voltage at node n reaches 0.5V. That time is the 50% delay. 3. Use the Elmore delay formula to calculate the Elmore delay. (find the shared path between each node and node n). 4. Write down the transfer function and driving point admittance of the circuit with input s and output n. 5. Expand the transfer function to get the moments m1* and m2*. Expand the driving point admittance to get m1, m2, m3, and m4. 41

6. Follow the S2P algorithm to get k1, k2, p1 and p2. Steps for Problem 1 6. Follow the S2P algorithm to get k1, k2, p1 and p2. 7. Use the frequency domain expression (h(s)) to derive the time domain expression (h(t)). 8. Plot the obtained time domain waveform to get the 50% delay for the S2P model. 9. Compare the results. 42

3. Get the voltage across the capacitance as the moment. Steps for Problem 2 1. Follow the DC analysis method to reconstruct the circuit (e.g. replace C with zero current source for 0th moment calculation, etc). 2. Stamp the G and C matrices for MATLAB analysis or write the corresponding netlist for SPICE analysis. 3. Get the voltage across the capacitance as the moment. 4. The above should be done repeatedly until all the desired moments are acquired. 43

Homework 2 [3] Modify the PRIMA code with single frequency expansion to multiple points expansion. You should use a vector fspan to pass the frequency expansion points. Compare the waveforms of the reduced model between the following two cases: 1. Single point expansion at s=1e4. 2. Four-point expansion at s=1e3, 1e5, 1e7, 1e9. 44

Format of the input matrices for test 1 3 -0.000542882 0.0 1 4 0.000152585 -7.5288e-15 1 5 0.000464074 -2.9702e-15 1 6 -0.000542801 0.0 1 68 -19.4595 0.0 2 1 0.0 -2.9702e-15 2 2 3.66672 2.44291e-13 2 3 0.0 -2.3594e-13 2 4 0.0 -5.3806e-15 2 72 -1.425 0.0 2 329 -2.06075 0.0 2 341 -0.091255 0.0 2 343 -0.0897199 0.0 3 1 -2.44188e-06 0.0 3 2 -0.000464141 -2.3594e-13 3 3 40.8898 2.42089e-13 ….. 45

G,C,B,U,L matrices have been generated. Prima begins: Elapsed time is 2.003787 seconds. Prima done! Calculate original time domain response: Elapsed time is 2.868078 seconds. Original time domain response done! Calculate reduced time domain response: Elapsed time is 0.553771 seconds. Reduced time domain response done! Calculate original frequency response: Elapsed time is 1.192908 seconds. Original frequency response done! Calculate reduced frequency response: Elapsed time is 0.359126 seconds. Reduced frequency response done! Calculate original impulse response: Elapsed time is 0.052804 seconds. Original impulse response done! Calculate reduced impulse response: Elapsed time is 0.052701 seconds. Reduced impulse response done!