Download presentation
Presentation is loading. Please wait.
Published byBridget Holmes Modified over 9 years ago
1
Power Network Distribution Chung-Kuan Cheng CSE Dept. University of California, San Diego 3/4/2010
2
Page 2 Agenda Background: power distribution networks (PDN’s) Analysis: worst-case PDN noise prediction –Motivation –Problem formulation –Proposed Algorithm –Case study Simulation: adaptive parallel flow using discrete Fourier transform (DFT) –Motivation –Adaptive parallel flow description –Experimental results Conclusions and future work
3
Page 3 Research on Power Distribution Networks Analysis –Stimulus, Noise Margin, Simulation Synthesis –VRM, Decap, ESR, Topology Integration –Sensors, Prediction, Stability, Robustness
4
Page 4 Publication List Power Distribution Network Simulation and Analysis [1] W. Zhang and C.K. Cheng, "Incremental Power Impedance Optimization Using Vector Fitting Modeling,“ IEEE Int. Symp. on Circuits and Systems, pp. 2439-2442, 2007. [2] W. Zhang, W. Yu, L. Zhang, R. Shi, H. Peng, Z. Zhu, L. Chua-Eoan, R. Murgai, T. Shibuya, N. Ito, and C.K. Cheng, "Efficient Power Network Analysis Considering Multi-Domain Clock Gating,“ IEEE Trans on CAD, pp. 1348-1358, Sept. 2009. [3] W.P. Zhang, L. Zhang, R. Shi, H. Peng, Z. Zhu, L. Chua-Eoan, R. Murgai, T. Shibuya, N. Ito, and C.K. Cheng, "Fast Power Network Analysis with Multiple Clock Domains,“ IEEE Int. Conf. on Computer Design, pp. 456-463, 2007. [4] W.P. Zhang, Y. Zhu, W. Yu, R. Shi, H. Peng, L. Chua-Eoan, R. Murgai, T. Shibuya, N. Ito, and C.K. Cheng, "Finding the Worst Case of Voltage Violation in Multi-Domain Clock Gated Power Network with an Optimization Method“ IEEE DATE, pp. 540-547, 2008. [5] X. Hu, W. Zhao, P. Du, A.Shayan, C.K.Cheng, “An Adaptive Parallel Flow for Power Distribution Network Simulation Using Discrete Fourier Transform,” accepted by IEEE/ACM Asia and South Pacific Design Automation Conference (ASP-DAC), 2010.
5
Page 5 Publication List Power Distribution Network Analysis and Synthesis [6] W. Zhang, Y. Zhu, W. Yu, A. Shayan, R. Wang, Z. Zhu, C.K. Cheng, "Noise Minimization During Power-Up Stage for a Multi-Domain Power Network,“ IEEE Asia and South Pacific Design Automation Conf., pp. 391-396, 2009. [7] W. Zhang, L. Zhang, A. Shayan, W. Yu, X. Hu, Z. Zhu, E. Engin, and C.K. Cheng, "On-Chip Power Network Optimization with Decoupling Capacitors and Controlled-ESRs,“ to appear at Asia and South Pacific Design Automation Conference, 2010. [8] X. Hu, W. Zhao, Y.Zhang, A.Shayan, C. Pan, A. E.Engin, and C.K. Cheng, “On the Bound of Time-Domain Power Supply Noise Based on Frequency-Domain Target Impedance,” in System Level Interconnect Prediction Workshop (SLIP), July 2009. [9] A. Shayan, X. Hu, H. Peng, W. Zhang, and C.K. Cheng, “Parallel Flow to Analyze the Impact of the Voltage Regulator Model in Nanoscale Power Distribution Network,” in 10 th International Symposium on Quality Electronic Design (ISQED), Mar. 2009.
6
Page 6 Publication List (Cont’) 3D Power Distribution Networks [10] A. Shayan, X. Hu, “Power Distribution Design for 3D Integration”, Jacob School of Engineering Research Expo, 2009 [Best Poster Award] [11] A. Shayan, X. Hu, M.l Popovich, A.E. Engin, C.K. Cheng, “Reliable 3D Stacked Power Distribution Considering Substrate Coupling”, in International Conference on Computre Design (ICCD), 2009. [12] A. Shayan, X. Hu, C.K. Cheng, “Reliability Aware Through Silicon Via Planning for Nanoscale 3D Stacked ICs,” in Design, Automation & Test in Europe Conference (DATE), 2009. [13] A. Shayan, X.g Hu, H. Peng, W. Zhang, C.K. Cheng, M. Popovich, and X. Chen, “3D Power Distribution Network Co-design for Nanoscale Stacked Silicon IC,” in 17 th Conference on Electrical Performance of Electronic Packaging (EPEP), Oct. 2008. [5] [14] W. Zhang, W. Yu, X. Hu, A.i Shayan, E. Engin, C.K. Cheng, "Predicting the Worst-Case Voltage Violation in a 3D Power Network", Proceeding of IEEE/ACM International Workshop on System Level Interconnect Prediction (SLIP), 2009.
7
Page 7 Agenda Background: power distribution networks (PDN’s) Analysis: worst-case PDN noise prediction –Motivation –Problem formulation –Proposed Algorithm –Case study Simulation: adaptive parallel flow using discrete Fourier transform (DFT) –Motivation –Adaptive parallel flow description –Experimental results Conclusions and future work
8
Page 8 What is a power distribution network (PDN) Power supply noise –Resistive IR drop –Inductive Ldi/dt noise [Popovich et al. 2008]
9
Page 9 PDN Roadmap V dd of high-performance microprocessors Currents of high-performance microprocessors [ITRS 2007]
10
Page 10 PDN Roadmap Target impedance [ITRS 2007]
11
Page 11 Agenda Background: power distribution networks (PDN’s) Analysis: worst-case PDN noise prediction –Motivation –Problem formulation –Proposed Algorithm –Case study Simulation: adaptive parallel flow using discrete Fourier transform (DFT) –Motivation –Adaptive parallel flow description –Experimental results Conclusions and future work
12
Page 12 Analysis Target Impedance vs. Worst Cases Noise vs. Rise Time of Stimulus Rogue Wave of Multiple Staged Network
13
Page 13 PDN Design Methodology: Target Impedance PDN design –Objective: low power supply noise –Popular methodology: “target impedance” [Smith ’99] Implication: if the target impedance is small, then the noise will also be small
14
Page 14 Worst-Case PDN Noise Prediction: Motivation Problems with “target impedance” design methodology –How to set the target impedance? Small target impedance may not lead to small noise –A PDN with smaller Z max may have larger noise Time-domain design methodology: worst-case PDN noise –If the worst-case noise is smaller than the requirement, then the PDN design is safe. Straightforward and guaranteed –How to generate the worst-case PDN noise FT : Fourier transform
15
Page 15 Worst-Case PDN Noise Prediction: Related Work At final design stages [Evmorfopoulos ’06] –Circuit design is fully or almost complete –Realistic current waveforms can be obtained by simulation –Problem: countless input patterns lead to countless current waveforms Sample the excitation space Statistically project the sample’s own worst-case excitations to their expected position in the excitation space At early design stages [Najm ’03 ’05 ’07 ’08 ’09] –Real current information is not available –“Current constraint” concept –Vectorless approach: no simulation needed –Problem: assume ideal current with zero transition time
16
Page 16 Ideal Worst-Case PDN Noise Problem formulation I PDN noise: Worst-case current [Xiang ’09]: Zero current transition time. Unrealistic!
17
Page 17 Ideal Case Study: One-Stage LC Tank w/ ESR’s Define: Note Under-damped condition:
18
Page 18 Ideal Case Study: One-Stage LC Tank w/ ESR’s (Cont’) Step response: where Normalized step response:
19
Page 19 Ideal Case Study: One-Stage LC Tank w/ ESR’s (Cont’) Local extreme points of the step response: Normalized magnitude of the first peak:
20
Page 20 Ideal Case Study: One-Stage LC Tank w/ ESR’s (Cont’) Normalized worst-case noise:
21
Page 21 Ideal Case Study: One-Stage LC Tank w/ ESR’s (Cont’) Impedance: When [Mikhail 08] Normalized peak impedance:
22
Page 22 Worst-Case Noise with Non-zero Current Transition Times Problem formulation II T: chosen to be such that h(t) has died down to some negligible value. * f(t) replaces i(T-τ)
23
Page 23 Proposed Algorithm Based on Dynamic Programming GetTransPos(j,k 1,k 2 ): find the smallest i such that F j (k 1,i)≤ F j (k 2,i) Q.GetMin(): return the minimum element in the priority queue Q Q.DeleteMin(): delete the minimum element in the priority queue Q Q.Add(e): insert the element e in the priority queue Q
24
Page 24 Proposed Algorithm: Initial Setup Divide the time range [0, T] into m intervals [t 0 =0, t 1 ], [t 1, t 2 ], …, [t m-1, t m =T]. h(t i ) = 0, i=1, 2, …, m-1 u 0 = 0, u 1, u 2, …, u n = b are a set of n+1 values within [0, b]. The value of f(t) is chosen from those values. A larger n gives more accurate results. h(t)
25
Page 25 Proposed Algorithm: f(t) within a time interval [ t j, t j+1 ] I j (k,i) : worst-case f(t) starting with u k at time t j and ending with u i at time t j+1 h(t) Theorem 1: The worst-case f(t) can be cons- tructed by determining the values at the zero- crossing points of the h(t)
26
Page 26 Proposed Algorithm: Dynamic Programming Formulation Define V j (k,i) : the corresponding output within time interval [t j, t j+1 ] Define the intermediate objective function OPT(j,i) : the maximum output generated by the f(t) ending at time t j with the value u i Recursive formula for the dynamic programming algorithm: Time complexity:
27
Page 27 Acceleration of the Dynamic Programming Algorithm Without loss of generality, consider the time interval [t j, t j+1 ] where h(t) is negative. Define W j (k,i) : the absolute value of V j (k,i) : Lemma 1 : W j (k 2,i 2 )- W j (k 1,i 2 )≤ W j (k 2,i 1 )- W j (k 1,i 1 ) for any 0 ≤ k 1 < k 2 ≤ n and 0 ≤ i 1 < i 2 ≤ n
28
Page 28 Acceleration of the Dynamic Programming Algorithm Define F j (k,i) : the candidate corresponding to k for OPT(j,i) Accelerated algorithm: –Based on Theorem 2 –Using binary search and priority queue Theorem 2: Suppose k 1 < k 2, i 1 ∈ [0,n] and F j (k 1,i 1 )≤ F j (k 2,i 1 ), then for any i 2 > i 1, we have F j (k 1,i 2 )≤ F j (k 2,i 2 ).
29
Page 29 Case Study 1: Impedance 2.09mΩ @ 19.8KHz 1.69mΩ @ 465KHz 3.23mΩ @ 166MHz
30
Page 30 Case Study 1: Impulse Response Impulse response: 100ns~10µs Impulse response: 10µs~100µs Impulse response: 0s~100ns High frequency oscillation at the beginning with large amplitude, but dies down very quickly Mid-frequency oscillation with relatively small amplitude. Low frequency oscillation with the smallest amplitude, but lasts the longest Amplitude = 1861 Amplitude = 29 Amplitude = 0.01
31
Page 31 Case Study 1: Worst-Case Current Current constraints: Zoom in The worst-case current also oscillates with the three resonant frequencies which matches the impulse response. Saw-tooth-like current waveform at large transition times
32
Page 32 Case Study 1: Worst-Case Noise Response
33
Page 33 Case Study 1: Worst-Case Noise vs. Transition Time The worst-case noise decreases with transition times. Previous approaches which assume zero current transition times result in pessimistic worst-case noise.
34
Page 34 Case Study 2: Impedance 224.3KHz 11.2MHz 98.1MHz 224.3KHz 10.9MHz 101.6MHz
35
Page 35 Case Study 2: Worst-Case Noise for both cases: meaning that the worst-case noise is larger than Z max. The worst-case noise can be larger even though its peak impedance is smaller.
36
Page 36 Case 3: “Rogue Wave” Phenomenon Worst-case noise response: The maximum noise is formed when a long and slow oscillation followed by a short and fast oscillation. Rogue wave: In oceanography, a large wave is formed when a long and slow wave hits a sudden quick wave. Low-frequency oscillation corresponds to the resonance of the 2 nd stage High-frequency oscillation corresponds to the resonance of the 1 st stage
37
Page 37 Case 3: “Rogue Wave” Phenomenon (Cont’) Equivalent input impedance of the 2 nd stage at high frequency
38
Page 38 Case 3: “Rogue Wave” Phenomenon (Cont’) Input current i(t): –Blue (I1): worst-case input stimulus –Red (I2): low frequency part of I1 –Green (I3): high frequency part of I1 I1=I2+I3
39
Page 39 Case 3: “Rogue Wave” Phenomenon (Cont’) Input current i(t) (zoom in):
40
Page 40 Case 3: “Rogue Wave” Phenomenon (Cont’) Noise response @ chip output –Blue (V1): response of I1 –Red (V2): response of I2 –Green (V3): response of I3
41
Page 41 Case 3: “Rogue Wave” Phenomenon (Cont’) Noise response (zoom in):
42
Page 42 Case 3: “Rogue Wave” Phenomenon (Cont’) Responses of I1 @ different locations: V1 Vpkg1 Vchipb1
43
Page 43 Case 3: “Rogue Wave” Phenomenon (Cont’) Response of I1 @ different locations (zoom in):
44
Page 44 Case 3: “Rogue Wave” Phenomenon (Cont’) Responses of I2 @ different locations: V2 Vpkg2 Vchipb2
45
Page 45 Case 3: “Rogue Wave” Phenomenon (Cont’) Response of I2 @ different locations (zoom in):
46
Page 46 Case 3: “Rogue Wave” Phenomenon (Cont’) Responses of I3 @ different locations: V3 Vpkg3 Vchipb3
47
Page 47 Case 3: “Rogue Wave” Phenomenon (Cont’) Response of I3 @ different locations (zoom in):
48
Page 48 Agenda Background: power distribution networks (PDN’s) Analysis: worst-case PDN noise prediction –Motivation –Problem formulation –Proposed Algorithm –Case study Simulation: adaptive parallel flow using discrete Fourier transform (DFT) –Motivation –Adaptive parallel flow description –Experimental results Conclusions and future work
49
Page 49 PDN Simulation: Why Frequency Domain? Huge PDN netlists –Time-domain simulation: serial - slow –Frequency-domain simulation: parallel – fast Frequency dependent parasitics Simulation results –Time-domain: voltage drops, simultaneous switching noise (SSN) – input dependent –Frequency-domain: impedance, anti-resonance peaks – input independent
50
Page 50 Transform Operations Laplace Transform [Wanping ’07] –Input: Series of ramp functions –Output: Rational expressing via vector fitting –Choice of frequency samples Discrete Fourier Transform (DFT) –Periodic signal assumption –Discrete frequency samples
51
Page 51 Basic DFT Simulation Flow
52
Page 52 Adaptive DFT Flow Period[i] : the input period at each iteration Interval[i] : the simulation time step at each iteration FreqUpBd[i] : the upper bound of the input frequency range at each iteration v i (t) : tentative time-domain output within the frequency range [0, FreqUpBd] at each iteration Iteration #1: obtain the main part of the output Iteration #2~k: capture the oscillations in the tail of the output (high, middle, and low resonant frequencies) For each iteration #i, i=k, k-1, …, 2, subtract the captured tail from the outputs at iteration #j, j<i to eliminate the wrap-around effect
53
Page 53 Problem with Basic DFT Flow “Wrap-around effect” requires long padding zeros at the end of the input –Periodicity nature of DFT Small uniform time steps are needed to cover the input frequency range T2T3T4T T DFT repetition output Large number of simulation points! Correct Distorted! T Wrap-around
54
Page 54 Adaptive DFT Simulation Basic ideas of the adaptive DFT flow: cancel out the wrap-around effect by subtracting the tail from the main part of the output –Main part of the output: obtained with small time step and small period; distorted by the wrap-around effect –Tail of the output: low frequency oscillation; can be captured with large time steps T2T3T4T - - - T CorrectDistorted Correct! Total number of simulation points is reduced significantly!
55
Page 55 Experimental Results: Test Case & Input Test case: 3D PDN –One resonant peak in the impedance profile Input current –Time step: ∆t = 20ps –Duration: T 0 = 16.88ns Impedance Original Input
56
Page 56 Experimental Results: Adaptive Flow Process Iteration #1: v 1 (t) –∆t 1 =20ps –T 1 =20.48ns Iteration #2: v 2 (t) –∆t 2 = 32∆t 1 = 640ps –T 2 = 4T 1 = 81.92ns Final output: –Main part: –Tail: T1T1 2T 1 4T 1 3T 1 v 1 (t), ∆t 1 =20ps, T 1 =20.48ns Final output v 2 (t), ∆t 2 =640ps, T 2 =81.92ns
57
Page 57 Experimental Results: DFT Flow vs. SPICE
58
Page 58 Error Analysis: Error Caused by Wrap-around Effect Theorem 1 : Let be the initial value of the output voltage. Suppose for some, then the mean square error, i.e., is bounded by. Relative error: 2.09% Relative error: 0.12% Output comparisonError relative to SPICE
59
Page 59 Error Analysis: Error Caused by Different Interpolation Methods SPICE: PWL interpolation DFT: sinusoidal interpolation Output comparisonError relative to SPICE
60
Page 60 Time Complexity Analysis: Adaptive vs. Non-adaptive Adaptive flow time complexity: –T i : simulation period at iteration #i, –∆t i : simulation time step at iteration #i, Non-adaptive flow time complexity:
61
Page 61 Parallel Processing Test case: 3D PDN –Case 1: 393930 nodes –Case 2: 1465206 nodes Simulation time (case 2) –DFT flow: ~3.5 hr (w/ 1 prc) 76 sec (w/ 256 prcs) –HSPICE: ~38 hr
62
Page 62 Agenda Background: power distribution networks (PDN’s) Analysis: worst-case PDN noise prediction –Motivation –Problem formulation –Proposed Algorithm –Case study Simulation: adaptive parallel flow using discrete Fourier transform (DFT) –Motivation –Adaptive parallel flow description –Experimental results Conclusions and future work
63
Page 63 Remarks Worst-case PDN noise prediction with non-zero current transition time –The worst-case PDN noise decreases with transition time –Small peak impedance may not lead to small worst-case noise –“Rogue wave” phenomenon Adaptive parallel flow for PDN simulation using DFT –0.093% relative error compared to SPICE –10x speed up with single processor. –Parallel processing reduces the simulation time even more significantly
64
Page 64 Summary 1.Throughput/power (instruction/energy) 2.Throughput 2 /power (f x instruction/energy) Power Distribution Network –VRMs, Switches, Decaps, ESRs, Topology, Analysis –Stimulus, Noise Tolerance, Simulation Control (smart grid) –High efficiency, Real time analysis, Stability, Reliability, Rapid recovery, and Self healing
65
Page 65
66
Back up
67
Page 67 Case 3: “Rogue Wave” Phenomenon (Cont’) I L2 ILIL
68
Page 68 Case 3: “Rogue Wave” Phenomenon (Cont’) V 2nd V 2nd_only
69
Page 69 Case 3: “Rogue Wave” Phenomenon (Cont’) I L2 I L1 ILIL
70
Page 70 Case 3: “Rogue Wave” Phenomenon (Cont’) I L2 I L1 ILIL Zoom in
71
Page 71 Case 3: “Rogue Wave” Phenomenon (Cont’) V 2nd V 1st -V 2nd V 1st_only
72
Page 72 Case 3: “Rogue Wave” Phenomenon (Cont’) V 2nd V 1st -V 2nd V 1st_only Zoom in
73
Page 73 Case 3: “Rogue Wave” Phenomenon (Cont’) V 2nd V 1st -V 2nd V 1st_only V 2nd_only V 1st max(V 1st )=37.34mV max(V 2nd_only ) + max(V 1st_only ) = 42.09mV ≈ max(V 1st )
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.