Thermal Via Allocation for 3D ICs Considering Temporally and Spatially Variant Thermal Power Hao Yu, Yiyu Shi and Lei He Electrical Engineering Dept. UCLA,

Slides:



Advertisements
Similar presentations
You have been given a mission and a code. Use the code to complete the mission and you will save the world from obliteration…
Advertisements

Cognitive Radio Communications and Networks: Principles and Practice By A. M. Wyglinski, M. Nekovee, Y. T. Hou (Elsevier, December 2009) 1 Chapter 12 Cross-Layer.
THE FINITE ELEMENT METHOD
1 Copyright © 2013 Elsevier Inc. All rights reserved. Appendix 01.
Balanced Device Characterization. Page 2 Outline Characteristics of Differential Topologies Measurement Alternatives Unbalanced and Balanced Performance.
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
On Sequential Experimental Design for Empirical Model-Building under Interval Error Sergei Zhilin, Altai State University, Barnaul, Russia.
1 Outline relationship among topics secrets LP with upper bounds by Simplex method basic feasible solution (BFS) by Simplex method for bounded variables.
Robust Window-based Multi-node Technology- Independent Logic Minimization Jeff L.Cobb Kanupriya Gulati Sunil P. Khatri Texas Instruments, Inc. Dept. of.
MICRO-BUMP ASSIGNMENT FOR 3D ICS USING ORDER RELATION TA-YU KUAN, YI-CHUN CHANG, TAI-CHEN CHEN DEPARTMENT OF ELECTRICAL ENGINEERING, NATIONAL CENTRAL UNIVERSITY,
Pole Placement.
Announcements Homework 6 is due on Thursday (Oct 18)
ECE 530 – Analysis Techniques for Large-Scale Electrical Systems
THERMAL-AWARE BUS-DRIVEN FLOORPLANNING PO-HSUN WU & TSUNG-YI HO Department of Computer Science and Information Engineering, National Cheng Kung University.
Utility Optimization for Event-Driven Distributed Infrastructures Cristian Lumezanu University of Maryland, College Park Sumeer BholaMark Astley IBM T.J.
Spoofing State Estimation
1 Column Generation. 2 Outline trim loss problem different formulations column generation the trim loss problem master problem and subproblem in column.
Shadow Prices vs. Vickrey Prices in Multipath Routing Parthasarathy Ramanujam, Zongpeng Li and Lisa Higham University of Calgary Presented by Ajay Gopinathan.
Chapter 9 -- Simplification of Sequential Circuits.
Chapter 4 Gates and Circuits.
Chapter 4 Gates and Circuits.
Gate Sizing for Cell Library Based Designs Shiyan Hu*, Mahesh Ketkar**, Jiang Hu* *Dept of ECE, Texas A&M University **Intel Corporation.
CS105 Introduction to Computer Concepts GATES and CIRCUITS
1 COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. On the Capacity of Wireless CSMA/CA Multihop Networks Rafael Laufer and Leonard Kleinrock Bell.
Chapter 4 Gates and Circuits.
1 Analysis of Random Mobility Models with PDE's Michele Garetto Emilio Leonardi Politecnico di Torino Italy MobiHoc Firenze.
1. 2 No lecture on Wed February 8th Thursday 9 th Feb 14: :00 Thursday 9 th Feb 14: :00.
Solving Equations How to Solve Them
1 Weiren Yu 1,2, Xuemin Lin 1, Wenjie Zhang 1 1 University of New South Wales 2 NICTA, Australia Towards Efficient SimRank Computation over Large Networks.
25 seconds left…...
Simultaneous Power and Thermal Integrity Driven Via Stapling in 3D ICs
Non-Gaussian Statistical Timing Analysis Using Second Order Polynomial Fitting Lerong Cheng 1, Jinjun Xiong 2, and Lei He 1 1 EE Department, UCLA *2 IBM.
Maria Ugryumova Direct Solution Techniques in Spectral Methods CASA Seminar, 13 December 2007.
The University of Michigan Georgia Institute of Technology
Introduction into Simulation Basic Simulation Modeling.
Ramprasad Yelchuru, Optimal controlled variable selection for Individual process units, 1 Optimal controlled variable selection for individual process.
ECE 530 – Analysis Techniques for Large-Scale Electrical Systems Prof. Hao Zhu Dept. of Electrical and Computer Engineering University of Illinois at Urbana-Champaign.
3D-STAF: Scalable Temperature and Leakage Aware Floorplanning for Three-Dimensional Integrated Circuits Pingqiang Zhou, Yuchun Ma, Zhouyuan Li, Robert.
Minimal Skew Clock Synthesis Considering Time-Variant Temperature Gradient Hao Yu, Yu Hu, Chun-Chen Liu and Lei He EE Department, UCLA Presented by Yu.
Minimal Skew Clock Embedding Considering Time-Variant Temperature Gradient Hao Yu, Yu Hu, Chun-Chen Liu and Lei He EE Department, UCLA Presented by Yu.
The continuous scaling trends of smaller devices, higher operating frequencies, lower power supply voltages, and more functionalities for integrated circuits.
1 Thermal Via Placement in 3D ICs Brent Goplen, Sachin Sapatnekar Department of Electrical and Computer Engineering University of Minnesota.
1 BSMOR: Block Structure-preserving Model Order Reduction http//:eda.ee.ucla.edu Hao Yu, Lei He Electrical Engineering Dept., UCLA Sheldon S.D. Tan Electrical.
Off-chip Decoupling Capacitor Allocation for Chip Package Co-Design Hao Yu Berkeley Design Chunta Chu and Lei He EE Department.
SAMSON: A Generalized Second-order Arnoldi Method for Reducing Multiple Source Linear Network with Susceptance Yiyu Shi, Hao Yu and Lei He EE Department,
Analytical Thermal Placement for VLSI Lifetime Improvement and Minimum Performance Variation Andrew B. Kahng †, Sung-Mo Kang ‡, Wei Li ‡, Bao Liu † † UC.
Efficient Decoupling Capacitance Budgeting Considering Operation and Process Variations Yiyu Shi*, Jinjun Xiong +, Chunchen Liu* and Lei He* *Electrical.
CDCTree: Novel Obstacle-Avoiding Routing Tree Construction based on Current Driven Circuit Model Speaker: Lei He.
Lecture 12 Review and Sample Exam Questions Professor Lei He EE 201A, Spring 2004
Fast Thermal Analysis on GPU for 3D-ICs with Integrated Microchannel Cooling Zhuo Fen and Peng Li Department of Electrical and Computer Engineering, {Michigan.
TSV-Aware Analytical Placement for 3D IC Designs Meng-Kai Hsu, Yao-Wen Chang, and Valerity Balabanov GIEE and EE department of NTU DAC 2011.
PiCAP: A Parallel and Incremental Capacitance Extraction Considering Stochastic Process Variation Fang Gong 1, Hao Yu 2, and Lei He 1 1 Electrical Engineering.
Thermal-aware Steiner Routing for 3D Stacked ICs M. Pathak and S.K. Lim Georgia Institute of Technology ICCAD 07.
Statistical Sampling-Based Parametric Analysis of Power Grids Dr. Peng Li Presented by Xueqian Zhao EE5970 Seminar.
A Power Grid Analysis and Verification Tool Based on a Statistical Prediction Engine M.K. Tsiampas, D. Bountas, P. Merakos, N.E. Evmorfopoulos, S. Bantas.
Scalable Symbolic Model Order Reduction Yiyu Shi*, Lei He* and C. J. Richard Shi + *Electrical Engineering Department, UCLA + Electrical Engineering Department,
Partition-Driven Standard Cell Thermal Placement Guoqiang Chen Synopsys Inc. Sachin Sapatnekar Univ of Minnesota For ISPD 2003.
QuickYield: An Efficient Global-Search Based Parametric Yield Estimation with Performance Constraints Fang Gong 1, Hao Yu 2, Yiyu Shi 1, Daesoo Kim 1,
EE 201C Modeling of VLSI Circuits and Systems
1 Hardware Reliability Margining for the Dark Silicon Era Liangzhen Lai and Puneet Gupta Department of Electrical Engineering University of California,
On-Chip Power Network Optimization with Decoupling Capacitors and Controlled-ESRs Wanping Zhang1,2, Ling Zhang2, Amirali Shayan2, Wenjian Yu3, Xiang Hu2,
Finite Element Method To be added later 9/18/2018 ELEN 689.
Chapter 2 Interconnect Analysis
Chapter 5b Stochastic Circuit Optimization
An Automated Design Flow for 3D Microarchitecture Evaluation
Simultaneous Power and Thermal Integrity Driven Via Stapling in 3D ICs
EE 201C Modeling of VLSI Circuits and Systems
Yiyu Shi*, Jinjun Xiong+, Chunchen Liu* and Lei He*
Yiyu Shi*, Jinjun Xiong+, Chunchen Liu* and Lei He*
Simultaneous Power and Thermal Integrity Driven Via Stapling in 3D ICs
Presentation transcript:

Thermal Via Allocation for 3D ICs Considering Temporally and Spatially Variant Thermal Power Hao Yu, Yiyu Shi and Lei He Electrical Engineering Dept. UCLA, USA Partially supported by NSF and UC-MICRO fund from Intel Tanay Karnik Circuit Research Lab Intel, USA

2 New Solution for High-performance Integration n 2D SoC design has limited density and interconnect performance n Potential solution: 3D Integration [Banerjee-Saraswart:IEEE’01] l Fabrication Technologies: Chip-level Wafer Bonding or Die-level Silicon Epitaxial Growth l Inter-layer via plays a crucial role in signaling, power delivery and heat- removal

3 Thermal Challenges in 3D ICs n Temperature increases along third dimension l Inter-layer dielectric layers are poor thermal conductors 150c 135c 100c 70c 40c n High temperature affects interconnect and device reliability and leads to variations to timing n Thermal analysis and thermal-aware design for 3D ICs becomes a need

4 Via Planning Problem n Motivation l Inter-layer vias are good thermal-conductor to remove heat l Inter-layer vias take additional chip area and routing resource n Previous work l Iterative via planning during placement [Goplen-Sapatnekar:ISPD’05] l Multilevel alternating direction via planning during routing [Zhang- Cong:ICCAD’05] l Both use steady-state analysis and assume a maximum-thermal power, and may lead to over-design n Primary contributions of our work l Minimize a thermal violation integral considering transient temperature l Develop an efficient sensitivity-driven sequential programming with use of macromodel

5 Outline n Background and Problem Formulation n Structured and Parameterized Macromodel n Sequential Optimization n Experimental Results n Conclusions

6 Thermal Model Overview Temperature Voltage state variables (x(t)) Thermal-Power Input Current sources (u(t)) Thermal conductance Electrical conductance (G) Thermal capacitanceElectrical capacitance (C) n Electric and thermal systems can be described in MNA (modified nodal analysis) equation n Via conductance gi and capacitance ci are both proportional to size Ai or density (Ai/a) (a is unit via area) l It can be parametrically added into MNA equation n Electric and thermal duality

7 Steady State Model and Analysis n Steady-state temperature can be obtained by directly solving a time-invariant linear equation R n Active-device and inter-dielectric layers are discretized into tiles l Tiles connected by thermal resistance l Heat sources modeled as time-invariant current sources

8 Transient Model and Analysis n Transient temperature can be obtained by solving a time- variant linear equation RC n Tiles connected by thermal resistance and thermal capacitance n Heat sources modeled as time-variant current sources

9 Thermal Power Variation and Analysis n Different workloads and dynamic power management introduces temporally and spatially power variations l Thermal power is the runtime averaging of cycle-accurate power, and is not a constant spatially and temporally n Steady-state analysis needs to assume a maximum thermal power simultaneously for all regions l It seldom happens that each part of the chip achieves their maximum simultaneously, and can result in an over-design n Transient analysis is accurate but time-consuming l It calls for more accurate yet efficient transient thermal simulation during the design automation

10 Thermal Violation Integral n Thermal violation is temperature overshoot for a long enough period, so maximum temperature is not a good Figure of Merit (FOM) Thermal-violation integral as FOM f k (A) is more accurate Time-domain transient temperature ( y ) integral over defined ceiling temperature ( T ceiling ) for a long enough period ( t 0 ~ t p ) at ith tile FOM f(A) for a group ( K ) of critical tiles A is a via density vector

11 Problem Formulation Find a via density vector A to minimize the thermal violation integral under global/local routing congestion constraints n Two keys to efficiently solve this problem l Efficient models to transient response, and its first-order and second-order sensitivity with respect to via density l Efficient yet effective mathematic programming Global constraint Local constraint

12 Outline n Background and Problem Formulation n Structured and Parameterized Macromodel n Sequential Optimization n Experimental Results n Conclusions

13 Macromodel by Moment Matching large linear network … small linear network n Krylov-subspace based projection can reduce model size and preserve accuracy by matching moments of inputs [Odabasioglu-Celik-Pileggi:TCAD’98] l Flat projection does not preserve block matrix structure such as sparsity l Reduced macromodel does not contain sensitivity information for design automation

14 Parameterization (I) The inserted location is described by adjacent matrix X The via density ( Ai ) is parameterized and added into MNA X(2,6)= Need to separate sensitivity from nominal response

15 Parameterization (II) Expand state variables x(A 1,…A K,s) by Taylor expansion w.r.t. A i (up to second order) l x^(0), x^(1), and x^(2) are nominal values, first-order and second-order sensitivities n Expanded system has lower-triangular structure n System size is enlarged and needs to be reduced by projection l Traditional flat projection can not separate the nominal state variables and their sensitivities [Li-Pileggi:ICCAD’04] l This can be solved by a structure-preserved projection [Yu-He-Tan:BMAS’05]

16 Structured Projection (I) n Block-diagonally partition the projection matrix by the size of nominal state- variable, first-order sensitivity, and second- order sensitivity n Use structured projection can result in a reduced triangular system with nominal value and sensitivities to be solved independently

17 Structured Projection (II) n Nominal response, and sensitivity can be solved separately and efficiently l The reduced model is sparse There is only one LU-factorization of the reduced diagonal block G 0 +(1/h)C 0 n Generated sensitivities can be used in any gradient based optimization n Time-domain transient response can be solved using Backward-Euler method

18 Outline n Background and Problem Formulation n Structured and Parameterized Macromodel n Sequential Optimization n Experimental Results n Conclusions

19 Sequential Approximation of Objective Function The objective function f(A) could be approximated Find ( ΔA ) to minimize f lp or f qp during each step n The objective function becomes semi-definite when integration is approximated by a discretized summation [Visweswariah:TCAD’00] l Sequential programming converges for convex-programming problems, and still has good convergence in semi-definite problems

20 Sensitivity Calculation n Direct sensitivity calculation for objective function n Structured and parameterized reduction provides an efficient calculation of both nominal value and sensitivity The via density vector A can be efficiently updated during each iteration n The computation cost could be further reduced when an adjoint Lagrangian method is used to calculate sensitivity [Visweswariah:TCAD’00]

21 Outline n Background and Problem Formulation n Structured and Parameterized Macromodel n Sequential Optimization n Experimental Results n Conclusions

22 Experiment Settings n A modest 3D stacking with 1-heat-sink, 2-die- layer, 2-dielectric-layer is assumed, each extracted as RC mesh interconnected by RC- pair for via n Clock gating is assumed with a period of 250ms n Reduction algorithm assumes SIMO (single- input-multiple-output) reduction when the number of inputs is large n Compare our method (SP-Macro) with Steady- state solution

23 Accuracy of Reduced Macromodel n Transient temperature responses of exact and SP-MACRO models at port 3, 18, and 58 of top layer with step-response input l The responses of macromodels are visually identical to those exact models

24 Optimization Profile by SQP n Temperature reduction at selected location during the procedure of via-allocation by SQP l The allocated via results in a transient temperature meeting the targeted ceiling temperature 52C

25 Temperature Map n Temperature maps before and after the via allocation at the top layer l The maximum temperature before allocation is about 150C l The temperature after allocation meets the targeted ceiling temperature 52C

26 Allocated-via and Runtime Comparison n Compared to steady-state solution l SP-MACRO has smaller simulation and planning time when increasing circuit size u It reduces the runtime by 126X l SP-MACRO is more accurate to predict the via insertion u It reduces the inserted via number by 2.04X Total/ critical tile Total via Constraint Original/ ceiling T Steady-state by direct solution Transient by SP-MACRO Solve- dc(s) Solve- tran(s) Allo-viaRedu- ckt(s) Solve- sens(s) Qp/lp- plan (s) Allo-via 256/ / / / / / / / NA / /60NA / /60NA

27 Conclusions n Via planning based on the transient thermal analysis reduces via umber by 2.04x compared to the steady-state thermal analysis n An efficient via planning algorithm is developed l Structured and parameterized model reduction provides both nominal values and sensitivities l Sequential linear/quadratic programming minimizes the thermal-violation integral n SP-MACRO is further extended for l Simultaneous power and thermal integrity driven via planning [Yu-Ho-He:ICCAD’06]