UC San Diego / VLSI CAD Laboratory Reliability-Constrained Die Stacking Order in 3DICs Under Manufacturing Variability Tuck-Boon Chan, Andrew B. Kahng,

Slides:



Advertisements
Similar presentations
THERMAL-AWARE BUS-DRIVEN FLOORPLANNING PO-HSUN WU & TSUNG-YI HO Department of Computer Science and Information Engineering, National Cheng Kung University.
Advertisements

International Symposium on Low Power Electronics and Design Qing Xie, Mohammad Javad Dousti, and Massoud Pedram University of Southern California ISLPED.
Tunable Sensors for Process-Aware Voltage Scaling
OCV-Aware Top-Level Clock Tree Optimization
-1- VLSI CAD Laboratory, UC San Diego Post-Routing BEOL Layout Optimization for Improved Time- Dependent Dielectric Breakdown (TDDB) Reliability Tuck-Boon.
Timing Margin Recovery With Flexible Flip-Flop Timing Model
UC San Diego / VLSI CAD Laboratory NOLO: A No-Loop, Predictive Useful Skew Methodology for Improved Timing in IC Implementation Tuck-Boon Chan, Andrew.
Toward Better Wireload Models in the Presence of Obstacles* Chung-Kuan Cheng, Andrew B. Kahng, Bao Liu and Dirk Stroobandt† UC San Diego CSE Dept. †Ghent.
S. Reda EN160 SP’07 Design and Implementation of VLSI Systems (EN0160) Lecture 20: Circuit Design Pitfalls Prof. Sherief Reda Division of Engineering,
UC San Diego Computer Engineering VLSI CAD Laboratory UC San Diego Computer Engineering VLSI CAD Laboratory UC San Diego Computer Engineering VLSI CAD.
Architectural-Level Prediction of Interconnect Wirelength and Fanout Kwangok Jeong, Andrew B. Kahng and Kambiz Samadi UCSD VLSI CAD Laboratory
Supply Voltage Degradation Aware Analytical Placement Andrew B. Kahng, Bao Liu and Qinke Wang UCSD CSE Department {abk, bliu,
Constructing Current-Based Gate Models Based on Existing Timing Library Andrew Kahng, Bao Liu, Xu Xu UC San Diego
Institute of Digital and Computer Systems 1 Fabio Garzia / Finding Peak Performance in a Process23/06/2015 Chapter 5 Finding Peak Performance in a Process.
From Compaq, ASP- DAC00. Power Consumption Power consumption is on the rise due to: - Higher integration levels (more devices & wires) - Rising clock.
Chung-Kuan Cheng†, Andrew B. Kahng†‡,
On-Line Adjustable Buffering for Runtime Power Reduction Andrew B. Kahng Ψ Sherief Reda † Puneet Sharma Ψ Ψ University of California, San Diego † Brown.
1 UCSD VLSI CAD Laboratory ISQED-2009 Revisiting the Linear Programming Framework for Leakage Power vs. Performance Optimization Kwangok Jeong, Andrew.
1 Temperature-Aware Resource Allocation and Binding in High Level Synthesis Authors: Rajarshi Mukherjee, Seda Ogrenci Memik, and Gokhan Memik Presented.
UC San Diego Computer Engineering VLSI CAD Laboratory UC San Diego Computer Engineering VLSI CAD Laboratory UC San Diego Computer Engineering VLSI CAD.
Analytical Thermal Placement for VLSI Lifetime Improvement and Minimum Performance Variation Andrew B. Kahng †, Sung-Mo Kang ‡, Wei Li ‡, Bao Liu † † UC.
Interconnect Implications of Growth-Based Structural Models for VLSI Circuits* Chung-Kuan Cheng, Andrew B. Kahng and Bao Liu UC San Diego CSE Dept.
SLIP 2000April 9, Wiring Layer Assignments with Consistent Stage Delays Andrew B. Kahng (UCLA) Dirk Stroobandt (Ghent University) Supported.
UC San Diego Computer Engineering VLSI CAD Laboratory UC San Diego Computer Engineering VLSI CAD Laboratory UC San Diego Computer Engineering VLSI CAD.
University of Michigan Electrical Engineering and Computer Science 1 Online Timing Analysis for Wearout Detection Jason Blome, Shuguang Feng, Shantanu.
UC San Diego Computer Engineering VLSI CAD Laboratory UC San Diego Computer Engineering VLSI CAD Laboratory UC San Diego Computer Engineering VLSI CAD.
Noise and Delay Uncertainty Studies for Coupled RC Interconnects Andrew B. Kahng, Sudhakar Muddu † and Devendra Vidhani ‡ UCLA Computer Science Department,
Decoupling Capacitance Allocation for Power Supply Noise Suppression Shiyou Zhao, Kaushik Roy, Cheng-Kok Koh School of Electrical & Computer Engineering.
DDRO: A Novel Performance Monitoring Methodology Based on Design-Dependent Ring Oscillators Tuck-Boon Chan †, Puneet Gupta §, Andrew B. Kahng †‡ and Liangzhen.
Enhanced Metamodeling Techniques for High-Dimensional IC Design Estimation Problems Andrew B. Kahng, Bill Lin and Siddhartha Nath VLSI CAD LABORATORY,
-1- UC San Diego / VLSI CAD Laboratory Methodology for Electromigration Signoff in the Presence of Adaptive Voltage Scaling Wei-Ting Jonas Chan, Andrew.
Analysis of Instruction-level Vulnerability to Dynamic Voltage and Temperature Variations ‡ Computer Science and Engineering, UC San Diego variability.org.
Advanced Computing and Information Systems laboratory Device Variability Impact on Logic Gate Failure Rates Erin Taylor and José Fortes Department of Electrical.
Accuracy-Configurable Adder for Approximate Arithmetic Designs
Chalmers University of Technology FlexSoC Seminar Series – Page 1 Power Estimation FlexSoc Seminar Series – Daniel Eckerbert
-1- UC San Diego / VLSI CAD Laboratory A Global-Local Optimization Framework for Simultaneous Multi-Mode Multi-Corner Clock Skew Variation Reduction Kwangsoo.
A New Methodology for Reduced Cost of Resilience Andrew B. Kahng, Seokhyeong Kang and Jiajia Li UC San Diego VLSI CAD Laboratory.
UC San Diego / VLSI CAD Laboratory Toward Quantifying the IC Design Value of Interconnect Technology Improvement Tuck-Boon Chan, Andrew B. Kahng, Jiajia.
Horizontal Benchmark Extension for Improved Assessment of Physical CAD Research Andrew B. Kahng, Hyein Lee and Jiajia Li UC San Diego VLSI CAD Laboratory.
Low-Power Gated Bus Synthesis for 3D IC via Rectilinear Shortest-Path Steiner Graph Chung-Kuan Cheng, Peng Du, Andrew B. Kahng, and Shih-Hung Weng UC San.
An evaluation of HotSpot-3.0 block-based temperature model
UC San Diego / VLSI CAD Laboratory Incremental Multiple-Scan Chain Ordering for ECO Flip-Flop Insertion Andrew B. Kahng, Ilgweon Kang and Siddhartha Nath.
Thermal-aware Steiner Routing for 3D Stacked ICs M. Pathak and S.K. Lim Georgia Institute of Technology ICCAD 07.
-1- UC San Diego / VLSI CAD Laboratory Construction of Realistic Gate Sizing Benchmarks With Known Optimal Solutions Andrew B. Kahng, Seokhyeong Kang VLSI.
The Fast Optimal Voltage Partitioning Algorithm For Peak Power Density Minimization Jia Wang, Shiyan Hu Department of Electrical and Computer Engineering.
PRAVEEN VENKATARAMANI VISHWANI D. AGRAWAL Auburn University, Dept. of ECE Auburn, AL 36849, USA 26 th International.
A High-Gain, Low-Noise, +6dBm PA in 90nm CMOS for 60-GHz Radio
Kwangsoo Han, Andrew B. Kahng, Hyein Lee and Lutong Wang
A Power Grid Analysis and Verification Tool Based on a Statistical Prediction Engine M.K. Tsiampas, D. Bountas, P. Merakos, N.E. Evmorfopoulos, S. Bantas.
Kwangsoo Han‡, Andrew B. Kahng‡† and Hyein Lee‡
Outline Introduction: BTI Aging and AVS Signoff Problem
Eyecharts: Constructive Benchmarking of Gate Sizing Heuristics Puneet Gupta, University of California, Los Angeles Andrew B. Kahng, University of California,
TSV-Constrained Micro- Channel Infrastructure Design for Cooling Stacked 3D-ICs Bing Shi and Ankur Srivastava, University of Maryland, College Park, MD,
Integrated Target Reflectivity Analysis
Computer Science and Engineering Power-Performance Considerations of Parallel Computing on Chip Multiprocessors Jian Li and Jose F. Martinez ACM Transactions.
Mixed Cell-Height Implementation for Improved Design Quality in Advanced Nodes Sorin Dobre +, Andrew B. Kahng * and Jiajia Li * * UC San Diego VLSI CAD.
Outline Motivation and Contributions Related Works ILP Formulation
-1- UC San Diego / VLSI CAD Laboratory On Potential Design Impacts of Electromigration Awareness Andrew B. Kahng, Siddhartha Nath and Tajana S. Rosing.
Department of Electrical and Computer Engineering University of Wisconsin - Madison Optimizing Total Power of Many-core Processors Considering Voltage.
-1- UC San Diego / VLSI CAD Laboratory Optimization of Overdrive Signoff Tuck-Boon Chan, Andrew B. Kahng, Jiajia Li and Siddhartha Nath Tuck-Boon Chan,
-1- UC San Diego / VLSI CAD Laboratory Optimal Reliability-Constrained Overdrive Frequency Selection in Multicore Systems Andrew B. Kahng and Siddhartha.
1 Hardware Reliability Margining for the Dark Silicon Era Liangzhen Lai and Puneet Gupta Department of Electrical Engineering University of California,
PROCEED: Pareto Optimization-based Circuit-level Evaluation Methodology for Emerging Devices Shaodi Wang, Andrew Pan, Chi-On Chui and Puneet Gupta Department.
Unified Adaptivity Optimization of Clock and Logic Signals Shiyan Hu and Jiang Hu Dept of Electrical and Computer Engineering Texas A&M University.
3D IC Reliability Xinxin Yu, Hao Wu.
Abbas Rahimi‡, Luca Benini†, and Rajesh Gupta‡ ‡CSE, UC San Diego
Kristof Blutman† , Hamed Fatemi† , Andrew B
Improved Performance of 3DIC Implementations Through Inherent Awareness of Mix-and-Match Die Stacking Kwangsoo Han, Andrew B. Kahng and Jiajia Li University.
Revisiting and Bounding the Benefit From 3D Integration
Mission Profile Aware IC Design - A Case Study -
Presentation transcript:

UC San Diego / VLSI CAD Laboratory Reliability-Constrained Die Stacking Order in 3DICs Under Manufacturing Variability Tuck-Boon Chan, Andrew B. Kahng, Jiajia Li VLSI CAD LABORATORY, UC San Diego

-2- Outline Motivation and Problem Statement Motivation and Problem Statement Modeling Modeling Our Methodologies Our Methodologies Experimental Setup and Results Experimental Setup and Results Conclusion Conclusion

-3- Outline Motivation and Problem Statement Motivation and Problem Statement Modeling Modeling Our Methodologies Our Methodologies Experimental Setup and Results Experimental Setup and Results Conclusion Conclusion

-4- Reliability Challenges for 3DICs Stacking of multiple dies increases power density  High power density  high temperature – –3DICs with four tiers increase peak temperature by 33°C Reliability (e.g., EM) highly depends on temperature Bottom tier Top tier (nearest to heat sink) 35°C Temperature range in a 5-tier 3DIC

-5- Context: Stacking of Identical Dies Identical dies in 3DIC stack   Can change stacking order Dies in stack can have different process corners, but must meet same performance spec  Adaptive Voltage Scaling (AVS)  each die has different V dd  Slower dies have higher V dd  power↑, temp↑, MTTF↓ Target frequency

-6- Motivation Stacking style: ordered selection of dies with particular process variations Heat sink  Letters S, T and F indicate the (slow, typical, fast) process corners  Strings over {S, T, F} indicate stacks (left-to-right corresponds to bottom-to-top) Stacking style “FTS” TSV MOSFET Fast-corner die Bottom tier MOSFET Slow-corner die Top tier TSV MOSFET Typical-corner die Middle tier

-7- Motivation Stacking style: ordered selection of dies with particular process variations  Different stacking style  different mean time to failure (MTTF)  Goal: find the optimal stacking style  improve reliability  Letters S, T and F indicate the (slow, typical, fast) process corners  Strings over {S, T, F} indicate stacks (left-to-right corresponds to bottom-to-top) Different stacking orders of {F, T, S} die  up to 44% ∆ MTTF

-8- Stacking Optimization Problem Given N dies with distinct process variation Such that frequency of each die in a stack = f req Objective to maximize summation of MTTFs of stacks

-9- Outline Motivation and Problem Statement Motivation and Problem Statement Modeling Modeling Our Methodologies Our Methodologies Experimental Setup and Results Experimental Setup and Results Conclusion Conclusion

-10- Reliability Model for 3DICs Electromigration is now a dominant reliability constraint   Our work focuses on EM We use Black’s equation to estimate MTTF of a die (MTTF die ) – –MTTF exponentially depends on temperature Failure rate ( λ ) is the number of units failing per unit time  During the useful-life period λ is constant  MTTF = 1 / λ (1)  Any failure of any die causes a stack to fail  λ stack = ∑ λ die (2)  (1) and (2)  MTTF stack = 1 / (∑1/MTTF die ) λ Time Useful-life period

-11- Bin-Based Model for Process Variation Each die exhibits distinct process variation   find the optimal stacking style is intractable We classify dies into constant number of process bins – –Dies with similar process variations are classified to one bin – –We assume same process variation for dies in one bin -3σ -1.5σ 0σ 1.5σ 3σ # of dies Bin 1Bin 2Bin 3

-12- Outline Motivation and Problem Statement Motivation and Problem Statement Modeling Modeling Our Methodologies Our Methodologies Experimental Setup and Results Experimental Setup and Results Conclusion Conclusion

-13- Determinants of 3DIC Reliability Peak temperature defines the MTTF of the 3DIC Two factors have significant impacts on temperature of 3DIC Process variation  Same performance requirement for all dies  Adaptive voltage scaling is deployed  Slower dies have higher V dd, power, higher temperatures Stacking order  Primary mechanism for thermal dissipation in a 3DIC is through heat sink  Vertical temperature gradient exists in 3DICs  Dies on bottom tiers have higher temperatures Worst-case peak temperature (= minimum MTTF) happens where slow dies are on bottom tiers (far from the heat sink)

-14- Rule-of-Thumb Rule-of-thumb: to optimize reliability of a 3DIC, the slowest dies should be located closest to the heat sink For a stack with particular composition of dies, the optimal stacking order is determined by rule-of-thumb  Letters {S, T, F} indicate process corners  Strings indicate stacking order Locating slow dies close to the heat sink helps improve MTTFs of 3DICs

-15- “Zig-zag” Heuristic Method Zig-zag heuristic method is based on rule-of-thumb Stack dies from slow to fast, from top tiers to bottom tiers Complexity of stacking optimization is NP-hard, but zig- zag is O(n·log(n)) (n = number of dies) Top tier (nearest to heat sink) Bottom tier

-16- ILP-Based Method ILP formulation – –Maximize ∑MTTF i ·C i – –Such that ∑C i ·Y q,i = X q // each input die should be used exactly once and consistent with its process bin C i ≥ 0 // number of output stacks implemented with i th stacking style cannot be negative Notations – –C i is the number of stacks implemented with i th stacking style – –MTTF i is the MTTF of stack implemented with i th stacking style – –Y q,i is the number of dies belong to q th bin contained in i th stacking style – –X q is the number of dies classified to q th bin

-17- Outline Motivation and Problem Statement Motivation and Problem Statement Modeling Modeling Our Methodologies Our Methodologies Experimental Setup and Results Experimental Setup and Results Conclusion Conclusion

-18- Experimental Setup Design: JPEG from OpenCores Technology: TSMC 65nm Libraries: characterized using Cadence Library Characterizer vEDI9.1 – –Process corner: SS, TT, FF – –Temperature: 45 ° C – 165 ° C – –Voltage: 0.9V – 1.2V LP solver: lp_solve 5.5 Thermal analysis: use Hotspot 5.02 – –Chip thickness = 50 μm – –Convection capacitance = 140.4J/K – –Ambient temperature = 60 ° C

-19- Improvement on MTTF Stacking optimization (ILP-based and zig-zag) increases the MTTFs of stacks Average MTTF of stacks

-20- Variation of MTTF Stacking optimization (ILP-based and zig-zag) increases the MTTFs of stacks Stacking optimization (ILP-based and zig-zag) reduces the variation in MTTFs ILP-based Zig-zag Greedy Random

-21- Variability Can Help ! Manufacturing variation can help improve MTTF of stacks

-22- Variability Can Help ! Manufacturing variation can help improve MTTF of stacks Supply voltage can exceed the maximum allowed value   Benefit from process variation disappears when the variation exceeds a particular amount Limited amount of process variation can help improve reliabilities of 3DICs with stacking optimization σ

-23- Outline Motivation Motivation Modeling Modeling Problem and Methodologies Problem and Methodologies Experimental Setups and Results Experimental Setups and Results Conclusion Conclusion

-24- Conclusion We study variability-reliability interactions and optimization in 3DICs We propose “rule-of-thumb” guideline for stacking optimization to reduce the peak temperature and increase MTTFs of 3DICs We propose ILP-based and zig-zag heuristic methods for stacking optimization We show that limited amount of manufacturing variation can help to improve reliabilities of 3DICs with stacking optimization Future Work – –Optimize on other objectives (power variation) – –Different performance requirements for dies

-25- Acknowledgments Work supported from Sandia National Labs, Qualcomm, Samsung, SRC and the IMPACT (UC Discovery) center

Thank You!