-1- UC San Diego / VLSI CAD Laboratory On Potential Design Impacts of Electromigration Awareness Andrew B. Kahng, Siddhartha Nath and Tajana S. Rosing.

Slides:



Advertisements
Similar presentations
Tunable Sensors for Process-Aware Voltage Scaling
Advertisements

OCV-Aware Top-Level Clock Tree Optimization
-1- VLSI CAD Laboratory, UC San Diego Post-Routing BEOL Layout Optimization for Improved Time- Dependent Dielectric Breakdown (TDDB) Reliability Tuck-Boon.
Kwangok Jeong and Andrew B. Kahng UCSD VLSI CAD Laboratory
Ispd-2007 Repeater Insertion for Concurrent Setup and Hold Time Violations with Power-Delay Trade-Off Salim Chowdhury John Lillis Sun Microsystems University.
NTHU-CS VLSI/CAD LAB TH EDA De-Shiuan Chiou Da-Cheng Juan Yu-Ting Chen Shih-Chieh Chang Department of CS, National Tsing Hua University, Taiwan Fine-Grained.
Timing Margin Recovery With Flexible Flip-Flop Timing Model
Minimum Implant Area-Aware Gate Sizing and Placement
UC San Diego / VLSI CAD Laboratory NOLO: A No-Loop, Predictive Useful Skew Methodology for Improved Timing in IC Implementation Tuck-Boon Chan, Andrew.
Power-Aware Placement
UC San Diego Computer Engineering VLSI CAD Laboratory UC San Diego Computer Engineering VLSI CAD Laboratory UC San Diego Computer Engineering VLSI CAD.
Toward PDN Resource Estimation: A Law of General Power Density Kwangok Jeong and Andrew B. Kahng
Architectural-Level Prediction of Interconnect Wirelength and Fanout Kwangok Jeong, Andrew B. Kahng and Kambiz Samadi UCSD VLSI CAD Laboratory
Supply Voltage Degradation Aware Analytical Placement Andrew B. Kahng, Bao Liu and Qinke Wang UCSD CSE Department {abk, bliu,
04/09/02EECS 3121 Lecture 25: Interconnect Modeling EECS 312 Reading: 8.3 (text), 4.3.2, (2 nd edition)
On Modeling and Sensitivity of Via Count in SOC Physical Implementation Kwangok Jeong Andrew B. Kahng.
NTHU-CS VLSI/CAD LAB TH EDA Student : Da-Cheng Juan Advisor : Shih-Chieh Chang Fine-Grained Sleep Transistor Sizing Algorithm for Leakage Power Minimization.
On-Line Adjustable Buffering for Runtime Power Reduction Andrew B. Kahng Ψ Sherief Reda † Puneet Sharma Ψ Ψ University of California, San Diego † Brown.
1 UCSD VLSI CAD Laboratory ISQED-2009 Revisiting the Linear Programming Framework for Leakage Power vs. Performance Optimization Kwangok Jeong, Andrew.
Analytical Thermal Placement for VLSI Lifetime Improvement and Minimum Performance Variation Andrew B. Kahng †, Sung-Mo Kang ‡, Wei Li ‡, Bao Liu † † UC.
Temperature-Aware Design Presented by Mehul Shah 4/29/04.
Methodology from Chaos in IC Implementation Kwangok Jeong * and Andrew B. Kahng *,** * ECE Dept., UC San Diego ** CSE Dept., UC San Diego.
UC San Diego Computer Engineering. VLSI CAD Laboratory.. UC San Diego Computer EngineeringVLSI CAD Laboratory.. UC San Diego Computer EngineeringVLSI CAD.
UC San Diego Computer Engineering VLSI CAD Laboratory UC San Diego Computer Engineering VLSI CAD Laboratory UC San Diego Computer Engineering VLSI CAD.
Timing Analysis and Optimization Implications of Bimodal CD Distribution in Double Patterning Lithography Kwangok Jeong and Andrew B. Kahng VLSI CAD LABORATORY.
University of Michigan Electrical Engineering and Computer Science 1 Online Timing Analysis for Wearout Detection Jason Blome, Shuguang Feng, Shantanu.
1 A Novel Metric for Interconnect Architecture Performance Parthasarathi Dasgupta, Andrew B. Kahng, Swamy V. Muddu Dept. of CSE and ECE University of California,
UC San Diego Computer Engineering VLSI CAD Laboratory UC San Diego Computer Engineering VLSI CAD Laboratory UC San Diego Computer Engineering VLSI CAD.
Signal Integrity Methodology on 300 MHz SoC using ALF libraries and tools Wolfgang Roethig, Ramakrishna Nibhanupudi, Arun Balakrishnan, Gopal Dandu Steven.
Enhanced Metamodeling Techniques for High-Dimensional IC Design Estimation Problems Andrew B. Kahng, Bill Lin and Siddhartha Nath VLSI CAD LABORATORY,
UC San Diego / VLSI CAD Laboratory Reliability-Constrained Die Stacking Order in 3DICs Under Manufacturing Variability Tuck-Boon Chan, Andrew B. Kahng,
Electromigration Analysis for MTTF Calculations
-1- UC San Diego / VLSI CAD Laboratory Methodology for Electromigration Signoff in the Presence of Adaptive Voltage Scaling Wei-Ting Jonas Chan, Andrew.
Andrew B. Kahng‡†, Mulong Luo†, Siddhartha Nath†
03/30/031 ECE 551: Digital System Design & Synthesis Lecture Set 9 9.1: Constraints and Timing 9.2: Optimization (In separate file)
-1- UC San Diego / VLSI CAD Laboratory A Global-Local Optimization Framework for Simultaneous Multi-Mode Multi-Corner Clock Skew Variation Reduction Kwangsoo.
A New Methodology for Reduced Cost of Resilience Andrew B. Kahng, Seokhyeong Kang and Jiajia Li UC San Diego VLSI CAD Laboratory.
Power Reduction for FPGA using Multiple Vdd/Vth
Are classical design flows suitable below 0.18  ? ISPD 2001 NEC Electronics Inc. WR0999.ppt-1 Wolfgang Roethig Senior Engineering Manager EDA R&D Group.
UC San Diego / VLSI CAD Laboratory Toward Quantifying the IC Design Value of Interconnect Technology Improvement Tuck-Boon Chan, Andrew B. Kahng, Jiajia.
Horizontal Benchmark Extension for Improved Assessment of Physical CAD Research Andrew B. Kahng, Hyein Lee and Jiajia Li UC San Diego VLSI CAD Laboratory.
UC San Diego / VLSI CAD Laboratory Incremental Multiple-Scan Chain Ordering for ECO Flip-Flop Insertion Andrew B. Kahng, Ilgweon Kang and Siddhartha Nath.
-1- UC San Diego / VLSI CAD Laboratory Construction of Realistic Gate Sizing Benchmarks With Known Optimal Solutions Andrew B. Kahng, Seokhyeong Kang VLSI.
Kwangsoo Han, Andrew B. Kahng, Hyein Lee and Lutong Wang
ECO Timing Optimization Using Spare Cells Yen-Pin Chen, Jia-Wei Fang, and Yao-Wen Chang ICCAD2007, Pages ICCAD2007, Pages
-1- UC San Diego / VLSI CAD Laboratory High-Dimensional Metamodeling for Prediction of Clock Tree Synthesis Outcomes Andrew B. Kahng, Bill Lin and Siddhartha.
Outline Introduction: BTI Aging and AVS Signoff Problem
-1- Statistical Analysis and Modeling for Error Composition in Approximate Computation Circuits Wei-Ting Jonas Chan 1, Andrew B. Kahng 1, Seokhyeong.
Physical Synthesis Buffer Insertion, Gate Sizing, Wire Sizing,
Explicit Modeling of Control and Data for Improved NoC Router Estimation Andrew B. Kahng +*, Bill Lin * and Siddhartha Nath + UCSD CSE + and ECE * Departments.
UC San Diego / VLSI CAD Laboratory Learning-Based Approximation of Interconnect Delay and Slew Modeling in Signoff Timing Tools Andrew B. Kahng, Seokhyeong.
University of Rostock Institute of Applied Microelectronics and Computer Engineering Monitoring and Control of Temperature in Networks-on- Chip Tim Wegner,
Mixed Cell-Height Implementation for Improved Design Quality in Advanced Nodes Sorin Dobre +, Andrew B. Kahng * and Jiajia Li * * UC San Diego VLSI CAD.
-1- UC San Diego / VLSI CAD Laboratory Optimization of Overdrive Signoff Tuck-Boon Chan, Andrew B. Kahng, Jiajia Li and Siddhartha Nath Tuck-Boon Chan,
-1- Delay Uncertainty and Signal Criticality Driven Routing Channel Optimization for Advanced DRAM Products Samyoung Bang #, Kwangsoo Han ‡, Andrew B.
-1- UC San Diego / VLSI CAD Laboratory Optimal Reliability-Constrained Overdrive Frequency Selection in Multicore Systems Andrew B. Kahng and Siddhartha.
1 Hardware Reliability Margining for the Dark Silicon Era Liangzhen Lai and Puneet Gupta Department of Electrical Engineering University of California,
14 February, 2004SLIP, 2004 Self-Consistent Power/Performance/Reliability Analysis for Copper Interconnects Bipin Rajendran, Pawan Kapur, Krishna C. Saraswat.
Improved Flop Tray-Based Design Implementation for Power Reduction
Kun Young Chung*, Andrew B. Kahng+ and Jiajia Li+
The Interconnect Delay Bottleneck.
Kristof Blutman† , Hamed Fatemi† , Andrew B
Revisiting and Bounding the Benefit From 3D Integration
Mission Profile Aware IC Design - A Case Study -
Timing Analysis 11/21/2018.
An Automated Design Flow for 3D Microarchitecture Evaluation
Puneet Gupta1 , Andrew B. Kahng1 , Youngmin Kim2, Dennis Sylvester2
FPGA Glitch Power Analysis and Reduction
Measuring the Gap between FPGAs and ASICs
HotAging — Impact of Power Dissipation on Hardware Degradation
Presentation transcript:

-1- UC San Diego / VLSI CAD Laboratory On Potential Design Impacts of Electromigration Awareness Andrew B. Kahng, Siddhartha Nath and Tajana S. Rosing VLSI CAD LABORATORY, UC San Diego

-2- Outline Motivation Motivation Previous Work Previous Work Our Work Our Work Preliminaries Preliminaries Study 1: MTTF vs. F max Study 1: MTTF vs. F max Study 2: MTTF vs. Area, Power Study 2: MTTF vs. Area, Power Insights on Conventional EM Fixes Insights on Conventional EM Fixes Conclusions Conclusions

-3- Electromigration in Interconnects Electromigration (EM) is the gradual displacement of metal atoms in an interconnect Electromigration (EM) is the gradual displacement of metal atoms in an interconnect I avg causes DC EM and affects power delivery networks I avg causes DC EM and affects power delivery networks I rms causes AC EM and affects clock and logic signals I rms causes AC EM and affects clock and logic signals

-4- EM Lifetime t 50 – median time to failure (= log e 2 x MTTF) t 50 – median time to failure (= log e 2 x MTTF) A* – geometry-dependent constant A* – geometry-dependent constant J – current density in interconnect segment J – current density in interconnect segment n – constant ( = 2) n – constant ( = 2) E a – activation energy of metal atoms E a – activation energy of metal atoms k – Boltzmann’s constant k – Boltzmann’s constant T – temperature of the interconnect T – temperature of the interconnect EM degrades interconnect lifetime EM degrades interconnect lifetime Black’s Equation calculates lifetime of interconnect segment due to EM degradation Black’s Equation calculates lifetime of interconnect segment due to EM degradation

-5- Parameters Affecting EM MTTF AB Inverse relation; if A increases then B decreases A B Direct relation; if A increases then B increases Design parameters α J rms Temp W wire MTTF Driver size V dd Fanout Freq Runtime parameters

-6- Why Is EM Important Now? ITRS 2011 data shows that EM will be a significant reliability issue ITRS 2011 data shows that EM will be a significant reliability issue Physical design teams trade off performance and/or resources to meet EM MTTF Physical design teams trade off performance and/or resources to meet EM MTTF What values of MTTF do we really need? What values of MTTF do we really need? – In the US, people replace Cell phones every 2 years Cell phones every 2 years Laptops every 3 – 5 years Laptops every 3 – 5 years Servers every 3 – 7 years Servers every 3 – 7 years Devices can be designed with small EM lifetimes J rms MTTF

-7- Examples of EM Guardband To meet EM MTTF margin at given wire width upper bound To meet EM MTTF margin at given wire width upper bound –Reduce J rms  reduce driver size  slower circuit To meet EM MTTF margin at given performance requirement To meet EM MTTF margin at given performance requirement –Increase W wire  increase capacitance, dynamic power α J rms Temp W wire MTTF Driver size V dd Fanout Freq

-8- Outline Motivation Motivation Previous Work Previous Work Our Work Our Work Preliminaries Preliminaries Study 1: MTTF vs. F max Study 1: MTTF vs. F max Study 2: MTTF vs. Area, Power Study 2: MTTF vs. Area, Power Insights on Conventional EM Fixes Insights on Conventional EM Fixes Conclusions Conclusions

-9- To Meet EM Lifetime Requirements Three major categories of prior work Three major categories of prior work EM MTTF modeling EM MTTF modeling –Black69 (Black’s Equation) –Liew89 (AC lifetime models) –Lu07 and Wu12 (Joule heating) Architecture changes to mitigate EM Architecture changes to mitigate EM – Srinivasan04 (RAMP) – Romanescu08 (core cannibalization) Synthesis and physical design (PD) techniques to reduce current density violations Synthesis and physical design (PD) techniques to reduce current density violations –Dasgupta96 (limit J rms violation at synthesis) –Jerke04 (limit J rms violation at PD) –Lienig03 (post-route J rms fixes)

-10- Outline Motivation Motivation Previous Work Previous Work Our Work Our Work Preliminaries Preliminaries Study 1: MTTF vs. F max Study 1: MTTF vs. F max Study 2: MTTF vs. Area, Power Study 2: MTTF vs. Area, Power Insights on Conventional EM Fixes Insights on Conventional EM Fixes Conclusions Conclusions

-11- Key Idea We quantify impact of EM guardband on performance (F max ), area and power We quantify impact of EM guardband on performance (F max ), area and power [Black’s Equation] F max Area / Power (I rms,limit ) 2 = (I rms,default ) 2 x MTTF default /MTTF reduced  Decrease MTTF reduced  increase I rms,limit  We study impacts on F max, area and power

-12- Approach We conduct two studies We conduct two studies 1. MTTF vs. F max tradeoffs with fixed resource budget 2. MTTF vs. resources tradeoffs with fixed performance requirement Assumptions Assumptions –10 years = example default EM MTTF –Six testcases Report three representative (AES, DMA, JPEG) Report three representative (AES, DMA, JPEG)

-13- Key Contributions We are the first to quantify impacts of EM guardband on performance and resources by using PD flows We are the first to quantify impacts of EM guardband on performance and resources by using PD flows We introduce EM slack as an accurate measure of potential performance improvements in different circuits at reduced MTTF requirements We introduce EM slack as an accurate measure of potential performance improvements in different circuits at reduced MTTF requirements –Black’s Equation cannot accurately quantify the impacts of EM-awareness in circuits We study how tightness vs. looseness of timing constraints determine area and power trends at reduced MTTF We study how tightness vs. looseness of timing constraints determine area and power trends at reduced MTTF Our study flow/methodology can potentially be used by Our study flow/methodology can potentially be used by –architects and front-end designers to improve performance at no area cost –physical designers whose levers are conventional SI and EM fixing methods

-14- Outline Motivation Motivation Previous Work Previous Work Our Work Our Work Preliminaries Preliminaries Study 1: MTTF vs. F max Study 1: MTTF vs. F max Study 2: MTTF vs. Area, Power Study 2: MTTF vs. Area, Power Insights on Conventional EM Fixes Insights on Conventional EM Fixes Conclusions Conclusions

-15- EM Slack When EM violations occur Black’s Equation Theoretical limit of I rms,net Basic Concept: EM slack of a net (units: mA)

-16- Significance of EM Slack Positive EM slack  potential for improved F max Positive EM slack  potential for improved F max If EM slack > 0, a part of it can be used to If EM slack > 0, a part of it can be used to increase I rms,limit by reducing MTTF (from Black’s Equation), and increase I rms,limit by reducing MTTF (from Black’s Equation), and improve F max by using SP&R knobs (e.g., gate sizing) without causing EM violations improve F max by using SP&R knobs (e.g., gate sizing) without causing EM violations

-17- Outline Motivation Motivation Previous Work Previous Work Our Work Our Work Preliminaries Preliminaries Study 1: MTTF vs. F max Study 1: MTTF vs. F max Study 2: MTTF vs. Area, Power Study 2: MTTF vs. Area, Power Insights on Conventional EM Fixes Insights on Conventional EM Fixes Conclusions Conclusions

-18- Study 1: MTTF vs. F max Study MTTF vs. F max tradeoffs given upper bounds on area, temperature and #EM violations Study MTTF vs. F max tradeoffs given upper bounds on area, temperature and #EM violations Setup Setup –Three testcases: AES, DMA and JPEG –Two technology libraries: TSMC 45GS and 65GPLUS –Upper bounds temperature = 378 K temperature = 378 K area = 66% utilization area = 66% utilization #EM violations = 25 #EM violations = 25 –Synopsys DesignCompiler and Cadence SOC Encounter flows –Thermal analysis using Hotspot

-19- Automated Flow to Determine F max Synthesis (DC) RTLLIBSDC NETLIST Place and Route (SOCE) LEF MTTF reduced Tech. LEF α PI UTIL tar AR Check constraints Met all constraints? Decrease frequency Y N Increase frequency Frequency tried before? N Y EXIT -- timing slack -- peak temperature -- UTIL eff -- #EM violations

-20- Automated Flow to Determine F max Synthesis (DC) RTLLIBSDC NETLIST Place and Route (SOCE) LEF MTTF reduced Tech. LEF α PI UTIL tar AR Check constraints Met all constraints? Decrease frequency Y N Increase frequency Frequency tried before? N Y EXIT -- timing slack -- peak temperature -- UTIL eff -- #EM violations

-21- Derating LEF We derate current density limits in technology Library Exchange Format (LEF) file Reduced lifetime (ratio > 1) increases J rms limit Reduced toggle rate (ratio > 1) increases J rms limit Increased maximum temperature increases J rms limit

-22- Automated Flow to Determine F max Synthesis (DC) RTLLIBSDC NETLIST Place and Route (SOCE) LEF MTTF reduced Tech. LEF α PI UTIL tar AR Check constraints Met all constraints? Decrease frequency Y N Increase frequency Frequency tried before? N Y EXIT -- timing slack -- peak temperature -- UTIL eff -- #EM violations

-23- Binary Search for F max Increase frequency by Increase frequency by  step until some constraint is violated Perform binary search between the current F and the last feasible F to find F max

-24- Automated Flow to Determine F max Synthesis (DC) RTLLIBSDC NETLIST Place and Route (SOCE) LEF MTTF reduced Tech. LEF α PI UTIL tar AR Check constraints Met all constraints? Decrease frequency Y N Increase frequency Frequency tried before? N Y EXIT -- timing slack -- peak temperature -- UTIL eff -- #EM violations

-25- Flow to Fix EM Violations Group nets depending on the extent of I rms,limit violations Create nondefault rules (NDR) for each net group All EM violations fixed? Downsize drivers Timing met? N Y return Y Perform ECO route Decrease fanout Perform timing analysis Y Netlist from F max determination flow N return N

-26- Automated Flow to Determine F max Synthesis (DC) RTLLIBSDC NETLIST Place and Route (SOCE) LEF MTTF reduced Tech. LEF α PI UTIL tar AR Check constraints Met all constraints? Decrease frequency Y N Increase frequency Frequency tried before? N Y EXIT -- timing slack -- peak temperature -- UTIL eff -- #EM violations

-27- Observation 1  F max scaling is not uniform across designs and at reduced MTTF as suggested by Black’s Equation  F max scaling is determined by the EM slack in each design at each MTTF requirement  Large F max improvements may be setup artifacts 45nm

-28- Observation 2  EM slack determines F max at fixed resources  % of positive EM slack is usable to improve F max by reducing MTTF requirement  EM violations in critical paths lead to positive EM slack EM slack (not timing slack) limits performance scaling due to AC EM

-29-  Area limits F max scaling for MTTF  7 years (DMA)  Area upper bounds are violated for MTTF  6 years; Temperature upper bounds are violated for MTTF  3 years EM AREATEMP Area and temperature can be dominating constraints at lower MTTF requirements Observation 3

-30- Outline Motivation Motivation Previous Work Previous Work Our Work Our Work Preliminaries Preliminaries Study 1: MTTF vs. F max Study 1: MTTF vs. F max Study 2: MTTF vs. Area, Power Study 2: MTTF vs. Area, Power Insights on Conventional EM Fixes Insights on Conventional EM Fixes Conclusions Conclusions

-31- Study 2: MTTF vs. Area, Power Study MTTF vs. area and power tradeoffs at a fixed performance requirement Setup – –DMA at 2000 MHz (2ps slack after SP&R at 45nm) – –AES at 1100 MHz (1.6ps slack after SP&R at 45nm) – –JPEG at 850 MHz (93ps slack after SP&R at 45nm) – –Two technology libraries: TSMC 45GS and 65GPLUS

-32- Observation 4  Large positive timing slack at MTTF = 10 years can lead to smaller area when MTTF requirement is reduced  Large positive timing slack at MTTF = 10 years can lead to smaller power when MTTF requirement is reduced

-33- Observation 5  Small positive timing slack at MTTF = 10 years can lead to increase in area as MTTF requirement is reduced  Small positive timing slack at MTTF = 10 years can lead to increase in power as MTTF requirement is reduced Area and power can decrease as MTTF requirement is reduced for designs with loose timing constraints

-34- Outline Motivation Motivation Previous Work Previous Work Our Work Our Work Preliminaries Preliminaries Study 1: MTTF vs. F max Study 1: MTTF vs. F max Study 2: MTTF vs. Area, Power Study 2: MTTF vs. Area, Power Insights on Conventional EM Fixes Insights on Conventional EM Fixes Conclusions Conclusions

-35- Conventional EM Fixes and MTTF Study how conventional SI and EM fixing methods affect area and performance at reduced MTTF requirements. Study how conventional SI and EM fixing methods affect area and performance at reduced MTTF requirements. Setup Setup –Sweep MTTF from 10 years down to 1 year –Apply per-net NDRs, driver downsizing and fanout reduction fixes –Study using AES, JPEG and DMA testcases –Two technology libraries: TSMC 45GS and 65GPLUS – –Insights are very instance-, technology/library- and flow-specific

-36- Observation 6  Fixing EM violations using NDRs can be effective in improving F max only till MTTF = 7 years  % increase in F max is less than 5%  % increase in area is ~2%

-37- Observation 7  Fanout reductions to fix EM can increase F max by 3% at the cost of 1.86% increase in area  Drive downsizing to fix EM can increase F max by 2.5% at the cost of 2% increase in area NDRs can be more effective knobs to increase F max with less increase in area

-38- Outline Motivation Motivation Previous Work Previous Work Our Work Our Work Preliminaries Preliminaries Study 1: MTTF vs. F max Study 1: MTTF vs. F max Study 2: MTTF vs. Area, Power Study 2: MTTF vs. Area, Power Insights on Conventional EM Fixes Insights on Conventional EM Fixes Conclusions Conclusions

-39- Conclusions We study and quantify potential impacts of improved EM-awareness in designs through two basic studies Our key observations – –Study 1: Available performance scaling (up to 80%) from MTTF reduction is dependent on EM slack – –Study 2: Area and power can decrease when MTTF is reduced in designs with loose timing constraints – –Additional studies: NDRs can be more effective in increasing performance ~5% at the cost of 2% increase in area for MTTF up to 7 years Ongoing work – –EM reliability requirements in multiple operating modes – –Combined impacts of EM and other back end of the line reliability mechanisms on interconnect lifetime

-40- Acknowledgments Work supported by IMPACT, SRC, NSF, Qualcomm Inc. and NXP Semiconductors

-41- Thank You!

-42- Backup

-43- Hotspot Setup We use Hotspot5.0 calibrated with thermal package from Qualcomm Inc. We perform two kinds of modeling – –Without heat spread and heat sink when profiling single block of AES, JPEG or DMA (area in µm 2 ) – –With heat spreader and heat sink when profiling 50x50 blocks of AES, JPEG, or DMA in an area of ~5mm 2 We get same values of temperature for a single block from both these methods