A New Methodology for Reduced Cost of Resilience Andrew B. Kahng, Seokhyeong Kang and Jiajia Li UC San Diego VLSI CAD Laboratory.

Slides:



Advertisements
Similar presentations
(1/25) UCSD VLSI CAD Laboratory - ISQED10, March. 23, 2010 Toward Effective Utilization of Timing Exceptions in Design Optimization Kwangok Jeong, Andrew.
Advertisements

Tunable Sensors for Process-Aware Voltage Scaling
OCV-Aware Top-Level Clock Tree Optimization
-1- VLSI CAD Laboratory, UC San Diego Post-Routing BEOL Layout Optimization for Improved Time- Dependent Dielectric Breakdown (TDDB) Reliability Tuck-Boon.
Designing a Processor from the Ground Up to Allow Voltage/Reliability Tradeoffs Andrew Kahng (UCSD) Seokhyeong Kang (UCSD) Rakesh Kumar (Illinois) John.
Timing Margin Recovery With Flexible Flip-Flop Timing Model
Xing Wei, Wai-Chung Tang, Yu-Liang Wu Department of Computer Science and Engineering The Chinese University of HongKong
Minimum Implant Area-Aware Gate Sizing and Placement
Sequential Timing Optimization. Long path timing constraints Data must not reach destination FF too late s i + d(i,j) + T setup  s j + P s i s j d(i,j)
CSE241 Formal Verification.1Cichy, UCSD ©2003 CSE241A VLSI Digital Circuits Winter 2003 Recitation 6: Formal Verification.
Post-Placement Voltage Island Generation for Timing-Speculative Circuits Rong Ye†, Feng Yuan†, Zelong Sun†, Wen-Ben Jone§ and Qiang Xu†‡
UC San Diego / VLSI CAD Laboratory NOLO: A No-Loop, Predictive Useful Skew Methodology for Improved Timing in IC Implementation Tuck-Boon Chan, Andrew.
The Cost of Fixing Hold Time Violations in Sub-threshold Circuits Yanqing Zhang, Benton Calhoun University of Virginia Motivation and Background Power.
Yuanlin Lu Intel Corporation, Folsom, CA Vishwani D. Agrawal
UCSD VLSI CAD Laboratory and UIUC PASSAT Group - ASPDAC, Jan. 21, 2010 Slack Redistribution for Graceful Degradation Under Voltage Overscaling Andrew B.
-1- UCSD VLSI CAD Laboratory and UIUC PASSAT Group Recovery-Driven Design: A Power Minimization Methodology for Error-Tolerant Processor Modules Andrew.
Background: Scan-Based Delay Fault Testing Sequentially apply initialization, launch test vector pairs that differ by 1-bit shift A vector pair induces.
Subthreshold Logic Energy Minimization with Application- Driven Performance EE241 Final Project Will Biederman Dan Yeager.
Power-Aware Placement
UC San Diego Computer Engineering VLSI CAD Laboratory UC San Diego Computer Engineering VLSI CAD Laboratory UC San Diego Computer Engineering VLSI CAD.
Architectural-Level Prediction of Interconnect Wirelength and Fanout Kwangok Jeong, Andrew B. Kahng and Kambiz Samadi UCSD VLSI CAD Laboratory
Supply Voltage Degradation Aware Analytical Placement Andrew B. Kahng, Bao Liu and Qinke Wang UCSD CSE Department {abk, bliu,
Fast and Area-Efficient Phase Conflict Detection and Correction in Standard-Cell Layouts Charles Chiang, Synopsys Andrew B. Kahng, UC San Diego Subarna.
Chung-Kuan Cheng†, Andrew B. Kahng†‡,
On-Line Adjustable Buffering for Runtime Power Reduction Andrew B. Kahng Ψ Sherief Reda † Puneet Sharma Ψ Ψ University of California, San Diego † Brown.
1 UCSD VLSI CAD Laboratory ISQED-2009 Revisiting the Linear Programming Framework for Leakage Power vs. Performance Optimization Kwangok Jeong, Andrew.
Toward Performance-Driven Reduction of the Cost of RET-Based Lithography Control Dennis Sylvester Jie Yang (Univ. of Michigan,
Jieyi Long and Seda Ogrenci Memik Dept. of EECS, Northwestern Univ. Jieyi Long and Seda Ogrenci Memik Dept. of EECS, Northwestern Univ. Automated Design.
UC San Diego Computer Engineering VLSI CAD Laboratory UC San Diego Computer Engineering VLSI CAD Laboratory UC San Diego Computer Engineering VLSI CAD.
Timing Analysis and Optimization Implications of Bimodal CD Distribution in Double Patterning Lithography Kwangok Jeong and Andrew B. Kahng VLSI CAD LABORATORY.
UC San Diego Computer Engineering VLSI CAD Laboratory UC San Diego Computer Engineering VLSI CAD Laboratory UC San Diego Computer Engineering VLSI CAD.
Enhanced Metamodeling Techniques for High-Dimensional IC Design Estimation Problems Andrew B. Kahng, Bill Lin and Siddhartha Nath VLSI CAD LABORATORY,
UC San Diego / VLSI CAD Laboratory Reliability-Constrained Die Stacking Order in 3DICs Under Manufacturing Variability Tuck-Boon Chan, Andrew B. Kahng,
-1- UC San Diego / VLSI CAD Laboratory Methodology for Electromigration Signoff in the Presence of Adaptive Voltage Scaling Wei-Ting Jonas Chan, Andrew.
Andrew B. Kahng‡†, Mulong Luo†, Siddhartha Nath†
Dose Map and Placement Co-Optimization for Timing Yield Enhancement and Leakage Power Reduction Kwangok Jeong, Andrew B. Kahng, Chul-Hong Park, Hailong.
DELAY INSERTION METHOD IN CLOCK SKEW SCHEDULING BARIS TASKIN and IVAN S. KOURTEV ISPD 2005 High Performance Integrated Circuit Design Lab. Department of.
Accuracy-Configurable Adder for Approximate Arithmetic Designs
-1- UC San Diego / VLSI CAD Laboratory A Global-Local Optimization Framework for Simultaneous Multi-Mode Multi-Corner Clock Skew Variation Reduction Kwangsoo.
Power Reduction for FPGA using Multiple Vdd/Vth
UC San Diego / VLSI CAD Laboratory Toward Quantifying the IC Design Value of Interconnect Technology Improvement Tuck-Boon Chan, Andrew B. Kahng, Jiajia.
Horizontal Benchmark Extension for Improved Assessment of Physical CAD Research Andrew B. Kahng, Hyein Lee and Jiajia Li UC San Diego VLSI CAD Laboratory.
UC San Diego / VLSI CAD Laboratory Incremental Multiple-Scan Chain Ordering for ECO Flip-Flop Insertion Andrew B. Kahng, Ilgweon Kang and Siddhartha Nath.
-1- UC San Diego / VLSI CAD Laboratory Construction of Realistic Gate Sizing Benchmarks With Known Optimal Solutions Andrew B. Kahng, Seokhyeong Kang VLSI.
Kwangsoo Han, Andrew B. Kahng, Hyein Lee and Lutong Wang
Kwangsoo Han‡, Andrew B. Kahng‡† and Hyein Lee‡
Toward Holistic Modeling, Margining and Tolerance of IC Variability
MicroFix: Exploiting Path-grained Timing Adaptability for Improving Power-Performance Efficiency Guihai Yan, Yinhe Han, Hui Liu, Xiaoyao Liang, Xiaowei.
Outline Introduction: BTI Aging and AVS Signoff Problem
Improving Voltage Assignment by Outlier Detection and Incremental Placement Huaizhi Wu* and Martin D.F. Wong** * Atoptech, Inc. ** University of Illinois.
Low-Power BIST (Built-In Self Test) Overview 10/31/2014
UC San Diego / VLSI CAD Laboratory Learning-Based Approximation of Interconnect Delay and Slew Modeling in Signoff Timing Tools Andrew B. Kahng, Seokhyeong.
Mixed Cell-Height Implementation for Improved Design Quality in Advanced Nodes Sorin Dobre +, Andrew B. Kahng * and Jiajia Li * * UC San Diego VLSI CAD.
0 Optimizing Stochastic Circuits for Accuracy-Energy Tradeoffs Armin Alaghi 3, Wei-Ting J. Chan 1, John P. Hayes 3, Andrew B. Kahng 1,2 and Jiajia Li 1.
-1- UC San Diego / VLSI CAD Laboratory Optimization of Overdrive Signoff Tuck-Boon Chan, Andrew B. Kahng, Jiajia Li and Siddhartha Nath Tuck-Boon Chan,
-1- UC San Diego / VLSI CAD Laboratory Optimal Reliability-Constrained Overdrive Frequency Selection in Multicore Systems Andrew B. Kahng and Siddhartha.
Proximity Optimization for Adaptive Circuit Design Ang Lu, Hao He, and Jiang Hu.
CUHK Test and Fault-Tolerance for Timing Error Presenter: Feng Yuan.
Yuxi Liu The Chinese University of Hong Kong Circuit Timing Problem Driven Optimization.
Unified Adaptivity Optimization of Clock and Logic Signals Shiyan Hu and Jiang Hu Dept of Electrical and Computer Engineering Texas A&M University.
Improved Flop Tray-Based Design Implementation for Power Reduction
Kun Young Chung*, Andrew B. Kahng+ and Jiajia Li+
Architecture and Synthesis for Multi-Cycle Communication
Kristof Blutman† , Hamed Fatemi† , Andrew B
SIMD Lane Decoupling Improved Timing-Error Resilience
Improved Performance of 3DIC Implementations Through Inherent Awareness of Mix-and-Match Die Stacking Kwangsoo Han, Andrew B. Kahng and Jiajia Li University.
Revisiting and Bounding the Benefit From 3D Integration
Yu-Guang Chen1,2, Wan-Yu Wen1, Tao Wang2,
Circuit Design Techniques for Low Power DSPs
Measuring the Gap between FPGAs and ASICs
Presentation transcript:

A New Methodology for Reduced Cost of Resilience Andrew B. Kahng, Seokhyeong Kang and Jiajia Li UC San Diego VLSI CAD Laboratory

UCSD VLSI CAD Laboratory2 Outline Background and Motivation Problem Statement Related Work Our Methodology Experimental Setup and Results Conclusion

UCSD VLSI CAD Laboratory3 Outline Background and Motivation Problem Statement Related Work Our Methodology Experimental Setup and Results Conclusion

UCSD VLSI CAD Laboratory4 Background: Resilient Designs Detect and recover from timing errors  Ensure correct operation with dynamic variations (e.g., IR drop, temperature fluctuation, cross-coupling, etc.) Trade off design robustness vs. design quality  E.g., enable margin reduction Improve performance (i.e., timing speculation) Conventional design:  Worst-case signoff  No Vdd downscaling Resilient design:  Typical-case signoff  Vdd downscaling  reduced energy 15% reduction

UCSD VLSI CAD Laboratory5 Motivation Cost of resilience is high Additional circuits  area / power penalty Recovery from errors  throughput degradation Large hold margin  short-path padding cost Goal: benefits overweigh costs RazorRazor-Lite TIMBER RazorRazor-LiteTIMBER Power penalty30% [Das08]~0% [Kim13]100% [Choudhury09] Area penalty182% [Kim13]33% [Kim13]255% [Chen13] #recovery cycles5 [Wan09]11 [Kim13]0 [Choudhury09]

UCSD VLSI CAD Laboratory6 Outline Background and Motivation Problem Statement Related Work Our Methodology Experimental Setup and Results Conclusion

UCSD VLSI CAD Laboratory7 Resilience Cost Reduction Problem Given: RTL design, throughput requirement and error-tolerant registers Objective: implement design to minimize energy Estimation of design energy: #recovery cycles Clock period Error rate [Kahng10]

UCSD VLSI CAD Laboratory8 Outline Background and Motivation Problem Statement Related Work Our Methodology Experimental Setup and Results Conclusion

UCSD VLSI CAD Laboratory9 Related Works [Choudhury09] masks timing errors only on timing- critical paths to reduce resilience cost [Yuan13] uses a fine-grained redundant approximate circuits insertion for error masking [Kahng10] optimizes designs for a target error rate and reduces design energy by lowering supply voltage [Wan09] optimizes the most frequently-exercised gates for error-rate and energy reduction Exploration of tradeoffs between cost of resilience vs. cost of datapath optimization has been ignored

UCSD VLSI CAD Laboratory10 Focus of This Work #Razor FFs (resilience cost) Power/area of fanin circuits Tradeoff Our work minimizes total energy using the tradeoffs There is tradeoff between resilience cost vs. cost of datapath optimization …

UCSD VLSI CAD Laboratory11 Outline Background and Motivation Problem Statement Related Work Our Methodology Experimental Setup and Results Conclusion

UCSD VLSI CAD Laboratory12 Overview of Our Methodology Our flow: pure-resilience  datapath optimizations Low-cost margin insertion (selective-endpoint optimization) Selectively increase margin at endpoint with timing violation Slack redistribution (clock skew optimization) Migrate timing slacks to endpoint with timing violation  Replace error-tolerant FFs to normal FFs  Reduced resilience cost

UCSD VLSI CAD Laboratory13 Overall Optimization Flow Iteratively optimize with SEOpt and SkewOpt Initial placement (all FFs = error-tolerant FFs) Energy < min energy? Save current solution Margin insertion on K paths based on sensitivity function Replace error-tolerant FFs w/ normal FFs SEOpt Activity aware clock skew optimization SkewOpt

UCSD VLSI CAD Laboratory14 Selective-Endpoint Optimization Optimize fanin cone w/ tighter constraints  Allows replacement of Razor FF w/ normal FF Trade off cost of resilience vs. data path optimization Question 1: Which endpoint to be optimized? Question 2: How many endpoints to be optimized?

UCSD VLSI CAD Laboratory15 Sensitivity Function Which endpoint to be optimized?  Pick endpoints based on sensitivity functions Vary #endpoints  compare area/power penalty Candidate Sensitivity Functions p negative slack endpoint c cells within fanin cone Num cri number of negative slack cells

UCSD VLSI CAD Laboratory16 Iterative Optimization Question 2: How many endpoints to be optimized?  Vary #optimized endpoints  pick minimum-energy solution Optimization Procedure 1.Pick top-K endpoints with minimum sensitivity 2.Timing optimization on fanin cone of p if ( slack at p is positive) replace with normal FFs 3.Error rate estimation 4.Check design energy if ( energy is reduced ) store current solution 5.Update sensitivity functions; Goto 1

UCSD VLSI CAD Laboratory17 Clock Skew Optimization Increase slacks on timing-critical and/or frequently- exercised paths 1.Generate sequential graph 2.Find cycle of paths with minimum total weight  adjust clock latencies  contract the cycle into one vertex 3.Iterate Step 2 until all endpoints are optimized FF1 FF2 FF3 W 12 W 23 Clock Data path Clock tree W 31 Setup slack of path p-q Weighting factor Toggle rate of path p-q W’ W’ = average weight on cycle

UCSD VLSI CAD Laboratory18 Outline Background and Motivation Problem Statement Related Work Our Methodology Experimental Setup and Results Conclusion

UCSD VLSI CAD Laboratory19 Experimental Setup Design OpenSparc T1 Technology 28nm FDSOI, dual-VT {RVT, LVT} Tools Synthesis: Synopsys Design Compiler vH SP3 P&R: Cadence EDI System 13.1 Gate-level simulation: Cadence NC-Verilog v8.2 Liberty characterization: Synopsys SiliconSmart v SP1 Questions How do the benefits/costs of resilience vary with safety margin? How do the benefits/costs of resilience change in AVS context? ModuleDescription# of cells EXUInteger execution18K MULInteger multiplier13K

UCSD VLSI CAD Laboratory20 Methodology Comparison Reference flows Pure-margin (PM): conventional method w/ only margin insertion Brute-force (BF): use error-tolerant FFs for timing-critical endpoints Proposed method (CO) achieves up to 20% energy reduction compared to reference methods Resilience benefits increase with safety margin Large margin Medium margin Small margin MUL EXU Large margin Medium margin Small margin Small/medium/large margin  safety margin = 5%/10%/15% of clock period

UCSD VLSI CAD Laboratory21 Energy Reduction from AVS Adaptive voltage scaling allows a lower supply voltage for resilient designs, thus reduced power Proposed method trades off between timing-error penalty vs. reduced power at a lower supply voltage Proposed method achieves an average of 18% energy reduction compared to pure-margin designs  Resilience benefits increase in the context of AVS strategy MULEXU Minimum achievable energy

UCSD VLSI CAD Laboratory22 Outline Background and Motivation Problem Statement Related Work Our Methodology Experimental Setup and Results Conclusion

UCSD VLSI CAD Laboratory23 Conclusion New design flow for mixing of resilient and non- resilient circuits Combined selective-endpoint and clock skew optimizations reduce costs of resilience Up to 20% energy reduction compared to reference methods Future work Unified framework for data- and clock-path optimization Study impact of process variation on resilient design methodologies

UCSD VLSI CAD Laboratory24 THANK YOU!