A F AST AND A CCURATE M ULTI -C YCLE S OFT E RROR R ATE E STIMATION A PPROACH TO R ESILIENT E MBEDDED S YSTEMS D ESIGN Department of Computer Engineering.

Slides:



Advertisements
Similar presentations
CS370 – Spring 2003 Hazards/Glitches. Time Response in Combinational Networks Gate Delays and Timing Waveforms Hazards/Glitches and How To Avoid Them.
Advertisements

ECE C03 Lecture 71 Lecture 7 Delays and Timing in Multilevel Logic Synthesis Hai Zhou ECE 303 Advanced Digital Design Spring 2002.
1 Lecture 16 Timing  Terminology  Timing issues  Asynchronous inputs.
Single Event Upsets in Digital VLSI Circuits EYES Summer Internship Program 2007 University of New Mexico Vinay Jain Dr. Payman Zarkesh-Ha Final Year Undergraduate.
Single Event Upsets (SEUs) – Soft Errors By: Rajesh Garg Sunil P. Khatri Department of Electrical and Computer Engineering, Texas A&M University, College.
April 30, Cost efficient soft-error protection for ASICs Tuvia Liran; Ramon Chips Ltd.
Digital Logic Chapter 5 Presented by Prof Tim Johnson
June 6, Using Negative Edge Triggered FFs to Reduce Glitching Power in FPGA Circuits Tomasz S. Czajkowski and Stephen D. Brown Department of Electrical.
Fault Injection in Mixed-Signal Environment Using Behavioral Fault Modeling in Verilog-A Seyed‌ Nematollah Ahmadian, Seyed Ghassem Miremadi Behavioral.
Cross-layer Optimized Placement and Routing for FPGA Soft Error Mitigation Keheng Huang 1,2, Yu Hu 1, and Xiaowei Li 1 1 Key Laboratory of Computer System.
Sequential Logic 1 clock data in may changestable data out (Q) stable Registers  Sample data using clock  Hold data between clock cycles  Computation.
1 Closed-Loop Modeling of Power and Temperature Profiles of FPGAs Kanupriya Gulati Sunil P. Khatri Peng Li Department of ECE, Texas A&M University, College.
Aiman El-Maleh, Ali Alsuwaiyan King Fahd University of Petroleum & Minerals, Dept. of Computer Eng., Saudi Arabia Aiman El-Maleh, Ali Alsuwaiyan King Fahd.
May 14-16, 2008 NATW' Probabilistic Soft Error Rate Estimation from Statistical SEU Parameters Fan Wang* Vishwani D. Agrawal Department of Electrical.
An Efficient Test Relaxation Technique for Synchronous Sequential Circuits Aiman El-Maleh and Khaled Al-Utaibi King Fahd University of Petroleum & Minerals.
ECEN 248: INTRODUCTION TO DIGITAL SYSTEMS DESIGN Dr. Shi Dept. of Electrical and Computer Engineering.
March 16-18, 2008SSST'20081 Soft Error Rate Determination for Nanometer CMOS VLSI Circuits Fan Wang Vishwani D. Agrawal Department of Electrical and Computer.
ENGIN112 L28: Timing Analysis November 7, 2003 ENGIN 112 Intro to Electrical and Computer Engineering Lecture 28 Timing Analysis.
A Delay-efficient Radiation-hard Digital Design Approach Using Code Word State Preserving (CWSP) Elements Charu Nagpal Rajesh Garg Sunil P. Khatri Department.
Give qualifications of instructors: DAP
1 A Fast, Analytical Estimator for the SEU-induced Pulse Width in Combinational Designs By: Rajesh Garg Charu Nagpal Sunil P. Khatri Department of Electrical.
Enhanced Dual-Transition Probabilistic Power Estimation with Selective Supergate Analysis Fei Huand Vishwani D. Agrawal Department of ECE, Auburn University,
ELEC 7250 Term Project Presentation Khushboo Sheth Department of Electrical and Computer Engineering Auburn University, Auburn, AL.
1 Oct 24-26, 2006 ITC'06 Fault Coverage Estimation for Non-Random Functional Input Sequences Soumitra Bose Intel Corporation, Design Technology, Folsom,
10/25/2007 ITC-07 Paper Delay Fault Simulation with Bounded Gate Delay Model Soumitra Bose Design Technology, Intel Corp. Folsom, CA Hillary.
1 Reconvergent Fanout Analysis of Bounded Gate Delay Faults Dept. of ECE, Auburn University Auburn, AL Master’s Defense Hillary Grimes Thesis Advisor:
March 12, 2008Fan's MS Defense1 Soft Error Rate Determination for Nanometer CMOS VLSI Circuits Master’s Defense Fan Wang Department of Electrical and Computer.
04/25/2006 ELEC 7250 Final Project: Jie Qin 1 Logic Simulator for Combinational Circuit Jie Qin Dept. of Electrical and Computer Engineering Auburn University,
A Probabilistic Method to Determine the Minimum Leakage Vector for Combinational Designs Kanupriya Gulati Nikhil Jayakumar Sunil P. Khatri Department of.
Barcelona, Spain November 13, 2005 WAR-1: Assessing SEU Vulnerability Via Circuit-Level Timing Analysis 1 Assessing SEU Vulnerability via Circuit-Level.
Chapter #6: Sequential Logic Design 6.2 Timing Methodologies
1 Enhancing Random Access Scan for Soft Error Tolerance Fan Wang* Vishwani D. Agrawal Department of Electrical and Computer Engineering, Auburn University,
1 Efficient Analytical Determination of the SEU- induced Pulse Shape Rajesh Garg Sunil P. Khatri Department of ECE Texas A&M University College Station,
Ronny Krashinsky Seongmoo Heo Michael Zhang Krste Asanovic MIT Laboratory for Computer Science SyCHOSys Synchronous.
A comprehensive method for the evaluation of the sensitivity to SEUs of FPGA-based applications A comprehensive method for the evaluation of the sensitivity.
SiLab presentation on Reliable Computing Combinational Logic Soft Error Analysis and Protection Ali Ahmadi May 2008.
Soft errors in adder circuits Rajaraman Ramanarayanan, Mary Jane Irwin, Vijaykrishnan Narayanan, Yuan Xie Penn State University Kerry Bernstein IBM.
Using co-design techniques to increase the reliability of the Electronic control System for a Multilevel Power Converter Javier C. Brook, Francisco J.
A Robust Pulse-triggered Flip-Flop and Enhanced Scan Cell Design
EEE2243 Digital System Design Chapter 7: Advanced Design Considerations by Muhazam Mustapha, extracted from Intel Training Slides, April 2012.
European Test Symposium, May 28, 2008 Nuno Alves, Jennifer Dworak, and R. Iris Bahar Division of Engineering Brown University Providence, RI Kundan.
1 Compacting Test Vector Sets via Strategic Use of Implications Kundan Nepal Electrical Engineering Bucknell University Lewisburg, PA Nuno Alves, Jennifer.
Skewed Flip-Flop Transformation for Minimizing Leakage in Sequential Circuits Jun Seomun, Jaehyun Kim, Youngsoo Shin Dept. of Electrical Engineering, KAIST,
Layali Rashid, Karthik Pattabiraman and Sathish Gopalakrishnan D EPARTMENT OF E LECTRICAL AND C OMPUTER E NGINEERING T HE U NIVERSITY OF B RITISH C OLUMBIA.
Analytical Approach for Soft Error Rate Estimation of SRAM-Based FPGAs Ghazanfar (Hossein) Asadi and Mehdi B. Tahoori Why Soft Error Rate (SER) Estimation?
Detecting Errors Using Multi-Cycle Invariance Information Nuno Alves, Jennifer Dworak, and R. Iris Bahar Division of Engineering Brown University Providence,
11 Online Computing and Predicting Architectural Vulnerability Factor of Microprocessor Structures Songjun Pan Yu Hu Xiaowei Li {pansongjun, huyu,
Using Memory to Cope with Simultaneous Transient Faults Authors: Universidade Federal do Rio Grande do Sul Programa de Pós-Graduação em Engenharia Elétrica.
Lecture 11 Timing diagrams Hazards.
In-Place Decomposition for Robustness in FPGA Ju-Yueh Lee, Zhe Feng, and Lei He Electrical Engineering Dept., UCLA Presented by Ju-Yueh Lee Address comments.
Methodology to Compute Architectural Vulnerability Factors Chris Weaver 1, 2 Shubhendu S. Mukherjee 1 Joel Emer 1 Steven K. Reinhardt 1, 2 Todd Austin.
Copyright 2005, M. Tahoori1 Soft Error Modeling and Mitigation Mehdi B. Tahoori Northeastern University
Static Timing Analysis
Deterministic Diagnostic Pattern Generation (DDPG) for Compound Defects Fei Wang 1,2, Yu Hu 1, Huawei Li 1, Xiaowei Li 1, Jing Ye 1,2 1 Key Laboratory.
IPR: In-Place Reconfiguration for FPGA Fault Tolerance Zhe Feng 1, Yu Hu 1, Lei He 1 and Rupak Majumdar 2 1 Electrical Engineering Department 2 Computer.
W ORST -C ASE N OISE A REA P REDICTION OF O N -C HIP P OWER D ISTRIBUTION N ETWORK Xiang Zhang 1, Jingwei Lu 2, Yang Liu 3 and Chung-Kuan Cheng 1,2 1 ECE.
Gill 1 MAPLD 2005/234 Analysis and Reduction Soft Delay Errors in CMOS Circuits Balkaran Gill, Chris Papachristou, and Francis Wolff Department of Electrical.
A Novel, Highly SEU Tolerant Digital Circuit Design Approach By: Rajesh Garg Sunil P. Khatri Department of Electrical and Computer Engineering, Texas A&M.
ELEC 7950 – VLSI Design and Test Seminar
Chandrasekhar 1 MAPLD 2005/204 Reduced Triple Modular Redundancy for Tolerating SEUs in SRAM based FPGAs Vikram Chandrasekhar, Sk. Noor Mahammad, V. Muralidharan.
Fault-Tolerant Resynthesis for Dual-Output LUTs Roy Lee 1, Yu Hu 1, Rupak Majumdar 2, Lei He 1 and Minming Li 3 1 Electrical Engineering Dept., UCLA 2.
IPF: In-Place X-Filling to Mitigate Soft Errors in SRAM-based FPGAs
Soft Error Analysis of FPGA under ISO Standard
Soft Error Rates with Inertial and Logical Masking
Timing Analysis 11/21/2018.
Overview Last Lecture Conversion of two-level logic to NAND or NOR forms Multilevel logic AOI and OAI gates Today Timing and hazards Multiplexers and demultiplexers.
Topics Performance analysis..
Analytical Approach for Soft Error Rate Estimation of SRAM-Based FPGAs
Advancement on the Analysis and Mitigation of
R.W. Mann and N. George ECE632 Dec. 2, 2008
Presentation transcript:

A F AST AND A CCURATE M ULTI -C YCLE S OFT E RROR R ATE E STIMATION A PPROACH TO R ESILIENT E MBEDDED S YSTEMS D ESIGN Department of Computer Engineering Sharif University of Technology Tehran, IRAN Mahdi Fazeli, Seyed Ghassem Miremadi, Hossein Asadi, Seyed Nematollah Ahmadian Presenter: Saman Aliari University of Illinois at Urbana Chamapign

S PEECH O UTLINES Soft Errors SER Modeling in Multi-Cycle Operation SER Modeling in Single Cycle Operation Proposed SER Modeling in Multi Cycle Operation Tool Overview Experimental Results and Discussions Conclusions 2

W HAT IS SOFT ERROR ? 3 Transient Faults Transient Faults Due to radiation events 1  0 or 0  1 1  0 or 0  1 Alpha particles or Neutrons Memory, Flip-flops, Combinational Logic Energetic Particle

E VIDENCES OF P ARTICLE S TRIKES 2000 [Forbes Magezine’00] SUN Enterprise servers crash, due to Cache problem 2001 [ITRS’01] Soft errors as a major issue in chip design 2003 [EE Times’04] Cisco routers failure, due to soft errors 2004 [Xilinx.com] Xilinx FPGAs highly sensitive to soft errors 2005 [Selse.org] Soft error workshop (70% industry attendees) 2011 [ZeroSoft’06] Expected 70% chips to fail in a year 4

M ULTI -C YCLE S OFT E RROR P ROPAGATION 5 First Cycle: The SET does not propagate to the Primary Output (PO) Second Cycle: The error propagates to the Primary Output (PO)

SER M ODELING IN S INGLE C YCLE Nominal FIT  Logic Derating  Timing Derating  Electrical Derating Nominal FIT: Occurrence rate of cosmic rays at error site Computed once for library characterization Logical Derating Timing Derating Electrical Derating 6 D BC E A D FF 1 1

L OGICAL D ERATING M ODELING 7 The Main Idea: Traversing structural paths from SEU site to POs and FFs Using Signal Probabilities (SP) for off-path signals SP A : probability of gate “A” having logic value “1” Effective techniques available for SP computation EPP(A  D) = SP B = 0.2 EPP: Error Propagation Probability EPP(A  D) = SP B = 0.2 EPP: Error Propagation Probability EPP(A  E) = EPP(A  D)  (1-SP C ) = 0.2  0.6 = 0.12 EPP(A  E) = EPP(A  D)  (1-SP C ) = 0.2  0.6 = 0.12 off-path signals SP B =0.2 SP C =0.4 D B C E A on-path signals

P ROPAGATION R ULES : O N -P ATH G ATES Reconvergent Paths Error propagated to two or more inputs of a gate Polarity of propagated error matters! Need of 4 logic values to represent state of each line 0, 1 : no error propagation (Error masked) a: error propagation with same polarity as error site ā : error propagation with opposite polarity as error site P a (U i ), P ā (U i ), P 1 (U i ), P 0 (U i ) Developed Error Propagation Probability (EPP) Rules For all logic gates 8

P ROPAGATION R ULES 9 On-path gates: P a (U i ) + P ā (U i ) + P 1 (U i ) + P 0 (U i ) = 1 On-path gates: P a (U i ) + P ā (U i ) + P 1 (U i ) + P 0 (U i ) = 1 Off-path gates: P 1 (U i ) + P 0 (U i ) = 1 Off-path gates: P 1 (U i ) + P 0 (U i ) = 1

T IMING D ERATING M ODELING Find all possible propagated waveforms Enhanced static timing analysis Record all possible transitions at each reachable gate Due to glitch at error site How? Create glitch of width w Represented by two events: (a,t), (ā,t+w) For both positive and negative glitches Inject two events (a,t), (ā,t+w) at error site Find all events at the outputs of all on-path gates Calculate the error propagation probabilities Pa, Pā for each event The propagation is done until reaching a PO or FF. Error propagation probabilities for all possible waveforms are computed For each waveform, Latching Probability is computed as follows: S: Setup Time, H: Hold Time, W: Glitch Width, T:Clock Period 10

11  Different Glitches may propagate to the POs or FFs due to re-convergent fan- out

E LECTRICAL D ERATING M ODELING 1. Algorithm:Computing electrical masking while propagating events 2. Vomin(Gj, inputk):Minimum voltage of input k of Gj 3. Vomax(Gj, inputk):Maximum voltage of input k of Gj 4. Vomin(Gj ): Minimum voltage of Gj output 5. Vomax(Gj ): Maximum voltage of Gj output 6. PWo: Output pulse width 7. For each gate Gj in List(Gi) do 8. For each valid waveform (Wl) in Event List(Gj) do 9. Vomin(inputs) = Max(V omin of gate inputs on waveform Wl); 10. Vomax(inputs) = Min(V omax of gate inputs on waveform Wl); 11. Compute Vomin(Gj ) 12. Compute Vomax(Gj ) 13. Compute Pwo using computed Vomin(Gj ) and Vomax(Gj ) 14. end 15. end 12

A C ASE S TUDY : E RROR P ROPAGATION FOR T WO C LOCK C YCLES 13 Only logical derating may occur All three deratings may occur

T HE T OOL : MLET M ULTI -C YCLE L OGICAL -E LECTRICAL -T IMING D ERATING 14

E XPERIMENTAL R ESULTS : R UN T IME 15 Execution times for MC simulation approach, SP computation, and MLET approach On average, 4 orders of magnitude faster than MC based simulation Time required to compute SPs is also 5 orders of magnitude less than MC based simulation

E XPERIMENTAL R ESULTS : A CCURACY 16 Difference of derating factors obtained by MLET using various SP variances compared to MC simulations (for an injected pulse width of 50 ps) The MLET have an accuracy of about 97% as compared to the MC fault injection approach

M ULTI -C YCLE SER S 17 Multi-cycle SER estimation of s820 and s832 ISCAS’89 circuits using MLET

C ONCLUSIONS & F UTURE W ORK SER Estimation is very challenging as it requires dynamic analysis of transients. The existing SER estimation approaches rely on investigation of error propagation probabilities for only single cycle resulting in inaccurate system failure rate. We have proposed a very fast and accurate analytical approach so called MLET which has four main features: 1. It runs very fast. 2. All three masking factors are considered. 3. The effects of error propagation in re-convergent fan-outs are modeled. 4. The effect of multi-cycle error propagation on overall circuit SER is considered. 18

C ONCLUSIONS & F UTURE W ORK C ONT ’ D Experimental results extracted for some ISCAS89 circuit benchmark show that MLET is: 4 orders of magnitude faster than the MC simulation based fault injection method It has an accuracy of about 97%. Future work: we are going to estimate the SER of a circuit in the presence of Multiple Event Transients (METs) as a reliability concern in ultra deep sub-micron technologies 19

T HANK YOU FOR YOUR ATTENTION 20

R ELATED W ORK : SER M ODELING Circuit/Logic-Level Approach Fault injection SERA by Zhang et. al. [ICCAD’04] SEAT-LA by Rajaraman et. al. [VLSID’06] Mohanram et. al. [ITC’03] Maheshwari et. al. [DFT’03] Asadi et. al. [DSN’03] [PRDC’04] Seifert et. al. [TDMR’04] Probabilistic Transfer Matrices (PTM) Krishnaswamy et. al. [DATE’05] Binary Decision Diagram (BDD) FASER by Zhang et. al. [ISQED’06] [SELSE’05] 21