Paper by F.L. Kastensmidt, G. Neuberger, L. Carro, R. Reis Talk by Nick Boyd 1.

Slides:



Advertisements
Similar presentations
Giuseppe De Robertis - INFN Sez. di Bari 1 SEU – SET test structures.
Advertisements

Survey of Detection, Diagnosis, and Fault Tolerance Methods in FPGAs
What is RAID Redundant Array of Independent Disks.
Baloch 1MAPLD 2005/1024-L Design of a ‘Single Event Effect’ Mitigation Technique for Reconfigurable Architectures SAJID BALOCH Prof. Dr. T. Arslan 1,2.
10/14/2005Caltech1 Reliable State Machines Dr. Gary R Burke California Institute of Technology Jet Propulsion Laboratory.
Scrubbing Approaches for Kintex-7 FPGAs
Fault-Tolerant Systems Design Part 1.
April 30, Cost efficient soft-error protection for ASICs Tuvia Liran; Ramon Chips Ltd.
ICAP CONTROLLER FOR HIGH-RELIABLE INTERNAL SCRUBBING Quinn Martin Steven Fingulin.
Maintaining Data Integrity in Programmable Logic in Atmospheric Environments through Error Detection Joel Seely Technical Marketing Manager Military &
3. Hardware Redundancy Reliable System Design 2010 by: Amir M. Rahmani.
DSD 2007 Concurrent Error Detection for FSMs Designed for Implementation with Embedded Memory Blocks of FPGAs Andrzej Krasniewski Institute of Telecommunications.
Self-Checking Carry-Select Adder Design Based on Two-Rail Encoding
1 A Design Approach for Radiation-hard Digital Electronics Rajesh Garg Nikhil Jayakumar Sunil P Khatri Gwan Choi Department of Electrical and Computer.
Carlos Arthur Lang Lisbôa, Luigi Carro, Erika Cota ETS 2005 RobOps - Arithmetic Operators for Future Technologies Future technologies, bellow 90nm, will.
TH EDA NTHU-CS VLSI/CAD LAB 1 Re-synthesis for Reliability Design Shih-Chieh Chang Department of Computer Science National Tsing Hua University.
2. Introduction to Redundancy Techniques Redundancy Implies the use of hardware, software, information, or time beyond what is needed for normal system.
Convolutional Code Based Concurrent Error Detection in Finite State Machines Konstantinos N. Rokas Advisor: Prof. Yiorgos Makris.
Carlos Arthur Lang Lisbôa, Luigi Carro DATE EDAA PhD Forum Dealing with Multiple Simultaneous Faults in Future Technologies INFORMÁTICA Universidade.
Embedded Systems Laboratory Informatics Institute Federal University of Rio Grande do Sul Porto Alegre – RS – Brazil SRC TechCon 2005 Portland, Oregon,
Carlos Arthur Lang Lisbôa, Erik Schüler, Luigi Carro SRC TechCon 2005 Dealing with Multiple Simultaneous Faults in Future Technologies in Future Technologies.
Spring 07, Apr 17, 19 ELEC 7770: Advanced VLSI Design (Agrawal) 1 ELEC 7770 Advanced VLSI Design Spring 2007 Soft Errors and Fault-Tolerant Design Vishwani.
Motivation Yang You 1, Jinghong Chen 1, Datao Gong 2, Deping Huang 1, Tiankuan Liu 2, Jingbo Ye 2 1 Department of Electrical Engineering, Southern Methodist.
Reduced Cost Reliability via Statistical Model Detection Jon-Paul Anderson- PhD Student Dr. Brent Nelson- Faculty Dr. Mike Wirthlin- Faculty Brigham Young.
A comprehensive method for the evaluation of the sensitivity to SEUs of FPGA-based applications A comprehensive method for the evaluation of the sensitivity.
IBM S/390 Parallel Enterprise Server G5 fault tolerance: A historical perspective by L. Spainhower & T.A. Gregg Presented by Mahmut Yilmaz.
Reconfiguration Based Fault-Tolerant Systems Design - Survey of Approaches Jan Balach, Jan Balach, Ondřej Novák FIT, CTU in Prague MEMICS 2010.
SiLab presentation on Reliable Computing Combinational Logic Soft Error Analysis and Protection Ali Ahmadi May 2008.
FORMAL VERIFICATION OF ADVANCED SYNTHESIS OPTIMIZATIONS Anant Kumar Jain Pradish Mathews Mike Mahar.
Fault-Tolerant Systems Design Part 1.
MAPLD 2005/202 Pratt1 Improving FPGA Design Robustness with Partial TMR Brian Pratt 1,2 Michael Caffrey, Paul Graham 2 Eric Johnson, Keith Morgan, Michael.
Synthesis Of Fault Tolerant Circuits For FSMs & RAMs Rajiv Garg Pradish Mathews Darren Zacher.
Yun-Chung Yang SimTag: Exploiting Tag Bits Similarity to Improve the Reliability of the Data Caches Jesung Kim, Soontae Kim, Yebin Lee 2010 DATE(The Design,
SET Fault Tolerant Combinational Circuits Based on Majority Logic
CprE 458/558: Real-Time Systems
High Performance Embedded Computing (HPEC) Workshop 23−25 September 2008 John Holland & Eliot Glaser Northrop Grumman Corporation P.O. Box 1693 Baltimore,
Fault-Tolerant Systems Design Part 1.
The concept of RAID in Databases By Junaid Ali Siddiqui.
2011/IX/27SEU protection insertion in Verilog for the ABCN project 1 Filipe Sousa Francis Anghinolfi.
D_160 / MAPLD Burke 1 Fault Tolerant State Machines Gary Burke, Stephanie Taft Jet Propulsion Laboratory, California Institute of Technology.
1/14 Merging BIST and Configurable Computing Technology to Improve Availability in Space Applications Eduardo Bezerra 1, Fabian Vargas 2, Michael Paul.
Using Memory to Cope with Simultaneous Transient Faults Authors: Universidade Federal do Rio Grande do Sul Programa de Pós-Graduação em Engenharia Elétrica.
Evaluating Logic Resources Utilization in an FPGA-Based TMR CPU
Varadarajan Srinivasan, Julian W. Farquharson,
Eduardo L. Rhod, Álisson Michels, Carlos A. L. Lisbôa, Luigi Carro ETS 2006 Fault Tolerance Against Multiple SEUs using Memory-Based Circuits to Improve.
1 Fundamentals of Computer Science Combinational Circuits.
1 Advanced Digital Design Reconfigurable Logic by A. Steininger and M. Delvai Vienna University of Technology.
In-Place Decomposition for Robustness in FPGA Ju-Yueh Lee, Zhe Feng, and Lei He Electrical Engineering Dept., UCLA Presented by Ju-Yueh Lee Address comments.
A4 1 Barto "Sequential Circuit Design for Space-borne and Critical Electronics" Dr. Rod L. Barto Spacecraft Digital Electronics Richard B. Katz NASA Goddard.
A Simplified Approach to Fault Tolerant State Machine Design for Single Event Upsets Melanie Berg.
03/30/031 ECE Digital System Design & Synthesis Lecture Design Partitioning for Synthesis Strategies  Partition for design reuse  Keep related.
12004 MAPLD Int’l Conference – Paper 118 Kumar Automated FSM Error Correction for Single Event Upsets Dr. Nand Kumar & Darren Zacher Design Creation and.
A Novel, Highly SEU Tolerant Digital Circuit Design Approach By: Rajesh Garg Sunil P. Khatri Department of Electrical and Computer Engineering, Texas A&M.
A Survey of Fault Tolerant Methodologies for FPGA’s Gökhan Kabukcu
Chandrasekhar 1 MAPLD 2005/204 Reduced Triple Modular Redundancy for Tolerating SEUs in SRAM based FPGAs Vikram Chandrasekhar, Sk. Noor Mahammad, V. Muralidharan.
MAPLD 2005/213Kakarla & Katkoori Partial Evaluation Based Redundancy for SEU Mitigation in Combinational Circuits MAPLD 2005 Sujana Kakarla Srinivas Katkoori.
Self-Checking Circuits
Floating-Point FPGA (FPFPGA)
CFTP ( Configurable Fault Tolerant Processor )
MAPLD 2005 Reduced Triple Modular Redundancy for Tolerating SEUs in SRAM based FPGAs Vikram Chandrasekhar, Sk. Noor Mahammad, V. Muralidharan Dr. V. Kamakoti.
SEU Mitigation Techniques for Virtex FPGAs in Space Applications
Maintaining Data Integrity in Programmable Logic in Atmospheric Environments through Error Detection Joel Seely Technical Marketing Manager Military &
MAPLD 2005 BOF-L Mitigation Methods for
Sequential circuits and Digital System Reliability
Evaluation of Power Costs in Triplicated FPGA Designs
Design of a ‘Single Event Effect’ Mitigation Technique for Reconfigurable Architectures SAJID BALOCH Prof. Dr. T. Arslan1,2 Dr.Adrian Stoica3.
Dynamic High-Performance Multi-Mode Architectures for AES Encryption
TECHNICAL SEMINAR PRESENTATION
Hardware Assisted Fault Tolerance Using Reconfigurable Logic
Seminar on Enterprise Software
Presentation transcript:

Paper by F.L. Kastensmidt, G. Neuberger, L. Carro, R. Reis Talk by Nick Boyd 1

 Exploring techniques for detecting and dealing with radiation-induced faults in FPGAs  Why?  Drive to use commercial off-the-shelf to minimize cost and development time (for space apps)  As technology gets smaller, radiation becomes an issue even at ground level 2

 Incident radiation deposits energy  creation of electron-hole pairs and secondary ionizations produce transient current pulse  Can change a ‘0’ to a ‘1’ or vice versa, often called “bit flip”  In combinational logic: Single Event Transient (SET)  In sequential (or memory): Single Event Upset (SEU) 3

 In FPGAs there are further considerations  SEU in the configuration SRAM (logic, routing)  SET in combinational FPGA fabric  SEU in BlockRAM 4

 Effects of SEU in configuration fabric 5

 TMR = Triple Modular Redundancy  Logic is triplicated and results are accepted by majority vote  Everything is tripled; including combinational, sequential, routing and i/o 6

 Benefits  Able to detect and correct SEU\SET anywhere in the FPGA  No performance penalty  Drawbacks  Very large area/resource penalty (particularly problematic for i/o pads) 7

 A new technique proposed by the authors of this paper  DMR-CED: Double Modular Redundancy with Concurrent Error Detection  Motivation: Want to find a way that is as reliable as TMR in detecting/correcting errors with less area overhead 8

 CED = Concurrent Error Detection  Exploits some property of the logic block to find error  Time-redundant examples:  bit-wise inversion  re-computing with shifted operands (RESO)  re-computing with swapped operands (REWSO) 9

 Result calculated from direct input and stored  Input then encoded, new result calculated and decoded  Two outputs compared – should be equal 10

 How can we use CED?  Only duplicate combinational logic  Use CED to determine the faulty module only if there is disagreement 11

12

 Three sample sequential circuits tested  8-bit multiplier  8-bit ALU  FIR filter  Sample circuits generated then each node was replaced with a multiplexor which either passes ‘correct’, ‘0’, or ‘1’  Able to simulate every possible SEU fault 13

14

15

 Benefits  Reduces area required for combinational logic (by a significant amount in some cases)  Drawbacks  Significantly more complicated due to CED  CED circuit needs to be chosen to be optimized for each combinational circuit you protect  Speed reduced by as much as 50% 16

 Reasonably well written and complete  Necessary to read the references to understand the minutiae of underlying principles  DMR-CED probably only useful under very specific conditions 17