MAPLD 2005/202 Pratt1 Improving FPGA Design Robustness with Partial TMR Brian Pratt 1,2 Michael Caffrey, Paul Graham 2 Eric Johnson, Keith Morgan, Michael.

Slides:



Advertisements
Similar presentations
Giuseppe De Robertis - INFN Sez. di Bari 1 SEU – SET test structures.
Advertisements

Survey of Detection, Diagnosis, and Fault Tolerance Methods in FPGAs
Baloch 1MAPLD 2005/1024-L Design of a ‘Single Event Effect’ Mitigation Technique for Reconfigurable Architectures SAJID BALOCH Prof. Dr. T. Arslan 1,2.
Sana Rezgui 1, Jeffrey George 2, Gary Swift 3, Kevin Somervill 4, Carl Carmichael 1 and Gregory Allen 3, SEU Mitigation of a Soft Embedded Processor in.
Scrubbing Approaches for Kintex-7 FPGAs
Discussion of: “Terrestrial-based Radiation Upsets: A Cautionary Tale” CprE 583 Tony Kuker 12/06/05.
Multi-Bit Upsets in the Virtex Devices Heather Quinn, Paul Graham, Jim Krone, Michael Caffrey Los Alamos National Laboratory Gary Swift, Jeff George, Fayez.
Using FPGAs in a Radiation Environment ATLAS CMS Electronics Workshop for LHC upgrades (ACES2014) March, 2014, CERN Michael Wirthlin Brigham Young.
Radiation Effects on FPGA and Mitigation Strategies Bin Gui Experimental High Energy Physics Group 1Journal Club4/26/2015.
HPEC 2012 Scrubbing Optimization via Availability Prediction (SOAP) for Reconfigurable Space Computing Quinn Martin Alan George.
Complex Upset Mitigation Applied to a Re-Configurable Embedded Processor EEL 6935 Lu Hao Wenqian Wu.
1 Fault Tolerant FPGA Co-processing Toolkit Oral defense in partial fulfillment of the requirements for the degree of Master of Science 2006 Oral defense.
BL-TMR and Mitigation Approaches for FPGAs
ICAP CONTROLLER FOR HIGH-RELIABLE INTERNAL SCRUBBING Quinn Martin Steven Fingulin.
Nishinaga No. 1 MAPLD2005 Availability Analysis of Xilinx FPGA on Orbit Nozomu Nishinaga National Institute of Information and Communications Technology.
Maintaining Data Integrity in Programmable Logic in Atmospheric Environments through Error Detection Joel Seely Technical Marketing Manager Military &
DSD 2007 Concurrent Error Detection for FSMs Designed for Implementation with Embedded Memory Blocks of FPGAs Andrzej Krasniewski Institute of Telecommunications.
L189/MAPLD2004Carmichael 1 A Triple Module Redundancy Scheme for SEU Mitigation of Static Latch-Based FPGAs (“Birds-of-a-Feather”) Carl Carmichael 1, Brendan.
Micro-RDC Microelectronics Research Development Corporation A Programmable Scrubber for FPGAs ACKNOWLEDGMENT OF SUPPORT: This material is based upon work.
Fault-Tolerant Softcore Processors Part I: Fault-Tolerant Instruction Memory Nathaniel Rollins Brigham Young University.
ENGIN112 L38: Programmable Logic December 5, 2003 ENGIN 112 Intro to Electrical and Computer Engineering Lecture 38 Programmable Logic.
Embedded Systems Laboratory Informatics Institute Federal University of Rio Grande do Sul Porto Alegre – RS – Brazil SRC TechCon 2005 Portland, Oregon,
Spring 07, Apr 17, 19 ELEC 7770: Advanced VLSI Design (Agrawal) 1 ELEC 7770 Advanced VLSI Design Spring 2007 Soft Errors and Fault-Tolerant Design Vishwani.
Fault Tolerant Techniques for System on a Chip Devices Matthew French and J.P. Walters University of Southern California’s Information Sciences Institute.
Radiation Effects and Mitigation Strategies for modern FPGAs 10 th annual workshop for LHC and Future experiments Los Alamos National Laboratory, USA.
Reduced Cost Reliability via Statistical Model Detection Jon-Paul Anderson- PhD Student Dr. Brent Nelson- Faculty Dr. Mike Wirthlin- Faculty Brigham Young.
A comprehensive method for the evaluation of the sensitivity to SEUs of FPGA-based applications A comprehensive method for the evaluation of the sensitivity.
SiLab presentation on Reliable Computing Combinational Logic Soft Error Analysis and Protection Ali Ahmadi May 2008.
Tools - Implementation Options - Chapter15 slide 1 FPGA Tools Course Implementation Options.
FORMAL VERIFICATION OF ADVANCED SYNTHESIS OPTIMIZATIONS Anant Kumar Jain Pradish Mathews Mike Mahar.
ATMEL ATF280E Rad Hard SRAM Based FPGA SEE test results Application oriented SEU Sensitiveness Bernard BANCELIN ATMEL Nantes SAS, Aerospace Business Unit.
FT-UNSHADES Analysis of SEU effects in Digital Designs for Space Gioacchino Giovanni Lucia TEC-EDM, MPD - 8 th March Phone: +31.
Vendor Independent SEE Mitigation Solution For FPGAs Kamesh Ramani Pravin Bhandakkar Darren Zacher Melanie Berg (MEI – NASA Goddard)
P173/MAPLD 2005 Swift1 Upset Susceptibility and Design Mitigation of PowerPC405 Processors Embedded in Virtex II-Pro FPGAs.
Introduction to FPGA Created & Presented By Ali Masoudi For Advanced Digital Communication Lab (ADC-Lab) At Isfahan University Of technology (IUT) Department.
Synthesis Of Fault Tolerant Circuits For FSMs & RAMs Rajiv Garg Pradish Mathews Darren Zacher.
Experimental Evaluation of System-Level Supervisory Approach for SEFIs Mitigation Mrs. Shazia Maqbool and Dr. Craig I Underwood Maqbool 1 MAPLD 2005/P181.
P189/MAPLD2004Carmichael 1 A Triple Module Redundancy Scheme for SEU Mitigation of Static Latch-Based FPGAs Carl Carmichael 1, Brendan Bridgford 1, Gary.
1 The NSEU Sensitivity of Static Latch Based FPGAs and Flash Storage CPLDs Joseph Fabula Jason Moore Austin Lesea Saar Drimer MAPLD2004 This work has benefited.
A Physical Resource Management Approach to Minimizing FPGA Partial Reconfiguration Overhead Heng Tan and Ronald F. DeMara University of Central Florida.
2011/IX/27SEU protection insertion in Verilog for the ABCN project 1 Filipe Sousa Francis Anghinolfi.
French 207 MAPLD 2005 Slide 1 Integrated Tool Suite for Post Synthesis FPGA Power Consumption Analysis Matthew French, Li Wang University of Southern California,
LaRC MAPLD 2005 / A208 Ng 1 Radiation Tolerant Intelligent Memory Stack (RTIMS) Tak-kwong Ng, Jeffrey Herath Electronics Systems Branch Systems Engineering.
1/14 Merging BIST and Configurable Computing Technology to Improve Availability in Space Applications Eduardo Bezerra 1, Fabian Vargas 2, Michael Paul.
Evaluating Logic Resources Utilization in an FPGA-Based TMR CPU
Varadarajan Srinivasan, Julian W. Farquharson,
Greg Alkire/Brian Smith 197 MAPLD An Ultra Low Power Reconfigurable Task Processor for Space Brian Smith, Greg Alkire – PicoDyne Inc. Wes Powell.
In-Place Decomposition for Robustness in FPGA Ju-Yueh Lee, Zhe Feng, and Lei He Electrical Engineering Dept., UCLA Presented by Ju-Yueh Lee Address comments.
Paper by F.L. Kastensmidt, G. Neuberger, L. Carro, R. Reis Talk by Nick Boyd 1.
Nishinaga No. 1 MAPLD2005/1003-J Availability Analysis of Xilinx FPGA on Orbit Nozomu Nishinaga National Institute of Information and Communications Technology.
Gill 1 MAPLD 2005/234 Analysis and Reduction Soft Delay Errors in CMOS Circuits Balkaran Gill, Chris Papachristou, and Francis Wolff Department of Electrical.
Radiation Tolerance Studies using Fault Injection on the Readout Control FPGA Design of the ALICE TPC Detector Johan Alme Bergen University College, Norway.
Chandrasekhar 1 MAPLD 2005/204 Reduced Triple Modular Redundancy for Tolerating SEUs in SRAM based FPGAs Vikram Chandrasekhar, Sk. Noor Mahammad, V. Muralidharan.
P201-L/MAPLD SEE Validation of SEU Mitigation Methods for FPGAs Carl Carmichael 1, Sana Rezgui 1, Gary Swift 2, Jeff George 3, & Larry Edmonds 2.
MAPLD 2005/213Kakarla & Katkoori Partial Evaluation Based Redundancy for SEU Mitigation in Combinational Circuits MAPLD 2005 Sujana Kakarla Srinivas Katkoori.
Fault-Tolerant Resynthesis for Dual-Output LUTs Roy Lee 1, Yu Hu 1, Rupak Majumdar 2, Lei He 1 and Minming Li 3 1 Electrical Engineering Dept., UCLA 2.
Floating-Point FPGA (FPFPGA)
MAPLD 2005 Reduced Triple Modular Redundancy for Tolerating SEUs in SRAM based FPGAs Vikram Chandrasekhar, Sk. Noor Mahammad, V. Muralidharan Dr. V. Kamakoti.
SEU Mitigation Techniques for Virtex FPGAs in Space Applications
Radiation Tolerance of an Used in a Large Tracking Detector
Maintaining Data Integrity in Programmable Logic in Atmospheric Environments through Error Detection Joel Seely Technical Marketing Manager Military &
MAPLD 2005 BOF-L Mitigation Methods for
M. Aguirre1, J. N. Tombs1, F. Muñoz1, V. Baena1, A. Torralba1, A
Sequential circuits and Digital System Reliability
Evaluation of Power Costs in Triplicated FPGA Designs
Design of a ‘Single Event Effect’ Mitigation Technique for Reconfigurable Architectures SAJID BALOCH Prof. Dr. T. Arslan1,2 Dr.Adrian Stoica3.
Analytical Approach for Soft Error Rate Estimation of SRAM-Based FPGAs
Upset Susceptibility and Design Mitigation of
Hardware Assisted Fault Tolerance Using Reconfigurable Logic
Xilinx Kintex7 SRAM-based FPGA
Presentation transcript:

MAPLD 2005/202 Pratt1 Improving FPGA Design Robustness with Partial TMR Brian Pratt 1,2 Michael Caffrey, Paul Graham 2 Eric Johnson, Keith Morgan, Michael Wirthlin 1 1 Brigham Young University Department of Electrical Engineering 2 Los Alamos National Laboratory

MAPLD 2005/202 Pratt2 Motivation for Partial TMR Factors of fault-tolerant computing: –Availability –Reliability –Mitigation Cost Full TMR –Expensive in terms of power, speed, area, etc. –Worthwhile if affordable! Area Cost MTBF x Reliability constraint Area constraint

MAPLD 2005/202 Pratt3 Motivation for Partial TMR Partial TMR offers: –Mitigation of most sensitive design structures –Increased availability of a system by decreasing number of system resets –Decreased mitigation cost over full TMR Suitability of Partial TMR is application dependent –Reduced reliability compared to full TMR

MAPLD 2005/202 Pratt4 Scrubbing Must be included with Partial Mitigation Continuously ‘read’ and ‘clean’ configuration memory Single bit will be upset no longer than t s t s = time for one scrub

MAPLD 2005/202 Pratt5 Non-Persistent Errors An SEU in the non- persistent cross- section will cause a temporary interruption of service Requires partial reconfiguration to correct Scrubbing Repairs Configuration Correct Output time cycle error magnitude error = delta between outputs of a golden and DUT circuit

MAPLD 2005/202 Pratt6 Persistent Errors An SEU in the persistent cross- section will cause a permanent interruption of service Requires full system reset to correct Scrubbing Repairs Configuration Incorrect Output error = delta between outputs of a golden and DUT circuit time cycle error magnitude

MAPLD 2005/202 Pratt7 Non-Persistent Circuit Structures Generally consists of circuit components and routing in a feed- forward path Logic FF Logic

MAPLD 2005/202 Pratt8 Persistent Circuit Structures Generally consists of circuit components and routing in, or contributing to, a feed-back path Logic FF Logic

MAPLD 2005/202 Pratt9 Apply a mitigation technique to just the persistent cross section Logic FF Logic TMR Partial Mitigation

MAPLD 2005/202 Pratt10 Limitations of Partial Mitigation Does not prevent all errors –System must be corrected with configuration bitstream scrubbing –Circuit configuration can be incorrect between scrubbing Non-persistent errors remain

MAPLD 2005/202 Pratt11 Automated Partial TMR Analyze an EDIF source file for feedback structures –Protect these sections with TMR to reduce persistent cross section

MAPLD 2005/202 Pratt12 BLTmr Partial TMR Tool BYU-LANL Triple Modular Redundancy: Configurable Reliability –Limit mitigation to minimize: design resource requirements power consumption –Mitigation focused on persistent circuit structures

MAPLD 2005/202 Pratt13 BLTmr Partial TMR Tool Design Divided into three sections: – Feedback, Input to FB, Output Logic FF Logic

MAPLD 2005/202 Pratt14 BLTmr Partial TMR Tool Design Divided into three sections: – Feedback, Input to FB, Output Logic FF Logic

MAPLD 2005/202 Pratt15 BLTmr Partial TMR Tool Design Divided into three sections: – Feedback, Input to FB, Output Logic FF Logic

MAPLD 2005/202 Pratt16 BLTmr Partial TMR Tool Design Divided into three sections: – Feedback, Input to FB, Output Logic FF Logic

MAPLD 2005/202 Pratt17 BLTmr Tool Options BLTmr Tool applies TMR mitigation to subsections of the design: –Feedback Only –Feedback + Input to Feedback –FB + Input to FB + Output (Full TMR)

MAPLD 2005/202 Pratt18 BLTmr Tool Options BLTmr Tool applies TMR mitigation to subsections of the design: –Feedback Only –Feedback + Input to Feedback –FB + Input to FB + Output (Full TMR)

MAPLD 2005/202 Pratt19 BLTmr Tool Options BLTmr Tool applies TMR mitigation to subsections of the design: –Feedback Only –Feedback + Input to Feedback –FB + Input to FB + Output (Full TMR)

MAPLD 2005/202 Pratt20 BLTmr Tool Options BLTmr Tool applies TMR mitigation to subsections of the design: –Feedback Only –Feedback + Input to Feedback –FB + Input to FB + Output (Full TMR)

MAPLD 2005/202 Pratt21 BLTmr Tool Flow BYU EDIF development environment reads in user design Design organized into graph structure for analysis Parse EDIF Create Design Database User Constraints Analysis (Feedback, Input to FB, etc.) Cell Triplication Original Design Partially Mitigated Design Voter Insertion

MAPLD 2005/202 Pratt22 BLTmr Tool Flow User may direct mitigation Design analyzed to classify components as described Parse EDIF Create Design Database User Constraints Analysis (Feedback, Input to FB, etc.) Cell Triplication Original Design Partially Mitigated Design Voter Insertion

MAPLD 2005/202 Pratt23 BLTmr Tool Flow Circuit elements triplicated Voters inserted Mitigated design written in EDIF format Parse EDIF Create Design Database User Constraints Analysis (Feedback, Input to FB, etc.) Cell Triplication Original Design Partially Mitigated Design Voter Insertion

MAPLD 2005/202 Pratt24 Example Circuits Tests on two designs 1.DSP Kernel 2.Synthetic Design –LFSR modules feeding into an add-multiply tree

MAPLD 2005/202 Pratt25 FPGA Editor Layout Sensitivity MapPersistence Map DSP Kernel Unmitigated Fault Analysis 5,746 slices (46%) 575,448 bits (9.9%)13,841 bits (0.23%) Synthetic Design 2,538 slices (20%)189,835 bits (3.3%)77,159 bits (1.3%)

MAPLD 2005/202 Pratt26 FPGA Editor LayoutSensitivity MapPersistence Map Unmitigated Experimental Results – Design #1 DSP Kernel 5,746 slices (46%) 575,448 (9.90%)13,841 (0.24%) Partial TMR applied to Feedback & Input to FB 8,036 slices (65%) 569,700 (9.81%)152 (0.0026%)

MAPLD 2005/202 Pratt27 Unmitigated Experimental Results – Design #2 Synthetic (LFSR/Mult) 2,538 slices (20%) 189,835 (3.27%)77,159 (1.33%) Full TMR Applied 11,961 slices (97%) 20,256 (0.35%) 671 (0.012%) FPGA Editor LayoutSensitivity MapPersistence Map

MAPLD 2005/202 Pratt28 * Full TMR could not be applied to DSP Kernel due to FPGA resource constraints “Qpro Virtex 2.5V radiation hardened FPGAs”, Xilinx Inc., DS028 (v1.2), Nov. 5, Experimental Results

MAPLD 2005/202 Pratt29 Experimental Results GPS orbit (22,200 km altitude, 55° inclination) AP-8 Solar Minimum, JPL Solar Proton Quiet, CRÈME 96 Solar Minimum

MAPLD 2005/202 Pratt30 Summary of Results Design Size Increase Sensitivity Decrease Persistence Decrease Average MTBF Increase ‡‡ DSP Kernel* 40%3%99%90x Synthetic Design ‡ 370%89%99%114x * Unmitigated to Partial TMR of Feedback + Input to FB ‡ Unmitigated to Full TMR ‡‡ GPS orbit; AP-8 Solar Minimum, JPL Solar Proton Quiet, CRÈME 96 Solar Minimum

MAPLD 2005/202 Pratt31 Conclusions Pros: Partial TMR (BLTmr) as fault mitigation offers: –Increased system availability due to fewer system resets –More “affordable” fault mitigation than full TMR –Critical design areas are mitigated with an automated tool Cons: –Much of the design may be unmitigated, leaving sensitive sections May result in temporary errors