Radiation Effects on FPGA and Mitigation Strategies Bin Gui Experimental High Energy Physics Group 1Journal Club4/26/2015.

Slides:



Advertisements
Similar presentations
RAM (RANDOM ACCESS MEMORY)
Advertisements

Sana Rezgui 1, Jeffrey George 2, Gary Swift 3, Kevin Somervill 4, Carl Carmichael 1 and Gregory Allen 3, SEU Mitigation of a Soft Embedded Processor in.
Scrubbing Approaches for Kintex-7 FPGAs
Multi-Bit Upsets in the Virtex Devices Heather Quinn, Paul Graham, Jim Krone, Michael Caffrey Los Alamos National Laboratory Gary Swift, Jeff George, Fayez.
HPEC 2012 Scrubbing Optimization via Availability Prediction (SOAP) for Reconfigurable Space Computing Quinn Martin Alan George.
Complex Upset Mitigation Applied to a Re-Configurable Embedded Processor EEL 6935 Lu Hao Wenqian Wu.
1 Fault Tolerant FPGA Co-processing Toolkit Oral defense in partial fulfillment of the requirements for the degree of Master of Science 2006 Oral defense.
April 30, Cost efficient soft-error protection for ASICs Tuvia Liran; Ramon Chips Ltd.
ICAP CONTROLLER FOR HIGH-RELIABLE INTERNAL SCRUBBING Quinn Martin Steven Fingulin.
DC/DC Switching Power Converter with Radiation Hardened Digital Control Based on SRAM FPGAs F. Baronti 1, P.C. Adell 2, W.T. Holman 2, R.D. Schrimpf 2,
Maintaining Data Integrity in Programmable Logic in Atmospheric Environments through Error Detection Joel Seely Technical Marketing Manager Military &
L189/MAPLD2004Carmichael 1 A Triple Module Redundancy Scheme for SEU Mitigation of Static Latch-Based FPGAs (“Birds-of-a-Feather”) Carl Carmichael 1, Brendan.
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR SRAM-based FPGA n SRAM-based LE –Registers in logic elements –LUT-based logic element.
Lecture 2: Field Programmable Gate Arrays I September 5, 2013 ECE 636 Reconfigurable Computing Lecture 2 Field Programmable Gate Arrays I.
Programmable logic and FPGA
Khaled A. Al-Utaibi Memory Devices Khaled A. Al-Utaibi
Radiation Effects and Mitigation Strategies for modern FPGAs 10 th annual workshop for LHC and Future experiments Los Alamos National Laboratory, USA.
12004 MAPLD: 141Buchner Single Event Effects Testing of the Atmel IEEE1355 Protocol Chip Stephen Buchner 1, Mark Walter 2, Moses McCall 3 and Christian.
Memory and Programmable Logic
Rabie A. Ramadan Lecture 3
A comprehensive method for the evaluation of the sensitivity to SEUs of FPGA-based applications A comprehensive method for the evaluation of the sensitivity.
MICROPROCESSOR INPUT/OUTPUT
FPGA IRRADIATION and TESTING PLANS (Update) Ray Mountain, Marina Artuso, Bin Gui Syracuse University OUTLINE: 1.Core 2.Peripheral 3.Testing Procedures.
EGRE 427 Advanced Digital Design Figures from Application-Specific Integrated Circuits, Michael John Sebastian Smith, Addison Wesley, 1997 Chapter 4 Programmable.
Section II Basic PLD Architecture. Section II Agenda  Basic PLD Architecture —XC9500 and XC4000 Hardware Architectures —Foundation and Alliance Series.
EKT 221 : Digital 2 Memory Basics
System Arch 2008 (Fire Tom Wada) /10/9 Field Programmable Gate Array.
Presented by Anthony B. Sanders NASA/GSFC at 2005 MAPLD Conference, Washington, DC #196 1 ALTERA STRATIX TM EP1S25 FIELD-PROGRAMMABLE GATE ARRAY (FPGA)
Memory and Programmable Logic Memory device: Device to which binary information is transferred for storage, and from which information is available for.
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR Topics n Latches and flip-flops. n RAMs and ROMs.
J. Christiansen, CERN - EP/MIC
FPGA (Field Programmable Gate Array): CLBs, Slices, and LUTs Each configurable logic block (CLB) in Spartan-6 FPGAs consists of two slices, arranged side-by-side.
® SPARTAN Series High Volume System Solution. ® Spartan/XL Estimated design size (system gates) 30K 5K180K XC4000XL/A XC4000XV Virtex S05/XL.
Memory and Storage Dr. Rebhi S. Baraka
ATMEL ATF280E Rad Hard SRAM Based FPGA SEE test results Application oriented SEU Sensitiveness Bernard BANCELIN ATMEL Nantes SAS, Aerospace Business Unit.
2/2/2009 Marina Artuso LHCb Electronics Upgrade Meeting1 Front-end FPGAs in the LHCb upgrade The issues What is known Work plan.
Fault-Tolerant Systems Design Part 1.
Field Programmable Gate Arrays (FPGAs) An Enabling Technology.
MAPLD 2005/202 Pratt1 Improving FPGA Design Robustness with Partial TMR Brian Pratt 1,2 Michael Caffrey, Paul Graham 2 Eric Johnson, Keith Morgan, Michael.
Basic Sequential Components CT101 – Computing Systems Organization.
Computer Architecture Lecture 2 System Buses. Program Concept Hardwired systems are inflexible General purpose hardware can do different tasks, given.
P189/MAPLD2004Carmichael 1 A Triple Module Redundancy Scheme for SEU Mitigation of Static Latch-Based FPGAs Carl Carmichael 1, Brendan Bridgford 1, Gary.
Chapter 4 MARIE: An Introduction to a Simple Computer.
LaRC MAPLD 2005 / A208 Ng 1 Radiation Tolerant Intelligent Memory Stack (RTIMS) Tak-kwong Ng, Jeffrey Herath Electronics Systems Branch Systems Engineering.
ESS | FPGA for Dummies | | Maurizio Donna FPGA for Dummies Basic FPGA architecture.
1 Advanced Digital Design Reconfigurable Logic by A. Steininger and M. Delvai Vienna University of Technology.
07/11/2005 Register File Design and Memory Design Presentation E CSE : Introduction to Computer Architecture Slides by Gojko Babić.
Chapter 3 System Buses.  Hardwired systems are inflexible  General purpose hardware can do different tasks, given correct control signals  Instead.
REGISTER TRANSFER LANGUAGE (RTL) INTRODUCTION TO REGISTER Registers1.
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR Topics n FPGA fabric architecture concepts.
Xilinx V4 Single Event Effects (SEE) High-Speed Testing Melanie D. Berg/MEI – Principal Investigator Hak Kim, Mark Friendlich/MEI.
Chandrasekhar 1 MAPLD 2005/204 Reduced Triple Modular Redundancy for Tolerating SEUs in SRAM based FPGAs Vikram Chandrasekhar, Sk. Noor Mahammad, V. Muralidharan.
P201-L/MAPLD SEE Validation of SEU Mitigation Methods for FPGAs Carl Carmichael 1, Sana Rezgui 1, Gary Swift 2, Jeff George 3, & Larry Edmonds 2.
MAPLD 2005/213Kakarla & Katkoori Partial Evaluation Based Redundancy for SEU Mitigation in Combinational Circuits MAPLD 2005 Sujana Kakarla Srinivas Katkoori.
Programmable Logic Devices
1 Introduction to Engineering Fall 2006 Lecture 17: Digital Tools 1.
Sequential Logic Design
Rad (radiation) Hard Devices used in Space, Military Applications, Nuclear Power in-situ Instrumentation Savanna Krassau 4/21/2017 Abstract: Environments.
REGISTER TRANSFER LANGUAGE (RTL)
Topics SRAM-based FPGA fabrics: Xilinx. Altera..
SEU Mitigation Techniques for Virtex FPGAs in Space Applications
Introduction to Microprocessors
Maintaining Data Integrity in Programmable Logic in Atmospheric Environments through Error Detection Joel Seely Technical Marketing Manager Military &
An Introduction to Microprocessor Architecture using intel 8085 as a classic processor
Interfacing Memory Interfacing.
EE345: Introduction to Microcontrollers Memory
The Xilinx Virtex Series FPGA
Design of a ‘Single Event Effect’ Mitigation Technique for Reconfigurable Architectures SAJID BALOCH Prof. Dr. T. Arslan1,2 Dr.Adrian Stoica3.
The Xilinx Virtex Series FPGA
Presentation transcript:

Radiation Effects on FPGA and Mitigation Strategies Bin Gui Experimental High Energy Physics Group 1Journal Club4/26/2015

Outline  General introduction of FPGA  Radiation Effects Single Event Upset Single Event Functional Interrupt  Mitigation  Measurements of SEU and SEFI Experiment Set-up Heavy Ion Test Results Proton Test Results  Summary 2Journal Club4/26/2015

Field Programmable Gate Array  The field programmable gate array (FPGA) is a semiconductor device that can be programmed after manufacturing. Instead of being restricted to any predetermined hardware function, an FPGA allows you to program product features and functions, adapt to new standards, and reconfigure hardware for specific applications even after the product has been installed in the field—hence the name "field-programmable".  These devices have always offered significant advantages in flexibility, and recent advances in fabrication have greatly increased logic capacity, substantially increasing the number of applications for this technology.  FPGAs have been an attractive choice in small volume instrumentation and control system electronics. 4/26/2015Journal Club3

The Uses of FPGA 4/26/2015Journal Club4 Sorry, no electricity

FPGA Architecture  The core of the Xilinx vertex series FPGA consists of: An array of configurable logic blocks (CLBs), each of which consists of two slices. Each slice contains two 4 input look up tables for logic generation, two flip flops, and arithmetic carry and clocking functions. Flanking the CLB matrix are two columns of dual port RAM, divided into 4Kbit blocks. The edges of the device are populated by input/output blocks, which support several I/O standards.  This FPGA is based on SRAM technology can be reconfigured at will, allowing unmatched flexibility in the face of changing requirements. 4/26/2015Journal Club5  Unfortunately, the increased density (and corresponding shrinkage of process geometry), has made these devices more susceptible to failure due to external radiation.

Single Event Effects  Single event effects (SEE) - Single Event Effects refer to the fact that it is not a cumulative effect but an effect related to single individual interactions in the silicon. Highly ionizing particles can directly deposit enough charge locally in the silicon to disturb the function of electronic circuits. Single event upset (SEU): The deposited charge is sufficient to flip the value of a digital signal. Single Event Upsets normally refer to bit flips in memory circuits (RAM, Latch, and flip-flop) but may also in some rare cases directly affect digital signals in logic circuits. This is usually reversible. Single event latchup (SEL): Latched change of state of a circuit due to radiation. May need to power cycle to reset. Single event burnout (SEB): Single event burnout refers to destructive failures of power MOSFET transistors in high power applications. Single event functional interrupt (SEFI): Typically, SEFIs are low in occurrence and are almost never seen while in orbit. However, in test environments where event rates are hugely accelerated in order to obtain statistical significance and accurate measurements of events even with negligible cross-sections, SEFIs may be observed. The criterion for a SEFI is that it requires either a complete reconfiguration or power-cycle of the device before returning to normal operation. 4/26/2015Journal Club6

Mitigation  Mitigation involves both repairing altered configuration and logic design that is resistant to failure. Scrubbing refers to the periodic readback of the FPGA’s configuration memory, comparing it to a known good copy, and writing back any corrections required. By periodically scrubbing a device, maximum limits may be placed on the period of time that a configuration error can be present in a device. Triple Module Redundancy, (the most widely used technique) is an effective technique creating fault tolerant logic. 4/26/2015Journal Club7

Triple Module Redundancy  In TMR, the logic of the design can simply be triplicated, with redundant voters on the output. In order to recover smoothly from logic upsets, the internal state of the design must be restored to the repaired logic.  In the feedback counter, the state of the counters is obtained from the output of the voters. This feature has the effect of always presenting the correct state to the counter logic, resulting in the logic being self restoring in the event of an upset and subsequent repair.  TMR does not come without a price. Obviously, designs are at least 3 times as large as a non TMR design, and suffer from speed degradation as well. In particular, feedback TMR degrades the speed of operation by introducing a longer feedback path including the voter. Power consumption is also tripled along with the logic.  The underlying assumption of TMR is that only one upset will occur within a given logic block. This is not always a good assumption to make. Recent testing resulted in approximately.3-.5% of upsets causing multiple bit upsets within the device. 4/26/2015Journal Club8 TMR counter Feedback counter with TMR in the feedback path

 Measurements of SEU and SEFI Experiment Set-up Heavy Ion Test Results Proton Test Results 4/26/2015Journal Club9

FPGAs Used in Measurement 4/26/2015Journal Club10

Test Setup in Vacuum 4/26/2015Journal Club11 The setup for in air testing was essentially the same as in vacuum, the main exception being that the adapted connections for getting through the bulkheads were discarded. Also, USB programming cables were used via high speed hubs for the in-air irradiations.

Latchup Testing – DUT FPGA  DUT – device under test  For the purpose of this experiment, the accepted definition of a latchup was any sudden high current modes resulting from the test run that required a power cycle of the DUT in order to recover.  Because the bottom of the silicon is solder “bumped” to a fully populated ball-grid package, it is difficult to heat the device enough for latchup testing with an external heating element. In order to obtain the target temperature (near 125°C junction temperature) in vacuum, the devices were configured with a “heater” (a long shift-register chain of CLB flip-flops) design meant to increase dynamic current consumption sufficient to heat the transistor junctions to a desired temperature. 4/26/2015Journal Club12

Latchup Testing – Results 4/26/2015Journal Club13

Heavy-Ion Test  The devices were tested at different incidences for an LET (linear energy transfer) range of 1.2–108.7 MeVc m² /mg. A combination of degraders and angles were used to achieve higher LET using the same ion. (How?) 4/26/2015Journal Club14

SEU Results  The data graphs shown in this report all have two sigma statistical error bars plotted.  The static heavy ion SEU response data set has been fit with a Weibull curve function to facilitate Orbital Rate Calculations. The equation below shows this function:  The absolute LET threshold extrapolates to about 1 MeV- cm²/mg (or lower) for both the configuration memory and the block memory. 4/26/2015Journal Club15

Single Event Functional Interrupt  Power-On-Reset (POR) SEFI results in a global reset of all internal storage cells and the loss of all program and state data.  SelectMAP (SMAP) SEFI is the loss of either read or write capabilities through the SelectMAP port.  Frame Address Register (FAR) SEFI results in the frame address register continuously incrementing uncontrollably.  Global Signal SEFI is separated from other design-disrupting SEFIs for the first time in these tests. These signals include GSR (Global Set/Reset), GWE_B (Global Write Enable), GHIGH_B (Global Drive High), and others. They can all be observed through the status (STAT) register or the control (CTL) register.  Readback SEFI occurs when a portion of the readback data has been upset and cannot be corrected.  Scrub SEFI seems to be the result of an upset causing corruption of the data stream being scrubbed into the DUT. 4/26/2015Journal Club16

SEFI Results 4/26/2015Journal Club17

Proton Test Results - SEU 4/26/2015Journal Club18

Proton Test Results - SEFI 4/26/2015Journal Club19

CREME96 Calculated Orbital Upset Rates 4/26/2015Journal Club20

CREME96 Calculated Orbital Upset Rates 4/26/2015Journal Club21

CREME96 Calculated Orbital Upset Rates 4/26/2015Journal Club22

CREME96 Calculated Orbital Upset Rates 4/26/2015Journal Club23

Summary  The SEFI cross sections are low enough to be almost academic.  The space upset rates given in Table 9 are sufficiently low.  Further study on orbital rate calculation.  Considering our experiment (Actel). 4/26/2015Journal Club24

Reference  Radiation effects and mitigation strategies for modern FPGAs  /26/2015Journal Club25