Using reconfigurable FPGAs in radioactive environments: challenges and possible solutions Massimo Violante Politecnico di Torino Dip. Automatica e Informatica.

Slides:



Advertisements
Similar presentations
Survey of Detection, Diagnosis, and Fault Tolerance Methods in FPGAs
Advertisements

FPGA (Field Programmable Gate Array)
Sana Rezgui 1, Jeffrey George 2, Gary Swift 3, Kevin Somervill 4, Carl Carmichael 1 and Gregory Allen 3, SEU Mitigation of a Soft Embedded Processor in.
Scrubbing Approaches for Kintex-7 FPGAs
HPEC 2012 Scrubbing Optimization via Availability Prediction (SOAP) for Reconfigurable Space Computing Quinn Martin Alan George.
Complex Upset Mitigation Applied to a Re-Configurable Embedded Processor EEL 6935 Lu Hao Wenqian Wu.
1 Fault Tolerant FPGA Co-processing Toolkit Oral defense in partial fulfillment of the requirements for the degree of Master of Science 2006 Oral defense.
April 30, Cost efficient soft-error protection for ASICs Tuvia Liran; Ramon Chips Ltd.
ICAP CONTROLLER FOR HIGH-RELIABLE INTERNAL SCRUBBING Quinn Martin Steven Fingulin.
Survey of Reconfigurable Logic Technologies
DC/DC Switching Power Converter with Radiation Hardened Digital Control Based on SRAM FPGAs F. Baronti 1, P.C. Adell 2, W.T. Holman 2, R.D. Schrimpf 2,
Maintaining Data Integrity in Programmable Logic in Atmospheric Environments through Error Detection Joel Seely Technical Marketing Manager Military &
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR SRAM-based FPGA n SRAM-based LE –Registers in logic elements –LUT-based logic element.
The Spartan 3e FPGA. CS/EE 3710 The Spartan 3e FPGA  What’s inside the chip? How does it implement random logic? What other features can you use?  What.
Evolution of implementation technologies
Programmable logic and FPGA
CS294-6 Reconfigurable Computing Day 2 August 27, 1998 FPGA Introduction.
1. 2 FPGAs Historically, FPGA architectures and companies began around the same time as CPLDs FPGAs are closer to “programmable ASICs” -- large emphasis.
General FPGA Architecture Field Programmable Gate Array.
Radiation Effects and Mitigation Strategies for modern FPGAs 10 th annual workshop for LHC and Future experiments Los Alamos National Laboratory, USA.
Lecture 2: Field Programmable Gate Arrays September 13, 2004 ECE 697F Reconfigurable Computing Lecture 2 Field Programmable Gate Arrays.
Lecture #3 Page 1 ECE 4110– Sequential Logic Design Lecture #3 Agenda 1.FPGA's 2.Lab Setup Announcements 1.No Class Monday, Labor Day Holiday 2.HW#2 assigned.
Power Reduction for FPGA using Multiple Vdd/Vth
A comprehensive method for the evaluation of the sensitivity to SEUs of FPGA-based applications A comprehensive method for the evaluation of the sensitivity.
FPGA IRRADIATION and TESTING PLANS (Update) Ray Mountain, Marina Artuso, Bin Gui Syracuse University OUTLINE: 1.Core 2.Peripheral 3.Testing Procedures.
ASIC/FPGA design flow. FPGA Design Flow Detailed (RTL) Design Detailed (RTL) Design Ideas (Specifications) Design Ideas (Specifications) Device Programming.
Electronics in High Energy Physics Introduction to Electronics in HEP Field Programmable Gate Arrays Part 1 based on the lecture of S.Haas.
Lecture #3 Page 1 ECE 4110– Sequential Logic Design Lecture #3 Agenda 1.FPGA's 2.Lab Setup Announcements 1.No Class Monday, Labor Day Holiday 2.HW#2 assigned.
Reconfiguration Based Fault-Tolerant Systems Design - Survey of Approaches Jan Balach, Jan Balach, Ondřej Novák FIT, CTU in Prague MEMICS 2010.
J. Christiansen, CERN - EP/MIC
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR FPGA Fabric n Elements of an FPGA fabric –Logic element –Placement –Wiring –I/O.
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR Topics n FPGA fabric architecture concepts.
THE TESTING APPROACH FOR FPGA LOGIC CELLS E. Bareiša, V. Jusas, K. Motiejūnas, R. Šeinauskas Kaunas University of Technology LITHUANIA EWDTW'04.
ATMEL ATF280E Rad Hard SRAM Based FPGA SEE test results Application oriented SEU Sensitiveness Bernard BANCELIN ATMEL Nantes SAS, Aerospace Business Unit.
2/2/2009 Marina Artuso LHCb Electronics Upgrade Meeting1 Front-end FPGAs in the LHCb upgrade The issues What is known Work plan.
Introduction to FPGA Created & Presented By Ali Masoudi For Advanced Digital Communication Lab (ADC-Lab) At Isfahan University Of technology (IUT) Department.
Field Programmable Gate Arrays (FPGAs) An Enabling Technology.
MAPLD 2005/202 Pratt1 Improving FPGA Design Robustness with Partial TMR Brian Pratt 1,2 Michael Caffrey, Paul Graham 2 Eric Johnson, Keith Morgan, Michael.
Lecture #3 Page 1 ECE 4110–5110 Digital System Design Lecture #3 Agenda 1.FPGA's 2.Lab Setup Announcements 1.HW#2 assigned Due.
EE3A1 Computer Hardware and Digital Design
Analytical Approach for Soft Error Rate Estimation of SRAM-Based FPGAs Ghazanfar (Hossein) Asadi and Mehdi B. Tahoori Why Soft Error Rate (SER) Estimation?
LaRC MAPLD 2005 / A208 Ng 1 Radiation Tolerant Intelligent Memory Stack (RTIMS) Tak-kwong Ng, Jeffrey Herath Electronics Systems Branch Systems Engineering.
M.Mohajjel. Why? TTM (Time-to-market) Prototyping Reconfigurable and Custom Computing 2Digital System Design.
FPGA-Based System Design: Chapter 1 Copyright  2004 Prentice Hall PTR Moore’s Law n Gordon Moore: co-founder of Intel. n Predicted that number of transistors.
Greg Alkire/Brian Smith 197 MAPLD An Ultra Low Power Reconfigurable Task Processor for Space Brian Smith, Greg Alkire – PicoDyne Inc. Wes Powell.
1 Advanced Digital Design Reconfigurable Logic by A. Steininger and M. Delvai Vienna University of Technology.
In-Place Decomposition for Robustness in FPGA Ju-Yueh Lee, Zhe Feng, and Lei He Electrical Engineering Dept., UCLA Presented by Ju-Yueh Lee Address comments.
Delivered by.. Love Jain p08ec907. Design Styles  Full-custom  Cell-based  Gate array  Programmable logic Field programmable gate array (FPGA)
ASIC/FPGA design flow. Design Flow Detailed Design Detailed Design Ideas Design Ideas Device Programming Device Programming Timing Simulation Timing Simulation.
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR Topics n FPGA fabric architecture concepts.
Actel Antifuse FPGA Information – Radiation Tests Actel Antifuse FPGA – A54SX72A 72K gates 208 pqfp package 2.5v to 5.0v I/O tolerant $62 each for tested.
Chandrasekhar 1 MAPLD 2005/204 Reduced Triple Modular Redundancy for Tolerating SEUs in SRAM based FPGAs Vikram Chandrasekhar, Sk. Noor Mahammad, V. Muralidharan.
MAPLD 2005/213Kakarla & Katkoori Partial Evaluation Based Redundancy for SEU Mitigation in Combinational Circuits MAPLD 2005 Sujana Kakarla Srinivas Katkoori.
Moore 1 Paper P51 Moore 1 Paper #51 Design Security in SRAM-based FPGAs Jason Moore Xilinx.
Topics SRAM-based FPGA fabrics: Xilinx. Altera..
MAPLD 2005 Reduced Triple Modular Redundancy for Tolerating SEUs in SRAM based FPGAs Vikram Chandrasekhar, Sk. Noor Mahammad, V. Muralidharan Dr. V. Kamakoti.
Dip. Automatica e Informatica
SEU Mitigation Techniques for Virtex FPGAs in Space Applications
Electronics for Physicists
Radiation Tolerance of an Used in a Large Tracking Detector
Maintaining Data Integrity in Programmable Logic in Atmospheric Environments through Error Detection Joel Seely Technical Marketing Manager Military &
ELEN 468 Advanced Logic Design
M. Aguirre1, J. N. Tombs1, F. Muñoz1, V. Baena1, A. Torralba1, A
Design of a ‘Single Event Effect’ Mitigation Technique for Reconfigurable Architectures SAJID BALOCH Prof. Dr. T. Arslan1,2 Dr.Adrian Stoica3.
Analytical Approach for Soft Error Rate Estimation of SRAM-Based FPGAs
Advancement on the Analysis and Mitigation of
Programmable Logic- How do they do that?
Electronics for Physicists
Xilinx Kintex7 SRAM-based FPGA
Programmable logic and FPGA
Presentation transcript:

Using reconfigurable FPGAs in radioactive environments: challenges and possible solutions Massimo Violante Politecnico di Torino Dip. Automatica e Informatica Torino, Italy

FPGA structure/technology 2M. Violante - TWEPP 2012 Logic Blocks & Interconnections Configuration Elements Antifuse Flash SRAM Before programming

FPGA structure/technology 3M. Violante - TWEPP 2012 Logic Blocks & Interconnections Configuration Elements Flash SRAM After programming Antifuse

Why FPGAs? Antifuse FPGAs are used heavily as they allow shorter time to market, and lower costs for small volumes than ASICs No versatility (one-time programmable) SRAM-/Flash-based FPGAs are reprogrammable The benefits of versatility: Reconfigurable computing Feature improvements over the years Bug fixing (!) 4M. Violante - TWEPP 2012 Source: Microsemi

Bug fixing M. Violante - TWEPP Buggy Chip

Reconfigurable FPGAs vs radiation As a matter of fact, most of the reconfigurable FPGAs are soft w.r.t. radiation To use them in radioactive environments it is compulsory to: Understand effects from the designers perspective Understand if/why mitigation techniques may fail Define validation flows 6M. Violante - TWEPP 2012

Outline Radiation effects in SRAM-/Flash-based FPGAs Design mitigation issues Design validation Conclusions 7M. Violante - TWEPP 2012

Outline Radiation effects in SRAM-/Flash-based FPGAs Design mitigation issues Design validation Conclusions 8M. Violante - TWEPP 2012

Single Event Effects (SEE) Hard Errors Soft Errors Effects relevant for FPGAs Single Event Transient (SET) Single Event Upset (SEU) Functional Interrupt (SEFI) Single Event Latchup (SEL) Gate Rupture (SEGR) Single Event Burnout (SEB) Total Ionizing Dose (TID) Total Ionizing Dose (TID) Displacement Damage (DD) Displacement Damage (DD) 9M. Violante - TWEPP 2012 Addressed in this talk

BRAM SRAM-based FPGA Architecture 10 Xilinx Virtex-4QV PowerPC DSP CLB Boolean Function F(A,B,C,D) M. Violante - TWEPP 2012

Configuration memory bits SEU in SRAM-based FPGAs: CLB slice CLB slice I1I1 I2I2 I3I3 I4I4 LUT routing LUT Persistent effect (corrected by reconfig) Transient Effect (corrected at next ffp load) 11M. Violante - TWEPP 2012

SRAM-based FPGA General Routing Matrix (GRM) Direct connections Hex connections Direct lines Double lines CLB Long lines Hex lines CLB Fast connect CLB Xilinx Virtex-4QV 12M. Violante - TWEPP 2012

0 1 short 10 open Direct connections: Hex connections: open short SEU in SRAM-based FPGAs: Routing configuration cells short open Persistent effect (corrected by reconfig) Xilinx Virtex-4QV 13M. Violante - TWEPP 2012

Flash-based FPGA Microsemi ProAsic3 14M. Violante - TWEPP 2012

SEE sensitivity Configurable Logic Block called VersaTile VersaTile logic Effect 1: SET in the logic 15M. Violante - TWEPP 2012

SEE sensitivity Configurable Logic Block called VersaTile ffp VersaTile X Effect 2: SEU in the ffp 16M. Violante - TWEPP 2012

SEE sensitivity Floating Gate (FG) switch Effect 3: SET in the logic path SET in the routing path 17M. Violante - TWEPP 2012

What to remember so far SRAM-based FPGAs are soft against radiation User logic (SET) User memory (SEU, MBU) Control logic (SEU, SEFI) Configuration memory (SEU, MBU) Flash-based FPGAs are soft against radiation User logic (SET) User memory (SEU, MBU) Control logic (SEU, SEFI) M. Violante - TWEPP

Outline Radiation effects in SRAM-/Flash-based FPGAs Design mitigation issues Design validation Conclusions 19M. Violante - TWEPP 2012

Problems and solutions The problems SEU SET SEL SEFI TID The solutions Device-level solutions Make the device design rad tolerant Design-level solutions Make your design rad tolerant 20 Which is the best solution? M. Violante - TWEPP 2012

Which is the best solution? From the designer perspective the answer is easy: device-level solutions Problem solved at the root No need to put extra-effort to design for SEE mitigation and validate the resulting design However, few devices are ready (?) today Atmel AT280 (SRAM-based, old concept, poor back-end tools) Xilinx Virtex-5QV (SRAM-based, ITAR restricted, expensive) No Flash-based device available 21M. Violante - TWEPP 2012

A pragmatic compromise Select among commercial devices those that are immune to TID and SEL Design your application for SEE mitigation using Appropriate system architecture for SEE removal Appropriate circuit architecture for SEE masking 22M. Violante - TWEPP 2012

System Architecture Payload FPGA on-chip configuration is refreshed periodically SRAM-based FPGAs To remove SEE in c.m. FLASH-based FPGAs To anneal TID effects Period depends on the radiation environment M. Violante - TWEPP Payload FPGA Configuration Memory Backup System Controller Config Bus

Architecture for SEE masking D1.1D1.2 M. Violante - TWEPP Your design

Architecture for SEE masking D1.1 D2.1 D3.1 V1 D1.2 D2.2 D3.2 V2 V3 TMR Domain Voter Partition M. Violante - TWEPP In SRAM-based FPGAs this is logic+FF In Flash-based FPGAs it is only FF Your design

Architecture for SEE masking All masking techniques are based on the single-fault assumption (1 SEE = 1 fault in the design) But SEE in the configuration memory may produce multiple faults M. Violante - TWEPP

An example: original circuit The bitstream The original netlist M. Violante - TWEPP

An example: single effect The bitstream The corrupted netlist * 010 An open circuit is created M. Violante - TWEPP

An example: multiple effects The bitstream The corrupted netlist * 101 A short circuit is created M. Violante - TWEPP

Why TMR may fail? The SEE modifies the same signal in two domains  SEE is producing multiple effects not masked by voters Domain 1 Domain 2 Domain 1 Domain 2 Original netlistSEE-corrupted netlist M. Violante - TWEPP

An example Design: TMR design (in theory any SEE should be mitigated) Fault injection in config. mem. (about 20 Mbits) ResourceFailure LUT 71 Global routing 3,503 CLB Local routing 53 CLB configuration 1 Total 3,628 M. Violante - TWEPP

What to remember so far SRAM-/Flash-based FPGAs may be OK for radioactive environments provided that Proper device is selected (TID, SEL) Design mitigation is used SEE mitigation is needed  huge costs 3x FFs, 3x IO, >4x user logic, >20% on clock frequency Mitigation may fail due to multiple effects of SEE in configuration memory  validation needed M. Violante - TWEPP

Outline Radiation effects in SRAM-/Flash-based FPGAs Design mitigation issues Design validation Conclusions 33M. Violante - TWEPP 2012

Validation approaches Qualitative validation via design inspection before place & route Quantitative validation after place & route Simulation-based validation Emulation-based validation Main issue in quantitative validation: amount of faults to be simulated 20 Mbits in config. mem., 1 M functional input 100 MHz  about 2.3 days to perform exhaustive fault injection M. Violante - TWEPP

PdT M. Violante - TWEPP # of SEU # of input vectors Design-oriented configuration memory analysis Static analysis # of SEU # of input vectors

Config. mem. analysis Reverse engineer the configuration memory of FPGA of choice M. Violante - TWEPP Configuration bitstream FPGAs resources Configuration memory bits layout

Config. mem. analysis 1.Read the place & routed design and build the netlist/bitstream association 2.For each bit of the bitstream: A.Flip the bit and update accordingly the netlist B.Is the original netlist corrupted (does the error arrive to outputs or memory element)? I.Yes  the bit is sensitive II.No  the bit is not sensitive Analysis is done looking at the error propagation path, and it does not consider workload M. Violante - TWEPP

Operational modes Discovery mode: it analyzes the bitstream while neglecting mitigation schemes Lists sensitive bits TMR mode: it analyzes the bitstream while automatically recognizing (X)TMR mitigation scheme Lists bits that violate (X)TMR scheme (domain crossing events) List bits that produce warnings (may lead to domain crossing events in case of accumulation) M. Violante - TWEPP

Domain crossing events D1.1 D2.1 D3.1 V1 D1.2 D2.2 D3.2 V2 V3 TMR Domain Voter Partition M. Violante - TWEPP

Domain crossing events D1.1 D2.1 D3.1 V1 D1.2 D2.2 D3.2 V2 V3 One Single Event Upset (SEU) in the configuration memory provokes two circuit modifications in two TMR domains in the same TMR partition  The fault propagates beyond the voter boundary M. Violante - TWEPP

Warnings D1.1 D2.1 D3.1 V1 D1.2 D2.2 D3.2 V2 V3 One SEE in the configuration memory provokes two circuit modifications in two voter partitions  The fault stops at the voter boundary M. Violante - TWEPP

TMR-mode algorithm The algorithm recognizes automatically TMR domains, voters, and voter partitions Forward error propagation: 1.Find all the paths from the fault site to the circuit outputs, or memory elements 2.Is the fault propagating to only one of the voter inputs? A.Yes  the bit is not sensitive A.No  the fault propagates to at least two inputs of a voter in the same partition  the bit is sensitive V V M. Violante - TWEPP

The report Detailed report is produced for Xilinx devices Resource: PIP Block Adr 0 Maj Add 6 Min Add 14 Bit 156 Involved PIP : Y1 -- S2BEG2 FAR: 0x000c1c00 Bit: 156 Net = data_bus_IBUF_TR M. Violante - TWEPP

Example X-TMR LEON3 processor on Xilinx xc2v Mbits in config. mem., 1 M functional input 100 MHz 2,603,950 are SEE-sensitive for the design (computed in about 2 hours vs 2.3 days) 3,628 SEUs lead to actual application failure for the considered workload (fault injection completes in about 7 hours) M. Violante - TWEPP

Complete design flow XST synthesis TMR tool Input design Output design PAR bitstream STAR List of sensitive bits VPLACE Robust placement Robust bitstream FLIPPER Workload Fault coverage RoRA/PAR 45M. Violante - TWEPP 2012

Outline Radiation effects in SRAM-based FPGAs Design mitigation issues Design validation Conclusions 46M. Violante - TWEPP 2012

Conclusions SRAM-/Flash-based FPGAs are very attractive for bringing reconfiguration in radioactive environments Bullet-proof (i.e., rad-hard) devices are not ready Solutions are available based on rad-tolerant devices (no TID/no SEL), however It is the designer responsibility to implement mitigation It is the designer responsibility to validate the mitigation Zero failure may not be possible thus estimating residual error rate is mandatory 47M. Violante - TWEPP 2012

Acknowledgment Monica Alderighi Niccolò Battezzati Fabio Casini Fernanda Lima Kastensmidt David Merodio Codinachs Luca Sterpone Atmel, France Boeing Satellite Systems, USA EADS-IW, France European Space Agency, The Netherland Thales Alenia Space, Italy 48M. Violante - TWEPP 2012