144_C4 / MAPLD04Swift and Roosta1 Tradeoffs in Flight-Design Upset Mitigation in State-of-the-Art FPGAs Hardened By Design vs. Design-Level Hardening Gary.

Slides:



Advertisements
Similar presentations
1January 18, 2006irk Rich Katz, Grunt Engineer NASA Office of Logic Design Some SEE Testing Considerations for the RTAX-S Series Devices.
Advertisements

Melanie Berg MEI Technologies/NASA GSFC
Sana Rezgui 1, Jeffrey George 2, Gary Swift 3, Kevin Somervill 4, Carl Carmichael 1 and Gregory Allen 3, SEU Mitigation of a Soft Embedded Processor in.
Scrubbing Approaches for Kintex-7 FPGAs
Discussion of: “Terrestrial-based Radiation Upsets: A Cautionary Tale” CprE 583 Tony Kuker 12/06/05.
Multi-Bit Upsets in the Virtex Devices Heather Quinn, Paul Graham, Jim Krone, Michael Caffrey Los Alamos National Laboratory Gary Swift, Jeff George, Fayez.
Radiation Effects on FPGA and Mitigation Strategies Bin Gui Experimental High Energy Physics Group 1Journal Club4/26/2015.
HPEC 2012 Scrubbing Optimization via Availability Prediction (SOAP) for Reconfigurable Space Computing Quinn Martin Alan George.
Complex Upset Mitigation Applied to a Re-Configurable Embedded Processor EEL 6935 Lu Hao Wenqian Wu.
1 Fault Tolerant FPGA Co-processing Toolkit Oral defense in partial fulfillment of the requirements for the degree of Master of Science 2006 Oral defense.
ICAP CONTROLLER FOR HIGH-RELIABLE INTERNAL SCRUBBING Quinn Martin Steven Fingulin.
EELE 367 – Logic Design Module 2 – Modern Digital Design Flow Agenda 1.History of Digital Design Approach 2.HDLs 3.Design Abstraction 4.Modern Design Steps.
Maintaining Data Integrity in Programmable Logic in Atmospheric Environments through Error Detection Joel Seely Technical Marketing Manager Military &
Programmable Logic Devices
Spartan II Features  Plentiful logic and memory resources –15K to 200K system gates (up to 5,292 logic cells) –Up to 57 Kb block RAM storage  Flexible.
FPGA Design Using the LEON3 Fault Tolerant Processor Core
L189/MAPLD2004Carmichael 1 A Triple Module Redundancy Scheme for SEU Mitigation of Static Latch-Based FPGAs (“Birds-of-a-Feather”) Carl Carmichael 1, Brendan.
Programmable logic and FPGA
Introduction to Field Programmable Gate Arrays (FPGAs) COE 203 Digital Logic Laboratory Dr. Aiman El-Maleh College of Computer Sciences and Engineering.
®. ® Rad Hard Products for Satellites and Space ® QPRO for Avionics  Standard QPRO products immune to upsets in avionics environment.
Radiation Effects and Mitigation Strategies for modern FPGAs 10 th annual workshop for LHC and Future experiments Los Alamos National Laboratory, USA.
FLASH Mitigation Strategies for Space Applications Charles Howard Southwest Research Institute.
BR 1/001 Implementation Technologies We can implement a design with many different implementation technologies - different implementation technologies.
12004 MAPLD: 141Buchner Single Event Effects Testing of the Atmel IEEE1355 Protocol Chip Stephen Buchner 1, Mark Walter 2, Moses McCall 3 and Christian.
Lecture 2: Field Programmable Gate Arrays September 13, 2004 ECE 697F Reconfigurable Computing Lecture 2 Field Programmable Gate Arrays.
MAPLD 2005 / E134 Rockett A 0.15  M Radiation-Hardened Antifuse Field Programmable Gate Array Technology The RH AX250-S production installation effort.
I n t e g r a t e d D e s i g n C e n t e r / M I s s I o n D e s I g n L a b o r a t o r y N A S A G O D D A R D S P A C E F L I G H T C E N T E R Do.
A comprehensive method for the evaluation of the sensitivity to SEUs of FPGA-based applications A comprehensive method for the evaluation of the sensitivity.
FPGA IRRADIATION and TESTING PLANS (Update) Ray Mountain, Marina Artuso, Bin Gui Syracuse University OUTLINE: 1.Core 2.Peripheral 3.Testing Procedures.
EGRE 427 Advanced Digital Design Figures from Application-Specific Integrated Circuits, Michael John Sebastian Smith, Addison Wesley, 1997 Chapter 4 Programmable.
2004 MAPLD, Paper 190 JJ Wang 1 SEU-Hardened Storage Devices in a 0.15 µm Antifuse FPGA – RTAX-S J. J. Wang 1, B. Cronquist 1, J. McCollum 1, R. Gorgis.
M. Adinolfi – University of Oxford – MAPMT Workshop – Imperial College 27 June Rad-Hard qualification for the LHCb RICH L0 electronics M. Adinolfi.
Presented by Anthony B. Sanders NASA/GSFC at 2005 MAPLD Conference, Washington, DC #196 1 ALTERA STRATIX TM EP1S25 FIELD-PROGRAMMABLE GATE ARRAY (FPGA)
Reconfiguration Based Fault-Tolerant Systems Design - Survey of Approaches Jan Balach, Jan Balach, Ondřej Novák FIT, CTU in Prague MEMICS 2010.
Leo Greiner IPHC meeting HFT PIXEL DAQ Prototype Testing.
J. Christiansen, CERN - EP/MIC
® SPARTAN Series High Volume System Solution. ® Spartan/XL Estimated design size (system gates) 30K 5K180K XC4000XL/A XC4000XV Virtex S05/XL.
ATMEL ATF280E Rad Hard SRAM Based FPGA SEE test results Application oriented SEU Sensitiveness Bernard BANCELIN ATMEL Nantes SAS, Aerospace Business Unit.
2/2/2009 Marina Artuso LHCb Electronics Upgrade Meeting1 Front-end FPGAs in the LHCb upgrade The issues What is known Work plan.
P173/MAPLD 2005 Swift1 Upset Susceptibility and Design Mitigation of PowerPC405 Processors Embedded in Virtex II-Pro FPGAs.
MAPLD 2005/202 Pratt1 Improving FPGA Design Robustness with Partial TMR Brian Pratt 1,2 Michael Caffrey, Paul Graham 2 Eric Johnson, Keith Morgan, Michael.
An Unobtrusive Debugging Methodology for Actel AX and RTAX-S FPGAs Jonathan Alexander Applications Consulting Manager Actel Corporation MAPLD 2004.
P189/MAPLD2004Carmichael 1 A Triple Module Redundancy Scheme for SEU Mitigation of Static Latch-Based FPGAs Carl Carmichael 1, Brendan Bridgford 1, Gary.
MooreC142/MAPLD Single Event Effects (SEE) Test Results on the Virtex-II Digital Clock Manager (DCM) Jason Moore 1, Carl Carmichael 1, Gary Swift.
High Performance Embedded Computing (HPEC) Workshop 23−25 September 2008 John Holland & Eliot Glaser Northrop Grumman Corporation P.O. Box 1693 Baltimore,
Field Programmable Port Extender (FPX) 1 Modular Design Techniques for the FPX.
29 May 2008 Exploration Technology Development Program’s Radiation Hardened Electronics for Space Environments (RHESE) Andrew S. Keys, James H. Adams,
Petrick_P2261 Virtex-II Pro SEE Test Methods and Results David Petrick 1, Wesley Powell 1, James Howard 2 1 NASA Goddard Space Flight Center, Greenbelt,
LaRC MAPLD 2005 / A208 Ng 1 Radiation Tolerant Intelligent Memory Stack (RTIMS) Tak-kwong Ng, Jeffrey Herath Electronics Systems Branch Systems Engineering.
Greg Alkire/Brian Smith 197 MAPLD An Ultra Low Power Reconfigurable Task Processor for Space Brian Smith, Greg Alkire – PicoDyne Inc. Wes Powell.
In-Place Decomposition for Robustness in FPGA Ju-Yueh Lee, Zhe Feng, and Lei He Electrical Engineering Dept., UCLA Presented by Ju-Yueh Lee Address comments.
A4 1 Barto "Sequential Circuit Design for Space-borne and Critical Electronics" Dr. Rod L. Barto Spacecraft Digital Electronics Richard B. Katz NASA Goddard.
2004 MAPLD, Paper 1008 Sanders 1 Radiation-Hardened re-programmable Field- Programmable Gate Array (RHrFPGA) A.B. Sanders 1, K.A. LaBel 1, J.F. McCabe.
Delivered by.. Love Jain p08ec907. Design Styles  Full-custom  Cell-based  Gate array  Programmable logic Field programmable gate array (FPGA)
Nishinaga No. 1 MAPLD2005/1003-J Availability Analysis of Xilinx FPGA on Orbit Nozomu Nishinaga National Institute of Information and Communications Technology.
Backside Device Irradiation for Single Event Upset Tests of Advanced Devices Gary M. Swift Jet Propulsion Laboratory/ California Institute of Technology.
Xilinx V4 Single Event Effects (SEE) High-Speed Testing Melanie D. Berg/MEI – Principal Investigator Hak Kim, Mark Friendlich/MEI.
Chandrasekhar 1 MAPLD 2005/204 Reduced Triple Modular Redundancy for Tolerating SEUs in SRAM based FPGAs Vikram Chandrasekhar, Sk. Noor Mahammad, V. Muralidharan.
P201-L/MAPLD SEE Validation of SEU Mitigation Methods for FPGAs Carl Carmichael 1, Sana Rezgui 1, Gary Swift 2, Jeff George 3, & Larry Edmonds 2.
MAPLD 2005/213Kakarla & Katkoori Partial Evaluation Based Redundancy for SEU Mitigation in Combinational Circuits MAPLD 2005 Sujana Kakarla Srinivas Katkoori.
Rad (radiation) Hard Devices used in Space, Military Applications, Nuclear Power in-situ Instrumentation Savanna Krassau 4/21/2017 Abstract: Environments.
CFTP ( Configurable Fault Tolerant Processor )
SEU Mitigation Techniques for Virtex FPGAs in Space Applications
Upgrade of the ATLAS MDT Front-End Electronics
Electronics for Physicists
Maintaining Data Integrity in Programmable Logic in Atmospheric Environments through Error Detection Joel Seely Technical Marketing Manager Military &
Design of a ‘Single Event Effect’ Mitigation Technique for Reconfigurable Architectures SAJID BALOCH Prof. Dr. T. Arslan1,2 Dr.Adrian Stoica3.
Upset Susceptibility and Design Mitigation of
Xilinx Kintex7 SRAM-based FPGA
Rad Hard Products for Satellites and Space
Presentation transcript:

144_C4 / MAPLD04Swift and Roosta1 Tradeoffs in Flight-Design Upset Mitigation in State-of-the-Art FPGAs Hardened By Design vs. Design-Level Hardening Gary M. Swift and Ramin Roosta Jet Propulsion Laboratory / California Institute of Technology The research done in this paper was carried out at the Jet Propulsion Laboratory, California Institute of Technology, under contract with the National Aeronautics and Space Administration (NASA) and was partially sponsored by the NASA Electronic Parts and Packaging Program. Reference herein to any specific commercial product, process, or service by trade name, trademark, manufacturer, or otherwise, does not constitute or imply its endorsement by the United States Government or the Jet Propulsion Laboratory, California Institute of Technology.

144_C4 / MAPLD04Swift and Roosta2 In the beginning was Actel … Leveraging from a commercial product line  ONO anti-fuse based  one-time programmable (OTP) “beginning” = 1993  Reference: Katz, R.; Barto, R.; McKerracher, P.; Carkhuff, B.; Koga, R.; “SEU hardening of field programmable gate arrays (FPGAs) for space applications and device characterization,” IEEE Transactions on Nuclear Science, Dec. 1994

144_C4 / MAPLD04Swift and Roosta3 Later, Xilinx Leveraging from a commercial product line  SRAM based  reconfigurable “later” = 1998  Reference: Guertin, S.M.; Swift, G.M.; Nguyen, D.; “Single-event upset test results for the Xilinx XQ1701L PROM”, Radiation Effects Data Workshop Record, 1999  Quote: (Xilinx SRAM-based FPGAs)… “do appear suited to a broad range of other (non-critical) applications, such as sensor and camera controllers.”

144_C4 / MAPLD04Swift and Roosta4 OUTLINE FPGAs: A key enabling technology for modern spacecraft Background in radiation testing of FPGAs ▫Earlier, Katz/Swift collaboration ▫Recently, Xilinx Consortium Feature Comparison Triple Modular Redundancy (TMR) - hardware approach vs. software approach Concluding Remarks

144_C4 / MAPLD04Swift and Roosta5 FPGAs: A key “enabling technology” Like custom ASICs, FPGAs can replace whole boards  Saving mass, volume, power  Achieving extra functionality FPGAs are much cheaper than ASICs  Design efforts can be later in the schedule  Design mistakes don’t require a re-spin through the foundry

144_C4 / MAPLD04Swift and Roosta6 MER Pyro-Controller Used self-checking of configuration to initiate a re- configuration after spotting an upset

144_C4 / MAPLD04Swift and Roosta7 MER Pyro-Controller Nearing Mars Xilinx XQR4062XL

144_C4 / MAPLD04Swift and Roosta8 My Background Actel experience is older  No direct involvement in radiation tests since the ONO anti-fuse was replaced  Results here are from others’ work Xilinx experience is recent  Active participant in Xilinx Rad Test Consortium  Currently, finishing two+ year test campaign targeting the Virtex II family

144_C4 / MAPLD04Swift and Roosta9 Currently Available Devices Actel RT54SX-S family vs. Xilinx Virtex II family (-SU) Note: both are essentially immune to single-event latchup and have good total ionizing dose tolerance, [ Actel > 135 krad(Si); Xilinx > 200 krad(Si) ]

144_C4 / MAPLD04Swift and Roosta10 Main Feature Comparison Actel Xilinx RT54SX72S XQR2V6000 Gates: 72,000~6M ( /~3.2 ) flip/flops: ,584 / 3.2 = ~20k I/O Pins: / 3 = 274 Speed external : 230 MHz 622 Mb/s (I-mode LVDS) Speed internal : 310 MHz360 MHz

144_C4 / MAPLD04Swift and Roosta11 Extra Features Comparison ActelXilinx RT54SX72S XQR2V6000 Block RAM:no2.5 Mb I/O Standards:manymany Others: hardwired TMR Clock Manager Multipliers

144_C4 / MAPLD04Swift and Roosta12 Actel: What bits can upset? User flip-flops only  Direct hits of same flip/flop in multiple domains ▫Very unlikely due to layout  Clock domain hits SEFI modes essentially eliminated

144_C4 / MAPLD04Swift and Roosta13 Xilinx: What bits can upset? Configuration Bits  Logical Function  Routing  User Options Block RAM User Flip-flops Control Registers × Type of I/O × Mode of Block RAM Access × Clock Manager × etc… × NAND × Ex-OR × Flip-Flop type × etc…

144_C4 / MAPLD04Swift and Roosta14 Xilinx: Heavy Ion Test Results Resulting in fairly low in-space rates: ~6 per day for 2V6000 in GCR min. Low Threshold (soft) Low Susceptibility (hard)

144_C4 / MAPLD04Swift and Roosta15 Actel: Heavy Ion Test Results Very low in-space rates (assume LET th > 40 achieved): ~1 per 6800 years for SX72-S in GCR min. Where’s Threshold ??? Low Susceptibility (~100x harder) Data for two RTAX2000S prototypes at 1 MHz using checkerboard pattern from Fig. 12, JJ Wang et al., NSREC 2003 [Ref. 1]

144_C4 / MAPLD04Swift and Roosta16 Actel-style TMR SX-A “R” cell triplicates to: RTSX-S “R” cell

144_C4 / MAPLD04Swift and Roosta17 Actel-style TMR Actel-style TMR is fairly straightforward:  Each flip-flop is replaced by three plus feedback voter  Triplicated elements spread out physically  Uses one clock/inverse-clock domain  No external parts needed

144_C4 / MAPLD04Swift and Roosta18 Xilinx-style TMR Xilinx-style TMR is more complicated:  First, it’s not too useful without configuration scrubbing  Whole functional blocks are triplicated, not individual flip-flops  Three voters are used  Three clock domains  Elimination of: ▫Weak keepers (aka half latches) ▫Use of configuration cells as part of the design -For example, SRL16  Needs some external circuitry (at least, a watchdog timer + PROMs)

144_C4 / MAPLD04Swift and Roosta19 Xilinx-style TMR

144_C4 / MAPLD04Swift and Roosta20 Xilinx-style TMR In Xilinx-style TMR, I/O’s use three pins tied externally : Minority Voter P P P D0 D1 D2 D Pins Board Traces

144_C4 / MAPLD04Swift and Roosta21 Xilinx TMRtool Xilinx-style TMR done by hand is difficult and tedious An automated tool which integrates into the design flow has been developed (“now” available) In-beam testing shows tool is very effective

144_C4 / MAPLD04Swift and Roosta22 Upset Comparison ATMR now has eliminated:  Upsets of static storage elements, and  SEFIs ATMR upsets from:  Transients that are clocked into storage  Clock tree hits Xilinx FPGAs have a small susceptibility to two types of SEFIs  Reset (sometimes only partial)  Disable scrub port XTMR in combination with scrubbing can lower system upset rates below the SEFI rate

144_C4 / MAPLD04Swift and Roosta23 Rate Comparison GCR = Galactic Cosmic Ray background (interplanetary space) almost identical to geosynchronous orbit Actel Dominated by transients Roughly one system error per thousand years (GCR min ) Xilinx Dominated by SEFI rate Expect one SEFI per ~65 years in GCR min Expect one system error ~5-20x less often

144_C4 / MAPLD04Swift and Roosta24 CONCLUSIONS For the present – Both can achieve very acceptable radiation tolerance Actel wins on: ▫Less burden on the designer ▫No auxiliary components ▫Lower SEFI susceptibility Xilinx wins on: ▫Designer control of the resources vs. hardness tradeoff ▫On-chip feature set ▫Re-configurability Competition is good.

144_C4 / MAPLD04Swift and Roosta25 Acronyms FPGA - Field Programmable Gate Array ASIC - Application Specific Integrated Circuit SEU - Single Event Upset SEFI - Single Event Functionality Interrupt TMR - Triple Modular Redundancy ATMR - Actel-style TMR XTMR - Xilinx-style TMR LET - Linear Energy Transfer (proportional to deposited charge per micron for a heavy ion strike on an active node) GCR min - Galactic Cosmic Ray background (highest during “solar minimum” period of ~11-yr cycle of sunspots) MER - Mars Exploration Rovers (i.e., Spirit and Opportunity)

144_C4 / MAPLD04Swift and Roosta26 Additional References [1] J.J. Wang, W. Wong, S. Wolday, B. Cronquist, J. McCollum, R. Katz, I. Kleyner, “Single event upset and hardening in 0.15  antifuse-based field programmable gate array,” IEEE Transactions on Nuclear Science, Dec [2] Jih-Jong Wang, R.B. Katz, F. Dhaoui, J.L. McCollum, W. Wong, B.E. Cronquist, R.T. Lambertson, E. Hamdy, I. Kleyner, W. Parker, “Clock buffer circuit soft errors in antifuse-based field programmable gate arrays,” IEEE Transactions on Nuclear Science, Dec [3] R. Katz, J.J. Wang, R. Koga, K.A. LaBel, J. McCollum, R. Brown, R.A. Reed, B. Cronquist, S. Crain, T. Scott, W. Paolini, B. Sin, “Current radiation issues for programmable elements and devices,” IEEE Transactions on Nuclear Science, Dec. 1998