March 21, 2014 Workshop on FPGAs for High-Energy Physics Single-Event Effects Testing and the Xilinx Radiation Test Consortium presenter: Gary Swift.

Slides:



Advertisements
Similar presentations
IHP Im Technologiepark Frankfurt (Oder) Germany IHP Im Technologiepark Frankfurt (Oder) Germany ©
Advertisements

FPGA Configuration. Introduction What is configuration? – Process for loading data into the FPGA Configuration Data Source Configuration Data Source FPGA.
Sana Rezgui 1, Jeffrey George 2, Gary Swift 3, Kevin Somervill 4, Carl Carmichael 1 and Gregory Allen 3, SEU Mitigation of a Soft Embedded Processor in.
Scrubbing Approaches for Kintex-7 FPGAs
Discussion of: “Terrestrial-based Radiation Upsets: A Cautionary Tale” CprE 583 Tony Kuker 12/06/05.
Multi-Bit Upsets in the Virtex Devices Heather Quinn, Paul Graham, Jim Krone, Michael Caffrey Los Alamos National Laboratory Gary Swift, Jeff George, Fayez.
Radiation Effects on FPGA and Mitigation Strategies Bin Gui Experimental High Energy Physics Group 1Journal Club4/26/2015.
HPEC 2012 Scrubbing Optimization via Availability Prediction (SOAP) for Reconfigurable Space Computing Quinn Martin Alan George.
Complex Upset Mitigation Applied to a Re-Configurable Embedded Processor EEL 6935 Lu Hao Wenqian Wu.
1 Fault Tolerant FPGA Co-processing Toolkit Oral defense in partial fulfillment of the requirements for the degree of Master of Science 2006 Oral defense.
ICAP CONTROLLER FOR HIGH-RELIABLE INTERNAL SCRUBBING Quinn Martin Steven Fingulin.
Nishinaga No. 1 MAPLD2005 Availability Analysis of Xilinx FPGA on Orbit Nozomu Nishinaga National Institute of Information and Communications Technology.
DC/DC Switching Power Converter with Radiation Hardened Digital Control Based on SRAM FPGAs F. Baronti 1, P.C. Adell 2, W.T. Holman 2, R.D. Schrimpf 2,
Maintaining Data Integrity in Programmable Logic in Atmospheric Environments through Error Detection Joel Seely Technical Marketing Manager Military &
L189/MAPLD2004Carmichael 1 A Triple Module Redundancy Scheme for SEU Mitigation of Static Latch-Based FPGAs (“Birds-of-a-Feather”) Carl Carmichael 1, Brendan.
Fault-Tolerant Softcore Processors Part I: Fault-Tolerant Instruction Memory Nathaniel Rollins Brigham Young University.
6 June 2002UK/HCAL common issues1 Paul Dauncey Imperial College Outline: UK commitments Trigger issues DAQ issues Readout electronics issues Many more.
Configuration. Mirjana Stojanovic Process of loading bitstream of a design into the configuration memory. Bitstream is the transmission.
1. 2 FPGAs Historically, FPGA architectures and companies began around the same time as CPLDs FPGAs are closer to “programmable ASICs” -- large emphasis.
Using Programmable Logic to Accelerate DSP Functions 1 Using Programmable Logic to Accelerate DSP Functions “An Overview“ Greg Goslin Digital Signal Processing.
1 DIGITAL DESIGN I DR. M. MAROUF FPGAs AUTHOR J. WAKERLY.
®. ® Rad Hard Products for Satellites and Space ® QPRO for Avionics  Standard QPRO products immune to upsets in avionics environment.
Radiation Effects and Mitigation Strategies for modern FPGAs 10 th annual workshop for LHC and Future experiments Los Alamos National Laboratory, USA.
144_C4 / MAPLD04Swift and Roosta1 Tradeoffs in Flight-Design Upset Mitigation in State-of-the-Art FPGAs Hardened By Design vs. Design-Level Hardening Gary.
Lecture #3 Page 1 ECE 4110– Sequential Logic Design Lecture #3 Agenda 1.FPGA's 2.Lab Setup Announcements 1.No Class Monday, Labor Day Holiday 2.HW#2 assigned.
A comprehensive method for the evaluation of the sensitivity to SEUs of FPGA-based applications A comprehensive method for the evaluation of the sensitivity.
FPGA IRRADIATION and TESTING PLANS (Update) Ray Mountain, Marina Artuso, Bin Gui Syracuse University OUTLINE: 1.Core 2.Peripheral 3.Testing Procedures.
2004 MAPLD, Paper 190 JJ Wang 1 SEU-Hardened Storage Devices in a 0.15 µm Antifuse FPGA – RTAX-S J. J. Wang 1, B. Cronquist 1, J. McCollum 1, R. Gorgis.
M. Adinolfi – University of Oxford – MAPMT Workshop – Imperial College 27 June Rad-Hard qualification for the LHCb RICH L0 electronics M. Adinolfi.
Presented by Anthony B. Sanders NASA/GSFC at 2005 MAPLD Conference, Washington, DC #196 1 ALTERA STRATIX TM EP1S25 FIELD-PROGRAMMABLE GATE ARRAY (FPGA)
Lessons Learned The Hard Way: FPGA  PCB Integration Challenges Dave Brady & Bruce Riggins.
NTD/xNTD Signal Processing Presented by: John Bunton Signal Processing team: Joseph Pathikulangara, Jayasri Joseph, Ludi de Souza and John Bunton Plus.
J. Christiansen, CERN - EP/MIC
PetrickMAPLD05/P1461 Virtex-II Pro PowerPC SEE Characterization Test Methods and Results David Petrick 1, Wesley Powell 1, Ken LaBel 1, James Howard 2.
PetrickMAPLD05/BOFL1461 Virtex-II Pro PowerPC SEE Characterization Test Methods and Results Session L: Birds of a Feather David Petrick 1, Wesley Powell.
® SPARTAN Series High Volume System Solution. ® Spartan/XL Estimated design size (system gates) 30K 5K180K XC4000XL/A XC4000XV Virtex S05/XL.
ATMEL ATF280E Rad Hard SRAM Based FPGA SEE test results Application oriented SEU Sensitiveness Bernard BANCELIN ATMEL Nantes SAS, Aerospace Business Unit.
FT-UNSHADES Analysis of SEU effects in Digital Designs for Space Gioacchino Giovanni Lucia TEC-EDM, MPD - 8 th March Phone: +31.
Radiation Effects Digital Tester: Why is this needed? Kenneth A. LaBel Co-Manager, NASA Electronic Parts and Packaging (NEPP) Program.
2/2/2009 Marina Artuso LHCb Electronics Upgrade Meeting1 Front-end FPGAs in the LHCb upgrade The issues What is known Work plan.
P173/MAPLD 2005 Swift1 Upset Susceptibility and Design Mitigation of PowerPC405 Processors Embedded in Virtex II-Pro FPGAs.
MAPLD 2005/202 Pratt1 Improving FPGA Design Robustness with Partial TMR Brian Pratt 1,2 Michael Caffrey, Paul Graham 2 Eric Johnson, Keith Morgan, Michael.
Experimental Evaluation of System-Level Supervisory Approach for SEFIs Mitigation Mrs. Shazia Maqbool and Dr. Craig I Underwood Maqbool 1 MAPLD 2005/P181.
Date processing and conversion Please use speaker notes for additional information!
25 April 2000 SEESCOASEESCOA STWW - Programma Evaluation of on-chip debugging techniques Deliverable D5.1 Michiel Ronsse.
P189/MAPLD2004Carmichael 1 A Triple Module Redundancy Scheme for SEU Mitigation of Static Latch-Based FPGAs Carl Carmichael 1, Brendan Bridgford 1, Gary.
MooreC142/MAPLD Single Event Effects (SEE) Test Results on the Virtex-II Digital Clock Manager (DCM) Jason Moore 1, Carl Carmichael 1, Gary Swift.
1 The NSEU Sensitivity of Static Latch Based FPGAs and Flash Storage CPLDs Joseph Fabula Jason Moore Austin Lesea Saar Drimer MAPLD2004 This work has benefited.
Analytical Approach for Soft Error Rate Estimation of SRAM-Based FPGAs Ghazanfar (Hossein) Asadi and Mehdi B. Tahoori Why Soft Error Rate (SER) Estimation?
Field Programmable Port Extender (FPX) 1 Modular Design Techniques for the FPX.
Petrick_P2261 Virtex-II Pro SEE Test Methods and Results David Petrick 1, Wesley Powell 1, James Howard 2 1 NASA Goddard Space Flight Center, Greenbelt,
LaRC MAPLD 2005 / A208 Ng 1 Radiation Tolerant Intelligent Memory Stack (RTIMS) Tak-kwong Ng, Jeffrey Herath Electronics Systems Branch Systems Engineering.
Evaluating Logic Resources Utilization in an FPGA-Based TMR CPU
1 Level 1 Pre Processor and Interface L1PPI Guido Haefeli L1 Review 14. June 2002.
EE121 John Wakerly Lecture #15
2004 MAPLD, Paper 1008 Sanders 1 Radiation-Hardened re-programmable Field- Programmable Gate Array (RHrFPGA) A.B. Sanders 1, K.A. LaBel 1, J.F. McCabe.
Presented by Anthony B. Sanders NASA/GSFC at 2004 MAPLD Conference, Washington, DC April 27-29, Radiation-Hardened re-programmable Field- Programmable.
SEU WG, TWEPP SEU mitigation in GBT On behalf of GBT team Circuit design: TMR Full-custom/High Speed Configuration Registers Protocol Some results.
Nishinaga No. 1 MAPLD2005/1003-J Availability Analysis of Xilinx FPGA on Orbit Nozomu Nishinaga National Institute of Information and Communications Technology.
Readout controller Block Diagram S. Hansen - CD-1 Lehman Review1 VXO Ø Det Links to 24 SiPM Front End Boards Clock Event Data USB ARM uC A D Rd Wrt 100Mbit.
Actel Antifuse FPGA Information – Radiation Tests Actel Antifuse FPGA – A54SX72A 72K gates 208 pqfp package 2.5v to 5.0v I/O tolerant $62 each for tested.
Xilinx V4 Single Event Effects (SEE) High-Speed Testing Melanie D. Berg/MEI – Principal Investigator Hak Kim, Mark Friendlich/MEI.
P201-L/MAPLD SEE Validation of SEU Mitigation Methods for FPGAs Carl Carmichael 1, Sana Rezgui 1, Gary Swift 2, Jeff George 3, & Larry Edmonds 2.
6th WorldFIP Insourcing Meeting
SEU Mitigation Techniques for Virtex FPGAs in Space Applications
Maintaining Data Integrity in Programmable Logic in Atmospheric Environments through Error Detection Joel Seely Technical Marketing Manager Military &
Sequential circuits and Digital System Reliability
Analytical Approach for Soft Error Rate Estimation of SRAM-Based FPGAs
Xilinx Kintex7 SRAM-based FPGA
Presentation transcript:

March 21, 2014 Workshop on FPGAs for High-Energy Physics Single-Event Effects Testing and the Xilinx Radiation Test Consortium presenter: Gary Swift

SEE Testing & the XRTC Slide 2 Page 2 Overview SEE Testing for Space –Accelerated AND Accelerator-based Testing –Purpose On-orbit rate Fail signatures and design level mitigation XRTC (Xilinx Radiation Test Consortium) –Voluntary membership –Maximize Leveraging –Test Campaign Phases: Static Dynamic Mitigation –Beam and/or Fault Injection Common Setup

SEE Testing & the XRTC Slide 3 MAPLD Seminar – Sept. 25, 2006 SEE testing is extremely easy. 1.It’s simply counting 2. Only two things to count: Number of upsets Total beam fluence 3. The facility counts the beam for you. SEE Testing’s Basic Principle

SEE Testing & the XRTC Slide 4 Short Course – April 22, 2010 FPGAs: What bits can upset? Configuration Bits –Logical Function –Routing –User Options Block RAM User Flip-flops Control Registers × Type of I/O × Mode of Block RAM Access × Clock Manager × etc… × NAND × Ex-OR × Flip-Flop type × etc…

SEE Testing & the XRTC Slide 5 MAPLD Seminar – Sept. 25, 2006 SEE Testing Is Hard Difficult to do High-Energy Accelerator Beam costs ~ $1000/hr Travel & shipping add extra costs Portable test fixtures and control systems Test Development Special Problems Part De-lidding In Vacuum Operation

SEE Testing & the XRTC Slide 6 MAPLD Seminar – Sept. 25, 2006 Acid Etch Plastic to Expose Die De-lidded plastic package of A54SX32

SEE Testing & the XRTC Slide 7 MAPLD Seminar – Sept. 25, 2006 Preparation: Thinning Flip-Chip Technician thins DUT backside

SEE Testing & the XRTC Slide 8 MAPLD Seminar – Sept. 25, 2006 Space Upset Rates – Three Inputs Measure  vs. LET –Testing done at high-energy accelerator –Cross-section determined from circuit response Determine Sensitive Volume –Requires assumptions about device construction –Used to determine effect of ions that strike the device at an angle Integrate with LET Spectrum LET Measured Cross Section LET dN dE Particle LET Spectrum ERROR RATE Sensitive Volume σ

SEE Testing & the XRTC Slide 9 Short Course – April 22, 2010 Prediction vs. Actual Example: Rovers Going to Mars - Pyro Control Board Xilinx XQR4062XL (four in each)

SEE Testing & the XRTC Slide 10 MAPLD Seminar – Sept. 25, 2006 Lots of Interest Lots of Leveraging Xilinx, Inc. and JPL started partnering on tests and soon had to form the Radiation Test Consortium Boeing Satellite Systems SEAKR Engineering

SEE Testing & the XRTC Slide 11 Short Course – April 22, 2010 XRTC Test Infrastructure Capturing detailed strip charts allows experiment to be “re-played” for smarter signature identification and later analysis and re-analysis.

SEE Testing & the XRTC Slide 12 Example – Strip Chart of Irradiation … …

SEE Testing & the XRTC Slide 13 Page 13 0 th generation motherboard Virtex-2 Virtex-IIpro Virtex-4 Virtex-5 XRTC Motherboard History 2000 Xilinx Radiation Test Consortium JPL & Xilinx partner up Consortium formation 1 st Annual Meeting 2 nd generation motherboard 3 rd generation motherboard 1 st generation motherboard

SEE Testing & the XRTC Slide 14 MAPLD Seminar – Sept. 25, 2006 BOTH Fault Injection & Irradiation Dynamic Test Setup at Texas A&M Cyclotron Inside target room Configuration Monitor/ Strip Chart Functional Monitor/ Strip Chart SEAKR-built DUT Board - XQR2V6000 in beam BEAM Thinned DUT

SEE Testing & the XRTC Slide 15 MAPLD Seminar – Sept. 25, 2006 Backside Lots of Signals = Messy Looking

SEE Testing & the XRTC Slide 16 MAPLD Seminar – Sept. 25, 2006 Virtex 4 FX pins SMPX MGTs Reset Switches USB RS-232 Temp I/F Osc DIFF_BUS JTAG/SelectMAP SNGL_BUS[0:63],[152:191] SNGL_BUS[64:151] JTAG/SelectMAP MGT_BUS 1.2V2.5V3.0VVAR Teradyne Conn B Teradyne Conn A MGTCLK FPGA Core MGT Bias VAR SNGL Bank Diff Bank MGT Analog Reg 3.0V MGT Term Reg Key MGT Regulators 2 nd Gen+ Setup – DUT Daughter Card

SEE Testing & the XRTC Slide 17 MAPLD Seminar – Sept. 25, nd Gen. Consortium Test Board

SEE Testing & the XRTC Slide 18 In-Vacuum Test at Texas A&M

SEE Testing & the XRTC Slide 19 Page 19 Some significant events Nov ’08 - First silicon V-5QV into beam followed by many tests, approx. monthly May ’09 - SpaceCube flies on Hubble Repair Mission with the first V-4’s in space Nov ’09 - First XRTC space experiment launched with first V-4QV in space (& SpaceCube again) July ’10 - Official V-5QV Product Announcement (at NSREC) plus shipping engineering samples to EA customers May ‘ nd XRTC space experiment launched with first V-5QV (ES) in space July ’11 - Official V-5QV Release Announcement (at NSREC) plus shipping flight parts Nov ’11 - First production V-5QV launched Feb ’ th XRTC Annual Meeting

SEE Testing & the XRTC Slide 20 Page 20 XRTC Mission: Beam Testing Feb ‘09 thru Jan ‘10 = 451 hrs of beam Feb ‘10 thru Jan ‘11 = 552 hrs of beam Feb’ 11 thru Dec ‘11 Tests = 392 hrs of beam Main Proton Test Campaign: –UCD, Nov –UCD, Jan –UCD, June –UCD, October –UCD, November –UCD, December “Cleanup” Heavy Ion Tests: –LBL, April –LBL, May –LBL, August –TAM, August 2012 (BYU/SEAKR)12 –TAM, Sept Total 371 beam hrs

SEE Testing & the XRTC Slide 21 XRTC Mission: Test Results Static Results on Virtex- 4QV –- Config cells –- User BRAM & FFs –- Functional Upsets – (aka SEFIs) -Both heavy ions & proton Virtex-5QV Static Report -Release 3 May 2013 Architectural Features Report -Release 2 Aug Reports hosted on JPL website:

SEE Testing & the XRTC Slide 22 XRTC Test Reports FPGA Workshop – March 21, 2014

SEE Testing & the XRTC Slide 23 XRTC Annual Meeting – 2.5 Days FPGA Workshop – March 21, 2014

SEE Testing & the XRTC Slide 24 Basics of Upset Mitigation Redundancy - Extra information (bits) prevents all upsets from yielding system errors. Scrubbing required – Accumulation of errors rapidly kills mitigation effectiveness. Effective – Most spacecraft now fly large arrays of upset-soft memories with few or no errors. Typically, uncorrectable errors are detectable. FPGA Workshop – March 21, 2014

SEE Testing & the XRTC Slide 25 Basics of Upset Mitigation - cont’d Common sense says - At some point, upsets will occur too rapidly and the mitigation will be “overwhelmed.” In fact, Edmonds approx. equation says – There’s not really a “cliff.” The relationships are known; the error rate: (1) increases with the square of upset rate (2) decreases linearly with faster scrub rates (3) is directly proportional to EDAC word size † † EDAC word size = data bits + check bits ; EDAC=error detection and correction FPGA Workshop – March 21, 2014

SEE Testing & the XRTC Slide 26 Basics of Upset Mitigation - Examples 32 data bits + 7 check bits - Cassini Solid State Recorders with 2+ Gb DRAM array is working well, in spite of architecture “flaw.” 128 data bits + 9 check bits – This hidden EDAC word inside IBM Luna-C 16Mb DRAMs used on RAD6000 boards on many missions requires external accesses to prevent accumulation of upsets. 64 data bits + 16 check bits – A specially design cyclical parity scheme on the RAD750 board corrects up to 4 upsets, if confined to a nibble, allowing correct operation with a bad DRAM chip. If U is the underlying upset rate, then the EDAC word error rate is approximately: T scrub U 2 N EDAC 0.5 x FPGA Workshop – March 21, 2014

SEE Testing & the XRTC Slide 27 Mitigation – Chip-Level TMR Maxwell’s SCS750 prototype at the Texas A&M Cyclotron Facility: RunuP-AuP-BuP-C Upsets by Processor Quite Acceptable Uniformity FPGA Workshop – March 21, 2014

SEE Testing & the XRTC Slide 28 TMR Basics TMR = triple-module redundancy Three independent “legs” or domains performing identical functions Voters are inserted – typically at feedback points Voters are triplicated also – they are not a single point of failure Error-free operation with any single upset Two upsets might cause system failure Scrubbing is again required to reduce the chance of co-resident upsets. FPGA Workshop – March 21, 2014

SEE Testing & the XRTC Slide 29 Model of TMR System Functional Block, i Voter, i N i = # of “care’ bits in each block and voter NiNi NiNi NiNi N i-1 N i+1 …N M N 1 … …N M M = # of modules T C = scrub time Feedback, i+1 Feedback from the voters corrects state errors inside blocks FPGA Workshop – March 21, 2014

SEE Testing & the XRTC Slide 30 TMR stops error propagation Single upsets cannot cause errors Error propagation requires upsets in two parallel modules. X X X X X X Even multiple upsets may not cause errors FPGA Workshop – March 21, 2014

SEE Testing & the XRTC Slide 31 Edmonds TMR Equation – small r approx. Scrub Time System Error Rate Underlying Upset Rate Total Modules “Fitting” Parameter FPGA Workshop – March 21, 2014

SEE Testing & the XRTC Slide 32 Edmonds TMR Equation – small r approx. Scrub Time System Error Rate Underlying Upset Rate Total Modules Parameter is really the second moment of the distribution of N’s. This is a “cousin” of the standard deviation. FPGA Workshop – March 21, 2014

SEE Testing & the XRTC Slide 33 Example Application - BRAM Scrubber Given parameters: T=2 ms, M=48000 Fit parameter: M2=M3=M4= 250 FPGA Workshop – March 21, 2014

SEE Testing & the XRTC Slide 34 In Review … New Edmonds Equation for TMR is –General (for TMR-ed systems) –Powerful Works over many orders-of-magnitude –Based on moments which are Statistically meaningful Of rapidly diminishing importance so only one (or two) adjustable parameters are enough Calculable, in theory anyway; in practice, probably not. –Useful In predicting system error rates in space In designing appropriate in-beam testing –Consortium uses “three-flux” test for all mitigated experiments using spacing of an order of magnitude or more FPGA Workshop – March 21, 2014

SEE Testing & the XRTC Slide 35 The End Thank you! Any questions? FPGA Workshop – March 21, 2014