First Results from Radiation Tolerance Tests on Xilinx Virtex 5

Slides:



Advertisements
Similar presentations
Reconfigurable Computing (EN2911X, Fall07) Lecture 04: Programmable Logic Technology (2/3) Prof. Sherief Reda Division of Engineering, Brown University.
Advertisements

Sana Rezgui 1, Jeffrey George 2, Gary Swift 3, Kevin Somervill 4, Carl Carmichael 1 and Gregory Allen 3, SEU Mitigation of a Soft Embedded Processor in.
10/14/2005Caltech1 Reliable State Machines Dr. Gary R Burke California Institute of Technology Jet Propulsion Laboratory.
Scrubbing Approaches for Kintex-7 FPGAs
Xilinx CPLDs and FPGAs Module F2-1. CPLDs and FPGAs XC9500 CPLD XC4000 FPGA Spartan FPGA Spartan II FPGA Virtex FPGA.
Multi-Bit Upsets in the Virtex Devices Heather Quinn, Paul Graham, Jim Krone, Michael Caffrey Los Alamos National Laboratory Gary Swift, Jeff George, Fayez.
Radiation Effects on FPGA and Mitigation Strategies Bin Gui Experimental High Energy Physics Group 1Journal Club4/26/2015.
HPEC 2012 Scrubbing Optimization via Availability Prediction (SOAP) for Reconfigurable Space Computing Quinn Martin Alan George.
Complex Upset Mitigation Applied to a Re-Configurable Embedded Processor EEL 6935 Lu Hao Wenqian Wu.
1 Fault Tolerant FPGA Co-processing Toolkit Oral defense in partial fulfillment of the requirements for the degree of Master of Science 2006 Oral defense.
ICAP CONTROLLER FOR HIGH-RELIABLE INTERNAL SCRUBBING Quinn Martin Steven Fingulin.
VirtexII 3000 FPGA Dynamic Burn-In Test For Military And Aerospace Applications Sponsored By NASA Electronic Parts and Packaging Program (NEPP) Electronic.
Maintaining Data Integrity in Programmable Logic in Atmospheric Environments through Error Detection Joel Seely Technical Marketing Manager Military &
Fault-Tolerant Softcore Processors Part I: Fault-Tolerant Instruction Memory Nathaniel Rollins Brigham Young University.
BIST for Logic and Memory Resources in Virtex-4 FPGAs Sachin Dhingra, Daniel Milton, and Charles Stroud Electrical and Computer Engineering Auburn University.
Data Partitioning for Reconfigurable Architectures with Distributed Block RAM Wenrui Gong Gang Wang Ryan Kastner Department of Electrical and Computer.
Storage Assignment during High-level Synthesis for Configurable Architectures Wenrui Gong Gang Wang Ryan Kastner Department of Electrical and Computer.
Configuration. Mirjana Stojanovic Process of loading bitstream of a design into the configuration memory. Bitstream is the transmission.
Juanjo Noguera Xilinx Research Labs Dublin, Ireland Ahmed Al-Wattar Irwin O. Irwin O. Kennedy Alcatel-Lucent Dublin, Ireland.
Radiation Effects and Mitigation Strategies for modern FPGAs 10 th annual workshop for LHC and Future experiments Los Alamos National Laboratory, USA.
A Serializer ASIC for High Speed Data Transmission in Cryogenic and HiRel Environment Tiankuan Liu On behalf of the ATLAS Liquid Argon Calorimeter Group.
12004 MAPLD: 141Buchner Single Event Effects Testing of the Atmel IEEE1355 Protocol Chip Stephen Buchner 1, Mark Walter 2, Moses McCall 3 and Christian.
Virtex-6 Radiation Studies & SEU Mitigation Tests
1 FLIPPER SEU Fault Injection in Xilinx FPGAs Monica Alderighi National Institute for Astrophysics, IASF Milano, Italy Computing.
Versatile Link FPGA-based Bit-Error-Ratio Tester for SEU-hardened Optical Links Csaba SOOS, Stéphane DETRAZ, Sérgio SILVA, Paulo MOREIRA, Spyridon PAPADOPOULOS,
A comprehensive method for the evaluation of the sensitivity to SEUs of FPGA-based applications A comprehensive method for the evaluation of the sensitivity.
FPGA IRRADIATION and TESTING PLANS (Update) Ray Mountain, Marina Artuso, Bin Gui Syracuse University OUTLINE: 1.Core 2.Peripheral 3.Testing Procedures.
DLS Digital Controller Tony Dobbing Head of Power Supplies Group.
J. Christiansen, CERN - EP/MIC
PetrickMAPLD05/P1461 Virtex-II Pro PowerPC SEE Characterization Test Methods and Results David Petrick 1, Wesley Powell 1, Ken LaBel 1, James Howard 2.
ATMEL ATF280E Rad Hard SRAM Based FPGA SEE test results Application oriented SEU Sensitiveness Bernard BANCELIN ATMEL Nantes SAS, Aerospace Business Unit.
FT-UNSHADES Analysis of SEU effects in Digital Designs for Space Gioacchino Giovanni Lucia TEC-EDM, MPD - 8 th March Phone: +31.
Radiation Effects Digital Tester: Why is this needed? Kenneth A. LaBel Co-Manager, NASA Electronic Parts and Packaging (NEPP) Program.
Introduction to FPGA Created & Presented By Ali Masoudi For Advanced Digital Communication Lab (ADC-Lab) At Isfahan University Of technology (IUT) Department.
MAPLD 2005/202 Pratt1 Improving FPGA Design Robustness with Partial TMR Brian Pratt 1,2 Michael Caffrey, Paul Graham 2 Eric Johnson, Keith Morgan, Michael.
23 February 2004 Christos Zamantzas 1 LHC Beam Loss Monitor Design Considerations: Digital Parts at the Tunnel Internal Review.
Petrick_P2261 Virtex-II Pro SEE Test Methods and Results David Petrick 1, Wesley Powell 1, James Howard 2 1 NASA Goddard Space Flight Center, Greenbelt,
LaRC MAPLD 2005 / A208 Ng 1 Radiation Tolerant Intelligent Memory Stack (RTIMS) Tak-kwong Ng, Jeffrey Herath Electronics Systems Branch Systems Engineering.
ESS | FPGA for Dummies | | Maurizio Donna FPGA for Dummies Basic FPGA architecture.
CERN, 18 december 2003Coincidence Matrix ASIC PRR Coincidence ASIC modifications E.Petrolo, R.Vari, S.Veneziano INFN-Rome.
Greg Alkire/Brian Smith 197 MAPLD An Ultra Low Power Reconfigurable Task Processor for Space Brian Smith, Greg Alkire – PicoDyne Inc. Wes Powell.
1 Advanced Digital Design Reconfigurable Logic by A. Steininger and M. Delvai Vienna University of Technology.
In-Place Decomposition for Robustness in FPGA Ju-Yueh Lee, Zhe Feng, and Lei He Electrical Engineering Dept., UCLA Presented by Ju-Yueh Lee Address comments.
FPGAs in ATLAS Front-End Electronics Henrik Åkerstedt, Steffen Muschter and Christian Bohm Stockholm University.
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR Topics n FPGA fabric architecture concepts.
COTS for on-detector timing distribution. Status report and preliminary tests. A. Aloisio, R. Giordano University of Naples ‘Federico II’ and INFN
GBT protocol implementation on Xilinx FPGAs Csaba SOOS PH-ESE-BE-OT.
Radiation Tolerance Studies using Fault Injection on the Readout Control FPGA Design of the ALICE TPC Detector Johan Alme Bergen University College, Norway.
October 12th 2005 ICALEPCS 2005D.Charlet The SPECS field bus  Global description  Module description Master Slave Mezzanine  Implementation  Link development.
Status and Plans for Xilinx Development
Xilinx V4 Single Event Effects (SEE) High-Speed Testing Melanie D. Berg/MEI – Principal Investigator Hak Kim, Mark Friendlich/MEI.
DHH progress report Igor Konorov TUM, Physics Department, E18 DEPFET workshop, Bonn February 7-9, 2011 Outline:  Implementation synchronous clock distribution.
E. Hazen1 MicroTCA for HCAL and CMS Review / Status E. Hazen - Boston University for the CMS Collaboration.
Error Correcting Codes for Serial links : an update
Presenter: Darshika G. Perera Assistant Professor
CLUster TIMing Electronics Part II
Rad-tolerance Tests of FPGAs and SerDeses for SuperB
Results of the TLK1711-A Radiation Tolerance Tests
Defining serial links for SuperB
Production Firmware - status Components TOTFED - status
FPGA IRRADIATION and TESTING PLANS (Update)
SEU Mitigation Techniques for Virtex FPGAs in Space Applications
Radiation Tolerance of an Used in a Large Tracking Detector
A microTCA Based DAQ System for the CMS GEM Upgrade
Links & Clock distribution: toward the TDR
Maintaining Data Integrity in Programmable Logic in Atmospheric Environments through Error Detection Joel Seely Technical Marketing Manager Military &
Impact of Serializer/Deserializer Architecture on ETD High-Speed Links
Test Bench for Serdes Radiation Qualification
Xilinx Kintex7 SRAM-based FPGA
Fixed Latency Serial Links with FPGA-embedded SerDes for SuperB
Presentation transcript:

First Results from Radiation Tolerance Tests on Xilinx Virtex 5 A. Aloisio1, V. Bocci2, R. Giordano1 In this talk 1 Physics Dept. - University of Napoli “Federico II” and INFN Sezione di Napoli, Italy 2 INFN Sezione di Roma 1, Rome, Italy email: aloisio@na.infn.it, bocci@roma1.infn.it, rgiordano@na.infn.it

Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011 Outline FPGAs on-detector? Benefits of homogeneous FPGA<->FPGA links Xilinx Virtex-5 LXT50T facts Test bench architecture Test facility and conditions Results Conclusions 28/04/2017 Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011

Can We Use FPGAs On-Detector ? S-RAM FPGAs traditionally excluded from radiation areas Configuration (stored in static RAM) is sensitive to single event upsets (SEUs) => bit-flips A bit flip in the configuration memory can change design functionality Mid-range Xilinx FPGAs include tools for configuration error detection and correction 28/04/2017 Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011

What if FPGAs Could be Used The fast link sub-system will be dramatically simplified: we would only have symmetrical links! (FPGA<->FPGA) No constraints imposed by line encodings of stand-alone SerDeses (i.e. start/stop bits of DS92LV18 and 8b10b coding TLK2711-A) Just one type of links, protocol and line coding customized to ETD requirements Fixed-latency proof and thoroughly tested Artix-7: cheap Xilinx FPGAs ( 40$) with  8 embedded SerDeses (GTPs) More than one link per chip Data-rates up to 3 Gbps on all links (including FCTS) (actually SerDes could go even faster : 6.6 Gbps) # of links could be halved or better (when not driven by topology) 28/04/2017 Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011

Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011 Device Xilinx Virtex 5 LX50T Includes high-speed SerDeses Mid-range size, yet large enough to fit our needs Tested by Xilinx, NASA, Lawrence Berkeley Lab. and Los Alamos Lab. Embeds configuration CRC with ECC blocks and partial reconfiguration capabilities Two versions: rad-hard (70k$) and industrial (400$) (we focus on the latter, guess why) 28/04/2017 Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011

Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011 Device Facts Configuration size 11.37 Mbits CLBs&routing  9 Mbit IOB, DSP, BRAM-interconnect  2.37 Mbit Clock cycles per CRC Readback 355,190 Readback time 7.1 ms (at 50 MHz) 28/04/2017 Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011

Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011 Benchmark Design Serial Link running at 2 Gbps (compatible with the TLK2711-A) 16-bit parallel words (18-bit including control bits) 100 MHz parallel clock 8b10b encoding Explicit lock flag Dummy Logic on Tx and Rx parallel data-path to observe realistic SEU effects 2x 16 levels of 18-bit registers and combinational logic Implemented with Precision Hi-Rel synthesizer Two firmware versions: one with and one without Triple Module Redundancy moderation techniques Resource Occupation GTPs : 1 (10%) Slices: 376 (5%) FFs: 926 (3%) LUTs: 980 (3%) BRAM: 1 (1%) PLLs: 1 (16%) Clock Buffers: 6 (18%) IOBs: 47 (10%) 28/04/2017 Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011

Mentor Precision Hi-Rel Synthesizer Aimed at applications in aerospace, medical, high-reliability and safety-critical FPGAs Several grades/options for Triple Module Redundancy Local TMR, registers are tripled and voted are inserted Distributed TMR, registers, combinatorial logic and even voters are tripled Global TMR, for highest reliability, see next slide Can use power posts of V5 instead of half-latches as a source for logic constants (logic ‘0’ and ‘1’) Local TMR Distributed TMR 28/04/2017 Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011

Global Triple Module Redundancy We adopted the GTMR option (recommended for S-RAM FPGAs) Combinatorial elements, registers and the pertaining clock routes are tripled and voted out But, in our design, IOs have not been tripled, unfeasible Embedded SerDes (GTP) & PLL not tripled GTP replication would have required Serial IO triplication, unpractical PLL triplication not supported by the tool We also used V5 dedicated powerposts 28/04/2017 Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011

Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011 V5LX50T Test Bench Ethernet link Ethernet link BRIDGE RS-232 link XILINX ML505 Tektronix DTG5334 USB link 18 2 PC logging all the data on the hard-drive (errors, static and dynamic currents…) and driving configuration read-back 18 RX Data out Recovered Clk 8 Controls Clock Generator FPGA Clk JTAG Programmer TX Clk Ethernet link TX Data in FPGA V5 LX50T JTAG Dire che e’ lo stesso setup dell’altra volta. Power Analyzer & Logger Controls RX Serdes TX Serdes Configuration and Readback 28/04/2017 AGILENT N6705A 4 Power inputs: 1.0V, 1.2V, 2.5V, 3.3V Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011

Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011 Tester Board Console I/O Same FPGA as DUT board Data pattern stored in the FPGA firmware TX and RX sections of the benchmark link design are tested independently and simultaneuosly Console IO Status and errors are logged on a console handle by an embedded microprocessor Parallel I/O XILINX ML505 Virtex5 – XC5VLX50T Clock input GTPdiff. I/O 28/04/2017 Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011

FPGA Board for Beam Test Power supply inputs Compatible with the Xilinx ML505 board used for the presented lab. tests (implements sub-set of features) Same firmware used in ML505 for SEU emulation tests (only synthesizer changed) SMAs for clocking and serial IO No active components 4-wires connectors for current sensing power supply SFP cage for optional testing of opto-electronics Parallel IO Parallel IO FPGA Virtex 5 LX50T Embedded SerDes diff. IO Differentialclock input and output 28/04/2017 Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011

Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011 LNS Facility Superconducting cyclotron at LNS - Catania 62-MeV proton beam CATANA beam line Current tunable up to 300 pA Uniform beam intensity on our sample Experimental setup LNS accelerator PLAN Beam spot 25 mm 2D Beam profile Gafcromico per beamspot 28/04/2017 Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011

Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011 Test-setup Test conditions All Vcc set to maximum allowed value (1.05V, 1.32V, 2.63V, 3.45V) fclock = 100 MHz Data rate during test=2 Gb/s Configuration cross section measurement and link failure test Tests split into many runs (~ 50) Configuration x-section, 6 runs, several dose rates from 1 to 20 Gy/min (Si) Tested 2 link firmwares: w/ TMR and w/out TMR (will refer to them as TMR and noTMR), with different dose rates also Closer Front view Beam spot WiderBackview FPGA Test board Parallel & Serial Data cables Tester board 28/04/2017 Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011

NoTMR Firmware: SEU Tolerance Total of 19 runs 18 runs @ 6 Gy/min (Si) 1 @ 23 Gy/min (Si) Avg. duration 146 s => total dose per run 14 Gy (Si) We measured # of configuration bit-flips accumulated before link failure (error burst) (Nflips) On average Nflips =2070 During the SEU emulation tests performed in lab. in May, Nflips=2250 noTMR project Good match (<10%) between emulated SEU and real SEUs 28/04/2017 Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011

TMR Firmware: SEU Tolerance Total of 24 runs 20 runs @ 6 Gy/min (Si) 4 @ at 23 Gy/min Avg. duration 171 s => total dose per run 19 Gy (Si) On average Nflips =2170 Results compatible with the noTMR firmware Looks like TMR did not improve SEU-tolerance due to non-tripled modules? (GTP, PLL?) TMR project 28/04/2017 Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011

Configuration Memory Test Methodology Start a new run Program the FPGA Read-back the configuration via JTAG and compare it to the initial one Log number of differences (errored bits) Go back to 3, until total desired dose reached The test loop (pts 3,4,5) has been executed at the maximum speed permitted by JTAG, i.e. one readback every 15 seconds 28/04/2017 Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011

Configuration Memory Error Results Accumulation of bit-flips into configuration memory, FPGA working like a «dosimeter» # of Config Errors Irradiation Stopped @ 71 Gy (Si) Experienced a few SEFIs, a SEU causes failure of readback circuitry => 1 SEU generates many (thousands) configuration errors Abrupt jumps of config errors Time (s) SEFI causes ‘jump’ of config erros # of Config Errors 9225 errors Time (s) 28/04/2017 Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011

Configuration Bit-flip Cross-section V5LX50T device Die size = 1.4 cm2 p @ 62MeV Measured s = 3,510-14 cm2/bit Published s = 6.410-14 cm2/bit, see [1] Good agreement between measured and published s (different device, facility, test conditions…) X-ray image Package 12 mm Die Reference: [1] Quinn et al., Proceedings of 2007 IEEE Radiation Effetcs Data Workshop, Page(s): 177-184 28/04/2017 Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011

Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011 Link Failure Rates Courtesy of Riccardo Cenci, Elba Meeting, May-Jun. 2011 Effective x-section to failure is design-dependent (critical bits) 0.5 kGy(Si)/year (a) => 4.1104 configuration bit-flips on our design /year (measured) .# of link failures/year = 20 (on average, without ANY recovery strategy) Expected one link failure every 18 days (includes both configuration and SEUs in configured logic) Die Note: (a) Estimated by R. Cenci, INFN Pisa 28/04/2017 Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011

Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011 Conclusions Very good agreement (within 10%) between test with emulated SEUs and test on beam On average  2100 bit-flips needed to have a link failure Measured configuration error x-section in agreement with the expected one (within factor 2) Test beam data analysis to be completed: Analysis of currents More detailed analysis of link failures, how did it fail? Investigate causes of failures, why did it fail ? Why did the TMR and noTMR designs perform the same? Need further testing and updated info on expected radiation in detector area as soon as possible Next proton test beam (@LNS, 62-MeV protons) scheduled for Dec. 10th 2011 Will test Xilinx Virtex-5 and Virtex-6 FPGA families Still unclear if FPGAs are suitable to be used on-detector We thank all the LNS staff for the their help and support during the beam test 28/04/2017 Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011

Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011 Back-up Slides 28/04/2017 Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011

Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011 Quick Facts rSi = 2.3 g/cm3 (dE/dx)p@60MeV in Si = 1.8 MeV/mm (or 600 keV in 300 mm) s = (1 / F) * nerrors NSEU = (Npbeam*sbit*Nbit(eff.)) / * Sbeam Nsysfail = NSEU * NSEU->failure 71 Gy(Si) => 5891 config. errors 28/04/2017 Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011

Single Event Functional Interrupts Experienced a few BRAM SEFIs Run 30 - This is a known SEFI we call "BRAM SEFI" where the configuration bits (one) get flipped, and the entire BRAM gets its data inverted. Another BRAM SEFI is a replacement of a column with the spare column (256 bits in error). 28/04/2017 Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011

Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011 Recovery Strategies Scheduled maintenance Reconfigure the FPGA at regular intervals, e.g. once a day, no matter what Emergency maintenance Reconfigure as soon as the service can be interrupted, e.g. exploit any reset or power-cycle of the link to reconfigure Repair-while-running Partial reconfigure as soon as the error is detected, even if the link is working => interruption of service, i.e. dead time 28/04/2017 Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011

Emulating Configuration SEUs Investigate impact of configuration SEUs on design functionality Flip configuration bits by means of internal configuration access port (ICAP) Optional error correction thank to integrated CRC calculator and ECC (FRAME_ECC) Programmable integrated controller: SEU generation without correction (error accumulation) SEU generation and on-the-fly correction Custom design derivative of a Xilinx core (so called SEU Controller) 28/04/2017 Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011

Need to Estimate Rad-Tolerance Two kind of issues : Total Ionizing Dose (common to all ICs) SEUs However, configuration SEUs have been largely over-estimated in the past: they rarely impact the design functionality In this talk we will not cover logic errors, they are common to every digital device Logic errors Lab test (SEU Injector) Configuration errors Beam test 28/04/2017 Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011

SEU Generation&Correction Results SEUs generated at 1 Hz (due to limitations of the original Xilinx SEU controller) Measured # of (generated&corrected) SEUs before failure On average  400 SEUs needed to observe BERT errors Very likely the difference is due to reconfiguration=>even correctly working blocks are affected % 28/04/2017 Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011

SEU Accumulation Test Results SEU generated at 20 Hz We measured # of SEUs before failure On average 2250 SEUs needed to observe BERT errors % 28/04/2017 Raffaele Giordano 1st SuperB Collaboration Meeting, London Sept. 2011