Reconfigurable Computing in Space with Radiation-Hardened Xilinx FPGAs

Slides:



Advertisements
Similar presentations
Sana Rezgui 1, Jeffrey George 2, Gary Swift 3, Kevin Somervill 4, Carl Carmichael 1 and Gregory Allen 3, SEU Mitigation of a Soft Embedded Processor in.
Advertisements

EEL6686 Guest Lecture February 25, 2014 A Framework to Analyze Processor Architectures for Next-Generation On-Board Space Computing Tyler M. Lovelly Ph.D.
HPEC 2012 Scrubbing Optimization via Availability Prediction (SOAP) for Reconfigurable Space Computing Quinn Martin Alan George.
Complex Upset Mitigation Applied to a Re-Configurable Embedded Processor EEL 6935 Lu Hao Wenqian Wu.
April 30, Cost efficient soft-error protection for ASICs Tuvia Liran; Ramon Chips Ltd.
ICAP CONTROLLER FOR HIGH-RELIABLE INTERNAL SCRUBBING Quinn Martin Steven Fingulin.
Maintaining Data Integrity in Programmable Logic in Atmospheric Environments through Error Detection Joel Seely Technical Marketing Manager Military &
CHREC F5/F6 W EEKLY M EETING F5-11: Device Performance Metrics and Mission-Critical Processing Nicholas Wulf Justin Richardson Quinn Martin Steven Fingulin.
Zheming CSCE715.  A wireless sensor network (WSN) ◦ Spatially distributed sensors to monitor physical or environmental conditions, and to cooperatively.
Introduction to Reconfigurable Computing CS61c sp06 Lecture (5/5/06) Hayden So.
Surrey Space Centre, University of Surrey, Guildford, Surrey, GU2 7XH ESA Wireless Sensor Motes Study George Prassinos, SSC, University of Surrey.
Danish Space Research Institute Danish Small Satellite Programme FH Space_Environment.ppt Slide # 1 Flemming Hansen MScEE, PhD Technology Manager.
Configurable System-on-Chip: Xilinx EDK
A Performance and Energy Comparison of FPGAs, GPUs, and Multicores for Sliding-Window Applications From J. Fowers, G. Brown, P. Cooke, and G. Stitt, University.
Introduction to FPGA and DSPs Joe College, Chris Doyle, Ann Marie Rynning.
Dynamic Power Consumption In Large FPGAs WILLIAM GARCIA, ANDREW MORTELLARO.
GPGPU platforms GP - General Purpose computation using GPU
Synergy.cs.vt.edu Power and Performance Characterization of Computational Kernels on the GPU Yang Jiao, Heshan Lin, Pavan Balaji (ANL), Wu-chun Feng.
2014 IEEE Aerospace Conference March 2, 2014 A Framework to Analyze Processor Architectures for Next-Generation On-Board Space Computing Tyler M. Lovelly.
Juanjo Noguera Xilinx Research Labs Dublin, Ireland Ahmed Al-Wattar Irwin O. Irwin O. Kennedy Alcatel-Lucent Dublin, Ireland.
Exploring the Tradeoffs of Configurability and Heterogeneity in Multicore Embedded Systems + Also Affiliated with NSF Center for High- Performance Reconfigurable.
EKT303/4 PRINCIPLES OF PRINCIPLES OF COMPUTER ARCHITECTURE (PoCA)
Networking Virtualization Using FPGAs Russell Tessier, Deepak Unnikrishnan, Dong Yin, and Lixin Gao Reconfigurable Computing Group Department of Electrical.
ECE 526 – Network Processing Systems Design Network Processor Architecture and Scalability Chapter 13,14: D. E. Comer.
Lecture 2: Field Programmable Gate Arrays September 13, 2004 ECE 697F Reconfigurable Computing Lecture 2 Field Programmable Gate Arrays.
Lecture#14. Last Lecture Summary Memory Address, size What memory stores OS, Application programs, Data, Instructions Types of Memory Non Volatile and.
Shared memory systems. What is a shared memory system Single memory space accessible to the programmer Processor communicate through the network to the.
1 3-General Purpose Processors: Altera Nios II 2 Altera Nios II processor A 32-bit soft core processor from Altera Comes in three cores: Fast, Standard,
Floating Point vs. Fixed Point for FPGA 1. Applications Digital Signal Processing -Encoders/Decoders -Compression -Encryption Control -Automotive/Aerospace.
A comprehensive method for the evaluation of the sensitivity to SEUs of FPGA-based applications A comprehensive method for the evaluation of the sensitivity.
COMPUTER SCIENCE &ENGINEERING Compiled code acceleration on FPGAs W. Najjar, B.Buyukkurt, Z.Guo, J. Villareal, J. Cortes, A. Mitra Computer Science & Engineering.
Paper Review: XiSystem - A Reconfigurable Processor and System
Reconfiguration Based Fault-Tolerant Systems Design - Survey of Approaches Jan Balach, Jan Balach, Ondřej Novák FIT, CTU in Prague MEMICS 2010.
Research on Reconfigurable Computing Using Impulse C Carmen Li Shen Mentor: Dr. Russell Duren February 1, 2008.
Advanced Computer Architecture, CSE 520 Generating FPGA-Accelerated DFT Libraries Chi-Li Yu Nov. 13, 2007.
SHA-3 Candidate Evaluation 1. FPGA Benchmarking - Phase Round-2 SHA-3 Candidates implemented by 33 graduate students following the same design.
Space Radiation and Fox Satellites 2011 Space Symposium AMSAT Fox.
Using Cycle Efficiency as a System Designer Metric to Characterize an Embedded DSP and Compare Hard Core vs. Soft Core Advisor Dr. Vishwani D. Agrawal.
J. Christiansen, CERN - EP/MIC
FPGA (Field Programmable Gate Array): CLBs, Slices, and LUTs Each configurable logic block (CLB) in Spartan-6 FPGAs consists of two slices, arranged side-by-side.
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR FPGA Fabric n Elements of an FPGA fabric –Logic element –Placement –Wiring –I/O.
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR Topics n FPGA fabric architecture concepts.
Lecture 16: Reconfigurable Computing Applications November 3, 2004 ECE 697F Reconfigurable Computing Lecture 16 Reconfigurable Computing Applications.
An FPGA Implementation of the Ewald Direct Space and Lennard-Jones Compute Engines By: David Chui Supervisor: Professor P. Chow.
1 - CPRE 583 (Reconfigurable Computing): Reconfigurable Computing Architectures Iowa State University (Ames) Reconfigurable Architectures Forces that drive.
EKT303/4 PRINCIPLES OF PRINCIPLES OF COMPUTER ARCHITECTURE (PoCA)
Wang Chen, Dr. Miriam Leeser, Dr. Carey Rappaport Goal Speedup 3D Finite-Difference Time-Domain.
29 May 2008 Exploration Technology Development Program’s Radiation Hardened Electronics for Space Environments (RHESE) Andrew S. Keys, James H. Adams,
Hardware Benchmark Results for An Ultra-High Performance Architecture for Embedded Defense Signal and Image Processing Applications September 29, 2004.
A Programmable Single Chip Digital Signal Processing Engine MAPLD 2005 Paul Chiang, MathStar Inc. Pius Ng, Apache Design Solutions.
FPGA-Based System Design: Chapter 1 Copyright  2004 Prentice Hall PTR Moore’s Law n Gordon Moore: co-founder of Intel. n Predicted that number of transistors.
Aerospace Conference ‘12 A Framework to Analyze, Compare, and Optimize High-Performance, On-Board Processing Systems Nicholas Wulf Alan D. George Ann Gordon-Ross.
FTCA Lecture November 14, 2012 FTCA: On-Board Processing Design Optimization Framework Dr. Alan D. George Professor of ECE University of Florida Dr. Herman.
In-Place Decomposition for Robustness in FPGA Ju-Yueh Lee, Zhe Feng, and Lei He Electrical Engineering Dept., UCLA Presented by Ju-Yueh Lee Address comments.
1 - CPRE 583 (Reconfigurable Computing): Reconfigurable Computing Architectures Iowa State University (Ames) CPRE 583 Reconfigurable Computing Lecture.
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR Topics n FPGA fabric architecture concepts.
Philipp Gysel ECE Department University of California, Davis
Gill 1 MAPLD 2005/234 Analysis and Reduction Soft Delay Errors in CMOS Circuits Balkaran Gill, Chris Papachristou, and Francis Wolff Department of Electrical.
1 Scaling Soft Processor Systems Martin Labrecque Peter Yiannacouras and Gregory Steffan University of Toronto FCCM 4/14/2008.
Xilinx V4 Single Event Effects (SEE) High-Speed Testing Melanie D. Berg/MEI – Principal Investigator Hak Kim, Mark Friendlich/MEI.
Heterogeneous Processing KYLE ADAMSKI. Overview What is heterogeneous processing? Why it is necessary Issues with heterogeneity CPU’s vs. GPU’s Heterogeneous.
Comparative Analysis of Parallel OPIR Compression on Space Processors
SEU Mitigation Techniques for Virtex FPGAs in Space Applications
Instructor: Dr. Phillip Jones
Electronics for Physicists
FPGAs in AWS and First Use Cases, Kees Vissers
Maintaining Data Integrity in Programmable Logic in Atmospheric Environments through Error Detection Joel Seely Technical Marketing Manager Military &
Lecture 41: Introduction to Reconfigurable Computing
Design of a ‘Single Event Effect’ Mitigation Technique for Reconfigurable Architectures SAJID BALOCH Prof. Dr. T. Arslan1,2 Dr.Adrian Stoica3.
Electronics for Physicists
Presentation transcript:

Reconfigurable Computing in Space with Radiation-Hardened Xilinx FPGAs Dr. Greg Stitt Associate Professor of ECE University of Florida Tyler M. Lovelly Research Student University of Florida December 10th, 2014

Introduction Space computing presents unique challenges Harsh and inaccessible operating environment Severe resource constraints – power, size, weight Stringent requirements for performance and reliability Increasing need for high-performance space computing Escalating demands for real-time sensor and autonomous processing Limited communication bandwidth to ground stations Legacy radiation-hardened (RadHard) processors cannot meet demands Generations behind commercial-off-the-shelf (COTS) processors Based upon architectures not particularly suited to needs of space computing Quantitative and objective analysis of processor architectures Device metrics analysis based on architectural capabilities Broad and diverse set of architectures under consideration Targeting space processors and low-power COTS processors (≤ 30 W)

Lecture Overview Intro to space computing Intro to device metrics Radiation hazards in space environment Radiation effects on electronics and FPGAs Radiation-hardening for techniques and outcomes Intro to device metrics Analyzing performance, power, memory, and IO Calculations with fixed and reconfigurable architectures Radiation-hardened Xilinx FPGAs Analysis of Virtex-5QV with device metrics Comparisons with COTS counterpart Comparisons with other RadHard processors 3

Earth magnetic field [8] Measured electron flux [8] Space Environment Radiation hazards Cosmic rays Low flux of high-energy charged particles Originate from sun (solar wind) and from outside solar system (galactic) Solar particle events Solar flares Energy bursts from coronal magnetic field Rich in electrons; last for hours Coronal mass ejections Eruption of plasma Rich in protons; last for days Radiation belts Charged particles trapped in magnetosphere Van Allen belts are mostly protons and electrons GCR Elements [8] 11-year solar cycle [8] Earth magnetic field [8] Measured electron flux [8] 4

Spacecraft Electronics Radiation effects on devices Electrostatic discharge Creates transient that couples into electronics Cumulative effects Total ionizing dose (TID) Silicon lattice damage, charge buildup within gate oxide Displacement damage (DD) Causes nucleus to move from normal lattice position Transient effects Single-event effects (SEEs) Particles pass thru lattice, cause soft errors SEU – transient pulses or bit flips SEFI – disrupts system functionality SEL – possible damage to device SEU in FF [7] SEU: single-event upset SEFI: single-event functional interrupt SEL: single-event latchup 5

Space Computing with FPGAs Xilinx FPGAs for space missions High computational capabilities; low power Enables parallelization and reconfiguration SRAM-based configuration memory Configures LUTs, FFs, BRAMs, DSPs, routing Memory sizes continually increasing SRAM vulnerable to SEEs Effects of SEEs on FPGAs [6] Configuration memory faults Routing faults - broken connections or short circuits LUT and DSP faults - change in logical functions BRAM and FF faults - data errors in the running design Can lead to data errors or disrupt functionality TID limits component lifetime Silicon lattice damage Charge buildup within gate oxide Xilinx CLB Architecture 6

Space Processors Radiation-hardened (RadHard) processors for high reliability Techniques for radiation-hardening Radiation-hardening by process Insulating oxide layer used in process Radiation-hardening by design Specialized circuit-layout techniques Radiation-hardening by architecture Fault-tolerant computing strategies Outcomes of radiation-hardening Cumulative effects Total-Ionizing Dose (TID) ≥ 300 krad(Si) Single-Event Effects (SEEs) Immunity to Single-Event Latchup (SEL), Upset (SEU), Functional Interrupt (SEFI) Performance and power Slower operating frequency, reduction in cores or execution units, increased power

Device Metrics Suite of quantitative and objective metrics developed by NSF CHREC Center at University of Florida [3-4] For comparative analysis of broad and diverse set of processors Central processing units (CPUs) Digital signal processors (DSPs) Field-programmable gate arrays (FPGAs) Graphics processing units (GPUs) Hybrid combinations of above (often SoCs) Highly useful for first-order analyses and comparisons Study broad range of devices with metrics to determine best candidates Later, study best candidates more deeply with selected, optimized benchmarking Different methods used for fixed- and reconfigurable-logic devices Metrics data collected from architectural features of device Determined from vendor-provided information and tools Experimental testbed in lab is not required for metrics analysis

Device Metrics Analysis Analyzing performance (GOPS) and power (GOPS/W) Computational Density (CD) measures theoretical performance Reported in giga-operations per second (GOPS) Calculated separately for varying data types 8-bit, 16-bit, and 32-bit Integer (Int8, Int16, and Int32) Single-precision and double-precision floating point (SPFP and DPFP) Determine operations mix (additions, multiplications, etc.) based on target apps CD per Watt (CD/W) measures performance scaled by power Analyzing memory and input-output bandwidth (GB/s) Internal Memory Bandwidth (IMB) measures throughput between processor and on-chip memory (cache or BRAM) External Memory Bandwidth (EMB) measures throughput between processor and off-chip memory (DDR2, DDR3, etc) Input-Output Bandwidth (IOB) measures throughput between processor and all off-chip resources (DDR, GigE, PCIe, GPIO, etc.)

Device Metrics: CPU Analysis (1/2) Example: Freescale P5040 COTS counterpart of RadHard RAD5545 CPU from BAE Systems Fixed-logic CPU: 2.2 GHz, 49 W, quad-core, no SIMD engine Calculating CD and CD/W Each core contains 3 integer execution units 1 floating-point execution unit Can issue 2 instructions each cycle Calculate operations/cycle for each data type Int8: 2 ops/cycle Int16: 2 ops/cycle Int32: 2 ops/cycle SPFP: 1 op/cycle DPFP: 1 op/cycle CDInt8,Int16,Int32 = 4 cores × 2.2 GHz × 2 ops/cycle = 17.6 GOPS CD/WInt8,Int16,Int32 = 17.6 GOPS / 49 W = 0.36 GOPS/W CDSPFP,DPFP = 4 cores × 2.2 GHz × 1 ops/cycle = 8.8 GOPS CD/WSPFP,DPFP = 8.8 GOPS / 49 W = 0.18 GOPS/W Operations mix of 50% add, 50% mult

Device Metrics: CPU Analysis (2/2) Calculating IMB, EMB, and IOB Each core contains L1 data cache: 8-byte bus L1 instr cache: 16-byte bus L2 cache: 64-byte bus Total of 2 DDR3 controllers: 8-byte bus; 1600 MT/s IMBL1data = 4 cores × 2.2 GHz × 8 bytes = 70.4 GB/s IMBL1inst = 4 cores × 2.2 GHz × 16 bytes = 140.8 GB/s IMBL2 = 4 cores × 2.2 GHz × 64 bytes / 11 cycles = 51.2 GB/s EMB = 2 DDR3 × 8 bytes × 1600 MT/s = 25.6 GB/s IOB = DDR3 + 10GigE + 1GigE + PCIe + SATA2.0 + GPIO + USB2.0 + SPI + UART + I2C = 48.76 GB/s Assumes 100% cache hit rate Based on optimal SerDes lane config.

Device Metrics: FPGA Analysis (1/2) Example: Xilinx Virtex-5 FX130T COTS counterpart of RadHard Virtex-5QV FX130 FPGA from Xilinx Reconfigurable-logic FPGA: different methods required for metrics [5] Calculating CD and CD/W FPGA logic resources Look-up tables (LUTs), Flip-flops (FFs), Multiply-accumulate units (DSPs) Generate and implement compute cores on FPGA with vendor tools All combinations of operation and data types: with and without DSP resources Collect data on resource usage and max. operating frequencies Linear-programming algorithm optimally packs cores onto FPGA Max. cores = max. ops/cycle (with pipelined cores) Use vendor-provided tools for power estimation Calculate dynamic power based on resource usage for cores CDInt8 = 2358 ops/cycle × 0.353 GHz = 833.2 GOPS CD/WInt8 = 833.2 GOPS / 15.87 W = 52.5 GOPS/W Operations mix of 50% add, 50% mult Same process used for all data types

Device Metrics: FPGA Analysis (2/2) Calculating IMB, EMB, and IOB 298 Block RAM units (BRAMs): 9-byte bus, 2 ports, 0.450 GHz operating frequency 5 DDR2 controllers: 8-byte bus, double data rate, 0.266 GHz operating frequency 840 GPIO pins: 0.8 Gb/s data rate 20 RocketIO GTX transceivers: 6.5 Gb/s data rate IMBBRAM = 298 BRAMs × 0.450 GHz × 9 bytes × 2 ports = 2413.8 GB/s EMB = 5 DDR2 × 0.266 GHz × 8 bytes × 2 (double data rate) = 21.33 GB/s IOB = DDR2 + GPIO + RocketIO GTX transceivers = 21.33 GB/s + (840 pins × 0.8 Gb/s) + (20 transceivers × 6.5 Gb/s) = 121.58 GB/s Based on max. packing of memory controllers

Metrics: Virtex-5 vs. Virtex-5QV (1/3) Resource usage of compute cores Data generated with Xilinx ISE tools + Tcl scripts Same for both devices Total resources FFs LUTs DSPs 81920 320 Xilinx Virtex-5 (XC5VFX130T_FF1738-1) Operation Data type Use DSPs? FFs DSPs LUTs Frequency (MHz) Add Int8 No 8 638.57 Yes 1 274.50 Int16 16 492.37 306.84 Int32 32 349.04 288.77 SPFP 547 416 376.08 327 2 230 418.24 DPFP 1035 777 301.30 945 3 720 343.05 Mult 82 76 353.36 488.04 302 293 377.79 445.83 1125 1133 303.31 113 4 485.91 681 619 338.07 106 91 400.96 2434 2286 210.70 484 11 315 309.98 Xilinx Virtex-5QV (XQR5VFX130_CF1752-1) Operation Data type Use DSPs? FFs DSPs LUTs Added LUTs Frequency (MHz) % of COTS frequency Add Int8 No 8 14 6 465.33 72.87 Yes 1 156.57 57.04 Int16 16 28 12 337.84 68.61 203.79 66.42 Int32 32 56 24 282.41 80.91 190.73 66.05 SPFP 547 468 52 222.57 59.18 327 2 265 35 259.27 61.99 DPFP 1035 890 113 236.57 78.52 945 3 808 88 210.84 61.46 Mult 82 84 301.30 85.27 220.41 45.16 302 309 283.69 75.09 205.80 46.16 1125 1163 30 215.47 71.04 4 77 45 414.42 85.29 681 640 21 268.82 79.52 106 99 249.44 62.21 2434 2331 187.20 88.84 484 11 362 47 270.93 87.40 FF and DSP usage same for both devices Uses more LUTs Average of ~70%

Metrics: Virtex-5 vs. Virtex-5QV (2/3) Performance and power calculations CD affected by reduction in operating frequencies and additional LUTs Calculated with linear-programming algorithm for optimal packing of cores CD/W calculated with resource usage data and Xilinx Power Estimator Xilinx Virtex-5 (XC5VFX130T_FF1738-1) Computational Density (CD) 𝐶𝐷 ­𝐼𝑛𝑡8 =𝟖𝟑𝟑.𝟐𝟎 𝑮𝑶𝑷𝑺 𝐶𝐷 ­𝐼𝑛𝑡16 ­=𝟒𝟏𝟔.𝟑𝟎 𝑮𝑶𝑷𝑺 𝐶𝐷 ­𝐼𝑛𝑡32 ­=𝟖𝟗.𝟔𝟖 𝑮𝑶𝑷𝑺 𝐶𝐷 ­𝑆𝑃𝐹𝑃 ­=𝟖𝟎.𝟐𝟑 𝑮𝑶𝑷𝑺 𝐶𝐷 ­𝐷𝑃𝐹𝑃 ­=𝟏𝟕.𝟓𝟑 𝑮𝑶𝑷𝑺 Computational Density per Watt (CD/W) 𝐶𝐷/𝑊 ­𝐼𝑛𝑡8 =833.2 𝐺𝑂𝑃𝑆 / 15.87 𝑊=𝟓𝟐.𝟓𝟎 𝑮𝑶𝑷𝑺/𝑾 𝐶𝐷/𝑊 ­𝐼𝑛𝑡8 =416.3 𝐺𝑂𝑃𝑆 / 16.83 𝑊=𝟐𝟒.𝟕𝟒 𝑮𝑶𝑷𝑺/𝑾 𝐶𝐷/𝑊 ­𝐼𝑛𝑡8 =89.680 𝐺𝑂𝑃𝑆 / 14.07 𝑊=𝟔.𝟑𝟕 𝑮𝑶𝑷𝑺/𝑾 𝐶𝐷/𝑊 ­𝐼𝑛𝑡8 ­=80.23 𝐺𝑂𝑃𝑆 / 13.28 𝑊=𝟔.𝟎𝟒 𝑮𝑶𝑷𝑺/𝑾 𝐶𝐷/𝑊 ­𝐼𝑛𝑡8 ­=17.53 𝐺𝑂𝑃𝑆 / 8.001 𝑊=𝟐.𝟏𝟗 𝑮𝑶𝑷𝑺/𝑾 Xilinx Virtex-5QV (XQR5VFX130_CF1752-1) Computational Density (CD) 𝐶𝐷 ­𝐼𝑛𝑡8 ­=𝟓𝟎𝟑.𝟕𝟐 𝑮𝑶𝑷𝑺 𝐶𝐷 ­𝐼𝑛𝑡16 ­­=𝟐𝟏𝟒.𝟓𝟕 𝑮𝑶𝑷𝑺 𝐶𝐷 ­𝐼𝑛𝑡32 ­=𝟓𝟗.𝟔𝟕 𝑮𝑶𝑷𝑺 𝐶𝐷 ­𝑆𝑃𝐹𝑃 ­=𝟓𝟏.𝟗𝟑 𝑮𝑶𝑷𝑺 𝐶𝐷 ­𝐷𝑃𝐹𝑃 ­=𝟏𝟒.𝟗𝟔 𝑮𝑶𝑷𝑺 Computational Density per Watt (CD/W) 𝐶𝐷/𝑊 ­𝐼𝑛𝑡8 =503.72 𝐺𝑂𝑃𝑆 / 11.37 𝑊=𝟒𝟒.𝟑𝟎 𝑮𝑶𝑷𝑺/𝑾 𝐶𝐷/𝑊 ­𝐼𝑛𝑡16 ­=214.57 𝐺𝑂𝑃𝑆 / 9.486 𝑊=𝟐𝟐.𝟔𝟐 𝑮𝑶𝑷𝑺/𝑾 𝐶𝐷/𝑊 ­𝐼𝑛𝑡32 ­=59.67 𝐺𝑂𝑃𝑆 / 10.09 𝑊=𝟓.𝟗𝟏 𝑮𝑶𝑷𝑺/𝑾 𝐶𝐷/𝑊 ­𝑆𝑃𝐹𝑃 ­=51.93 𝐺𝑂𝑃𝑆 / 10.1 𝑊=𝟓.𝟏𝟒 𝑮𝑶𝑷𝑺/𝑾 𝐶𝐷/𝑊 ­𝐷𝑃𝐹𝑃 =14.96 𝐺𝑂𝑃𝑆 / 8.781 𝑊=𝟏.𝟕𝟎 𝑮𝑶𝑷𝑺/𝑾 RadHard gives ~51-85% of original CD Performance hit not as significant as RadHard CPUs RadHard consumes lower power, but gives worse CD/W

Metrics: Virtex-5 vs. Virtex-5QV (3/3) Memory and input/output bandwidth calculations IMB affected by reduction in BRAM operating frequencies EMB calculated with DDR2 controller data from Xilinx ISE Roadblock: DDR2 controller not supported for Virtex-5QV tools IOB affected by reduction in data rates and pins for GPIO and RocketIO GTX transceivers Xilinx Virtex-5 (XC5VFX130T_FF1738-1) Internal Memory Bandwidth (IMB) 𝐼𝑀𝐵 𝐵𝑅𝐴𝑀 =298 𝐵𝑅𝐴𝑀𝑠×0.45 𝐺𝐻𝑧×9 𝑏𝑦𝑡𝑒𝑠×2 𝑝𝑜𝑟𝑡𝑠 =𝟐𝟒𝟏𝟑.𝟖𝟎 𝑮𝑩/𝒔 External Memory Bandwidth (EMB) 𝐸𝑀𝐵 𝐷𝐷𝑅2 =5 𝑐𝑜𝑛𝑡𝑟𝑜𝑙𝑙𝑒𝑟𝑠× 0.26667 𝐺𝐻𝑧× 8 𝑏𝑦𝑡𝑒𝑠× 2 𝑑𝑜𝑢𝑏𝑙𝑒 𝑑𝑎𝑡𝑎 =𝟐𝟏.𝟑𝟑 𝑮𝑩/𝒔 Input/Output Bandwidth (IOB) 𝐼𝑂𝐵 𝐷𝐷𝑅2 = 𝐸𝑀𝐵 𝐷𝐷𝑅2 = 𝟐𝟏.𝟑𝟑 𝑮𝑩/𝒔 𝐼𝑂𝐵 𝑅𝑜𝑐𝑘𝑒𝑡𝐼𝑂 𝐺𝑇𝑋 = 20 𝑡𝑟𝑎𝑛𝑠𝑐𝑒𝑖𝑣𝑒𝑟𝑠×6.5 𝐺𝑏/𝑠 = 𝟏𝟔.𝟐𝟓 𝑮𝑩/𝒔 𝐼𝑂𝐵 𝐺𝑃𝐼𝑂 = 840 𝑝𝑖𝑛𝑠×0.8 𝐺𝑏/𝑠 =𝟖𝟒.𝟎𝟎 𝑮𝑩/𝒔 Xilinx Virtex-5QV (XQR5VFX130_CF1752-1) Internal Memory Bandwidth (IMB) 𝐼𝑀𝐵 𝐵𝑅𝐴𝑀 =298 𝐵𝑅𝐴𝑀𝑠×0.36 𝐺𝐻𝑧×9 𝑏𝑦𝑡𝑒𝑠×2 𝑝𝑜𝑟𝑡𝑠=𝟏𝟗𝟑𝟏.𝟎𝟒 𝑮𝑩/𝒔 External Memory Bandwidth (EMB) 𝐸𝑀𝐵 𝐷𝐷𝑅2 =𝑻𝑩𝑫 Input/Output Bandwidth (IOB) 𝐼𝑂𝐵 𝐷𝐷𝑅2 = 𝐸𝑀𝐵 𝐷𝐷𝑅2 = 𝑻𝑩𝑫 𝐼𝑂𝐵 𝑅𝑜𝑐𝑘𝑒𝑡𝐼𝑂 𝐺𝑇𝑋 = 18 𝑡𝑟𝑎𝑛𝑠𝑐𝑒𝑖𝑣𝑒𝑟𝑠×4.25 𝐺𝑏/𝑠 =𝟗.𝟓𝟔 𝑮𝑩/𝒔 𝐼𝑂𝐵 𝐺𝑃𝐼𝑂 = 836 𝑝𝑖𝑛𝑠×0.8 𝐺𝑏/𝑠 =𝟖𝟑.𝟔𝟎 𝑮𝑩/𝒔 80% of COTS IMB Investigate alternate source for DDR2 controller Total IOB: 121.58 GB/s Total IOB: 𝑻𝑩𝑫

Metrics: RadHard Processors (1/2) Best integer CD; 2nd best floating-point CD 2nd best integer CD Best floating-point CD Older RadHard CPUs greatly outperformed Best integer and floating-point CD/W 2nd best floating-point CD/W 2nd best integer CD/W Older RadHard CPUs greatly outperformed Results displayed in logarithmic scale

Metrics: RadHard Processors (2/2) BRAMs in FPGA give much higher IMB than caches Older RadHard CPUs greatly outperformed Highest IOB by far, even without including DDR2 Highest EMB EMB based on controller for external L2 cache EMB still TBD No controllers for external memory Results displayed in logarithmic scale

Conclusions SRAM-based FPGAs in space are subject to radiation hazards, including errors to configuration memory Xilinx Virtex-5QV supports high-performance, high-reliability, low-power computing for next-generation space missions Comparisons with COTS counterpart Compute cores use same # of FFs and DSPs, but more LUTs Compute cores achieve average of ~70% of COTS operating frequencies RadHard achieves ~51-85% of COTS CD; lower power, but worse CD/W RadHard achieves 80% of COTS IMB; EMB and final IOB are TBD Comparisons with other RadHard processors Virtex-5QV achieves best integer CD, 2nd best floating-point CD, best CD/W Virtex-5QV achieves highest IMB and IOB; EMB and final IOB are TBD 19

CHREC Research Opportunities Next-Generation Space Processors Analysis with device metrics and benchmarking Investigation of theoretical vs. experimental performance On-Board Data Compression Exploration of pre-processing to reduce entropy Investigation of region-of-interest encoding Space Networking Analysis Investigation of key networking protocols for space Quantitative analysis with models and hardware testbeds 20

References T. M. Lovelly, K. Cheng, W. Garcia, and A. D. George, “Comparative Analysis of Present and Future Space Processors with Device Metrics," Proc. of Military and Aerospace Programmable Logic Devices Conference (MAPLD), San Diego, CA, May 19-22, 2014 T. M. Lovelly, D. Bryan, K. Cheng, R. Kreynin, A. D. George, A. Gordon-Ross, and G. Mounce, “A Framework to Analyze Processor Architectures for Next-Generation On-Board Space Computing,“ Proc. of IEEE Aerospace Conference (AERO), Big Sky, MT, Mar. 1-8, 2014 J. Williams, A. George, J. Richardson, K. Gosrani, C. Massie, H. Lam, “Characterization of Fixed and Reconfigurable Multi-Core Devices for Application Acceleration,” ACM Transactions on Reconfigurable Technology and Systems (TRETS), Vol. 3, No. 4, Nov. 2010, pp. 19:1-19:29 J. Richardson, S. Fingulin, D. Raghunathan, C. Massie, A. George, and H. Lam, “Comparative Analysis of HPC and Accelerator Devices: Computation, Memory, I/O, and Power,” Proc. of High-Performance Reconfigurable Computing Technology and Applications Workshop (HPRCTA), at SC’10, New Orleans, LA, Nov 14, 2010 N. Wulf, J. Richardson, and A. George, “Optimizing FPGA Performance, Power, and Dependability with Linear Programming,” Proc. of Military and Aerospace Programmable-Logic Devices Conference (MAPLD), San Diego, CA, April 9 - 12, 2013 Adam Jacobs, “Reconfigurable Fault Tolerance for Space Systems,” PhD Dissertation Defense, NSF Center for High-Performance Reconfigurable Computing (CHREC), ECE Department, University of Florida, March 22, 2013 Brock J. LaMeres, "FPGA-Based Radiation Tolerant Computing", University of Florida Research Colloquium, Gainesville, FL, November 9, 2012 Bourdarie, S.; Xapsos, M., "The Near-Earth Space Radiation Environment,“ IEEE Transactions on Nuclear Science, vol.55, no.4, pp.1810,1832, Aug. 2008 Lakshminarayana, V.; Karthikeyan, B.; Hariharan, V. K.; Ghatpande, N.D.; Danabalan, T.L., "Impact of Space weather on spacecraft," 10th International Conference on Electromagnetic Interference & Compatibility, pp.481,486, 26-27 Nov. 2008 Johnston, A.H., "Space Radiation Effects and Reliability Considerations for Micro- and Optoelectronic Devices," IEEE Transactions on Device and Materials Reliability, vol.10, no.4, pp.449,459, Dec. 2010 21