Evaluation of Power Costs in Triplicated FPGA Designs

Slides:



Advertisements
Similar presentations
What are FPGA Power Management HDL Coding Techniques Xilinx Training.
Advertisements

Spartan-3 FPGA HDL Coding Techniques
Sana Rezgui 1, Jeffrey George 2, Gary Swift 3, Kevin Somervill 4, Carl Carmichael 1 and Gregory Allen 3, SEU Mitigation of a Soft Embedded Processor in.
10/14/2005Caltech1 Reliable State Machines Dr. Gary R Burke California Institute of Technology Jet Propulsion Laboratory.
Scrubbing Approaches for Kintex-7 FPGAs
Multi-Bit Upsets in the Virtex Devices Heather Quinn, Paul Graham, Jim Krone, Michael Caffrey Los Alamos National Laboratory Gary Swift, Jeff George, Fayez.
1 Fault Tolerant FPGA Co-processing Toolkit Oral defense in partial fulfillment of the requirements for the degree of Master of Science 2006 Oral defense.
ICAP CONTROLLER FOR HIGH-RELIABLE INTERNAL SCRUBBING Quinn Martin Steven Fingulin.
Fast FPGA Resource Estimation Paul Schumacher & Pradip Jha Xilinx, Inc.
Power Visualization, Analysis, and Optimization Tools for FPGAs Matthew French, Li Wang and Michael Wirthlin USC, Information Sciences Institute, BYU.
XPower for CoolRunner™-II CPLDs
Floating-Point FPGA (FPFPGA) Architecture and Modeling (A paper review) Jason Luu ECE University of Toronto Oct 27, 2009.
Maintaining Data Integrity in Programmable Logic in Atmospheric Environments through Error Detection Joel Seely Technical Marketing Manager Military &
Fault-Tolerant Softcore Processors Part I: Fault-Tolerant Instruction Memory Nathaniel Rollins Brigham Young University.
FPGA Implementation of Closed-Loop Control System for Small-Scale Robot.
12004 MAPLD/1002??? When Should You and When Should You Not Use VHDL? Richard B. Katz NASA Office of Logic Design 2004 MAPLD International Conference September.
ENGIN112 L38: Programmable Logic December 5, 2003 ENGIN 112 Intro to Electrical and Computer Engineering Lecture 38 Programmable Logic.
The Spartan 3e FPGA. CS/EE 3710 The Spartan 3e FPGA  What’s inside the chip? How does it implement random logic? What other features can you use?  What.
The Memory/Logic Interface in FPGA’s with Large Embedded Memory Arrays The Memory/Logic Interface in FPGA’s with Large Embedded Memory Arrays Steven J.
CS 151 Digital Systems Design Lecture 38 Programmable Logic.
© 2003 Xilinx, Inc. All Rights Reserved Power Estimation.
156 / MAPLD 2005 Rollins 1 Reducing Energy in FPGA Multipliers Through Glitch Reduction Nathan Rollins and Michael J. Wirthlin Department of Electrical.
French 207 MAPLD 2005 Slide 1 Integrated Tool Suite for Post Synthesis FPGA Power Consumption Analysis Matthew French, Li Wang University of Southern California,
Radiation Effects and Mitigation Strategies for modern FPGAs 10 th annual workshop for LHC and Future experiments Los Alamos National Laboratory, USA.
Introduction to FPGA AVI SINGH. Prerequisites Digital Circuit Design - Logic Gates, FlipFlops, Counters, Mux-Demux Familiarity with a procedural programming.
Reduced Cost Reliability via Statistical Model Detection Jon-Paul Anderson- PhD Student Dr. Brent Nelson- Faculty Dr. Mike Wirthlin- Faculty Brigham Young.
Power Reduction for FPGA using Multiple Vdd/Vth
A comprehensive method for the evaluation of the sensitivity to SEUs of FPGA-based applications A comprehensive method for the evaluation of the sensitivity.
Spring Introduction  Today’s tutorial focuses on introducing you to Xilinx ISE and Modelsim.  These tools are used for Verilog Coding Simulation.
XPower for CoolRunner™ XPLA3 CPLDs. Quick Start Training Overview Design power considerations Power consumption basics of CMOS devices Calculating power.
Department of Communication Engineering, NCTU 1 Unit 5 Programmable Logic and Storage Devices – RAMs and FPGAs.
FORMAL VERIFICATION OF ADVANCED SYNTHESIS OPTIMIZATIONS Anant Kumar Jain Pradish Mathews Mike Mahar.
Introduction to FPGA Created & Presented By Ali Masoudi For Advanced Digital Communication Lab (ADC-Lab) At Isfahan University Of technology (IUT) Department.
MAPLD 2005/202 Pratt1 Improving FPGA Design Robustness with Partial TMR Brian Pratt 1,2 Michael Caffrey, Paul Graham 2 Eric Johnson, Keith Morgan, Michael.
Swankoski MAPLD 2005 / B103 1 Dynamic High-Performance Multi-Mode Architectures for AES Encryption Eric Swankoski Naval Research Lab Vijay Narayanan Penn.
Design of a Novel Bridge to Interface High Speed Image Sensors In Embedded Systems Tareq Hasan Khan ID: ECE, U of S Term Project (EE 800)
© 2003 Xilinx, Inc. All Rights Reserved Global Timing Constraints FPGA Design Flow Workshop.
Introduction to FPGA Tools
1 Leakage Power Analysis of a 90nm FPGA Authors: Tim Tuan (Xilinx), Bocheng Lai (UCLA) Presenter: Sang-Kyo Han (ECE, University of Maryland) Published.
2011/IX/27SEU protection insertion in Verilog for the ABCN project 1 Filipe Sousa Francis Anghinolfi.
Evaluating Logic Resources Utilization in an FPGA-Based TMR CPU
In-Place Decomposition for Robustness in FPGA Ju-Yueh Lee, Zhe Feng, and Lei He Electrical Engineering Dept., UCLA Presented by Ju-Yueh Lee Address comments.
DDRIII BASED GENERAL PURPOSE FIFO ON VIRTEX-6 FPGA ML605 BOARD PART B PRESENTATION STUDENTS: OLEG KORENEV EUGENE REZNIK SUPERVISOR: ROLF HILGENDORF 1 Semester:
Paper by F.L. Kastensmidt, G. Neuberger, L. Carro, R. Reis Talk by Nick Boyd 1.
Xilinx V4 Single Event Effects (SEE) High-Speed Testing Melanie D. Berg/MEI – Principal Investigator Hak Kim, Mark Friendlich/MEI.
Chandrasekhar 1 MAPLD 2005/204 Reduced Triple Modular Redundancy for Tolerating SEUs in SRAM based FPGAs Vikram Chandrasekhar, Sk. Noor Mahammad, V. Muralidharan.
MAPLD 2005/213Kakarla & Katkoori Partial Evaluation Based Redundancy for SEU Mitigation in Combinational Circuits MAPLD 2005 Sujana Kakarla Srinivas Katkoori.
Introduction to the FPGA and Labs
Presenter: Darshika G. Perera Assistant Professor
Floating-Point FPGA (FPFPGA)
Topics SRAM-based FPGA fabrics: Xilinx. Altera..
An FPGA Implementation of a Brushless DC Motor Speed Controller
Xin Fang, Pei Luo, Yunsi Fei, and Miriam Leeser
MAPLD 2005 Reduced Triple Modular Redundancy for Tolerating SEUs in SRAM based FPGAs Vikram Chandrasekhar, Sk. Noor Mahammad, V. Muralidharan Dr. V. Kamakoti.
ECE 4110–5110 Digital System Design
SEU Mitigation Techniques for Virtex FPGAs in Space Applications
Instructor: Dr. Phillip Jones
Maintaining Data Integrity in Programmable Logic in Atmospheric Environments through Error Detection Joel Seely Technical Marketing Manager Military &
MAPLD 2005 BOF-L Mitigation Methods for
LOW POWER DIGITAL VIDEO COMPRESSION HARDWARE DESIGN
We will be studying the architecture of XC3000.
The Xilinx Virtex Series FPGA
Timing Analysis 11/21/2018.
A 100 µW, 16-Channel, Spike-Sorting ASIC with On-the-Fly Clustering
FPGA Tools Course Answers
Analytical Approach for Soft Error Rate Estimation of SRAM-Based FPGAs
Dynamic High-Performance Multi-Mode Architectures for AES Encryption
FPGA Glitch Power Analysis and Reduction
The Xilinx Virtex Series FPGA
Hardware Assisted Fault Tolerance Using Reconfigurable Logic
Presentation transcript:

Evaluation of Power Costs in Triplicated FPGA Designs Nathan Rollins1, Michael J. Wirthlin1, and Paul Graham2 1 2 Department of Electrical and Computer Engineering Brigham Young University Provo, UT Los Alamos National Laboratory Los Alamos, NM This work was supported by Los Alamos National Laboratory, U.S. Department of Energy (LA-UR #04-5285)

Single-Event Effects of SRAM FPGAs Static memory sensitive to single-event upsets (SEU) Large amount of static memory in FPGAs Configuration memory User memory and flip-flops Upsets within the configuration memory may modify the behavior of the design Internal Logic and Interconnect Global clocking and configuration modes Operation of Input/Output pads Design redundancy necessary for proper operation

SEU Mitigation Through TMR Triple Modular Redundancy (TMR) Triplicate circuit of interest Vote on circuit output (majority voter) TMR masks all single failures in design Design Voter

Area and Speed Costs of TMR Bulletproof design is possible with 3-voter TMR or feedback TMR and triplicated clocks Design Voter OBUF BUFG Area Cost: 3x – 6x Speed Cost: up to 50% slower

Power Costs of TMR FPGAs consume relatively more power than ASICs TMR may cause a triplication of an already large power consumption Goal: Investigate the cost of TMR in terms of power consumption

Power Analysis Tools Xilinx’s XPower tool JPower tool Estimates power consumption JPower tool Measures power consumption JPower power measurement tool can be used to validate XPower power estimation tool

XPower Xilinx power measurement tool Requires: *.ncd and *.pcf files or *.par.ncd file Activity rates for EVERY net in the design 3 ways to assign net activity rates: Manually in the XPower tool (GOOD LUCK!) Import *.xml file Import *.vcd or *.xad files

XPower Importing *.xml files: XPower creates XML files as a way of saving session settings – why not create them ourselves ? JHDL used to get activity rates 100% nets assigned activity rate XPower reports “accurate” confidence level JHDL cannot simulate glitches This method underestimates true power consumption

XPower Importing *.vcd files: Use back-annotation (ngd2vhdl) & ModelSim to create *.vcd files Accurate timing information available (*.sdf file) provides most accurate results

JPower Tool to measure total current on SLAAC1V board (in mA) Measurement functions added to SLAAC1V C API by LANL - summer 2003 Tool named after LANL intern Jason Zimmerman

JPower Current Measurments: 2.5V channel sampled at 120 kHz Current stored in a register as a 10-bit number Value in register multiplied by 4.8828125 mA and rounded to the nearest mA Max current reading: 4990mA Averaged current measurements are repeatable to within a few mA

Calibrating JPower and XPower Blank Design – run JPower with no designs on SLAAC1V board (default X0 design) Run JPower tool with and without clock running at frequencies from 10MHz – 100MHz Results from this blank design provides an equation which allows us to isolate the power measurements to a single FPGA on the SLAAC1V board

Test Calibration Four 8-bit Designs (non-TMR & TMR): Incrementer 72 incrementers – each output to IOBs XOR’ed Incrementer 416 incrementers – groups of outputs XOR’ed Up/Down Loadable Counter 416 counters – final counter bits output to IOBs 3 CLK Implementations of above designs

Calibration Results

Frequency vs. Power Slopes Calibration Results “Close” match between JPower & XPower for non-TMR designs and for TMR designs with triplicated clock Large slope difference in counter TMR design Non - TMR TMR 3 CLK (TMR) INC XOR CNT Frequency vs. Power Slopes JPower 1.54 7.85 11.08 7.37 31.13 47.53 5.37 30.67 48.14 XPower 7.95 9.26 5.23 27.06 39.03 5.28 30.62 49.94 Area Costs LUTs 576 3250 3328 1728 9750 19968

TMR Power Studies Effects of design placement on power consumption Power costs of realistic designs Hitachi 8-bit CPU QPSK demodulator Power costs of different Xilinx architectures

Effects of Design Placement on Power Consumption Incrementer TMR design used to test effects of placement on power 3 different placements used: Placement 1 Placement 2 Placement 3 Cnt0-0 Cnt0-1 Cnt0-2 Cnt1-0 Cnt1-1 Cnt1-2 Cnt2-0 Cnt2-1 Cnt2-2 …

Frequency vs. Power Slopes Effects of Placement Comparison of frequency vs. power slopes for auto-placed and hand-placed designs shows the effects of placement “JP/XP” row ratio indicates how closely JPower and XPower match Incrementer XOR Incrementer Up/Down Counter Auto-Place Hand-Place Frequency vs. Power Slopes JPower 7.37 4.78 31.13 22.18 47.53 41.22 XPower 5.23 4.76 27.06 25.10 39.03 36.40 JP / XP 1.41 1.00 1.15 0.88 1.22 1.13

Effects of Placement JPower is much more sensitive to placement than Xpower JPower and XPower closest with placement 3 (optimized hand-placement) Auto-Place Place 1 Place 2 Place 3 Frequency vs. power slopes (TMR) JPower 7.37 10.65 6.15 4.76 XPower 5.23 6.20 5.21 4.78 Power increase due to TMR (TMR / non-TMR) 4.79x 7.04x 4.06x 3.15x 3.40x 4.04x 3.39x 3.10x

Realistic Designs: Area and Speed Costs QPSK Demodulator Hitachi CPU Virtex TMR Area Cost 3.03x 3.01x TMR Speed Cost 95.2 % 71.1 % Virtex2 3.00x 84.6 % 100.0 % Virtex2Pro 77.9 % 80.8 % Spartan 3 3.02x 97.2% 87.0 % Area and speed costs due to TMR

Realistic Designs: Power Costs

Realistic Designs: Power Costs Non - TMR TMR JPower Virtex Virtex2 Virtex2Pro Spartan3 Dynamic Power (mW) QPSK 40.50 45.71 8.60 8.16 1.97 93.75 150.64 30.17 24.98 6.68 Hitachi 2.06 2.34 0.79 0.48 0.12 5.48 7.30 2.10 1.39 0.30 Static 28.57 22.14 150 336.86 179.83 26.43 37.86 139.5 334.71 182.23 27.17 337.07 180.00 28.25 27.50 337.50 180.34 NOTE: JPower measures power on Virtex parts only JPower Virtex Virtex2 Virtex2Pro Spartan3 Dynamic Power Increase Due to TMR (TMR / non-TMR) QPSK 2.53x 3.30x 3.51x 3.06x 3.39x Hitachi 2.66x 3.12x 2.88x 2.50x

Power Study Results Power consumption increase due to TMR: ~3x Poor design mapping can cause higher power consumption Static power can be a significant contribution to total power consumption FPGA architecture is significant in determining power consumption