Maintaining Data Integrity in Programmable Logic in Atmospheric Environments through Error Detection Joel Seely Technical Marketing Manager Military &

Slides:



Advertisements
Similar presentations
FPGA (Field Programmable Gate Array)
Advertisements

Defect Tolerance for Yield Enhancement of FPGA Interconnect Using Fine-grain and Coarse-grain Redundancy Anthony J. YuGuy G.F. Lemieux September 15, 2005.
Computer Organization and Architecture
Commercial FPGAs: Altera Stratix Family Dr. Philip Brisk Department of Computer Science and Engineering University of California, Riverside CS 223.
Sana Rezgui 1, Jeffrey George 2, Gary Swift 3, Kevin Somervill 4, Carl Carmichael 1 and Gregory Allen 3, SEU Mitigation of a Soft Embedded Processor in.
10/14/2005Caltech1 Reliable State Machines Dr. Gary R Burke California Institute of Technology Jet Propulsion Laboratory.
+ CS 325: CS Hardware and Software Organization and Architecture Internal Memory.
Scrubbing Approaches for Kintex-7 FPGAs
Discussion of: “Terrestrial-based Radiation Upsets: A Cautionary Tale” CprE 583 Tony Kuker 12/06/05.
Fault-Tolerant Systems Design Part 1.
1 SECURE-PARTIAL RECONFIGURATION OF FPGAs MSc.Fisnik KRAJA Computer Engineering Department, Faculty Of Information Technology, Polytechnic University of.
HPEC 2012 Scrubbing Optimization via Availability Prediction (SOAP) for Reconfigurable Space Computing Quinn Martin Alan George.
Complex Upset Mitigation Applied to a Re-Configurable Embedded Processor EEL 6935 Lu Hao Wenqian Wu.
1 Fault Tolerant FPGA Co-processing Toolkit Oral defense in partial fulfillment of the requirements for the degree of Master of Science 2006 Oral defense.
April 30, Cost efficient soft-error protection for ASICs Tuvia Liran; Ramon Chips Ltd.
ICAP CONTROLLER FOR HIGH-RELIABLE INTERNAL SCRUBBING Quinn Martin Steven Fingulin.
Microprocessor Reliability
120/MAPLD 2004 Maintaining Data Integrity in EEPROM’s Ed Patnaude Maxwell Technologies San Diego, Ca.
Programmable Logic Devices
Zheming CSCE715.  A wireless sensor network (WSN) ◦ Spatially distributed sensors to monitor physical or environmental conditions, and to cooperatively.
Evolution of implementation technologies
Programmable logic and FPGA
Configuration. Mirjana Stojanovic Process of loading bitstream of a design into the configuration memory. Bitstream is the transmission.
COMPUTER ARCHITECTURE & OPERATIONS I Instructor: Hao Ji.
Spring 07, Apr 17, 19 ELEC 7770: Advanced VLSI Design (Agrawal) 1 ELEC 7770 Advanced VLSI Design Spring 2007 Soft Errors and Fault-Tolerant Design Vishwani.
1. 2 FPGAs Historically, FPGA architectures and companies began around the same time as CPLDs FPGAs are closer to “programmable ASICs” -- large emphasis.
EE 261 – Introduction to Logic Circuits Module #8 Page 1 EE 261 – Introduction to Logic Circuits Module #8 – Programmable Logic & Memory Topics A.Programmable.
1 Fault-Tolerant Computing Systems #2 Hardware Fault Tolerance Pattara Leelaprute Computer Engineering Department Kasetsart University
Radiation Effects and Mitigation Strategies for modern FPGAs 10 th annual workshop for LHC and Future experiments Los Alamos National Laboratory, USA.
Lecture 2: Field Programmable Gate Arrays September 13, 2004 ECE 697F Reconfigurable Computing Lecture 2 Field Programmable Gate Arrays.
A comprehensive method for the evaluation of the sensitivity to SEUs of FPGA-based applications A comprehensive method for the evaluation of the sensitivity.
Presented by Anthony B. Sanders NASA/GSFC at 2005 MAPLD Conference, Washington, DC #196 1 ALTERA STRATIX TM EP1S25 FIELD-PROGRAMMABLE GATE ARRAY (FPGA)
J. Christiansen, CERN - EP/MIC
ATMEL ATF280E Rad Hard SRAM Based FPGA SEE test results Application oriented SEU Sensitiveness Bernard BANCELIN ATMEL Nantes SAS, Aerospace Business Unit.
2/2/2009 Marina Artuso LHCb Electronics Upgrade Meeting1 Front-end FPGAs in the LHCb upgrade The issues What is known Work plan.
P173/MAPLD 2005 Swift1 Upset Susceptibility and Design Mitigation of PowerPC405 Processors Embedded in Virtex II-Pro FPGAs.
Sept. 2005EE37E Adv. Digital Electronics Lesson 1 CPLDs and FPGAs: Technology and Design Features.
MAPLD 2005/202 Pratt1 Improving FPGA Design Robustness with Partial TMR Brian Pratt 1,2 Michael Caffrey, Paul Graham 2 Eric Johnson, Keith Morgan, Michael.
Swankoski MAPLD 2005 / B103 1 Dynamic High-Performance Multi-Mode Architectures for AES Encryption Eric Swankoski Naval Research Lab Vijay Narayanan Penn.
Basic Sequential Components CT101 – Computing Systems Organization.
Synthesis Of Fault Tolerant Circuits For FSMs & RAMs Rajiv Garg Pradish Mathews Darren Zacher.
EE3A1 Computer Hardware and Digital Design
Analytical Approach for Soft Error Rate Estimation of SRAM-Based FPGAs Ghazanfar (Hossein) Asadi and Mehdi B. Tahoori Why Soft Error Rate (SER) Estimation?
Fault-Tolerant Systems Design Part 1.
CPS3340 COMPUTER ARCHITECTURE Fall Semester, /3/2013 Lecture 9: Memory Unit Instructor: Ashraf Yaseen DEPARTMENT OF MATH & COMPUTER SCIENCE CENTRAL.
Evaluating Logic Resources Utilization in an FPGA-Based TMR CPU
1 CzajkowskiMAPLD 2005/138 Radiation Hardened, Ultra Low Power, High Performance Space Computer Leveraging COTS Microelectronics With SEE Mitigation D.
Greg Alkire/Brian Smith 197 MAPLD An Ultra Low Power Reconfigurable Task Processor for Space Brian Smith, Greg Alkire – PicoDyne Inc. Wes Powell.
In-Place Decomposition for Robustness in FPGA Ju-Yueh Lee, Zhe Feng, and Lei He Electrical Engineering Dept., UCLA Presented by Ju-Yueh Lee Address comments.
Digital Circuits Introduction Memory information storage a collection of cells store binary information RAM – Random-Access Memory read operation.
EE121 John Wakerly Lecture #15
A Simplified Approach to Fault Tolerant State Machine Design for Single Event Upsets Melanie Berg.
Delivered by.. Love Jain p08ec907. Design Styles  Full-custom  Cell-based  Gate array  Programmable logic Field programmable gate array (FPGA)
Reconfigurable Architectures Greg Stitt ECE Department University of Florida.
FPGA Technology Overview Carl Lebsack * Some slides are from the “Programmable Logic” lecture slides by Dr. Morris Chang.
1 Chapter 1 Basic Structures Of Computers. Computer : Introduction A computer is an electronic machine,devised for performing calculations and controlling.
Xilinx V4 Single Event Effects (SEE) High-Speed Testing Melanie D. Berg/MEI – Principal Investigator Hak Kim, Mark Friendlich/MEI.
Chandrasekhar 1 MAPLD 2005/204 Reduced Triple Modular Redundancy for Tolerating SEUs in SRAM based FPGAs Vikram Chandrasekhar, Sk. Noor Mahammad, V. Muralidharan.
MAPLD 2005/213Kakarla & Katkoori Partial Evaluation Based Redundancy for SEU Mitigation in Combinational Circuits MAPLD 2005 Sujana Kakarla Srinivas Katkoori.
1 Introduction to Engineering Fall 2006 Lecture 17: Digital Tools 1.
Chapter 5 - Internal Memory 5.1 Semiconductor Main Memory 5.2 Error Correction 5.3 Advanced DRAM Organization.
Programmable Logic Devices
Sequential Logic Design
SEU Mitigation Techniques for Virtex FPGAs in Space Applications
Electronics for Physicists
Maintaining Data Integrity in Programmable Logic in Atmospheric Environments through Error Detection Joel Seely Technical Marketing Manager Military &
Computer Architecture & Operations I
Design of a ‘Single Event Effect’ Mitigation Technique for Reconfigurable Architectures SAJID BALOCH Prof. Dr. T. Arslan1,2 Dr.Adrian Stoica3.
Electronics for Physicists
Programmable logic and FPGA
Presentation transcript:

Maintaining Data Integrity in Programmable Logic in Atmospheric Environments through Error Detection Joel Seely Technical Marketing Manager Military & Aerospace Business Unit

Single Event Upset (SEU) Overview for SRAM-Based FPGAs

Copyright © 2004 Altera Corporation Definitions SEU: Single Event Upset  Unwanted Change in State of a Latch or a Memory Cell SER: Soft Error Rate  SEU Rate SEFI: Single Event Functional Interrupt  Functional Failure by SEU  Not All SEUs are SEFIs  Generally Takes 5-10 SEUs to Cause SEFI

Copyright © 2004 Altera Corporation Circuit Components of SRAM-Based FPGAs I/O Registers & I/O Configuration  No Issue, Very Robust Registers, < 1 FIT Logic Registers (LEs)  No Issues, Very Robust Registers, < Hard Error Rate User Memory  Typically On-Chip Memories are “By 9” for Parity Checking  IP Available for ECC Configuration RAM (CRAM) for LUTs & Routing  Area of Focus

Copyright © 2004 Altera Corporation Upset of a CRAM Cell Data In Add Vcc Vss Clear Data Out Time Voltage 6 Transistor Cell Noise Current for 10fC Collected Charge Time (ps) Current (µA)

Copyright © 2004 Altera Corporation SEU Induced Failure Rate* DeviceLE CountSEU Rate (FIT) SEFI Rate (FIT) MTBF** (Years) EP1C66K250601,900 Years EP1C2020K Years EP1S2526K Years EP1S8079K Years * Data at Sea Level **MTBF: Mean Time Between Functional Interrupt

Copyright © 2004 Altera Corporation Number of CRAM Bit Upsets for Each Occurrence of Functional Upset Median ~6 Median 5

Addressing System-Level Issues

Copyright © 2004 Altera Corporation SER Improvements/Mitigation Chip Design Enhancements  New Materials & Process Enhancements  Larger CRAM Structure  Increase in Capacitance on Critical Node  Smaller Process => Smaller Die => Lower SEU Probability  Built-In Error Detection/Correction Circuitry

Copyright © 2004 Altera Corporation SER Per SRAM Bit Trend Process Technology Year 0.5 µm µm 2002 SER per SRAM MBit 100 FITS 1,000 FITS 90 nm Projection

Copyright © 2004 Altera Corporation System Level Improvements Mitigation ECC for User Memory Use Detection/Correction Feature Triple Module Redundancy (TMR) To Achieve Lower Error Rate & Less Downtime Migrate to Structured ASIC

Copyright © 2004 Altera Corporation Soft Error Detection Methods Configuration RAM Readout  Read-Out Full Bitstream  Compare with Stored Bitstream  Can Determine where in Configuration Error Occurred Caveat: Security Issues with Reading Out Bitstream Stored CRAM Data Stored CRAM Data FPGA Microprocessor or CPLD Microprocessor or CPLD Same or Different?

Copyright © 2004 Altera Corporation Soft Error Detection Methods On-Chip SEU Detection  Dedicated Comparison Circuitry e.g. CRC Engine Comparing Stored CRC with That Calculated from Configuration RAM  Detection Circuitry Running Continuously  Error Detection Rate Variable Based on Implementation of Hardware, Number of CRAM Bits & Input Clock Frequency  Error Signal Available Internally or Externally Caveat: Cannot Determine Where in Configuration Error Occurred Computed Value Stored Value To Core = FPGA

Copyright © 2004 Altera Corporation On-Chip Detection Example Dedicated CRC Circuit  Configuration RAM Verification Capability 32-Bit Cyclic Redundancy Code Check Verified Against Internally Stored Value Runs in the Background Without Impacting Device Performance  Close to Real-Time Detection Variable Clock Frequency Depends on Number of CRAM Bits  Multi-Event Detection Up to 3-Bit for 32-Bit CRC  Result Output to Either Core or Pin Use with Either Internal or External Hardware for Error Correction

Copyright © 2004 Altera Corporation Correction Methods FPGA Detection, System-Level Correction  Lower Total Cost  Downtime Is Limited & Manageable  Used in Non-Critical Applications Triple Module Redundancy  Two Flavors All On-Chip in FPGA Separate Chips & Voter  Correction Can Be Real-Time  Used in Critical Applications

Copyright © 2004 Altera Corporation Single System Detection & Correction Step One: Detect the Soft Error  75% of Reported Errors Are “Don’t Care” Errors Step Two: Alert the System Step Three: Fix the Error  In Some Cases, Re-Program the FPGA  In Some Cases, Reboot the Sub-System  In Some Cases, Reboot the System Need to Focus on System “Downtime”  Each System Has Unique Requirements  Re-Programming FPGA Takes < 250 ms  Rebooting Time Varies & Can Be Fast “by Design”

Copyright © 2004 Altera Corporation TMR Method 1 Identical Hardware in FPGAs Use Voter Implemented in FPGA or CPLD Utilize Either Hardware Output or CRC Error Pin Voter Also Used to Signal Reconfiguration on Difference or Error FPGA Hardware1 FPGA Hardware1 FPGA Hardware3 FPGA Hardware3 FPGA Hardware 2 FPGA Hardware 2 FPGA or CPLD (Voting) FPGA or CPLD (Voting)

Copyright © 2004 Altera Corporation TMR Method 2 Multiple Instantiations of Hardware in Single FPGA For Low-Rate SEUs SEU Events May Occur Much More Frequently than Functional Error (De-Rating) Voter Signals Reconfiguration of FPGA FPGA Must be Reconfigured Voting Circuit Voting Circuit FPGA Hardware 1 Hardware 2 Hardware 3

Copyright © 2004 Altera Corporation De-Rating Methodology Only a Fraction of Configuration Bits Are Actually Programmed  e.g. Using Only Two Inputs of 4-Input LUT Leaves 75% of LUT as “Don’t Care”  Only About 20% of Routing Is Used  Depends on Utilization & Application Some Un-Programmed Bits Still Matter  Flipping Could Change Function of the Device Extensive Experimentation Shows a Range From 1/8 to 1/3 of the Bits Matter

Copyright © 2004 Altera Corporation Structured ASIC: Ultimate SEU Protection No Configuration Memory = Estimated SER is below Hard Failure Rate for the Device FPGA Structured ASIC PLD Architecture with ASIC Routing

Copyright © 2004 Altera Corporation Summary SEU is a Well Understood Phenomena Many Chip Level Enhancements Mitigate SEUs  Process  Design  Manufacturing Techniques Easy Detection of SEU Events is Key After Detection, Other Methods Must be Employed to Deal with the Event  Critical Nature of Application Determines Level of SEU Response Structured ASICs from FPGA Designs Offer a Much More Robust Solution Due to Removal of All CRAM