IPR: In-Place Reconfiguration for FPGA Fault Tolerance Zhe Feng 1, Yu Hu 1, Lei He 1 and Rupak Majumdar 2 1 Electrical Engineering Department 2 Computer.

Slides:



Advertisements
Similar presentations
Survey of Detection, Diagnosis, and Fault Tolerance Methods in FPGAs
Advertisements

Match and Replace — A Functional ECO Engine for Multi-Error Circuit Rectification Shao-Lun Huangy, Wei-Hsun Linz, Chung-Yang (Ric) Huangyz ICCAD’11.
Abdullah Aldahami ( ) Jan 29,  This paper propose a new resynthesis algorithm for FPGA area reduction.  The existing resynthesis techniques.
Address comments to FPGA Area Reduction by Multi-Output Sequential Resynthesis Yu Hu 1, Victor Shih 2, Rupak Majumdar 2 and Lei He 1 1.
Exploiting Symmetry in SAT-Based Boolean Matching for Heterogeneous FPGA Technology Mapping Yu Hu 1, Victor Shih 2, Rupak Majumdar 2 and Lei He 1 1 Electrical.
ECE 667 Synthesis & Verificatioin - FPGA Mapping 1 ECE 667 Synthesis and Verification of Digital Systems Technology Mapping for FPGAs D.Chen, J.Cong, DAOMap.
Cross-layer Optimized Placement and Routing for FPGA Soft Error Mitigation Keheng Huang 1,2, Yu Hu 1, and Xiaowei Li 1 1 Key Laboratory of Computer System.
Maintaining Data Integrity in Programmable Logic in Atmospheric Environments through Error Detection Joel Seely Technical Marketing Manager Military &
FPGA Latency Optimization Using System-level Transformations and DFG Restructuring Daniel Gomez-Prado, Maciej Ciesielski, and Russell Tessier Department.
Logic Simulation 4 Outline –Fault Simulation –Fault Models –Parallel Fault Simulation –Concurrent Fault Simulation Goal –Understand fault simulation problem.
Combining Technology Mapping and Retiming EECS 290A Sequential Logic Synthesis and Verification.
Exploiting Symmetry in SAT-Based Boolean Matching for Heterogeneous FPGA Technology Mapping Yu Hu 1, Victor Shih 2, Rupak Majumdar 2 and Lei He 1 1 Electrical.
Address comments to Robust FPGA Resynthesis Based on Fault-Tolerant Boolean Matching Yu Hu 1, Zhe Feng 1, Lei He 1 and Rupak Majumdar 2.
Yu Hu1, Satyaki Das2 Steve Trimberger2, and Lei He1
1 FRAIGs: Functionally Reduced And-Inverter Graphs Adapted from the paper “FRAIGs: A Unifying Representation for Logic Synthesis and Verification”, by.
Logic Design Outline –Logic Design –Schematic Capture –Logic Simulation –Logic Synthesis –Technology Mapping –Logic Verification Goal –Understand logic.
DAG-Aware AIG Rewriting Alan Mishchenko, Satrajit Chatterjee, Robert Brayton Department of EECS, University of California Berkeley Presented by Rozana.
 Y. Hu, V. Shih, R. Majumdar and L. He, “Exploiting Symmetries to Speedup SAT-based Boolean Matching for Logic Synthesis of FPGAs”, TCAD  Y. Hu,
1 Application Specific Integrated Circuits. 2 What is an ASIC? An application-specific integrated circuit (ASIC) is an integrated circuit (IC) customized.
مرتضي صاحب الزماني  The registers are master-slave flip-flops (a.k.a. edge-triggered) –At the beginning of each cycle, propagate values from primary inputs.
USING SAT-BASED CRAIG INTERPOLATION TO ENLARGE CLOCK GATING FUNCTIONS Ting-Hao Lin, Chung-Yang (Ric) Huang Graduate Institute of Electrical Engineering,
Accuracy-Configurable Adder for Approximate Arithmetic Designs
Power Reduction for FPGA using Multiple Vdd/Vth
SiLab presentation on Reliable Computing Combinational Logic Soft Error Analysis and Protection Ali Ahmadi May 2008.
05/04/06 1 Integrating Logic Synthesis, Tech mapping and Retiming Presented by Atchuthan Perinkulam Based on the above paper by A. Mishchenko et al, UCAL.
FORMAL VERIFICATION OF ADVANCED SYNTHESIS OPTIMIZATIONS Anant Kumar Jain Pradish Mathews Mike Mahar.
THE TESTING APPROACH FOR FPGA LOGIC CELLS E. Bareiša, V. Jusas, K. Motiejūnas, R. Šeinauskas Kaunas University of Technology LITHUANIA EWDTW'04.
Combinational and Sequential Mapping with Priority Cuts Alan Mishchenko Sungmin Cho Satrajit Chatterjee Robert Brayton UC Berkeley.
DAOmap: A Depth-optimal Area Optimization Mapping Algorithm for FPGA Designs Deming Chen and Jason Cong Computer Science Department University of California,
Analytical Approach for Soft Error Rate Estimation of SRAM-Based FPGAs Ghazanfar (Hossein) Asadi and Mehdi B. Tahoori Why Soft Error Rate (SER) Estimation?
ATS Exploiting Free LUT Entries to Mitigate Soft Errors in SRAM- based FPGAs Keheng Huang, Yu Hu, Xiaowei Li Institute of Computing Technology Chinese.
Detecting Errors Using Multi-Cycle Invariance Information Nuno Alves, Jennifer Dworak, and R. Iris Bahar Division of Engineering Brown University Providence,
1 Stephen Jang Kevin Chung Xilinx Inc. Alan Mishchenko Robert Brayton UC Berkeley Power Optimization Toolbox for Logic Synthesis and Mapping.
Using Memory to Cope with Simultaneous Transient Faults Authors: Universidade Federal do Rio Grande do Sul Programa de Pós-Graduação em Engenharia Elétrica.
Optimality FPGA Technology Mapping: A Study of Optimality Andrew C. Ling M.A.Sc. Candidate University of Toronto Deshanand P. Singh Ph.D. Altera Corporation.
In-Place Decomposition for Robustness in FPGA Ju-Yueh Lee, Zhe Feng, and Lei He Electrical Engineering Dept., UCLA Presented by Ju-Yueh Lee Address comments.
DAOmap: A Depth-optimal Area Optimization Mapping Algorithm for FPGA Designs Deming Chen, Jason Cong , Computer Science Department , UCLA Presented.
Fast Synthesis of Clock Gating from Existing Logic Aaron P. Hurst Univ. of California, Berkeley Portions In Collaboration with… Artur Quiring and Andreas.
Gill 1 MAPLD 2005/234 Analysis and Reduction Soft Delay Errors in CMOS Circuits Balkaran Gill, Chris Papachristou, and Francis Wolff Department of Electrical.
Chandrasekhar 1 MAPLD 2005/204 Reduced Triple Modular Redundancy for Tolerating SEUs in SRAM based FPGAs Vikram Chandrasekhar, Sk. Noor Mahammad, V. Muralidharan.
Kandemir224/MAPLD Reliability-Aware OS Support for FPGA-Based Systems M. Kandemir, G. Chen, and F. Li Department of Computer Science & Engineering.
1 Architecture of Datapath- oriented Coarse-grain Logic and Routing for FPGAs Andy Ye, Jonathan Rose, David Lewis Department of Electrical and Computer.
Fault-Tolerant Resynthesis for Dual-Output LUTs Roy Lee 1, Yu Hu 1, Rupak Majumdar 2, Lei He 1 and Minming Li 3 1 Electrical Engineering Dept., UCLA 2.
IPF: In-Place X-Filling to Mitigate Soft Errors in SRAM-based FPGAs
Robust FPGA Resynthesis Based on Fault-Tolerant Boolean Matching
MAPLD 2005 Reduced Triple Modular Redundancy for Tolerating SEUs in SRAM based FPGAs Vikram Chandrasekhar, Sk. Noor Mahammad, V. Muralidharan Dr. V. Kamakoti.
Power Optimization Toolbox for Logic Synthesis and Mapping
Architecture Synthesis for Cost Constrained Fault Tolerant Biochips
Mapping into LUT Structures
Delay Optimization using SOP Balancing
Maintaining Data Integrity in Programmable Logic in Atmospheric Environments through Error Detection Joel Seely Technical Marketing Manager Military &
Samuel Luckenbill1, Ju-Yueh Lee2, Yu Hu3, Rupak Majumdar1, and Lei He2
Applying Logic Synthesis for Speeding Up SAT
Integrating an AIG Package, Simulator, and SAT Solver
Verilog to Routing CAD Tool Optimization
Standard-Cell Mapping Revisited
SAT-Based Area Recovery in Technology Mapping
Alan Mishchenko University of California, Berkeley
Soft Error Detection for Iterative Applications Using Offline Training
SAT-Based Optimization with Don’t-Cares Revisited
Analytical Approach for Soft Error Rate Estimation of SRAM-Based FPGAs
FPGA Logic Synthesis using Quantified Boolean Satisfiability
FPGA Glitch Power Analysis and Reduction
On the Improvement of Statistical Timing Analysis
Integrating an AIG Package, Simulator, and SAT Solver
Improvements in FPGA Technology Mapping
Delay Optimization using SOP Balancing
Chapter 3b Leakage Efficient Chip-Level Dual-Vdd Assignment with Time Slack Allocation for FPGA Power Reduction Prof. Lei He Electrical Engineering Department.
Alan Mishchenko Department of EECS UC Berkeley
Integrating AIG Package, Simulator, and SAT Solver
Presentation transcript:

IPR: In-Place Reconfiguration for FPGA Fault Tolerance Zhe Feng 1, Yu Hu 1, Lei He 1 and Rupak Majumdar 2 1 Electrical Engineering Department 2 Computer Science Department University of California, Los Angeles Present by Zhe Feng Address comments to

Outline Introduction and motivation Introduction and motivation Algorithms Algorithms Experimental Results Experimental Results Conclusions Conclusions

Soft Error Soft errors could be caused by cosmic rays or noise upsets Future devices more vulnerable due to scaling Special session 1E “Resilient Computing” Two types of soft errors in FPGA Single Event Upset (SEU): Modification of the content of memory bits Single Event Transient (SET): Glitches latched by registers

SEU for FPGA SEU of block memory can be detected and corrected by row-based CRC and ECC SEU of configuration memory can be fixed by Periodical memory scrubbing. Scan-based CRC and ECC Both may be too late, as the circuit function may have been changed.

SER (Soft Error Rate) SER is calculated by Monte Carlo simulation under single fault model. SER is calculated by Monte Carlo simulation under single fault model. In each run, SER is the percentage of clock cycles with observable errors at primary output for given test bench In each run, SER is the percentage of clock cycles with observable errors at primary output for given test bench The overall SER is the average of all runs. The overall SER is the average of all runs. SER  1/ MTTF (mean time to failures) SER  1/ MTTF (mean time to failures)

Impact of SEU for FPGA FGPA has 10x bigger SER compared to ASIC Due to large configuration memory SEU is one of biggest challenges for FPGA-based applications Most FPGAs are used in systems but not prototypes One of the biggest application is internet routers FPGA boards returned after two crashes

FPGA Resynthesis Resynthesis Resynthesis Rewrites the circuit in logic or physical netlist Rewrites the circuit in logic or physical netlist Reconfigures the LUTs Reconfigures the LUTs (Source: Andrew Ling, University of Toronto, DAC'05) RTL Synthesis Logic Synthesis Technology Mapping ResynthesisPackingP&R

ROSE performs iterative logic transformations with explicit stochastic yield rate evaluation ROSE performs iterative logic transformations with explicit stochastic yield rate evaluation Logic transformation by fault tolerance Boolean Matching Boolean Matching Inputs Template H and Boolean function F for logic block Fault rates for the inputs and the SRAM bits of the template Outputs Either that F cannot be implemented by template H Or the configuration of H to obtain function F ROSE: RObust REsynthesis [ICCAD08’] Fault-Tolerant Boolean Matching minimizes the observable faults at the output of the template

Need of In-place Logic Optimization ROSE, same as most existing logic optimization techniques, does not preserve the layout (topology) of a circuit design. ROSE, same as most existing logic optimization techniques, does not preserve the layout (topology) of a circuit design. Interconnect dominates in FPGA In-place resynthesis (IPR) leads to a faster design closure. In-place resynthesis (IPR) leads to a faster design closure. Minimal or no impact on the physical design Minimal or no impact on the physical design IPR ROSE

Our Major Contributions Propose an in-place resynthesis algorithm, IPR Propose an in-place resynthesis algorithm, IPR Maximize the yield rate for FPGAs Maximize the yield rate for FPGAs Preserve the topology of the logic network Preserve the topology of the logic network Reduce the runtime complexity compared to other SAT-based approaches Reduce the runtime complexity compared to other SAT-based approaches IPR reduces the fault rate by 48% and increases MTTF by 1.94X. IPR reduces the fault rate by 48% and increases MTTF by 1.94X. Compared to the state-of-the-art academic technology mapper Berkeley ABC. Compared to the state-of-the-art academic technology mapper Berkeley ABC. With the same area and performance. With the same area and performance.

Outline Background Background Algorithms Algorithms Experimental Results Experimental Results Conclusions Conclusions

IPR: In-place Reconfiguration (0 -> 1) Fault rate = 37.5% Fault rate = 12.5% Maximize identical configuration bits for complementary inputs of an LUT. Maximize identical configuration bits for complementary inputs of an LUT. Change the functions of multiple LUTs to guarantee the function of the circuit unchanged. Change the functions of multiple LUTs to guarantee the function of the circuit unchanged.

IPR algorithm Initial Full-chip Functional Simulation Initial Full-chip ODC Mask Calculation Node Criticality Analysis Cone Construction In-place LUT Reconfiguration and Boolean Matching Localize Truth Table Update Localize ODC Mask Update Circuit Analysis Localize Update

IPR algorithm Initial Full-chip Functional Simulation Initial Full-chip ODC Mask Calculation Node Criticality Analysis Cone Construction In-place LUT Reconfiguration and Boolean Matching Localize Truth Table Update Localize ODC Mask Update Circuit Analysis Localize Update

ODC Mask based Node Criticality LogicNetwork … Primary outputs ODC mask: 1010 (I. Markov, ICCAD’07) The ODC mask quantifies the impact of a node on the primary outputs. The ODC mask quantifies the impact of a node on the primary outputs. The criticality of a node is defined as the percentage of one’s in the ODC mask, and decides the priority of reconfiguration in IPR. The criticality of a node is defined as the percentage of one’s in the ODC mask, and decides the priority of reconfiguration in IPR.

IPR algorithm Initial Full-chip Functional Simulation Initial Full-chip ODC Mask Calculation Node Criticality Analysis Cone Construction In-place LUT Reconfiguration and Boolean Matching Localize Truth Table Update Localize ODC Mask Update Circuit Analysis Localize Update

Cone Construction Select a subset S N of first-order fanout LUTs of n Construct a cone for a selected root LUT Root LUT is a fanout of S N Include S N but not its first-order fanins Cut size of the cone is limited a n d c b e Root

In-place LUT Reconfiguration The functions of LUTs in the cone are changed to increase # of identical configuration pairs But function of input/out nets and topology of internal nets are kept unchanged  No change of circuit function and layout a n d c b e Root

In-place Boolean Matching Conjunctive Normal Form (CNF) Truth table can be encoded as follows Truth table can be encoded as follows The cone can be encoded as follows The cone can be encoded as follows To make a pair of configuration bits (ci, cj) To make a pair of configuration bits (ci, cj) in LUT L symmetric, we have in LUT L symmetric, we have Combining all the three, we have CNF Combining all the three, we have CNF formulation for in-place Boolean matching (IP-BM). formulation for in-place Boolean matching (IP-BM). IP-BM preserves both the logic function and topology of the cone. IP-BM preserves both the logic function and topology of the cone.

Outline Background Background Algorithms Experimental Results Experimental Results Conclusions Conclusions

Experimental Settings and CAD Flows Implemented in C++ and use miniSAT2.0 as the SAT solver Implemented in C++ and use miniSAT2.0 as the SAT solver Results collected on a Ubuntu workstation with 2.6GHz Xeon CPU and 2GB memory Results collected on a Ubuntu workstation with 2.6GHz Xeon CPU and 2GB memory QUIP benchmarks are tested QUIP benchmarks are tested Mapped with 4-LUTs by Berkeley ABC Mapped with 4-LUTs by Berkeley ABC Perform and compare the following synthesis flows: ABC, IPR, ROSE+IPR Perform and compare the following synthesis flows: ABC, IPR, ROSE+IPR

Experimental Settings and CAD Flows (Cont’) Fault model Fault model Uniform soft error rate for all configuration bits in LUT but ignore interconnect configuration bits during IPR. Uniform soft error rate for all configuration bits in LUT but ignore interconnect configuration bits during IPR. Uniform soft error rate for all configuration bits in LUT and interconnect during validation. Uniform soft error rate for all configuration bits in LUT and interconnect during validation. The fault rate of the chip is calculated by Monte Carlo simulation The fault rate of the chip is calculated by Monte Carlo simulation Single fault injection for all configuration bits in LUT and interconnect Single fault injection for all configuration bits in LUT and interconnect 32k random inputs 32k random inputs

Full-chip Fault Rate by Monte Carlo Simulation 59% fault rate reduction! ABC vs. IPR vs. ROSE+IPR: 1:0.52:0.51 ABC vs. IPR vs. ROSE+IPR: 1:0.52:0.51

Area (LUT#) ABC vs. IPR vs. ROSE+IPR: 1: 1 : 0.81 ABC vs. IPR vs. ROSE+IPR: 1: 1 : 0.81

Estimation of Mean Time To Failure The best flow in terms of the robustness and area is ROSE+IPR The best flow in terms of the robustness and area is ROSE+IPR 50x faster!

Conclusions We develop an in-place resynthesis algorithm, IPR. We develop an in-place resynthesis algorithm, IPR. Increases MTTF by 2X over ABC; Increases MTTF by 2X over ABC; Preserves the topology of the logic network for a faster design closure; Preserves the topology of the logic network for a faster design closure; Complementary to existing fault-tolerant resynthesis algorithms. Complementary to existing fault-tolerant resynthesis algorithms. In the future, we will consider In the future, we will consider Experiments assume multiple uncorrelated faults and given correlations between faults; Experiments assume multiple uncorrelated faults and given correlations between faults; Extend IPR with criticality considering interconnects explicitly. Extend IPR with criticality considering interconnects explicitly.

Thank You! IPR: In-Place Reconfiguration for FPGA Fault Tolerance Zhe Feng, Yu Hu, Lei He and Rupak Majumdar

Backup Slides

Criticality for Configuration Bit Depends on two criteria: Depends on two criteria: One is a sequence of input vectors for the LUT. One is a sequence of input vectors for the LUT. The other is the ODC mask of the LUT. The other is the ODC mask of the LUT. The criticality of a configuration bit c : The criticality of a configuration bit c :

In-place Boolean Matching Conjunctive Normal Form (CNF) Truth table can be encoded as follows Truth table can be encoded as follows The cone can be encoded as follows The cone can be encoded as follows To make a pair of configuration bits (ci, cj) To make a pair of configuration bits (ci, cj) in LUT L symmetric, we have in LUT L symmetric, we have Combining all the three, we have CNF Combining all the three, we have CNF formulation for in-place Boolean matching (IP-BM). formulation for in-place Boolean matching (IP-BM). IP-BM preserves both the logic function and topology of the cone. IP-BM preserves both the logic function and topology of the cone.

IPR algorithm Initial Full-chip Functional Simulation Initial Full-chip ODC Mask Calculation Node Criticality Analysis Cone Construction In-place LUT Reconfiguration and Boolean Matching Localize Truth Table Update Localize ODC Mask Update Circuit Analysis Localize Update

Localized Update Localized update of ODC mask reduces runtime Localized update of ODC mask reduces runtime Reconfigured ConeC R Maximum Fanin Cone C MFI Maximum Fanout Cone C MFO C MFI is affected, but the ODC mask is not updated to save time. ODC mask updated for C R. C MFO is not affected, so the ODC mask does not need to be updated.

Defects are created equally but not propagated equally Defects are created equally but not propagated equally Logic don’t-cares may mask the propagation of defects Logic don’t-cares may mask the propagation of defects Key to stochastic synthesis: Logic Masking defect 11 Not affected by defects! Observability Don’t-cares with a=1&b=1 We can maximize don’t-cares while keeps the logic function. We can maximize don’t-cares while keeps the logic function.

IPR Enhancement Iterative (i.e., random) algorithm without greedy procedure based on criticality Iterative (i.e., random) algorithm without greedy procedure based on criticality Provide different ordering for optimization of gates Provide different ordering for optimization of gates Without periodic yield rate evaluation Without periodic yield rate evaluation With periodic yield rate evaluation With periodic yield rate evaluation Large cut size Large cut size Increase the opportunity to find the feasible cone. Increase the opportunity to find the feasible cone.

IPR Enhancement (Cont’) Extend to MIMO Extend to MIMO MISOMIMO MISOMIMO Increase the opportunity to try more LUTs Increase the opportunity to try more LUTs