Online and Operand-Aware Detection of Failures Utilizing False Alarm Vectors Amir Yazdanbakhsh, David Palframan, Azadeh Davoodi, Mikko Lipasti, Nam Sung.

Slides:



Advertisements
Similar presentations
Survey of Detection, Diagnosis, and Fault Tolerance Methods in FPGAs
Advertisements

Digital Integrated Circuits© Prentice Hall 1995 Design Methodologies Design for Test.
University of Michigan Electrical Engineering and Computer Science University of Michigan Electrical Engineering and Computer Science University of Michigan.
10/14/2005Caltech1 Reliable State Machines Dr. Gary R Burke California Institute of Technology Jet Propulsion Laboratory.
1 KU College of Engineering Elec 204: Digital Systems Design Lecture 9 Programmable Configurations Read Only Memory (ROM) – –a fixed array of AND gates.
NATW 2008 Using Implications for Online Error Detection Nuno Alves, Jennifer Dworak, R. Iris Bahar Division of Engineering Brown University Providence,
Self-Checking Carry-Select Adder Design Based on Two-Rail Encoding
1/28 ECE th May 2014 H ardware Implementation of Self-checking circuits on FPGA Project Team #1 Chandru Loganathan Sakshi Gupta Vignesh Chandrasekaran.
Mitigating the Performance Degradation due to Faults in Non-Architectural Structures Constantinos Kourouyiannis Veerle Desmet Nikolas Ladas Yiannakis Sazeides.
Copyright 2001, Agrawal & BushnellVLSI Test: Lecture 71 Lecture 7 Fault Simulation n Problem and motivation n Fault simulation algorithms n Serial n Parallel.
Aiman El-Maleh, Ali Alsuwaiyan King Fahd University of Petroleum & Minerals, Dept. of Computer Eng., Saudi Arabia Aiman El-Maleh, Ali Alsuwaiyan King Fahd.
A Diagnostic Test Generation System Yu Zhang Vishwani D. Agrawal Auburn University, Auburn, Alabama USA Nov. 3rdITC
Copyright 2001, Agrawal & BushnellDay-1 PM Lecture 4b1 Design for Testability Theory and Practice Lecture 4b: Fault Simulation n Problem and motivation.
Dec. 19, 2005ATS05: Agrawal and Doshi1 Concurrent Test Generation Auburn University, Department of Electrical and Computer Engineering Auburn, AL 36849,
Concurrent Test Generation Auburn University, Department of Electrical and Computer Engineering Auburn, AL 36849, USA Vishwani D. Agrawal Alok S. Doshi.
Spring 07, Feb 8 ELEC 7770: Advanced VLSI Design (Agrawal) 1 ELEC 7770 Advanced VLSI Design Spring 2007 Logic Equivalence Vishwani D. Agrawal James J.
Efficient Test Compaction for Combinational Circuits Based on Fault Detection Count- Directed Clustering Aiman El-Maleh and Saqib Khurshid King Fahd University.
Dec. 29, 2005Texas Instruments (India)1 Concurrent Test Generation Auburn University, Department of Electrical and Computer Engineering Auburn, AL 36849,
1 Introduction VLSI Testing. 2 Overview First digital products (mid 1940's) Complexity:low MTTF:hours Cost:high Present day products (mid 1980's) Complexity:high.
Spring 08, Mar 27 ELEC 7770: Advanced VLSI Design (Agrawal) 1 ELEC 7770 Advanced VLSI Design Spring 2008 Fault Simulation Vishwani D. Agrawal James J.
1 EECS Components and Design Techniques for Digital Systems Lec 21 – RTL Design Optimization 11/16/2004 David Culler Electrical Engineering and Computer.
An Efficient IP Lookup Architecture with Fast Update Using Single-Match TCAMs Author: Jinsoo Kim, Junghwan Kim Publisher: WWIC 2008 Presenter: Chen-Yu.
Benefits of Early Cache Miss Determination Memik G., Reinman G., Mangione-Smith, W.H. Proceedings of High Performance Computer Architecture Pages: 307.
A Probabilistic Method to Determine the Minimum Leakage Vector for Combinational Designs Kanupriya Gulati Nikhil Jayakumar Sunil P. Khatri Department of.
Oct. 5, 2001Agrawal, Kim and Saluja1 Partial Scan Design With Guaranteed Combinational ATPG Vishwani D. Agrawal Agere Systems Processor Architectures and.
Jan. 11, '02Kim, et al., VLSI Design'021 Mutiple Faults: Modeling, Simulation and Test Yong C. Kim University of Wisconsin, Dept. of ECE, Madison, WI 53706,
ELEN 468 Lecture 231 ELEN 468 Advanced Logic Design Lecture 23 Testing.
黃錫瑜 Shi-Yu Huang National Tsing-Hua University, Taiwan Speeding Up Byzantine Fault Diagnosis Using Symbolic Simulation.
Software-Based Online Detection of Hardware Defects: Mechanisms, Architectural Support, and Evaluation Kypros Constantinides University of Michigan Onur.
Sub-Nyquist Sampling DSP & SCD Modules Presented by: Omer Kiselov, Daniel Primor Supervised by: Ina Rivkin, Moshe Mishali Winter 2010High Speed Digital.
Technical University Tallinn, ESTONIA Overview: Fault Simulation Overview about methods Low (gate) level methods Parallel fault simulation Deductive fault.
Secure Embedded Processing through Hardware-assisted Run-time Monitoring Zubin Kumar.
ECE 553: TESTING AND TESTABLE DESIGN OF DIGITAL SYSTES Fault Modeling.
Technical University Tallinn, ESTONIA 1 Boolean derivatives Calculation of the Boolean derivative: Given:
A Framework for Elastic Execution of Existing MPI Programs Aarthi Raveendran Graduate Student Department Of CSE 1.
An Iterative Heuristic for State Justification in Sequential Automatic Test Pattern Generation Aiman H. El-MalehSadiq M. SaitSyed Z. Shazli Department.
SafetyNet: improving the availability of shared memory multiprocessors with global checkpoint/recovery Daniel J. Sorin, Milo M. K. Martin, Mark D. Hill,
THE TESTING APPROACH FOR FPGA LOGIC CELLS E. Bareiša, V. Jusas, K. Motiejūnas, R. Šeinauskas Kaunas University of Technology LITHUANIA EWDTW'04.
A Robust Pulse-triggered Flip-Flop and Enhanced Scan Cell Design
Reconfigurable Computing Using Content Addressable Memory (CAM) for Improved Performance and Resource Usage Group Members: Anderson Raid Marie Beltrao.
European Test Symposium, May 28, 2008 Nuno Alves, Jennifer Dworak, and R. Iris Bahar Division of Engineering Brown University Providence, RI Kundan.
Reducing Test Application Time Through Test Data Mutation Encoding Sherief Reda and Alex Orailoglu Computer Science Engineering Dept. University of California,
LA-LRU: A Latency-Aware Replacement Policy for Variation Tolerant Caches Aarul Jain, Cambridge Silicon Radio, Phoenix Aviral Shrivastava, Arizona State.
1 Compacting Test Vector Sets via Strategic Use of Implications Kundan Nepal Electrical Engineering Bucknell University Lewisburg, PA Nuno Alves, Jennifer.
By: Daniel BarskyNatalie Pistunovich Supervisors: Rolf HilgendorfInna Rivkin 10/06/2010.
2-1 Introduction Gate Logic: Two-Level Simplification Design Example: Two Bit Comparator Block Diagram and Truth Table A 4-Variable K-map for each of the.
CS/EE 3700 : Fundamentals of Digital System Design Chris J. Myers Lecture 4: Logic Optimization Chapter 4.
Adaptive Multi-Threading for Dynamic Workloads in Embedded Multiprocessors 林鼎原 Department of Electrical Engineering National Cheng Kung University Tainan,
CAS 721 Course Project Minimum Weighted Clique Cover of Test Set By Wei He ( )
Section 1  Quickly identify faulty components  Design new, efficient testing methodologies to offset the complexity of FPGA testing as compared to.
11 Online Computing and Predicting Architectural Vulnerability Factor of Microprocessor Structures Songjun Pan Yu Hu Xiaowei Li {pansongjun, huyu,
Copyright © 2010 Houman Homayoun Houman Homayoun National Science Foundation Computing Innovation Fellow Department of Computer Science University of California.
VLSI Test Symposium, 2011 Nuno Alves, Yiwen Shi, and R. Iris Bahar School of Engineering, Brown University, Providence, RI Jennifer Dworak Department of.
Online and Operand-Aware Detection of Failures Utilizing False Alarm Vectors Amir Yazdanbakhsh, David Palframan, Azadeh Davoodi, Nam Sung Kim, Mikko Lipasti.
Custom Computing Machines for the Set Covering Problem Paper Written By: Christian Plessl and Marco Platzner Swiss Federal Institute of Technology, 2002.
Deterministic Diagnostic Pattern Generation (DDPG) for Compound Defects Fei Wang 1,2, Yu Hu 1, Huawei Li 1, Xiaowei Li 1, Jing Ye 1,2 1 Key Laboratory.
1 IP Routing table compaction and sampling schemes to enhance TCAM cache performance Author: Ruirui Guo, Jose G. Delgado-Frias Publisher: Journal of Systems.
Copyright 2001, Agrawal & BushnellLecture 6:Fault Simulation1 VLSI Testing Lecture 6: Fault Simulation Dr. Vishwani D. Agrawal James J. Danaher Professor.
Fault Models, Fault Simulation and Test Generation Vishwani D. Agrawal Department of ECE, Auburn University Auburn, AL 36849, USA
A Survey of Fault Tolerant Methodologies for FPGA’s Gökhan Kabukcu
Lecture 5: Design for Testability. CMOS VLSI DesignCMOS VLSI Design 4th Ed. 12: Design for Testability2 Outline  Testing –Logic Verification –Silicon.
1 Architecture of Datapath- oriented Coarse-grain Logic and Routing for FPGAs Andy Ye, Jonathan Rose, David Lewis Department of Electrical and Computer.
Maintaining Data Integrity in Programmable Logic in Atmospheric Environments through Error Detection Joel Seely Technical Marketing Manager Military &
ECE 553: TESTING AND TESTABLE DESIGN OF DIGITAL SYSTES
VLSI Testing Lecture 6: Fault Simulation
ELEC Digital Logic Circuits Fall 2014 Logic Testing (Chapter 12)
FPGA Glitch Power Analysis and Reduction
Aiman H. El-Maleh Sadiq M. Sait Syed Z. Shazli
Automatic Test Pattern Generation
Presentation transcript:

Online and Operand-Aware Detection of Failures Utilizing False Alarm Vectors Amir Yazdanbakhsh, David Palframan, Azadeh Davoodi, Mikko Lipasti, Nam Sung Kim Department of Electrical and Computer Engineering

2 Introduction Technology scaling beyond 32nm degrades the manufacturing yield o Can be addressed by imposing restrictive design rules or just using regular fabrics [T. Jhaveri, SPIE’06] o Can be addressed by using configurable logic blocks to make post-silicon corrections [Y. Ran, TVLSI’06] o Redundancy based techniques can also be used o Exploiting existing redundancy in high performance processors [P. Shivakumar, ICCD’12][S. Shyam, ASPLOS’06][J. Srinivasan, ISCA’05] o Incorporate redundancy at the granularity of a bit slice [K. Namba, PRDC’05]

3 Motivation C7C7 C4C Defective prefix node impacts C 4 and C …11010…0110 Minimized vectors Recover Checker The checker detects a match with the faulty vectors and a small number of False Alarm vectors at runtime.

4 Contributions Checker Unit ModuleFalse Alarm Vectors Flexible option for online and operand-level fault detection Update faulty vectors over the time TCAM-based implementation which can store cubes with don’t care No extra logic on the critical paths Efficient use of false alarm vectors to reduce the number of vectors to be checked, thus reducing the TCAM area Integrate the false alarm insertion into ESPRESSO 2-level logic minimization tool The recovery flag is not falsely activated too frequently

5 Checker Unit: Comparison with A Redundancy-based Alternative Checker UnitRedundancy-based Recover 0xxx11x0 TCAM Operand Checker Does not affect the critical path Flexible checker unit (can update faulty vectors) Online and operand-aware detection of failures  Affects the critical path (large muxes)  Fixed design approach (can not be updated)  Two out of three adders should always be fault-free

6 Overview of TCAM TCAM can store test cubes which have don’t care bits Conventional TCAM needs to support random access to a specific entry to update the key value at runtime –Requires a log N-to-N decoder for a TCAM with N entries –The checker unit does not need such a decoder Each entry must be updated only once, every time the chip is turned on Supporting a sequential access to write the test cubes to the TCAM is sufficient In our framework the size of TCAM can get impractically large if all the faulty are individually stored –We propose to a few false alarm vectors to reduce number of entries in the TCAM and therefore reduce the TCAM size

7 False Alarm Insertion to Minimize the TCAM Size: Example A B C D E ABCDE x10xx 0110x x00x1 x0101 V1V1 V2V2 V3V3 V4V4 V5V5 V6V6 Identify cubes which excite fault

8 ABCDE x10xx 0110x x00x1 x0101 V1V1 V2V2 V3V3 V4V4 V5V5 V6V6 ABCDE xx0x1 x10xx xxx01 x1x0x V1V1 V2V2 V3V3 V4V4 Test cube minimization False Alarm Insertion to Minimize the TCAM Size: Example

9 ABCDE xx0x1 x10xx xxx0x V1V1 V2V2 V3V3 ABCDE xx0x1 x10xx xxx01 x1x0x V1V1 V2V2 V3V3 V4V4 We reduce the number of test cubes from 6 to 3 Identify cubes which excite fault Test cube minimization Further minimization with False Alarm Insertion False Alarm Insertion to Minimize the TCAM Size: Example

10 False Alarm Insertion Problem Definition Reduce the number of cubes beneath the given threshold by adding as few false alarm vectors as possible Why we need False Alarms? Due to area budget, number of entries in TCAM is limited The number of test cubes translates to the number of entries in the TCAM

11 Using Two-Level Logic Minimization Two-level logic minimization can be used to minimize the number of test cubes We expand the ESPRESSO* tool by inserting false alarm vectors to achieve higher minimization *ESPRESSO.

12 False Alarm Insertion by Extending ESPRESSO F = IRREDUNDANT (F ON, F DC ) F = REDUCE (F ON, F DC ) F = EXPAND (F ON, F OFF ) F = IRREDUNDANT (F ON, F DC ) F = REDUCE (F ON, F DC ) F = EXPAND (F ON, F OFF ) Stop Minimization? Test cubes Minimized test cubes Overview of the main loop of ESPRESSO F = EXPAND-FA (F ON, F OFF ) F = IRREDUNDANT (F ON, F DC ) F = REDUCE (F ON, F DC ) F = EXPAND (F ON, F OFF ) F = EXPAND-FA (F ON, F OFF ) F = IRREDUNDANT (F ON, F DC ) F = REDUCE (F ON, F DC ) F = EXPAND (F ON, F OFF ) # vectors < threshold Minimized cubes Minimized cubes with false alarm Extension with False Alarm insertion

13 False Alarm Insertion Example EXPAND-FA IRREDUNDANT REDUCE EXPAND

14 False Alarm Insertion Procedure Each call EXPAND-FA function expands multiple test cubes –How I sequentially go through the on-set? –Look at the paper –Which cube is selected to be expanded? –Same section – stopping criteria (when you reach the target number of cubes)

15 False Alarm Insertion for One Cube A0A1 A2 A3 000x ON 1 xx1x x1xx 1xx1 OFF 1 OFF 2 OFF 3 Offset Matrix False Alarm Matrix B1B1 B2B2 B3B OFF 1 OFF 2 OFF 3 ON 1 ON 2 ON’ 2 False Alarm Matrix (i, j) –Entry (i, j) indicates false alarms between the off-set cube i and (the expanded) cube when literal j is dropped A0A1A0A1 A2A3A2A3

16 Simulation Configuration Single-failure scenarios in various nodes of 32-bit Brent-Kung adder (prefix adder) Generate all the test vectors for two failing cases modeled by a stuck-at-0 and stuck-at-1 using ATALANTA* ATPG toolset Using SPEC2006 suite for workload-dependent case Record the input arguments to the adder by running each benchmark on an X86 simulator Analyzing area overhead in 2-issue and 4-issue microprocessors *H.K. Lee and D.S. Ha. Atalanta: an efficient ATPG for combinational circuits. Technical Report; Department of Electrical Engineering, Virginia Polytechnic Institute and State University, pages 93 12, 1993.

17 Comparison of Probability of Detection Probability of detection: percentage of times that the checker unit activates the recovery signal (could be false alarm or true positive) Average PoD degrades with decrease in the number of test cubes Average PoD after inserting false alarms does not degrade significantly in FA-128 or FA-64 or FA-32 compared to W/O FA This behavior is true for both workload-dependent and random cases

18 Comparison of False Alarm Insertion Algorithms FA-Ag Algorithm –At each iteration, all the cubes are expanded using the expand-FA procedure. Each entry indicates the fraction of false alarms from the total number of detection ( ) –FA denotes the number fo false alarm minterms and TP the number of true positive when a fault is truly happening. On-average FA-Ag results in more overhead with increase in the number of test cubes compared to FA

19 Area Overhead Implemented approaches –Baseline k+1 (for k=2, 4) K-issue processor with 1 redundant component –K+TCAM K-issue processor with checker implemented as TCAM –K+FPGA K-issue processor with checker implemented as FPGA 2+TCAM has better area than 2+1 for 32 and 48 cubes 2+FPGA always has more area than baseline Similar behavior for 4+TCAM and 4+FPGA

20 Conclusion A new framework for online detection of failures at operand level of granularity Design a flexible TCAM-based checker unit Propose a false alarm insertion algorithm to reduce the number of vectors below the given threshold Incorporate the false alarm insertion algorithm into ESPRESSO 2-level logic minimization tool Future works: Use checker unit for other existing modules inside the processor Utilizing the online and operand-aware detection for other type of faults such as delay path fault

21 Questions?