ELE 523E COMPUTATIONAL NANOELECTRONICS

Slides:



Advertisements
Similar presentations
NanoFabric Chang Seok Bae. nanoFabric nanoFabric : an array of connect nanoBlocks nanoBlock : logic block that can be progammed to implement Boolean function.
Advertisements

Computer Engineering II
Fault-Tolerant Techniques and Nanoelectronic Devices Andy Hill CH E
ELE 523E COMPUTATIONAL NANOELECTRONICS W7-W8: Probabilistic Computing, 20/10/ /10/2014 FALL 2014 Mustafa Altun Electronics & Communication Engineering.
Self-Checking Carry-Select Adder Design Based on Two-Rail Encoding
Weikang Qian Ph.D. Candidate Electrical & Computer Engineering
Slide 1/20 Fault Tolerant Approaches to Nanoelectronic Programmable Logic Arrays Authors: Wenjing Rao, Alex Orailoglu, Ramesh Karri Conference: DSN 2007.
Quantum Error Correction SOURCES: Michele Mosca Daniel Gottesman Richard Spillman Andrew Landahl.
A Probabilistic Approach to Nano- computing J. Chen, J. Mundy, Y. Bai, S.-M. C. Chan, P. Petrica and R. I. Bahar Division of Engineering Brown University.
Array-Based Architecture for FET-Based, Nanoscale Electronics André DeHon 2003 Presented By Mahmoud Ben Naser.
ELE 523E COMPUTATIONAL NANOELECTRONICS
ELE 523E COMPUTATIONAL NANOELECTRONICS W1: Introduction, 8/9/2014 FALL 2014 Mustafa Altun Electronics & Communication Engineering Istanbul Technical University.
1 Fault-Tolerant Computing Systems #2 Hardware Fault Tolerance Pattara Leelaprute Computer Engineering Department Kasetsart University
1 Stochastic Logic Beyond CMOS... Prof. Mingjie Lin.
Nanoscale Digital Computation Through Percolation Mustafa Altun Electrical and Computer Engineering DAC, “Wild and Crazy Ideas” Session ─ San Francisco,
5/31/07IWLS Computing Beyond CMOS Intense research into novel materials and devices: Carbon Nanotubes… Molecular Switches… Biological Processes…
Defect Tolerance in Diode, FET, and Four- Terminal Switch based Nano-Crossbar Arrays IEEE/ACM International Symposium on Nanoscale Architectures, NANOARCH.
Fault-Tolerant Systems Design Part 1.
EE5393, Circuits, Computation, and Biology Computing with Probabilities 1,1,0,0,0,0,1,0 1,1,0,1,0,1,1,1 1,1,0,0,1,0,1,0 a = 6/8 c = 3/8 b = 4/8.
CALTECH CS137 Winter DeHon CS137: Electronic Design Automation Day 8: February 4, 2004 Fault Detection.
Computing with Defects
Marc Riedel – EE5393 The Synthesis of Robust Polynomial Arithmetic with Stochastic Logic Electrical & Computer Engineering University of Minnesota.
ELE 523E COMPUTATIONAL NANOELECTRONICS W8-W9: Probabilistic Computing, 2/11/ /11/2015 FALL 2015 Mustafa Altun Electronics & Communication Engineering.
ELE 523E COMPUTATIONAL NANOELECTRONICS W10: Defects and Reliability, 16/11/2015 FALL 2015 Mustafa Altun Electronics & Communication Engineering Istanbul.
Marc Riedel Associate Professor, Electrical and Computer Engineering University of Minnesota ITA – Feb. 14, 2014 (“Singles’ Awareness Day”) Probability.
MAPLD 2005/213Kakarla & Katkoori Partial Evaluation Based Redundancy for SEU Mitigation in Combinational Circuits MAPLD 2005 Sujana Kakarla Srinivas Katkoori.
Memory and Programmable Logic
Self-Checking Circuits
ELE 523E COMPUTATIONAL NANOELECTRONICS
ELE 523E COMPUTATIONAL NANOELECTRONICS
Muhammed Ceylan Morgul, Furkan Peker and Mustafa Altun
ELE 523E COMPUTATIONAL NANOELECTRONICS
VLSI Testing Lecture 5: Logic Simulation
VLSI Testing Lecture 5: Logic Simulation
Instructor:Po-Yu Kuo 教師:郭柏佑
NanoMadeo Introduction of Error Correcting Schemes in the Design Process of Self-Healing Circuits for Nanoscale Fabrics.
Vishwani D. Agrawal Department of ECE, Auburn University
Logic and Computer Design Fundamentals
Fault Tolerance In Operating System
Maintaining Data Integrity in Programmable Logic in Atmospheric Environments through Error Detection Joel Seely Technical Marketing Manager Military &
Presentation Title Greg Snider QSR, Hewlett-Packard Laboratories
Biological Processes…
Basics Combinational Circuits Sequential Circuits
Basics Combinational Circuits Sequential Circuits Ahmad Jawdat
Chapter 4 Gates and Circuits.
CS105 Introduction to Computer Concepts GATES and CIRCUITS
Introduction to Verilog
Sequential circuits and Digital System Reliability
Lecture 5 Fault Modeling
ELE 523E COMPUTATIONAL NANOELECTRONICS
5-8 December, Batumi - Georgia
Software testing.
Instructor:Po-Yu Kuo 教師:郭柏佑
Design of a ‘Single Event Effect’ Mitigation Technique for Reconfigurable Architectures SAJID BALOCH Prof. Dr. T. Arslan1,2 Dr.Adrian Stoica3.
ELE 523E COMPUTATIONAL NANOELECTRONICS
Presentation Title Greg Snider QSR, HP Laboratories
Transmission Errors Error Detection and Correction
Chapter 10.3 and 10.4: Combinatorial Circuits
Lattice-Based Computation of Boolean Functions
Topics Switch networks. Combinational testing..
Hardware Assisted Fault Tolerance Using Reconfigurable Logic
Parity Generator and Checkers
Digital System Design Combinational Logic
A New Hybrid FPGA with Nanoscale Clusters and CMOS Routing Reza M. P
ELE 523E COMPUTATIONAL NANOELECTRONICS
FAULT-TOLERANT TECHNIQUES FOR NANOCOMPUTERS
FIGURE 5-1 MOS Transistor, Symbols, and Switch Models
ELE 523E COMPUTATIONAL NANOELECTRONICS
Fault Mitigation of Switching Lattices under the Stuck-At Model
Presentation transcript:

ELE 523E COMPUTATIONAL NANOELECTRONICS Mustafa Altun Electronics & Communication Engineering Istanbul Technical University Web: http://www.ecc.itu.edu.tr/ FALL 2018 WW11: Fault Tolerance, 26/11/2018

Outline Faults in Nano-crossbar arrays Fault Tolerance Stages Diode-based FET-based Four-terminal switch based Fault Tolerance Stages Fabrication Post-fabrication In-field Post-fabrication and Defects in Nano-crossbar arrays Reconfiguration of a circuit Mapping with defects Defect-aware Defect-unaware Analysis of in-field Transient Faults in Nano-crossbar arrays General Transient Fault Tolerance Techniques Multiplexing and stochastic computing Dual modular redundancy (DMR) and triple modular redundant (TMR) Parity bits and Hamming codes

Faults in Nano-Crossbar Arrays Ideally f = A B + C D With a fault f = A B + B C D With a fault f = A + C D How to tolerate faults? Each crosspoint is either closed (diode connected) or open. What if a crosspoint is closed when it is supposed to be open? What if a crosspoint is open when it is supposed to be closed?

Faults in Nano-Crossbar Arrays Ideally f = (A B + C D)ꞌ With a fault f = 0 How to tolerate faults? Each crosspoint is either closed (FET or shorted) or open. What if a crosspoint is closed when it is supposed to be open?

Faults in Nano-Crossbar Arrays Ideally f = x1 x2ꞌ x3+ x1 x4ꞌ + x2 x3 x4ꞌ + x2 x4 x5 + x3 x5 With a fault f = x1 x2ꞌ x3+ x1 x4ꞌ + x2 x3 x4ꞌ + x2 x4 x5 + x3 x5 1 With a fault f = x1 x2ꞌ x3+ x1 x4ꞌ + x2 x3 x4ꞌ + x2 x4 x5 + x3 x5 How to tolerate faults? Each crosspoint is either closed or open depending on the applied literal. What if a crosspoint is always closed when it is supposed to switch? What if a crosspoint is always open when it is supposed to switch?

Fault Tolerance Stages Stages: Fabrication Post-fabrication In-field/ Service Stakeholder: Chip Manufacturer Application Designer End User Mitigation Methods- Adding Redundancy: Error-correcting codes, TMR, NAND demultiplexing Mitigation Methods: Configuring around defects Mitigation Methods: Self-testing, reconfiguring Permanent Faults Permanent+ Transient Faults Design Nanomaterials: carbon nanotube, nanowires Fabrication, verification and test Final Product Test and verification

Post-fabrication and Defects Nano-array fabricated with bottom-up methods In post-fabrication, the ciruit is configured

Configuration of a circuit A logic function is implemented with configuration A full-adder with activating and deactivating the switches Activated Deactivated

Configuration of a circuit In a defect-free array, straitghtforward process Input Lines I1 I2 I3 I4 I5 I6 A B C A B C Mapping O1 O2 O3 O4 O5 O6 O7 A B B C A C Output Lines A B C Activated switch Deactivated switch F = A B + B C + A C + A B C F = A B + B C + A C + A B C (1) Given function (2) Realized function

Defects Stuck-at deactivated, switch cannot be activated Stuck-at activated, switch cannot be deactivated : Stuck-at deactivated switch : Stuck-at activated switch : Configurable switch : Defective switch

Mapping with Defects F F’ In a defective array, every mapping is not valid Input Lines A B C A B C I1 I2 I3 I4 I5 I6 Mapping A B O1 O2 O3 O4 O5 O6 O7 A B C A C Output Lines A B C Activated switch C Deactivated switch F’ = A B + A B C + A C + A B C + C F = A B + B C + A C + A B C (2) Realized function (1) Given function F F’

Defect-aware mapping F F’ Mapping is performed with employing defects Previous mapping A B C A B C Input Lines B A C A B C I1 I2 I3 I4 I5 I6 Mapping A B A B O1 O2 O3 O4 O5 O6 O7 B C A B C A C A C Output Lines A B C Activated switch A B C C Deactivated switch F = A B + B C + A C + A B C F = A B + B C + A C + A B C (2) Realized function (1) Given function F F’

Defect-unaware mapping First, a defect-free sub-array is found Input Lines I1 I2 I3 I4 I5 I6 I7 I1 I2 I3 I4 I5 I6 I7 O1 O2 O3 O4 O5 O6 O7 O1 O2 O3 O4 O5 O6 O7 Defect-free sub-aray Output Lines F = A B + B C + A C + A B C (1) Given function I7 and O5 discarded I7

Defect-unaware mapping Second, configuration is starightforward Input Lines A B C A B C I1 I2 I3 I4 I5 I6 I7 A B O1 O2 O3 O4 O5 O6 O7 B C Mapping A C Output Lines A B C F’ = A B + B C + A C + A B C F = A B + B C + A C + A B C (2) Realized function (1) Given function F F’

In-field Transient Faults Transient faults occur according to a time-domain They are predicted with probability analysis Diode and FET Components show different behaviour regarding to the fault type Stuck-at OFF: switch is not capable of conducting current, infinite resistance Stuck-at ON: switch is constantly conducting current, zero resistance Diode Stuck-at OFF only switch Stuck-at ON entire output line FET Stuck-at OFF entire output line Stuck-at ON only switch

Diode-based Nanoarray Stuck-at OFF, no connection between terminals Only faulty switch is affected Stuck-at ON, terminals always connected Entire line is affected Terminals Gnd Vdd : Stuck-at OFF switch : Stuck-at ON switch : Functional switch : Unusable switch

FET-based Nanoarray Stuck-at OFF, no connection between terminals Entire line is affected Stuck-at ON, terminals always connected Only faulty switch is affected : Stuck-at OFF switch : Stuck-at ON switch : Functional switch : Unusable switch

In-field Transient Faults OFF-to-ON transition fault: The switch is ON when it is supposed to be OFF; x1=0. ON-to-OFF transition fault: The switch is OFF when it is supposed to be ON; x1=1. Each switch of the lattice has independent fault rates.

In-field Transient Faults Ideally, if x1=0 then all the switches are OFF. Ideally, if x1=1 then all the switches are ON. We use redundancy in tolerating faults powered by percolation.

Broadbent & Hammersley (1957). Percolation Theory Rich mathematical topic that forms the basis of explanations of physical phenomena such as diffusion and phase changes in materials. Broadbent & Hammersley (1957).

Percolation Theory Sharp non-linearity in global connectivity as a function of random local connectivity.

Percolation Theory p2 versus p1 for 1×1, 2×2, 6×6, 24×24, 120×120, and infinite size lattices. Each square in the lattice is colored black with independent probability p1. p2 is the probability that a connected path exists between the top and bottom plates.

Margins correlate with the degree of fault tolerance. One-margin: Tolerable p1 ranges for which we interpret p2 as logical one. Zero-margin: Tolerable p1 ranges for which we interpret p2 as logical zero. Margins correlate with the degree of fault tolerance.

Implementing Boolean Functions signals in: xi’s signals out: connectivity top-to-bottom / left-to-right.

An Example with 16 Boolean Inputs A path exists between top and bottom, fL = 1

Margin Performance with a 2×2 Lattice fL=x1x3+x2x4 gL =x1x2+x3x4 Different assignments of input variables to the regions of the network affect the margins.

One-margins (always good) fL =0 fL =1 Fault probabilities exceeding the one-margin would likely cause an (1→0) error.

Good Zero-margins fL =1 fL =0 Fault probabilities exceeding zero-margin would likely cause an (0→1) error.

Poor Zero-margins fL =1 fL =0 Assignments that evaluate to 0 but have diagonally adjacent assignments of blocks of 1's result in poor zero-margins

Lattice Duality A necessary and sufficient condition for good error margins is that the Boolean functions fL and gL are dual functions.

Lattice Duality fL=x1x3+x2x4 gL =x1x2+x3x4 fL ≠ gLD

Transient Fault Tolerance Von Neumann’s multiplexing unit, 1956 Randomly shuffled N number of inputs and outputs Values are calculated as the number of 1 valued input/output lines over N Parallel operation Stochastic computing Values are calculated as the number of 1 valued input/output lines over N Serial operation

Multiplexing for Transition Faults Error probability ϵ : a gate evaluates the incorrect result, the complement of the correct Boolean value, with ϵ. Calculate z with and without error ϵ ⟶ ϵ(1-2z)

Multiplexing for Stuck-at 1 Faults Error/fault probability ϵ : each gate constantly evaluates logic 1 with ϵ. Calculate z with and without error ϵ ⟶ ϵ(1-z)

Multiplexing for Stuck-at 0 Faults Error/fault probability ϵ : each gate constantly evaluates logic 0 with ϵ. Calculate z with and without error ϵ ⟶ ϵ(z)

Transient Fault Tolerance Dual modular redundancy (DMR) Increase area 2 times plus an XOR gate For only a single output fault For only detection Triple modular redundancy (TMR) Increase area 3 times plus XOR gates For only a single output fault For both detection and correction

Transient Fault Tolerance Extra parity bit Applicable for large circuits For only odd number of output faults For only detection Satisfying Hamming distance Practical for large circuits For multiple output faults For both detection and correction

Suggested Readings DeHon, A. (2003). Array-based architecture for FET-based, nanoscale electronics. Nanotechnology, IEEE Transactions on, 2(1), 23-32. Han, J., & Jonker, P. (2003). A defect and fault-tolerant architecture for nanocomputers. Nanotechnology, 14(2), 224. Rao, W., Orailoglu, A., & Karri, R. (2007, April). Logic level fault tolerance approaches targeting nanoelectronics plas. In 2007 Design, Automation & Test in Europe Conference & Exhibition (pp. 1-5). IEEE. Altun, M., & Riedel, M. D. (2011). Robust Computation through Percolation: Synthesizing Logic with Percolation in Nanoscale Lattices. International Journal of Nanotechnology and Molecular Computation (IJNMC), 3(2), 12-30. Tunali, O., & Altun, M. (2016) Permanent and Transient Fault Tolerance for Reconfigurable Nano-Crossbar Arrays. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.