1. NATURE: Non-Volatile Nanotube RAM based Field-Programmable Gate Arrays Wei Zhang†, Niraj K. Jha† and Li Shang ‡ †Dept. of Electrical Engineering Princeton.

Slides:



Advertisements
Similar presentations
Survey of Detection, Diagnosis, and Fault Tolerance Methods in FPGAs
Advertisements

FPGA (Field Programmable Gate Array)
NanoFabric Chang Seok Bae. nanoFabric nanoFabric : an array of connect nanoBlocks nanoBlock : logic block that can be progammed to implement Boolean function.
Spartan II Features  Plentiful logic and memory resources –15K to 200K system gates (up to 5,292 logic cells) –Up to 57 Kb block RAM storage  Flexible.
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR SRAM-based FPGA n SRAM-based LE –Registers in logic elements –LUT-based logic element.
Lecture 26: Reconfigurable Computing May 11, 2004 ECE 669 Parallel Computer Architecture Reconfigurable Computing.
CMOL: Device, Circuits, and Architectures Konstantin K.Likharev and Dmitri B. Strukov Stony Brook University 697GG Nano Computering Fall 2005 Prepared.
ENGIN112 L38: Programmable Logic December 5, 2003 ENGIN 112 Intro to Electrical and Computer Engineering Lecture 38 Programmable Logic.
Caltech CS184a Fall DeHon1 CS184a: Computer Architecture (Structures and Organization) Day10: October 25, 2000 Computing Elements 2: Cascades, ALUs,
Caltech CS184a Fall DeHon1 CS184a: Computer Architecture (Structures and Organization) Day8: October 18, 2000 Computing Elements 1: LUTs.
Nanotechnology: Spatial Computing Using Molecular Electronics Mihai Budiu joint work with Seth Copen Goldstein Dan Rosewater.
Silicon Programming--Altera Tools1 “Silicon Programming“ programmable logic Altera devices and the Altera tools major tasks in the silicon programming.
Evolution of implementation technologies
Build-In Self-Test of FPGA Interconnect Delay Faults Laboratory for Reliable Computing (LaRC) Electrical Engineering Department National Tsing Hua University.
Programmable logic and FPGA
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 11: February 14, 2007 Compute 1: LUTs.
Array-Based Architecture for FET-Based, Nanoscale Electronics André DeHon 2003 Presented By Mahmoud Ben Naser.
NanoMap: An Integrated Design Optimization Flow for a Hybrid Nanotube/CMOS Dynamically Reconfigurable Architecture Wei Zhang†, Li Shang‡ and Niraj K. Jha†
CS294-6 Reconfigurable Computing Day 2 August 27, 1998 FPGA Introduction.
FPGA Defect Tolerance: Impact of Granularity Anthony YuGuy Lemieux December 14, 2005.
S. Reda EN160 SP’07 Design and Implementation of VLSI Systems (EN0160) Lecture 33: Array Subsystems (PLAs/FPGAs) Prof. Sherief Reda Division of Engineering,
CS294-6 Reconfigurable Computing Day 14 October 7/8, 1998 Computing with Lookup Tables.
Programmable Array Logic (PAL) Fixed OR array programmable AND array Fixed OR array programmable AND array Easy to program Easy to program Poor flexibility.
CS 151 Digital Systems Design Lecture 38 Programmable Logic.
Dynamic Power Consumption In Large FPGAs WILLIAM GARCIA, ANDREW MORTELLARO.
GPGPU platforms GP - General Purpose computation using GPU
EE 261 – Introduction to Logic Circuits Module #8 Page 1 EE 261 – Introduction to Logic Circuits Module #8 – Programmable Logic & Memory Topics A.Programmable.
February 12, 1998 Aman Sareen DPGA-Coupled Microprocessors Commodity IC’s for the Early 21st Century by Aman Sareen School of Electrical Engineering and.
Lecture 2: Field Programmable Gate Arrays September 13, 2004 ECE 697F Reconfigurable Computing Lecture 2 Field Programmable Gate Arrays.
EE4OI4 Engineering Design Programmable Logic Technology.
Power Reduction for FPGA using Multiple Vdd/Vth
FPGA Switch Block Design Dr. Philip Brisk Department of Computer Science and Engineering University of California, Riverside CS 223.
PROGRAMMABLE LOGIC DEVICES (PLD)
Amalgam: a Reconfigurable Processor for Future Fabrication Processes Nicholas P. Carter University of Illinois at Urbana-Champaign.
Modern VLSI Design 4e: Chapter 6 Copyright  2008 Wayne Wolf Topics Memories: –ROM; –SRAM; –DRAM; –Flash. Image sensors. FPGAs. PLAs.
J. Christiansen, CERN - EP/MIC
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR FPGA Fabric n Elements of an FPGA fabric –Logic element –Placement –Wiring –I/O.
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR Topics n FPGA fabric architecture concepts.
Programmable Logic Devices
Reconfigurable Computing Using Content Addressable Memory (CAM) for Improved Performance and Resource Usage Group Members: Anderson Raid Marie Beltrao.
Field Programmable Gate Arrays (FPGAs) An Enabling Technology.
Basic Sequential Components CT101 – Computing Systems Organization.
Design Space Exploration for Application Specific FPGAs in System-on-a-Chip Designs Mark Hammerquist, Roman Lysecky Department of Electrical and Computer.
A Reconfigurable Low-power High-Performance Matrix Multiplier Architecture With Borrow Parallel Counters Counters : Rong Lin SUNY at Geneseo
EE3A1 Computer Hardware and Digital Design
CprE / ComS 583 Reconfigurable Computing Prof. Joseph Zambreno Department of Electrical and Computer Engineering Iowa State University Lecture #4 – FPGA.
Topics Architecture of FPGA: Logic elements. Interconnect. Pins.
1 Leakage Power Analysis of a 90nm FPGA Authors: Tim Tuan (Xilinx), Bocheng Lai (UCLA) Presenter: Sang-Kyo Han (ECE, University of Maryland) Published.
1 Carnegie Mellon University Center for Silicon System Implementation An Architectural Exploration of Via Patterned Gate Arrays Chetan Patel, Anthony Cozzie,
M.Mohajjel. Why? TTM (Time-to-market) Prototyping Reconfigurable and Custom Computing 2Digital System Design.
ESS | FPGA for Dummies | | Maurizio Donna FPGA for Dummies Basic FPGA architecture.
FPGA-Based System Design: Chapter 1 Copyright  2004 Prentice Hall PTR Moore’s Law n Gordon Moore: co-founder of Intel. n Predicted that number of transistors.
1 Advanced Digital Design Reconfigurable Logic by A. Steininger and M. Delvai Vienna University of Technology.
In-Place Decomposition for Robustness in FPGA Ju-Yueh Lee, Zhe Feng, and Lei He Electrical Engineering Dept., UCLA Presented by Ju-Yueh Lee Address comments.
Logic and Computer Design Fundamentals, Fifth Edition Mano | Kime | Martin Copyright ©2016, 2008, 2004 by Pearson Education, Inc. All rights reserved.
Lecture 17: Dynamic Reconfiguration I November 10, 2004 ECE 697F Reconfigurable Computing Lecture 17 Dynamic Reconfiguration I Acknowledgement: Andre DeHon.
Delivered by.. Love Jain p08ec907. Design Styles  Full-custom  Cell-based  Gate array  Programmable logic Field programmable gate array (FPGA)
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 11: January 31, 2005 Compute 1: LUTs.
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR Topics n FPGA fabric architecture concepts.
Programmable Logic Devices
Field Programmable Gate Arrays
Topics SRAM-based FPGA fabrics: Xilinx. Altera..
Instructor: Dr. Phillip Jones
From Silicon to Microelectronics Yahya Lakys EE & CE 200 Fall 2014
Electronics for Physicists
Anne Pratoomtong ECE734, Spring2002
Electronics for Physicists
A New Hybrid FPGA with Nanoscale Clusters and CMOS Routing Reza M. P
Programmable logic and FPGA
Reconfigurable Computing (EN2911X, Fall07)
Presentation transcript:

1

NATURE: Non-Volatile Nanotube RAM based Field-Programmable Gate Arrays Wei Zhang†, Niraj K. Jha† and Li Shang ‡ †Dept. of Electrical Engineering Princeton University ‡ Dept. of Electrical and Computer Engineering Queen’s University

3 A Hybrid CMOS/NAnoTUbe REconfigurable Architecture Motivation Background on CNT and NRAM Architecture of NATURE Logic Folding Experimental Results Conclusions

4 Motivation Moore’s Law: What’s Next? Carbon nanotubes (CNTs) Nanowires Single electron devices... Challenges in nano-circuits/architectures Lack of a mature fabrication process Defects and run-time failures Reconfigurable architectures, such as an FPGA, favored Regular structures ease fabrication Fault tolerance through reconfiguration

5 Motivation (Contd.) Problems of existing reconfigurable architectures High reconfiguration time overhead Low area efficiency Some recent works on programmable nanofabrics Molecular logic array (Goldstein et al. [ICCAD 2002]) Molecular logic array (Goldstein et al. [ICCAD 2002]) Nanowire PLA (Dehon et al. [FPGA 2004]) Nanowire PLA (Dehon et al. [FPGA 2004]) CMOS/nanowire hybrid architecture CMOL (Strukov et al. [Nanotechnology 2005]) CMOS/nanowire hybrid architecture CMOL (Strukov et al. [Nanotechnology 2005]) Fabrication problem not yet solved

6 NATURE CMOS fabrication compatible CMOS fabrication compatible NRAM-based Run-time reconfiguration Run-time reconfiguration Temporal logic folding Temporal logic folding Design flexibility Design flexibility Logic density Logic density Advantages of NATURE Hybrid design leverages beneficial aspects of both CMOS and CNT technologies NRAMs are distributed in NATURE to store multi- context reconfiguration bits Fine-grain reconfiguration (even cycle-by-cycle) Enables temporal logic folding Flexibility to perform area-performance trade- offs One-to-two orders of magnitude increase in logic density

7 Background Carbon nanotube (CNT) Metallic or semiconducting Single-wall or multi-wall Diameter: 1-100nm Length: up to millimeters Ballistic transport Excellent thermal conductivity Very high current density High chemical stability Robust to environment Source: Euronanotrade

8 Background (Contd.) Non-volatile nanotube random-access memory (NRAM) Mechanically bent or not: determines bistable on/off states Fully CMOS-compatible manufacturing process Prototype chip: 10 Gbit NRAM Will be ready for the market in the near future Source: Nantero

9 NRAMs Properties of NRAMs Non-volatile Similar speed to SRAM Similar density to DRAM Chemically and mechanically stable NATURE not tied to NRAMs Phase change RAM Magnetoresistive RAM Ferroelectric RAM

10 Architecture of NATURE Island-style logic blocks (LBs) connected by various levels of interconnects An LB contains a super macroblock (SMB) and a local switch matrix

11 Architecture of a Super Macroblock (SMB) n 1 macroblocks (MBs) comprise an SMB, here n 1 = 4

12 Architecture of a Macroblock (MB) n 2 logic elements (LEs) comprise an MB, here n 2 = 4

13 Logic Element and Interconnect An LE implements a computation and contains: An m-input look-up table (LUT) A flip-flop A pass transistor Interconnect Mixed wire segment scheme 25%, 50% and 25% distribution for length-1, length-4 and long wires Direct links from one LB to its 4 neighbors

14 Support for Reconfiguration Reconfiguration time short: 160ps Area overhead of NRAMs k: no. of reconfiguration sets per NRAM, assume k = 16 Area overhead: 20.5% per LB, assuming 100nm technology for CMOS logic and nanotube length Logic density = k (conf. copies) x area per configuration = 16*( )=12.75 Appropriate value for k obtained through design space exploration

15 Temporal Logic Folding Basic idea: one can use NRAM-enabled run-time reconfiguration to realize different Boolean functions in the same logic element (LE) every few cycles

16 Example Without logic folding Num of LEs = 6 Delay = 4 LE delays +Interconnect delay Num of LEs = 2 Delay =4*clock_period With logic folding Clock period =LE delay +Reconfiguration +Interconnect delay

17 Folding Levels Logic folding can be performed at different levels of granularity, providing flexibility to perform area-performance trade-offs A level-p folding implies reconfiguration of the LE after the execution of p LUT computations (a) level-1 folding (b) level-2 folding

18 Choosing the Folding Level Advantages of logic folding Significant flexibility for performing area-performance trade-offs Ability to map much larger circuits using the same number of LEs Significant improvement in the area/circuit delay product Reduction in the need for global routing Folding level Clock period increases: Routing delay increases Number of clock cycles decreases Reconfiguration time decreases Total delay typically decreases Number of LEs increases Area increases

19 Experimental Setup Instance of architecture: 4 MBs in an SMB, 4 LEs in an MB, and LEs contain a 4-input LUT Number of reconfiguration copies k varied in order to compare implementations corresponding to selected folding levels: level-1, level-2, level-4 and no logic folding Results based on 100nm CMOS technology parameters

20 Experimental Results Average area-time product advantage = 2X Maximum area-time product advantage = 3X

21 16-RCA: 16-bit ripple carry adder 16-CLA: 16-bit carry lookahead adder 16-CSA: 16-bit carry select adder 8-MUL: 8-bit multiplier Experimental Results (Contd.) Average area-time product advantage = 13X Maximum area-time product advantage = 35X

22 Experimental Results (Contd.) Flexibility in performing area-performance trade-off For area-time (AT) product, larger the circuit depth, more the advantages of level-1 folding relative to no folding For the 64-bit ripple-carry adder, this advantage is about 35X LE utilization and logic density very high, with a reduced need for a deep interconnect hierarchy

23 Conclusions NATURE: A novel high-performance run-time reconfigurable architecture Introduction of NRAMs into the architecture enables cycle-by-cycle reconfiguration and logic folding Choice of different folding levels allows the flexibility of performing area-performance trade-offs Logic density and area-time product improved significantly Can be very useful for cost-conscious embedded systems and future FPGA improvement