Institut des Nanotechnologies de Lyon UMR CNRS 5270 ICECS 2010 – Athens, Greece Logic cells and interconnect strategies for nanoscale.

Slides:



Advertisements
Similar presentations
Digital Integrated Circuits© Prentice Hall 1995 Combinational Logic COMBINATIONAL LOGIC.
Advertisements

COMBINATIONAL LOGIC [Adapted from Rabaey’s Digital Integrated Circuits, ©2002, J. Rabaey et al.]
NanoFabric Chang Seok Bae. nanoFabric nanoFabric : an array of connect nanoBlocks nanoBlock : logic block that can be progammed to implement Boolean function.
Based on text by S. Mourad "Priciples of Electronic Systems" Digital Testing: Design Representation and Fault Detection
Budapest University of Technology and Economics Department of Electron Devices Microelectronics, BSc course MOS circuits: basic construction.
ECE Synthesis & Verification - Lecture 0 1 ECE 697B (667) Spring 2006 ECE 697B (667) Spring 2006 Synthesis and Verification of Digital Circuits VLSI.
Copyright 2001, Agrawal & BushnellDay-1 PM Lecture 4a1 Design for Testability Theory and Practice Lecture 4a: Simulation n What is simulation? n Design.
Designing Combinational Logic Circuits: Part2 Alternative Logic Forms:
ENGIN112 L38: Programmable Logic December 5, 2003 ENGIN 112 Intro to Electrical and Computer Engineering Lecture 38 Programmable Logic.
Spring 07, Jan 16 ELEC 7770: Advanced VLSI Design (Agrawal) 1 ELEC 7770 Advanced VLSI Design Spring 2007 Introduction Vishwani D. Agrawal James J. Danaher.
Optimal Layout of CMOS Functional Arrays ECE665- Computer Algorithms Optimal Layout of CMOS Functional Arrays T akao Uehara William M. VanCleemput Presented.
Nanotechnology: Spatial Computing Using Molecular Electronics Mihai Budiu joint work with Seth Copen Goldstein Dan Rosewater.
Programmable logic and FPGA
Digital Integrated Circuits A Design Perspective
 2000 M. CiesielskiPTL Synthesis1 Synthesis for Pass Transistor Logic Maciej Ciesielski Dept. of Electrical & Computer Engineering University of Massachusetts,
Chapter #6: Sequential Logic Design 6.2 Timing Methodologies
An Extra-Regular, Compact, Low-Power Multiplier Design Using Triple-Expansion Schemes and Borrow Parallel Counter Circuits Rong Lin Ronald B. Alonzo SUNY.
Digital Integrated Circuits© Prentice Hall 1995 Combinational Logic COMBINATIONAL LOGIC.
THEORETICAL LIMITS FOR SIGNAL REFLECTIONS DUE TO INDUCTANCE FOR ON-CHIP INTERCONNECTIONS F. Huret, E. Paleczny, P. Kennis F. Huret, E. Paleczny, P. Kennis.
CS 151 Digital Systems Design Lecture 38 Programmable Logic.
Robust Low Power VLSI R obust L ow P ower VLSI Finding the Optimal Switch Box Topology for an FPGA Interconnect Seyi Ayorinde Pooja Paul Chaudhury.
CSET 4650 Field Programmable Logic Devices
© Digital Integrated Circuits 2nd Sequential Circuits Digital Integrated Circuits A Design Perspective Designing Sequential Logic Circuits Jan M. Rabaey.
Lecture 2: Field Programmable Gate Arrays September 13, 2004 ECE 697F Reconfigurable Computing Lecture 2 Field Programmable Gate Arrays.
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR Topics n Circuit design for FPGAs: –Logic elements. –Interconnect.
Building Cad Prototyping Tool for Emerging Nanoscale Fabrics Catherine Dezan Joined work between Lester( France.
EE415 VLSI Design DYNAMIC LOGIC [Adapted from Rabaey’s Digital Integrated Circuits, ©2002, J. Rabaey et al.]
A comprehensive method for the evaluation of the sensitivity to SEUs of FPGA-based applications A comprehensive method for the evaluation of the sensitivity.
1. NATURE: Non-Volatile Nanotube RAM based Field-Programmable Gate Arrays Wei Zhang†, Niraj K. Jha† and Li Shang ‡ †Dept. of Electrical Engineering Princeton.
Arithmetic Building Blocks
Ratioed Circuits Ratioed circuits use weak pull-up and stronger pull-down networks. The input capacitance is reduced and hence logical effort. Correct.
DCSL & LVDCSL: A High Fan-in, High Performance Differential Current Switch Logic Families Dinesh Somasekhaar, Kaushik Roy Presented by Hazem Awad.
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR FPGA Fabric n Elements of an FPGA fabric –Logic element –Placement –Wiring –I/O.
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR Topics n FPGA fabric architecture concepts.
Programmable Logic Devices
Notices You have 18 more days to complete your final project!
A Robust Pulse-triggered Flip-Flop and Enhanced Scan Cell Design
Field Programmable Gate Arrays (FPGAs) An Enabling Technology.
A Reconfigurable Low-power High-Performance Matrix Multiplier Architecture With Borrow Parallel Counters Counters : Rong Lin SUNY at Geneseo
Modern VLSI Design 4e: Chapter 3 Copyright  2008 Wayne Wolf Topics n Pseudo-nMOS gates. n DCVS logic. n Domino gates. n Design-for-yield. n Gates as IP.
Recent Topics on Programmable Logic Array
Review for Final Exam LC3 – Controller FPGAs Multipliers
CSE477 L07 Pass Transistor Logic.1Irwin&Vijay, PSU, 2003 CSE477 VLSI Digital Circuits Fall 2003 Lecture 07: Pass Transistor Logic Mary Jane Irwin (
UNIVERSITY OF ROSTOCK Institute of Applied Microelectronics and Computer Science Single-Rail Self-timed Logic Circuits in Synchronous Designs Frank Grassert,
Dynamic Logic Dynamic Circuits will be introduced and their performance in terms of power, area, delay, energy and AT2 will be reviewed. We will review.
Budapest University of Technology and Economics Department of Electron Devices Microelectronics, BSc course MOS circuits: basic construction.
FPGA-Based System Design: Chapter 1 Copyright  2004 Prentice Hall PTR Moore’s Law n Gordon Moore: co-founder of Intel. n Predicted that number of transistors.
Greg Alkire/Brian Smith 197 MAPLD An Ultra Low Power Reconfigurable Task Processor for Space Brian Smith, Greg Alkire – PicoDyne Inc. Wes Powell.
Static CMOS Logic Seating chart updates
Click to edit Master title style Progress Update Energy-Performance Characterization of CMOS/MTJ Hybrid Circuits Fengbo Ren 05/28/2010.
Dynamic Logic.
EE141 Combinational Circuits 1 Chapter 6 (I) Designing Combinational Logic Circuits Dynamic CMOS LogicDynamic CMOS Logic V1.0 5/4/2003.
Lecture 17: Dynamic Reconfiguration I November 10, 2004 ECE 697F Reconfigurable Computing Lecture 17 Dynamic Reconfiguration I Acknowledgement: Andre DeHon.
EE 466/586 VLSI Design Partha Pande School of EECS Washington State University
1 Recap: Lecture 4 Logic Implementation Styles:  Static CMOS logic  Dynamic logic, or “domino” logic  Transmission gates, or “pass-transistor” logic.
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR Topics n FPGA fabric architecture concepts.
EE222 Winter 2013 Steve Kang Lecture 5 Interconnects and Clock Signaling Open systems interconnect (
CSE477 L06 Static CMOS Logic.1Irwin&Vijay, PSU, 2003 CSE477 VLSI Digital Circuits Fall 2003 Lecture 06: Static CMOS Logic Mary Jane Irwin (
Cell Design Standard Cells Datapath Cells General purpose logic
Lecture 08: Pass Transistor Logic
1 Architecture of Datapath- oriented Coarse-grain Logic and Routing for FPGAs Andy Ye, Jonathan Rose, David Lewis Department of Electrical and Computer.
This chapter in the book includes: Objectives Study Guide
A High-Speed and High-Capacity Single-Chip Copper Crossbar
VLSI Testing Lecture 5: Logic Simulation
This chapter in the book includes: Objectives Study Guide
Fundamentals of Computer Science Part i2
Design Technologies Custom Std Cell Performance Gate Array FPGA Cost.
Topics Circuit design for FPGAs: Logic elements. Interconnect.
EEE2243 Digital System Design Chapter 9: Advanced Topic: Physical Implementation by Muhazam Mustapha extracted from Frank Vahid’s slides, May 2012.
A New Hybrid FPGA with Nanoscale Clusters and CMOS Routing Reza M. P
Presentation transcript:

Institut des Nanotechnologies de Lyon UMR CNRS 5270 ICECS 2010 – Athens, Greece Logic cells and interconnect strategies for nanoscale reconfigurable computing fabrics I. O'Connor, K. Jabeur, D. Navarro, N. Yakymets Lyon Institute of Nanotechnology, Lyon, France P.E. Gaillardon, M.H. Ben Jamaa, F. Clermidy CEA-LETI-MINATEC, Grenoble, France nano.grain

Institut des Nanotechnologies de Lyon UMR CNRS 5270 ICECS 2010 – Athens, Greece Outline Some technology fabric considerations Logic cells –Reduced-complexity dynamic standard cells –Reconfigurable logic cells and design methods Interconnect strategies –Matrix topologies –Island-style architecture –Metrics and comparisons Conclusions

Institut des Nanotechnologies de Lyon UMR CNRS 5270 ICECS 2010 – Athens, Greece Explaining the jargon nanoscale computing fabric (nanoFabric): –nanoFabric: an array of connected nanoscale logic blocks (nanoBlocks) –nanoBlock: a circuit block containing programmable devices to compute boolean logic functions and means to route data usually hybrid approach (silicon die, or CMOS compatible): –bottom-up structure: chemical self-assembly for dense and regular arrangement of elements –top-down structure: conventional process options for interconnect or for computation –and memory …

Institut des Nanotechnologies de Lyon UMR CNRS 5270 ICECS 2010 – Athens, Greece Double-gate ambipolarity In DG-CNTFETS, the I d -V g characteristic demonstrates ambipolarity –V bg > 0V: device behavior = n-type FET –V bg < 0V: device behavior = p-type FET –V bg floating / 0V: device is in the off state Verilog-A model developed (IMS) Reduced-complexity logic cells Ultra fine-grain reconfigurable logic cells Y.-M. Lin et al., IEEE Trans. Nanotechnology, 4(5),2005 -V +V -V on (n) off (n) on (p) off (p) GPGstate +V X0off (n/p)

Institut des Nanotechnologies de Lyon UMR CNRS 5270 ICECS 2010 – Athens, Greece Hybrid technology "Selective Growth of Well-Aligned Semiconducting Single-Walled Carbon Nanotubes", L. Ding et al., Nano Lett., 9(2), 800 (2009) "Monolithic integration of CMOS VLSI and carbon nanotubes for hybrid nanotechnology applications", D. Akinwande et al., IEEE Trans. Nanotechnology, 7(5), 636 (2008)

Institut des Nanotechnologies de Lyon UMR CNRS 5270 ICECS 2010 – Athens, Greece Outline Some technology fabric considerations Logic cells –Reduced-complexity dynamic standard cells –Reconfigurable logic cells and design methods Interconnect strategies –Matrix topologies –Island-style architecture –Metrics and comparisons Conclusions

Institut des Nanotechnologies de Lyon UMR CNRS 5270 ICECS 2010 – Athens, Greece Dynamic logic standard cells use the extra gate (PG) to reduce complexity function path includes EV phase transistor count: –2n (static logic) –n+2 (conventional DL) –n+1 (this work) clocking strategy: –Double clock (DCK) –Multiple clock (MCK) –Single clock (SCK) EV function path PC Out +V In 1 In 2 In n +V A B V bA V bB Y Pc gnd V dd Ev Layout flipping: rich set of operators

Institut des Nanotechnologies de Lyon UMR CNRS 5270 ICECS 2010 – Athens, Greece SCKMCKDCK Clocking strategies and cell variants PUN EV{0,+V}, PC{0,+V} Precharge: (PC=+V, EV=0) Evaluation: (PC=0, EV=+V) PUN EV+{0,+V}, EV-{0, -V}, PC{0,+V} mixed N- and P- function path: more complex functions function path PC EV- Out +V In 1 In 2 In n EV+ function path Clk Out +V In 1 In 2 In n -V function path PC EV Out +V In 1 In 2 In n PDN Clk{0,+V} Precharge: (Clk=0) Evaluation: (Clk=+V) complementary functions

Institut des Nanotechnologies de Lyon UMR CNRS 5270 ICECS 2010 – Athens, Greece Comparison (simulation) V dd =1V f clk =3GHz, t r =t f =20ps, C L =150aF av. power +(0-20)% wc. delay -(30-50)% –no EV transistor, lower branch resistance pdp –(25-40)%

Institut des Nanotechnologies de Lyon UMR CNRS 5270 ICECS 2010 – Athens, Greece Reconfigurable logic cell CNT-DR7T boolean data inputs A and B, data output Y {0,+V) four-phase non-overlapping clock signals PC 1, PC 2, EV 1, EV 2 {0,+V) ternary configuration inputs V bgA, V bgB, V bgC {-V,0,+V) EV 2 PC 2 EV 1 PC 1 t C Y f(A,B,V bA,V bB ) f(C,V bC )  = 1.5nm I off = A I on /I off =10 5 V bA A V bB B EV 1 PC 1 EV 2 PC 2 V bC V dd Y C J. Liu, I. O'Connor, D. Navarro, F. Gaffiot, El. Lett., 43(9), April 2007 V bgA V bgB V bgC Y +V -V +V -V +V -V +V -V +V -V +V -V +V -V +V -V A+B A.B A+B A.B A+B A A B B 1 0 +V0 0-V 0+V 0 -V

Institut des Nanotechnologies de Lyon UMR CNRS 5270 ICECS 2010 – Athens, Greece Towards complete operator sets 1.5X-2X decrease in power consumption more functions, fewer transistors, one extra configuration signal DRLC-6T 15 functions DRLC-9T 16 functions

Institut des Nanotechnologies de Lyon UMR CNRS 5270 ICECS 2010 – Athens, Greece Outline Some technology fabric considerations Logic cells –Reduced-complexity dynamic standard cells –Reconfigurable logic cells and design methods Interconnect strategies –Matrix topologies –Island-style architecture –Metrics and comparisons Conclusions

Institut des Nanotechnologies de Lyon UMR CNRS 5270 ICECS 2010 – Athens, Greece Physical view: clusters of matrices

Institut des Nanotechnologies de Lyon UMR CNRS 5270 ICECS 2010 – Athens, Greece Directed matrix interconnect topologies Mod_Omega_4d4w Baseline_4d4w Flip_4d4w Banyan_4d4w Adata inputs f rc B YY logic function data output (x2) configuration inputs

Institut des Nanotechnologies de Lyon UMR CNRS 5270 ICECS 2010 – Athens, Greece Mapping success rate for matrices omega topology can achieve up to 25% more functions % exploitable cases 0-fault baseline 0-fault omega 0-fault flip 0-fault banyan

Institut des Nanotechnologies de Lyon UMR CNRS 5270 ICECS 2010 – Athens, Greece Towards undirected topologies f 11 f 12 f 21 f 22 f 11 f 12 f 21 f 22 f 11 f 12 f 21 f 22 MetricsBanyanSystolic array Cross-cap Max. I/O data width / side---+ Intra-matrix connectivity--+ Total wire length wa+2a(w-1) Max. primary I/O path length wa+2a(w-1) Av. mapping success rate (2x2)61%58%66% Cross-capBanyanSystolic array

Institut des Nanotechnologies de Lyon UMR CNRS 5270 ICECS 2010 – Athens, Greece When to move to island-style? MetricsIsland-styleCell matrix No. transistors involved in mapping, T % mapped matrices in a cluster75100 No. of switches added to connect matrices1680 Island-styleCell-matrix 1-bit FA application

Institut des Nanotechnologies de Lyon UMR CNRS 5270 ICECS 2010 – Athens, Greece Wrap-up Logic with ambipolar DG-CNTFETS: –reduced-complexity dynamic-logic standard cells with –(25-40)% PDP –complete operator set dynamic-logic reconfigurable cells with low transistor count and power consumption Interconnect strategies: –directed matrix interconnect topology exploration –cross-cap topology proposed to relieve latency and data-directivity issues –matrices within islands allow efficient packing –routing between islands to be explored …

Institut des Nanotechnologies de Lyon UMR CNRS 5270 ICECS 2010 – Athens, Greece Ambipolar double-gate FET BDD (A-BDD) for reconfigurable logic cells a 01 A BB F b c i k L d e f g h j Different edge value (reconfigurable) Shared edge value (non- reconfigurable) 5. Pass-transistor logic circuit implementation 4. Define implementation rules to implement the A-BDD into the circuit level 3. Label every edge connecting two different nodes 2. Combine their BDDs in a common A-BDD 1. Define output functions A-BDD of 2-inputs reconfigurable cell PTL reconfigurable cell (RSL-12T)  6X decrease in power consumption  Slightly improved time delay  Full functionality  More transistors  High number of config signals

Institut des Nanotechnologies de Lyon UMR CNRS 5270 ICECS 2010 – Athens, Greece Methodology for mapping applications onto matrix-based nanocomputer architectures Aims: mapping applications onto selected architecture; obtaining diverse solutions with required area, power and delay; comparing different architectures Architecture Logic Cell, Matrix, Cluster Application n ≤ pi, m ≤ po Mapping application onto a matrix using GA Pi, number of primary inputs Po, number of primary outputs n, number of inputs m, number of outputs Partitioning Shannon’s expansion yesno Configuration for mapping matrices onto clusters Application Cluster architecture Application is mapped? yes Truth table Interconnection type Matrix size Cell characteristics no Mapping matrices onto clusters (e.g. VPR tool) 1. Mapping matrices using GA 2.1. Partitioning outputs 2.2. Partitioning inputs /Shannon exp.

Institut des Nanotechnologies de Lyon UMR CNRS 5270 ICECS 2010 – Athens, Greece Methodology evaluation Normalized area, delay and track count Flow_1: ABC → T-VPack → VPR Flow_2: new methodology → VPR Two versions of a 1-bit full adder Version 1: 12 cells Version 2: 10 cells