Nanotechnology: Spatial Computing Using Molecular Electronics Mihai Budiu joint work with Seth Copen Goldstein Dan Rosewater.

Slides:



Advertisements
Similar presentations
Spatial Computation Thesis committee: Seth Goldstein Peter Lee Todd Mowry Babak Falsafi Nevin Heintze Ph.D. Thesis defense, December 8, 2003 SCS Mihai.
Advertisements

Fast Compilation for Reconfigurable Hardware Mihai Budiu and Seth Copen Goldstein Carnegie Mellon University Computer Science Department Joint work with.
Mihai Budiu Microsoft Research – Silicon Valley joint work with Girish Venkataramani, Tiberiu Chelcea, Seth Copen Goldstein Carnegie Mellon University.
Introduction to Programmable Logic John Coughlan RAL Technology Department Electronics Division.
NanoFabric Chang Seok Bae. nanoFabric nanoFabric : an array of connect nanoBlocks nanoBlock : logic block that can be progammed to implement Boolean function.
Kameshwar K. Yadavalli, Alexei O. Orlov, Ravi K. Kummamuru, John Timler, Craig Lent, Gary Bernstein, and Gregory Snider Department of Electrical Engineering.
+ CS 325: CS Hardware and Software Organization and Architecture Internal Memory.
NATIONAL INSTITUTE OF SCIENCE & TECHNOLOGY Presented by: Susman Das Technical Seminar Presentation FPAA for Analog Circuit Design Presented by Susman.
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR SRAM-based FPGA n SRAM-based LE –Registers in logic elements –LUT-based logic element.
PipeRench: A Coprocessor for Streaming Multimedia Acceleration Seth Goldstein, Herman Schmit et al. Carnegie Mellon University.
CMOL vs NASICs T. Wang University of Massachusetts, Amherst September 29, 2005.
Introduction to Reconfigurable Computing CS61c sp06 Lecture (5/5/06) Hayden So.
Processor Technology and Architecture
ENGIN112 L38: Programmable Logic December 5, 2003 ENGIN 112 Intro to Electrical and Computer Engineering Lecture 38 Programmable Logic.
Spring 08, Jan 15 ELEC 7770: Advanced VLSI Design (Agrawal) 1 ELEC 7770 Advanced VLSI Design Spring 2007 Introduction Vishwani D. Agrawal James J. Danaher.
Spring 07, Jan 16 ELEC 7770: Advanced VLSI Design (Agrawal) 1 ELEC 7770 Advanced VLSI Design Spring 2007 Introduction Vishwani D. Agrawal James J. Danaher.
Compiling Application-Specific Hardware Mihai Budiu Seth Copen Goldstein Carnegie Mellon University.
Build-In Self-Test of FPGA Interconnect Delay Faults Laboratory for Reliable Computing (LaRC) Electrical Engineering Department National Tsing Hua University.
Application-Specific Hardware Computing Without Processors Mihai Budiu October 6, 2001 SOCS-4.
Array-Based Architecture for FET-Based, Nanoscale Electronics André DeHon 2003 Presented By Mahmoud Ben Naser.
ECE 232 L2 Basics.1 Adapted from Patterson 97 ©UCBCopyright 1998 Morgan Kaufmann Publishers ECE 232 Hardware Organization and Design Lecture 2 Computer.
Prospects for Terabit-scale nano electronic memories Venkata R.Malladi Instructor : Dr.Damian.
SSS 4/9/99CMU Reconfigurable Computing1 The CMU Reconfigurable Computing Project April 9, 1999 Mihai Budiu
Computing Without Processors Thesis Proposal Mihai Budiu July 30, 2001 This presentation uses TeXPoint by George Necula Thesis Committee: Seth Goldstein,
Nanoscale Self-Assembly A Computational View Philip Kuekes Quantum Science Research HP Labs.
3.1Introduction to CPU Central processing unit etched on silicon chip called microprocessor Contain tens of millions of tiny transistors Key components:
ASH: A Substrate for Scalable Architectures Mihai Budiu Seth Copen Goldstein CALCM Seminar, March 19, 2002.
1. 2 Electronics Beyond Nano-scale CMOS Shekhar Borkar Intel Corp. July 27, 2006.
February 12, 1998 Aman Sareen DPGA-Coupled Microprocessors Commodity IC’s for the Early 21st Century by Aman Sareen School of Electrical Engineering and.
A Reconfigurable Processor Architecture and Software Development Environment for Embedded Systems Andrea Cappelli F. Campi, R.Guerrieri, A.Lodi, M.Toma,
A RISC ARCHITECTURE EXTENDED BY AN EFFICIENT TIGHTLY COUPLED RECONFIGURABLE UNIT Nikolaos Vassiliadis N. Kavvadias, G. Theodoridis, S. Nikolaidis Section.
System Arch 2008 (Fire Tom Wada) /10/9 Field Programmable Gate Array.
1. NATURE: Non-Volatile Nanotube RAM based Field-Programmable Gate Arrays Wei Zhang†, Niraj K. Jha† and Li Shang ‡ †Dept. of Electrical Engineering Princeton.
1 Embedded Systems Computer Architecture. Embedded Systems2 Memory Hierarchy Registers Cache RAM Disk L2 Cache Speed (faster) Cost (cheaper per-byte)
Amalgam: a Reconfigurable Processor for Future Fabrication Processes Nicholas P. Carter University of Illinois at Urbana-Champaign.
1 Moore’s Law in Microprocessors Pentium® proc P Year Transistors.
J. Christiansen, CERN - EP/MIC
Reminder Lab 0 Xilinx ISE tutorial Research Send me an if interested Looking for those interested in RC with skills in compilers/languages/synthesis,
Programmable Logic Devices
Introduction to FPGA Created & Presented By Ali Masoudi For Advanced Digital Communication Lab (ADC-Lab) At Isfahan University Of technology (IUT) Department.
Field Programmable Gate Arrays (FPGAs) An Enabling Technology.
Basic Sequential Components CT101 – Computing Systems Organization.
Radix-2 2 Based Low Power Reconfigurable FFT Processor Presented by Cheng-Chien Wu, Master Student of CSIE,CCU 1 Author: Gin-Der Wu and Yi-Ming Liu Department.
Datta1 Routing for Reliability in Molecular Diode-based Programmable Nanofabrics Kushal Datta, Arindam Mukherjee and Arun Ravindran Department of Electrical.
EE3A1 Computer Hardware and Digital Design
Computer Organization & Assembly Language © by DR. M. Amer.
QCAdesigner – CUDA HPPS project
1 Interconnect/Via. 2 Delay of Devices and Interconnect.
Northeastern U N I V E R S I T Y 1 Design and Test of Fault Tolerant Quantum Dot Cellular Automata Electrical and Computer Department.
Institut des Nanotechnologies de Lyon UMR CNRS 5270 ICECS 2010 – Athens, Greece Logic cells and interconnect strategies for nanoscale.
Advanced Computing and Information Systems laboratory Nanocomputing technologies José A. B. Fortes Dpt. of Electrical and Computer Eng. and Dpt. of Computer.
Integrated Microsystems Lab. EE372 VLSI SYSTEM DESIGNE. Yoon 1-1 Panorama of VLSI Design Fabrication (Chem, physics) Technology (EE) Systems (CS) Matel.
VADA Lab.SungKyunKwan Univ. 1 L5:Lower Power Architecture Design 성균관대학교 조 준 동 교수
Altera Technical Solutions Seminar Schedule OpeningIntroduction FLEX ® 10KE Devices APEX ™ 20K & Quartus ™ Overview Design Integration EDA Integration.
Hardware Trends CSE451 Andrew Whitaker. Motivation Hardware moves quickly OS code tends to stick around for a while “System building” extends way beyond.
Introduction to Field Programmable Gate Arrays (FPGAs) EDL Spring 2016 Johns Hopkins University Electrical and Computer Engineering March 2, 2016.
Sequential Logic Design
Topics SRAM-based FPGA fabrics: Xilinx. Altera..
Architecture & Organization 1
From Silicon to Microelectronics Yahya Lakys EE & CE 200 Fall 2014
Introduction to Reconfigurable Computing
Field Programmable Gate Array
Field Programmable Gate Array
Field Programmable Gate Array
Architecture & Organization 1
Topics Antifuse-based FPGA fabrics: Flash-based FPGAs Actel.
Semiconductor Memories
A High Performance SoC: PkunityTM
Computer Evolution and Performance
A New Hybrid FPGA with Nanoscale Clusters and CMOS Routing Reza M. P
Presentation transcript:

Nanotechnology: Spatial Computing Using Molecular Electronics Mihai Budiu joint work with Seth Copen Goldstein Dan Rosewater

SSS April 20, Intersection of Three Areas Reconfigurable computing Nanotechnology Computer architecture

SSS April 20, Prophecies, A Risky Endeavor I think there is a world market for maybe five computers. --- T. J. Watson 640K ought to be enough for everybody. --- Bill Gates There is no reason anyone would want a computer in their home. --- Ken Olson I will propose this semester. --- Anonymous There is not the slightest indication that nuclear energy will ever be obtainable. --- Albert Einstein

SSS April 20, Moore’s Law

SSS April 20, Moore’s Second Law Plant costMask cost generation X 1000$

SSS April 20, Our Proposal Nanotechnology + cheap + high-density + low-power – unreliable Computer architecture + vast body of knowledge – expensive – high-power Reconfigurable Computing + defect tolerant + high performance – low density _ _ _ _

SSS April 20, Paradigm Shift Executable Configuration Complex fixed chip + Program Dense, regular structure + Configuration

SSS April 20, Outline Introduction Reconfigurable computing Nanotechnology Nano-architecture proposal Preliminary results Conclusions and Future Work

SSS April 20, Reconfigurable Computing Back to ENIAC-style computing Synthesize one machine to solve one problem

SSS April 20, Island-Style RC Architecture Universal gates and/or storage elements Interconnection network Programmable Switches

SSS April 20, Switch controlled by a 1-bit RAM cell Universal gate = RAM a0 a1 a0 a1 data a1 & a2 0 data in control Main RC Ingredient: RAM Cell

SSS April 20, Place and Route int reverse(int x) { int k,r=0; for (k=0; k<64; k++) r |= x&1; x = x >> 1; r = r << 1; } int func(int* a,int *b) { int j,sum=0; for (j=0; *a>0; j++) sum+=reverse(*b

SSS April 20, Kernel Speedup Using PipeRench ATR Cordic DCT DCT-2D FIR IDEA Nqueens Over PopCount Times Over 300Mhz UltraSparc-II

SSS April 20, Defect Tolerance Despite having >70% of the chips defective, Teramac works flawlessly. Compilation has two phases: defect detection through self-testing placement for defect-avoidance

SSS April 20, Outline Introduction Reconfigurable computing Nanotechnology Nano-architecture proposal Preliminary results Conclusions and Future work

SSS April 20, Nanotechnology

SSS April 20, Predicted Features Low Power: gates use less than 2 W (compare to 3x10 7 transistors using 100 W in CMOS) Low cost (nanocents/gate) Small size (10 5 factor area gain) Nano-RAM cell In yellow: a CMOS RAM cell.

SSS April 20, Nano-wires carbon nanotubues, Si, metal >2nm diameter, up to mm length excellent electrical properties A carbon nanotube: one molecule

SSS April 20, Nano-switch

SSS April 20, Nano-switch Between Nano-wires

SSS April 20, Self-assembly

SSS April 20, No Complex Irregular Structures

SSS April 20, No Three-Terminal Devices

SSS April 20, Diode-resistor Logic V DD Output Input 1 Input 2 A * B V AND B A A ^ B VVV AND A B A B A * B Nano-implementationElectrical equivalent

SSS April 20, Nanoscale Latches D clock dataout Provide: signal restoration (amplification) clocking (synchronization) memory

SSS April 20, High Defect Rate

SSS April 20, Outline Introduction Reconfigurable computing Nanotechnology Nano-architecture proposal Preliminary results Conclusions and future work

SSS April 20, The nanoBlock (3-in to 3-out Logic) +Vdd Gnd clk Inputs Outputs CMOS clk

SSS April 20, Interconnecting nanoBlocks Switch block

SSS April 20, Global View

SSS April 20, Control cluster long-lines Many Clusters = nanoFabric

SSS April 20, Compilation 1.Program 2.Split-phase Abstract Machines 3.Configurations placed independently 4.Placement on chip int reverse(int x) { int k,r=0; for (k=0; k > 1; r = r << 1; } } Computations & local storage Unknown latency ops.

SSS April 20, Outline Introduction Reconfigurable Hardware Nanotechnology Nano-architecture proposal Preliminary results Conclusions and Future work

SSS April 20, A graph of the whole program execution: A Limit Study of Performance Memory word Basic block Memory write Memory read Control-flow transfer

SSS April 20, Area (10 6 units/cm 2 available)

SSS April 20, Typical Program Graph (g721_e) Control flow transfer 100% memory cluster Memory reads 100% code cluster

SSS April 20, Typical Program Graph (g721_e) Control flow transfer memory Memory reads code memcpy

SSS April 20, Program Graph After Inlining memcpy memcpy

SSS April 20, Application Slowdown

SSS April 20, How Time Is Spent No caches: reads expensive No speculation

SSS April 20, Future Work Better nano-devices More accurate hardware models in simulations Compilation technology

SSS April 20, Conclusions Electronic nanotechnology promises to transcend the limitations of CMOS Nanofabrics are very well suited to reconfigurable computation gate designs can be managed through hierarchies of abstract machines