IRDS Emerging Research Devices and Architectures NanoCrossbar Workshop

Slides:



Advertisements
Similar presentations
ITRS Design ITWG Design and System Drivers Worldwide Design ITWG Key messages: 1.- Software is now part of semiconductor technology roadmap 2.-
Advertisements

Multiprocessors— Large vs. Small Scale Multiprocessors— Large vs. Small Scale.
From Essentials of Computer Architecture by Douglas E. Comer. ISBN © 2005 Pearson Education, Inc. All rights reserved.
CMOL: Device, Circuits, and Architectures Konstantin K.Likharev and Dmitri B. Strukov Stony Brook University 697GG Nano Computering Fall 2005 Prepared.
OCIN Workshop Wrapup Bill Dally. Thanks To Funding –NSF - Timothy Pinkston, Federica Darema, Mike Foster –UC Discovery Program Organization –Jane Klickman,
University of Kansas Research Interests David Andrews Rm. 324 Nichols
1 A survey on Reconfigurable Computing for Signal Processing Applications Anne Pratoomtong Spring2002.
LLNL-PRES This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under Contract DE-AC52-07NA27344.
ERD and Memory Architectures Paul Franzon Department of Electrical and Computer Engineering
Work in Progress --- Not for Publication 1 Beyond CMOS – April 12, 2010 ERD – Emerging Research Architectures Paul Franzon,
Parallel Computing Basic Concepts Computational Models Synchronous vs. Asynchronous The Flynn Taxonomy Shared versus Distributed Memory Interconnection.
1 ERD Memory Review Paul Franzon, NCSU, u Strengths u Areas for future focus u Opportunities Franzon.
1 Computer Systems. 2 Introduction – What is a Computer? This course is all about how computers work What do computer and computer system mean to you?
Paper Review I Coarse Grained Reconfigurable Arrays Presented By: Matthew Mayhew I.D.# ENG*6530 Tues, June, 10,
NIMIA October 2001, Crema, Italy - Vincenzo Piuri, University of Milan, Italy NEURAL NETWORKS FOR SENSORS AND MEASUREMENT SYSTEMS Part II Vincenzo.
Chapter 6 Multiprocessor System. Introduction  Each processor in a multiprocessor system can be executing a different instruction at any time.  The.
NVIDIA Tesla GPU Zhuting Xue EE126. GPU Graphics Processing Unit The "brain" of graphics, which determines the quality of performance of the graphics.
Artificial Neural Networks An Introduction. What is a Neural Network? A human Brain A porpoise brain The brain in a living creature A computer program.
1 - CPRE 583 (Reconfigurable Computing): Reconfigurable Computing Architectures Iowa State University (Ames) CPRE 583 Reconfigurable Computing Lecture.
IBM T. J. Watson Research © 2004 IBM Corporation On Scalable Storage Area Network(SAN) Fabric Design Algorithm Bong-Jun Ko (Columbia University) Kang-Won.
Lecture 16: Reconfigurable Computing Applications November 3, 2004 ECE 697F Reconfigurable Computing Lecture 16 Reconfigurable Computing Applications.
MAPLD 2005/254C. Papachristou 1 Reconfigurable and Evolvable Hardware Fabric Chris Papachristou, Frank Wolff Robert Ewing Electrical Engineering & Computer.
1 - CPRE 583 (Reconfigurable Computing): Reconfigurable Computing Architectures Iowa State University (Ames) Reconfigurable Architectures Forces that drive.
Computer Architecture Lecture 26 Past and Future Ralph Grishman November 2015 NYU.
Emerging Non-volatile Memories: Opportunities and Challenges
System on a Programmable Chip (System on a Reprogrammable Chip)
Chapter 5 - Internal Memory 5.1 Semiconductor Main Memory 5.2 Error Correction 5.3 Advanced DRAM Organization.
Brief Intro to Machine Learning CS539
Computer Organization and Architecture Lecture 1 : Introduction
These slides are based on the book:
ELE 523E COMPUTATIONAL NANOELECTRONICS
18-447: Computer Architecture Lecture 30B: Multiprocessors
Learning Deep Generative Models by Ruslan Salakhutdinov
Web: Parallel Computing Rabie A. Ramadan , PhD Web:
COMPUTATIONAL MODELS.
A High-Speed and High-Capacity Single-Chip Copper Crossbar
Large-scale Machine Learning
James Coole PhD student, University of Florida Aaron Landy Greg Stitt
Hyperdimensional Computing with 3D VRRAM In-Memory Kernels: Device-Architecture Co-Design for Energy-Efficient, Error-Resilient Language Recognition H.
Chapter 1: Introduction
NanoMadeo Introduction of Error Correcting Schemes in the Design Process of Self-Healing Circuits for Nanoscale Fabrics.
Grid Computing.
Energy Efficient Computing in Nanoscale CMOS
Organizers Adwait Jog EJ Kim
Dr. Unnikrishnan P.C. Professor, EEE
Linchuan Chen, Xin Huo and Gagan Agrawal
CS775: Computer Architecture
Anne Pratoomtong ECE734, Spring2002
Counter propagation network (CPN) (§ 5.3)
Zhen Dong1,2, Haitong Li1, and H.-S. Philip Wong1
Conventional Silicon Computers Ovonic Cognitive Computer
The OvonicTM Cognitive Computer
Organizers Adwait Jog Assistant Professor (William & Mary)
Hyunchul Park, Kevin Fan, Manjunath Kudlur,Scott Mahlke
Neuromorphic Algorithm Acceleration with
3.1 Introduction to CPU Central processing unit etched on silicon chip called microprocessor Contain tens of millions of tiny transistors Key components:
Optimizing MapReduce for GPUs with Effective Shared Memory Usage
Characteristics of Reconfigurable Hardware
URECA: A Compiler Solution to Manage Unified Register File for CGRAs
Neuro-RAM Unit in Spiking Neural Networks with Applications
†UCSD, ‡UCSB, EHTZ*, UNIBO*
Programmable Logic- How do they do that?
Towards Neuromorphic Complexity Analysis
A New Hybrid FPGA with Nanoscale Clusters and CMOS Routing Reza M. P
Chapter 01: Introduction
ARTIFICIAL NEURAL networks.
Introduction to Neural Network
ELE 523E COMPUTATIONAL NANOELECTRONICS
Fault Mitigation of Switching Lattices under the Stuck-At Model
Patterson: Chap 1 A Review of Machine Learning
Presentation transcript:

IRDS Emerging Research Devices and Architectures NanoCrossbar Workshop Paul Franzon North Carolina State University Raleigh, NC paulf@ncsu.edu 919.515.7351 http://www.ece.ncsu.edu/erl/faculty/paulf.html

Thanks Thanks to Rambus for hosting this event Spherically Gary Edge, VP for Research and Jaimie Stuart for Logistics

Background International Roadmap for Semiconductors (ITRS) used to sponsored by SIA Now International Roadmap for Devices and Systems (IRDS) sponsored by IEEE Emerging Research Devices (ERD) Chapter had a subsection called Emerging Research Architectures (ERA). Both Edited chapter in odd years Help workshops in even years Held a workshop on Storage Class Memory in 2012 Been wanting to do workshops in other areas but lacked good definition For 2016 two workshops Nanocrossbar Approximate/ Stochastic/ Probabilistic computing

Core Group - ERA Paul Franzon, NCSU (editor) An Chen, IBM (ERA chair) Shamik Das, Mitre Matthew Marinella, Sandia Erik DeBenidictis, Sandia Geoff Burr , IBM

New learning algorithms Probabilistic Learning Program-Centric (performance and components dictated by designer) Data-Centric (performance and/or components influenced by the data that is passed through the system) Good old-fashioned Von Neumann Non-Von Neumann Memory Processor Non-VN Processor (including less-than-reliable VN) Trained off-line Trained in-line CMOS Non-CMOS Active Interconnect Analog computing (w/ Flash) Execution of pre-trained ANN FPGA SRAM CMOS Coarse-Grained Reconfigurable Architectures New learning algorithms (unsupervised, reinforcement) True North Ohmic Weave DRAM “Next switch” Coupled oscillators Flash Analog computing Automata Processing HTM GPUs Crossbars for STDP Deterministic/reliable NV computing Accelerators (multimedia, etc.) ML Acceler-ators (Convolution, SVM, ML) TCAM NVM- based FPGA Supervised ANN learning NVM crossbars for S-SCM, M-SCM Logic-in- memory Crossbars for backprop Probabilistic computing Probabilistic Learning CMOS beyond the design envelope deterministic Non- Quantum computing Bayesian Approximate computing RBM

2015 Chapter Started tracking NanoCrossbars explicitly

Goals of Workshop Identify and quantify the state of the art in devices, design, modeling, fabrication, and employment of Nano-enabled Crossbars for computing. Identify the research barriers impeding the use of NanoCrossbars for computing.

Questions asked to presenters General, including memories What is the status of achieving linear repeatable response, low power, sufficiently long retention, fast writes, sufficiently distinguishable resistances in different states, and long write endurance in one nanoscale device? Is the access device issue solved? What are the remaining issues?

… Questions Neuromorphic computing What are the requirements on device linearity, scalability and dynamic range? What is achieved today? What are the tradeoffs exposed in achieving this? What style of non-traditional computing is best suited to nanodevice arrays? E.g. spiking neuron, deep network, full logic map, etc. Why? What are the specific gaps in device properties that are preventing us from achieving this paradigm?

… Questions Analog computing What are the requirements on device linearity, scalability and dynamic range? What is achieved today? What are the tradeoffs exposed in achieving this? What is the required device yield? What mechanisms are available for implementing working arrays in the presence of <100% yield? What levels of noise during readout can be tolerated? To what degree could closed-loop control (e.g., iterative resistance-tuning for higher accuracy) be available during device write?

Agenda 0900– 0930 : Introduction: Paul Franzon, NC State University 0930 – 1020 : Matt Marinella, Sandia 1020 – 1040 : Break 1040 : 1120 : Geoff Burr IBM 1120 – 1200 : Catchup 1200 – 1300 : Lunch 1340 : Dmitri Strukov, UCSB 1420 : Kevin Cao, ASU 1420– 1440 : Break 1440 - 1520: Miao Hu, HPE 520 – 1600 : Wei Lu, UMich 1600 – 1640 : Gert Cauwenberghs, UCSD (via webex)