ERD and Memory Architectures Paul Franzon Department of Electrical and Computer Engineering 919.515.7351.

Slides:



Advertisements
Similar presentations
Automatic Data Movement and Computation Mapping for Multi-level Parallel Architectures with Explicitly Managed Memories Muthu Baskaran 1 Uday Bondhugula.
Advertisements

MEMORY popo.
Subthreshold SRAM Designs for Cryptography Security Computations Adnan Gutub The Second International Conference on Software Engineering and Computer Systems.
A Search Memory Substrate for High Throughput and Low Power Packet Processing Sangyeun Cho, Michel Hanna and Rami Melhem Dept. of Computer Science University.
Outline Memory characteristics SRAM Content-addressable memory details DRAM © Derek Chiou & Mattan Erez 1.
A Scalable and Reconfigurable Search Memory Substrate for High Throughput Packet Processing Sangyeun Cho and Rami Melhem Dept. of Computer Science University.
+ CS 325: CS Hardware and Software Organization and Architecture Internal Memory.
TM This document is strictly confidential and proprietary of SMIC. It must not be copied or used for any purpose other than for reference only, and SMIC.
System Design Tricks for Low-Power Video Processing Jonah Probell, Director of Multimedia Solutions, ARC International.
Priority Research Direction (I/O Models, Abstractions and Software) Key challenges What will you do to address the challenges? – Develop newer I/O models.
S CRATCHPAD M EMORIES : A D ESIGN A LTERNATIVE FOR C ACHE O N - CHIP M EMORY IN E MBEDDED S YSTEMS - Nalini Kumar Gaurav Chitroda Komal Kasat.
ENGIN112 L38: Programmable Logic December 5, 2003 ENGIN 112 Intro to Electrical and Computer Engineering Lecture 38 Programmable Logic.
Computational Astrophysics: Methodology 1.Identify astrophysical problem 2.Write down corresponding equations 3.Identify numerical algorithm 4.Find a computer.
Overview Memory definitions Random Access Memory (RAM)
1 Chapter Seven Large and Fast: Exploiting Memory Hierarchy.
Memory Hierarchy.1 Review: Major Components of a Computer Processor Control Datapath Memory Devices Input Output.
Improving the Efficiency of Memory Partitioning by Address Clustering Alberto MaciiEnrico MaciiMassimo Poncino Proceedings of the Design,Automation and.
Modern VLSI Design 2e: Chapter 6 Copyright  1998 Prentice Hall PTR Topics n Memories: –ROM; –SRAM; –DRAM. n PLAs.
Memory Key component of a computer system is its memory system to store programs and data. ITCS 3181 Logic and Computer Systems 2014 B. Wilkinson Slides12.ppt.
Memory and Storage - Sheetal Gosrani. Overview Memory Hierarchy RAM Memory Chip Organization ROM Flash Memory.
1: IntroductionData Management & Engineering1 Course Overview: CS 395T Semantic Web, Ontologies and Cloud Databases Daniel P. Miranker Objectives: Get.
Presenter MaxAcademy Lecture Series – V1.0, September 2011 Introduction and Motivation.
1 Copyright © 2012, Elsevier Inc. All rights reserved. Chapter 1 Fundamentals of Quantitative Design and Analysis Computer Architecture A Quantitative.
Exploring the Tradeoffs of Configurability and Heterogeneity in Multicore Embedded Systems + Also Affiliated with NSF Center for High- Performance Reconfigurable.
Physical Memory By Gregory Marshall. MEMORY HIERARCHY.
156 / MAPLD 2005 Rollins 1 Reducing Energy in FPGA Multipliers Through Glitch Reduction Nathan Rollins and Michael J. Wirthlin Department of Electrical.
Lecture on Electronic Memories. What Is Electronic Memory? Electronic device that stores digital information Types –Volatile v. non-volatile –Static v.
Networking Virtualization Using FPGAs Russell Tessier, Deepak Unnikrishnan, Dong Yin, and Lixin Gao Reconfigurable Computing Group Department of Electrical.
Work in Progress --- Not for Publication 1 Beyond CMOS – April 12, 2010 ERD – Emerging Research Architectures Paul Franzon,
Introduction to Interconnection Networks. Introduction to Interconnection network Digital systems(DS) are pervasive in modern society. Digital computers.
TOWARDS AN EARLY DESIGN SPACE EXPLORATION TOOL SET FOR STT-RAM DESIGN Philip Asare and Ben Melton.
February 12, 1998 Aman Sareen DPGA-Coupled Microprocessors Commodity IC’s for the Early 21st Century by Aman Sareen School of Electrical Engineering and.
1 ERD Memory Review Paul Franzon, NCSU, u Strengths u Areas for future focus u Opportunities Franzon.
ECE 526 – Network Processing Systems Design Network Processor Architecture and Scalability Chapter 13,14: D. E. Comer.
Chapter 1 Basic Structure of Computers. Chapter Outline computer types, structure, and operation instructions and programs numbers, arithmetic operations,
Lecture 03: Fundamentals of Computer Design - Trends and Performance Kai Bu
Last Time Performance Analysis It’s all relative
CS1104: Computer Organisation School of Computing National University of Singapore.
Dong Hyuk Woo Nak Hee Seong Hsien-Hsin S. Lee
Chapter 8 Memory Interface
CPEN Digital System Design
Ralph K. Cavin, III March 18, 2009 Brussels.  Is there a Carnot-like theorem for computation? ◦ e.g., a limit on rate of information throughput/power.
+ CS 325: CS Hardware and Software Organization and Architecture Memory Organization.
King Fahd University of Petroleum and Minerals King Fahd University of Petroleum and Minerals Computer Engineering Department Computer Engineering Department.
Routing Prefix Caching in Network Processor Design Huan Liu Department of Electrical Engineering Stanford University
Computer Organization & Assembly Language © by DR. M. Amer.
Energy Reduction for STT-RAM Using Early Write Termination Ping Zhou, Bo Zhao, Jun Yang, *Youtao Zhang Electrical and Computer Engineering Department *Department.
Outline Motivation Simulation Framework Experimental methodology STT-RAM (Spin Torque Transference) ReRAM (Resistive RAM) PCRAM (Phase change) Comparison.
Authors – Jeahyuk huh, Doug Burger, and Stephen W.Keckler Presenter – Sushma Myneni Exploring the Design Space of Future CMPs.
Computer Architecture Lecture 24 Fasih ur Rehman.
A High-Speed & High-Capacity Single-Chip Copper Crossbar John Damiano, Bruce Duewer, Alan Glaser, Toby Schaffer, John Wilson, and Paul Franzon North Carolina.
By: Eric Moreno.  What is it?  What does it do?  What impact will it have?  When will it be available?
Lecture 17: Dynamic Reconfiguration I November 10, 2004 ECE 697F Reconfigurable Computing Lecture 17 Dynamic Reconfiguration I Acknowledgement: Andre DeHon.
Chapter 11 System Performance Enhancement. Basic Operation of a Computer l Program is loaded into memory l Instruction is fetched from memory l Operands.
CS203 – Advanced Computer Architecture
VU-Advanced Computer Architecture Lecture 1-Introduction 1 Advanced Computer Architecture CS 704 Advanced Computer Architecture Lecture 1.
1 Paolo Bianco Storage Architect Sun Microsystems An overview on Hybrid Storage Technologies.
CS 704 Advanced Computer Architecture
CS203 – Advanced Computer Architecture
Architecture & Organization 1
Welcome.
Maintaining Data Integrity in Programmable Logic in Atmospheric Environments through Error Detection Joel Seely Technical Marketing Manager Military &
Scalable High Performance Main Memory System Using PCM Technology
Architecture & Organization 1
Morgan Kaufmann Publishers Memory Hierarchy: Introduction
Electronics for Physicists
ECE 463/563 Fall `18 Memory Hierarchies, Cache Memories H&P: Appendix B and Chapter 2 Prof. Eric Rotenberg Fall 2018 ECE 463/563, Microprocessor Architecture,
The University of Adelaide, School of Computer Science
Utsunomiya University
Presentation transcript:

ERD and Memory Architectures Paul Franzon Department of Electrical and Computer Engineering

2 High Level Overview Challenges for Memories  Power consumption  Bandwidth  Resiliency  Scaling (density, speed, power) Opportunities  Increased use of memory in logic and routing  Using ERD devices to help ERD memories scale  Intersection with 3DIC Impact on Memory Architectures  Scope for new and more specialized memory architectures  Mobile; (Cloud) Computing; SSD; Structured ASIC;

3 Challenge: Bandwidth  Soon to exceed 1 TBps  Key challenge: Power: DDR3 would consume 600 W at this bandwidth

4 Challenge: Power Consumption  Memory hierarchy is becoming the largest single consumer of the power in a computer  Consumes 30%+ of power in today’s servers  E.g. DARPA Exascale computing study Note: This assumed DRAM power was reduced from 600 pJ/bit to 3 pJ/bit

5 Challenge: Resiliency Issues:  SEU of SRAM  Checkpointing and resiliency of entire processor  Future scaled systems could spend 80% of their time checkpointing Note: DRAM Failures almost all due to packaging

6 Opportunity: Power  Reduced capacitance associated with scalable 4F 2 ERD memory technologies  Change RAM architecture  Smaller sub-arrays  Less energy/access  Low-swing interconnect  Reduced overhead  Revisit memory hierarchy to optimize energy, not latency  Exploit 3DIC for energy reduction

7 Challenge: Scaling Scaling  Peripheral Circuits  Smaller memory sub-arrays increases overhead of sense amps and row/column decoders  4F 2 cells increases overhead of sense amps and row/column decoders  Need:  Technologies and techniques to reduce overhead of peripheral circuits  Circuit Solutions  Use of ERDs  Use of 3DIC to implement peripheral circuits in a circuit- optimized technology

8 Challenge: Scaling 1R1D cells needed for resistive memories to ensure scalability (1R cells limits array size) Impact of rectification ratio on scalability with 1R1D cell On:Off Ratio Rectifcation  Memory Size

9 Opportunity: Memory as Logic 4F 2 programmable ERD memory cells permit high density Look Up Tables and programmable logic  Nanocrossbar  CMOL  Potential for high-speed (low C) and high density  Need to compare to logic ITRS to establish benchmark

10 Opportunity: Memory as Interconnect  Use programmable persistent memory elements as routable switchbox interconnect  Use high-density, low-overhead memory in NOC store/forward routing architectures  Example: Using a low-voltage nanocrystal flash device as an interconnect switch

11 Other Comments ERD as a Solid State Disk?  Definitely an opportunity but it looks like magnetic disk will always outscale solid state disk in density and cost/bit.  Instead use inside memory hierarchy, e.g. Disk cache, checkpoint store  ERD as Associative Memory, Bayesian inference calculator, etc.?  Some ERD have analog behaviors that have yet to be exploited or explored  E.g. Memristor, Memcapacitor, Meminductor

12 Other Comments Memory in Logic  3DIC and 4F 2 memory cells permits memory-heavy architectures to be revisited  But energy/bit-accessed has to better than energy/FLOP for computation  Real need are algorithms that increase spatial locality

13 Outline of ERD Chapter Memory Challenges  Table with metrics Solution paths to these challenges  Table with metrics New Opportunities for ERD  Architectural and circuit opportunities  Review of current memory enhanced architectures???  Nanocrossbar, CMOL, Synaptic  Tables, largely qualitative, somewhat quantitative

14 Feedback from Group 1.Quantify overhead of peripheral circuits as a function of core size 2.Latency matters too, both memory and interconnect. Typically traded for bandwidth. Quantify.