Yi-Lin, Tu 2013 IEE5011 –Fall 2013 Memory Systems Wide I/O High Bandwidth DRAM Yi-Lin, Tu Department of Electronics Engineering National Chiao Tung University.

Slides:



Advertisements
Similar presentations
Zehan Cui, Yan Zhu, Yungang Bao, Mingyu Chen Institute of Computing Technology, Chinese Academy of Sciences July 28, 2011.
Advertisements

Packaging.
DRAM background Fully-Buffered DIMM Memory Architectures: Understanding Mechanisms, Overheads and Scaling, Garnesh, HPCA'07 CS 8501, Mario D. Marino, 02/08.
Dawei Huang, IEEE Journal of Selected Topics in Quantum Electronics, March/April 2003 Optical Interconnects: Out of the Box Forever? Jeong-Min Lee
An International Technology Roadmap for Semiconductors
Arjun Suresh S7, R College of Engineering Trivandrum.
Smart Refresh: An Enhanced Memory Controller Design for Reducing Energy in Conventional and 3D Die-Stacked DRAMs Mrinmoy Ghosh Hsien-Hsin S. Lee School.
COEN 180 DRAM. Dynamic Random Access Memory Dynamic: Periodically refresh information in a bit cell. Else it is lost. Small footprint: transistor + capacitor.
Tiered-Latency DRAM: A Low Latency and A Low Cost DRAM Architecture
3D-MAPS: 3D Massively Parallel Processor with Stacked Memory Dae Hyun Kim, Krit Athikulwongse, Michael Healy, Mohammad Hossain, Moongon Jung, et al. Georgia.
CS.305 Computer Architecture Memory: Structures Adapted from Computer Organization and Design, Patterson & Hennessy, © 2005, and from slides kindly made.
11/29/2004EE 42 fall 2004 lecture 371 Lecture #37: Memory Last lecture: –Transmission line equations –Reflections and termination –High frequency measurements.
Lecture 12: DRAM Basics Today: DRAM terminology and basics, energy innovations.
1 Lecture 15: DRAM Design Today: DRAM basics, DRAM innovations (Section 5.3)
Scaling the Bandwidth Wall: Challenges in and Avenues for CMP Scalability 36th International Symposium on Computer Architecture Brian Rogers †‡, Anil Krishna.
Die-Hard SRAM Design Using Per-Column Timing Tracking
1 COMP 206: Computer Architecture and Implementation Montek Singh Wed., Nov. 13, 2002 Topic: Main Memory (DRAM) Organization.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 19: April 9, 2008 Routing 1.
Microwave Interference Effects on Device,
1 COMP 206: Computer Architecture and Implementation Montek Singh Mon., Nov. 18, 2002 Topic: Main Memory (DRAM) Organization – contd.
Physical Memory and Physical Addressing By: Preeti Mudda Prof: Dr. Sin-Min Lee CS147 Computer Organization and Architecture.
©Wen-mei W. Hwu and David Kirk/NVIDIA, ECE408/CS483/ECE498AL, University of Illinois, ECE408/CS483 Applied Parallel Programming Lecture 7: DRAM.
MGR: Multi-Level Global Router Yue Xu and Chris Chu Department of Electrical and Computer Engineering Iowa State University ICCAD
TLC: Transmission Line Caches Brad Beckmann David Wood Multifacet Project University of Wisconsin-Madison 12/3/03.
L i a b l eh kC o m p u t i n gL a b o r a t o r y Yield Enhancement for 3D-Stacked Memory by Redundancy Sharing across Dies Li Jiang, Rong Ye and Qiang.
CDMA WIRELESS DATA TRANSMITTER By Vijay kumar kintali B.tech 7 th semester Regd no:
Global Routing.
CSIE30300 Computer Architecture Unit 07: Main Memory Hsin-Chou Chi [Adapted from material by and
Survey of Existing Memory Devices Renee Gayle M. Chua.
Dong Hyuk Woo Nak Hee Seong Hsien-Hsin S. Lee
Kuang-Yu,Li 2013 IEE5011 –Autumn 2013 Memory Systems Duty Cycle Correctors (DCC) In GDDR5 SDRAM Kuang-Yu, Li Department of Electronics Engineering National.
Penn ESE534 Spring DeHon 1 ESE534: Computer Organization Day 7: February 6, 2012 Memories.
A.SATHEESH Department of Software Engineering Periyar Maniammai University Tamil Nadu.
1 Memory Design EE 208 – Logic Design Chapter 7 Sohaib Majzoub.
Wei-Shen, Hsu 2013 IEE5011 –Autumn 2013 Memory Systems Solid State Drive with Flash Memory Wei-Shen, Hsu Department of Electronics Engineering National.
1 Lecture 14: DRAM Main Memory Systems Today: cache/TLB wrap-up, DRAM basics (Section 2.3)
CALTECH CS137 Winter DeHon CS137: Electronic Design Automation Day 13: February 20, 2002 Routing 1.
Computer Architecture Lecture 24 Fasih ur Rehman.
Modern VLSI Design 3e: Chapter 7 Copyright  1998, 2002 Prentice Hall PTR Topics n Power/ground routing. n Clock routing. n Floorplanning tips. n Off-chip.
Shih-Fan, Peng 2013 IEE5008 –Autumn 2013 Memory Systems DRAM Controller for Video Application Shih-Fan, Peng Department of Electronics Engineering National.
CS/EE 5810 CS/EE 6810 F00: 1 Main Memory. CS/EE 5810 CS/EE 6810 F00: 2 Main Memory Bottom Rung of the Memory Hierarchy 3 important issues –capacity »BellÕs.
High Performance Interconnect and Packaging Chung-Kuan Cheng CSE Department UC San Diego
Simultaneous Multi-Layer Access Improving 3D-Stacked Memory Bandwidth at Low Cost Donghyuk Lee, Saugata Ghose, Gennady Pekhimenko, Samira Khan, Onur Mutlu.
1 Lecture: Memory Technology Innovations Topics: memory schedulers, refresh, state-of-the-art and upcoming changes: buffer chips, 3D stacking, non-volatile.
1 Memory Hierarchy (I). 2 Outline Random-Access Memory (RAM) Nonvolatile Memory Disk Storage Suggested Reading: 6.1.
1 Lecture 3: Memory Energy and Buffers Topics: Refresh, floorplan, buffers (SMB, FB-DIMM, BOOM), memory blades, HMC.
Contemporary DRAM memories and optimization of their usage Nebojša Milenković and Vladimir Stanković, Faculty of Electronic Engineering, Niš.
Implementing Tile-based Chip Multiprocessors with GALS Clocking Styles Zhiyi Yu, Bevan Baas VLSI Computation Lab, ECE Department University of California,
© Wen-mei Hwu and S. J. Patel, 2005 ECE 511, University of Illinois Lecture 5-6: Memory System.
1 Lecture: DRAM Main Memory Topics: DRAM intro and basics (Section 2.3)
CS35101 Computer Architecture Spring 2006 Lecture 18: Memory Hierarchy Paul Durand ( ) [Adapted from M Irwin (
Computer Architecture Chapter (5): Internal Memory
Dept. of Electronics Engineering & Institute of Electronics National Chiao Tung University Hsinchu, Taiwan ISPD’16 Generating Routing-Driven Power Distribution.
DRAM Tutorial Lecture Vivek Seshadri. Vivek Seshadri – Thesis Proposal DRAM Module and Chip 2.
CS203 – Advanced Computer Architecture Main Memory Slides adapted from Onur Mutlu (CMU)
15-740/ Computer Architecture Lecture 25: Main Memory
CALTECH CS137 Fall DeHon 1 CS137: Electronic Design Automation Day 21: November 28, 2005 Routing 1.
1 Lecture 16: Main Memory Innovations Today: DRAM basics, innovations, trends HW5 due on Thursday; simulations can take a few hours Midterm: 32 scores.
1 Lecture: Memory Basics and Innovations Topics: memory organization basics, schedulers, refresh,
CS161 – Design and Architecture of Computer Main Memory Slides adapted from Onur Mutlu (CMU)
ESE532: System-on-a-Chip Architecture
Architecture & Organization 1
Lecture 15: DRAM Main Memory Systems
Lecture: Memory, Multiprocessors
The Main Memory system: DRAM organization
Architecture & Organization 1
Lecture: Memory Technology Innovations
EE4271 VLSI Design, Fall 2016 VLSI Channel Routing.
15-740/ Computer Architecture Lecture 19: Main Memory
DRAM Hwansoo Han.
Presentation transcript:

Yi-Lin, Tu 2013 IEE5011 –Fall 2013 Memory Systems Wide I/O High Bandwidth DRAM Yi-Lin, Tu Department of Electronics Engineering National Chiao Tung University

NCTU IEE5011 Memory Systems 2013Yi-Lin, Tu Outline Introduction Proximity Communication A Wide I/O DRAM Architecture Conclusion Reference 2

NCTU IEE5011 Memory Systems 2013Yi-Lin, Tu Introduction Memory gap How to solve this problem? Other techniques? 3

NCTU IEE5011 Memory Systems 2013Yi-Lin, Tu Proximity Communication A wireless chip-to-chip communication technology. Two chips are placed face to face and their bonding pads are allowed to come within close proximity of each other without touching. 4

NCTU IEE5011 Memory Systems 2013Yi-Lin, Tu Proximity Communication 5

NCTU IEE5011 Memory Systems 2013Yi-Lin, Tu Proximity Communication Advantages Increase I/O density Remove the on/off chip wires Remove the on-die termination Ease of testability Remove the ESD structures 6

NCTU IEE5011 Memory Systems 2013Yi-Lin, Tu Proximity Communication 7

NCTU IEE5011 Memory Systems 2013Yi-Lin, Tu Proximity Communication Challenges Mechanical misalignment Supplying power to chips Thermal removal 8

NCTU IEE5011 Memory Systems 2013Yi-Lin, Tu Proximity Communication Mechanical misalignment Six axis Multiple source 9

NCTU IEE5011 Memory Systems 2013Yi-Lin, Tu Proximity Communication Electronic sensor Chip to chip separation sensor Vernier scale(translation) 10

NCTU IEE5011 Memory Systems 2013Yi-Lin, Tu Proximity Communication Electronic re-alignment Use receiver and transmitter array. This array has the ability to electrically reposition the transmitter pads to align the transmitter and receiver pads. 11

NCTU IEE5011 Memory Systems 2013Yi-Lin, Tu A Wide I/O DRAM Architecture Using 4Gb DRAM as the starting point for developing a wide I/O DRAM architecture. 12

NCTU IEE5011 Memory Systems 2013Yi-Lin, Tu A Wide I/O DRAM Architecture Pad moving Moving the I/O channel to the edge. Data and comment signals will need to be buffered at the center. Allows the local column circuitry to be moved to the center. 13

NCTU IEE5011 Memory Systems 2013Yi-Lin, Tu A Wide I/O DRAM Architecture Centralizing Limit the bandwidth of the column and row path. It’s possible by using proximity communication. Increase the array efficiency. 14

NCTU IEE5011 Memory Systems 2013Yi-Lin, Tu A Wide I/O DRAM Architecture Conventional DRAM chips operate with eight internal memory bank. Enable eight wordlines to be active at once, one of each bank. Possible to perform sequential column access to each bank. Remove the large row access latency. 15

NCTU IEE5011 Memory Systems 2013Yi-Lin, Tu A Wide I/O DRAM Architecture 512Mb bank structure 4 possible arrangements for creating a 512Mb memory bank. Keep the global I/O metal lines short allows for a higher bandwidth on an open page. C and D are preferred. 16

NCTU IEE5011 Memory Systems 2013Yi-Lin, Tu A Wide I/O DRAM Architecture 17

NCTU IEE5011 Memory Systems 2013Yi-Lin, Tu A Wide I/O DRAM Architecture Challenges Number of metal layers Global I/O routing Local I/O routing 18

NCTU IEE5011 Memory Systems 2013Yi-Lin, Tu A Wide I/O DRAM Architecture Number of metal layers and global I/O routing A wide I/O architecture with 64 data pins operating with burst length of eight, and therefore a pre-fetch of 8n, requires 512 bits to be accessed in parallel. The highest level of metal is used for global I/O routing and metal one is for global wordlines. Increase the parasitic of each wordline. 19

NCTU IEE5011 Memory Systems 2013Yi-Lin, Tu A Wide I/O DRAM Architecture Divide the 8k page 256 bits per half-bank This enables a possibility of increasing the number of global I/O tracks from 512 to 1024 or higher. 20

NCTU IEE5011 Memory Systems 2013Yi-Lin, Tu A Wide I/O DRAM Architecture Local I/O routing The large number of global I/O tracks requires 32 data signals from each 256 kb memory array. Moving 32 data signals from the bitline sense amplifiers to the global I/O track is a major challenge due to the limited routing space above the bitline sense amplifiers. Signals can be routed to the top and bottom of each 256kb memory segment. 21

NCTU IEE5011 Memory Systems 2013Yi-Lin, Tu A Wide I/O DRAM Architecture Summary Developing a wide I/O DRAM architecture that is suitable for Proximity Communication requires the communication channel to be moved to the side of the DRAM chip. A distributed page and bank structure was developed to enable the possibility of using Proximity Communication with 32 data pins. Reaching the use of 64 data pins required architectural changes that would not increase the manufacturing cost compared to current DRAM architectures. 22

NCTU IEE5011 Memory Systems 2013Yi-Lin, Tu Conclusion 23 A DRAM architecture that uses proximity communication to increase the off-chip bandwidth while scaling the number of data pins. Proximity communication allows for an increase of I/O density, ease of testability, removal of ESD structures and resistive termination. Electrical sensors and electrical re-alignment techniques has enabled proximity communication to become a viable I/O technology. The challenges of creating a wide I/O architecture were found to be in the global and local I/O routing.

NCTU IEE5011 Memory Systems 2013Yi-Lin, Tu Reference Q. Harvard, “Wide I/O DRAM architecture utilizing proximity communication,” Master’s thesis, Boise State University, December Q. Harvard, R. J. Baker, and R. Drost, “Main memory with proximity communication: A wide I/O DRAM architecture,” in Proc. IEEE Workshop Microelectron. Electron Devices, Apr. 2010, pp. 1–4. Harvard, Q., Baker, R.J., “A scalable I/O architecture for wide I/O DRAM,” 54th International Midwest Symposium on Circuits and Systems (MWSCAS), 7-10 Aug. 2011, Seoul,

Yi-Lin, Tu 2013 Thank you 25