Shih-Fan, Peng 2013 IEE5008 –Autumn 2013 Memory Systems DRAM Controller for Video Application Shih-Fan, Peng Department of Electronics Engineering National.

Slides:



Advertisements
Similar presentations
Sungjun Kim Columbia University Edward A. Lee UC Berkeley
Advertisements

Outline Memory characteristics SRAM Content-addressable memory details DRAM © Derek Chiou & Mattan Erez 1.
AMD OPTERON ARCHITECTURE Omar Aragon Abdel Salam Sayyad This presentation is missing the references used.
Lecture Objectives: 1)Explain the limitations of flash memory. 2)Define wear leveling. 3)Define the term IO Transaction 4)Define the terms synchronous.
Low Power Memory. Quick Start Training Agenda What constitutes low power memory Variations & vendors of low power memory How to interface using CoolRunner-II.
Main Mem.. CSE 471 Autumn 011 Main Memory The last level in the cache – main memory hierarchy is the main memory made of DRAM chips DRAM parameters (memory.
CS.305 Computer Architecture Memory: Structures Adapted from Computer Organization and Design, Patterson & Hennessy, © 2005, and from slides kindly made.
Memories and the Memory Subsystem; The Memory Hierarchy; Caching; ROM.
Chapter 9 Memory Basics Henry Hexmoor1. 2 Memory Definitions  Memory ─ A collection of storage cells together with the necessary circuits to transfer.
Lecture 12: DRAM Basics Today: DRAM terminology and basics, energy innovations.
1 Lecture 15: DRAM Design Today: DRAM basics, DRAM innovations (Section 5.3)
Memory Hierarchy.1 Review: Major Components of a Computer Processor Control Datapath Memory Devices Input Output.
Array Allocation Taking into Account SDRAM Characteristics Hong-Kai Chang Youn-Long Lin Department of Computer Science National Tsing Hua University HsinChu,
Optimization Of Power Consumption For An ARM7- BASED Multimedia Handheld Device Hoseok Chang; Wonchul Lee; Wonyong Sung Circuits and Systems, ISCAS.
1 Lecture 14: Virtual Memory Today: DRAM and Virtual memory basics (Sections )
Chapter 8: Part II Storage, Network and Other Peripherals.
Overview Booth’s Algorithm revisited Computer Internal Memory Cache memory.
1 Lecture 1: Introduction and Memory Systems CS 7810 Course organization:  5 lectures on memory systems  5 lectures on cache coherence and consistency.
CSIT 301 (Blum)1 Memory. CSIT 301 (Blum)2 Types of DRAM Asynchronous –The processor timing and the memory timing (refreshing schedule) were independent.
Memory Technology “Non-so-random” Access Technology:
Charles Kime & Thomas Kaminski © 2008 Pearson Education, Inc. (Hyperlinks are active in View Show mode) Chapter 8 – Memory Basics Logic and Computer Design.
Basic concepts Maximum size of the memory depends on the addressing scheme: 16-bit computer generates 16-bit addresses and can address up to 216 memory.
CSIE30300 Computer Architecture Unit 07: Main Memory Hsin-Chou Chi [Adapted from material by and
Survey of Existing Memory Devices Renee Gayle M. Chua.
Kuang-Yu,Li 2013 IEE5011 –Autumn 2013 Memory Systems Duty Cycle Correctors (DCC) In GDDR5 SDRAM Kuang-Yu, Li Department of Electronics Engineering National.
A Mixed Time-Criticality SDRAM Controller MeAOW Sven Goossens, Benny Akesson, Kees Goossens COBRA – CA104 NEST.
Main Memory CS448.
CPEN Digital System Design
Computer Architecture System Interface Units Iolanthe II approaches Coromandel Harbour.
Chapter 8 CPU and Memory: Design, Implementation, and Enhancement The Architecture of Computer Hardware and Systems Software: An Information Technology.
I/O Computer Organization II 1 Interconnecting Components Need interconnections between – CPU, memory, I/O controllers Bus: shared communication channel.
Wei-Shen, Hsu 2013 IEE5011 –Autumn 2013 Memory Systems Solid State Drive with Flash Memory Wei-Shen, Hsu Department of Electronics Engineering National.
Yi-Lin, Tu 2013 IEE5011 –Fall 2013 Memory Systems Wide I/O High Bandwidth DRAM Yi-Lin, Tu Department of Electronics Engineering National Chiao Tung University.
1 Lecture 14: DRAM Main Memory Systems Today: cache/TLB wrap-up, DRAM basics (Section 2.3)
Computer Architecture Lecture 32 Fasih ur Rehman.
1 Presented By: Michael Bieniek. Embedded systems are increasingly using chip multiprocessors (CMPs) due to their low power and high performance capabilities.
By Edward A. Lee, J.Reineke, I.Liu, H.D.Patel, S.Kim
Modern DRAM Memory Architectures Sam Miller Tam Chantem Jon Lucas CprE 585 Fall 2003.
Computer Architecture Lecture 24 Fasih ur Rehman.
Jason Li Jeremy Fowers 1. Speedups and Energy Reductions From Mapping DSP Applications on an Embedded Reconfigurable System Michalis D. Galanis, Gregory.
SYEN 3330 Digital SystemsJung H. Kim 1 SYEN 3330 Digital Systems Chapter 9 – Part 2.
Chapter 4 Memory Design: SOC and Board-Based Systems
-BY KUSHAL KUNIGAL UNDER GUIDANCE OF DR. K.R.RAO. SPRING 2011, ELECTRICAL ENGINEERING DEPARTMENT, UNIVERSITY OF TEXAS AT ARLINGTON FPGA Implementation.
How do you model a RAM in Verilog. Basic Memory Model.
Computer Architecture System Interface Units Iolanthe II in the Bay of Islands.
High-Performance DRAM System Design Constraints and Considerations by: Joseph Gross August 2, 2010.
Chapter 6 Storage and Other I/O Topics. Chapter 6 — Storage and Other I/O Topics — 2 Introduction I/O devices can be characterized by Behaviour: input,
1 Lecture 2: Memory Energy Topics: energy breakdowns, handling overfetch, LPDRAM, row buffer management, channel energy, refresh energy.
Processor Memory Processor-memory bus I/O Device Bus Adapter I/O Device I/O Device Bus Adapter I/O Device I/O Device Expansion bus I/O Bus.
Digital Circuits Introduction Memory information storage a collection of cells store binary information RAM – Random-Access Memory read operation.
1 Adapted from UC Berkeley CS252 S01 Lecture 18: Reducing Cache Hit Time and Main Memory Design Virtucal Cache, pipelined cache, cache summary, main memory.
Charles Kime & Thomas Kaminski © 2008 Pearson Education, Inc. (Hyperlinks are active in View Show mode) Chapter 8 – Memory Basics Logic and Computer Design.
1 Lecture: Memory Technology Innovations Topics: memory schedulers, refresh, state-of-the-art and upcoming changes: buffer chips, 3D stacking, non-volatile.
Contemporary DRAM memories and optimization of their usage Nebojša Milenković and Vladimir Stanković, Faculty of Electronic Engineering, Niš.
1 Lecture: DRAM Main Memory Topics: DRAM intro and basics (Section 2.3)
Chapter 11 System Performance Enhancement. Basic Operation of a Computer l Program is loaded into memory l Instruction is fetched from memory l Operands.
CS35101 Computer Architecture Spring 2006 Lecture 18: Memory Hierarchy Paul Durand ( ) [Adapted from M Irwin (
Gunjeet Kaur Dronacharya Group of Institutions. Outline I Random-Access Memory Memory Decoding Error Detection and Correction Read-Only Memory Programmable.
15-740/ Computer Architecture Lecture 25: Main Memory
1 Lecture: Memory Basics and Innovations Topics: memory organization basics, schedulers, refresh,
CS 704 Advanced Computer Architecture
CSE 502: Computer Architecture
Reducing Hit Time Small and simple caches Way prediction Trace caches
Lecture 15: DRAM Main Memory Systems
Lecture: DRAM Main Memory
Lecture: Memory Technology Innovations
15-740/ Computer Architecture Lecture 19: Main Memory
DRAM Hwansoo Han.
Bob Reese Micro II ECE, MSU
Presentation transcript:

Shih-Fan, Peng 2013 IEE5008 –Autumn 2013 Memory Systems DRAM Controller for Video Application Shih-Fan, Peng Department of Electronics Engineering National Chiao Tung University

NCTU IEE5008 Memory Systems 2013Shih-Fan, Peng Outline Introduction Traditional Controller DRAM Memory Controller for Video Application Conclusion Reference 2

NCTU IEE5008 Memory Systems 2013Shih-Fan, Peng DRAM WORK 3

NCTU IEE5008 Memory Systems 2013Shih-Fan, Peng DRAM WORK(cont.) 4

NCTU IEE5008 Memory Systems 2013Shih-Fan, Peng DRAM Organization 5

NCTU IEE5008 Memory Systems 2013Shih-Fan, Peng Introduction What is DRAM Memory Controller? Manage the movement of data in and out of DRAM devices Error detection and correction 6

NCTU IEE5008 Memory Systems 2013Shih-Fan, Peng Abstract DRAM Memory Controller 7

NCTU IEE5008 Memory Systems 2013Shih-Fan, Peng Address Mapping (Translation) Translation a given physical address in to indices in a DRAM memory system Channel ID, rank ID, bank ID, row ID, column ID Mapping scheme is often coupled to the row-buffer- management policy Ex: memory request sequence Mapping to different rows of the same bank Mapping to different rows of different bank 8

NCTU IEE5008 Memory Systems 2013Shih-Fan, Peng Write caching Write requests are typically non-critical in terms of latency Defer write requests and allow read requests to proceed ahead But sometime it is not good Read requests must be checked against the address of cached writes Controller must ensure the correctness of memory- access order 9

NCTU IEE5008 Memory Systems 2013Shih-Fan, Peng Request Queue Organizations Place the commands into single queue or multiple queue Prioritize commands on many different factor Priority of the request, availability of resources to a given queue, bank address of the request, age of request….. Ex: per-bank queuing organization Base on bank address 10

NCTU IEE5008 Memory Systems 2013Shih-Fan, Peng Refresh Management DRAM must be refreshed to ensure the integrity of the data Ex: pseudo-static DRAM Temperature-compensated self-refresh is used If request collides with refresh action, assert a wait signal used in low frequency Separate refresh to each bank 11

NCTU IEE5008 Memory Systems 2013Shih-Fan, Peng Method: Optimized the data arrangement Minimize the number of overhead cycles needed for row-activations Consider the feature of SDRAM and memory- access patterns of video-processing application Video-processing algorithms usually uses multi- dimensional arrays and nested loops 12

NCTU IEE5008 Memory Systems 2013Shih-Fan, Peng 13 Timing diagram of two consecutive read cycle

NCTU IEE5008 Memory Systems 2013Shih-Fan, Peng Row changes for logical array with (1,128) window Consider a 2-D array of 8X8 data Cause seven row changes→more latency and power consumption 14

NCTU IEE5008 Memory Systems 2013Shih-Fan, Peng Row changes for logical array with (8,16) window A logical array is portioned into a set of rectangles called windows Each window is stored in a row of SDRAM 15

NCTU IEE5008 Memory Systems 2013Shih-Fan, Peng Condition for Minimal Overhead Cycles Condition: two banks, the horizontal size of windows is greater than the loop bound 16

NCTU IEE5008 Memory Systems 2013Shih-Fan, Peng Method: History-based predictive approach(HDTV) What if the row address differs from the previous one? Page miss Additional cycles cannot be hidden What can do for it? Statically schedule the address sequence Control the memory operation mode But if it is irregular? 17

NCTU IEE5008 Memory Systems 2013Shih-Fan, Peng History-based predictive approach If t rp can be overlapped to burst accesses or data transfer between the memory controller and the processor Effective latency=t RCD + t CL Use the past history of memory reference Predict next access (built a two-bit counter for each row) If history predicts to same row  Bank remain in row active state If not  Change to idle state 18

NCTU IEE5008 Memory Systems 2013Shih-Fan, Peng Two-bit saturated up/down counter(per- row) SH,WH→auto-precharge is not issued SM,WM→auto-precharge is issued 19

NCTU IEE5008 Memory Systems 2013Shih-Fan, Peng Method: Burst Terminates Burst and Anticipative Row Activation In video application, most bandwidth is consumed by 2-D block data Block data may consist of several non-sequential accesses Data length is inconstant 20

NCTU IEE5008 Memory Systems 2013Shih-Fan, Peng Organization of Address Bus Allocate and address space larger than the physical address Seamless transfer Mode Control Information (MCI) to memory controller Burst access operation is pre-calculated by embedded address generator(EAG) MCI=Mode Select (MS)& Stride control (SC) 21

NCTU IEE5008 Memory Systems 2013Shih-Fan, Peng Organization of Address Bus 22

NCTU IEE5008 Memory Systems 2013Shih-Fan, Peng BTB and ARA 23

NCTU IEE5008 Memory Systems 2013Shih-Fan, Peng Conclusion Optimize data arrangement Select the suitable window size History-based predictive approach Predict the successive memory access BTB and ARA Provide the parallel access of different banks 24

NCTU IEE5008 Memory Systems 2013Shih-Fan, Peng Reference [1] H. Kim and I. C. Park, “High Performance and Low Power Memory Interface Architecture for Video Processing Applications”, IEEE Transaction on Circuits and Systems for Video Technology, vol.11, pp – 1170, [2] J. Zhu, L. Hou, W. Wu, R. Wang, C. Huang, and J. Li, “High performance synchronous DRAMs controller in H.264 HDTV decoder,” in Proc. International Conference on Solid State and Integrated Circuits Technology, vol. 3, pp – 1624, [3] S. I. Park, Y. Yi, and I. C Park, “High Performance Memory Mode Control for HDTV Decoders,” IEEE Transactions on Consumer Electronics, vol. 49, no. 4, pp – 1353, November, [4] H-Y Kang, K-A Jeong, J-Y Bae, Y-S Lee, and S-H Lee, “MPEG4 AVC/H.264 Decoder with Scalable Bus Architecture and Dual Memory Controller,” in Proc. ISCAS, vol. 2, pp. 145 – 148, May, [5] K. B. Lee, T. C. Lin, and C. W Jen, “An Efficient Qual ity- Aware Memory Controller for Multimedia Platform SoC,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 15, no. 5, pp. 620 – 633, May [6]G. Hyun, Y. Jin, J. Jung, S. Kim, “A Synchronous DRAM Controller of an H.264/AVC Encoder” International SoC Design Conference, Vol. 02, pp. II-113- II-116,2008 [7]Textbook: Memory Systems: Cache, DRAM, Disk chapter 13 25

NCTU IEE5008 Memory Systems 2013Shih-Fan, Peng 26 Thank you for your listening