CSE477 L25 Memory Peripheral.1Irwin&Vijay, PSU, 2003 CSE477 VLSI Digital Circuits Fall 2003 Lecture 25: Peripheral Memory Circuits Mary Jane Irwin ( www.cse.psu.edu/~mji.

Slides:



Advertisements
Similar presentations
Semiconductor Memory Design. Organization of Memory Systems Driven only from outside Data flow in and out A cell is accessed for reading by selecting.
Advertisements

+ CS 325: CS Hardware and Software Organization and Architecture Internal Memory.
Robust Low Power VLSI R obust L ow P ower VLSI Sub-threshold Sense Amplifier (SA) Compensation Using Auto-zeroing Circuitry 01/21/2014 Peter Beshay Department.
5-1 Memory System. Logical Memory Map. Each location size is one byte (Byte Addressable) Logical Memory Map. Each location size is one byte (Byte Addressable)
Sistemi Elettronici Programmabili1 Progettazione di circuiti e sistemi VLSI Anno Accademico Lezione Memorie (vedi anche i file pcs1_memorie.pdf.
COEN 180 DRAM. Dynamic Random Access Memory Dynamic: Periodically refresh information in a bit cell. Else it is lost. Small footprint: transistor + capacitor.
Digital Integrated Circuits A Design Perspective
Elettronica T AA Digital Integrated Circuits © Prentice Hall 2003 SRAM & DRAM.
CSE477 L19 Timing Issues; Datapaths.1Irwin&Vijay, PSU, 2002 CSE477 VLSI Digital Circuits Fall 2002 Lecture 19: Timing Issues; Introduction to Datapath.
Introduction to CMOS VLSI Design Lecture 13: SRAM
Chapter 9 Memory Basics Henry Hexmoor1. 2 Memory Definitions  Memory ─ A collection of storage cells together with the necessary circuits to transfer.
11/29/2004EE 42 fall 2004 lecture 371 Lecture #37: Memory Last lecture: –Transmission line equations –Reflections and termination –High frequency measurements.
Digital Integrated Circuits© Prentice Hall 1995 Memory SEMICONDUCTOR MEMORIES.
11/03/05ELEC / Lecture 181 ELEC / (Fall 2005) Special Topics in Electrical Engineering Low-Power Design of Electronic Circuits.
Introduction to CMOS VLSI Design SRAM/DRAM
Digital Integrated Circuits© Prentice Hall 1995 Memory SEMICONDUCTOR MEMORIES.
Spring 07, Feb 27 ELEC 7770: Advanced VLSI Design (Agrawal) 1 ELEC 7770 Advanced VLSI Design Spring 2007 Power Consumption in a Memory Vishwani D. Agrawal.
Die-Hard SRAM Design Using Per-Column Timing Tracking
S. Reda EN160 SP’07 Design and Implementation of VLSI Systems (EN0160) Lecture 31: Array Subsystems (SRAM) Prof. Sherief Reda Division of Engineering,
Lecture 13, Slide 1EECS40, Fall 2004Prof. White Lecture #13 Announcements You should now purchase the reader EECS 40: Introduction to Microelectronics,
Modern VLSI Design 2e: Chapter 6 Copyright  1998 Prentice Hall PTR Topics n Memories: –ROM; –SRAM; –DRAM. n PLAs.
Lecture 19: SRAM.
Parts from Lecture 9: SRAM Parts from
Digital Integrated Circuits for Communication
55:035 Computer Architecture and Organization
Semiconductor Memories Lecture 1: May 10, 2006 EE Summer Camp Abhinav Agarwal.
Penn ESE370 Fall DeHon 1 ESE370: Circuit-Level Modeling, Design, and Optimization for Digital Systems Day 28: November 15, 2013 Memory Periphery.
EE415 VLSI Design DYNAMIC LOGIC [Adapted from Rabaey’s Digital Integrated Circuits, ©2002, J. Rabaey et al.]
Review: Basic Building Blocks  Datapath l Execution units -Adder, multiplier, divider, shifter, etc. l Register file and pipeline registers l Multiplexers,
Digital Integrated Circuits Chpt. 5Lec /29/2006 CSE477 VLSI Digital Circuits Fall 2002 Lecture 21: Multiplier Design Mary Jane Irwin (
Washington State University
Penn ESE370 Fall DeHon 1 ESE370: Circuit-Level Modeling, Design, and Optimization for Digital Systems Day 28: November 16, 2012 Memory Periphery.
Digital Integrated Circuits© Prentice Hall 1995 Memory SEMICONDUCTOR MEMORIES Adapted from Jan Rabaey's IC Design. Copyright 1996 UCB.
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR Topics n Latches and flip-flops. n RAMs and ROMs.
Modern VLSI Design 4e: Chapter 6 Copyright  2008 Wayne Wolf Topics Memories: –ROM; –SRAM; –DRAM; –Flash. Image sensors. FPGAs. PLAs.
Sp09 CMPEN 411 L23 S.1 CMPEN 411 VLSI Digital Circuits Spring 2009 Lecture 23: Memory Cell Designs SRAM, DRAM [Adapted from Rabaey’s Digital Integrated.
Z. Feng MTU EE4800 CMOS Digital IC Design & Analysis 12.1 EE4800 CMOS Digital IC Design & Analysis Lecture 12 SRAM Zhuo Feng.
Memory System Unit-IV 4/24/2017 Unit-4 : Memory System.
CPEN Digital System Design
Digital Logic Design Instructor: Kasım Sinan YILDIRIM
Advanced VLSI Design Unit 06: SRAM
CSE477 L24 RAM Cores.1Irwin&Vijay, PSU, 2002 CSE477 VLSI Digital Circuits Fall 2002 Lecture 24: RAM Cores Mary Jane Irwin ( )
Penn ESE370 Fall DeHon 1 ESE370: Circuit-Level Modeling, Design, and Optimization for Digital Systems Day 24: November 5, 2010 Memory Overview.
ECE 300 Advanced VLSI Design Fall 2006 Lecture 19: Memories
CSE477 L23 Memories.1Irwin&Vijay, PSU, 2002 CSE477 VLSI Digital Circuits Fall 2002 Lecture 23: Semiconductor Memories Mary Jane Irwin (
CSE477 L07 Pass Transistor Logic.1Irwin&Vijay, PSU, 2003 CSE477 VLSI Digital Circuits Fall 2003 Lecture 07: Pass Transistor Logic Mary Jane Irwin (
Computer Memory Storage Decoding Addressing 1. Memories We've Seen SIMM = Single Inline Memory Module DIMM = Dual IMM SODIMM = Small Outline DIMM RAM.
Penn ESE370 Fall DeHon 1 ESE370: Circuit-Level Modeling, Design, and Optimization for Digital Systems Day 28: November 16, 2011 Memory Periphery.
Washington State University
CMPEN 411 VLSI Digital Circuits Spring 2009 Lecture 22: Memery, ROM
CMPEN 411 VLSI Digital Circuits Spring 2009 Lecture 24: Peripheral Memory Circuits [Adapted from Rabaey’s Digital Integrated Circuits, Second Edition,
Introduction to Computer Organization and Architecture Lecture 7 By Juthawut Chantharamalee wut_cha/home.htm.
Penn ESE370 Fall DeHon 1 ESE370: Circuit-Level Modeling, Design, and Optimization for Digital Systems Day 28: November 7, 2014 Memory Overview.
EE 466/586 VLSI Design Partha Pande School of EECS Washington State University
Penn ESE370 Fall DeHon 1 ESE370: Circuit-Level Modeling, Design, and Optimization for Digital Systems Day 28: November 8, 2013 Memory Overview.
Sp09 CMPEN 411 L21 S.1 CMPEN 411 VLSI Digital Circuits Spring 2009 Lecture 21: Shifters, Decoders, Muxes [Adapted from Rabaey’s Digital Integrated Circuits,
Low Power SRAM VLSI Final Presentation Stephen Durant Ryan Kruba Matt Restivo Voravit Vorapitat.
CSE477 L21 Multiplier Design.1Irwin&Vijay, PSU, 2002 CSE477 VLSI Digital Circuits Fall 2002 Lecture 21: Multiplier Design Mary Jane Irwin (
CSE477 L19 Timing Issues; Datapaths.1Irwin&Vijay, PSU, 2003 CSE477 VLSI Digital Circuits Fall 2003 Lecture 19: Timing Issues; Introduction to Datapath.
CSE477 L06 Static CMOS Logic.1Irwin&Vijay, PSU, 2003 CSE477 VLSI Digital Circuits Fall 2003 Lecture 06: Static CMOS Logic Mary Jane Irwin (
CSE477 L27 System Interconnect.1Irwin&Vijay, PSU, 2003 CSE477 VLSI Digital Circuits Fall 2003 Lecture 27: System Level Interconnect Mary Jane Irwin (
EE586 VLSI Design Partha Pande School of EECS Washington State University
Lecture 19: SRAM.
CSE477 VLSI Digital Circuits Fall 2003 Lecture 21: Multiplier Design
CSE477 VLSI Digital Circuits Fall 2003 Lecture 24: Memory Cell Designs
Mary Jane Irwin ( ) CSE477 VLSI Digital Circuits Fall 2002 Lecture 25: Peripheral Memory Circuits Mary Jane.
Mary Jane Irwin ( ) CSE477 VLSI Digital Circuits Fall 2002 Lecture 22: Shifters, Decoders, Muxes Mary Jane.
Mary Jane Irwin ( ) CSE477 VLSI Digital Circuits Fall 2003 Lecture 22: Shifters, Decoders, Muxes Mary Jane.
Semiconductor Memories
Day 26: November 10, 2010 Memory Periphery
Presentation transcript:

CSE477 L25 Memory Peripheral.1Irwin&Vijay, PSU, 2003 CSE477 VLSI Digital Circuits Fall 2003 Lecture 25: Peripheral Memory Circuits Mary Jane Irwin ( ) [Adapted from Rabaey’s Digital Integrated Circuits, Second Edition, ©2003 J. Rabaey, A. Chandrakasan, B. Nikolic]

CSE477 L25 Memory Peripheral.2Irwin&Vijay, PSU, 2003 Review: Read-Write Memories (RAMs)  Static – SRAM l data is stored as long as supply is applied l large cells (6 fets/cell) – so fewer bits/chip l fast – so used where speed is important (e.g., caches) l differential outputs (output BL and !BL) l use sense amps for performance l compatible with CMOS technology  Dynamic – DRAM l periodic refresh required (every 1 to 4 ms) to compensate for the charge loss caused by leakage l small cells (1 to 3 fets/cell) – so more bits/chip l slower – so used for main memories l single ended output (output BL only) l need sense amps for correct operation l not typically compatible with CMOS technology

CSE477 L25 Memory Peripheral.3Irwin&Vijay, PSU, 2003 Review: 2D Memory Bank Row Decoder Read Precharge Precharge Circuit Column Decoder Sense Amps Column Decoder A N-1 … A i A i-1 … A 0 Write Circuitry Sense Amps Write Circuitry data

CSE477 L25 Memory Peripheral.4Irwin&Vijay, PSU, 2003 Peripheral Memory Circuitry  Row and column decoders  Read bit line precharge logic  Sense amplifiers  Read/write circuitry  Timing and control  Speed  Power consumption  Area – pitch matching

CSE477 L25 Memory Peripheral.5Irwin&Vijay, PSU, 2003 Row Decoders  Collection of 2 M complex logic gates organized in a regular, dense fashion  (N)AND decoder for 8 address bits WL(0) = !A 7 & !A 6 & !A 5 & !A 4 & !A 3 & !A 2 & !A 1 & !A 0 … WL(255) = A 7 & A 6 & A 5 & A 4 & A 3 & A 2 & A 1 & A 0  NOR decoder for 8 address bits WL(0) = !(A 7 | A 6 | A 5 | A 4 | A 3 | A 2 | A 1 | A 0 ) … WL(255) = !(!A 7 | !A 6 | !A 5 | !A 4 | !A 3 | !A 2 | !A 1 | !A 0 )  Goals: Pitch matched, fast, low power

CSE477 L25 Memory Peripheral.6Irwin&Vijay, PSU, 2003 Implementing a Wide NOR Function  Single stage 8x256 bit decoder (as in Lecture 22) l One 8 input NOR gate per row x 256 rows = 256 x (8+8) = 4,096 l Pitch match and speed/power issues  Decompose logic into multiple levels !WL(0) = !(!(A 7 | A 6 ) & !(A 5 | A 4 ) & !(A 3 | A 2 ) & !(A 1 | A 0 )) l First level is the predecoder (for each pair of address bits, form A i |A i-1, A i |!A i-1, !A i |A i-1, and !A i |!A i-1 ) l Second level is the word line driver  Predecoders reduce the number of transistors required l Four sets of four 2-bit NOR predecoders = 4 x 4 x (2+2) = 64 l 256 word line drivers, each a four input NAND – 256 x (4+4) = 2,048 -4,096 vs 2,112 = almost a 50% savings  Number of inputs to the gates driving the WLs is halved, so the propagation delay is reduced by a factor of ~4

CSE477 L25 Memory Peripheral.7Irwin&Vijay, PSU, 2003 Split Row Two-Level 8x256 Decoder !(!A 0 & !A 1 & !A 2 )... !(A 0 & A 1 & A 2 ) !(!(!A 0 &!A 1 &!A 2 ) | !(!A 3 &!A 4 &!A 5 ) | !(!A 6 &!A 7 )) *256 WL 255 WL 0 *256 WL 255 WL 0 Address *8 *4  Pitch matched  Buffered word line drivers

CSE477 L25 Memory Peripheral.8Irwin&Vijay, PSU, 2003 Pass Transistor Based Column Decoder BL 3 BL 2 BL 1 BL 0 data_out 2 input NOR decoder A1A1 A0A0 S3S3 S2S2 S1S1 S0S0  Read: connect BLs to the Sense Amps (SA) Writes: drive one of the BLs low to write a 0 into the cell l Fast since there is only one transistor in the signal path. However, there is a large transistor count ( (K+1)2 K + 2 x 2 K ) l For K = 2  3 x 2 2 (decoder) + 2 x 2 2 (PTs) = = 20 !BL 3 !BL 2 !BL 1 !BL 0 !data_out

CSE477 L25 Memory Peripheral.9Irwin&Vijay, PSU, 2003 Tree Based Column Decoder BL 3 BL 2 BL 1 BL 0 A0A0 !A 0 A1A1 !A 1 data_out  Number of transistors reduced to (2 x 2 x (2 K -1)) for K = 2  2 x 2 x (2 2 – 1) = 4 x 3 = 12  Delay increases quadratically with the number of sections (K) (so prohibitive for large decoders) can fix with buffers, progressive sizing, combination of tree and pass transistor approaches !BL 3 !BL 2 !BL 1 !BL 0 !data_out

CSE477 L25 Memory Peripheral.10Irwin&Vijay, PSU, 2003 Decoder Complexity Comparisons  Consider a memory with 10b address and 8b data Conf.Data/RowRow DecoderColumn Decoder 1D8b10b = a 10x2 10 decoder Single stage = 20,480 Two stage = 10,320 2D32b (32x256 core) 8b = 8x2 8 decoder Single stage = 4,096 T Two stage = 2,112 T 2b = 2x2 2 decoder PT = 76 T Tree = 96 T 2D64b (64x128 core) 7b = 7x2 7 decoder Single stage = 1,792 T Two stage = 1,072 T 3b = 3x2 3 decoder PT = 160 T Tree = 224 T 2D128b (128x64 core) 6b = 6x2 6 decoder Single stage = 768 T Two stage = 432 T 4b = 4x2 4 decoder PT = 336 T Tree = 480 T

CSE477 L25 Memory Peripheral.11Irwin&Vijay, PSU, 2003 Bit Line Precharge Logic equalization transistor - speeds up equalization of the two bit lines by allowing the capacitance and pull-up device of the nondischarged bit line to assist in precharging the discharged line !PC !BLBL  First step of a Read cycle is to precharge (PC) the bit lines to V DD l every differential signal in the memory must be equalized to the same voltage level before Read  Turn off PC and enable the WL l the grounded PMOS load limits the bit line swing (speeding up the next precharge cycle)

CSE477 L25 Memory Peripheral.12Irwin&Vijay, PSU, 2003 Sense Amplifiers  Amplification – resolves data with small bit line swings (in some DRAMs required for proper functionality)  Delay reduction – compensates for the limited drive capability of the memory cell to accelerate BL transition SA input output t p = ( C *  V ) / I av large small make  V as small as possible  Power reduction – eliminates a large part of the power dissipation due to charging and discharging bit lines  Signal restoration – for DRAMs, need to drive the bit lines full swing after sensing (read) to do data refresh

CSE477 L25 Memory Peripheral.13Irwin&Vijay, PSU, 2003 Classes of Sense Amplifiers  Differential SA – takes small signal differential inputs (BL and !BL) and amplifies them to a large signal single- ended output l common-mode rejection – rejects noise that is equally injected to both inputs  Only suitable for SRAMs (with BL and !BL)  Types l Current mirroring l Two-stage l Latch based  Single-ended SA – needed for DRAMs

CSE477 L25 Memory Peripheral.14Irwin&Vijay, PSU, 2003 Latch Based Sense Amplifier SE BL isolate  V = 0.1V DD  V = V DD !BL data_out!data_out bit line inputs sense amplifier outputs

CSE477 L25 Memory Peripheral.15Irwin&Vijay, PSU, 2003 Alpha Differential Amplifier/Latch SE data_out  V = V DD !data_out S3S3 S2S2 S1S1 S0S0 column decoder sense amplifier N2 N3 N4 N5 P1P2 P3P4 N1 mux_out sense PC !mux_out 0101 0101 off  on !sense

CSE477 L25 Memory Peripheral.16Irwin&Vijay, PSU, 2003 Read/Write Circuitry D: data (write) bus R: read bus W: write signal CS: column select (column decoder) Local W (write): BL = D, !BL = !D enabled by W & CS Local R (read): R = BL, !R = !BL enabled by !W & CS !BLBL SA D W !R R Precharge Local R/WCS

CSE477 L25 Memory Peripheral.17Irwin&Vijay, PSU, 2003 Approaches to Memory Timing DRAM Timing Multiplexed Addressing RAS CAS RAS-CAS timing Address Bus msb’slsb’s Row Addr. Column Addr. Address Bus SRAM Timing Self-Timed Address Address transition initiates memory operation

CSE477 L25 Memory Peripheral.18Irwin&Vijay, PSU, 2003 Reliability and Yield  Memories operate under low signal-to-noise conditions l word line to bit line coupling can vary substantially over the memory array -folded bit line architecture (routing BL and !BL next to each other ensures a closer match between parasitics and bit line capacitances) l interwire bit line to bit line coupling -transposed (or twisted) bit line architecture (turn the noise into a common-mode signal for the SA) l leakage (in DRAMs) requiring refresh operation  suffer from low yield due to high density and structural defects l increase yield by using error correction (e.g., parity bits) and redundancy  and are susceptible to soft errors due to alpha particles and cosmic rays

CSE477 L25 Memory Peripheral.19Irwin&Vijay, PSU, 2003 Redundancy in the Memory Structure Row address Column address Redundant row Redundant columns Fuse bank

CSE477 L25 Memory Peripheral.20Irwin&Vijay, PSU, 2003 Redundancy and Error Correction

CSE477 L25 Memory Peripheral.21Irwin&Vijay, PSU, 2003 Soft Errors  Nonrecurrent and nonpermanent errors from l alpha particles (from the packaging materials) l neutrons from cosmic rays  As feature size decreases, the charge stored at each node decreases (due to a lower node capacitance and lower V DD ) and thus Q critical (the charge necessary to cause a bit flip) decreases leading to an increase in the soft error rate (SER ) From Semico Research Corp. MTBF (hours).13  m.09  m Ground-based Civilian Avionics System Military Avionics System189 From Actel

CSE477 L25 Memory Peripheral.22Irwin&Vijay, PSU, 2003 Next Lecture and Reminders  Next lecture l Power consumption in datapaths and memories -Reading assignment – Rabaey, et al, 11.7; 12.5  Reminders l HW#5 due today l Project final reports due on-line by 5:00pm on Friday, December 5 th l Final grading negotiations/correction (except for the final exam) must be concluded by December 10 th l Final exam scheduled -Tuesday, December 16 th from 10:10 to noon in 118 and 113 Thomas