V-IRAM Register File Iakovos Mavroidis

Slides:



Advertisements
Similar presentations
1 The 2-to-4 decoder is a block which decodes the 2-bit binary inputs and produces four output All but one outputs are zero One output corresponding to.
Advertisements

PROGRAMMABLE PERIPHERAL INTERFACE -8255
Semiconductor Memory Design. Organization of Memory Systems Driven only from outside Data flow in and out A cell is accessed for reading by selecting.
Experiment 2 PIC Program Execution & Built-In Self-Test.
Digital Integrated Circuits A Design Perspective
“A Configurable Radiation Tolerant Dual-Ported Static RAM macro, designed in a 0.25 μm CMOS technology for applications in the LHC environment.” 8th Workshop.
Digital Integrated Circuits© Prentice Hall 1995 Memory SEMICONDUCTOR MEMORIES.
[M2] Traffic Control Group 2 Chun Han Chen Timothy Kwan Tom Bolds Shang Yi Lin Manager Randal Hong Wed. Oct. 27 Overall Project Objective : Dynamic Control.
S. Reda EN160 SP’07 8-bit MIPS Processor EN160 Class Project May 2007.
EECS 470 Cache and Memory Systems Lecture 14 Coverage: Chapter 5.
Memory Technology “Non-so-random” Access Technology:
Review: Basic Building Blocks  Datapath l Execution units -Adder, multiplier, divider, shifter, etc. l Register file and pipeline registers l Multiplexers,
Figure A flip-flop with an enable input. D Q Q Q R Clock E 0 1.
High Speed Cache For: PICo Board Proposal By: Team XOR NOTE TO FUTURE VIEWERS OF THESE SLIDES: ALL YELLOW TEXT BOXES ACCOMPANIED BY ARROWS IN THE DIRECT.
Cache Physical Implementation Panayiotis Charalambous Xi Research Group Panayiotis Charalambous Xi Research Group.
SRAM DESIGN PROJECT PHASE 2 Nirav Desai VLSI DESIGN 2: Prof. Kia Bazargan Dept. of ECE College of Science and Engineering University of Minnesota,
CSE477 L24 RAM Cores.1Irwin&Vijay, PSU, 2002 CSE477 VLSI Digital Circuits Fall 2002 Lecture 24: RAM Cores Mary Jane Irwin ( )
ECE 300 Advanced VLSI Design Fall 2006 Lecture 19: Memories
Power-Aware RAM Processing for FPGAs December 9, 2005 Power-aware RAM Processing for FPGA Embedded Memory Blocks Russell Tessier University of Massachusetts.
CSE477 L23 Memories.1Irwin&Vijay, PSU, 2002 CSE477 VLSI Digital Circuits Fall 2002 Lecture 23: Semiconductor Memories Mary Jane Irwin (
ARM MIPS.  32 registers, each 32-bit wide. 30 are general purpose, R30(Hi) and R31(Low) are reserved for the results of long multiplication (64-bit.
CMPEN 411 VLSI Digital Circuits Spring 2009 Lecture 22: Memery, ROM
1 ALU ports can get input from any of the two registers ALU can perform one of the eight operations Shift unit can perform one of the four possible types.
8254 Timer.
SRAM Design for SPEED GROUP 2 Billy Chantree Daniel Sosa Justin Ferrante.
Bit Cell Ratio Testing. Thin Cell Advantages: Smallest possible area of 6T Bit Cell, Can be mirrored (saves area = can reduce distance between n-wells.
The System Bus. Conceptual CPU Block Diagram Datapath Regs Buses ALU Control Unit Bus Interface IR etc. PC etc. Control Signals Status Signals PSR System.
Low Power SRAM VLSI Final Presentation Stephen Durant Ryan Kruba Matt Restivo Voravit Vorapitat.
1 To write any register, we need register address and a write signal A 3-bit write address is decoded if write signal is present One of the eight registers.
EE586 VLSI Design Partha Pande School of EECS Washington State University
STUDY OF PIC MICROCONTROLLERS.. Design Flow C CODE Hex File Assembly Code Compiler Assembler Chip Programming.
Homework Reading Tokheim, Chapter 12-1 through 12-4.
Class Exercise 1B.
COMP211 Computer Logic Design
Lecture 19: SRAM.
Low Power Very Fast Dynamic Logic Circuits
מבנה מחשב תרגול 3.
Low-Power SRAM Using 0.6 um Technology
Mary Jane Irwin ( ) CSE477 VLSI Digital Circuits Fall 2002 Lecture 25: Peripheral Memory Circuits Mary Jane.
Each I/O pin may be configured as either input or output.
MOS Memory and Storage Circuits
Dr. Michael Nasief Lecture 2
SEQUENTIAL LOGIC -II.
Chapter 11 Sequential Circuits.
Latches and Flip-flops
Introduction to DIGITAL CIRCUITS MODELING & VERIFICATION using VERILOG [Part-I]
Interfacing Memory Interfacing.
Figure 13.1 MIPS Single Clock Cycle Implementation.
Instructor: Alexander Stoytchev
Mary Jane Irwin ( ) CSE477 VLSI Digital Circuits Fall 2002 Lecture 22: Shifters, Decoders, Muxes Mary Jane.
Lecture 17: Case Studies Topics: case studies for virtual memory and cache hierarchies (Sections )
Digital Integrated Circuits A Design Perspective
Programmable Electrically Erasable Logic Devices (PEEL)
Memory.
Design of Combinational Logic
8085 Microprocessor Architecture
Implementation Technology
April 3 Fun with MUXes Implementing arbitrary logical functions
Enhancing Data Path M 4-bit Reg X A L U
A register design with parallel load input
8051 Micro Controller.
Chapter 8 MOS Memory and Storage Circuits
Chapter 4 Introduction to Computer Organization
Sequential Logic.
Team Awesome += 5 PICo Design Presentation
8085 Microprocessor Architecture
Levels in Processor Design
Day 26: November 10, 2010 Memory Periphery
DIICD Class 13 Memories.
Implementing Logic Gates and Circuits
Presentation transcript:

V-IRAM Register File Iakovos Mavroidis iakovos@cs.berkeley.edu January 9, 2001

Block Diagram eight select signals Bank0 eight external external four read addresses 64-bit word eight external read addresses external to internal mapping 64-bit word 4x7 128 8x7 four read addresses four data-out buses 4x7 internal to external mapping eight external data-out buses 4x64 8x64 three external write addresses Bank1 3x7 four data-out buses 64-bit word 64-bit word 4x64 128 three data-in buses 4-bit mask 3x64 256 64-bit words with 8-read, 3-write ports Area constraints => Split RF to two 128 64-bit words banks. clock at 200MHz

Write operation D E C MEM ARRAY address write WL enable data-in mask 7 enable 128 1 16 16 16 16 data-in 64 mask 4 phi enable/address decoder prech eval prech eval prech eval prech eval prech eval prech mask/datain mem write RF write RF write RF write RF write RF write RF write RF write RF Two cycle pipelined write

Read operation select D E C MEM ARRAY read WL address external to 3 D E C MEM ARRAY read WL address external to internal mapping 7 128 enable 1 64 read BL decoded select internal to external mapping Data-out 8 64 phi enable/address/sel map dec prech map dec prech map dec prech map dec prech map dec prech map dec ext2int map and decode data-out and int2ext map prech rd map prech rd map prech rd map prech rd map prech rd map prech Throughput = 1 cycle, Latency = 1 cycle

Write through phi WRITE en/address decoder prech eval prech eval prech mask/datain RF write RF write RF write RF write RF write RF write RF write RF write RF READ enable/address/map map dec prech map dec prech map dec prech map dec prech map dec prech map dec mapping and decode data-out and mapping prech rd map prech rd map prech rd map prech rd map prech rd map prech

Floorplan output regs input regs MEM ARRAY 0 8-bit input decoders regs clock drivers 0.78m ext 2int map prech read bit lines regs routing regs int2ext mapping 0.26m 1.92m prech read bit lines MEM ARRAY 1 8-bit input decoders regs 0.78m input regs 0.05m output regs 0.1m 0.11m 0.19m 0.61m 1.02m

Mapping of external to internal read ports Eight 3-bit decoders 3 1 3 2 3 read port mapping 3 3 4 3 5 3 6 3 7 3 decoded select signals 8 8 8 8 8 8 8 8 8 external read addresses 1 8 2 8 3 8 4 8 5 8 6 8 7 8 1 2 3 4 5 6 7 internal read addresses

} ... ... ... Dynamic NAND decoder read WL 127 rwline126 rwline2 VDD VDD read WL 127 VDD VDD rwline126 ... ... ... rwline2 VDD VDD rwline1 VDD VDD rwline0 gnd phi phi CMOS 2-bit decoder CMOS 2-bit decoder CMOS 3-bit decoder } Predecoders enable(prech to gnd) addr[6:5] addr[4:3] addr[2:0]

Decoder Interleaving precharge to VDD PMOS PMOS NMOS NMOS G F E D C B 4bit dec 4bit dec 8bit dec 4bit dec 4bit dec 8bit dec 4bit dec 4bit dec 8bit dec 4bit dec 4bit dec 8bit dec 4bit dec 4bit dec 8bit dec 4bit dec 4bit dec 8bit dec 4bit dec 4bit dec 8bit dec

Cell : 4 read/3 write ports wwl0 wwl1 wwl2 rwl0 rwl1 rwl2 rwl3 rbl2 rbl3 wbl0 wbl1 wbl2 wbl2_ wbl1_ wbl0_ rbl1 rbl0

Precharging of read bit lines precharge to VM1 (inverter thershold) + s BL0 BL1 VM2 < VM1 data-out VM1 phi vdd

Mapping of internal data-outs to external data-outs BANK0 port 0 port 1 port 2 port 3 read BL’s 64 64 64 64 64 data-out 0 8 decoded select signals from external to internal mapping 64 data-out 1 8 64 data-out 2 8 64 data-out 3 8 64 data-out 4 8 64 data-out 5 8 64 data-out 6 8 64 data-out 7 8 read BL’s 64 64 64 64 port 0 port 1 port 2 port 3 BANK1

Backup slides

Registers Posedge trigger FF with output prech to gnd output prech to vdd phi phi phi out in phi out in phi phi Negedge trigger FF with output prech to gnd Negedge trigger FF with output prech to vdd phi phi phi out in phi out in phi phi

Buffers 128bit regs decoders BANK0 ext 2int map prech rblines routing int2ext mapping BANK1

Spice Simulations - read at 1.1 volts

Spice Simulations - write at 1.1 volts

Spice Simulations - read at 1.3 volts

Spice Simulations - write at 1.3 volts