Jennifer Winikus Computer Engineering Seminar Michigan Technological University February 10,2011 2/10/2011J Winikus EE5900 1.

Slides:



Advertisements
Similar presentations
Barcelona Forum on Ph.D. Research in Communications, Electronics and Signal Processing 21st October 2010 Soft Errors Hardening Techniques in Nanometer.
Advertisements

COEN 180 Flash Memory.
Subthreshold SRAM Designs for Cryptography Security Computations Adnan Gutub The Second International Conference on Software Engineering and Computer Systems.
Computer Organization and Architecture
Semiconductor Memory Design. Organization of Memory Systems Driven only from outside Data flow in and out A cell is accessed for reading by selecting.
University of Michigan Electrical Engineering and Computer Science University of Michigan Electrical Engineering and Computer Science University of Michigan.
+ CS 325: CS Hardware and Software Organization and Architecture Internal Memory.
Robust Low Power VLSI R obust L ow P ower VLSI Sub-threshold Sense Amplifier (SA) Compensation Using Auto-zeroing Circuitry 01/21/2014 Peter Beshay Department.
5-1 Memory System. Logical Memory Map. Each location size is one byte (Byte Addressable) Logical Memory Map. Each location size is one byte (Byte Addressable)
Managing Static (Leakage) Power S. Kaxiras, M Martonosi, “Computer Architecture Techniques for Power Effecience”, Chapter 5.
COEN 180 DRAM. Dynamic Random Access Memory Dynamic: Periodically refresh information in a bit cell. Else it is lost. Small footprint: transistor + capacitor.
2007 MURI Review The Effect of Voltage Fluctuations on the Single Event Transient Response of Deep Submicron Digital Circuits Matthew J. Gadlage 1,2, Ronald.
0 1 Width-dependent Statistical Leakage Modeling for Random Dopant Induced Threshold Voltage Shift Jie Gu, Sachin Sapatnekar, Chris Kim Department of Electrical.
Introduction to CMOS VLSI Design Lecture 13: SRAM
University of Michigan Electrical Engineering and Computer Science University of Michigan Electrical Engineering and Computer Science August 20, 2009 Enabling.
Chuanjun Zhang, UC Riverside 1 Low Static-Power Frequent-Value Data Caches Chuanjun Zhang*, Jun Yang, and Frank Vahid** *Dept. of Electrical Engineering.
11/29/2004EE 42 fall 2004 lecture 371 Lecture #37: Memory Last lecture: –Transmission line equations –Reflections and termination –High frequency measurements.
Super-Drowsy Caches Single-V DD and Single-V T Super-Drowsy Techniques for Low- Leakage High-Performance Instruction Caches Nam Sung Kim, Krisztián Flautner,
Introduction to CMOS VLSI Design SRAM/DRAM
Overview Memory definitions Random Access Memory (RAM)
1 Razor: A Low Power Processor Design Presented By: - Murali Dharan.
Lecture 13, Slide 1EECS40, Fall 2004Prof. White Lecture #13 Announcements You should now purchase the reader EECS 40: Introduction to Microelectronics,
Architectural Power Management for High Leakage Technologies Department of Electrical and Computer Engineering Auburn University, Auburn, AL /15/2011.
Lecture 19: SRAM.
Lecture 7: Power.
Parts from Lecture 9: SRAM Parts from
Integrated Regulation for Energy- Efficient Digital Circuits Elad Alon 1 and Mark Horowitz 2 1 UC Berkeley 2 Stanford University.
Physical Memory and Physical Addressing By: Preeti Mudda Prof: Dr. Sin-Min Lee CS147 Computer Organization and Architecture.
12/1/2004EE 42 fall 2004 lecture 381 Lecture #38: Memory (2) Last lecture: –Memory Architecture –Static Ram This lecture –Dynamic Ram –E 2 memory.
ECE 342 Electronic Circuits 2. MOS Transistors
“ Near-Threshold Computing: Reclaiming Moore’s Law Through Energy Efficient Integrated Circuits ” By Ronald G. Dreslinski, Michael Wieckowski, David Blaauw,
Drowsy Caches: Simple Techniques for Reducing Leakage Power Authors: ARM Ltd Krisztián Flautner, Advanced Computer Architecture Lab, The University of.
הפקולטה למדעי ההנדסה Faculty of Engineering Sciences.
Z. Feng MTU EE4800 CMOS Digital IC Design & Analysis 12.1 EE4800 CMOS Digital IC Design & Analysis Lecture 12 SRAM Zhuo Feng.
CPEN Digital System Design
Advanced VLSI Design Unit 06: SRAM
CSE477 L24 RAM Cores.1Irwin&Vijay, PSU, 2002 CSE477 VLSI Digital Circuits Fall 2002 Lecture 24: RAM Cores Mary Jane Irwin ( )
A 256kb Sub-threshold SRAM in 65nm CMOS
XIAOYU HU AANCHAL GUPTA Multi Threshold Technique for High Speed and Low Power Consumption CMOS Circuits.
1 Bus Encoding for Total Power Reduction Using a Leakage-Aware Buffer Configuration 班級:積體所碩一 學生:林欣緯 指導教授:魏凱城 老師 IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION.
Basics of Energy & Power Dissipation
Physical Memory and Physical Addressing By Alex Ames.
11. 9/15 2 Figure A 2 M+N -bit memory chip organized as an array of 2 M rows  2 N columns. Memory SRAM organization organized as an array of 2.
Bi-CMOS Prakash B.
Low-Power BIST (Built-In Self Test) Overview 10/31/2014
Copyright © 2010 Houman Homayoun Houman Homayoun National Science Foundation Computing Innovation Fellow Department of Computer Science University of California.
Patricia Gonzalez Divya Akella VLSI Class Project.
EE201C : Stochastic Modeling of FinFET LER and Circuits Optimization based on Stochastic Modeling Shaodi Wang
EE 466/586 VLSI Design Partha Pande School of EECS Washington State University
Tae- Hyoung Kim, Hanyong Eom, John Keane Presented by Mandeep Singh
EE415 VLSI Design THE INVERTER [Adapted from Rabaey’s Digital Integrated Circuits, ©2002, J. Rabaey et al.]
Z. Feng MTU EE4800 CMOS Digital IC Design & Analysis 6.1 EE4800 CMOS Digital IC Design & Analysis Lecture 6 Power Zhuo Feng.
Seok-jae, Lee VLSI Signal Processing Lab. Korea University
07/11/2005 Register File Design and Memory Design Presentation E CSE : Introduction to Computer Architecture Slides by Gojko Babić.
Lecture 3. Lateches, Flip Flops, and Memory
Digital Integrated Circuits A Design Perspective
Floating-Gate Devices / Circuits
Alireza Shafaei, Shuang Chen, Yanzhi Wang, and Massoud Pedram
LOW POWER DESIGN METHODS V.ANANDI ASST.PROF,E&C MSRIT,BANGALORE.
VLSI Design MOSFET Scaling and CMOS Latch Up
Reading: Hambley Ch. 7; Rabaey et al. Sec. 5.2
Maintaining Data Integrity in Programmable Logic in Atmospheric Environments through Error Detection Joel Seely Technical Marketing Manager Military &
Analytical Delay and Variation Modeling for Subthreshold Circuits
Analytical Delay and Variation Modeling for Subthreshold Circuits
William Stallings Computer Organization and Architecture 7th Edition
Introduction to Computing
Electronics for Physicists
2.C Memory GCSE Computing Langley Park School for Boys.
Characterization of C2MOS Flip-Flop in Sub-Threshold Region
Presentation transcript:

Jennifer Winikus Computer Engineering Seminar Michigan Technological University February 10,2011 2/10/2011J Winikus EE5900 1

Presentation based on : 2/10/2011J Winikus EE Gregory Chen, Dennis Sylvester, David Blaauw, and Trevor Mudge IEEE Transactions on Very Large Scale Integration(VLSI) Systems, Vol. 18 No. 11, November 2010

 SRAM Performance Background  SRAM 6T and 8T designs  Experiment Methodology  Experimental Results  Conclusions 2/10/2011J Winikus EE5900 3

 Reduction in energy consumption is a high priority objective for electronics  RAM cells are optimized by utilizing transistors instead of flip flops to improve density  Transistors have increase in “failure” as V DD is minimized near and below V TH  Simulations of 8T and 6T SRAM at various sizes, V DD, V TH performed to analyze for future applications 2/10/2011J Winikus EE5900 4

 RAM is the acronym for Random Access Memory  Utilized by computers and digital devices as a temporary memory storage to store data as it is processed  RAM and Processor speeds are the main components in the speed of a computer except for saving functions that the hard drive capabilities factor in 2/10/2011J Winikus EE5900 5

 Acronym for Static Random Access Memory  Denser then flip flops  Faster then DRAM  Holds the data as long as power is applied  Form arrays on the RAM chips 2/10/2011J Winikus EE5900 6

 Three power regions exist for Transistors: triode, saturation and cut-off  Near Threshold region is classified as between 400 and 700 mV 2/10/2011J Winikus EE Power Regions in MOSFET

 The shifting of V TH for each transistor independently  Reduces Static Noise Margin  Causes mismatches within cells  More predominate influence on system at smaller V DD  Robustness is reduced with Random Dopant Fluctuation 2/10/2011J Winikus EE Chen, Gregory, et al. "Yield-Driven Near-Threshold SRAM Design." IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS (2010):

 Researchers at MIT have established that in general the relationship between total energy, energy leakage and Voltage in terms of the energy per ALU cycle  ALU cycles are computer architecture’s actions  ALU’s include read and write 2/10/2011J Winikus EE Advances in Ultra-Low-Voltage Design Joyce Kwong, Anantha P. Chandrakasan, Massachusetts Institute of Technology

 Designs are based around desired read and write speeds and stability  Cross coupled invertors are responsible for holding the state 2/10/2011J Winikus EE

 The voltage threshold in the MOSFET in which below the threshold the transistor is essentially off, only leakage current passes between the drain and source 2/10/2011J Winikus EE

 Read is performed by prechargin the cell and floating the bitline  Write is performed by driving opposite values to the bitline overriding the wordline 2/10/2011J Winikus EE Chen, Gregory, et al. "Yield-Driven Near-Threshold SRAM Design." IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS (2010):

 Read Upset  The voltage applied to the node holding the zero value is flipped due to noise  Dependent on the pass gate strength  Write  Timing  Hold  Static Noise Margin over powers the cross- coupled invertors causing the state being stored to flip 2/10/2011J Winikus EE

 Useful for independent read-write ports, done by having adding two additional stacked negative channel FETS to the 6 cell to isolate the read and write ports from each other 2/10/2011J Winikus EE Chen, Gregory, et al. "Yield-Driven Near-Threshold SRAM Design." IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS (2010):

 The addition of a separate Read Bitline eliminates the read failure that occurs in the 6T cell  Benefit of having the separate ports is that the remaining 6 transistors can be sized and doped to get the desired write stability  Failure mechanisms are otherwise the same as the 6T 2/10/2011J Winikus EE

 An alternative to sizing modification  The premise is the modification of voltage thresholds for behaviors such that failures are prevented  In a duel port structure the word line voltage can be reduced when read process is performed  The implementation is done through addition of diodes  More complex drivers and decoding is needed with this methodology 2/10/2011J Winikus EE

 Modification of the threshold voltage of the transistors for the design specific goals  In application the tuning of MOSFETs are done by exciting dopants in the drain and source with laser pulses to achieve desired threshold voltages 2/10/2011J Winikus EE

 8T SRAM is typically 33% larger then a 6T cell  V TH is optimized separately for the SRAM and the Logic to improve robustness and performance  Sizing alterations can be minimized by utilizing Assist Circuits 2/10/2011J Winikus EE

 The experiment methodology is an incremental conditioning structure  Only Word Line Drivers, Bit Line Drivers and bit cells are considered 2/10/2011J Winikus EE Chen, Gregory, et al. "Yield-Driven Near- Threshold SRAM Design." IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS (2010):

 Monte Carlo  To account for high yeild systems like 8-kb SRAM with a 99% yeild, the failure rate is on the order of 10 -7, which requires atleast 10 million simulations  SRAM with caches larger then 8-kb have smaller failure rates in which more simulations would be needed  Definition: Monte Carlo is the art of approximating an expectation by the sample mean of a function of simulated random variables(UC Berkley)  Importance Sampling  A unique probability density function is chosen for each transistor  Choosing a relative range to choose the samples from 2/10/2011J Winikus EE

 Based on the percent error seen over many iterations, using the importance sampling 20,000 samples is sufficient for accurate results  Monte Carlo sampling would require samples for accurate results 2/10/2011J Winikus EE Chen, Gregory, et al. "Yield-Driven Near-Threshold SRAM Design." IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS (2010):

 The selected threshold voltages are chosen by choosing random values from within the sample probability distribution in which the normal distribution is shifted to induce a higher probability of failure  Selectively choosing the region speeds up analysis by eliminating samples from regions that are not of interest of the experiment 2/10/2011J Winikus EE Chen, Gregory, et al. "Yield-Driven Near-Threshold SRAM Design." IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS (2010):

 Measured using Static Noise Margin or Corner Cases  Determines if optimization is complete 2/10/2011J Winikus EE

2/10/2011J Winikus EE Chen, Gregory, et al. "Yield-Driven Near-Threshold SRAM Design." IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS (2010):

 Activity Factor is the average fraction of bit cells accessed per cycle  V min is the voltage in which the energy needed per operation is minimized 2/10/2011J Winikus EE

 Robustness is controlled to be constant through the modification of the size of the Bit Cells.  This allows for V TH tuning 2/10/2011J Winikus EE Chen, Gregory, et al. "Yield-Driven Near-Threshold SRAM Design." IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS (2010):

 L1- small in size, high in activity, total energy is mostly dynamic  L2- larger in size then L1, lower activity 2/10/2011J Winikus EE

2/10/2011J Winikus EE Chen, Gregory, et al. "Yield-Driven Near-Threshold SRAM Design." IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS (2010): The energy reduction capability by tuning is most significantly seen in the 8T, a 61% reduction, including sizing as well results in an 83% energy reduction for the 8T cell

 The activity behavior of the circuit is a substantial controlling factor in V min and E min 2/10/2011J Winikus EE Chen, Gregory, et al. "Yield-Driven Near-Threshold SRAM Design." IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS (2010):

 The relationship produced between activity factor and V min demonstrates that the less activity the higher the V min 2/10/2011J Winikus EE Chen, Gregory, et al. "Yield-Driven Near-Threshold SRAM Design." IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS (2010):

 Of the components that assist circuit has been applied the only is beneficial in the sub threshold region for the overdriven word line 2/10/2011J Winikus EE Chen, Gregory, et al. "Yield-Driven Near-Threshold SRAM Design." IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS (2010):

 Performance is compromised with voltage reduction, but is minimized with the use of assists 2/10/2011J Winikus EE Chen, Gregory, et al. "Yield-Driven Near-Threshold SRAM Design." IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS (2010):

 The parabolic response of the energy cost per bit access for the write line displays the correspondence with the V min as below the approximate value the energy increases again  Below about 650 mV assist drooping is no longer considered to assist the circuit  Leakage increases in a near exponential behavior below V min 2/10/2011J Winikus EE Chen, Gregory, et al. "Yield-Driven Near-Threshold SRAM Design." IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS (2010):

 Assist circuit design benefits in this result for minimum voltage capabilities  Energy cost increases with activity 2/10/2011J Winikus EE Chen, Gregory, et al. "Yield-Driven Near-Threshold SRAM Design." IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS (2010):

 With low failure rates, Importance Sampling is more efficient then Monte Carlo Sampling  Scaling to 300mV for 8T cells achieves 83% energy reduction  In effort to reduce power and maintain function-ability size is the compromise or through the addition of assist circuit  Assist modification of certain functional components is beneficial in failure prevention as voltage decreases 2/10/2011J Winikus EE

 In the computer based 21 st Century there is the drive for faster computing, smaller devices and less power consumption  The results display that at this point tuning threshold voltages is the most promising advancement to forward the consumer desires  The results express the trade off that exists, size or power, speed or power 2/10/2011J Winikus EE

 Chen, Gregory, et al. "Yield-Driven Near-Threshold SRAM Design." IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS (2010):  R.P. Lu, A.D. Ramirez, B.W. Offord, and S.D. Russell of SPAWAR Systems Center Pacific for the Office of Naval Research.“Threshold Voltage Tuning of Metal-Gate MOSFETs Using an Excimer Laser”.  Advances in Ultra-Low-Voltage Design.Joyce Kwong, Anantha P. Chandrakasan, Massachusetts Institute of Technology bac26c8/index.jsp?&pName=sscs_level1_article&path=sscs/08Fall&f ile=Kwong.xml&xsl=article.xsl bac26c8/index.jsp?&pName=sscs_level1_article&path=sscs/08Fall&f ile=Kwong.xml&xsl=article.xsl  Eric C. Anderson, 1999, UC Berkeley. “Lecture Notes for Stat 578C” tes.pdf 2/10/2011J Winikus EE

2/10/2011J Winikus EE