1 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved.Copyright © 2011 SuVolta, Inc. All rights reserved. 1 Device Considerations.

Slides:



Advertisements
Similar presentations
18 July 2001 Work In Progress – Not for Publication 2001 ITRS Test Chapter ITRS Test ITWG Mike Rodgers Don Edenfeld.
Advertisements

ITRS December 2003, Hsin-Chu Taiwan How Much Variability Can Designers Tolerate? Andrew B. Kahng ITRS Design ITWG December 1, 2003.
Chapter 10 Digital CMOS Logic Circuits
Topics Electrical properties of static combinational gates:
Introduction to the TRAMS project objectives and results in Y1 Antonio Rubio, Ramon Canal UPC, Project coordinator CASTNESS’11 WORKSHOP ON TERACOMP FET.
Tunable Sensors for Process-Aware Voltage Scaling
Robust Low Power VLSI R obust L ow P ower VLSI Sub-threshold Sense Amplifier (SA) Compensation Using Auto-zeroing Circuitry 01/21/2014 Peter Beshay Department.
Slide 1 Bayesian Model Fusion: Large-Scale Performance Modeling of Analog and Mixed- Signal Circuits by Reusing Early-Stage Data Fa Wang*, Wangyang Zhang*,
1 Cleared for Open Publication July 30, S-2144 P148/MAPLD 2004 Rea MAPLD 148:"Is Scaling the Correct Approach for Radiation Hardened Conversions.
Keeping Hot Chips Cool Ruchir Puri, Leon Stok, Subhrajit Bhattacharya IBM T.J. Watson Research Center Yorktown Heights, NY Circuits R-US.
Introduction to CMOS VLSI Design Lecture 21: Scaling and Economics
0 1 Width-dependent Statistical Leakage Modeling for Random Dopant Induced Threshold Voltage Shift Jie Gu, Sachin Sapatnekar, Chris Kim Department of Electrical.
The Cost of Fixing Hold Time Violations in Sub-threshold Circuits Yanqing Zhang, Benton Calhoun University of Virginia Motivation and Background Power.
Lateral Asymmetric Channel (LAC) Transistors
SRAM Mohammad Sharifkhani. Effect of Mismatch.
EE/MAtE1671 Front-End-Of-Line Variability Considerations EE/MatE 167 David Wahlgren Parent.
EE/MAtE1671 Process Variability EE/MatE 167 David Wahlgren Parent.
1 A Variation-tolerant Sub- threshold Design Approach Nikhil Jayakumar Sunil P. Khatri. Texas A&M University, College Station, TX.
Institute of Digital and Computer Systems 1 Fabio Garzia / Finding Peak Performance in a Process23/06/2015 Chapter 5 Finding Peak Performance in a Process.
On-Line Adjustable Buffering for Runtime Power Reduction Andrew B. Kahng Ψ Sherief Reda † Puneet Sharma Ψ Ψ University of California, San Diego † Brown.
PED Roadmapping Issues Vijaykrishnan Narayanan Dept. of CSE Penn State University GSRC Workshop, March 20-21, 2003.
S. Reda EN160 SP’07 Design and Implementation of VLSI Systems (EN0160) Lecture 13: Power Dissipation Prof. Sherief Reda Division of Engineering, Brown.
1 adaptive body bias for reducing process variations nuno alves 19 / october / 2006.
Lecture 7: Power.
UC San Diego Computer Engineering VLSI CAD Laboratory UC San Diego Computer Engineering VLSI CAD Laboratory UC San Diego Computer Engineering VLSI CAD.
1 Computing with Leakage Currents Nikhil Jayakumar, Kanupriya Gulati, Rajesh Garg and Sunil P. Khatri ECE Department Texas A&M University.
Integrated Regulation for Energy- Efficient Digital Circuits Elad Alon 1 and Mark Horowitz 2 1 UC Berkeley 2 Stanford University.
© Digital Integrated Circuits 2nd Devices VLSI Devices  Intuitive understanding of device operation  Fundamental analytic models  Manual Models  Spice.
The new E-port interface circuits Filip Tavernier CERN.
Advanced Computing and Information Systems laboratory Device Variability Impact on Logic Gate Failure Rates Erin Taylor and José Fortes Department of Electrical.
Logic Optimization Mohammad Sharifkhani. Reading Textbook II, Chapters 5 and 6 (parts related to power and speed.) Following Papers: –Nose, Sakurai, 2000.
EE466: VLSI Design Power Dissipation. Outline Motivation to estimate power dissipation Sources of power dissipation Dynamic power dissipation Static power.
EE414 VLSI Design Design Metrics in Design Metrics in VLSI Design [Adapted from Rabaey’s Digital Integrated Circuits, ©2002, J. Rabaey et al.]
Chalmers University of Technology FlexSoC Seminar Series – Page 1 Power Estimation FlexSoc Seminar Series – Daniel Eckerbert
Fast & Furious: Taming the Challenges of Advanced-Node Design Anirudh Devgan, Senior Vice President, Digital & Signoff Group.
Lecture 03: Fundamentals of Computer Design - Trends and Performance Kai Bu
A 30-GS/sec Track and Hold Amplifier in 0.13-µm CMOS Technology
Introduction to FinFet
MICAS Department of Electrical Engineering (ESAT) Design-In for EMC on digital circuit October 27th, 2005 AID–EMC: Low Emission Digital Circuit Design.
Interconnect Focus Center e¯e¯ e¯e¯ e¯e¯ e¯e¯ IWSM 2001Sam, Chandrakasan, and Boning – MIT Variation Issues in On-Chip Optical Clock Distribution S. L.
Low Power – High Speed MCML Circuits (II)
XIAOYU HU AANCHAL GUPTA Multi Threshold Technique for High Speed and Low Power Consumption CMOS Circuits.
Process Variation Mohammad Sharifkhani. Reading Textbook, Chapter 6 A paper in the reference.
Leakage reduction techniques Three major leakage current components 1. Gate leakage ; ~ Vdd 4 2. Subthreshold ; ~ Vdd 3 3. P/N junction.
W E L C O M E. T R I G A T E T R A N S I S T O R.
Trends in IC technology and design J. Christiansen CERN - EP/MIC
64 bit Kogge-Stone Adders in different logic styles – A study Rob McNish Satyanand Nalam.
Patricia Gonzalez Divya Akella VLSI Class Project.
FPGA-Based System Design: Chapter 2 Copyright  2004 Prentice Hall PTR Topics n Logic gate delay. n Logic gate power consumption. n Driving large loads.
EE201C : Stochastic Modeling of FinFET LER and Circuits Optimization based on Stochastic Modeling Shaodi Wang
An Improved “Soft” eFPGA Design and Implementation Strategy
Modern VLSI Design 3e: Chapter 3 Copyright  1998, 2002 Prentice Hall PTR Topics n Electrical properties of static combinational gates: –transfer characteristics;
Tae- Hyoung Kim, Hanyong Eom, John Keane Presented by Mandeep Singh
Z. Feng MTU EE4800 CMOS Digital IC Design & Analysis 6.1 EE4800 CMOS Digital IC Design & Analysis Lecture 6 Power Zhuo Feng.
Seok-jae, Lee VLSI Signal Processing Lab. Korea University
UTB SOI for LER/RDF EECS Min Hee Cho. Outline  Introduction  LER (Line Edge Roughness)  RDF (Random Dopant Fluctuation)  Variation  Solution – UTB.
1 Dual-V cc SRAM Class presentation for Advanced VLSIPresenter:A.Sammak Adopted from: M. Khellah,A 4.2GHz 0.3mm 2 256kb Dual-V CC SRAM Building Block in.
FaridehShiran Department of Electronics Carleton University, Ottawa, ON, Canada SmartReflex Power and Performance Management Technologies.
CS203 – Advanced Computer Architecture
Alireza Shafaei, Shuang Chen, Yanzhi Wang, and Massoud Pedram
20-NM CMOS DESIGN.
Device Structure & Simulation
VLSI Design MOSFET Scaling and CMOS Latch Up
INTRODUCTION: MD. SHAFIQUL ISLAM ROLL: REGI:
Challenges in Nanoelectronics: Process Variability
Overview of VLSI 魏凱城 彰化師範大學資工系.
Circuit Design Techniques for Low Power DSPs
Impact of Parameter Variations on Multi-core chips
Post-Silicon Calibration for Large-Volume Products
Beyond Si MOSFETs Part IV.
Presentation transcript:

1 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved.Copyright © 2011 SuVolta, Inc. All rights reserved. 1 Device Considerations for Low Power VLSI Circuits EDS Silicon Valley Chapter David Kidd Senior Director of Digital Design

2 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved.  Transistor Variability Limits Chips  Impact on Mobile System on Chip (SOC)  Limited Low Power Design Techniques  Where does Variability come from?  New Transistor Alternatives to Reduce Variability  Deeply Depleted Channel (DDC) technology  Silicon Impact  Outlook  Taking advantage of Deeply Depleted Channel (DDC) technology in Mobile SOC Overview

3 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved.  Multiple blocks with different performance requirements  Integrated on the same die  Different power modes – would like to run at different supplies  Multiple V T transistors used to control leakage  Single chip solution requires analog integration  Need co-design of architecture, circuits and transistor technology for best solution What is needed in Mobile System on Chip?

4 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved.  Limited benefit using voltage scaling (DVFS)  Cannot overdrive much due to reliability and power restrictions  Dynamically lowering voltage limited to mV  Only lowering frequency leaves large leakage power  “Run to hold” beats DVFS despite overhead  Finicky SRAM memories  High SRAM V MIN leaves no room for memory voltage scaling  Many circuit tricks to improve V MIN and noise margins  Design teams moved to dedicated power rail for SRAM  Works for CPU – difficult in GPU  Impacts power network integrity – more fluctuations  Transistor variability limits chips Variability Limits Design & Architecture

5 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved.  Global/Systematic/Manufacturing Variation  Shifts all the transistors similarly  Longer/shorter transistor lengths  More (or less) implant energy and dose  Will result in speed/power distribution  Local/Random Variation  Transistor next to each other vary widely  Small number of dopants in transistor channel  Random Dopant Fluctuation (RDF)  Apparent in threshold voltage mismatch (σV T )  Impacts speed, leakage, SRAM & Analog  Industry solution: Remove RDF using Undoped Channel  What is the right silicon roadmap going forward? Transistor Variation Source of Chip Variation [#Transistors] Useable Yield Too slowToo hot

6 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved.  FinFET or TriGate  Promises high drive current  Manufacturing, cost, and IP challenge  Doped channel to enable multi V T  FDSOI  Showing off undoped channel benefits  Good body effect, but lack of multi V T capability  Restricted supply chain  DDC – Deeply Depleted Channel transistor  Straight forward insertion into Bulk Planar CMOS  Undoped channel to reduce random variability  Good body effect and multi V T transistors Transistor Alternatives Source: IMEC Source: Fujitsu Textbook FinFET Source: GSS, Chipworks Intel TriGate

7 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved.  Undoped or very lightly doped region  Significantly reduced transistor random variability  V T  Lower leakage  Better SRAM (I READ, lower V min & V ret )  Tighter corners  Smaller area analog design  Higher channel mobility (increased I eff, lower DIBL)  Higher speed, improved voltage scaling  V T setting offset region  Enables multiple threshold voltages  Screening region  Strong body coefficient  Bias bodies to tighten manufacturing distribution  Body biasing to compensate for temperature and aging Deeply Depleted Channel™ (DDC) Transistor Benefits similar to FinFET in planar bulk CMOS *Example implementation

8 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved. Wells and VT Poly Spacers and LDDs Salicide Gate Metal Layers Blanket Epi STI Poly Spacers and LDDs Salicide Gate Metal Layers Wells and VT Foundry Standard Flow SuVolta Flow (example) New  No new materials / No new tools STI Bulk Planar Foundry-Compatible Process

9 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved. Presented at IEDM 2011 TEM of DDC Transistor and STI 43.1nm S D

10 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved. Lower Transistor Variability Reduces Leakage  Transistor variability is reflected in threshold voltage (V T ) distribution  Leakage current is exponentially dependent on V T  Lower V T variability (  V T ) reduces number of leaky low V T devices  Power dissipation is dominated by low V T edge of distribution  Smaller  V T  Less leakage power for digital and memory/SRAM High leakage tail High leakage tail dominates power 2.7x higher power (Model using 85mV subV T slope) High V T tail Slows down ICs [#Transistors] [Leakage Power] 65nm Silicon SRAM V T

11 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved.  Nominal (TT) ring oscillator speed expected to be 400ps (A)  Equivalent to having many similar critical paths in a chip  V T variation will randomly affect paths within the same die limiting speed to 470ps  Undoped channel reduces variability and increases mobility (B)  25% faster mean, 30% faster tail due to tighter distribution  To match performance lower V DD until tails have same speed (C)  Large impact on power due square dependence P=CV 2 f +IV Lower Transistor Variability Improves Speed 65nm Silicon Measurement (A) (B) (C)

12 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved.  SRAM memories built using 6-T SRAM cell  Smallest transistors on every chip, worst V T mismatch  Higher V DD is required to avoid failures  SRAM blocks limit V DD scaling  V min – lowest operating voltage limited by transistor mismatch  Demonstrated SRAM to V min of 0.425V  Standard SRAM macros  No circuit “tricks” for low voltage operation  Demonstrates potential for 50% voltage scaling Sub 0.5V V Min is Possible 300 mV Tester limit Industry Norm DDC

13 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved.  40-60% improved matching for SRAM transistors Improved V T Matching Key for Low V MIN Baseline DDC Baseline DDC Baseline DDC Baseline DDC Presented at IEDM 2011

14 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved.  In analog circuits, matching is key  Large transistors used to improve relative variability in current mirrors, differential pairs, etc.  Better transistor matching allows for  Area savings  Higher performance  Lower power  Undoped channel improves R OUT  higher gain  OpAmp stage  Matched bandwidth, gain, slewrate  Over 50% smaller area (84 vs 190μm 2 )  45% better input noise (176 vs 327 μV)  Bandgap reference circuit  Same accuracy achieved at half the size Design Examples Analog 450um 420um 210um 450um Baseline DDC Baseline DDC 10.9x17.4um 6.9x12.2um SuVolta sample layout

15 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved.  Body Bias to fix systematic variation  Speed-up (forward bias - FBB) slow parts  Cool down (reverse bias - RBB) hot parts  Increase manufacturing yield  Body bias enables multiple modes of operation  Active  minimize power at every performance  Standby  leakage reduction, power gating  DDC transistor provides 2-4x larger body factor Better Chips with Body Biasing Useable Yield Too slowToo hot RBBFBB TCAD prediction

16 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved.  Inverter ring-oscillators (RO) fabricated at process corners  1.2V V DD and 0.9V V DD  For each corner, DDC transistor RO is faster and lower power  Using strong body coefficient to pull in corners  Half the power (50% less power) while matching speed Half the Power at Matched Performance 65nm Silicon Measurement Baseline speed 50% power 100% power SS TT FF TT

17 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved.  Better process control leads to tighter corners  Manufacturing flow further reduces layout effects  1 sigma tighter wafer to wafer and within wafer variation for DDC  Less overdesign as max paths and min (hold) paths are closer  Faster design closure  earlier tapeout  shorter TTM Tighter Manufacturing Corners w/ DDC 11 22 33 V DD =1.2V V DD =0.9V POR 65nm Silicon Measurement

18 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved.  Achieve half the speed at 1/6 the V DD  Use body bias to compensate for temperature and aging  Critical for low V DD operation  Enable workable design window – avoid overdesign Voltage Scaling to 0.6V V DD 65nm Silicon Measurement Baseline DDC FF TT SS -83%

19 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved.  Turbo Mode: DDC transistor achieves over 50% 1.2V V DD  All corners for DDC run at 580MHz vs 370MHz for baseline This is HotChips – Go Faster! DVFSBaselineDDC V DD 1.2V0.6V0.9V1.05V1.2V Speed Power nm Silicon Measurement

20 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved.  Same performance at 0.75V V DD as baseline at 0.9V V DD  30% lower power  Alternatively 25% faster at same voltage  Even better when using body bias to pull in corners 28nm and Beyond (silicon calibrated SPICE simulations)

21 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved.  CPU: Single thread performance critical  Push frequency by temporarily raising voltage in turbo mode  DVFS with body biasing becomes DVBFS  GPU: High number of cores using small transistors  Less overdesign due to lower delay variability  Increase parallelism, lower voltage, body bias dynamically for more pixels/Watt  Lower frequency blocks  In addition to high V T transistors also run at lower voltage and optimal body bias  Whole chip: Use body bias to adjust for manufacturing variation  Take advantage of improved memory and analog performance  Lowering variability while compatible with existing bulk planar silicon IP Applying DDC to Lower Variability in Mobile SOC

22 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved.  Variability limits chips  DDC transistor reduces random variability through its undoped channel  DDC transistor’s strong body factor can be used to fix systematic variation and compensate for temperature variation  DDC technology provides performance kicker from 90nm to 20nm  Straight forward integration into existing nodes  Compatible with existing bulk planar CMOS silicon IP  Use existing CAD flow  DDC technology brings back low power tools  Large range DVFS  Body biasing  Low voltage operation  Taking advantage of reduced variability DDC transistor in design and architecture will lead to next level in mobile SOC Conclusions

23 EDS Silicon Valley October 2012Copyright © 2012 SuVolta, Inc. All rights reserved.Copyright © 2011 SuVolta, Inc. All rights reserved. 23