Circuit Design with Alternative Energy-Efficient Devices

Slides:



Advertisements
Similar presentations
Semiconductor Memory Design. Organization of Memory Systems Driven only from outside Data flow in and out A cell is accessed for reading by selecting.
Advertisements

Keeping Hot Chips Cool Ruchir Puri, Leon Stok, Subhrajit Bhattacharya IBM T.J. Watson Research Center Yorktown Heights, NY Circuits R-US.
CS 7810 Lecture 12 Power-Aware Microarchitecture: Design and Modeling Challenges for Next-Generation Microprocessors D. Brooks et al. IEEE Micro, Nov/Dec.
Lecture 7: Power.
Integrated Regulation for Energy- Efficient Digital Circuits Elad Alon 1 and Mark Horowitz 2 1 UC Berkeley 2 Stanford University.
Lecture 21, Slide 1EECS40, Fall 2004Prof. White Lecture #21 OUTLINE –Sequential logic circuits –Fan-out –Propagation delay –CMOS power consumption Reading:
Logic Optimization Mohammad Sharifkhani. Reading Textbook II, Chapters 5 and 6 (parts related to power and speed.) Following Papers: –Nose, Sakurai, 2000.
Effects of Variation on Emerging Devices for Use in SRAM
17 Sep 2002Embedded Seminar2 Outline The Big Picture Who’s got the Power? What’s in the bag of tricks?
Low Power Techniques in Processor Design
Low-Power Wireless Sensor Networks
Jia Yao and Vishwani D. Agrawal Department of Electrical and Computer Engineering Auburn University Auburn, AL 36830, USA Dual-Threshold Design of Sub-Threshold.
1 EE 587 SoC Design & Test Partha Pande School of EECS Washington State University
Low Power – High Speed MCML Circuits (II)
Circuit Design with Alternative Energy-Efficient Devices Elad Alon Collaborators: Hei Kam, Fred Chen (MIT), Tsu-Jae King-Liu, Vladimir Stojanovic (MIT),
Basics of Energy & Power Dissipation
HO #3: ELEN Review MOS TransistorsPage 1S. Saha Long Channel MOS Transistors The theory developed for MOS capacitor (HO #2) can be directly extended.
Patricia Gonzalez Divya Akella VLSI Class Project.
Z. Feng MTU EE4800 CMOS Digital IC Design & Analysis 6.1 EE4800 CMOS Digital IC Design & Analysis Lecture 6 Power Zhuo Feng.
Seok-jae, Lee VLSI Signal Processing Lab. Korea University
Yanqing Zhang University of Virginia On Clock Network Design for Sub- threshold Circuitry 1.
Power-Optimal Pipelining in Deep Submicron Technology
Smruti R. Sarangi IIT Delhi
Tunnel FETs Peng Wu Mar 30, 2017.
YASHWANT SINGH, D. BOOLCHANDANI
CS203 – Advanced Computer Architecture
Lecture 19: SRAM.
Lecture 21 OUTLINE The MOSFET (cont’d) P-channel MOSFET
Basics of Energy & Power Dissipation
OMEN: a Quantum Transport Modeling Tool for Nanoelectronic Devices
Circuits and Interconnects In Aggressively Scaled CMOS
Reading: Hambley Ch. 7; Rabaey et al. Sec. 5.2
ELEC 5270/6270 Spring 2013 Low-Power Design of Electronic Circuits Pseudo-nMOS, Dynamic CMOS and Domino CMOS Logic Vishwani D. Agrawal James J. Danaher.
Day 16: September 15, 2010 Energy and Power
Low Power and High Speed Multi Threshold Voltage Interface Circuits
Challenges in Nanoelectronics: Process Variability
Analytical Delay and Variation Modeling for Subthreshold Circuits
Analytical Delay and Variation Modeling for Subthreshold Circuits
Tunnel field-effect transistor
Lecture 19 OUTLINE The MOSFET: Structure and operation
ELEC 5270/6270 Spring 2015 Low-Power Design of Electronic Circuits Pseudo-nMOS, Dynamic CMOS and Domino CMOS Logic Vishwani D. Agrawal James J. Danaher.
Ratioed Logic.
M.S. Thesis Defense Murali Dharan Advisor: Dr. Vishwani D. Agrawal
Notes on Diodes 1. Diode saturation current:  
Week 9a OUTLINE MOSFET ID vs. VGS characteristic
Day 17: October 18, 2010 (Energy) Ratioed Logic
ELEC 6970: Low Power Design Class Project By: Sachin Dhingra
Dual Mode Logic An approach for high speed and energy efficient design
Vishwani D. Agrawal James J. Danaher Professor
Circuit Design Techniques for Low Power DSPs
Lecture 21 OUTLINE The MOSFET (cont’d) P-channel MOSFET
Lecture 21 OUTLINE The MOSFET (cont’d) P-channel MOSFET
Week 9a OUTLINE MOSFET ID vs. VGS characteristic
Day 2: September 10, 2010 Transistor Introduction
Lecture 21 OUTLINE The MOSFET (cont’d) P-channel MOSFET
Lecture #17 (cont’d from #16)
Day 18: October 20, 2010 Ratioed Logic Pass Transistor Logic
Characterization of C2MOS Flip-Flop in Sub-Threshold Region
Literature Review Demonstration of Integrated Micro-Electro-Mechanical Switch Circuits for VLSI Applications Fred Chen, Matthew Spencer, Rhesa Nathanael,
Lecture 7: Power.
Lecture #22 ANNOUNCEMENTS OUTLINE Reading (Rabaey et al.)
Lecture 7: Power.
Reading: Hambley Ch. 7; Rabaey et al. Secs. 5.2, 5.5, 6.2.1
EE216A – Fall 2010 Design of VLSI Circuits and Systems
A 200MHz <insert E #>pJ 6-bit Absolute-Value Detector
Arithmetic Building Blocks
COMBINATIONAL LOGIC - 2.
Arithmetic Circuits.
Lecture #16 OUTLINE MOSFET ID vs. VGS characteristic
ELEC 5270/6270 Spring 2009 Low-Power Design of Electronic Circuits Pseudo-nMOS, Dynamic CMOS and Domino CMOS Logic Vishwani D. Agrawal James J. Danaher.
Presentation transcript:

Circuit Design with Alternative Energy-Efficient Devices Elad Alon Collaborators: Hei Kam, Fred Chen (MIT), Tsu-Jae King-Liu, Vladimir Stojanovic (MIT), Dejan Markovic (UCLA), Mark Horowitz (Stanford) Dept. of EECS, UC Berkeley

CMOS is Scaling, Power Can Not 1000 Predictions (ca. 2000) Reality (Core 2) Itanium II 100 Itanium Pentium 4 Pentium III Pentium Pro Power (W) 10 Pentium Pentium II 486DX 80286 8086 8088 1 8080 386DX 8008 S. Borkar, Intel 4004 0.1 1970 1975 1980 1985 1990 1995 2000 2005 2010

Supply and Threshold Voltages Ed Nowak, IBM Drain Current Id Scaling Vth, Vdd kT/q is fixed, so lowering Vth increases leakage (independent of technology) Since reached Vth that optimally balances leakage and dynamic energy at roughly 90nm node, can no longer scale Vth Means that Vdd is fixed in order to achieve certain performance, and hence power doesn’t scale well with technology. Gate Voltage Vg kT/q doesn’t scale, so lowering Vth increases leakage Fixed Vth, Vdd  power density doesn’t scale well 3

Alternative Devices to the Rescue? Many new devices with S-1<60mV/dec proposed But, many of these are slow (low Ion) And/or have other “weird” characteristics Can these devices reduce energy? If so, at what performance? Need to look at the circuits MOSFET Drain Current Id New Device Slope=S-1 Gate Voltage Vg

Outline Energy-Performance Analysis Circuit Design with Relays Conclusions 5

Processor Power Breakdown Most components track performance vs. energy curves of logic Control, Datapath, Clock Use proxy circuit to examine tradeoffs

Proxy Circuit for Static Logic 0V Vdd Vdd Input Output Ld stages Switching activity factor = , Gate capacitance per stage = C tdelay = LdCVdd/(2Ion) Edyn+Eleak = αLdCVdd2 + LdIoffVddtdelay

Simple Optimization Rule Optimal Ion/Ioff  Ld/α Derived in CMOS But holds for nearly all switching devices Pleak/Pdyn ~constant ~30-50% across wide range of parameters Nose and Sakurai

Using the Rule to Compare MOSFET “New Device” Energy Drain Current Id “New Device” MOSFET Vddx Vddx Gate Voltage Vg Performance Match Ioff by adjusting “VT” New device wins if: Ion,new(Vdd) > Ion,MOS(Vdd)

What Else Matters: Variability Delay: Finite Ld Cycle time set by worst-case Leakage: E(Ioff) vs. E(Vth)

What Else Matters: Wires & Area 0V Vdd Vdd Input Output Cw Cw Cw Cw Devices don’t drive just other devices Need to look at extrinsic cap (wires) too Especially if device has area overhead

Parallelism Serial: Perf.  f Parallel: Perf.  2f, E/op ~const Energy “New Device” Parallel: Perf.  2f, E/op ~const MOSFET Performance If available, parallelism allows slower devices Extends energy benefit to higher performance

Minimum Energy Seff-1 At low performance or high parallelism: Drain Current Id Seff-1 Normalized Energy/cycle Lower Seff Gate Voltage Vg Vdd(V) At low performance or high parallelism: Lowest Vdd for required Ion/Ioff wins Vdd,min  Seff, Emin  Seff2

Example: Tunneling FET Gate Source Drain Drain Current Id (A/mm) Ion ≈A(Vgs+VT)exp[-B/(Vgs+VT)] [1] [1]J. Chen et al., IEEE Electron Device Lett., vol. EDL-8, no. 11, pp. 515–517, Nov. 1987. Gate Voltage Vg (V) Band-to-band tunneling device Steep transition (<60mV/dec) at low current Low Ion(<~100μA) Assume work function can be tuned

Energy-Performance Tradeoff 30 stages α=0.01 TFET Energy (J) MOSFET Performance (GHz) Competitive with subthreshold CMOS TFETs promising below ~100MHz

Outline Energy-Performance Analysis Circuit Design with Relays Conclusions 16

Nano-Electro-Mechanical Relay Gon Conductance Vrl Vpi Gate Voltage Vg [V] Based on mechanically making and breaking contact No leakage, perfectly abrupt transition Reliability is the key challenge

Circuit Design with Relays CMOS: Relay: CMOS delay set by electrical time constant Distribute logical/electrical effort over many stages Relay: mechanical delay (~10ns) >> electrical t (~1ps) Implement logic as a single complex gate Spend more time discussing stages vs. pass transistor logic. This characteristic means that we should take a design style that is a departure from the one we’re used to w/ CMOS Fundamental difference between CMOS & Relays - CMOS delay set by electrical time constant - more (delay) efficient to buffer in stages - Relay delay set mechanically - can sacrifice electrical (logical) complexity for mechanical simplicity Add # of relays & # of transistors to title of slides… 18

Relay Energy-Perf. Tradeoff TFET Stack of 30 series relays No leakage Vdd,min set only by functionality (surface force) How about real logic circuits? MOSFET Energy (J) Relay Performance (GHz)

Relay-Based Adder Manchester carry chain Ripple carry Cascade full adder cells N-bit adder still 1 mechanical delay

Adder Energy-Delay Compare vs. optimal CMOS adder ~10-40x slower Low Rcont not critical ~10-100x lower E/op Lower Cg Fewer devices, all minimum size Lower Vdd,min

Parallelism and Area If parallelism available, can trade area for throughput Competing with sub-threshold CMOS Area-overhead bounded

Power Breakdown Revisited Better logic  “uncore” power dominant Need to analyze (and leverage) devices for entire system… Relay DRAM or NVM (not SRAM)? Relay ADC/DACs?

Outline Simple Energy-Performance Analysis Circuit Design with Relays Conclusions 24

Summary New devices need circuit level analysis Ion/Ioff set by logic depth, activity factor Don’t forget about variability, wires Tailor circuit style to the device If available, parallelism may allow slower (low Ion) devices Don’t forget about the rest of the system

Today: Parallelism lowers E/op Future: Parallelism doesn’t help Good News/Bad News Parallelism still available in CMOS But eventually limited by Emin Opportunity for new devices… At least in sub-100MHz applications Today: Parallelism lowers E/op Future: Parallelism doesn’t help -1

Acknowledgements Berkeley Wireless Research Center NSF DARPA FCRP