EVALUATION OF A CIRCUIT PATH DELAY TUNING TECHNIQUE FOR NANOMETER CMOS Advisor: Dr. Adit D. Singh Committee members: Dr. Vishwani D. Agrawal and Dr. Victor.

Slides:



Advertisements
Similar presentations
COMBINATIONAL LOGIC [Adapted from Rabaey’s Digital Integrated Circuits, ©2002, J. Rabaey et al.]
Advertisements

ECE 424 – Introduction to VLSI
Algorithm for Fast Statistical Timing Analysis Jakob Salzmann, Frank Sill, Dirk Timmermann SOC 2007, Nov ‘07, Tampere, Finland Institute of Applied Microelectronics.
Tunable Sensors for Process-Aware Voltage Scaling
CSET 4650 Field Programmable Logic Devices
Robust Low Power VLSI R obust L ow P ower VLSI Sub-threshold Sense Amplifier (SA) Compensation Using Auto-zeroing Circuitry 01/21/2014 Peter Beshay Department.
October 2nd Karthik’s MS Defense DVF4: A Dual Vth Feedback Type 4-Transistor Level Converter Master’s Defense Karthik Naishathrala Jayaraman Department.
ELEC Digital Logic Circuits Fall 2014 Logic Synthesis (Chapters 2-5) Vishwani D. Agrawal James J. Danaher Professor Department of Electrical and.
Elettronica T A.A Digital Integrated Circuits © Prentice Hall 2003 Inverter CMOS INVERTER.
Introduction to Digital Systems By Dr. John Abraham UT-Panam.
Leakage and Dynamic Glitch Power Minimization Using MIP for V th Assignment and Path Balancing Yuanlin Lu and Vishwani D. Agrawal Auburn University ECE.
1 Dual Threshold Voltage Domino Logic Synthesis for High Performance with Noise and Power Constraint Seong-Ook Jung, Ki-Wook Kim and Sung-Mo (Steve) Kang.
Statistical Full-Chip Leakage Analysis Considering Junction Tunneling Leakage Tao Li Zhiping Yu Institute of Microelectronics Tsinghua University.
Yuanlin Lu Intel Corporation, Folsom, CA Vishwani D. Agrawal
Fall 06, Sep 19, 21 ELEC / Lecture 6 1 ELEC / (Fall 2005) Special Topics in Electrical Engineering Low-Power Design of Electronic.
Polynomial-Time Algorithms for Designing Dual-Voltage Energy Efficient Circuits Master’s Thesis Defense Mridula Allani Advisor : Dr. Vishwani D. Agrawal.
Design of Variable Input Delay Gates for Low Dynamic Power Circuits
S. Reda EN160 SP’08 Design and Implementation of VLSI Systems (EN1600) Lecture 14: Power Dissipation Prof. Sherief Reda Division of Engineering, Brown.
Process-Variation-Resistant Dynamic Power Optimization for VLSI Circuits Fei Hu Department of ECE Auburn University, AL Ph.D. Dissertation Committee:
Spring 07, Feb 20 ELEC 7770: Advanced VLSI Design (Agrawal) 1 ELEC 7770 Advanced VLSI Design Spring 2007 Reducing Power through Multicore Parallelism Vishwani.
UC San Diego Computer Engineering VLSI CAD Laboratory UC San Diego Computer Engineering VLSI CAD Laboratory UC San Diego Computer Engineering VLSI CAD.
Spring 07, Jan 16 ELEC 7770: Advanced VLSI Design (Agrawal) 1 ELEC 7770 Advanced VLSI Design Spring 2007 Introduction Vishwani D. Agrawal James J. Danaher.
Computer Engineering 222. VLSI Digital System Design Introduction.
Spring 07, Jan 23 ELEC 7770: Advanced VLSI Design (Agrawal) 1 ELEC 7770 Advanced VLSI Design Spring 2007 Moore’s Law Vishwani D. Agrawal James J. Danaher.
Institute of Digital and Computer Systems 1 Fabio Garzia / Finding Peak Performance in a Process23/06/2015 Chapter 5 Finding Peak Performance in a Process.
A Defect Tolerant and Performance Tunable Gate Architecture for End-of-Roadmap CMOS Adit D. Singh Electrical and Computer Engineering, Auburn University.
On-Line Adjustable Buffering for Runtime Power Reduction Andrew B. Kahng Ψ Sherief Reda † Puneet Sharma Ψ Ψ University of California, San Diego † Brown.
Jan. 2007VLSI Design '071 Statistical Leakage and Timing Optimization for Submicron Process Variation Yuanlin Lu and Vishwani D. Agrawal ECE Dept. Auburn.
S. Reda EN160 SP’07 Design and Implementation of VLSI Systems (EN0160) Lecture 13: Power Dissipation Prof. Sherief Reda Division of Engineering, Brown.
Fall 06, Sep 14 ELEC / Lecture 5 1 ELEC / (Fall 2006) Low-Power Design of Electronic Circuits (Formerly ELEC / )
Lecture 7: Power.
Spring 10, Jan 13ELEC 7770: Advanced VLSI Design (Agrawal)1 ELEC 7770 Advanced VLSI Design Spring 2010 VLSI Yield and Moore’s Law Vishwani D. Agrawal James.
Fall 2006: Dec. 5 ELEC / Lecture 13 1 ELEC / (Fall 2006) Low-Power Design of Electronic Circuits Adiabatic Logic Vishwani.
1 EE 587 SoC Design & Test Partha Pande School of EECS Washington State University
Advanced Computing and Information Systems laboratory Device Variability Impact on Logic Gate Failure Rates Erin Taylor and José Fortes Department of Electrical.
EE466: VLSI Design Power Dissipation. Outline Motivation to estimate power dissipation Sources of power dissipation Dynamic power dissipation Static power.
04/26/05 Anthony Singh, Carleton University, MCML - Fixed Point - Integer Divider Presentation #2 High-Speed Low Power VLSI – Prof. Shams By Anthony.
MOS Capacitors MOS capacitors are the basic building blocks of CMOS transistors MOS capacitors distill the basic physics of MOS transistors MOS capacitors.
1 Delay Estimation Most digital designs have multiple data paths some of which are not critical. The critical path is defined as the path the offers the.
ENGG 6090 Topic Review1 How to reduce the power dissipation? Switching Activity Switched Capacitance Voltage Scaling.
32-BIT ADDER FOR LOW VOLTAGE OPERATION WITH LEVEL CONVERTERS PRIYADHARSHINI S.
TEMPLATE DESIGN © Gate-Diffusion Input (GDI) Technique for Low Power CMOS Logic Circuits Design Yerkebulan Saparov, Aktanberdi.
SoC TAM Design to Minimize Test Application Time Advisor Dr. Vishwani D. Agrawal Committee Members Dr. Victor P. Nelson, Dr. Adit D. Singh Apr 9, 2015.
An Efficient Algorithm for Dual-Voltage Design Without Need for Level-Conversion SSST 2012 Mridula Allani Intel Corporation, Austin, TX (Formerly.
Jia Yao and Vishwani D. Agrawal Department of Electrical and Computer Engineering Auburn University Auburn, AL 36830, USA Dual-Threshold Design of Sub-Threshold.
Low Power – High Speed MCML Circuits (II)
26 th International Conference on VLSI January 2013 Pune,India Optimum Test Schedule for SoC with Specified Clock Frequencies and Supply Voltages Vijay.
Robust Low Power VLSI ECE 7502 S2015 Minimum Supply Voltage and Very- Low-Voltage Testing ECE 7502 Class Discussion Elena Weinberg Thursday, April 16,
Modern VLSI Design 4e: Chapter 3 Copyright  2008 Wayne Wolf Topics n Pseudo-nMOS gates. n DCVS logic. n Domino gates. n Design-for-yield. n Gates as IP.
Distributed Computation: Circuit Simulation CK Cheng UC San Diego
IMPLEMENTATION OF MIPS 64 WITH VERILOG HARDWARE DESIGN LANGUAGE BY PRAMOD MENON CET520 S’03.
LOGIC OPTIMIZATION USING TECHNOLOGY INDEPENDENT MUX BASED ADDERS IN FPGA Project Guide: Smt. Latha Dept of E & C JSSATE, Bangalore. From: N GURURAJ M-Tech,
Bi-CMOS Prakash B.
An Introduction to VLSI (Very Large Scale Integrated) Circuit Design
Patricia Gonzalez Divya Akella VLSI Class Project.
A Class presentation for VLSI course by : Maryam Homayouni
Solid-State Devices & Circuits
Introduction to CMOS Transistor and Transistor Fundamental
Seok-jae, Lee VLSI Signal Processing Lab. Korea University
ELEC Digital Logic Circuits Fall 2015 Delay and Power Vishwani D. Agrawal James J. Danaher Professor Department of Electrical and Computer Engineering.
CS203 – Advanced Computer Architecture
GATE DIFFUSION INPUT: A low power digital circuit design
ELEC 7950 – VLSI Design and Test Seminar
YASHWANT SINGH, D. BOOLCHANDANI
Device Structure & Simulation
Challenges in Nanoelectronics: Process Variability
M.S. Thesis Defense Murali Dharan Advisor: Dr. Vishwani D. Agrawal
Energy Efficient Power Distribution on Many-Core SoC
ELEC 7770 Advanced VLSI Design Spring 2014 VLSI Yield and Moore’s Law
A Low-Power Analog Bus for On-Chip Digital Communication
Presentation transcript:

EVALUATION OF A CIRCUIT PATH DELAY TUNING TECHNIQUE FOR NANOMETER CMOS Advisor: Dr. Adit D. Singh Committee members: Dr. Vishwani D. Agrawal and Dr. Victor P. Nelson Department of Electrical and Computer Engineering Masters Project Defense Ahmed Faraz

MOORES LAW International Technology Roadmap for Semiconductors (ITRS) is expected to continue for another decade at least Performance gains measured in terms of processor speeds has started to saturate or even fall back a little Transistor integration density per die[1]

PROCESSOR CLOCK RATES Power dissipation/cooling problems Process variability Clock Rate

PROCESS VARIATION Natural variation occurring in the parameters of transistors during the fabrication of integrated circuits Critical sources are Random Dopant Fluctuations, Line-edge and Line-width roughness and variations in gate oxide thickness[2,3] All of the sources mentioned above primarily degrade Threshold Voltage(V th )

PROCESS VARIATION(contd..) THRESHOLD VOLTAGE(Vth) NUMBEROFTRANSISTORSNUMBEROFTRANSISTORS 5 out of 10 million elements lie beyond in a normal distribution For, worst case delay = 10 X average gate delay NUMBEROFTRANSISTORSNUMBEROFTRANSISTORS Larger technologySmaller technology

Post manufacture paths are NOT balanced Pre manufacture paths are optimized and balanced Clock period = Worst case delay CLOCK RATES IN PIPELINED PROCESSORS

MOTIVATION Statistical chance of every chip to have a few hundred exceptionally slow outlier devices Such outliers (10 X gate delays) make the overall path delay much much greater than the average delays Post manufacture tuning is proposed to bring down the worst case path delay close to the average case delays[4]

CMOS GATE Every CMOS gate has a pull-up network with PMOS transistors, and a pull-down network with NMOS transistor The presence of a parallel path can speed up the charge time or the discharge time respectively

POST MANUFACTURING TUNING A parallel PMOS transistor with tuning capability is introduced in to the pull-up network A parallel NMOS transistor with tuning capability is introduced in to the pull-down network The gate terminals of the tuning transistors are connected to a switch which can be turned ON or OFF

ASSUMPTIONS Outlier gates can be diagnosed Programming capability exists to control the tuning transistors TUNABLE CMOS GATE

SLOW TRANSISTOR IN PULL- UP NETWORK Switching on the tuning PMOS transistor Parallel path in pull- up network will speed up the charge time

SLOW TRANSISTOR IN PULL- DOWN NETWORK Switching on the tuning NMOS transistor Parallel path in pull- down network will speed up the discharge time

SIZING TUNING TRANSISTORS

SIZING (CONTD..)

EVALUATION OF TUNING TECHNIQUE Need SPICE simulation studies to evaluate the effectiveness of tuning technique

GENERATING SPICE NETLIST (USING 45 nm NANGATE OPEN CELL LIBRARY)

Modelsim (functional verification) Structural model (Verilog) RTL Compiler Perl code SPICE netlist Cadence SOC encounter 45 nm tech.

EXAMPLE SMALL CIRCUIT NAND tree with 64 inputs and 1 output Each NAND gate has tuning capabilty Every circuit has 381 PMOS and 381 NMOS transistor (including tuning transistor) Each transistor is assigned a different V th V th drawn from Gaussian distribution

64 INPUTS OUTPUT

SIMULATION OF EXAMPLE CIRCUIT Worst path delay simulated for three cases Circuit without tuning circuitry added Circuit with tuning circuitry added Tuned Circuit

SIMULATING COPIES OF SMALL CIRCUIT Circuit simulated 10,000 times New random set of V th values for transistors in every simulation Worst path delays are stored for every copy of the circuit

CONSTRUCTING LARGER CIRCUITS Need larger circuits to see impact of parameter variations Pick N copies of standard small circuit to make a large circuit Largest delays among N subcircuit is the worst path delays of large circuit N ranges from 1 to 5000 Pick of N sub-circuits is done 1000 times each to get an average case for every size N

10,000 copies of basic small circuit Worst case path delay Larger circuit with 10 randomly chosen copies Pick of 10 sub-circuits is done 1000 times to get an average case for size 10 N=10

Speed up 45 nm technology V dd =1V V th =0.464V Sigma=0.116V SIMULATION RESULTS

CONCLUSION This tuning technique speeds up worst case path delays close to the average case delay by adding redundant tuning transistors to the circuit Only few hundred out of millions of gates are tuned Simulation results indicate that significant performance gain can be achieved

FUTURE WORK Adding tunable gates to libraries so that its extraction can be automated Study of definite diagnosis strategy that would enable the detection of outlier gates in a chip Simulations with standard circuits to make the case more authentic and applicable

REFERENCES [1] G. E. Moore et al., Cramming more components onto integrated circuits," Proceedings of the IEEE, vol. 86, no. 1, pp , [2] K. S. Saha, Modeling Process Variability in Scaled CMOS Technology," IEEE Design and Test of Computers, Vol. 27, Issue 2, pp , March/April [3] C. Kenyon, A. Kornfeld, and et al., Managing Process Variation in Intel's 45nm CMOS Technology. Intel Technology Journal, Vol. 12, Issue 2, pp , June [4] A. D. Singh, K. Mishra, and et al., Path Delay Tuning for Performance Gain in the face of Random Manufacturing Variations," in proc. International Conference on VLSI Design, Bangalore, India, January 2011

THANK YOU