Ring Oscillator Clocks and Margins

Slides:



Advertisements
Similar presentations
Asynchronous Circuits Jordi Cortadella Universitat Politècnica de Catalunya, Barcelona Collège de France May 14 th, 2013.
Advertisements

Data Synchronization Issues in GALS SoCs Rostislav (Reuven) Dobkin and Ran Ginosar Technion Christos P. Sotiriou FORTH ICS- FORTH.
Tunable Sensors for Process-Aware Voltage Scaling
OCV-Aware Top-Level Clock Tree Optimization
NTHU-CS VLSI/CAD LAB TH EDA De-Shiuan Chiou Da-Cheng Juan Yu-Ting Chen Shih-Chieh Chang Department of CS, National Tsing Hua University, Taiwan Fine-Grained.
Variability-Driven Formulation for Simultaneous Gate Sizing and Post-Silicon Tunability Allocation Vishal Khandelwal and Ankur Srivastava Department of.
0 1 Width-dependent Statistical Leakage Modeling for Random Dopant Induced Threshold Voltage Shift Jie Gu, Sachin Sapatnekar, Chris Kim Department of Electrical.
The Cost of Fixing Hold Time Violations in Sub-threshold Circuits Yanqing Zhang, Benton Calhoun University of Virginia Motivation and Background Power.
Subthreshold Logic Energy Minimization with Application- Driven Performance EE241 Final Project Will Biederman Dan Yeager.
Handshake protocols for de-synchronization I. Blunno, J. Cortadella, A. Kondratyev, L. Lavagno, K. Lwin and C. Sotiriou Politecnico di Torino, Italy Universitat.
1 Variability Characterization in FPGAs Brendan Hargreaves 10/05/2006.
Institute of Digital and Computer Systems 1 Fabio Garzia / Finding Peak Performance in a Process23/06/2015 Chapter 5 Finding Peak Performance in a Process.
NTHU-CS VLSI/CAD LAB TH EDA Student : Da-Cheng Juan Advisor : Shih-Chieh Chang Fine-Grained Sleep Transistor Sizing Algorithm for Leakage Power Minimization.
A Defect Tolerant and Performance Tunable Gate Architecture for End-of-Roadmap CMOS Adit D. Singh Electrical and Computer Engineering, Auburn University.
On-Line Adjustable Buffering for Runtime Power Reduction Andrew B. Kahng Ψ Sherief Reda † Puneet Sharma Ψ Ψ University of California, San Diego † Brown.
Focus group: Statistical synthesis. Top reasons to go for statistical Often cited - worst case is way off - exact SI and IR drop analysis is too complex.
DDRO: A Novel Performance Monitoring Methodology Based on Design-Dependent Ring Oscillators Tuck-Boon Chan †, Puneet Gupta §, Andrew B. Kahng †‡ and Liangzhen.
Signal Integrity Methodology on 300 MHz SoC using ALF libraries and tools Wolfgang Roethig, Ramakrishna Nibhanupudi, Arun Balakrishnan, Gopal Dandu Steven.
-1- UC San Diego / VLSI CAD Laboratory Methodology for Electromigration Signoff in the Presence of Adaptive Voltage Scaling Wei-Ting Jonas Chan, Andrew.
Self-Biased, High-Bandwidth, Low- Jitter 1-to-4096 Multiplier Clock Generator PLL Based on a presentation by: John G. Maneatis 1, Jaeha Kim 1, Iain McClatchie.
11 1 Process Variation in Near-threshold Wide SIMD Architectures Sangwon Seo 1, Ronald G. Dreslinski 1, Mark Woh 1, Yongjun Park 1, Chaitali Chakrabarti.
VOLTAGE SCHEDULING HEURISTIC for REAL-TIME TASK GRAPHS D. Roychowdhury, I. Koren, C. M. Krishna University of Massachusetts, Amherst Y.-H. Lee Arizona.
1 IN THE NAME GOD Advanced VLSI Class Presentation A 1.1GHz Charge Recovery Logic Insructor : Dr. Fakhrayi Presented by : Mahdiyeh Mehran.
Power Reduction for FPGA using Multiple Vdd/Vth
Logic Synthesis for Low Power(CHAPTER 6) 6.1 Introduction 6.2 Power Estimation Techniques 6.3 Power Minimization Techniques 6.4 Summary.
Low Power – High Speed MCML Circuits (II)
EEE2243 Digital System Design Chapter 7: Advanced Design Considerations by Muhazam Mustapha, extracted from Intel Training Slides, April 2012.
Minimum Energy Sub-Threshold CMOS Operation Given Yield Constraints Max Dreo Vincent Luu Julian Warchall.
Outline Introduction: BTI Aging and AVS Signoff Problem
Variation-Tolerant Circuits: Circuit Solutions and Techniques Jim Tschanz, Keith Bowman, and Vivek De Microprocessor Technology Lab Intel Corporation,
REALITY STATISTICAL CHARACTERIZATION OF A HIGH-K METAL GATE 32NM ARM926 CORE UNDER PROCESS VARIABILITY IMPACT Paul Zuber Petr Dobrovolny Miguel Miranda.
Patricia Gonzalez Divya Akella VLSI Class Project.
1 Bridging the gap between asynchronous design and designers Peter A. BeerelFulcrum Microsystems, Calabasas Hills, CA, USA Jordi CortadellaUniversitat.
Static Timing Analysis
VADA Lab.SungKyunKwan Univ. 1 L5:Lower Power Architecture Design 성균관대학교 조 준 동 교수
EE141 Timing Issues 1 Chapter 10 Timing Issues Rev /11/2003 Rev /28/2003 Rev /05/2003.
EE141 Timing Issues 1 Chapter 10 Timing Issues Rev /11/2003.
-1- Soft Core Viterbi Decoder EECS 290A Project Dave Chinnery, Rhett Davis, Chris Taylor, Ning Zhang.
Unified Adaptivity Optimization of Clock and Logic Signals Shiyan Hu and Jiang Hu Dept of Electrical and Computer Engineering Texas A&M University.
Copyright © 2009, Intel Corporation. All rights reserved. Power Gate Design Optimization and Analysis with Silicon Correlation Results Yong Lee-Kee, Intel.
University of Michigan Advanced Computer Architecture Lab. 2 CAD Tools for Variation Tolerance David Blaauw and Kaviraj Chopra University of Michigan.
Gopakumar.G Hardware Design Group
Lecture 11: Sequential Circuit Design
Time-borrowing platform in the Xilinx UltraScale+ family of FPGAs and MPSoCs Ilya Ganusov, Benjamin Devlin.
Timing issues.
Reactive Clocks with Variability-Tracking Jitter
Defining Statistical Sensitivity for Timing Optimization of Logic Circuits with Large-Scale Process and Environmental Variations Xin Li, Jiayong Le, Mustafa.
Circuits and Interconnects In Aggressively Scaled CMOS
Revisiting and Bounding the Benefit From 3D Integration
Introduction to CMOS VLSI Design Lecture 10: Sequential Circuits
Fine-Grain CAM-Tag Cache Resizing Using Miss Tags
Challenges in Nanoelectronics: Process Variability
Limitations of STA, Slew of a waveform, Skew between Signals
CLOCK DOMAIN AND OPERATING CONDITIONS
Introduction to Static Timing Analysis:
CMOS VLSI Design Chapter 13 Clocks, DLLs, PLLs
Clocking in High-Performance and Low-Power Systems Presentation given at: EPFL Lausanne, Switzerland June 23th, 2003 Vojin G. Oklobdzija Advanced.
Dual Mode Logic An approach for high speed and energy efficient design
Chapter 10 Timing Issues Rev /11/2003 Rev /28/2003
Circuit Design Techniques for Low Power DSPs
CMOS VLSI Design Chapter 13 Clocks, DLLs, PLLs
Pre-Computed Asynchronous Scan Invited Talk
Jordi Cortadella and Jordi Petit
Post-Silicon Calibration for Large-Volume Products
De-synchronization: from synchronous to asynchronous
Towards a Fully Digital State-of-the-art Analog SiPM
Low Power Digital Design
Measuring the Gap between FPGAs and ASICs
Parametric Yield Estimation Considering Leakage Variability Rajeev Rao, Anirudh Devgan, David Blaauw, Dennis Sylvester Present by Fengbo Ren Apr. 30.
Research Topics Embedded, Real-time, Sensor Systems Frank Mueller moss
Presentation transcript:

Ring Oscillator Clocks and Margins Jordi Cortadella, Marc Lupon, Alberto Moreno, Antoni Roca and Sachin S. Sapatnekar Universitat Politècnica de Catalunya University of Minnesota

Matched delay: is it long enough? Combinational logic ? req req delay C ? C ack ack RO Clocks and Margins Async 2016

Static Timing Analysis RO Clocks and Margins Async 2016

Static Timing Analysis Flip Flops Combinational Logic Flip Flops PLL RO Clocks and Margins Async 2016

Static Timing Analysis: setup constraint Comb. Logic PVT variability Two competing paths: Launching path Capturing path Launching path < Capturing path + Period No variability (or very little) PLL 1 2 1 1 2 2 + Margins RO Clocks and Margins Async 2016

Correlated! Ring Oscillator PVT variability + Margins Comb. Logic Launching path < Capturing path + Period PLL + Margins RO Clocks and Margins Async 2016

Matched delays: setup constraint Launching Path Capturing Path Delay delay C C Launching path < Capturing path + Delay (Margins?) RO Clocks and Margins Async 2016

Handshaking Ring Oscillators Combinational Logic delay C C RO Clocks and Margins Async 2016

Understanding variability RO Clocks and Margins Async 2016

Process variability Fast Typical Slow RO Clocks and Margins Async 2016

Dynamic variability (Vdd-Vss) Raj Nair (Anasim Corp.), IC floorplanning and Power Integrity, SOCcentral, Aug 2, 2010. http://www.soccentral.com/results.asp?entryID=31901. From SOCcentral.com, Copyright 2002 - 2014 Tech Pro Communications RO Clocks and Margins Async 2016

Variability during STA Within-die (local) Derating factors (OCV) Die-to-die (global) Library corners Source: H. Masuda et al. (ICICC 2005) RO Clocks and Margins Async 2016

Timing sign-off RO Clocks and Margins Async 2016

Local (OCV) variability Timing sign-off Global variability Corner: Best  Local (OCV) variability Corner: Typical  Corner: Worst Jitter 3 Clock Period (1+)  PathDelay < Period - Jitter OCV derating factor typical   0.05 … 0.15 RO Clocks and Margins Async 2016

Timing sign-off with ROs Best Ring Oscillator Critical Path Typical Critical Path Ring Oscillator Worst Critical Path Ring Oscillator RO Clocks and Margins Async 2016

Timing sign-off with ROs Goal: doing classical timing sign-off using conventional STA tools designer’s library corners and OCV derating factors In the paper you will find A more sophisticated statistical delay model (1st-order Taylor expansions for variability) This presentation: A simpler and complementary variability model Similar conclusions as the ones in the paper RO Clocks and Margins Async 2016

Timing sign-off delay model Critical Path yield 3 derating factor (OCV) RO Clocks and Margins Async 2016

Timing sign-off delay model with ROs Critical Path Same corner Ring Oscillator RO Clocks and Margins Async 2016

Timing sign-off model with ROs Ring Oscillator Critical Path Assumption: similar index of dispersion (variance-to-mean ratio) new derating factor RO Clocks and Margins Async 2016

Timing sign-off model with ROs Ring Oscillator Critical Path Assumption: similar index of dispersion (variance-to-mean ratio) new derating factor For small values of   RO Clocks and Margins Async 2016

Apple-to-apple comparison Clock domain Ring oscillator PLL RO Clocks and Margins Async 2016

Apple-to-apple comparison Clock domain Ring oscillator Fixed PLL 1.15 1.21 Adaptive !!! RO Clocks and Margins Async 2016

Variability: PLL vs. Ring Oscillator Q D time clock tree critical path launching path capturing path clock tree PLL period RO Clocks and Margins Async 2016

Variability: PLL vs. Ring Oscillator Q D PLL period time clock tree critical path launching path capturing path clock tree Voltage droop Q D clock-data compensation PLL period margin Q D Ring Oscillator only margins for OCV RO Clocks and Margins Async 2016

PLL vs. Ring Oscillator Adaptive frequency Fixed frequency Ring Osc. Clk (1.2V) Adaptive frequency 30% PLL (1.2V) Fixed frequency Ring Osc. Clk (0.85V) Vdd = 1.2V  30% RO Clocks and Margins Async 2016

PLL vs. Ring Oscillator PLL computation time Ring oscillator wasted time computation time Cycle period (just in case) Ring oscillator Avg period computation time Cycle period (just-in-time) RO Clocks and Margins Async 2016

Synthesis of the Ring Oscillator PLL period jitter Delay OCV (1+) Benefits 1+𝛿 2 Critical paths Corners FF Vdd+10% 0oC TT Vdd 25oC SS Vdd-10% 125oC RO Clocks and Margins Async 2016

Experiment RO Clocks and Margins Async 2016

10 AES modules (65nm) 1.75 mm 0.7 mm 350x350 m2 Every AES module can be activated/deactivated by clock gating Switching activity generated by VCS IR drop (static and dynamic) estimated by PrimeRail RO Clocks and Margins Async 2016

Voltage droop analysis All modules active Nominal voltage: 1V Max voltage droop: 168mV Max voltage difference: 25mV RO Clocks and Margins Async 2016

Max voltage difference RO Clocks and Margins Async 2016

Performance & Energy benefits Sign-off (WC) 0.9V, 125oC 1.6x voltage scaling -40% Energy Perfect binning Ring oscillator 1.2x 0.9V, 125oC 1.0V, 75oC RO Clocks and Margins Async 2016

Reactive Clocks J. Cortadella, L. Lavagno, P. López, M. Lupon, A. Moreno, A. Roca and S.S. Sapatnekar Reactive Clocks with variability-tracking jitter ICCD 2015. Algorithms and EDA scripts for timing analysis: Automatic synthesis of Ring Oscillators Technology independent (any .lib file) Adaptable to different STA and P&R tools Ring Oscillators + monitors for post-silicon calibration: Non-intrusive monitors automatically synthesized Patented technology RO Clocks and Margins Async 2016

Conclusions Moore’s law is (economically) over. It’s time to exploit what is left in established nodes. Ring Oscillators: a zero-risk scheme to boost performance (1.6x speedup) or reduce energy (40%) Margins for Ring Oscillators (or matched delays) Use the same library corners Derating factor: 1+𝛿 → (1+𝛿 2 ) RO Clocks and Margins Async 2016