Construction of Latency-Bounded Clock Trees Rickard Ewetz, Chuan Yean Tan, Cheng-Kok Koh Purdue University.

Slides:



Advertisements
Similar presentations
Porosity Aware Buffered Steiner Tree Construction C. Alpert G. Gandham S. Quay IBM Corp M. Hrkic Univ Illinois Chicago J. Hu Texas A&M Univ.
Advertisements

CS 140 Lecture 11 Sequential Networks: Timing and Retiming Professor CK Cheng CSE Dept. UC San Diego 1.
1 Timing-Driven, Over-the-Block Rectilinear Steiner Tree Construction with Pre-Buffering and Slew Constraints Yilin Zhang and David Z. Pan ECE, Univ. of.
Gregory Shklover, Ben Emanuel Intel Corporation MATAM, Haifa 31015, Israel Simultaneous Clock and Data Gate Sizing Algorithm with Common Global Objective.
OCV-Aware Top-Level Clock Tree Optimization
Presentation of Designing Efficient Irregular Networks for Heterogeneous Systems-on-Chip by Christian Neeb and Norbert Wehn and Workload Driven Synthesis.
4/22/ Clock Network Synthesis Prof. Shiyan Hu Office: EREC 731.
Buffer and FF Insertion Slides from Charles J. Alpert IBM Corp.
1 Interconnect Layout Optimization by Simultaneous Steiner Tree Construction and Buffer Insertion Presented By Cesare Ferri Takumi Okamoto, Jason Kong.
Clock Skewing EECS 290A Sequential Logic Synthesis and Verification.
Chop-SPICE: An Efficient SPICE Simulation Technique For Buffered RC Trees Myung-Chul Kim, Dong-Jin Lee and Igor L. Markov Dept. of EECS, University of.
Minimal Skew Clock Synthesis Considering Time-Variant Temperature Gradient Hao Yu, Yu Hu, Chun-Chen Liu and Lei He EE Department, UCLA Presented by Yu.
© Yamacraw, 2001 Minimum-Buffered Routing of Non-Critical Nets for Slew Rate and Reliability A. Zelikovsky GSU Joint work with C. Alpert.
A Useful Skew Tree Framework for Inserting Large Safety Margins Rickard Ewetz and Cheng-Kok Koh School of Electrical and Computer Engineering, Purdue University.
UC San Diego / VLSI CAD Laboratory NOLO: A No-Loop, Predictive Useful Skew Methodology for Improved Timing in IC Implementation Tuck-Boon Chan, Andrew.
An Optimal Algorithm of Adjustable Delay Buffer Insertion for Solving Clock Skew Variation Problem Juyeon Kim, Deokjin Joo, Taehan Kim DAC’13.
The Cost of Fixing Hold Time Violations in Sub-threshold Circuits Yanqing Zhang, Benton Calhoun University of Virginia Motivation and Background Power.
Low-power Clock Trees for CPUs Dong-Jin Lee, Myung-Chul Kim and Igor L. Markov Dept. of EECS, University of Michigan 1 ICCAD 2010, Dong-Jin Lee, University.
Minimum-Buffered Routing of Non- Critical Nets for Slew Rate and Reliability Control Supported by Cadence Design Systems, Inc. and the MARCO Gigascale.
38 th Design Automation Conference, Las Vegas, June 19, 2001 Creating and Exploiting Flexibility in Steiner Trees Elaheh Bozorgzadeh, Ryan Kastner, Majid.
Continuous Retiming EECS 290A Sequential Logic Synthesis and Verification.
A Global Minimum Clock Distribution Network Augmentation Algorithm for Guaranteed Clock Skew Yield A. B. Kahng, B. Liu, X. Xu, J. Hu* and G. Venkataraman*
EE4271 VLSI Design Advanced Interconnect Optimizations Buffer Insertion.
Pei-Ci Wu Martin D. F. Wong On Timing Closure: Buffer Insertion for Hold-Violation Removal DAC’14.
Decoupling Capacitance Allocation for Power Supply Noise Suppression Shiyou Zhao, Kaushik Roy, Cheng-Kok Koh School of Electrical & Computer Engineering.
Page 1 Department of Electrical Engineering National Chung Cheng University, Chiayi, Taiwan Power Optimization for Clock Network with Clock Gate Cloning.
DELAY INSERTION METHOD IN CLOCK SKEW SCHEDULING BARIS TASKIN and IVAN S. KOURTEV ISPD 2005 High Performance Integrated Circuit Design Lab. Department of.
Xin-Wei Shih and Yao-Wen Chang.  Introduction  Problem formulation  Algorithms  Experimental results  Conclusions.
CAFE router: A Fast Connectivity Aware Multiple Nets Routing Algorithm for Routing Grid with Obstacles Y. Kohira and A. Takahashi School of Computer Science.
-1- UC San Diego / VLSI CAD Laboratory A Global-Local Optimization Framework for Simultaneous Multi-Mode Multi-Corner Clock Skew Variation Reduction Kwangsoo.
A New Methodology for Reduced Cost of Resilience Andrew B. Kahng, Seokhyeong Kang and Jiajia Li UC San Diego VLSI CAD Laboratory.
SMART: A Single- Cycle Reconfigurable NoC for SoC Applications -Jyoti Wadhwani Chia-Hsin Owen Chen, Sunghyun Park, Tushar Krishna, Suvinay Subramaniam,
A Polynomial Time Approximation Scheme For Timing Constrained Minimum Cost Layer Assignment Shiyan Hu*, Zhuo Li**, Charles J. Alpert** *Dept of Electrical.
Low-Power Gated Bus Synthesis for 3D IC via Rectilinear Shortest-Path Steiner Graph Chung-Kuan Cheng, Peng Du, Andrew B. Kahng, and Shih-Hung Weng UC San.
An Efficient Clustering Algorithm For Low Power Clock Tree Synthesis Rupesh S. Shelar Enterprise Microprocessor Group Intel Corporation, Hillsboro, OR.
Efficient Multi-Layer Obstacle- Avoiding Rectilinear Steiner Tree Construction Chung-Wei Lin, Shih-Lun Huang, Kai-Chi Hsu,Meng-Xiang Li, Yao-Wen Chang.
Algorithm Paradigms High Level Approach To solving a Class of Problems.
Tao Lin Chris Chu TPL-Aware Displacement- driven Detailed Placement Refinement with Coloring Constraints ISPD ‘15.
1 ε -Optimal Minimum-Delay/Area Zero-Skew Clock Tree Wire-Sizing in Pseudo-Polynomial Time Jeng-Liang Tsai Tsung-Hao Chen Charlie Chung-Ping Chen (National.
Timing-Driven Routing for FPGAs Based on Lagrangian Relaxation
Routing Tree Construction with Buffer Insertion under Obstacle Constraints Ying Rao, Tianxiang Yang Fall 2002.
Maze Routing Algorithms with Exact Matching Constraints for Analog and Mixed Signal Designs M. M. Ozdal and R. F. Hentschke Intel Corporation ICCAD 2012.
Wajid Minhass, Paul Pop, Jan Madsen Technical University of Denmark
Network-Aware Query Processing for Stream- based Application Yanif Ahmad, Ugur Cetintemel - Brown University VLDB 2004.
1ISPD'03 Process Variation Aware Clock Tree Routing Bing Lu Cadence Jiang Hu Texas A&M Univ Gary Ellis IBM Corp Haihua Su IBM Corp.
Algorithmic Tuning of Clock Trees and Derived Non-Tree Structures Igor L. Markov and Dong-Jin Lee University of Michigan Additional details in Dong-Jin.
Retiming EECS 290A Sequential Logic Synthesis and Verification.
Unified Adaptivity Optimization of Clock and Logic Signals Shiyan Hu and Jiang Hu Dept of Electrical and Computer Engineering Texas A&M University.
Yanqing Zhang University of Virginia On Clock Network Design for Sub- threshold Circuitry 1.
Gopakumar.G Hardware Design Group
Kun Young Chung*, Andrew B. Kahng+ and Jiajia Li+
Time-borrowing platform in the Xilinx UltraScale+ family of FPGAs and MPSoCs Ilya Ganusov, Benjamin Devlin.
Chapter 7 – Specialized Routing
Improved Performance of 3DIC Implementations Through Inherent Awareness of Mix-and-Match Die Stacking Kwangsoo Han, Andrew B. Kahng and Jiajia Li University.
James D. Z. Ma Department of Electrical and Computer Engineering
CS223 Advanced Data Structures and Algorithms
COMP541 Flip-Flop Timing Montek Singh Feb 23, 2010.
T. C. van Dijk1, J.-H. Haunert2, J. Oehrlein2 1University of Würzburg
Topics Performance analysis..
Alan Mishchenko University of California, Berkeley
Performance Analysis (Clock Signal) مرتضي صاحب الزماني.
Buffered tree construction for timing optimization, slew rate, and reliability control Abstract: With the rapid scaling of IC technology, buffer insertion.
Pipeline Principle A non-pipelined system of combination circuits (A, B, C) that computation requires total of 300 picoseconds. Comb. logic.
Post-Silicon Calibration for Large-Volume Products
Improvements in FPGA Technology Mapping
Timing Analysis and Optimization of Sequential Circuits
Reinventing The Wheel: Developing a New Standard-Cell Synthesis Flow
Clock Tree Routing With Obstacles
Performance-Driven Interconnect Optimization Charlie Chung-Ping Chen
Under a Concurrent and Hierarchical Scheme
Presentation transcript:

Construction of Latency-Bounded Clock Trees Rickard Ewetz, Chuan Yean Tan, Cheng-Kok Koh Purdue University

On-Chip Variations 175 ps225 ps i j 75 ps125 ps Process variations Voltage variations Temperature variations

Outline Motivation Problem Formulation Proposed Latency Constraint Graph (LCG) Proposed Tree Construction Framework Experimental Results and Future Work

Skew Constraints Combinational Logic FF i FF j Setup time: Hold time: DDD

Clock Tree Synthesis Objective: Connect source to sinks – Buffers – Wires Constraints: – Transition time – Skew D Q Clock Source Clock Sinks a b c d wire buffer

Clock Tree Synthesis

On-Chip Variations D Q Clock Sinks a b c d CCA(a,b) CCA(b,c) Estimate OCV Delay variations by OCV and Safety margin

Problem Formulation (1) Estimate and. (2) Construct clock trees with and. Construct a clock tree with and !

-30 = = = = 10 SCG and Safety Margins D Q a b c d Find: a b c d With Safety Margin to OCV

20 30 Greedy-UST/DME D Q a b c d Source FSR ab = [-d ab, d ba ] [17] C.-W. A. Tsao and C.-K. Koh. UST/DME: a clock tree router for general skew constraints. ACM TODAES, pages 359–379, a b d c

Construction of Latency-Bounded Clock Trees 175 ps225 ps D Q i j 75 ps125 ps

Latency a b c a b c The latency is dependent on the subtree latencies and the skew constraints

Proposed Latency Constraint Graph (LCG) abc Latency Path SCG Virtual sink Virtual source LCG (negative) Bottom-up shortest path to vertex i: =[65, 55, 40] - (0 +(-10) +(-15)+(-40) = [65, 55, 40]

Root Construction a b c c a b LCG Compute delay insertions Minimum latency and maximal sharing of delay insertions

Root Construction Sort based on Topology selection Inexact Delay Realization a b c Maximal sharing of delay insertions 25 c a b

Tree Construction Delay insertions Skew commitments abc Virtual source Delay insertion Skew commitment -(40 +X ) a a -15 Delay insertion

Latency-Bounded Tree Construction abc Virtual source Virtual sink Feasible latency range -5 FLSR FSR FLR a b c

Virtual latency Root location abc Virtual source Virtual sink Virtual latency = d root * c delay ( )

Flow CTS CTO Input Output Merging Buffer insertion Input to CTS Output from CTS Subtree dragging Latency Aware Merging Latency locking Update of virtual latency Root construction

Experimental Setup Arbitrary skew constraints Monte Carlo Framework with on-chip variations – Process variations – Voltage variations – Temperature variations NameSinksSkew constraints scaled_s ecg aes [14] C. N. Sze ISPD 2010 high performance clock network synthesis contest: benchmark suite and results. ISPD’10.

Various Safety Margins on ecg Before CTOAfter CTO Circu it Struc- ture M user Lat- ency (ps) Cap (pF) TNS (ps) WNS (ps) Yield (%) TNS (ps) WNS (ps) Yield (%) ecgTree in [8] Tree R-Tree ecgTree in [8] Tree R-Tree ecgTree in [8] Tree R-Tree [8] R. Ewetz and C-K. Koh A Useful Skew Tree Framework for Inserting Large Safety Margins. ISPD’15

Latency-Bounded Clock Trees Before CTOAfter CTO Circ uit Struc- ture M user (ps) L user (ps) Lat- ency (ps) Cap (pF) TNS (ps) WNS (ps) Yield (%) TNS (ps) WNS (ps) Yield (%) s15R-Tree25∞ L-R-Tree L-R-Tree ecgR-Tree30∞ L-R-Tree L-R-Tree aesR-Tree50∞ L-R-Tree L-R-Tree

Summary and Future Work Proposed a latency constraint graph Tree construction based on the LCG Estimate and. Questions?