Research on Analysis and Physical Synthesis Chung-Kuan Cheng CSE Department UC San Diego

Slides:

Advertisements

Similar presentations

Design Rule Generation for Interconnect Matching Andrew B. Kahng and Rasit Onur Topaloglu {abk | rtopalog University of California, San Diego.

Advertisements

OCV-Aware Top-Level Clock Tree Optimization

A Novel 3D Layer-Multiplexed On-Chip Network

4/22/ Clock Network Synthesis Prof. Shiyan Hu Office: EREC 731.

Transmission Line Network For Multi-GHz Clock Distribution Hongyu Chen and Chung-Kuan Cheng Department of Computer Science and Engineering, University.

Computer Science & Engineering Department University of California, San Diego SPICE Diego A Transistor Level Full System Simulator Chung-Kuan Cheng May.

Boosters for Driving Long On-chip Interconnects : Design Issues, Interconnect Synthesis and Comparison with Repeaters Ankireddy Nalamalpu Intel Corporation/Hillsboro.

Noise Model for Multiple Segmented Coupled RC Interconnects Andrew B. Kahng, Sudhakar Muddu †, Niranjan A. Pol ‡ and Devendra Vidhani* UCSD CSE and ECE.

Sequential Definitions  Use two level sensitive latches of opposite type to build one master-slave flipflop that changes state on a clock edge (when the.

CSE477 L19 Timing Issues; Datapaths.1Irwin&Vijay, PSU, 2002 CSE477 VLSI Digital Circuits Fall 2002 Lecture 19: Timing Issues; Introduction to Datapath.

Clock Design Adopted from David Harris of Harvey Mudd College.

Chapter 11 Timing Issues in Digital Systems Boonchuay Supmonchai Integrated Design Application Research (IDAR) Laboratory August 20, 2004; Revised - July.

Chapter 5 Interconnect RLC Model n Efficient capacitance model Efficient inductance model Efficient inductance model RC and RLC circuit model generation.

SoC Interconnect Modeling Venkata Krishna N. Dhulipala 11/20/2008.

Supply Voltage Degradation Aware Analytical Placement Andrew B. Kahng, Bao Liu and Qinke Wang UCSD CSE Department {abk, bliu,

04/09/02EECS 3121 Lecture 25: Interconnect Modeling EECS 312 Reading: 8.3 (text), 4.3.2, (2 nd edition)

Lecture #25a OUTLINE Interconnect modeling

Interconnessioni e parassiti1 Progettazione di circuiti e sistemi VLSI Anno Accademico Lezione Interconnessioni e parassiti.

Effects of Global Interconnect Optimizations on Performance Estimation of Deep Sub-Micron Design Yu (Kevin) Cao 1, Chenming Hu 1, Xuejue Huang 1, Andrew.

UCSD CSE245 Notes -- Spring 2006 CSE245: Computer-Aided Circuit Simulation and Verification Lecture Notes Spring 2006 Prof. Chung-Kuan Cheng.

A Global Minimum Clock Distribution Network Augmentation Algorithm for Guaranteed Clock Skew Yield A. B. Kahng, B. Liu, X. Xu, J. Hu* and G. Venkataraman*

Lecture 24: Interconnect parasitics

Circuit Performance Variability Decomposition Michael Orshansky, Costas Spanos, and Chenming Hu Department of Electrical Engineering and Computer Sciences,

Statistical Gate Delay Calculation with Crosstalk Alignment Consideration Andrew B. Kahng, Bao Liu, Xu Xu UC San Diego

Lecture 7: Power.

Effects of Global Interconnect Optimizations on Performance Estimation of Deep Sub-Micron Design Yu Cao, Chenming Hu, Xuejue Huang, Andrew B. Kahng, Sudhakar.

THEORETICAL LIMITS FOR SIGNAL REFLECTIONS DUE TO INDUCTANCE FOR ON-CHIP INTERCONNECTIONS F. Huret, E. Paleczny, P. Kennis F. Huret, E. Paleczny, P. Kennis.

Noise and Delay Uncertainty Studies for Coupled RC Interconnects Andrew B. Kahng, Sudhakar Muddu † and Devendra Vidhani ‡ UCLA Computer Science Department,

CSE245: Computer-Aided Circuit Simulation and Verification Lecture Note 2: State Equations Prof. Chung-Kuan Cheng.

-1- UC San Diego / VLSI CAD Laboratory A Global-Local Optimization Framework for Simultaneous Multi-Mode Multi-Corner Clock Skew Variation Reduction Kwangsoo.

CAD for Physical Design of VLSI Circuits

Research in IC Packaging Electrical and Physical Perspectives

1 ECE 556 Design Automation of Digital Systems By Prof. Charlie Chung-Ping Chen ECE Department UW-Madison.

1 Interconnect and Packaging Lecture 8: Clock Meshes and Shunts Chung-Kuan Cheng UC San Diego.

EE141 © Digital Integrated Circuits 2nd Wires 1 Digital Integrated Circuits A Design Perspective The Interconnect Jan M. Rabaey Anantha Chandrakasan Borivoje.

© Digital Integrated Circuits 2nd Interconnect ECE 558/658 : Lecture 20 Interconnect Design (Chapter 9) Clock distribution (Chapter ) Atul Maheshwari.

1 ε -Optimal Minimum-Delay/Area Zero-Skew Clock Tree Wire-Sizing in Pseudo-Polynomial Time Jeng-Liang Tsai Tsung-Hao Chen Charlie Chung-Ping Chen (National.

Penn ESE370 Fall DeHon 1 ESE370: Circuit-Level Modeling, Design, and Optimization for Digital Systems Day 33: November 20, 2013 Crosstalk.

1 Interconnect/Via. 2 Delay of Devices and Interconnect.

Modern VLSI Design 4e: Chapter 3 Copyright  2008 Wayne Wolf Topics n Wire delay. n Buffer insertion. n Crosstalk. n Inductive interconnect. n Switch logic.

Distributed Computation: Circuit Simulation CK Cheng UC San Diego

Modern VLSI Design 3e: Chapter 7 Copyright  1998, 2002 Prentice Hall PTR Topics n Power/ground routing. n Clock routing. n Floorplanning tips. n Off-chip.

Power Integrity Test and Verification CK Cheng UC San Diego 1.

By Nasir Mahmood.  The NoC solution brings a networking method to on-chip communication.

CSE245: Computer-Aided Circuit Simulation and Verification Lecture Note 2: State Equations Spring 2010 Prof. Chung-Kuan Cheng.

Interconnect/Via.

Surfliner: Distortion-less Electrical Signaling for Speed of Light On- chip Communication Hongyu Chen, Rui Shi, Chung-Kuan Cheng Computer Science and Engineering.

Chapter 4: Secs ; Chapter 5: pp

Modern VLSI Design 3e: Chapter 3 Copyright  1998, 2002 Prentice Hall PTR Topics n Wire delay. n Buffer insertion. n Crosstalk. n Inductive interconnect.

Inductance Screening and Inductance Matrix Sparsification 1.

High Performance Interconnect and Packaging Chung-Kuan Cheng CSE Department UC San Diego

1 Revamping Electronic Design Process to Embrace Interconnect Dominance Chung-Kuan Cheng CSE Department UC San Diego La Jolla, CA

-1- Delay Uncertainty and Signal Criticality Driven Routing Channel Optimization for Advanced DRAM Products Samyoung Bang #, Kwangsoo Han ‡, Andrew B.

Penn ESE370 Fall DeHon 1 ESE370: Circuit-Level Modeling, Design, and Optimization for Digital Systems Day 30: November 21, 2012 Crosstalk.

MICROPROCESSOR DESIGN1 IR/Inductive Drop Introduction One component of every chip is the network of wires used to distribute power from the input power.

Piero Belforte, HDT 1999: PRESTO POWER by Alessandro Arnulfo.

Worst Case Crosstalk Noise for Nonswitching Victims in High-Speed Buses Jun Chen and Lei He.

Crosstalk If both a wire and its neighbor are switching at the same time, the direction of the switching affects the amount of charge to be delivered and.

High Performance Interconnect and Packaging

Jason Cong, David Zhigang Pan & Prasanna V. Srinivas

EE201C Chapter 3 Interconnect RLC Modeling

Interconnect Dominated Design and Analysis

Interconnect Architecture

Inductance Screening and Inductance Matrix Sparsification

Wire Indctance Consequences of on-chip inductance include:

Multiport, Multichannel Transmission Line: Modeling and Synthesis

COPING WITH INTERCONNECT

Applications of GTX Y. Cao, X. Huang, A.B. Kahng, F. Koushanfar, H. Lu, S. Muddu, D. Stroobandt and D. Sylvester Abstract The GTX (GSRC Technology Extrapolation)

Jason Cong, David Zhigang Pan & Prasanna V. Srinivas

Presentation transcript:

Research on Analysis and Physical Synthesis Chung-Kuan Cheng CSE Department UC San Diego

Outlines Analysis (Signal Integrity) SPICE Diego RLC Reduction Synthesis (Interconnect Dominant) Networks on Chip Clock Distribution Floorplanning Datapath Packaging (High Performance)

Analysis: SPICE Large netlist, e.g. 100M transistors, 5G Hz Strong Coupling: interconnect delay, crosstalk, voltage drop, ground bounce Process Variations Short Channel Devices

Why SPICE Diego is better? SPICE Diego : fast accurate transistor level circuit simulator Powerful Matrix Solver Engine Transistor devices. Capable of capturing coupling effects. Device Model including Miller’s effect Less Memory Requirement (no LU factorization, dose not save matrix for transistors) Application interconnect delay Crosstalk voltage drop, ground bounce simultaneous switching noise

Experimental Results chip board Power Supply Test Case Board / Packaging / Chip Power Network Fully coupled packaging inductance 60k elements, 5000 nodes. Spice failed Our tool Less than 10 minutes

Synthesis: Clock Distribution Process variations causes significant amount of clock skew Working frequency keeps increasing, skew accounts for large portion of clock period Mesh is effective to reduce skew There is no theoretical design guide line for mesh structure

State-of-the-art In Engineering practice, very deep balanced buffer tree + mesh is widely adopted for global clock distribution IBM Power 4: 64 by 64 grid at the bottom of an H- tree Intel IA: clock stripe at the bottom of a buffer tree. “Skew Averaging”: shunt at different levels “Skew Averaging Factor” determined by simulation. No guideline for routing resource planning known yet

Clock Mesh Example (1) DEC Alpha 21264

Clock Mesh Example (2) IBM Power4 H-tree drives one domain clock mesh 8x8 area buffers

Clock Mesh Example (3) Intel Pentium 4 Tree drives three spines

Our Contributions and On-going Efforts Contribution: Analytical skew expression using R,C model Proposed generalized multi-level mesh network structure for skew reduction Optimal allocation of routing resources among meshes On-going Study: More accurate R,L,C delay model Signal propagation on a uniform mesh

Multi-level mesh structure

Skew on mesh Skew expression

Optimization Skew function Multi level skew function

Die size 1cm by 1cm 100nm copper technology Ground Shielded Differential Signal Wires for Global Clock Distribution Routing area is normalized to the area of a 16 by 16 mesh with minimal wire width Clock Design Settings +- GND

Delay Surfaces

Robustness Against Supply Voltage Variations

Y Architecture Chip-Package Breakaway Packaging

Grids of X and Y Architectures ( X-Architecture Y-Architecture

Clock Tree on Square Mesh N-level clock tree: path length 21% less than H- tree total wire length 9% less than H tree, 3% less than X tree No self-overlapping between parallel wire segments

Chip to Package Breakaway Manhattan Architecture

Y Architecture

Row by rowComparison IndentTwo sides Chip-Package Breakaways

Conclusion Analysis: Signal Integrity Synthesis: Interconnect Dominant Packaging: Performance Goals: Performance, Cost Resources: Physical Space Constraints: Yield, Signal Integrity