Analysis and Avoidance of Cross-talk in on-chip buses Chunjie Duan Ericsson Wireless Communications Anup Tirumala Jasmine Networks Sunil P Khatri University.

Slides:



Advertisements
Similar presentations
Exploiting Crosstalk to Speed up On-chip Buses Chunjie Duan Ericsson Wireless, Boulder Sunil P Khatri University of Colorado, Boulder.
Advertisements

Design Rule Generation for Interconnect Matching Andrew B. Kahng and Rasit Onur Topaloglu {abk | rtopalog University of California, San Diego.
1 Modeling and Optimization of VLSI Interconnect Lecture 9: Multi-net optimization Avinoam Kolodny Konstantin Moiseev.
Net-Ordering for Optimal Circuit Timing in Nanometer Interconnect Design M. Sc. work by Moiseev Konstantin Supervisors: Dr. Shmuel Wimer, Dr. Avinoam Kolodny.
Leakage and Dynamic Glitch Power Minimization Using MIP for V th Assignment and Path Balancing Yuanlin Lu and Vishwani D. Agrawal Auburn University ECE.
NETWORKING CONCEPTS. ERROR DETECTION Error occures when a bit is altered between transmission& reception ie. Binary 1 is transmitted but received is binary.
March 8, 2006“Bus Stuttering”1 Bus Stuttering : An Encoding Technique To Reduce Inductive Noise In Off-Chip Data Transmission DATE 2006 Session 5B: Timing.
Sang-Chun Han Hwangjun Song Jun Heo International Conference on Intelligent Hiding and Multimedia Signal Processing (IIH-MSP), Feb, /05 Feb 2009.
University of Illinois, Urbana-Champaign
Chapter 5 Interconnect RLC Model n Efficient capacitance model Efficient inductance model Efficient inductance model RC and RLC circuit model generation.
Design of Variable Input Delay Gates for Low Dynamic Power Circuits
Turbo Codes – Decoding and Applications Bob Wall EE 548.
1 Modular Arithmetic Logic Unit By Salvador Sandoval & Lucas Morales Advisor: Dave Parent December 6, 2004.
A Novel Clock Distribution and Dynamic De-skewing Methodology Arjun Kapoor – University of Colorado at Boulder Nikhil Jayakumar – Texas A&M University,
1 Simple FPGA David, Ronald and Sudha Advisor: Dave Parent 12/05/2005.
S. Reda EN160 SP’07 Design and Implementation of VLSI Systems (EN0160) Lecture 15: Interconnects & Wire Engineering Prof. Sherief Reda Division of Engineering,
TH EDA NTHU-CS VLSI/CAD LAB 1 Re-synthesis for Reliability Design Shih-Chieh Chang Department of Computer Science National Tsing Hua University.
Energy Efficient and High Speed On-Chip Ternary Bus Chunjie Duan Mitsubishi Electric Research Labs, Cambridge, MA, USA Sunil P. Khatri Texas A&M University,
Performance Driven Crosstalk Elimination at Compiler Level TingTing Hwang Department of Computer Science Tsing Hua University, Taiwan.
Effects of Global Interconnect Optimizations on Performance Estimation of Deep Sub-Micron Design Yu (Kevin) Cao 1, Chenming Hu 1, Xuejue Huang 1, Andrew.
A Graph-based Framework for Transmission of Correlated Sources over Multiuser Channels Suhan Choi May 2006.
1 A Deep Sub-Micron VLSI Design Flow using Layout Fabrics Sunil P. Khatri University of Colorado, Boulder Amit Mehrotra University of Illinois, Urbana-Champaign.
October 5, 2005“Broadband Impedance Matching”1 Broadband Impedance Matching for Inductive Interconnect in VLSI Packages ICCD 2005 Authors: Brock J. LaMeres,
Effects of Global Interconnect Optimizations on Performance Estimation of Deep Sub-Micron Design Yu Cao, Chenming Hu, Xuejue Huang, Andrew B. Kahng, Sudhakar.
Electronic Counters.
1 Encoding-based Minimization of Inductive Cross-talk for Off-Chip Data Transmission Brock J. LaMeres Agilent Technologies, Inc. Sunil P. Khatri Dept.
Yulei Zhang1, James F. Buckwalter1, and Chung-Kuan Cheng2
Worst-Case Timing Jitter and Amplitude Noise in Differential Signaling Wei Yao, Yiyu Shi, Lei He, Sudhakar Pamarti, and Yu Hu Electrical Engineering Dept.,
Pipelined Two Step Iterative Matching Algorithms for CIOQ Crossbar Switches Deng Pan and Yuanyuan Yang State University of New York, Stony Brook.
A Methodology for Interconnect Dimension Determination By: Jeff Cobb Rajesh Garg Sunil P Khatri Department of Electrical and Computer Engineering, Texas.
Super-Orthogonal Space- Time BPSK Trellis Code Design for 4 Transmit Antennas in Fast Fading Channels Asli Birol Yildiz Technical University,Istanbul,Turkey.
Capturing Crosstalk-Induced Waveform for Accurate Static Timing Analysis Masanori Hashimoto, Yuji Yamada, Hidetoshi Onodera Kyoto University.
Crosstalk Calculation and SLEM. 2 Crosstalk Calculation Topics  Crosstalk and Impedance  Superposition  Examples  SLEM.
TSV-Aware Analytical Placement for 3D IC Designs Meng-Kai Hsu, Yao-Wen Chang, and Valerity Balabanov GIEE and EE department of NTU DAC 2011.
1/ , Graz, Austria Power Spectral Density of Convolutional Coded Pulse Interval Modulation Z. Ghassemlooy, S. K. Hashemi and M. Amiri Optical Communications.
1 Sign Bit Reduction Encoding for Low Power Applications Hsin-Wei Lin Saneei, M. Afzali-Kusha, A. and Navabi, Z. Sign Bit Reduction Encoding for Low Power.
INTRODUCTION Interconnection wiring is gaining a significant importance in speed of modern VLSI circuits. Since the wiring may cover up to eighty percent.
1/20 A Novel Technique for Input Vector Compression in System-on-Chip Testing Student: Chien Nan Lin Satyendra Biswas, Sunil Das, and Altaf Hossain,” Information.
Error-Correction &Crosstalk Avoidance in DSM Busses Ketan Patel and Igor Markov University of Michigan Electrical Engineering & Computer Science 2003 ACM.
Prof. Amr Goneid, AUC1 Analysis & Design of Algorithms (CSCE 321) Prof. Amr Goneid Department of Computer Science, AUC Part 8. Greedy Algorithms.
RF network in SoC1 SoC Test Architecture with RF/Wireless Connectivity 1. D. Zhao, S. Upadhyaya, M. Margala, “A new SoC test architecture with RF/wireless.
Forbidden Transition Free Crosstalk Avoidance CODEC Design Chunjie Duan Mitsubishi Electric Research Labs, Cambridge, MA, USA Chengyu Zhu Polaris Microelectronic.
1 Bus Encoding for Total Power Reduction Using a Leakage-Aware Buffer Configuration 班級:積體所碩一 學生:林欣緯 指導教授:魏凱城 老師 IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION.
Introduction of Low Density Parity Check Codes Mong-kai Ku.
Coding Theory. 2 Communication System Channel encoder Source encoder Modulator Demodulator Channel Voice Image Data CRC encoder Interleaver Deinterleaver.
StrideBV: Single chip 400G+ packet classification Author: Thilan Ganegedara, Viktor K. Prasanna Publisher: HPSR 2012 Presenter: Chun-Sheng Hsueh Date:
1 Interconnect/Via. 2 Delay of Devices and Interconnect.
Unconventional Fixed-Radix Number Systems
Timo O. Korhonen, HUT Communication Laboratory 1 Convolutional encoding u Convolutional codes are applied in applications that require good performance.
EE 587 SoC Design & Test Partha Pande School of EECS Washington State University
Inductance Screening and Inductance Matrix Sparsification 1.
Investigation of crosstalk in the readout structure of the Beamcal K.Afanaciev.
Bounds on Redundancy in Constrained Delay Arithmetic Coding Ofer ShayevitzEado Meron Meir Feder Ram Zamir Tel Aviv University.
Evaluating and Optimizing IP Lookup on Many Core Processors Author: Peng He, Hongtao Guan, Gaogang Xie and Kav´e Salamatian Publisher: International Conference.
-1- Delay Uncertainty and Signal Criticality Driven Routing Channel Optimization for Advanced DRAM Products Samyoung Bang #, Kwangsoo Han ‡, Andrew B.
Fast VLSI Implementation of Sorting Algorithm for Standard Median Filters Hyeong-Seok Yu SungKyunKwan Univ. Dept. of ECE, Vada Lab.
Effects of Inductance on the Propagation Delay and Repeater Insertion in VLSI Circuits Yehea I. Ismail and Eby G. Friedman, Fellow, IEEE.
Bus Encoding to Prevent Crosstalk Delay Bert Victor and Kurt Keutzer ICCAD 2001.
July 10, th VLSI Design and Test Symposium1 BIST / Test-Decompressor Design using Combinational Test Spectrum Nitin Yogi Vishwani D. Agrawal Auburn.
On Reliable Modular Testing with Vulnerable Test Access Mechanisms Lin Huang, Feng Yuan and Qiang Xu.
January 27, Controlling Inductive Cross-talk and Power in Off-chip Buses using CODECs ASP-DAC 2006 Session 8C-5: Inductive Issues in Power Grids.
Slide 1 SLIP 2004 Payman Zarkesh-Ha, Ken Doniger, William Loh, and Peter Bendix LSI Logic Corporation Interconnect Modeling Group February 14, 2004 Prediction.
Reducing Structural Bias in Technology Mapping
Worst Case Crosstalk Noise for Nonswitching Victims in High-Speed Buses Jun Chen and Lei He.
Contents Introduction Bus Power Model Related Works Motivation
Inductance Screening and Inductance Matrix Sparsification
Guihai Yan, Yinhe Han, Xiaowei Li, and Hui Liu
A Block Based MAP Segmentation for Image Compression
Authors: A. Rasmussen, A. Kragelund, M. Berger, H. Wessing, S. Ruepp
IV. Convolutional Codes
Presentation transcript:

Analysis and Avoidance of Cross-talk in on-chip buses Chunjie Duan Ericsson Wireless Communications Anup Tirumala Jasmine Networks Sunil P Khatri University of Colorado, Boulder

Outline  Introduction  Classification of Cross-talk types  Eliminating 3C and 4C sequences  Eliminating 4C sequences  Experimental Results  Conclusions

Introduction  Verified cross-talk trends  Accurate 3-D capacitance extraction  Delay variation 2.47:1 (200  m wires, 10X drivers, 0.1  m technology)  Deep sub-micron process s t w a v a CICI CLCL v a CLCL CLCL CICI a v a CLCL v a CLCL CICI CICI CLCL a a v a CLCL v CLCL CLCL CICI CICI a CICI a a v v CICI CLCL CLCL CLCL CICI CICI CLCL CLCL CLCL CICI CLCL CICI CLCL CLCL

Cross-talk vs Bus Data Pattern  When λ ~ 0.1μm, r = C I /C L > 10 (metal 4)  Effective total capacitance depends on bus data sequence :  Best case: 0 x C I x L  Worst case: 4 x C I x L 0·C I C total = 0 ·C I C total = 4 ·C I 0·C I 2·C I

Classification of Cross-talk  4·C sequence:  3·C sequence  2·C sequence  1·C sequence  0·C sequence  Forbidden patterns (“010” and “101”)

Eliminating 3C & 4C Sequences  Motivation  Maximum bus data rate depends on total capacitance seen by any bit  Removing 3C and 4C sequences will increase the maximum data rate  Simple approach: shielding  g s g s g s g... (ground line between signals)  No 3C or 4C sequences possible  However, bus-width is doubled  Coding gain = (throughput/area) with coding (throughput/area) without coding  Coding gain = 0 for this approach - 1

Eliminating 3C & 4C Sequences  Theorem: If no forbidden patterns are allowed on the bus,  Proof: see paper  Our approach:  Encode the data on the bus to get rid of the forbidden patterns  Questions to be answered:  What is the number of redundancy bits (and the coding gain)?  How to practically implement such a CODEC ?

Number of Redundancy Bits  Map the n bit bus to a k=n+r bit bus so that  the k bit data bus has no forbidden patterns  Definitions :  T(n): number of distinct n-bit vectors.  T(n)=2 n  T B (n): number of n-bit vectors which contain a forbidden pattern  T G (n): number of n-bit vectors which do not contain forbidden patterns  Let the sets of vectors be V(n), V B (n), and V G (n) respectively  Let v(n), v B (n) and v G (n) respectively represent an element of these sets  T GG (n): Number of n-bit vectors in V G (n) with last 2 bits ‘00’ or ‘11’  T GB (n): number of n-bit vectors in V G (n) with last two bits ‘01’ or ‘10’  Goal: to find the smallest k such that

Counting Forbidden Vectors  v(n) can be constructed by appending {0,1} to any v(n-1)  Two v(n) are constructed from any v(n-1)  Two v B (n) are constructed from any v B (n-1)  xxx010xx -> xxx010xx0, xxx010xx1  One v GG (n) and one v GB (n) are constructed from any v GG (n-1)  xxxxxx00 -> xxxxxx000, xxxxxx001  One v GG (n) and one v B (n) are constructed from any v GB (n-1)  xxxxxx01 -> xxxxxx010, xxxxxx011

Counting Forbidden Vectors  Algorithm  Initial conditions (n=3)  T(3) = 8, T G (3) = 6, T B (3)=2, T GG (3)=4, T GB (3)=2  Inductive step  T(n) = 2 x T(n-1);  T G (n) = 2 x T G (n-1) + T G (n-1)  T GG (n) = T GG (n-1) + T GB (n-1)  T B (n) = 2 x T B (n-1) + T GB (n-1)

Eliminating 3C & 4C sequences  44% overhead when n > 30 bits  Coding gain

3C & 4C CODEC Implementation  Implements a one-to-one map from V(n) to V G (k)  Look-Up Table, straightforward, can achieve minimum overhead (44%), but not practical  Our implementation  62.5% overhead (higher than minimum)  Modular and straightforward  Break bus into 4-bit groups  Encode each group independently (4bit -> 5 bit)  Additional logic to handle across-the-boundary forbidden patterns  Ripple effect (Eliminated by pipelining)

3C & 4C CODEC Implementation CODEC block diagram b0b1b2b3b0b1b2b3 b4b5b6b7b4b5b6b7 b 8 b 9 b 10 b 11 b 12 b 13 b 14 b 15

Eliminating 4C sequences  Less aggressive: eliminating 4C sequences only  Less overhead (33%) : simpler implementation  Simpler algorithm  Divide the bus into 3 bit groups  When 4C sequence occurs, complement group data  Insert group complement indicator  Special handling for across-the-boundary forbidden sequences (see paper for details)  Examples:  >  >

Experimental Results  Bus simulations  CODEC was not modeled  Spice3, 0.1μm model  Transmission line with inter-wire coupling  Quantify delay dependency on bus vector sequences  CODEC implementation  Currently implemented 3C & 4C CODEC  Matching delay on CODEC outputs  4C CODEC implementation planned in future

Bus Simulation Results  Bus length 5mm, 10mm or 20mm  Driver strength 30X, 60X and 120X of minimum

CODEC Results  Compare waveform with coding and w/o coding  Random input sequence Random sequence Recovered sequence encoderdecoder driver receiver Random sequence Recovered sequence encoderdecoder driver receiver  Encoder/decoder delay ~250ps  Max data rate more than 2X compared to scheme with no encoding

CODEC Results  random sequence directly into bus buffer  20mm trace  45x buffer  > 1ns delay variation  Random sequence into 3C & 4C encoder  20mm trace  45x buffer  < 500ps delay variation

Experimental Results Reshaped data after receivers  without coding,  edge jitter ~ 1000ps  with coding  edge jitter < 500ps

Conclusions  Inter-wire capacitance increasingly significant in DSM VLSI interconnect  Total capacitance is heavily dependent on bus data sequence  With 44% overhead, we can eliminate 3C & 4C cross- talk  Compared to shielding, which has 100% overhead  Implemented CODEC to eliminate 3C and 4C cross- talk sequences  Proposed CODEC to eliminate 4C cross-talk sequences with 33% overhead  Simulation results match our analysis.

Thank You!