“Performance Model for Inter-chip Busses”1 Performance Model for Inter-chip Busses Considering Bandwidth and Cost ISCAS 2005 Authors: Brock J. LaMeres.

Slides:



Advertisements
Similar presentations
Exploiting Crosstalk to Speed up On-chip Buses Chunjie Duan Ericsson Wireless, Boulder Sunil P Khatri University of Colorado, Boulder.
Advertisements

Accelerating DRAM Performance
Computer Science & Engineering Department University of California, San Diego SPICE Diego A Transistor Level Full System Simulator Chung-Kuan Cheng May.
Trends and Perspectives in deep-submicron IC design Bram Nauta MESA + Research Institute University of Twente, Enschede, The Netherlands University of.
EE 587 SoC Design & Test Partha Pande School of EECS Washington State University
Noise Model for Multiple Segmented Coupled RC Interconnects Andrew B. Kahng, Sudhakar Muddu †, Niranjan A. Pol ‡ and Devendra Vidhani* UCSD CSE and ECE.
1 Closed-Loop Modeling of Power and Temperature Profiles of FPGAs Kanupriya Gulati Sunil P. Khatri Peng Li Department of ECE, Texas A&M University, College.
March 8, 2006“Bus Stuttering”1 Bus Stuttering : An Encoding Technique To Reduce Inductive Noise In Off-Chip Data Transmission DATE 2006 Session 5B: Timing.
Digital Integrated Circuits© Prentice Hall 1995 Interconnect COPING WITH INTERCONNECT.
EELE 461/561 – Digital System Design Module #5 Page 1 EELE 461/561 – Digital System Design Module #5 – Crosstalk Topics 1.Near-End and Far-End Crosstalk.
EE466: VLSI Design 2009 Term Project. Expectation To introduce you to basic research ideas Involves reading research paper Extraction of relevant parameters.
Hierarchical Error Correction Codes over Multi-Bit Differential Signaling Jason D. Bakos.
EELE 461/561 – Digital System Design Module #6 Page 1 EELE 461/561 – Digital System Design Module #6 – Differential Signaling Topics 1.Differential and.
1 COMP 206: Computer Architecture and Implementation Montek Singh Wed., Nov. 13, 2002 Topic: Main Memory (DRAM) Organization.
 2003 Micron Technology, Inc. All rights reserved. Information is subject to change without notice. High Performance Next­ Generation Memory Technology.
Energy Efficient and High Speed On-Chip Ternary Bus Chunjie Duan Mitsubishi Electric Research Labs, Cambridge, MA, USA Sunil P. Khatri Texas A&M University,
1 A Single-supply True Voltage Level Shifter Rajesh Garg Gagandeep Mallarapu Sunil P. Khatri Department of Electrical and Computer Engineering, Texas A&M.
Split, 12 December 2005 University of Zagreb Slide 1 Chip level EMC measurements and simulations “Impact of Communications Technology to EMC“, COST 286.
Lecture 24: Interconnect parasitics
Statistical Gate Delay Calculation with Crosstalk Alignment Consideration Andrew B. Kahng, Bao Liu, Xu Xu UC San Diego
October 5, 2005“Broadband Impedance Matching”1 Broadband Impedance Matching for Inductive Interconnect in VLSI Packages ICCD 2005 Authors: Brock J. LaMeres,
Analysis and Avoidance of Cross-talk in on-chip buses Chunjie Duan Ericsson Wireless Communications Anup Tirumala Jasmine Networks Sunil P Khatri University.
© 2006 Pearson Education, Upper Saddle River, NJ All Rights Reserved.Brey: The Intel Microprocessors, 7e Chapter 15 PC Standard Bus Interfaces WK.
Noise and Delay Uncertainty Studies for Coupled RC Interconnects Andrew B. Kahng, Sudhakar Muddu † and Devendra Vidhani ‡ UCLA Computer Science Department,
1 Encoding-based Minimization of Inductive Cross-talk for Off-Chip Data Transmission Brock J. LaMeres Agilent Technologies, Inc. Sunil P. Khatri Dept.
Yulei Zhang1, James F. Buckwalter1, and Chung-Kuan Cheng2
Practical Aspects of Logic Gates COE 202 Digital Logic Design Dr. Aiman El-Maleh College of Computer Sciences and Engineering King Fahd University of Petroleum.
Link A/D converters and Microcontrollers using Long Transmission Lines John WU Precision Analog - Data Converter Applications Engineer
DDR MEMORY  NEW TCEHNOLOGY  BANDWIDTH  SREVERS, WORKSTATION  NEXT GENERATION OF SDRAM.
A Methodology for Interconnect Dimension Determination By: Jeff Cobb Rajesh Garg Sunil P Khatri Department of Electrical and Computer Engineering, Texas.
1 Frequency response I As the frequency of the processed signals increases, the effects of parasitic capacitance in (BJT/MOS) transistors start to manifest.
Power Supply Aware Computing Pradeep S. Shenoy and Philip T. Krein Support provided by National Science Foundation under Grant ECS and by the.
Research on Analysis and Physical Synthesis Chung-Kuan Cheng CSE Department UC San Diego
Telefunken LVDS/M-LVDS as an alternative to RS-485/422.
DEVICES AND COMMUNICATION BUSES FOR DEVICES NETWORK
© 2009 Pearson Education, Upper Saddle River, NJ All Rights ReservedFloyd, Digital Fundamentals, 10 th ed Digital Fundamentals with PLD Programming.
Digital Integrated Circuits© Prentice Hall 1995 Interconnect COPING WITH INTERCONNECT.
Washington State University
K.C.RAVINDRAN,GRAPES-3 EXPERIMENT,OOTY 1 Development of fast electronics for the GRAPES-3 experiment at Ooty K.C. RAVINDRAN On Behalf of GRAPES-3 Collaboration.
Egle Cebelyte. Random Access Memory is simply the storage area where all software is loaded and works from; also called working memory storage.
A Robust Pulse-triggered Flip-Flop and Enhanced Scan Cell Design
Packet Dispersion in IEEE Wireless Networks Mingzhe Li, Mark Claypool and Bob Kinicki WPI Computer Science Department Worcester, MA 01609
An accurate and efficient SSO/SSN simulation methodology for 45 nm LPDDR I/O interface Dr. Souvik Mukherjee, Dr. Rajen Murugan (Texas Instruments Inc.)
Authors – Jeahyuk huh, Doug Burger, and Stephen W.Keckler Presenter – Sushma Myneni Exploring the Design Space of Future CMPs.
1 Interconnect/Via. 2 Delay of Devices and Interconnect.
Floyd, Digital Fundamentals, 10 th ed Digital Fundamentals Tenth Edition Floyd © 2008 Pearson Education Chapter 1.
Low Power, High-Throughput AD Converters
ECE 2799 Electrical and Computer Engineering Design ANALOG to DIGITAL CONVERSION Prof. Bitar Last Update:
Inductance Screening and Inductance Matrix Sparsification 1.
Sunpyo Hong, Hyesoon Kim
An O(bn 2 ) Time Algorithm for Optimal Buffer Insertion with b Buffer Types Authors: Zhuo Li and Weiping Shi Presenter: Sunil Khatri Department of Electrical.
Chapter 2. High-speed properties of logic gates.
1 Decentralized Adaptive Voltage Control with Distributed Energy Resources Presenter: Huijuan Li.
Low Power, High-Throughput AD Converters
A Mini project on “DRIVERLESS TRAIN ENGINE USING 8051 MICROCONTROLLER” GUIDED BY K.RAMBABU.
LengthWire Length Bond Wire Diameter 0.8 mil1.0 mil1.2 mil1.3 mil Resistance (Ohm) 5 mm mm Inductance (nH) 5.
January 27, Controlling Inductive Cross-talk and Power in Off-chip Buses using CODECs ASP-DAC 2006 Session 8C-5: Inductive Issues in Power Grids.
MICROPROCESSOR DESIGN1 IR/Inductive Drop Introduction One component of every chip is the network of wires used to distribute power from the input power.
Analysis of noise immunity at common circuits of the front end parts of high-speed transceivers S.V. Kondratenko NRNU MEPhI ICPPA-2016,
High Speed Properties of Digital Gates, Copyright F. Canavero, R. Fantino Licensed to HDT - High Design Technology
Chapter 11 Inductors.
RAM, CPUs, & BUSES Egle Cebelyte.
Chapter 3 Inductance and Capacitance
University of Colorado at Boulder
Inductance Screening and Inductance Matrix Sparsification
Digital Fundamentals Floyd Chapter 1 Tenth Edition
COPING WITH INTERCONNECT
Ground Recommendations Review of Recent Discussion
Frequency response I As the frequency of the processed signals increases, the effects of parasitic capacitance in (BJT/MOS) transistors start to manifest.
Demonstration of POM Flowline Optimizer
Presentation transcript:

“Performance Model for Inter-chip Busses”1 Performance Model for Inter-chip Busses Considering Bandwidth and Cost ISCAS 2005 Authors: Brock J. LaMeres University of Colorado, Boulder, CO Sunil P. Khatri Texas A&M University, College Station, TX.

“Performance Model for Inter-chip Busses”2 Problem : Packaging Limits Performance Transistor Technology is Faster than Package Technology IC “Moore’s Law” - # of transistors will double every 18 months Package “Rent’s Rule” - # of I/O will double in next 10 years

“Performance Model for Inter-chip Busses”3 Inductive noise in packaging Package performance model – Relationship with inductive parasitics – “Bandwidth per unit cost” metric Experimental results – Model versus SPICE Conclusions Outline

“Performance Model for Inter-chip Busses”4 1) Supply Bounce Switching current through inductive packaging induces voltage: Multiple Signals Switching Increase the Problem: Inductive Noise in Packaging n = # of drivers sharing the power/gnd pin (L 11 ). L 11 = Inductance of pwr/gnd pin that current is being switched through.

“Performance Model for Inter-chip Busses”5 2) Pin-to-Pin Coupling Switching Signals Couple Voltage onto Neighbors: Multiple Signals Switching Increase the Problem: Inductive Noise in Packaging M 1k = Mutual Inductance between pwr/gnd pin and k th signal pin.

“Performance Model for Inter-chip Busses”6 Analytical Model Bus Parameters W BUS : # of signals in the bus GSPSSSGSSPSSGSSPSS W BUS N G : # of Grounds in the bus GSPSSSGSSPSSGSSPSS

“Performance Model for Inter-chip Busses”7 Analytical Model Bus Parameters SPG : (# of Signals) : (# of PWR’s) : (# of GND’s) = 4:1:1 in above example GSPSSSGSSPSSGSSPSS Repetitive Pattern of Signal, Power, and Ground Pins

“Performance Model for Inter-chip Busses”8 Analytical Model Bus Performance Description dt dv v(t) t Slewrate

“Performance Model for Inter-chip Busses”9 Analytical Model Bus Performance Description v(t) t Risetime (0.8)  V DD 90% 10% V DD

“Performance Model for Inter-chip Busses”10 DATA Analytical Model Bus Performance Description DATA Minimum Unit Interval UI

“Performance Model for Inter-chip Busses”11 Analytical Model Bus Performance Description DATA Bus Throughput DATA Tx W BUS Rx

“Performance Model for Inter-chip Busses”12 NOISE Analytical Model Bus Performance Limits v(t) t Maximum Acceptable Ground Bounce p  V DD V DD (p typical = 5%)

“Performance Model for Inter-chip Busses”13 Analytical Model Model Development Maximum Ground Bounce Self Contribution Coupling Contribution

“Performance Model for Inter-chip Busses”14 Analytical Model Model Development Maximum Slewrate - pull out (di/dt) - convert to (dv/dt)

“Performance Model for Inter-chip Busses”15 Analytical Model Model Development Minimum Risetime : But since - convert slewrate to risetime

“Performance Model for Inter-chip Busses”16 Analytical Model Model Development Maximum Datarate : Also, since - convert Risetime to Datarate Maximum Throughput : Finally, we have

“Performance Model for Inter-chip Busses”17 Experimental Results Per-pin and Bus throughput values computed for 3 packages – Compared our model with SPICE simulations QFP – Wire Bond BGA – Wire Bond BGA – Flip-Chip

“Performance Model for Inter-chip Busses”18 Experimental Results QFP Wire-Bond Package Simulations Per-Pin Data-RateBus Throughput - Throughput reaches an asymptotic limit as channels are added Model Simulation

“Performance Model for Inter-chip Busses”19 Experimental Results BGA Wire-Bond Package Simulations Per-Pin Data-RateBus Throughput - Level 1 : BGA Increases Performance Over QFP

“Performance Model for Inter-chip Busses”20 Experimental Results BGA Flip-Chip Package Simulations Per-Pin Data-RateBus Throughput - Level 2: Flip-Chip Increases Performance Over Wire-Bond

“Performance Model for Inter-chip Busses”21 Experimental Results Cost Must Also Be Considered in Analysis Bandwidth Per Cost This Metric Represents “Cost Effectiveness of the Bus” Units = (Mb/$)

“Performance Model for Inter-chip Busses”22 Experimental Results Bandwidth Per Cost Results Faster Narrower Busses = More Cost Effective

“Performance Model for Inter-chip Busses”23 Presented a model for bus performance – Accounts for inductive parasitics in packaging – Developed a “bandwidth-per-unit-cost” metric to evaluate packages Experiments indicate – Strong agreement with SPICE simulation – Throughput for a bus reaches asymptotic limit as channels added Increase in channels compensated by decrease in per-pin throughput Optimal number of channels is small (4-8) “Bandwidth-per-unit-cost” experiments indicate – Flip-chip packages are actually most cost-effective – Narrower buses have better bandwidth-per-unit-cost Conclusions

“Performance Model for Inter-chip Busses”24 Thank you!

“Performance Model for Inter-chip Busses”25 Backup Slides

“Performance Model for Inter-chip Busses”26 Example PACKAGE - Rent’s Rule IC Core - Moore’s Law On-Chip - 8 bit Data Bus Mb/s Package - Need (8)(300M) = 2400 Mb/s

“Performance Model for Inter-chip Busses” Mb/s Example Need: QFP – Wire Bond BGA – Wire Bond BGA – Flip-Chip - 4 bits wide, SPG=2:1:1 - 1 bit wide, SPG=2:1:1 - 1 bit wide, SPG=2:1: bits wide, SPG=4:1:1 - 1 bit wide, SPG=4:1:1 - 1 bit wide, SPG=8:1:1 X XX X

“Performance Model for Inter-chip Busses”28 Example Cost of Each Bus Configuration Most Cost Effective: - BGA-WB - W bus = 1 - SPG = 2:1:1

“Performance Model for Inter-chip Busses”29 Experimental Results Cost per Bus Configuration Performance Increases with Cost (Package, SPG) $