Applications of GTX Y. Cao, X. Huang, A.B. Kahng, F. Koushanfar, H. Lu, S. Muddu, D. Stroobandt and D. Sylvester Abstract The GTX (GSRC Technology Extrapolation)

Slides:



Advertisements
Similar presentations
GTX: The MARCO GSRC Technology Extrapolation System A. Caldwell, Y. Cao, A. B. Kahng, F. Koushanfar, H. Lu, I. Markov, M. Oliver, D. Stroobandt and D.
Advertisements

June 6, Using Negative Edge Triggered FFs to Reduce Glitching Power in FPGA Circuits Tomasz S. Czajkowski and Stephen D. Brown Department of Electrical.
High-Level Constructors and Estimators Majid Sarrafzadeh and Jason Cong Computer Science Department
Noise Model for Multiple Segmented Coupled RC Interconnects Andrew B. Kahng, Sudhakar Muddu †, Niranjan A. Pol ‡ and Devendra Vidhani* UCSD CSE and ECE.
Introduction to CMOS VLSI Design Clock Skew-tolerant circuits.
Chapter 11 Timing Issues in Digital Systems Boonchuay Supmonchai Integrated Design Application Research (IDAR) Laboratory August 20, 2004; Revised - July.
DARPA Assessing Parameter and Model Sensitivities of Cycle-Time Predictions Using GTX u Abstract The GTX (GSRC Technology Extrapolation) system serves.
Chapter 5 Interconnect RLC Model n Efficient capacitance model Efficient inductance model Efficient inductance model RC and RLC circuit model generation.
Statistical Crosstalk Aggressor Alignment Aware Interconnect Delay Calculation Supported by NSF & MARCO GSRC Andrew B. Kahng, Bao Liu, Xu Xu UC San Diego.
Lecture 8: Clock Distribution, PLL & DLL
S. Reda EN160 SP’07 Design and Implementation of VLSI Systems (EN0160) Lecture 15: Interconnects & Wire Engineering Prof. Sherief Reda Division of Engineering,
© Digital Integrated Circuits 2nd Inverter CMOS Inverter: Digital Workhorse  Best Figures of Merit in CMOS Family  Noise Immunity  Performance  Power/Buffer.
From Compaq, ASP- DAC00. Power Consumption Power consumption is on the rise due to: - Higher integration levels (more devices & wires) - Rising clock.
S. Reda EN160 SP’07 Design and Implementation of VLSI Systems (EN0160) Lecture 22: Material Review Prof. Sherief Reda Division of Engineering, Brown University.
Effects of Global Interconnect Optimizations on Performance Estimation of Deep Sub-Micron Design Yu (Kevin) Cao 1, Chenming Hu 1, Xuejue Huang 1, Andrew.
Parameters System attributes or variables Example of ASCII parameter grammar #parameter dl_chip #parameter dl_chip #type double #type double #units {m}
S. Reda EN160 SP’07 Design and Implementation of VLSI Systems (EN0160) Lecture 13: Power Dissipation Prof. Sherief Reda Division of Engineering, Brown.
DARPA GTX: The GSRC Technology Extra- polation System, “A Living Roadmap” A. Caldwell, A. B. Kahng, F. Koushanfar, H. Lu, I. Markov, M. Oliver and D. Stroobandt.
EDA Roadmap Taskforce Report Draft 2 2/9/99 Figure 0.1 Process: Focus on Change Challenges & Directions Technology Paradigm Shifts Market Segment Semiconductor.
Effects of Global Interconnect Optimizations on Performance Estimation of Deep Sub-Micron Design Yu Cao, Chenming Hu, Xuejue Huang, Andrew B. Kahng, Sudhakar.
DARPA Calibrating Achievable Design Jason Cong, Wayne Dai, Andrew B. Kahng, Kurt Keutzer and Wojciech Maly.
Noise and Delay Uncertainty Studies for Coupled RC Interconnects Andrew B. Kahng, Sudhakar Muddu † and Devendra Vidhani ‡ UCLA Computer Science Department,
“ Near-Threshold Computing: Reclaiming Moore’s Law Through Energy Efficient Integrated Circuits ” By Ronald G. Dreslinski, Michael Wieckowski, David Blaauw,
Toshiba Standard Cell Architecture for High Frequency Operation Peter Hsu, Ph.D. Chief Architect Microprocessor Development Toshiba America Electronics.
Research on Analysis and Physical Synthesis Chung-Kuan Cheng CSE Department UC San Diego
UC San Diego / VLSI CAD Laboratory Toward Quantifying the IC Design Value of Interconnect Technology Improvement Tuck-Boon Chan, Andrew B. Kahng, Jiajia.
EE 5900 Advanced Algorithms for Robust VLSI CAD, Spring 2009 Static Timing Analysis and Gate Sizing.
Washington State University
Optimal digital circuit design Mohammad Sharifkhani.
DARPA GTX: The MARCO GSRC Technology Extrapolation System Abstract Technology extrapolation -- i.e., the calibration and prediction of achievable Technology.
Georgia Institute of Technology, Microelectronics Research Center Prediction of Interconnect Fan-out Distribution Using Rent’s Rule Payman Zarkesh-Ha,
INTERCONNECT MODELING M.Arvind 2nd M.E Microelectronics
1 Interconnect/Via. 2 Delay of Devices and Interconnect.
March 28, Glitch Reduction for Altera Stratix II devices Tomasz S. Czajkowski PhD Candidate University of Toronto Supervisor: Professor Stephen D.
-1- Delay Uncertainty and Signal Criticality Driven Routing Channel Optimization for Advanced DRAM Products Samyoung Bang #, Kwangsoo Han ‡, Andrew B.
EE141 Timing Issues 1 Chapter 10 Timing Issues Rev /11/2003 Rev /28/2003 Rev /05/2003.
EE141 Timing Issues 1 Chapter 10 Timing Issues Rev /11/2003.
14/2/20041 Interconnect-Power Dissipation in a Microprocessor N. Magen, A. Kolodny, U. Weiser, N. Shamir Intel corporation® Technion - Israel Institute.
PROCEED: Pareto Optimization-based Circuit-level Evaluation Methodology for Emerging Devices Shaodi Wang, Andrew Pan, Chi-On Chui and Puneet Gupta Department.
Unified Adaptivity Optimization of Clock and Logic Signals Shiyan Hu and Jiang Hu Dept of Electrical and Computer Engineering Texas A&M University.
Prediction of Interconnect Net-Degree Distribution Based on Rent’s Rule Tao Wan and Malgorzata Chrzanowska- Jeske Department of Electrical and Computer.
Slide 1 SLIP 2004 Payman Zarkesh-Ha, Ken Doniger, William Loh, and Peter Bendix LSI Logic Corporation Interconnect Modeling Group February 14, 2004 Prediction.
Piero Belforte, HDT 1999: PRESTO POWER by Alessandro Arnulfo.
Oleg Petelin and Vaughn Betz FPL 2016
Worst Case Crosstalk Noise for Nonswitching Victims in High-Speed Buses Jun Chen and Lei He.
The Interconnect Delay Bottleneck.
Calibrating Achievable Design
Alireza Shafaei, Shuang Chen, Yanzhi Wang, and Massoud Pedram
Crosstalk If both a wire and its neighbor are switching at the same time, the direction of the switching affects the amount of charge to be delivered and.
Circuits and Interconnects In Aggressively Scaled CMOS
Revisiting and Bounding the Benefit From 3D Integration
Jason Cong, David Zhigang Pan & Prasanna V. Srinivas
CMOS VLSI Design Chapter 13 Clocks, DLLs, PLLs
Timing Analysis 11/21/2018.
Dual Mode Logic An approach for high speed and energy efficient design
Buffered tree construction for timing optimization, slew rate, and reliability control Abstract: With the rapid scaling of IC technology, buffer insertion.
Chapter 10 Timing Issues Rev /11/2003 Rev /28/2003
Chapter 3b Static Noise Analysis
Summary Current density in a signal line was estimated, based on the simple circuit shown in Fig.1. This circuit is scaled down according to ITRS 2003.
Circuit Design Techniques for Low Power DSPs
CMOS VLSI Design Chapter 13 Clocks, DLLs, PLLs
University of Colorado at Boulder
Circuit Characterization and Performance Estimation
A High Performance SoC: PkunityTM
332:578 Deep Submicron VLSI Design Lecture 14 Design for Clock Skew
Puneet Gupta1 , Andrew B. Kahng1 , Youngmin Kim2, Dennis Sylvester2
Computer Evolution and Performance
Reading (Rabaey et al.): Sections 3.5, 5.6
EE216A – Fall 2010 Design of VLSI Circuits and Systems
Jason Cong, David Zhigang Pan & Prasanna V. Srinivas
Presentation transcript:

Applications of GTX Y. Cao, X. Huang, A.B. Kahng, F. Koushanfar, H. Lu, S. Muddu, D. Stroobandt and D. Sylvester Abstract The GTX (GSRC Technology Extrapolation) system serves as a flexible platform for integration and comparison of various studies, aimed at cali-brating and predicting achievable design in future technology generations. The flexibility of GTX makes it particularly useful for development of new studies that model certain aspects of design and technology, and emulation, comparison, and evaluation of various technology extrapolation methods. In this poster, we highlight the ability of GTX to evaluate the sensitivity of estimation methods to their input parameters and to their implicit modeling choices. Our sensitivity studies for existing models reveal a surprisingly high level of uncertainty inherent in predictions of future CPU timing. In particular, existing cycle time models are extremely sensitive to both modeling choices and to changes in device parameters. We also study tbe effect of new SOI device models and effects of repeater optimization, inductance and signal line shielding. Sensitivity Analyses Goal: investigate sensitivity of existing cycle-time prediction models SUSPENS, BACPAC, Fisher (ITRS) Types of sensitivity Parameter sensitivity (to changes in primary input parameters) Model (rule) sensitivity (to changes in the estimation model itself) Parameter sensitivity of cycle-time predictions Common primary input (PI) parameter base for all models (250nm technology, BACPAC’s default parameter values) Change of a single input parameter value by subtracting/adding 10% of its value, and computation of the resulting clock frequency Simultaneous changes of various parameters (up to 7 together) BACPAC is most robust model (may not be the best!) SUSPENS is very sensitive to parameter changes Everything else * For variations from 3 to 1 and 2 Fanout per gate (25%)* Supply voltage Fisher Logic depth (12%) Supply voltage (7%) BACPAC Dielectric constant (0.2%) Input capacitance of a minimum sized device Logic depth On-resistance of a minimum sized device Rent exponent (41%!) Track utilization factor (routing efficiency) Wiring pitch on layers SUSPENS Rather insensitive (<5%) to Very sensitive (>10%) to Model New Device Models New SOI module (assumes partially-depleted SOI and is based on BSIM3SOI models) Comparison between Bulk Si and SOI floating body effect (changes in Idsat) dynamic delay (due to coupling capacitances between same-layer wires) power comparison: Parameter sensitivity for bulk Si and SOI device models Delay uncertainty for staggered (S) and non-staggered (NS) repeater topologies using bulk and SOI (1.5cm global wire with 4 100X repeaters) (worst case – best case) / nominal delay Repeater intervals and line widths optimized subject to constraints on delay uncertainty and on peak coupling noise 100.00 66.03 56.74 Total power 0.54 0.36 0.12 0.07 Leakage 15.47 10.21 13.54 7.68 Short circuit 1.31 0.86 1.66 0.94 Memory 14.62 9.65 13.98 7.93 Clock distribution 20.22 13.35 20.65 11.71 I/O drivers + pads 3.93 2.60 3.88 2.20 Global interconnects 43.91 28.99 46.18 26.20 Logic + local wires % P (W) Bulk Si SOI GTX: The GSRC Technology Extrapolation System Evaluates the impact of both design and process technology on achievable design and associated design problems. Sets new requirements for CAD tools and methodologies. Allows easy integration, evaluation and comparison of several technology extrapolation efforts. Is based on the concepts of “parameters” (technology description) “rules” (derivation methods) “rule chains” (inference chain) a “derivation engine” (executes rule chain) a “GUI” (represents results, provides user interaction) Sensitivity to rules of other models (model sensitivity) Replacement of one rule of BACPAC / Fisher with a rule (or a set of rules) from another model BACPAC and Fisher are comparable except for a few rules Fisher model shows more variation than BACPAC Differences are larger for local than for global delay We also assessed the effects of clock skew (Takahashi model) and leading-edge interconnect optimizations such as wire sizing, driver and wire sizing, and buffer insertion and wire sizing (IPEM model) Inductance Analysis Comparison of five analytic RC and RLC delay models Non-staggered repeaters Shielding Topologies Optimize cost function = wire pitch x repeater size x number of repeaters Parameters: width of shield and signal wires spacing between signal wires and from signal wires to shield wires Constraints: delay (uncertainty) noise peak Topologies: No shielding (NS) 1 shield (1S) 2 shields (2S) 7.05 2S 7.40 6.00 5.10 1S RLC 6.75 4.60 2.85 NS 7.65 9.25 5.55 RC, 2 pole 9.00 6.25 3.45 RC, 1 pole 8.75 5.75 SF=3 SF=2 SF=1 Shield Model Staggered repeaters Repeater Optimization Repeater sizing Optimizing energy-delay product instead of delay only (Bakoglu) Bakoglu vastly oversizes! Staggering is beneficial Conclusion The example studies shown in this poster were all performed in the technology extrapolation system GTX. These and other examples confirm that the flexible framework afforded by GTX enables useful studies of alternative modeling choices with relatively little effort, as well as open collaboration in the open-source spirit.