PiCAP: A Parallel and Incremental Capacitance Extraction Considering Stochastic Process Variation Fang Gong 1, Hao Yu 2, and Lei He 1 1 Electrical Engineering.

Slides:



Advertisements
Similar presentations
Non-Gaussian Statistical Timing Analysis Using Second Order Polynomial Fitting Lerong Cheng 1, Jinjun Xiong 2, and Lei He 1 1 EE Department, UCLA *2 IBM.
Advertisements

Parallel Jacobi Algorithm Steven Dong Applied Mathematics.
Slide 1 Bayesian Model Fusion: Large-Scale Performance Modeling of Analog and Mixed- Signal Circuits by Reusing Early-Stage Data Fa Wang*, Wangyang Zhang*,
Point-wise Discretization Errors in Boundary Element Method for Elasticity Problem Bart F. Zalewski Case Western Reserve University Robert L. Mullen Case.
1 Optimization Algorithms on a Quantum Computer A New Paradigm for Technical Computing Richard H. Warren, PhD Optimization.
Exploiting Sparse Markov and Covariance Structure in Multiresolution Models Presenter: Zhe Chen ECE / CMR Tennessee Technological University October 22,
Stochastic Analog Circuit Behavior Modeling by Point Estimation Method
An Efficient Method for Chip-Level Statistical Capacitance Extraction Considering Process Variations with Spatial Correlation W. Zhang, W. Yu, Z. Wang,
FastPlace: Efficient Analytical Placement using Cell Shifting, Iterative Local Refinement and a Hybrid Net Model FastPlace: Efficient Analytical Placement.
Minimal Skew Clock Synthesis Considering Time-Variant Temperature Gradient Hao Yu, Yu Hu, Chun-Chen Liu and Lei He EE Department, UCLA Presented by Yu.
Modern iterative methods For basic iterative methods, converge linearly Modern iterative methods, converge faster –Krylov subspace method Steepest descent.
1 In-Network PCA and Anomaly Detection Ling Huang* XuanLong Nguyen* Minos Garofalakis § Michael Jordan* Anthony Joseph* Nina Taft § *UC Berkeley § Intel.
Non-Linear Statistical Static Timing Analysis for Non-Gaussian Variation Sources Lerong Cheng 1, Jinjun Xiong 2, and Prof. Lei He 1 1 EE Department, UCLA.
Weiping Shi Department of Computer Science University of North Texas HiCap: A Fast Hierarchical Algorithm for 3D Capacitance Extraction.
1 BSMOR: Block Structure-preserving Model Order Reduction http//:eda.ee.ucla.edu Hao Yu, Lei He Electrical Engineering Dept., UCLA Sheldon S.D. Tan Electrical.
Statistical Crosstalk Aggressor Alignment Aware Interconnect Delay Calculation Supported by NSF & MARCO GSRC Andrew B. Kahng, Bao Liu, Xu Xu UC San Diego.
Primary Contributions  Derive inversion based VPEC (Vector Potential Equivalent Circuit) model from first principles.  Replace inductances with effective.
Tolerance Analysis of Flexible Assemblies Michael Tonks.
SAMSON: A Generalized Second-order Arnoldi Method for Reducing Multiple Source Linear Network with Susceptance Yiyu Shi, Hao Yu and Lei He EE Department,
An Algebraic Multigrid Solver for Analytical Placement With Layout Based Clustering Hongyu Chen, Chung-Kuan Cheng, Andrew B. Kahng, Bo Yao, Zhengyong Zhu.
A Global Minimum Clock Distribution Network Augmentation Algorithm for Guaranteed Clock Skew Yield A. B. Kahng, B. Liu, X. Xu, J. Hu* and G. Venkataraman*
Planning operation start times for the manufacture of capital products with uncertain processing times and resource constraints D.P. Song, Dr. C.Hicks.
Statistical Gate Delay Calculation with Crosstalk Alignment Consideration Andrew B. Kahng, Bao Liu, Xu Xu UC San Diego
A Solenoidal Basis Method For Efficient Inductance Extraction H emant Mahawar Vivek Sarin Weiping Shi Texas A&M University College Station, TX.
PETE 603 Lecture Session #29 Thursday, 7/29/ Iterative Solution Methods Older methods, such as PSOR, and LSOR require user supplied iteration.
Fast 3-D Interconnect Capacitance Extraction and Related Numerical Techniques Wenjian Yu EDA Lab, Dept. Computer Science & Technology, Tsinghua University.
More Realistic Power Grid Verification Based on Hierarchical Current and Power constraints 2 Chung-Kuan Cheng, 2 Peng Du, 2 Andrew B. Kahng, 1 Grantham.
Worst-Case Timing Jitter and Amplitude Noise in Differential Signaling Wei Yao, Yiyu Shi, Lei He, Sudhakar Pamarti, and Yu Hu Electrical Engineering Dept.,

Parallel Performance of Hierarchical Multipole Algorithms for Inductance Extraction Ananth Grama, Purdue University Vivek Sarin, Texas A&M University Hemant.
AN ITERATIVE METHOD FOR MODEL PARAMETER IDENTIFICATION 4. DIFFERENTIAL EQUATION MODELS E.Dimitrova, Chr. Boyadjiev E.Dimitrova, Chr. Boyadjiev BULGARIAN.
Kalman filtering techniques for parameter estimation Jared Barber Department of Mathematics, University of Pittsburgh Work with Ivan Yotov and Mark Tronzo.
Presented by Fang Gong 1 University of California, Los Angeles, Los Angeles, USA 2 Nanyang Technological University, Singapore Fang Gong 1, Hao Yu 2 and.
Fast Low-Frequency Impedance Extraction using a Volumetric 3D Integral Formulation A.MAFFUCCI, A. TAMBURRINO, S. VENTRE, F. VILLONE EURATOM/ENEA/CREATE.
Fast Thermal Analysis on GPU for 3D-ICs with Integrated Microchannel Cooling Zhuo Fen and Peng Li Department of Electrical and Computer Engineering, {Michigan.
3. 3 Separation of Variables We seek a solution of the form Cartesian coordinatesCylindrical coordinates Spherical coordinates Not always possible! Usually.
Scheduling Many-Body Short Range MD Simulations on a Cluster of Workstations and Custom VLSI Hardware Sumanth J.V, David R. Swanson and Hong Jiang University.
CFD Lab - Department of Engineering - University of Liverpool Ken Badcock & Mark Woodgate Department of Engineering University of Liverpool Liverpool L69.
1 ELEC 3105 Basic EM and Power Engineering Start Solutions to Poisson’s and/or Laplace’s.
Statistical Sampling-Based Parametric Analysis of Power Grids Dr. Peng Li Presented by Xueqian Zhao EE5970 Seminar.
EE 201C Homework 1 Fang Gong 1.References Capacitance Calculation: n Formula based – T.Sakurai, K.Tamaru, "Simple Formulas for Two-
Scalable Symbolic Model Order Reduction Yiyu Shi*, Lei He* and C. J. Richard Shi + *Electrical Engineering Department, UCLA + Electrical Engineering Department,
Tarek A. El-Moselhy and Luca Daniel
Speeding up the Simulation of Coupling Losses in ITER size CICC joints by using a Multi-Level Fast Multipole Method and GPU technology Ezra van Lanen,
On the Use of Sparse Direct Solver in a Projection Method for Generalized Eigenvalue Problems Using Numerical Integration Takamitsu Watanabe and Yusaku.
Molecular Simulation of Reactive Systems. _______________________________ Sagar Pandit, Hasan Aktulga, Ananth Grama Coordinated Systems Lab Purdue University.
Xianwu Ling Russell Keanini Harish Cherukuri Department of Mechanical Engineering University of North Carolina at Charlotte Presented at the 2003 IPES.
Computational Aspects of Multi-scale Modeling Ahmed Sameh, Ananth Grama Computing Research Institute Purdue University.
QuickYield: An Efficient Global-Search Based Parametric Yield Estimation with Performance Constraints Fang Gong 1, Hao Yu 2, Yiyu Shi 1, Daesoo Kim 1,
Modeling Electromagnetic Fields in Strongly Inhomogeneous Media
Consider Preconditioning – Basic Principles Basic Idea: is to use Krylov subspace method (CG, GMRES, MINRES …) on a modified system such as The matrix.
Monte Carlo Linear Algebra Techniques and Their Parallelization Ashok Srinivasan Computer Science Florida State University
Simulation and Experimental Verification of Model Based Opto-Electronic Automation Drexel University Department of Electrical and Computer Engineering.
Circuit Simulation using Matrix Exponential Method Shih-Hung Weng, Quan Chen and Chung-Kuan Cheng CSE Department, UC San Diego, CA Contact:
Fast BEM Algorithms for 3D Interconnect Capacitance and Resistance Extraction Wenjian Yu EDA Lab, Dept. Computer Science & Technology, Tsinghua University.
A Parallel Hierarchical Solver for the Poisson Equation Seung Lee Deparment of Mechanical Engineering
Multipole-Based Preconditioners for Sparse Linear Systems. Ananth Grama Purdue University. Supported by the National Science Foundation.
Monte Carlo Linear Algebra Techniques and Their Parallelization Ashok Srinivasan Computer Science Florida State University
EEE 431 Computational Methods in Electrodynamics
ELEC 3105 Basic EM and Power Engineering
Nodal Methods for Core Neutron Diffusion Calculations
Finite Element Method To be added later 9/18/2018 ELEN 689.
Fang Gong1, Hao Yu2, and Lei He1 1Electrical Engineering Dept., UCLA
Model Order Reduction for Parametric Systems
Supported by the National Science Foundation.
Yiyu Shi*, Wei Yao*, Jinjun Xiong+ and Lei He*
EE201C Chapter 3 Interconnect RLC Modeling
EE 201C Homework 1 Fang Gong
EE 201C Homework 1 Fang Gong
EE 201C Homework 1 Fang Gong
Presentation transcript:

PiCAP: A Parallel and Incremental Capacitance Extraction Considering Stochastic Process Variation Fang Gong 1, Hao Yu 2, and Lei He 1 1 Electrical Engineering Dept., UCLA 2 Berkeley Design Automation Presented by Fang Gong

Outline Background and Motivation Algorithms Experimental Results Conclusion and Future Work

Outline Background and Motivation Algorithms Experimental Results Conclusion and Future Work

Process Variation and Cap Extraction Process variation leads to capacitance variation OPC lithography and CMP polishing Capacitance variation affects circuit performance Delay variation and analog mismatch From [Kang and Gupta]

Background of BEM Based Cap Extraction Capacitance extraction in FastCap  Procedures 1. Discretize metal surface into panels 2. Form linear system by collocation 3. Results in dense potential coeffs 4. Solve by iterative GMRES Fast Multipole method (FMM) to evaluate Matrix Vector Product (MVP) Preconditioned GMRES iteration with guided convergence Difficulties for stochastic capacitance extraction How to consider variations in FMM? How to consider different variations in precondition? Source Panel j Observe Panel i

Motivation of Our Work Existing works  Stochastic integral by low-rank approximation Zhu, Z. and White, J. “FastSies: a fast stochastic integral equation solver for modeling the rough surface effect”. In Proceedings of IEEE/ACM ICCAD Pros: Rigorous formulation Cons: Random integral is slow for full-chip extraction  Stochastic orthogonal polynomial (SOP) expansion Cui, J., and etc. “Variational capacitance modeling using orthogonal polynomial method”. In Proceedings of the 18th ACM Great Lakes Symposium on VLSI Pros: An efficient non-Monte-Carlo approach Cons: SOP expansion results in an augmented and dense linear system Objective of our work early parallel improvement.  Fast multi-pole method (FMM) with nearly O(n) performance with a further parallel improvement.  Pre-conditioner should be updated incrementally for different variation.

Outline Background and Motivation Algorithms Experimental Results Conclusion and Future Work

Flow of piCAP 1.Represent P ij with stochastic geometric moments 2.Use parallel FMM to evaluate MVP of Pxq 3.Obtain capacitance (mean and variance) with incrementally preconditioned GMRES Potential Coefficient Solve with GMRES Build Spectrum preconditioner Evaluate the MVP ( P x q ) with FMM in parallel Calculate C ij with the charge distribution. Geometric Moments Incrementally update preconditioner Geometry Info Process Variation

Stochastic Geometric Moment Consider two independent variation sources: panel distance (d) and panel width (w)‏ Multipole expansion along x-y-z coordinates: multipole moments and local moments  M i and L i show an explicit dependence on geometry parameters, and are called geometric moments. source-cube observer cube source-panel d0d0 r0r0

Stochastic Potential Coefficient Expansion Stochastic Potential Coefficient  Relate geometric parameters to random variables Let be random variable for panel width w, and be random variable for panel distance d.  Geometric moments M p and L p are:  Now, the potential coefficient is

Stochastic Potential Coefficient Expansion Stochastic Potential Coefficient  Relate geometric parameters to random variables Let be random variable for panel width w, and be random variable for panel distance d.  Geometric moments M p and L p are:  Now, the potential coefficient is n -order stochastic orthogonal polynomial expansion of P Accordingly, m -th order ( m = 2n + n(n − 1) ) expansion of charge is :

Potential Coefficient Solve with GMRES Build Spectrum preconditioner Evaluate the MVP ( P x q ) with FMM in parallel Calculate C ij with the charge distribution. Geometric Moments Incrementally update preconditioner Geometry Info Process Variation Augmented System Recap: SOP expansion leads to a large and dense system equation

Parallel Fast Multipole Method--upward Overview of Parallel fast-multipole method (FMM)‏  group panels in cubes, and build hierarchical tree for cubes  We use 8-degree trees in implementation, but use 2-degree trees for illustration here.  A parallel FMM distributes cubes to different processors Upward Pass Level 0 Level 1 Level 2 Level 3 starting from bottom level, it calculates stochastic geometric moments Update parent’s moments by summing the moments of its children— called M2M operation M2M M2M operations can be performed in parallel at different nodes

Parallel Fast Multipole Method--Downwards Downward Pass Level 0 Level 1 Level 2 Level 3 Sum L2L results with near-field potential for all panels at bottom level and return P x q At the top level, calculation of potential between two cubes—called M2L operation. potential is further distributed down to children from their parent in parallel—called L2L operation. M2L L2L Calculate near-field potential directly in parallel M2M M2M, L2L are local operations, while M2L is global operation. How to reduce communication traffic due to global operation?

Reduction of traffic between processors  Global data dependence exists in M2L operation at the top level  Pre-fetch moments: distributes its moments to all cubes on its dependency list before the calculations.  As such, it can hide communication time. Cube 1 Cube under calculation Cube 0 Dependency List Cube 0 Cube 1 … Cube k … Cube k

Flow of piCAP 1.Use spectrum pre-conditioner to accelerate convergence 2.Incrementally update the pre-conditioner for different variation. Potential Coefficient Solve with GMRES Build Spectrum preconditioner Evaluate the MVP ( P x q ) with FMM in parallel Calculate C ij with the charge distribution. Geometric Moments Incrementally update preconditioner Geometry Info Process Variation

Deflated Spectral Iteration Why need spectral preconditioner  GMRES needs too many iterations to achieve convergence.  Spectral preconditioner shifts the spectrum of system matrix to improve the iteration convergence Deflated spectral iteration  k (k=1 power iteration) partial eigen-pairs  Spectrum preconditioner Why need incremental precondition  Variation can significantly change spectral distribution  Building each pre-conditioner for different variations is expensive  Simultaneously considering all variations increases the complexity of our model.

Incremental Precondition For updated system, the update for the i- th eigen vector is:  is the subspace composed of  is the updated spectrum Updated pre-conditioner W’ is Inverse operation only involves diagonal matrix D K Consider different variations by updating the nominal preconditioner partially.

Outline Background and Motivation Algorithm Experimental Results Conclusion and Future Work

Accuracy Comparison Setup : two panels with random variation for distance d and width w Result : Stochastic Geometric Moments have high accuracy with average error of 1.8%, and can be up to ~1000X faster than MC 2 panels, d 0 = 7.07μm, w 0 = 1μm, d 1 = 20%d 0 MC (3000)‏piCAP Cij (fF)‏ Time(s)‏ panels, d 0 = 11.31μm, w 0 = 1μm, d 1 = 10%d 0 MC (3000)‏piCAP Cij (fF)‏ Time(s)‏ panels, d 0 = 4.24μm, w 0 = 1μm, d 1 = 20%d 0, w 1 = 20%h 0 MC (3000)‏piCAP Cij (fF)‏ Time(s)‏

Runtime for parallel FMM  Setup Two-layer example with 20 conductors. Other: 40, 80, 160 conductors Evaluate P x q (MVP) with 10% perturbation on panel distance  Result All examples can have about 3X speedup with 4 processors #wire #panels proc / / / /1.0 2 proc /1.7X /1.4X /1.7X /1.7X 3 proc /2.0X /2.0X /2.0X /2.0X 4 proc /2.7X /2.9X /3.0X /2.8X

Efficiency of spectral preconditioner  Setup : Three test structures: single plate, 2x2 bus, cubic  Result Compare diagonal precondition with spectrum precondition Spectrum precondition accelerates convergence of GMRES (3X). # panel# variablediagonal prec.spectral prec. # iterTime(s)‏# iterTime(s)‏ plate cubic bus

Speedup by Incremental Precondition Setup  Test on two-layer 20 conductor example  Incremental update of nominal pre-conditioner for different variation sources  Compare with non-incremental one discretization w-t-l #panel#variableTotal Runtime (s)‏ Non-incrementalincremental 3x3x x3x x3x x3x Result : Up to 15X speedup over non-incremental results, and only incremental one can finish all large examples.

Conclusion and Future Work Introduce stochastic geometric moments Develop a parallel FMM to evaluate the matrix- vector product with process variation Develop a spectral pre-conditioner incrementally to consider different variations Future Work: extend our parallel and incremental solver to solve other IC-variation related stochastic analysis

Thanks PiCAP: A Parallel and Incremental Capacitance Extraction Considering Stochastic Process Variation Fang Gong, Hao Yu and Lei He