Multipole-Based Preconditioners for Sparse Linear Systems. Ananth Grama Purdue University. Supported by the National Science Foundation.

Slides:



Advertisements
Similar presentations
05/11/2005 Carnegie Mellon School of Computer Science Aladdin Lamps 05 Combinatorial and algebraic tools for multigrid Yiannis Koutis Computer Science.
Advertisements

Numerical Solution of Linear Equations
Fast Algorithms For Hierarchical Range Histogram Constructions
Chapter 8 Elliptic Equation.
Point-wise Discretization Errors in Boundary Element Method for Elasticity Problem Bart F. Zalewski Case Western Reserve University Robert L. Mullen Case.
Exploiting Sparse Markov and Covariance Structure in Multiresolution Models Presenter: Zhe Chen ECE / CMR Tennessee Technological University October 22,
MEASURES OF POST-PROCESSING THE HUMAN BODY RESPONSE TO TRANSIENT FIELDS Dragan Poljak Department of Electronics, University of Split R.Boskovica bb,
The Finite Element Method Defined
Rayan Alsemmeri Amseena Mansoor. LINEAR SYSTEMS Jacobi method is used to solve linear systems of the form Ax=b, where A is the square and invertible.
Numerical Parallel Algorithms for Large-Scale Nanoelectronics Simulations using NESSIE Eric Polizzi, Ahmed Sameh Department of Computer Sciences, Purdue.
Computational Solutions of Helmholtz Equation Yau Shu Wong Department of Mathematical & Statistical Sciences University of Alberta Edmonton, Alberta, Canada.
Weiping Shi Department of Computer Science University of North Texas HiCap: A Fast Hierarchical Algorithm for 3D Capacitance Extraction.
Steady Aeroelastic Computations to Predict the Flying Shape of Sails Sriram Antony Jameson Dept. of Aeronautics and Astronautics Stanford University First.
CS240A: Conjugate Gradients and the Model Problem.
A Solenoidal Basis Method For Efficient Inductance Extraction H emant Mahawar Vivek Sarin Weiping Shi Texas A&M University College Station, TX.
Module on Computational Astrophysics Jim Stone Department of Astrophysical Sciences 125 Peyton Hall : ph :
PETE 603 Lecture Session #29 Thursday, 7/29/ Iterative Solution Methods Older methods, such as PSOR, and LSOR require user supplied iteration.
Numerical methods for PDEs PDEs are mathematical models for –Physical Phenomena Heat transfer Wave motion.
Parallel Adaptive Mesh Refinement Combined With Multigrid for a Poisson Equation CRTI RD Project Review Meeting Canadian Meteorological Centre August.
Tutorial 5: Numerical methods - buildings Q1. Identify three principal differences between a response function method and a numerical method when both.
Parallel Performance of Hierarchical Multipole Algorithms for Inductance Extraction Ananth Grama, Purdue University Vivek Sarin, Texas A&M University Hemant.
CIS V/EE894R/ME894V A Case Study in Computational Science & Engineering HW 5 Repeat the HW associated with the FD LBI except that you will now use.
Fast Low-Frequency Impedance Extraction using a Volumetric 3D Integral Formulation A.MAFFUCCI, A. TAMBURRINO, S. VENTRE, F. VILLONE EURATOM/ENEA/CREATE.
PiCAP: A Parallel and Incremental Capacitance Extraction Considering Stochastic Process Variation Fang Gong 1, Hao Yu 2, and Lei He 1 1 Electrical Engineering.
CFD Lab - Department of Engineering - University of Liverpool Ken Badcock & Mark Woodgate Department of Engineering University of Liverpool Liverpool L69.
1 ELEC 3105 Basic EM and Power Engineering Start Solutions to Poisson’s and/or Laplace’s.
Efficient Integration of Large Stiff Systems of ODEs Using Exponential Integrators M. Tokman, M. Tokman, University of California, Merced 2 hrs 1.5 hrs.
The Finite Element Method A Practical Course
A particle-gridless hybrid methods for incompressible flows
The swiss-carpet preconditioner: a simple parallel preconditioner of Dirichlet-Neumann type A. Quarteroni (Lausanne and Milan) M. Sala (Lausanne) A. Valli.
Stable, Circulation- Preserving, Simplicial Fluids Sharif Elcott, Yiying Tong, Eva Kanso, Peter Schröder, and Mathieu Desbrun.
1 Complex Images k’k’ k”k” k0k0 -k0-k0 branch cut   k 0 pole C1C1 C0C0 from the Sommerfeld identity, the complex exponentials must be a function.
Molecular Simulation of Reactive Systems. _______________________________ Sagar Pandit, Hasan Aktulga, Ananth Grama Coordinated Systems Lab Purdue University.
Computational Aspects of Multi-scale Modeling Ahmed Sameh, Ananth Grama Computing Research Institute Purdue University.
CS240A: Conjugate Gradients and the Model Problem.
Case Study in Computational Science & Engineering - Lecture 5 1 Iterative Solution of Linear Systems Jacobi Method while not converged do { }
Modeling of the Unsteady Separated Flow over Bilge Keels of FPSO Hulls under Heave or Roll Motions Yi-Hsiang Yu 09/23/04 Copies of movies/papers and today’s.
Domain Decomposition in High-Level Parallelizaton of PDE codes Xing Cai University of Oslo.
October 2008 Integrated Predictive Simulation System for Earthquake and Tsunami Disaster CREST/Japan Science and Technology Agency (JST)
Algorithms and Infrastructure for Molecular Dynamics Simulations Ananth Grama Purdue University Various.
23/5/20051 ICCS congres, Atlanta, USA May 23, 2005 The Deflation Accelerated Schwarz Method for CFD C. Vuik Delft University of Technology
Modeling Electromagnetic Fields in Strongly Inhomogeneous Media
Discretization Methods Chapter 2. Training Manual May 15, 2001 Inventory # Discretization Methods Topics Equations and The Goal Brief overview.
Discretization for PDEs Chunfang Chen,Danny Thorne Adam Zornes, Deng Li CS 521 Feb., 9,2006.
Partial Derivatives Example: Find If solution: Partial Derivatives Example: Find If solution: gradient grad(u) = gradient.
Data Structures and Algorithms in Parallel Computing Lecture 10.
Consider Preconditioning – Basic Principles Basic Idea: is to use Krylov subspace method (CG, GMRES, MINRES …) on a modified system such as The matrix.
Monte Carlo Linear Algebra Techniques and Their Parallelization Ashok Srinivasan Computer Science Florida State University
F. Fairag, H Tawfiq and M. Al-Shahrani Department of Math & Stat Department of Mathematics and Statistics, KFUPM. Nov 6, 2013 Preconditioning Technique.
Fast BEM Algorithms for 3D Interconnect Capacitance and Resistance Extraction Wenjian Yu EDA Lab, Dept. Computer Science & Technology, Tsinghua University.
A Parallel Hierarchical Solver for the Poisson Equation Seung Lee Deparment of Mechanical Engineering
Adaptive grid refinement. Adaptivity in Diffpack Error estimatorError estimator Adaptive refinementAdaptive refinement A hierarchy of unstructured gridsA.
Non-parametric Methods for Clustering Continuous and Categorical Data Steven X. Wang Dept. of Math. and Stat. York University May 13, 2010.
Quality of Service for Numerical Components Lori Freitag Diachin, Paul Hovland, Kate Keahey, Lois McInnes, Boyana Norris, Padma Raghavan.
Monte Carlo Linear Algebra Techniques and Their Parallelization Ashok Srinivasan Computer Science Florida State University
Auburn University
EEE 431 Computational Methods in Electrodynamics
Chapter 7. Classification and Prediction
Xing Cai University of Oslo
Parallel Tasks Decomposition
ELEC 3105 Basic EM and Power Engineering
Auburn University COMP7330/7336 Advanced Parallel and Distributed Computing Mapping Techniques Dr. Xiao Qin Auburn University.
Finite Element Method To be added later 9/18/2018 ELEN 689.
A Parallel Hierarchical Solver for the Poisson Equation
L Ge, L Lee, A. Candel, C Ng, K Ko, SLAC
Supported by the National Science Foundation.
Objective Numerical methods Finite volume.
topic4: Implicit method, Stability, ADI method
SKTN 2393 Numerical Methods for Nuclear Engineers
Ph.D. Thesis Numerical Solution of PDEs and Their Object-oriented Parallel Implementations Xing Cai October 26, 1998.
Presentation transcript:

Multipole-Based Preconditioners for Sparse Linear Systems. Ananth Grama Purdue University. Supported by the National Science Foundation.

Overview Summary of Contributions Generalized Stokes Problem Solenoidal Basis Methods and Preconditioning Multipole Methods as Preconditioners Performance of Multipole-Based Preconditioners Parallelization of Solver/Preconditioner Parallel Performance Concluding Remarks

Summary of Contributions Problem Formulation Excellent Convergence Properties of Multipole- Based Preconditioners Parallelism in Multipole-based Sparse Solvers Highly Scalable and Efficient Parallel Formulations

Generalized Stokes Problem Navier-Stokes equation Incompressibility Condition

Generalized Stokes Problem Linear system B T is the discrete divergence operator and A collects all the velocity terms. Navier-Stokes: Generalized Stokes Problem:

Solenoidal Basis Methods Class of projection based methods Basis for divergence-free velocity Solenoidal basis matrix P: Divergence free velocity: Modified system =>

Solenoidal Basis Method Reduced system: Reduced system solved by suitable iterative methods such as CG or GMRES. Velocity obtained by : Pressure obtained by: ;

Preconditioning Observations Vorticity-Velocity function formulation:

Preconditioning Reduced system approximation: Preconditioner: Low accuracy preconditioning is sufficient

Preconditioning Preconditioning can be affected by an approximate Laplace solve followed by a Helmholtz solve. The Laplace solve can be performed effectively using a Multipole method.

Preconditioning Laplace/Poisson Problems Given a domain with known internal charges and boundary potential (Dirichlet conditions), estimate potential across the domain. –Compute boundary potential due to internal charges (single multipole application) –Compute residual boundary potential due to charges on the boundary (vector operation). –Compute boundary charges corresponding to residual boundary potential (multipole-based GMRES solve). –Compute potential through entire domain due to boundary and internal charges (single multipole).

Overview of Multipole Methods Consider the problem of computing the trajectory of particles in space. Multipole methods use hierarchical approximations to reduce computational cost.

Overview of Multipole Methods Accurate formulation requires O(n 2 ) force computations (mat-vec with a coefficient matrix of Green’s functions) Hierarchical methods reduce this complexity to O(n) or O(n log n) Since particle distributions can be very irregular, the tree can be highly unbalanced Commonly used hierarchical methods include FMM and the Barnes-Hut methods.

Overview of Multipole Methods Interactions (direct force computations) are replaced by Gaussian quadratures. Far-field interactions are replaced by multipole series representing remote Gauss points. Each matrix-vector product now becomes a single tree traversal taking O(n) or O(n log n) time. This mat-vec can be encapsulated in iterative solvers for dense systems.

Overview of Multipole Methods A number of results relating to the complexity of hierarchical methods [Rokhlin, Greengard], error analysis and control [Grama, Sarin, Sameh], and their use in dense solvers [Grama, Kumar, Sameh] have been shown. Their applications in applications such as inductance extraction [White, Sarin, Grama] have also been demonstrated.

Numerical Experiments 3D driven cavity problem in a cubic domain. Marker-and-cell scheme in which domain is discretized uniformly. Pressure unknowns are defined at node points and velocities at mid points of edges. x, y, and z components of velocities are defined on edges along respective directions.

Sample Problem Sizes 92,25695,23232,76832x32x32 10,80011,5204,09616x16x16 1,1761, x8x8 Solenoidal Functions VelocityPressureMesh

Preconditioning Performance

Preconditioning Performance – Poisson Problem

Parallel Formulation - Outer Solve Partition domain across processors Computation and storage of solenoidal basis matrix P Matrix-vector products: – Local operations on the grid Vector operations Preconditioning Scalable parallel implementations have been developed for all these operations Matrix free approach

Parallel Formulation (Multipole Solve) Each node evaluation can potentially be executed as a thread. Since there may be a large number of nodes, this may lead to too many threads. In general, we build some granularity into the thread by gathering k nodes into a single thread. The Barnes-Hut tree is a read-only data structure. Therefore, good performance can be achieved if we can build spatial and temporal locality (and the working set size does not exceed local memory).

Parallel Formulation(Multipole Solve) Spatially proximate particles are likely to interact with nearly identical parts of the tree. Therefore particles must be traversed in a spatial-proximity preserving order.

Parallel Performance

Concluding Remarks Multipole methods provide highly effective preconditioning techniques that yield excellent parallel speedups. The accuracy parameters of the multipole solve (degree, multipole acceptance criteria) significantly impact time and convergence rate. Rely on closed form Green’s function, but can be adapted to other scenarios.