Supported by the National Science Foundation.

Slides:



Advertisements
Similar presentations
Stable Fluids A paper by Jos Stam.
Advertisements

Chapter 8 Elliptic Equation.
Point-wise Discretization Errors in Boundary Element Method for Elasticity Problem Bart F. Zalewski Case Western Reserve University Robert L. Mullen Case.
Exploiting Sparse Markov and Covariance Structure in Multiresolution Models Presenter: Zhe Chen ECE / CMR Tennessee Technological University October 22,
MEASURES OF POST-PROCESSING THE HUMAN BODY RESPONSE TO TRANSIENT FIELDS Dragan Poljak Department of Electronics, University of Split R.Boskovica bb,
The Finite Element Method Defined
Coupled Fluid-Structural Solver CFD incompressible flow solver has been coupled with a FEA code to analyze dynamic fluid-structure coupling phenomena CFD.
Weiping Shi Department of Computer Science University of North Texas HiCap: A Fast Hierarchical Algorithm for 3D Capacitance Extraction.
Steady Aeroelastic Computations to Predict the Flying Shape of Sails Sriram Antony Jameson Dept. of Aeronautics and Astronautics Stanford University First.
CS240A: Conjugate Gradients and the Model Problem.
A Solenoidal Basis Method For Efficient Inductance Extraction H emant Mahawar Vivek Sarin Weiping Shi Texas A&M University College Station, TX.
Module on Computational Astrophysics Jim Stone Department of Astrophysical Sciences 125 Peyton Hall : ph :
Numerical methods for PDEs PDEs are mathematical models for –Physical Phenomena Heat transfer Wave motion.
Parallel Adaptive Mesh Refinement Combined With Multigrid for a Poisson Equation CRTI RD Project Review Meeting Canadian Meteorological Centre August.
Lecture Objectives Review SIMPLE CFD Algorithm SIMPLE Semi-Implicit Method for Pressure-Linked Equations Define Residual and Relaxation.
Parallel Performance of Hierarchical Multipole Algorithms for Inductance Extraction Ananth Grama, Purdue University Vivek Sarin, Texas A&M University Hemant.
CIS V/EE894R/ME894V A Case Study in Computational Science & Engineering HW 5 Repeat the HW associated with the FD LBI except that you will now use.
Fast Low-Frequency Impedance Extraction using a Volumetric 3D Integral Formulation A.MAFFUCCI, A. TAMBURRINO, S. VENTRE, F. VILLONE EURATOM/ENEA/CREATE.
PiCAP: A Parallel and Incremental Capacitance Extraction Considering Stochastic Process Variation Fang Gong 1, Hao Yu 2, and Lei He 1 1 Electrical Engineering.
ParCFD Parallel computation of pollutant dispersion in industrial sites Julien Montagnier Marc Buffat David Guibert.
CFD Lab - Department of Engineering - University of Liverpool Ken Badcock & Mark Woodgate Department of Engineering University of Liverpool Liverpool L69.
1 ELEC 3105 Basic EM and Power Engineering Start Solutions to Poisson’s and/or Laplace’s.
The Finite Element Method A Practical Course
A particle-gridless hybrid methods for incompressible flows
Lecture Objectives: Explicit vs. Implicit Residual, Stability, Relaxation Simple algorithm.
The swiss-carpet preconditioner: a simple parallel preconditioner of Dirichlet-Neumann type A. Quarteroni (Lausanne and Milan) M. Sala (Lausanne) A. Valli.
Stable, Circulation- Preserving, Simplicial Fluids Sharif Elcott, Yiying Tong, Eva Kanso, Peter Schröder, and Mathieu Desbrun.
1 Complex Images k’k’ k”k” k0k0 -k0-k0 branch cut   k 0 pole C1C1 C0C0 from the Sommerfeld identity, the complex exponentials must be a function.
Computational Aspects of Multi-scale Modeling Ahmed Sameh, Ananth Grama Computing Research Institute Purdue University.
Lecture Objectives Review Define Residual and Relaxation SIMPLE CFD Algorithm SIMPLE Semi-Implicit Method for Pressure-Linked Equations.
Case Study in Computational Science & Engineering - Lecture 5 1 Iterative Solution of Linear Systems Jacobi Method while not converged do { }
October 2008 Integrated Predictive Simulation System for Earthquake and Tsunami Disaster CREST/Japan Science and Technology Agency (JST)
Discretization Methods Chapter 2. Training Manual May 15, 2001 Inventory # Discretization Methods Topics Equations and The Goal Brief overview.
Partial Derivatives Example: Find If solution: Partial Derivatives Example: Find If solution: gradient grad(u) = gradient.
Monte Carlo Linear Algebra Techniques and Their Parallelization Ashok Srinivasan Computer Science Florida State University
A Parallel Hierarchical Solver for the Poisson Equation Seung Lee Deparment of Mechanical Engineering
Adaptive grid refinement. Adaptivity in Diffpack Error estimatorError estimator Adaptive refinementAdaptive refinement A hierarchy of unstructured gridsA.
Quality of Service for Numerical Components Lori Freitag Diachin, Paul Hovland, Kate Keahey, Lois McInnes, Boyana Norris, Padma Raghavan.
Multipole-Based Preconditioners for Sparse Linear Systems. Ananth Grama Purdue University. Supported by the National Science Foundation.
Monte Carlo Linear Algebra Techniques and Their Parallelization Ashok Srinivasan Computer Science Florida State University
EEE 431 Computational Methods in Electrodynamics
Auburn University
EEE 431 Computational Methods in Electrodynamics
Chapter 7. Classification and Prediction
Xing Cai University of Oslo
Objective Numerical methods SIMPLE CFD Algorithm Define Relaxation
ChaNGa: Design Issues in High Performance Cosmology
Parallel Tasks Decomposition
ELEC 3105 Basic EM and Power Engineering
Boundary Element Analysis of Systems Using Interval Methods
Auburn University COMP7330/7336 Advanced Parallel and Distributed Computing Mapping Techniques Dr. Xiao Qin Auburn University.
Implementing Finite Volume Methods
© Fluent Inc. 1/10/2018L1 Fluids Review TRN Solution Methods.
Finite Element Method To be added later 9/18/2018 ELEN 689.
Fluent Overview Ahmadi/Nazridoust ME 437/537/637.
Convergence in Computational Science
A Parallel Hierarchical Solver for the Poisson Equation
Objective Unsteady state Numerical methods Discretization
L Ge, L Lee, A. Candel, C Ng, K Ko, SLAC
Efficient Parallel Simulation of Fluid Dynamics on Cartesian Grids
A robust preconditioner for the conjugate gradient method
AIAA OBSERVATIONS ON CFD SIMULATION UNCERTAINITIES
AIAA OBSERVATIONS ON CFD SIMULATION UNCERTAINTIES
Performance Evaluation of the Parallel Fast Multipole Algorithm Using the Optimal Effectiveness Metric Ioana Banicescu and Mark Bilderback Department of.
Convergence in Numerical Science
Objective Numerical methods Finite volume.
Investigators Tony Johnson, T. V. Hromadka II and Steve Horton
topic4: Implicit method, Stability, ADI method
SKTN 2393 Numerical Methods for Nuclear Engineers
Ph.D. Thesis Numerical Solution of PDEs and Their Object-oriented Parallel Implementations Xing Cai October 26, 1998.
Presentation transcript:

Supported by the National Science Foundation. Impact of Far-Field Interactions on Performance of Multipole-Based Preconditioners for Sparse Linear Systems. Ananth Grama Purdue University. Vivek Sarin Texas A&M University Supported by the National Science Foundation.

Overview Summary of Contributions Generalized Stokes Problem Solenoidal Basis Methods and Preconditioning Multipole Methods as Preconditioners Performance of Multipole-Based Preconditioners Parallelization of Solver/Preconditioner Parallel Performance Concluding Remarks

Summary of Contributions Problem Formulation Excellent Convergence Properties of Multipole- Based Preconditioners Parallelism in Multipole-based Sparse Solvers Highly Scalable and Efficient Parallel Formulations

Generalized Stokes Problem Navier-Stokes equation Incompressibility Condition

Generalized Stokes Problem Linear system BT is the discrete divergence operator and A collects all the velocity terms. Navier-Stokes: Generalized Stokes Problem:

Solenoidal Basis Methods Class of projection based methods Basis for divergence-free velocity Solenoidal basis matrix P: Divergence free velocity: Modified system =>

Solenoidal Basis Method Reduced system: Reduced system solved by suitable iterative methods such as CG or GMRES. Velocity obtained by : Pressure obtained by: ;

Preconditioning Observations Vorticity-Velocity function formulation:

Preconditioning Reduced system approximation: Preconditioner: Low accuracy preconditioning is sufficient

Preconditioning Preconditioning can be affected by an approximate Laplace solve followed by a Helmholtz solve. The Laplace solve can be performed effectively using a Multipole method.

Preconditioning Laplace/Poisson Problems Given a domain with known internal charges and boundary potential (Dirichlet conditions), estimate potential across the domain. Compute boundary potential due to internal charges (single multipole application) Compute residual boundary potential due to charges on the boundary (vector operation). Compute boundary charges corresponding to residual boundary potential (multipole-based GMRES solve). Compute potential through entire domain due to boundary and internal charges (single multipole).

Overview of Multipole Methods Consider the problem of computing the trajectory of particles in space. Multipole methods use hierarchical approximations to reduce computational cost.

Overview of Multipole Methods Accurate formulation requires O(n2) force computations (mat-vec with a coefficient matrix of Green’s functions) Hierarchical methods reduce this complexity to O(n) or O(n log n) Since particle distributions can be very irregular, the tree can be highly unbalanced Commonly used hierarchical methods include FMM and the Barnes-Hut methods.

Overview of Multipole Methods Interactions (direct force computations) are replaced by Gaussian quadratures. Far-field interactions are replaced by multipole series representing remote Gauss points. Each matrix-vector product now becomes a single tree traversal taking O(n) or O(n log n) time. This mat-vec can be encapsulated in iterative solvers for dense systems.

Overview of Multipole Methods A number of results relating to the complexity of hierarchical methods [Rokhlin, Greengard], error analysis and control [Grama, Sarin, Sameh], and their use in dense solvers [Grama, Kumar, Sameh] have been shown. Their applications in applications such as inductance extraction [White, Sarin] have also been demonstrated.

Numerical Experiments 3D driven cavity problem in a cubic domain. Marker-and-cell scheme in which domain is discretized uniformly. Pressure unknowns are defined at node points and velocities at mid points of edges. x, y, and z components of velocities are defined on edges along respective directions.

Sample Problem Sizes 92,256 95,232 32,768 32x32x32 10,800 11,520 4,096 1,176 1,344 512 8x8x8 Solenoidal Functions Velocity Pressure Mesh

Preconditioning Performance

Preconditioning Performance

Preconditioning Performance – Poisson Problem

Preconditioning Performance– Poisson Problem

Preconditioning Performance– Poisson Problem

Parallel Formulation - Outer Solve Partition domain across processors Computation and storage of solenoidal basis matrix P Matrix-vector products: Local operations on the grid Vector operations Preconditioning Scalable parallel implementations have been developed for all these operations Matrix free approach

Parallel Formulation (Multipole Solve) Each node evaluation can potentially be executed as a thread. Since there may be a large number of nodes, this may lead to too many threads. In general, we build some granularity into the thread by gathering k nodes into a single thread. The Barnes-Hut tree is a read-only data structure. Therefore, good performance can be achieved if we can build spatial and temporal locality (and the working set size does not exceed local memory).

Parallel Formulation(Multipole Solve) Spatially proximate particles are likely to interact with nearly identical parts of the tree. Therefore particles must be traversed in a spatial-proximity preserving order.

Parallel Performance

Parallel Performance

Concluding Remarks Multipole methods provide highly effective preconditioning techniques that yield excellent parallel speedups. The accuracy parameters of the multipole solve (degree, multipole acceptance criteria) significantly impact time and convergence rate. Rely on closed form Green’s function, but can be adapted to other scenarios.