The swiss-carpet preconditioner: a simple parallel preconditioner of Dirichlet-Neumann type A. Quarteroni (Lausanne and Milan) M. Sala (Lausanne) A. Valli.

Slides:



Advertisements
Similar presentations
Chapter 4 Partition I. Covering and Dominating.
Advertisements

Introduction The concept of transform appears often in the literature of image processing and data compression. Indeed a suitable discrete representation.
One-phase Solidification Problem: FEM approach via Parabolic Variational Inequalities Ali Etaati May 14 th 2008.
Isoparametric Elements Element Stiffness Matrices
Linear Algebra Applications in Matlab ME 303. Special Characters and Matlab Functions.
Chapter 8 Elliptic Equation.
P. Venkataraman Mechanical Engineering P. Venkataraman Rochester Institute of Technology DETC2013 – 12269: Continuous Solution for Boundary Value Problems.
03/29/2006, City Univ1 Iterative Methods with Inexact Preconditioners and Applications to Saddle-point Systems & Electromagnetic Maxwell Systems Jun Zou.
Uniform Treatment of Numerical Time-Integrations of the Maxwell Equations R. Horváth TU/e, Philips Research Eindhoven TU/e Eindhoven, 27th June, 2002 Scientific.
Efficient Storage and Processing of Adaptive Triangular Grids using Sierpinski Curves Csaba Attila Vigh Department of Informatics, TU München JASS 2006,
Some Ideas Behind Finite Element Analysis
By S Ziaei-Rad Mechanical Engineering Department, IUT.
Noga Alon Institute for Advanced Study and Tel Aviv University
Basis Expansion and Regularization Presenter: Hongliang Fei Brian Quanz Brian Quanz Date: July 03, 2008.
1 A component mode synthesis method for 3D cell by cell calculation using the mixed dual finite element solver MINOS P. Guérin, A.M. Baudron, J.J. Lautard.
The number of edge-disjoint transitive triples in a tournament.
CS 584. Review n Systems of equations and finite element methods are related.
Camera Calibration CS485/685 Computer Vision Prof. Bebis.
Tutorial 10 Iterative Methods and Matrix Norms. 2 In an iterative process, the k+1 step is defined via: Iterative processes Eigenvector decomposition.
12 INFINITE SEQUENCES AND SERIES The Comparison Tests In this section, we will learn: How to find the value of a series by comparing it with a known.
1 Parallel Simulations of Underground Flow in Porous and Fractured Media H. Mustapha 1,2, A. Beaudoin 1, J. Erhel 1 and J.R. De Dreuzy IRISA – INRIA.
Binary Variables (1) Coin flipping: heads=1, tails=0 Bernoulli Distribution.
Copyright © Cengage Learning. All rights reserved. CHAPTER 11 ANALYSIS OF ALGORITHM EFFICIENCY ANALYSIS OF ALGORITHM EFFICIENCY.
Parallel Adaptive Mesh Refinement Combined With Multigrid for a Poisson Equation CRTI RD Project Review Meeting Canadian Meteorological Centre August.
Interpolation. Interpolation is important concept in numerical analysis. Quite often functions may not be available explicitly but only the values of.
Basic Communication Operations Based on Chapter 4 of Introduction to Parallel Computing by Ananth Grama, Anshul Gupta, George Karypis and Vipin Kumar These.
Algorithms for a large sparse nonlinear eigenvalue problem Yusaku Yamamoto Dept. of Computational Science & Engineering Nagoya University.
Krakow, Summer 2011 Comparability Graphs William T. Trotter
Finite Element Method.
Pareto Linear Programming The Problem: P-opt Cx s.t Ax ≤ b x ≥ 0 where C is a kxn matrix so that Cx = (c (1) x, c (2) x,..., c (k) x) where c.
Yaomin Jin Design of Experiments Morris Method.
DISCRETE COMPUTATIONAL STRUCTURES CS Fall 2005.
C GasparAdvances in Numerical Algorithms, Graz, Fast interpolation techniques and meshless methods Csaba Gáspár Széchenyi István University, Department.
Application Paradigms: Unstructured Grids CS433 Spring 2001 Laxmikant Kale.
Tarek A. El-Moselhy and Luca Daniel
EDGE DETECTION IN COMPUTER VISION SYSTEMS PRESENTATION BY : ATUL CHOPRA JUNE EE-6358 COMPUTER VISION UNIVERSITY OF TEXAS AT ARLINGTON.
Serge Andrianov Theory of Symplectic Formalism for Spin-Orbit Tracking Institute for Nuclear Physics Forschungszentrum Juelich Saint-Petersburg State University,
LP Narrowing: A New Strategy for Finding All Solutions of Nonlinear Equations Kiyotaka Yamamura Naoya Tamura Koki Suda Chuo University, Tokyo, Japan.
Parallel Solution of the Poisson Problem Using MPI
The Dominating Set and its Parametric Dual  the Dominated Set  Lan Lin prepared for theory group meeting on June 11, 2003.
Domain Decomposition in High-Level Parallelizaton of PDE codes Xing Cai University of Oslo.
A Dirichlet-to-Neumann (DtN)Multigrid Algorithm for Locally Conservative Methods Sandia National Laboratories is a multi program laboratory managed and.
8.4.2 Quantum process tomography 8.5 Limitations of the quantum operations formalism 量子輪講 2003 年 10 月 16 日 担当:徳本 晋
Lecture 21 MA471 Fall 03. Recall Jacobi Smoothing We recall that the relaxed Jacobi scheme: Smooths out the highest frequency modes fastest.
Partitioning using Mesh Adjacencies  Graph-based dynamic balancing Parallel construction and balancing of standard partition graph with small cuts takes.
Outline Introduction Research Project Findings / Results
Determining 3D Structure and Motion of Man-made Objects from Corners.
Monte Carlo Linear Algebra Techniques and Their Parallelization Ashok Srinivasan Computer Science Florida State University
F. Fairag, H Tawfiq and M. Al-Shahrani Department of Math & Stat Department of Mathematics and Statistics, KFUPM. Nov 6, 2013 Preconditioning Technique.
A Parallel Hierarchical Solver for the Poisson Equation Seung Lee Deparment of Mechanical Engineering
Adaptive grid refinement. Adaptivity in Diffpack Error estimatorError estimator Adaptive refinementAdaptive refinement A hierarchy of unstructured gridsA.
Multipole-Based Preconditioners for Sparse Linear Systems. Ananth Grama Purdue University. Supported by the National Science Foundation.
Hybrid Parallel Implementation of The DG Method Advanced Computing Department/ CAAM 03/03/2016 N. Chaabane, B. Riviere, H. Calandra, M. Sekachev, S. Hamlaoui.
Monte Carlo Linear Algebra Techniques and Their Parallelization Ashok Srinivasan Computer Science Florida State University
High Performance Computing Seminar II Parallel mesh partitioning with ParMETIS Parallel iterative solvers with Hypre M.Sc. Caroline Mendonça Costa.
High Performance Computing Seminar
Theory of Computational Complexity Probability and Computing Chapter Hikaru Inada Iwama and Ito lab M1.
Xing Cai University of Oslo
Copyright © Cengage Learning. All rights reserved.
11.4 The Comparison Tests In this section, we will learn:
Distinct Distances in the Plane
Numerical Analysis Lecture 16.
Supported by the National Science Foundation.
Objective Numerical methods Finite volume.
Spin Models and Distance-Regular Graphs
Quantum Two.
Inequalities Some problems in algebra lead to inequalities instead of equations. An inequality looks just like an equation, except that in the place of.
Ph.D. Thesis Numerical Solution of PDEs and Their Object-oriented Parallel Implementations Xing Cai October 26, 1998.
Communication Driven Remapping of Processing Element (PE) in Fault-tolerant NoC-based MPSoCs Chia-Ling Chen, Yen-Hao Chen and TingTing Hwang Department.
Presentation transcript:

The swiss-carpet preconditioner: a simple parallel preconditioner of Dirichlet-Neumann type A. Quarteroni (Lausanne and Milan) M. Sala (Lausanne) A. Valli (Trento)

OUTLINE 1. Element-oriented and vertex-oriented decompositions 2. The swiss-carpet preconditioner 3. Analysis 4. Numerical results

1. Element-oriented and vertex-oriented decompositions Left: element-oriented Right: vertex-oriented

The model problem h is the grid size parameter

For the two-level Schwarz method, the construction of a coarse grid and of the corresponding restriction and extension operators can be computationally expensive on general unstructured grids. The vertex-oriented decomposition has been used in this framework for providing a coarse operator based on the so- called “aggregation”, avoiding the explicit construction of the coarse grid. A first result: the Schwarz algorithm on a vertex- oriented decomposition

For each subdomain one constructs a number of aggregates (sets of contiguous vertices). Each node of the fine grid is internal to only one aggregate (taking advantage of the vertex-oriented decomposition of the grid). The total number of the aggregates is the dimension of the coarse space. Only one coarse grid basis function is defined in each aggregate: the elements of the coarse space are given by linear combinations of elements of the fine space. The restriction operator is therefore a sort of piecewise constant interpolation. The numerical results have showed that this procedure is cheap and efficient (Sala and Formaggia, 2002; Sala, 2003).

5.35

Algebraic structure for the element-oriented decomposition

Algebraic structure for the vertex-oriented decomposition

Some remarks:

2. The swiss-carpet preconditioner First of all, let us joke for a minute…: the reason of the name!

Start from the Swiss flag:

Modify the cross a little bit: Do you see the vertex-oriented decomposition?

Then take the Sierpinski carpet: the first four iterations are:

Here is the following one:

A Sierpinski-like procedure gives:

And here is the swiss-carpet!

The idea is to use a Dirichlet-Neumann preconditioner based on the swiss-carpet decomposition: this means one of the following choices P =

The first choice requires the solution of a Neumann problem in the red region (indeed, M independent Neumann problems in the M connected components of the red region) The second choice requires the solution of one Neumann problem in the white region

A third choice (a Neumann-Neumann preconditioner) is also possible: P = It requires the solution of M independent Neumann problems in the M red regions, and of one Neumann problem in the white region

In our experience, one finds better theoretical and computational results with the second choice, P =

The vertex-oriented decomposition is a suitable one for parallel computations: the nodes in the white transition region can be replicated on the processors which hold the disconnected red subdomains The Dirichlet step in the red region is clearly performed in parallel The vertex-oriented technique is used in many parallel linear algebra packages

3. Analysis The condition number of the preconditioned Schur complement matrix satisfies where the constant C does not depend on H and h.

The result is a straightforward consequence of the following estimate: that is obtained by carefully estimating the energy of the harmonic extension in the red regions by the energy of the harmonic extension in the white region.

Sketch of the proof. The first estimate is clear. The second is obtained by means of a frequently used approach: first, note that one can consider the local problem over a single red region and its white “frame”, and then add each local contribute; then, the harmonic extension in the red region is estimated by means of the trace over the boundary; by mapping back and forth on a square of unit side, the explicit dependence on H is determined (in this step, the estimate is uniform with respect to h, by the well- known uniform extension theorem);

finally, the trace over the boundary of the square is estimated by the energy of the harmonic extension (indeed, the interpolant) into the white frame (the only tool is the trace inequality); here, the frame is nonlinearly mapped into a “fixed” frame of width 1, and the explicit dependence on h and H is determined.

The result above has been proved for a second order symmetric elliptic operator having no zero-order terms, in the two- dimensional case (piecewise linear finite elements). However, numerical results seem to confirm it also for more general symmetric elliptic operators, and in the three-dimensional case.

Some remarks:

Generalized vertex-oriented decomposition One can use a larger “connecting” set: use for instance the following algorithm

4. Numerical results

The comparison between the EO Balancing Neumann- Neumann preconditioner and the VO swiss-carpet preconditioner

The comparison between the EO Balancing Neumann- Neumann preconditioner and the VO swiss-carpet preconditioner (continued)

We want to present now some additional numerical results for a non-structured subdomain decomposition The used decomposition is obtained by the software METIS METIS is a family of programs for partitioning unstructured graphs and meshes, and computing fill-reducing orderings of sparse matrices Other numerical results (work in progress)

Conclusions A simple preconditioner of Dirichlet-Neumann type has been presented Condition number (for the Laplace operator…) behaves like H/h (proved in 2D; numerical evidence in 3D) The main feature is the type of decomposition (the swiss- carpet, a suitable vertex-oriented decomposition) In principle, the same decomposition could be used for other problems (convection-diffusion, Stokes, systems,…) To be done: analysis and numerical results for more general problems