Fill Reduction Algorithm Using Diagonal Markowitz Scheme with Local Symmetrization Patrick Amestoy ENSEEIHT-IRIT, France Xiaoye S. Li Esmond Ng Lawrence.

Slides:



Advertisements
Similar presentations
Section 2.5: Graphs and Trees
Advertisements

Graph Theory Arnold Mesa. Basic Concepts n A graph G = (V,E) is defined by a set of vertices and edges v3 v1 v2Vertex (v1) Edge (e1) A Graph with 3 vertices.
Lecture 3 Sparse Direct Method: Combinatorics Xiaoye Sherry Li Lawrence Berkeley National Laboratory, USA crd-legacy.lbl.gov/~xiaoye/G2S3/
1 Maximum flow sender receiver Capacity constraint Lecture 6: Jan 25.
ECE 552 Numerical Circuit Analysis Chapter Four SPARSE MATRIX SOLUTION TECHNIQUES Copyright © I. Hajj 2012 All rights reserved.
Siddharth Choudhary.  Refines a visual reconstruction to produce jointly optimal 3D structure and viewing parameters  ‘bundle’ refers to the bundle.
Symmetric Minimum Priority Ordering for Sparse Unsymmetric Factorization Patrick Amestoy ENSEEIHT-IRIT (Toulouse) Sherry Li LBNL/NERSC (Berkeley) Esmond.
1cs542g-term Notes  Assignment 1 is out (due October 5)  Matrix storage: usually column-major.
1cs542g-term Sparse matrix data structure  Typically either Compressed Sparse Row (CSR) or Compressed Sparse Column (CSC) Informally “ia-ja” format.
Symmetric Weighted Matching for Indefinite Systems Iain Duff, RAL and CERFACS John Gilbert, MIT and UC Santa Barbara June 21, 2002.
Chapter 9 Graph algorithms Lec 21 Dec 1, Sample Graph Problems Path problems. Connectedness problems. Spanning tree problems.
Sparse Matrix Methods Day 1: Overview Day 2: Direct methods Nonsymmetric systems Graph theoretic tools Sparse LU with partial pivoting Supernodal factorization.
Sparse Matrix Methods Day 1: Overview Matlab and examples Data structures Ax=b Sparse matrices and graphs Fill-reducing matrix permutations Matching and.
1 EE 616 Computer Aided Analysis of Electronic Networks Lecture 4 Instructor: Dr. J. A. Starzyk, Professor School of EECS Ohio University Athens, OH,
The Evolution of a Sparse Partial Pivoting Algorithm John R. Gilbert with: Tim Davis, Jim Demmel, Stan Eisenstat, Laura Grigori, Stefan Larimore, Sherry.
Sparse Direct Solvers on High Performance Computers X. Sherry Li CS267: Applications of Parallel Computers March.
Graph Algorithms. Overview Graphs are very general data structures – data structures such as dense and sparse matrices, sets, multi-sets, etc. can be.
CS 290H Lecture 12 Column intersection graphs, Ordering for sparsity in LU with partial pivoting Read “Computing the block triangular form of a sparse.
Graphs Algorithm Design and Analysis Week 4 Bibliography: [CLRS]- Subchap 22.1 – Representation of Graphs.
MUMPS A Multifrontal Massively Parallel Solver IMPLEMENTATION Distributed multifrontal.
Scalabilities Issues in Sparse Factorization and Triangular Solution Sherry Li Lawrence Berkeley National Laboratory Sparse Days, CERFACS, June 23-24,
TECHNIQUES ALGORITHMS AND SOFTWARE FOR THE DIRECT SOLUTION OF LARGE SPARSE LINEAR EQUATIONS.
Symbolic sparse Gaussian elimination: A = LU
Lecture 5 Parallel Sparse Factorization, Triangular Solution
Perfect Gaussian Elimination and Chordality By Shashank Rao.
GRAPHS CSE, POSTECH. Chapter 16 covers the following topics Graph terminology: vertex, edge, adjacent, incident, degree, cycle, path, connected component,
The Landscape of Sparse Ax=b Solvers Direct A = LU Iterative y’ = Ay Non- symmetric Symmetric positive definite More RobustLess Storage More Robust More.
Amesos Sparse Direct Solver Package Ken Stanley, Rob Hoekstra, Marzio Sala, Tim Davis, Mike Heroux Trilinos Users Group Albuquerque 3 Nov 2004.
CS 290H Lecture 5 Elimination trees Read GLN section 6.6 (next time I’ll assign 6.5 and 6.7) Homework 1 due Thursday 14 Oct by 3pm turnin file1.
A Clustering Algorithm based on Graph Connectivity Balakrishna Thiagarajan Computer Science and Engineering State University of New York at Buffalo.
Nyhoff, ADTs, Data Structures and Problem Solving with C++, Second Edition, © 2005 Pearson Education, Inc. All rights reserved Graphs.
Approximation Algorithms for NP-hard Combinatorial Problems Magnús M. Halldórsson Reykjavik University Local Search, Greedy and Partitioning
Data Structures & Algorithms Graphs
Solution of Sparse Linear Systems
Lecture 4 Sparse Factorization: Data-flow Organization
1/24 Introduction to Graphs. 2/24 Graph Definition Graph : consists of vertices and edges. Each edge must start and end at a vertex. Graph G = (V, E)
Direct Methods for Sparse Linear Systems Lecture 4 Alessandra Nardi Thanks to Prof. Jacob White, Suvranu De, Deepak Ramaswamy, Michal Rewienski, and Karen.
Department of Electronic Engineering, Tsinghua University Nano-scale Integrated Circuit and System Lab. Performance Analysis of Parallel Sparse LU Factorization.
Administrivia: October 5, 2009 Homework 1 due Wednesday Reading in Davis: Skim section 6.1 (the fill bounds will make more sense next week) Read section.
Graphs A graphs is an abstract representation of a set of objects, called vertices or nodes, where some pairs of the objects are connected by links, called.
CS 290H Administrivia: May 14, 2008 Course project progress reports due next Wed 21 May. Reading in Saad (second edition): Sections
GRAPHS. Graph Graph terminology: vertex, edge, adjacent, incident, degree, cycle, path, connected component, spanning tree Types of graphs: undirected,
CSE 421 Algorithms Richard Anderson Winter 2009 Lecture 5.
CS 290H Lecture 15 GESP concluded Final presentations for survey projects next Tue and Thu 20-minute talk with at least 5 min for questions and discussion.
CS 290H Lecture 9 Left-looking LU with partial pivoting Read “A supernodal approach to sparse partial pivoting” (course reader #4), sections 1 through.
Symmetric-pattern multifrontal factorization T(A) G(A)
Solution of Sparse Linear Systems Numerical Simulation CSE245 Thanks to: Kin Sou, Deepak Ramaswamy, Michal Rewienski, Jacob White, Shihhsien Kuo and Karen.
Conjugate gradient iteration One matrix-vector multiplication per iteration Two vector dot products per iteration Four n-vectors of working storage x 0.
CSE 373: Data Structures and Algorithms Lecture 21: Graphs V 1.
Lecture 3 Sparse Direct Method: Combinatorics
Graphs Representation, BFS, DFS
Direct Methods for Sparse Linear Systems
Lecture 11 Graph Algorithms
Graphs Algorithm Design and Analysis Bibliography:
Solving Linear Systems Ax=b
CS 290H Administrivia: April 16, 2008
CSE373: Data Structures & Algorithms Lecture 16: Introduction to Graphs Linda Shapiro Winter 2015.
The Landscape of Sparse Ax=b Solvers
CS200: Algorithm Analysis
Graphs Representation, BFS, DFS
Matrix Martingales in Randomized Numerical Linear Algebra
Graphs.
Kevin Mason Michael Suggs
CSCI2100 Data Structures Tutorial
A Graph Partitioning Algorithm by Node Separators
Graphs G = (V, E) V are the vertices; E are the edges.
CSE 373 Data Structures and Algorithms
Lecture 10 Graph Algorithms
Graphs Algorithm Design and Analysis Bibliography:
Nonsymmetric Gaussian elimination
Presentation transcript:

Fill Reduction Algorithm Using Diagonal Markowitz Scheme with Local Symmetrization Patrick Amestoy ENSEEIHT-IRIT, France Xiaoye S. Li Esmond Ng Lawrence Berkeley National Laboratory

SIAM CSE03, Feb 10-13, Contents  Motivation  Graph models for Gaussian elimination  Minimum priority metrics  Experimental results  Summary  Add: Runtime and space complexity

SIAM CSE03, Feb 10-13, Motivation -- New Sparse LU Factorization Algorithms  Inexpensive pre/post-processing  Equilibration (or scaling)  Pre-permute rows or columns of A to maximize its diagonal  Find a matching with maximum weight for bipartite graph of A  Example: MC64 [Duff/Koster ‘99]  Iterative refinement  GESP (static pivoting) [Li/Demmel ‘98, SuperLU_DIST]  Pivots are chosen from the diagonal  Allow half-precision perturbation of small diagonals  Unsymmetrized multifrontal [Amestoy/Puglisi ‘00, MA41_NEW]  Prefer diagonal pivoting, but threshold pivoting is possible  Allow unsymmetric fronts, but dependency graph is still a tree  Diagonal is (almost) good  Struct(L’)  Struct(U)

SIAM CSE03, Feb 10-13, Existing Ordering Strategies to Preserve Sparsity  Symmetric ordering algorithms on A’+A  Greedy algorithms  e.g., minimum degree, minimum deficiency, etc.  Graph partitioning  Hybrid  Problem: unsymmetric structure is not respected!

SIAM CSE03, Feb 10-13, Structural Gaussian Elimination -- Symmetric Case i jk i jk Eliminate 1 1 i j k 1 i j k 1 i j k i k j Undirected graph After a vertex is eliminated, all its neighbors become a clique The edges of the clique are the potential fills (upper bound !)

SIAM CSE03, Feb 10-13, Structural Gaussian Elimination -- Unsymmetric Case Eliminate 1 1 r1 r2 c1 c2 c3 1 r1 r2 c1 c2 c3 c1r1 r2 c2 c3 Eliminate 1 r1 r2 c1 c2 c3 1 1 Bipartite graph After a vertex is eliminated, all the row & column vertices adjacent to it become fully connected – “bi-clique” (assuming diagonal pivot) The edges of the bi-clique are the potential fills (upper bound !)

SIAM CSE03, Feb 10-13, Ordering Algorithms Revisit  Markowitz [1957] for unsymmetric matrices  At step k, pick pivot in the trailing submatrix so that:  It has minimum, and  It is bounded by a numerical threshold  Bound the size of the rank-1 update matrix  Expensive to implement because it is mixed with numerical consideration  Examples: MA48 (HSL), etc.  “Restricted” Markowitz -- only look ahead a few candidate columns (rows) with the lowest degrees [Zlatev ‘80]  Minimum degree [Tinney/Walker ‘67]  Special case of Markowitz for SPD systems  Efficient implementation, because:  Diagonal is stable as numerical pivot  Use quotient graph as a compact representation without regard of numerical values

SIAM CSE03, Feb 10-13, Simulation Result  Order(A) vs. Order(A’+A) (Markowitz vs. min degree)  Diagonal pivoting  88 unsymmetric matrices  Mean fill ratio 0.90  Mean flops ratio 0.79  54 very unsymmetric (symmetry <= 0.5)  Mean fill ratio 0.85  Mean flops ratio 0.56

SIAM CSE03, Feb 10-13, Quotient Graph – Symmetric Case  Elements -- representative nodes of the connected components in the eliminated subgraph  Variables -- uneliminated nodes Current pivot p: If variable v adjacent to e1, it will be adjacent to p  e1 can be absorbed by p  p is representative of conn. comp. {e1, e2, p} e1 e2 pxx x x. element list = {e1, e2}. variable list v

SIAM CSE03, Feb 10-13, Quotient Graph -- Unsymmetric Case Current pivot p: Difficulty: Path length may be greater than 2 ! e1 e2 p x x x v

SIAM CSE03, Feb 10-13, Quotient Graph -- “Local Symmetrization” e1 e2 p x x x v Current pivot p: Advantage: - Path length bounded by 2 ! Disadvantage: - Lose some asymmetry - More fill ss s

SIAM CSE03, Feb 10-13, Cost of Implementation  Elimination models can be implemented using standard graphs or quotient graphs, with different cost in time & space.

SIAM CSE03, Feb 10-13, Minimum Priority Metrics  Metrics are based on “approximate degree” in the sense of AMD, can be implemented efficiently  Almost the same cost using various metrics:  Based on row & column counts:  PRODUCT (a.k.a. Markowitz), SUM, MIN, MAX, etc.  Minimum fill : areas associated with the existing cliques are deducted  …...

SIAM CSE03, Feb 10-13, Preliminary Results with Local Symmetrization  Matrices: 98 unsymmetric in structure  Metrics : based on row/column counts or fill  Solvers:  MA41_NEW : unsymmetrized multifrontal  Local symmetrization ordering is ideal for this solver  SuperLU_DIST : GESP

SIAM CSE03, Feb 10-13, Compare Different Metrics  Solver: MA41_NEW  Average fill ratio using various metrics with respect to Markowitz (product of row & col counts)

SIAM CSE03, Feb 10-13, Compare with AMD(A’+A) using Min Fill -- All Unsymmetric  MA41_NEW  SuperLU_DIST

SIAM CSE03, Feb 10-13, Compare with AMD(A’+A) using Min Fill -- Very Unsymmetric  MA41_NEW  SuperLU_DIST

SIAM CSE03, Feb 10-13, Summary  First implementation based on BQG model  Features: supervariable, element absorption, mass elimination  Using approximate degree (degree upper bound)  Tried various metrics on large collection of matrices  PRODUCT, SUM, MIN-FILL, etc.  Not a single one is universally best, MIN-FILL is often better  Local symmetrization  Cheaper to implement, harder to understand behavior  Especially suitable for unsymmetrized multifrontal, also benefit GESP  Respectable gain for very unsymmetric matrices

SIAM CSE03, Feb 10-13, Summary (con’d)  Results for very unsymmetric matrices  Future work  Work underway for a fully unsymmetric version  Extend to graph partitioning strategy

SIAM CSE03, Feb 10-13, The End

SIAM CSE03, Feb 10-13, x 2 x x x 3 x 4 x 5 x x x 6 x x 7 Example A G(A) row column