MATH 685/ CSI 700/ OR 682 Lecture Notes Math 685/CSI 700 Spring 08 MATH 685/ CSI 700/ OR 682 Lecture Notes Lecture 3. Solving linear systems George Mason University, Department of Mathematical Sciences
Solving linear systems Math 685/CSI 700 Spring 08 Solving linear systems To solve linear system, transform it into one whose solution is same but easier to compute What type of transformation of linear system leaves solution unchanged? We can premultiply (from left) both sides of linear system Ax = b by any nonsingular matrix M without affecting solution Solution to MAx =Mb is given by George Mason University, Department of Mathematical Sciences
Permutation matrices Math 685/CSI 700 Spring 08 George Mason University, Department of Mathematical Sciences
Scaling Math 685/CSI 700 Spring 08 George Mason University, Department of Mathematical Sciences
Triangular form What type of linear system is easy to solve? Math 685/CSI 700 Spring 08 Triangular form What type of linear system is easy to solve? If one equation in system involves only one component of solution (i.e., only one entry in that row of matrix is nonzero), then that component can be computed by division If another equation in system involves only one additional solution component, then by substituting one known component into it, we can solve for other component If this pattern continues, with only one new solution component per equation, then all components of solution can be computed in succession. System with this property is called triangular George Mason University, Department of Mathematical Sciences
Triangular form Math 685/CSI 700 Spring 08 George Mason University, Department of Mathematical Sciences
Gaussian elimination Math 685/CSI 700 Spring 08 George Mason University, Department of Mathematical Sciences
Gaussian elimination Math 685/CSI 700 Spring 08 George Mason University, Department of Mathematical Sciences
Example Math 685/CSI 700 Spring 08 George Mason University, Department of Mathematical Sciences
Elimination Math 685/CSI 700 Spring 08 George Mason University, Department of Mathematical Sciences
Elimination matrices Math 685/CSI 700 Spring 08 George Mason University, Department of Mathematical Sciences
Elimination matrices Math 685/CSI 700 Spring 08 George Mason University, Department of Mathematical Sciences
Example Math 685/CSI 700 Spring 08 George Mason University, Department of Mathematical Sciences
Example Math 685/CSI 700 Spring 08 George Mason University, Department of Mathematical Sciences
Gaussian elimination Math 685/CSI 700 Spring 08 George Mason University, Department of Mathematical Sciences
Gaussian elimination Math 685/CSI 700 Spring 08 George Mason University, Department of Mathematical Sciences
LU decomposition Math 685/CSI 700 Spring 08 George Mason University, Department of Mathematical Sciences
LU decomposition Math 685/CSI 700 Spring 08 George Mason University, Department of Mathematical Sciences
Example Math 685/CSI 700 Spring 08 George Mason University, Department of Mathematical Sciences
Example Math 685/CSI 700 Spring 08 George Mason University, Department of Mathematical Sciences
Example Math 685/CSI 700 Spring 08 George Mason University, Department of Mathematical Sciences
Example Math 685/CSI 700 Spring 08 George Mason University, Department of Mathematical Sciences
Math 685/CSI 700 Spring 08 Row interchanges Gaussian elimination breaks down if leading diagonal entry of remaining unreduced matrix is zero at any stage Easy fix: if diagonal entry in column k is zero, then interchange row k with some subsequent row having nonzero entry in column k and then proceed as usual If there is no nonzero on or below diagonal in column k, then there is nothing to do at this stage, so skip to next column Zero on diagonal causes resulting upper triangular matrix U to be singular, but LU factorization can still be completed Subsequent back-substitution will fail, however, as it should for singular matrix George Mason University, Department of Mathematical Sciences
Math 685/CSI 700 Spring 08 Partial pivoting In principle, any nonzero value will do as pivot, but in practice pivot should be chosen to minimize error propagation To avoid amplifying previous rounding errors when multiplying remaining portion of matrix by elementary elimination matrix, multipliers should not exceed 1 in magnitude This can be accomplished by choosing entry of largest magnitude on or below diagonal as pivot at each stage Such partial pivoting is essential in practice for numerically stable implementation of Gaussian elimination for general linear systems George Mason University, Department of Mathematical Sciences
LU with pivoting Math 685/CSI 700 Spring 08 George Mason University, Department of Mathematical Sciences
Complete pivoting Math 685/CSI 700 Spring 08 George Mason University, Department of Mathematical Sciences
Example Math 685/CSI 700 Spring 08 George Mason University, Department of Mathematical Sciences
Small pivots Math 685/CSI 700 Spring 08 George Mason University, Department of Mathematical Sciences
Small pivots Math 685/CSI 700 Spring 08 George Mason University, Department of Mathematical Sciences
Pivoting not required sometimes Math 685/CSI 700 Spring 08 Pivoting not required sometimes George Mason University, Department of Mathematical Sciences
Residual Math 685/CSI 700 Spring 08 George Mason University, Department of Mathematical Sciences
Residual Math 685/CSI 700 Spring 08 George Mason University, Department of Mathematical Sciences
Example Math 685/CSI 700 Spring 08 George Mason University, Department of Mathematical Sciences
Example Math 685/CSI 700 Spring 08 George Mason University, Department of Mathematical Sciences
Implementing Gaussian elimination Math 685/CSI 700 Spring 08 Implementing Gaussian elimination George Mason University, Department of Mathematical Sciences
Complexity Math 685/CSI 700 Spring 08 George Mason University, Department of Mathematical Sciences
Inversion vs. Factorization Math 685/CSI 700 Spring 08 Inversion vs. Factorization George Mason University, Department of Mathematical Sciences
Math 685/CSI 700 Spring 08 Modified systems If right-hand side of linear system changes but matrix does not, then LU factorization need not be repeated to solve new system Only forward- and back-substitution need be repeated for new right-hand side This is substantial savings in work, since additional triangular solutions cost only O(n2) work, in contrast to O(n3) cost of factorization George Mason University, Department of Mathematical Sciences
Rank-one update Math 685/CSI 700 Spring 08 George Mason University, Department of Mathematical Sciences
Rank-one update Math 685/CSI 700 Spring 08 George Mason University, Department of Mathematical Sciences
Example Math 685/CSI 700 Spring 08 George Mason University, Department of Mathematical Sciences
Example Math 685/CSI 700 Spring 08 George Mason University, Department of Mathematical Sciences
Math 685/CSI 700 Spring 08 Scaling In principle, solution to linear system is unaffected by diagonal scaling of matrix and right-hand-side vector In practice, scaling affects both conditioning of matrix and selection of pivots in Gaussian elimination, which in turn affect numerical accuracy in finite-precision arithmetic It is usually best if all entries (or uncertainties in entries) of matrix have about same size Sometimes it may be obvious how to accomplish this by choice of measurement units for variables, but there is no foolproof method for doing so in general Scaling can introduce rounding errors if not done carefully George Mason University, Department of Mathematical Sciences
Scaling Math 685/CSI 700 Spring 08 George Mason University, Department of Mathematical Sciences
Iterative refinement Math 685/CSI 700 Spring 08 George Mason University, Department of Mathematical Sciences
Math 685/CSI 700 Spring 08 Iterative refinement Iterative refinement requires double storage, since both original matrix and its LU factorization are required Due to cancellation, residual usually must be computed with higher precision for iterative refinement to produce meaningful improvement For these reasons, iterative improvement is often impractical to use routinely, but it can still be useful in some circumstances For example, iterative refinement can sometimes stabilize otherwise unstable algorithm George Mason University, Department of Mathematical Sciences
SPD matrices Math 685/CSI 700 Spring 08 George Mason University, Department of Mathematical Sciences
Cholesky decomposition Math 685/CSI 700 Spring 08 Cholesky decomposition Features of Cholesky algorithm for symmetric positive definite matrices All n square roots are of positive numbers, so algorithm is well defined No pivoting is required to maintain numerical stability Only lower triangle of A is accessed, and hence upper triangular portion need not be stored Only n3/6 multiplications and similar number of additions are required Thus, Cholesky factorization requires only about half work and half storage compared with LU factorization of general matrix by Gaussian elimination, and also avoids need for pivoting George Mason University, Department of Mathematical Sciences
Math 685/CSI 700 Spring 08 Banded matrices Gaussian elimination for band matrices differs little from general case — only ranges of loops change Typically matrix is stored in array by diagonals to avoid storing zero entries If pivoting is required for numerical stability, bandwidth can grow (but no more than double) General purpose solver for arbitrary bandwidth is similar to code for Gaussian elimination for general matrices For fixed small bandwidth, band solver can be extremely simple, especially if pivoting is not required for stability George Mason University, Department of Mathematical Sciences
Iterative vs. direct methods Math 685/CSI 700 Spring 08 Iterative vs. direct methods Gaussian elimination is direct method for solving linear system, producing exact solution in finite number of steps (in exact arithmetic) Iterative methods begin with initial guess for solution and successively improve it until desired accuracy attained In theory, it might take infinite number of iterations to converge to exact solution, but in practice iterations are terminated when residual is as small as desired For some types of problems, iterative methods have significant advantages over direct methods George Mason University, Department of Mathematical Sciences