Basic Algorithm

Next: Variants Up: Arnoldi Method Y. Saad Previous: Arnoldi Method Y. Saad Contents Index

Basic Algorithm

The Arnoldi method is an orthogonal projection method onto a Krylov subspace. It starts with the Arnoldi procedure as described in Algorithm 7.3. The procedure can be essentially viewed as a modified Gram-Schmidt process for building an orthogonal basis of the Krylov subspace $\KK^m(A,v)$ .

$\begin{algorithm}{Arnoldi Procedure } { \begin{tabbing} (nr)ss\=ijkl\=bbb\=ccc\... ... h_{j+1,j} $\ \\ {\rm (11)} \> \> {\bf end for} \end{tabbing}} \end{algorithm}$

The above procedure will stop if the vector computed in line (8) vanishes. The vectors $v_1, v_2, \ldots , v_m$ form an orthonormal system by construction and are called Arnoldi vectors. An easy induction argument shows that this system is a basis of the Krylov subspace $\KK^m(A,v)$ .

Next we consider a fundamental relation between quantities generated by the algorithm. The following equality is readily derived:

$\begin{displaymath} A v_j = \sum_{i=1}^{j+1} h_{ij} v_i , \quad j=1,2,\ldots ,m \ . \end{displaymath}$

(122)

If we denote by

the $n \times m$ matrix with column vectors $v_1, \ldots, v_m$ , and by

the $m \times m$ Hessenberg matrix whose nonzero entries $h_{ij}$ are defined by the algorithm, then the following relations hold:

$\displaystyle A V_m$	$\textstyle =$	$\displaystyle V_m H_m + h_{m+1,m} v_{m+1} e_m^{\ast},$	(123)
$\displaystyle V_m^{\ast} A V_m$	$\textstyle =$	$\displaystyle H_m \ .$	(124)

Relation eq:VmTAVm follows from eq:AVm by multiplying both sides of eq:AVm by $V_m^{\ast}$ and making use of the orthonormality of $\{ v_1, \ldots,v_m \}$ .

As was noted earlier the algorithm breaks down when the norm of computed on line (8) vanishes at a certain step . As it turns out, this happens if and only if the starting vector is a combination of eigenvectors (i.e., the minimal polynomial of is of degree ). In addition, the subspace $\KK_j$ is then invariant and the approximate eigenvalues and eigenvectors are exact [387].

The approximate eigenvalues $\lambda_i \sup{m}$ provided by the projection process onto $\KK_m$ are the eigenvalues of the Hessenberg matrix . These are known as Ritz values. A Ritz approximate eigenvector associated with a Ritz value $\lambda_i \sup{m}$ is defined by $u_i \sup{m}= V_m y_i \sup{m}$ , where $y_i \sup{m}$ is an eigenvector associated with the eigenvalue $\lambda_i \sup{m}$ . A number of the Ritz eigenvalues, typically a small fraction of , will usually constitute good approximations for corresponding eigenvalues $\lambda_i$ of , and the quality of the approximation will usually improve as increases.

The original algorithm consists of increasing until all desired eigenvalues of are found. For large matrices, this becomes costly both in terms of computation and storage. In terms of storage, we need to keep vectors of length plus an $m \times m$ Hessenberg matrix, a total of approximately . For the arithmetic costs, we need to multiply by , at the cost of $2 \times N_z$ , where is number of nonzero elements in , and then orthogonalize the result against vectors at the cost of which increases with the step number . Thus an -dimensional Arnoldi procedure costs $\approx n m + m ^2 /2$ in storage and $\approx N_z + 2 n m ^2$ in arithmetic operations.

Obtaining the residual norm, for a Ritz pair, as the algorithm progresses is fairly inexpensive. Let $y_i \sup{m}$ be an eigenvector of associated with the eigenvalue $\lambda_i \sup{m}$ , and let $u_i \sup{m}$ be the Ritz approximate eigenvector $u_i \sup{m}= V_m y_i \sup{m}$ . We have the relation

$\begin{displaymath} (A - \lambda_i \sup{m}I ) u_i \sup{m}= h_{m+1,m}(e_m^{\ast} y_i\sup{m})v_{m+1}, \end{displaymath}$

and, therefore,

$\begin{displaymath} \Vert ( A - \lambda_i \sup{m}I ) u_i \sup{m}\Vert _2 = h_{m+1,m} \vert e_m^{\ast} y_i \sup{m}\vert \ . \end{displaymath}$

Thus, the residual norm is equal to the absolute value of the last component of the eigenvector $y_i \sup{m}$ multiplied by $h_{m+1,m}$ . The residual norms are not always indicative of actual errors in $\lambda_i \sup{m}$ , but can be quite helpful in deriving stopping procedures.

Next: Variants Up: Arnoldi Method Y. Saad Previous: Arnoldi Method Y. Saad Contents Index

Susan Blackford 2000-11-20