Basic Properties

Next: Algorithm Up: Band Lanczos Method Previous: Deflation Contents Index

Basic Properties

After iterations, the algorithm has generated the vectors (7.62). It will be convenient to introduce the notation

$\begin{displaymath} V_j = \left[ \begin{array}{cccc} v_1 & v_2 & \cdots & v_j ... ...in{array}{cccc} w_1 & w_2 & \cdots & w_j \end{array} \right] \end{displaymath}$

(169)

for the matrices whose columns are just the right and left Lanczos vectors (7.62), respectively. The vectors (7.62) are constructed to be biorthogonal. Using the notation (7.63), the biorthogonality can be stated compactly as follows:

$\begin{displaymath} W_j^T V_j = D_j = {\mathop{\rm diag}\nolimits}\left(\delta_1, \delta_2,\ldots,\delta_j\right). \end{displaymath}$

(170)

In order to enforce biorthogonality of the next Lanczos vectors, the algorithm involves division by the diagonal entries, $\delta_k$ , in (7.64). As a result, the algorithm has to be stopped as soon as

$\begin{displaymath} \delta_k = w_k^T v_k = 0,\quad \mbox{but}\quad v_k,\; w_k \not=0, \end{displaymath}$

(171)

occurs. The situation (7.65) is called a breakdown of the algorithm. While breakdowns can be remedied by incorporating so-called look-ahead into the algorithm, here, for simplicity, we discuss only the band Lanczos algorithm without look-ahead.

After iterations, in addition to (7.62), the algorithm has constructed the vectors

$\begin{displaymath} \hat{v}_{j+1},\hat{v}_{j+2},\ldots,\hat{v}_{j+m_c} \quad \mbox{and}\quad \hat{w}_{j+1},\hat{w}_{j+2},\ldots,\hat{w}_{j+p_c}. \end{displaymath}$

(172)

The vectors $\hat{v}_{j+1},\hat{v}_{j+2},\ldots,\hat{v}_{j+m_c}$ are the candidates for the next

right Lanczos vectors, $v_{j+1},v_{j+2},\ldots,v_{j+m_c}$ , and the vectors $\hat{w}_{j+1},\hat{w}_{j+2},\ldots,\hat{w}_{j+p_c}$ are the candidates for the next

left Lanczos vectors, $w_{j+1},w_{j+2},\ldots,w_{j+p_c}$ . Here,

is the number of right starting vectors,

, minus the number of deflations in the right block Krylov sequence that have occurred during the first

iterations, and

is the number of left starting vectors,

, minus the number of deflations in the left block Krylov sequence that have occurred during the first

iterations. The vectors (7.66) are constructed so that they satisfy the biorthogonality relations

$\begin{displaymath} \begin{array}{rl} W_j^T \hat{v}_{k} &\!\!\! = 0 \quad \mbox{... ... \quad \mbox{for all} \quad k=j+1,j+2,\ldots,j+p_c. \end{array}\end{displaymath}$

(173)

The algorithm has a very simple built-in deflation procedure based on the vectors (7.66). In fact, an exact deflation in the right block Krylov sequence at iteration

is equivalent to $\hat{v}_{j+1}=0$ . Therefore, in the algorithm, one checks if $\left\Vert\hat{v}_{j+1}\right\Vert _2$ is smaller than some suitable deflation tolerance. If yes, the vector $\hat{v}_{j+1}$ is deflated and

is reduced by 1. Otherwise, $\hat{v}_{j+1}$ is normalized to become the next right Lanczos vector $v_{j+1}$ . Similarly, an exact deflation in the left block Krylov sequence at iteration

is equivalent to $\hat{w}_{j+1}=0$ . In the algorithm, one checks if $\left\Vert\hat{w}_{j+1}\right\Vert _2$ is smaller than the deflation tolerance. If yes, the vector $\hat{w}_{j+1}$ is deflated and

is reduced by 1. Otherwise, $\hat{w}_{j+1}$ is normalized to become the next left Lanczos vector $w_{j+1}$ .

The recurrences that are used in the algorithm to generate the vectors (7.62) and (7.66) can be summarized compactly as follows:

$\begin{displaymath} \begin{array}{rl} A V_j &\!\!\! = V_j T_j + \left[ \begin{ar... ..._c} \end{array} \right] + \hat{W}_j^{\rm {(dl)}}. \end{array}\end{displaymath}$

(174)

Here,

and $\tilde{T}_j$ are $j\times j$ matrices whose entries are chosen so that the biorthogonality conditions (7.64) and (7.67) are satisfied. The matrix $\hat{V}_j^{\rm {(dl)}}$ in (7.68) contains mostly zero columns, together with the $\hat{v}_k$ vectors that have been deflated during the first

iterations. The matrix $\hat{W}_j^{\rm {(dl)}}$ in (7.68) contains mostly zero columns, together with the $\hat{w}_k$ vectors that have been deflated during the first

iterations. We remark that

is the number of deflated $\hat{v}_k$ vectors and that

is the number of deflated $\hat{w}_k$ vectors. It turns out that biorthogonality only has to be explicitly enforced among

consecutive Lanczos vectors and, once deflation has occurred, against

fixed earlier left Lanczos vectors, respectively,

fixed earlier right Lanczos vectors. As a result, the matrices

and $\tilde{T}_j$ are ``essentially'' banded. More precisely,

has lower bandwidth

and upper bandwidth

, where the lower bandwidth is reduced by 1 every time a $\hat{v}_{k}$ vector is deflated, and the upper bandwidth is reduced by 1 every time a $\hat{w}_{k}$ vector is deflated. In addition, each deflation of a $\hat{w}_{k}$ vector causes

to have nonzero elements in a fixed row outside and to the right of the banded part. More precisely, the row index of each such row caused by deflation of a $\hat{w}_{k}$ vector is given by

, where

is the number of the iteration at which the deflation has occurred and

is the corresponding value of

at that iteration. The matrix

can thus be written as

$\begin{displaymath} T_j = T_j^{\rm {(b)}} + T_j^{\rm {(d)}}, \end{displaymath}$

(175)

where $T_j^{\rm {(b)}}$ is a banded matrix and $T_j^{\rm {(d)}}$ contains horizontal ``spikes'' above the band of

due to deflation of $\hat{w}_{k}$ vectors. Similarly,

$\begin{displaymath} \tilde{T}_j = \tilde{T}_j^{\rm {(b)}} + \tilde{T}_j^{\rm {(d)}}, \end{displaymath}$

where the banded part $\tilde{T}_j^{\rm {(b)}}$ has lower bandwidth

and upper bandwidth

, and $\tilde{T}_j^{\rm {(d)}}$ contains horizontal ``spikes'' above the band of $\tilde{T}_j$ due to deflation of $\hat{v}_{k}$ vectors.

The entries of the matrices and $\tilde{T}_j$ are not independent of each other. More precisely, setting

$\begin{displaymath} T_j^{\rm (pr)} = T_j + D_j^{-1} \left(\tilde{T}_j^{\rm {(d)... ...= \tilde{T}_j + D_j^{-1} \left(T_j^{\rm {(d)}}\right)^T D_j, \end{displaymath}$

(176)

we have

$\begin{displaymath} T_j^{\rm (pr)} = D_j^{-1} \left(\tilde{T}_j^{\rm (pr)}\right)^T D_j, \end{displaymath}$

(177)

where

is the diagonal matrix given by (7.64). Inserting (7.69) into the definition of $T_j^{\rm (pr)}$ in (7.70), we obtain the relation

$\begin{displaymath} T_j^{\rm (pr)} = T_j^{\rm {(b)}} + T_j^{\rm {(d)}} + D_j^{-1} \left(\tilde{T}_j^{\rm {(d)}}\right)^T D_j, \end{displaymath}$

which shows that $T_j^{\rm (pr)}$ consists of the banded part $T_j^{\rm {(b)}}$ , horizontal spikes due to deflation of $\hat{w}_{k}$ vectors above the banded part, and vertical spikes due to deflation of $\hat{v}_{k}$ vectors below the banded part.

For example, consider the case of right and left starting vectors. Assume that during the first iterations, deflations of $\hat{w}_k$ vectors have occurred at iterations , , and , and deflations of $\hat{v}_k$ vectors have occurred at iterations and . In this case, the matrix $T^{(\rm pr)}_{15}$ has the following sparsity structure:

$\begin{displaymath} T_{15}^{\rm {(pr)}} = %%\footnotesize{ \left[ \begin{array}{... ...& & & & & &\tilde{{\tt d}}& & & &{*}&{*} \end{array} \right]. \end{displaymath}$

Here, the

's denote potentially nonzero entries within the banded part, $T_{15}^{\rm {(b)}}$ ; the ${\tt d}$ 's denote potentially nonzero entries due to the deflations of $\hat{w}_k$ vectors at iterations

, and

; and the $\tilde{{\tt d}}$ 's denote potentially nonzero entries due to the deflations of $\hat{v}_k$ vectors at iterations

and

. Note that the deflations have reduced the initial lower bandwidth

at iteration

and the initial upper bandwidth

at iteration

After iterations of the band Lanczos algorithm, approximate eigensolutions for the NHEP (7.58) are obtained via an oblique projection of the matrix onto the subspace spanned by the columns of and orthogonal to the subspace spanned by the columns of . More precisely, this means that we are seeking approximate eigenvectors of (7.58) of the form $x_i^{(j)} = V_j z_i^{(j)}$ and that, after inserting this ansatz for $x_i^{(j)}$ into (7.58), we multiply the resulting relation from the left by . This yields the generalized eigenvalue problem

$\begin{displaymath} W_j^T A V_j z_i^{(j)} = \theta_i^{(j)} W_j^T V_j z_i^{(j)},\quad i=1,2,\ldots,j. \end{displaymath}$

(178)

Using the biorthogonality relations (7.64) and (7.67), it is easy to verify that the matrix $T_j^{\rm (pr)}$ defined in (7.70) satisfies

$\begin{displaymath} T_j^{\rm {(pr)}} = \left(W_j^T V_j\right)^{-1} W_j^T A V_j. \end{displaymath}$

(179)

By (7.73), the generalized eigenvalue problem (7.72) is equivalent to the standard eigenvalue problem

$\begin{displaymath} T_j^{\rm {(pr)}} z_i^{(j)} = \theta_i^{(j)} z_i^{(j)},\quad i=1,2,\ldots,j. \end{displaymath}$

We stress that, in the algorithm, we use the formula in (7.70) to obtain the entries of $T_j^{\rm (pr)}$ , rather than (7.73).

The band Lanczos algorithm terminates as soon as or is reached. In the case , deflations of $\hat{v}_k$ vectors have occurred, and thus the right block Krylov sequence (7.60) is exhausted. In the case , deflations of $\hat{w}_k$ vectors have occurred, and thus the left block Krylov sequence (7.61) is exhausted.

First consider termination due to . Then, the relation for the right Lanczos vectors in (7.68) can be rewritten as follows:

$\begin{displaymath} A V_j - V_j T_j^{\rm {(pr)}} = P_j \hat{V}_j^{\rm {(dl)}}. \end{displaymath}$

(180)

Here, the matrix

$\begin{displaymath} P_j = I - V_j D_j^{-1} W_j^T,\quad \mbox{where}\quad D_j = W_j^T V_j, \end{displaymath}$

(181)

represents the oblique projection characterized by

and

for all

in the null space of

. Now let $\theta_i^{(j)}$ and $z_i^{(j)}$ be any of the eigenpairs of $T_j^{\rm {(pr)}}$ , and assume that $z_i^{(j)}$ is normalized so that $\left\Vert z_i^{(j)}\right\Vert _2 = 1$ . Recall that the pair $\theta_i^{(j)}$ and $x_i^{(j)} = V_j z_i^{(j)}$ is used as an approximate eigensolution of

. From (7.74), it follows that the residual of this approximate eigensolution can be bounded as follows:

$\begin{displaymath} \left\Vert A x_i^{(j)} - \theta_i^{(j)} x_i^{(j)} \right\Ver... ...rt _2 \cdot \left\Vert \hat{V}_j^{\rm {(dl)}} \right\Vert _2. \end{displaymath}$

(182)

In particular, if only exact deflation is performed, then $\hat{V}_j^{\rm {(dl)}} = 0$ and (7.76) shows that each eigenvalue $\theta_i^{(j)}$ of $T_j^{\rm {(pr)}}$ is indeed an eigenvalue of

Similarly, in the case of termination due to , the relation for the left Lanczos vectors in (7.68) can be rewritten as follows:

$\begin{displaymath} A^T W_j - W_j \tilde{T}_j^{\rm {(pr)}} = P_j^T \hat{W}_j^{\rm {(dl)}}. \end{displaymath}$

(183)

Here,

is again the matrix defined in (7.75). Now let $\theta_i^{(j)}$ and $\tilde{z}_i^{(j)}$ be any of the eigenpairs of $(T_j^{\rm {(pr)}})^T$ , and assume that $\tilde{z}_i^{(j)}$ is normalized such that $\Vert D_j^{-1} \tilde{z}_i^{(j)}\Vert _2 = 1$ . Note that the complex conjugate of $\tilde{z}_i^{(j)}$ is a left eigenvector of $T_j^{\rm {(pr)}}$ . The pair $\theta_i^{(j)}$ and $y_i^{(j)} = W_j D_j^{-1} \tilde{z}_i^{(j)}$ then represents an approximate eigensolution of

. From (7.77), it follows that the residual of this approximate eigensolution can be bounded as follows:

$\begin{displaymath} \left\Vert A^T y_i^{(j)} - \theta_i^{(j)} y_i^{(j)} \right\V... ...rt _2 \cdot \left\Vert \hat{W}_j^{\rm {(dl)}} \right\Vert _2. \end{displaymath}$

(184)

In particular, if only exact deflation is performed, then $\hat{W}_j^{\rm {(dl)}} = 0$ and (7.78) shows that each eigenvalue $\theta_i^{(j)}$ of $T_j^{\rm {(pr)}}$ is indeed an eigenvalue of

and thus of

Next: Algorithm Up: Band Lanczos Method Previous: Deflation Contents Index

Susan Blackford 2000-11-20