next up previous
Next: Local Memory Requirements Up: Precise Storage and I/O Previous: Precise Storage and I/O

Choosing the Ordering of the Block Loops

A consequence of the requirement that tex2html_wrap_inline2078 is that the block loops may appear in any order. Suppose, without loss of generality, that
displaymath2392
Then the flux of data per unit surface area across the faces of the tiles normal to tex2html_wrap_inline2396 is greater than that across the other faces. We would choose to make the tex2html_wrap_inline2398 block loop innermost. This is because we would avoid storing to memory the data that flow across the faces normal to tex2html_wrap_inline2396 when going from one tile to the next. This has the effect, for example, of causing us to choose a ``left-looking'' block Gaussian elimination or block Householder QR method in preference to a ``right-looking'' method, which helps to reduce the memory traffic further. See the examples of Section 7 for illustration of how this technique should be applied.



Jack Dongarra
Tue Feb 18 15:39:11 EST 1997