Proof

Next: Optimizing the number of Up: Optimizing the tile size Previous: Optimizing the tile size

Proof

We break down the problem into two subcases depending on the values taken by the function f, whose argument ranges from 1 to ;

. Since f is a nondecreasing function of , this condition is equivalent to . In this case, Equation (2) is always satisfied (). Then the minimum of T is obtained by minimizing with , namely

This easily leads to , as stated in the theorem
. Since f is a nondecreasing function of , we can safely take be such that . Note that all values of will lead to admissible values for , because we always have by definition of f. Now consider the expression of T for arbitrary and :
- if , then , T is a non-increasing function of both and decreases, then the minimum is obtained with and .
- if then and is a non-increasing function of . Then the minimum of T is reached if . In that case , and again the minimum is reached when and .

This result is disappointing in that we end up with degenerate tiles in most practical situations. For instance if (which is very likely to happen in practice), , and the optimal tile size is , not a very coarse-grain tiling indeed! The flaw is that the model is not accurate enough to take the impact of data locality and data reuse into account (which are the main objectives of tiling). A first solution is to model the computation cost of a tile by an affine expression like , where u represents some access overhead. It is not difficult to plug this expression into the formula given for the total execution time T, and to derive the optimal tile size. Another solution is to assume a fixed tile size that would be imposed by some a priori considerations (such as the cache size). Again, we can let in Equation (1), and minimize T for , say.

Next: Optimizing the number of Up: Optimizing the tile size Previous: Optimizing the tile size

Jack Dongarra
Sat Feb 8 08:17:58 EST 1997