PBLAS -- Introduction
Parallel Basic Linear Algebra Subprograms for distributed-memory MIMD computers
Do both the communication and computation.
Simplification of the parallelization: especially when BLAS-based,
Modularity: gives programmer larger building blocks,
Portability: machine dependencies are confined to the BLAS and BLACS.