ScaLAPACK Implementation
BLAS (Performance and Portability)
- Blocked data access (Level 3 BLAS) yields good local performance,
- Portability: standardization of efficient kernels.
BLACS (Performance and Portability)
- Correct level of notation: communication of matrices,
- Efficiency: identify frequent linear algebra operations which can be optimized on various computers.