Goals - Port LAPACK to distributed-memory environments.

Efficiency
- Optimized compute and communication engines
- Block-partitioned algorithms (Level 3 BLAS) utilize memory hierarchy and yield good node performance

Flexibility
- Modularity: Build rich set of linear algebra tools: BLAS, BLACS, PBLAS

Efficiency