Operations Currently Available
Real matrix multiply
- Basic building block for L3 BLAS operations
- Competitive with vendor GEMM across most platforms
- Same code generator & approach may be used for all of level 3
- Threaded code written & tested
Gemm-based BLAS are publicly available
- Bo Kågström & Per Ling, Umeå University