Recursive TRMM
¨
Recur down to L1
cache block size
¨
Need kernel at
bottom of
recursion
ã
Use gemm-based
kernel for
portability
0
0
0
0
0
0
0