¨Performance instability
ãSmall changes in the
architecture may cause dramatic changes in
delivered performance.
¨Latency tolerant and bandwidth aware algorithms and software are critical
ãSometimes this means recompute
rather than store/load
¨Need to help the compiler
¨Today we have a hard time getting performance
ãOnly going to get harder
¨BLAS as a starting point