Why Never Obtained
The bandwidth from the memory to the cache is insufficient.
The bandwidth from the cache to registers/functional units is insufficient.
Startup time or latency is too long.
The Communication time is too large wrt the computation time.
The mix of floating point ops is not suitable.