Achieving High Performance on a Distributed-Memory Computer
Use the right number of processors
- Rule of thumb: P=MxN/1,000,000 for an MxN matrix. This provides a local matrix of size approximately 1000-by-1000.
- Do not try to solve a small problem on too many processors.
- Do not exceed physical memory.