[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Athlon 1.2Ghz results == P4 1.5



Guys,

I include below the new stuff on a 1.2Ghz Athlon with DDR memory.  This is
Julian's kernel, with Peters mods, plus the new cacheedge, and we are talking
over 80% of peak in out-of-cache performance on a 1.2Ghz machine.  If
you compare these numbers in double precision (real and complex), you will
find that the Athlon wins big for small-medium size probs, and ties for large
with a 1.5Ghz P4.  It gets very close to the performance of a 800Mhz IA64
as well.  I am impressed.  Of course, the P4 cleans up the place in single,
since it is using SSE for double the theoretical peak . . .

You'll notice that the double complex performance is much better than the
last time I posted numbers; it was a problem with cacheedge setting screwing
it up before . . .

Cheers,
Clint

Results on 1.2Ghz Athlon
3.3.2  : old ATLAS version, using generated kernel
3.3.11 : ATLAS + Julian/Peter + CE

              100    200    300    400    500    600    700    800    900   1000
           ====== ====== ====== ====== ====== ====== ====== ====== ====== ======
3.3.11 sMM 1612.9 1710.3 1800.0 1828.6 1851.9 1963.6 1905.6 1896.3 1970.3 1941.7
3.3.11 sLU  781.6 1147.2 1323.0 1381.2 1456.1 1554.8 1593.6 1586.1 1618.7 1624.8

3.3.11 cMM 1771.4 1792.0 1878.3 1896.3 1886.8 1898.9 1905.6 1887.6 1924.8 1900.2
3.3.11 cLU 1068.3 1381.2 1438.2 1504.5 1549.2 1599.0 1603.8 1624.6 1660.8 1687.1

3.3.2  dMM 1136.4 1271.8 1388.6 1280.0 1315.8 1393.5 1372.0 1383.8 1429.4 1418.4
3.3.11 dMM 1470.6 1550.0 1800.0 1828.6 1785.7 1878.3 1960.0 1861.8 1944.0 1904.8
3.3.2  dLU  676.0  841.3  914.1  982.7  970.8 1027.3 1054.3 1065.7 1116.3 1110.3
3.3.11 dLU  714.6  952.4 1092.9 1188.5 1323.8 1307.5 1370.5 1391.9 1566.4 1448.2

3.3.11 zMM 1503.0 1600.0 1728.0 1765.5 1554.4 1838.3 1841.6 1820.4 1869.2 1834.5
3.3.11 zLU  947.4 1087.3 1227.7 1311.6 1359.5 1439.1 1428.4 1483.3 1530.0 1540.8