speedy. cat res/dMMRES MULADD LAT NB MU NU KU FFTCH IFTCH NFTCH MFLOP 0 5 40 2 1 40 0 2 1 197.94 16
We generate and time this case by:
make mmcase muladd=0 lat=5 nb=40 mu=2 nu=1 ku=40 beta=1 dNB=40, ldc=40, mu=2, nu=1, ku=40, lat=5: time=0.490000, mflop=199.575510 dNB=40, ldc=40, mu=2, nu=1, ku=40, lat=5: time=0.500000, mflop=195.584000 dNB=40, ldc=40, mu=2, nu=1, ku=40, lat=5: time=0.490000, mflop=199.575510
We test that the generator isn't out of its mind by:
make mmtstcase muladd=0 lat=5 nb=40 mu=2 nu=1 ku=40 beta=1
Note that when timing/testing the generator, varying the parameters such as mu, nu, ku, beta, etc., generates different codes, and thus different performance numbers:
make mmcase muladd=0 lat=4 nb=40 mu=2 nu=1 ku=4 beta=1 dNB=40, ldc=40, mu=2, nu=1, ku=4, lat=4: time=0.760000, mflop=128.673684 dNB=40, ldc=40, mu=2, nu=1, ku=4, lat=4: time=0.770000, mflop=127.002597 dNB=40, ldc=40, mu=2, nu=1, ku=4, lat=4: time=0.770000, mflop=127.002597 make mmcase muladd=0 lat=4 nb=40 mu=2 nu=1 ku=8 beta=1 dNB=40, ldc=40, mu=2, nu=1, ku=8, lat=4: time=0.640000, mflop=152.800000 dNB=40, ldc=40, mu=2, nu=1, ku=8, lat=4: time=0.630000, mflop=155.225397 dNB=40, ldc=40, mu=2, nu=1, ku=8, lat=4: time=0.630000, mflop=155.225397 make mmcase muladd=0 lat=4 nb=40 mu=2 nu=2 ku=40 beta=1 dNB=40, ldc=40, mu=2, nu=2, ku=40, lat=4: time=2.550000, mflop=38.349804 dNB=40, ldc=40, mu=2, nu=2, ku=40, lat=4: time=2.560000, mflop=38.200000 dNB=40, ldc=40, mu=2, nu=2, ku=40, lat=4: time=2.550000, mflop=38.349804