Next: About this document
Up: ScaLAPACK: A Linear
Previous: Conclusions
References
- 1
-
M. ABOELAZE, N. CHRISOCHOIDES, AND E. HOUSTIS, The Parallelization
of Level 2 and 3 BLAS Operations on Distributed Memory Machines, Tech. Rep.
CSD-TR-91-007, Purdue University, West Lafayette, IN, 1991.
- 2
-
R. AGARWAL, F. GUSTAVSON, AND M. ZUBAIR, Improving Performance of
Linear Algebra Algorithms for Dense Matrices Using Algorithmic Prefetching,
IBM J. Res. Dev., 38 (1994), pp. 265-275.
- 3
-
E. ANDERSON, Z. BAI, C. BISCHOF, J. DEMMEL, J. DONGARRA, J. DUCROZ, A. GREENBAUM, S. HAMMARLING, A. MCKENNEY, S. OSTROUCHOV, AND D. SORENSEN,
``LAPACK Users' Guide, Second Edition'', SIAM, Philadelphia, PA,
1995.
- 4
-
C. ASHCRAFT, The Distributed Solution of Linear Systems Using the
Torus-wrap Data mapping, Tech. Rep. ECA-TR-147, Boeing Computer Services,
Seattle, WA, 1990.
- 5
-
L. S. BLACKFORD, J. CHOI, A. CLEARY, J. DEMMEL, I. DHILLON, J. DONGARRA, S. HAMMARLING, G. HENRY, A. PETITET, D. WALKER, AND R. C. WHALEY, ``
ScaLAPACK: A Portable Linear Algebra Library for Distributed Memory Computers
- Design Issues and Performance '', in Proceedings of the Supercomputer 96
Conference, IEEE Computer Society Press, November 1996.
- 6
-
R. BRENT, The LINPACK Benchmark on the AP 1000, in Frontiers,
1992, McLean, VA, 1992, pp. 128-135.
- 7
-
R. BRENT AND P. STRAZDINS, Implementation of BLAS Level 3 and
LINPACK Benchmark on the AP1000, Fujitsu Scientific and Technical Journal,
5 (1993), pp. 61-70.
- 8
-
J. CHOI, J. DEMMEL, I. DHILLON, J. DONGARRA, S. OSTROUCHOV, A. PETITET, K. STANLEY, D. WALKER, AND R. C. WHALEY, ScaLAPACK: A Portable Linear
Algebra Library for Distributed Memory Computers - Design Issues and
Performance, Computer Physics Communications, 97 (1996), pp. 1-15.
(also LAPACK Working Note #95).
- 9
-
J. CHOI, J. DONGARRA, S. OSTROUCHOV, A. PETITET, D. WALKER, AND R. C. WHALEY, A proposal for a set of parallel basic linear algebra
subprograms, LAPACK Working Note #100 Technical report UT CS-95-292,
University of Tennessee, 1995.
- 10
-
J. CHOI, J. DONGARRA, R. POZO, AND D. WALKER, ``ScaLAPACK: A
Scalable Linear Algebra Library for Distributed Memory Concurrent
Computers'', Tech. Rep. UT CS-92-181, LAPACK Working Note #55,
University of Tennessee, 1992.
- 11
-
J. CHOI, J. DONGARRA, AND D. WALKER, PB-BLAS: A Set of Parallel
Block Basic Linear Algebra Subroutines, Concurrency: Practice and
Experience, 8 (1996), pp. 517-535.
- 12
-
A. CHTCHELKANOVA, J. GUNNELS, G. MORROW, J. OVERFELT, AND R. VAN DE GEIJN, Parallel Implementation of BLAS: General Techniques for Level 3
BLAS, Tech. Rep. TR95-49, Department of Computer Sciences, UT-Austin, 1995.
Submitted to Concurrency: Practice and Experience.
- 13
-
E. CHU AND A. GEORGE, QR Factorization of a Dense Matrix on a
Hypercube Multiprocessor, SIAM Journal on Scientific and Statistical
Computing, 11 (1990), pp. 990-1028.
- 14
-
M. DAYDE, I. DUFF, AND A. PETITET, A Parallel Block Implementation
of Level 3 BLAS for MIMD Vector Processors, ACM Trans. Math. Softw., 20
(1994), pp. 178-193.
- 15
-
J. DONGARRA, J. DUCROZ, I. DUFF, AND S. HAMMARLING, ``A Set of
Level 3 Basic Linear Algebra Subprograms'', ACM Trans. Math. Softw., 16
(1990), pp. 1-28.
- 16
-
J. DONGARRA, J. DUCROZ, S. HAMMARLING, AND R. HANSON, ``An Extended
Set of Fortran Basic Linear Algebra Subprograms'', ACM Trans. Math. Softw.,
14 (1988), pp. 1-32.
- 17
-
J. DONGARRA, R. VAN DE GEIJN, AND D. WALKER, ``A Look at Scalable
Dense Linear Algebra Librairies'', Tech. Rep. UT CS-92-155, LAPACK
Working Note #43, University of Tennessee, 1992.
- 18
-
J. DONGARRA AND D. WALKER, Software Libraries for Linear Algebra
Computations on High Performance Computers, SIAM Review, 37 (1995),
pp. 151-180.
- 19
-
J. DONGARRA AND R. C. WHALEY, ``A User's Guide to the BLACS
v1.0'', Tech. Rep. UT CS-95-281, LAPACK Working Note #94, University
of Tennessee, 1995.
- 20
-
R. FALGOUT, A. SKJELLUM, S. SMITH, AND C. STILL, The Multicomputer
Toolbox Approach to Concurrent BLAS and LACS, in Proceedings of the
Scalable High Performance Computing Conference SHPCC-92, IEEE Computer
Society Press, 1992.
- 21
-
G. FOX, M. JOHNSON, G. LYZENGA, S. OTTO, J. SALMON, AND D. WALKER,
``Solving Problems on Concurrent Processors'', vol. 1, Prentice Hall,
Englewood Cliffs, N.J, 1988.
- 22
-
A. GEIST, A. BEGUELIN, J. DONGARRA, W. JIANG, R. MANCHEK, AND V. SUNDERAM, PVM : Parallel Virtual Machine. A Users' Guide and
Tutorial for Networked Parallel Computing, The MIT Press Cambridge,
Massachusetts, 1994.
- 23
-
G. GEIST AND C. ROMINE, LU Factorization Algorithms on Distributed
Memory Multiprocessor Architectures, SIAM Journal on Scientific and
Statistical Computing, 9 (1988), pp. 639-649.
- 24
-
B. HENDRICKSON AND D. WOMBLE, The Torus-wrap Mapping for Dense
Matrix Calculations on Massively Parallel Computers, SIAM Journal on
Scientific and Statistical Computing, 15 (1994), pp. 1201-1226.
- 25
-
G. HENRY AND R. VAN DE GEIJN, Parallelizing the QR Algorithm for
the Unsymmetric Algebraic Eigenvalue problem: Myths and Reality, Tech. Rep.
UT CS-94-244, LAPACK Working Note #79, University of Tennessee, 1994.
- 26
-
S. HUSS-LEDERMAN, E. JACOBSON, A. TSAO, AND G. ZHANG, Matrix
Multiplication on the Intel Touchstone DELTA, Concurrency: Practice and
Experience, 6 (1994), pp. 571-594.
- 27
-
B. KAGSTRfOM, P. LING, AND C. VAN LOAN, GEMM-Based Level 3 BLAS:
High-Performance Model Implementations and Performance Evaluation
Benchmark, Tech. Rep. UMINF 95-18, Department of Computing Science, Umea
University, 1995.
Submitted to ACM TOMS.
- 28
-
V. KUMAR, A. GRAMA, A. GUPTA, AND G. KARYPIS, Introduction to
Parallel Computing, The Benjamin/Cummings Publishing Company, Inc., Redwood
City, CA, 1994.
- 29
-
C. LAWSON, R. HANSON, D. KINCAID, AND F. KROGH, ``Basic Linear
Algebra Subprograms for Fortran Usage'', ACM Trans. Math. Softw., 5 (1979),
pp. 308-323.
- 30
-
W. LICHTENSTEIN AND S. L. JOHNSSON, Block-Cyclic Dense Linear
Algebra, SIAM Journal on Scientific and Statistical Computing, 14 (1993),
pp. 1259-1288.
- 31
-
R. SCHREIBER AND C. VAN LOAN, A storage efficient WY
representation for products of Householder transformations, SIAM J. Sci.
Stat. Comput., 10 (1989), pp. 53-57.
- 32
-
M. SNIR, S. W. OTTO, S. HUSS-LEDERMAN, D. W. WALKER, AND J. J. DONGARRA,
MPI: The Complete Reference, MIT Press, Cambridge, Massachusetts,
1996.
Jack Dongarra
Sat Feb 1 08:18:10 EST 1997