HPL - A Portable Implementation of the High-Performance Linpack Benchmark for Distributed-Memory Computers

Version 2.3

A. Petitet, R. C. Whaley, J. Dongarra, A. Cleary

December 2, 2018

HPL is a software package that solves a (random) dense linear system in double precision (64 bits) arithmetic on distributed-memory computers. It can thus be regarded as a portable as well as freely available implementation of the High Performance Computing Linpack Benchmark.

The algorithm used by HPL can be summarized by the following keywords: Two-dimensional block-cyclic data distribution - Right-looking variant of the LU factorization with row partial pivoting featuring multiple look-ahead depths - Recursive panel factorization with pivot search and column broadcast combined - Various virtual panel broadcast topologies - bandwidth reducing swap-broadcast algorithm - backward substitution with look-ahead of depth 1.

The HPL package provides a testing and timing program to quantify the accuracy of the obtained solution as well as the time it took to compute it. The best performance achievable by this software on your system depends on a large variety of factors. Nonetheless, with some restrictive assumptions on the interconnection network, the algorithm described here and its attached implementation are scalable in the sense that their parallel efficiency is maintained constant with respect to the per processor memory usage.

The HPL software package requires the availibility on your system of an implementation of the Message Passing Interface MPI (1.1 compliant). An implementation of either the Basic Linear Algebra Subprograms BLAS or the Vector Signal Image Processing Library VSIPL is also needed. Machine-specific as well as generic implementations of MPI, the BLAS and VSIPL are available for a large variety of systems.

Acknowledgements: This work was supported in part by a grant from the Department of Energy's Lawrence Livermore National Laboratory and Los Alamos National Laboratory as part of the ASCI Projects contract numbers B503962 and 12187-001-00 4R.

[Home] [Copyright and Licensing Terms] [Algorithm] [Scalability] [Performance Results] [Documentation] [Software] [FAQs] [Tuning] [Errata-Bugs] [References] [Related Links]

Innovative Computing Laboratory
last revised December 2, 2018

#########################################################################

file    hpl-2.3.tar.gz
for     HPL 2.3 - A Portable Implementation of the High-Performance Linpack
,       Benchmark for Distributed-Memory Computers 
by      Antoine Petitet, Clint Whaley, Jack Dongarra, Andy Cleary, Piotr Luszczek
Updated: December 2, 2018

#########################################################################

file    hpl-2.2.tar.gz
for     HPL 2.2 - A Portable Implementation of the High-Performance Linpack
,       Benchmark for Distributed-Memory Computers 
by      Antoine Petitet, Clint Whaley, Jack Dongarra, Andy Cleary, Piotr Luszczek
Updated: February 24, 2016

#########################################################################

file    hpl-2.1.tar.gz
for     HPL 2.1 - A Portable Implementation of the High-Performance Linpack
,       Benchmark for Distributed-Memory Computers 
by      Antoine Petitet, Clint Whaley, Jack Dongarra, Andy Cleary, Piotr Luszczek
Updated: October 26, 2012

#########################################################################

file    hpl-2.0.tar.gz
for     HPL 2.0 - A Portable Implementation of the High-Performance Linpack
,       Benchmark for Distributed-Memory Computers 
by      Antoine Petitet, Clint Whaley, Jack Dongarra, Andy Cleary
Updated: September 10, 2008

#########################################################################

file    hpl.tgz
for     HPL 1.0a - A Portable Implementation of the High-Performance Linpack
,       Benchmark for Distributed-Memory Computers 
by      Antoine Petitet, Clint Whaley, Jack Dongarra, Andy Cleary
Updated: January 20, 2004


#########################################################################

file    hpl_qs22-2008-11-30.patch
for     Implementation of the High-Performance Linpack benchmark for IBM
,       QS22 systems with PowerXCell 8i processors. The file is a patch
,       for HPL 1.0a.
by      IBM

file    IBM_LICENSE.TXT
for     IBM Copyright notice for QS22 HPL
by      IBM

file    IBM_README.txt
for     README for IBM QS22 HPL
by      IBM
Updated: November 30, 2008


#########################################################################