Name
HPL_equil Equilibrate U and forward the column panel L.
Synopsis
#include <hpl.h>
void
HPL_equil(
HPL_T_panel *
PBCST
,
int *
IFLAG
,
HPL_T_panel *
PANEL
,
const enum HPL_TRANS
TRANS
,
const int
N
,
double *
U
,
const int
LDU
,
int *
IPLEN
,
const int *
IPMAP
,
const int *
IPMAPM1
,
int *
IWORK
);
Description
HPL_equil
equilibrates the local pieces of U, so that on exit to
this function, pieces of U contained in every process row are of the
same size. This phase makes the rolling phase optimal. In addition,
this function probes for the column panel L and forwards it when
possible.
Arguments
PBCST (local input/output) HPL_T_panel *
On entry, PBCST points to the data structure containing the
panel (to be broadcast) information.
IFLAG (local intput/output) int *
On entry, IFLAG indicates whether or not the broadcast has
already been completed. If not, probing will occur, and the
outcome will be contained in IFLAG on exit.
PANEL (local input/output) HPL_T_panel *
On entry, PANEL points to the data structure containing the
panel (to be equilibrated) information.
TRANS (global input) const enum HPL_TRANS
On entry, TRANS specifies whether U is stored in transposed
or non-transposed form.
N (local input) const int
On entry, N specifies the number of rows or columns of U. N
must be at least 0.
U (local input/output) double *
On entry, U is an array of dimension (LDU,*) containing the
local pieces of U in each process row.
LDU (local input) const int
On entry, LDU specifies the local leading dimension of U. LDU
should be at least MAX(1,IPLEN[nprow]) when U is stored in
non-transposed form, and MAX(1,N) otherwise.
IPLEN (global input) int *
On entry, IPLEN is an array of dimension NPROW+1. This array
is such that IPLEN[i+1] - IPLEN[i] is the number of rows of U
in process IPMAP[i].
IPMAP (global input) const int *
On entry, IPMAP is an array of dimension NPROW. This array
contains the logarithmic mapping of the processes. In other
words, IPMAP[myrow] is the absolute coordinate of the sorted
process.
IPMAPM1 (global input) const int *
On entry, IPMAPM1 is an array of dimension NPROW. This array
contains the inverse of the logarithmic mapping contained in
IPMAP: For i in [0.. NPROCS) IPMAPM1[IPMAP[i]] = i.
IWORK (workspace) int *
On entry, IWORK is a workarray of dimension NPROW+1.
See Also
HPL_pdlaswp01N,
HPL_pdlaswp01T