(2)编译安装GotoBLAS
编辑vi Makefile.rule
#BINARY64 = 1 ==>> BINARY64 = 1
在GotoBLAS目录下,执行:make
进exports目录,生成so文件
cd exports
make so
拷贝xerbla.o到GotoBLAS目录
cp level1/others/xerbla.o ./
在HPL目录下,在setup文件中选出Make文件,我的选择是Make.Linux_PII_CBLAS文件,copy在HPL的目录下,
重命名为Make.Linux
ARCH= Linux_PII_CBLAS==>>ARCH= Linux
TOPdir= $(HOME)/hpl ==>> TOPdir= /opt/hpl
MPdir= /usr/local/mpi==>> MPdir= /opt/iba
MPlib= $(MPdir)/lib/libmpich.a==>> MPlib= $(MPdir)/lib64/ libmpich.a
LAdir= $(HOME)/netlib/ARCHIVES/Linux_PII ==>>LAdir= /opt/GotoBLAS
LAlib=$(LAdir)/libcblas.a $(LAdir)/libatlas.a ==>> LAlib=$(LAdir)/libgoto_prescott-r1.26.so $(LAdir)/xerbla.o
HPL_OPTS= -DHPL_CALL_CBLAS ==>> HPL_OPTS=
CC=/usr/bin/gcc ==>> CC=/opt/iba/bin/mpicc
LINKER=/usr/bin/g77 ==>> LINKER=/opt/iba/bin/mpif77
make arch=Linux
完成后在/opt/iba/bin/Linux目录下会出现HPL.dat和xhpl两个文件,表示成功
HPLinpack 2.0 -- High-Performance Linpack benchmark -- September 10, 2008
Written by A. Petitet and R. Clint Whaley, Innovative Computing Laboratory, UTK
Modified by Piotr Luszczek, Innovative Computing Laboratory, UTK
Modified by Julien Langou, University of Colorado Denver
================================================================================
T/V : Wall time / encoded variant.
N : The order of the coefficient matrix A.
NB : The partitioning blocking factor.
P : The number of process rows.
Q : The number of process columns.
Time : Time in seconds to solve the linear system.
Gflops : Rate of execution for solving the linear system.
NB : 1 2 3 4
PMAP : Row-major process mapping
P : 2 1 4
Q : 2 4 1
PFACT : Left Crout Right
NBMIN : 2 4
NDIV : 2
RFACT : Left Crout Right
BCAST : 1ring
DEPTH : 0
SWAP : Mix (threshold = 64)
L1 : transposed form
U : transposed form
EQUIL : yes
ALIGN : 8 double precision words
- The following scaled residual check will be computed:
||Ax-b||_oo / ( eps * ( || x ||_oo * || A ||_oo + || b ||_oo ) * N )
- The relative machine precision (eps) is taken to be 1.110223e-16
- Computational tests pass if scaled residuals are less than 16.0
T/V N NB P Q Time Gflops
--------------------------------------------------------------------------------
WR00L2L2 29 1 2 2 0.05 3.340e-04
--------------------------------------------------------------------------------
||Ax-b||_oo/(eps*(||A||_oo*||x||_oo+||b||_oo)*N)= 0.0188218 ...... PASSED
================================================================================
T/V N NB P Q Time Gflops
--------------------------------------------------------------------------------
WR00L2L4 29 1 2 2 0.00 1.244e-02
--------------------------------------------------------------------------------
||Ax-b||_oo/(eps*(||A||_oo*||x||_oo+||b||_oo)*N)= 0.0188218 ...... PASSED
================================================================================
T/V N NB P Q Time Gflops
--------------------------------------------------------------------------------
WR00L2C2 29 1 2 2 0.00 1.262e-02
--------------------------------------------------------------------------------
||Ax-b||_oo/(eps*(||A||_oo*||x||_oo+||b||_oo)*N)= 0.0188218 ...... PASSED
================================================================================
T/V N NB P Q Time Gflops
--------------------------------------------------------------------------------
WR00L2C4 29 1 2 2 0.00 1.276e-02
--------------------------------------------------------------------------------
||Ax-b||_oo/(eps*(||A||_oo*||x||_oo+||b||_oo)*N)= 0.0188218 ...... PASSED
================================================================================
T/V N NB P Q Time Gflops
--------------------------------------------------------------------------------
WR00L2R2 29 1 2 2 0.00 1.301e-02
--------------------------------------------------------------------------------
||Ax-b||_oo/(eps*(||A||_oo*||x||_oo+||b||_oo)*N)= 0.0188218 ...... PASSED
================================================================================
T/V N NB P Q Time Gflops
--------------------------------------------------------------------------------
WR00L2R4 29 1 2 2 0.00 1.301e-02
--------------------------------------------------------------------------------
||Ax-b||_oo/(eps*(||A||_oo*||x||_oo+||b||_oo)*N)= 0.0188218 ...... PASSED
================================================================================
T/V N NB P Q Time Gflops
--------------------------------------------------------------------------------
WR00C2L2 29 1 2 2 0.00 1.297e-02
--------------------------------------------------------------------------------
||Ax-b||_oo/(eps*(||A||_oo*||x||_oo+||b||_oo)*N)= 0.0188218 ...... PASSED
================================================================================
T/V N NB P Q Time Gflops
--------------------------------------------------------------------------------
WR00C2L4 29 1 2 2 0.00 1.297e-02
--------------------------------------------------------------------------------
||Ax-b||_oo/(eps*(||A||_oo*||x||_oo+||b||_oo)*N)= 0.0188218 ...... PASSED
================================================================================
T/V N NB P Q Time Gflops
--------------------------------------------------------------------------------
WR00C2C2 29 1 2 2 0.00 1.318e-02
--------------------------------------------------------------------------------
||Ax-b||_oo/(eps*(||A||_oo*||x||_oo+||b||_oo)*N)= 0.0188218 ...... PASSED
================================================================================