It has become more or less standard in scientific codes to utilize shared- and distributed-memory parallelism achieved by OpenMP and MPI and thus to use the computational power of all available cores. However, in recent years the theoretical peak performance of CPUs has also been rising due to the capabilities of vector processing units able to perform simultaneous computations on vectors of data. This concept, known as SIMD, becomes increasingly important, yet it is still quite neglected. This paper deals with the intra-node optimized code utilizing the SIMD registers solving the boundary integral equations, and with numerical experiments on two modern Intel’s processors, namely Xeon Phi 7250 and Xeon 8160.

1.
O.
Steinbach
, Numerical Approximation Methods for Elliptic Boundary Value Problems: Finite and Boundary Elements,
Texts in applied mathematics
(
Springer
,
2008
).
2.
S.
Sauter
and
C.
Schwab
, Boundary Element Methods,
Springer Series in Computational Mathematics
(
Springer
,
2010
).
3.
S.
Rjasanow
and
O.
Steinbach
,
The Fast Solution of Boundary Integral Equations
(
Springer
,
2007
).
4.
J.
Zapletal
,
G.
Of
, and
M.
Merta
, “Parallel and vectorized implementation of analytic evaluation of boundary integral operators,” (preprint).
5.
F.
Lemaitre
,
B.
Couturier
, and
L.
Lacassagne
, “
Cholesky factorization on simd multi-core architectures
,” (
Journal of Systems Architecture
,
2017
).
6.
H.
Watanabe
and
K. M.
Nakagawa
, “SIMD Vectorization for the Lennard-Jones Potential with AVX2 and AVX-512 instructions,” (
2018
).
7.
R.
Alfieri
,
S.
Bernuzzi
,
A.
Perego
, and
D.
Radice
, “
Optimization of finite-differencing kernels for numerical relativity applications
,” (
Journal of Low Power Electronics and Applications
,
2018
).
8.
M.
Merta
,
J.
Zapletal
,
M.
Kravcenko
, and
L.
Maly
,
BEM4I
, Available at http://bem4i.it4i.cz (
2014
).
9.
J.
Zapletal
,
M.
Merta
, and
L.
Maly
, “
Boundary element quadrature schemes for multi- and many-core architectures
,” (
Computers & Mathematics with Applications
,
2017
).
This content is only available via PDF.
You do not currently have access to this content.