The anomalous (fractional) diffusion has a wide application in modeling many physical processes including superconductivity phenomena, protein diffusion within cells, diffusion through porous media, etc. The Fractional Laplacian is a non-local operator. Thus, after the discretization we get a dense system of linear equations. Solving it with traditional Gaussian elimination results in computational complexity of O(n3).

The hierarchical compression methods utilize the structure of the matrix A to compress it into a sparser approximation H. The Hierarchical Semi-Separable (HSS) compression is a derivative method that approximates the off-diagonal blocks as Ai j,ijUiBijVj. If the approximation of Aij,i≠j is low-rank, the generator matrices Ui, Bij and Vj are much smaller than the non-compressed block Aij,i≠j. The diagonal blocks Aii can be compressed in a similar way and so on recursively. The computational complexity is O(r2n), where r is the maximum off-diagonal rank and, ideally, r approaches O(log n). However, the performance heavily depends on the ordering of the unknowns. In this work we analyze and compare the use of Recursive Bisection and Nested Dissection on the initial mesh to produce a more suitable ordering. Recursive bisection uses a divide and conquer approach to recursively divide the mesh in two and then each halve in two again, and so on recursively. Nested Dissection also divides the mesh in two recursively, but it does so along a set of separator vortices, which form a third part in each level of dissection.

We assess and compare the use of Recursive Bisection and Nested Dissection reorderings for improving the performance and accuracy of a parallel HSS solver from the STRUctured Matrix PACKage (STRUMPACK) for a fractional diffusion problem modeled with the Ritz potential. We also compare with results from the original and three other reorderings, as well as using a direct Gaussian elimination solver from Intel®’s Math Kernel Library (MKL).

1.
S.
Chandrasekaran
,
P.
Dewilde
,
M.
Gu
, and
N.
Somasunderam
(
2010
)
On the numerical rank of the off-diagonal blocks of Schur complements of discretized elliptic PDEs
,
SIAM Journal on Matrix Analysis and Applications
31
,
2261
2290
, .
2.
B.
Engquist
and
L.
Ying
(
2011
)
Sweeping preconditioner for the helmholtz equation: Hierarchical matrix representation
,
Communications on Pure and Applied Mathematics
64
,
697
735
, https://onlinelibrary.wiley.com/doi/pdf/10.1002/cpa.20358.
3.
G.
Acosta
and
J. P.
Borthagaray
(
2017
)
A fractional laplace equation: Regularity of solutions and finite element approximations
,
SIAM Journal on Numerical Analysis 
55
,
472
495
, .
4.
G.
Acosta
,
F. M.
Bersetche
, and
J. P.
Borthagaray
(
2017
)
A short fe implementation for a 2d homogeneous dirichlet problem of a fractional laplacian
,
Computers & Mathematics with Applications
74
,
784
816
.
5.
S.
Harizanov
,
R.
Lazarov
,
S.
Margenov
,
P.
Marinov
, and
J.
Pasciak
(
2020
),
Analysis of numerical methods for spectral fractional elliptic equations based on the best uniform rational approximation
,
Journal of Computational Physics
408
,
109285
.
6.
W.
Hackbusch
(
1999
)
A sparse matrix arithmetic based on H-matrices. Part I: Introduction to -matrices
,
Computing
62
,
89
108
.
7.
P. G.
Martinsson
(
2011
)
A fast randomized algorithm for computing a hierarchically semiseparable representation of a matrix
,
SIAM Journal on Matrix Analysis and Applications 
32
,
1251
1274
, .
8.
F.-H.
Rouet
,
X. S.
Li
,
P.
Ghysels
, and
A.
Napov
(
2016
)
A distributed-memory package for dense hierarchically semi-separable matrix computations using randomization
,
ACM Trans. Math. Softw.
42
, .
9.
The STRUMPACK project’s homepage is
https://portal.nersc.gov/project/sparse/strumpack/.
10.
S.
Chandrasekaran
,
M.
Gu
, and
T.
Pals
(
2006
)
A fast ulv decomposition solver for hierarchically semiseparable representations
,
SIAM Journal on Matrix Analysis and Applications
28
,
603
622
, .
11.
R. J.
Lipton
,
D. J.
Rose
, and
R. E.
Tarjan
(
1979
)
Generalized nested dissection
,
SIAM Journal on Numerical Analysis
16
,
346
358
, .
12.
A.
Napov
and
X. S.
Li
(
2016
)
An algebraic multifrontal preconditioner that exploits the low-rank property
,
Numerical Linear Algebra with Applications
23
,
61
82
, https://onlinelibrary.wiley.com/doi/pdf/10.1002/nla.2006.
13.
S.
Wang
,
X. S.
Li
,
F.-H.
Rouet
,
J.
Xia
, and
M. V.
De Hoop
(
2016
)
A parallel geometric multifrontal solver using hierarchically semiseparable structure
,
ACM Trans. Math. Softw.
42
, .
14.
P.
Amestoy
,
C.
Ashcraft
,
O.
Boiteau
,
A.
Buttari
,
J.-Y.
L’Excellent
, and
C.
Weisbecker
(
2015
)
Improving multifrontal methods by means of block low-rank representations
,
SIAM Journal on Scientific Computing
37
,
A1451
A1474
, .
15.
H. D.
Simon
and
S.-H.
Teng
(
1997
)
How good is recursive bisection?
SIAM Journal on Scientific Computing
18
,
1436
1445
, .
16.
E.
Rebrova
,
G.
Chávez
,
Y.
Liu
,
P.
Ghysels
, and
X. S.
Li
, “
A study of clustering techniques and hierarchical matrix formats for kernel ridge regression
,” in
2018 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)
(
2018
) pp.
883
892
.
17.
More information about AVITOHOL could be found on http://www.hpc.acad.bg/avitohol/.
18.
D.
Slavchev
, “On the impact of reordering in a hierarchical semi-separable compression solver for fractional diffusion problems,” in
Large-Scale Scientific Computing
, edited by
I.
Lirkov
and
S.
Margenov
(
Springer International Publishing
,
Cham
,
2020
) pp.
373
381
.
19.
D.
Slavchev
, “Performance analysis of hierarchical semi-separable compression solver for fractional diffusion problems,” in
Advanced Computing in Industrial Mathematics
(
Springer International Publishing
,
Cham
). [in press]
20.
J. R.
Gilbert
,
Y.
Li
, and
S.-H.
Teng
(
2020
)
Mesh partitioning toolbox – meshpart
, .
This content is only available via PDF.
You do not currently have access to this content.