| |
E. Agullo, A. Buttari, A. Guermouche and F. Lopez, Implementing multifrontal sparse solvers for multicore architectures with sequential task flow runtime systems, ACM Trans. Math. Softw., 43 (2016), 13: 1-13: 22,
doi: 10.1145/2898348.
|
| |
E. Agullo, J. Demmel, J. Dongarra, B. Hadri, J. Kurzak, J. Langou, H. Ltaief, P. Luszczek and S. Tomov, Numerical linear algebra on emerging architectures: The PLASMA and MAGMA projects, Journal of Physics: Conference Series, 180 (2009), 012037, http://stacks.iop.org/1742-6596/180/i=1/a=012037.
doi: 10.1088/1742-6596/180/1/012037.
|
| |
P. R. Amestoy, T. A. Davis and I. S. Duff, An approximate minimum degree ordering algorithm, SIAM J. Matrix Anal. Appl., 17 (1996), 886-905,
doi: 10.1137/S0895479894278952.
|
| |
P. R. Amestoy, T. A. Davis and I. S. Duff, Algorithm 837: Amd, an approximate minimum degree ordering algorithm, ACM Trans. Math. Softw., 30 (2004), 381-388,
doi: 10.1145/1024074.1024081.
|
| |
C. Augonnet, S. Thibault, R. Namyst and P. -A. Wacrenier, Starpu: a unified platform for task scheduling on heterogeneous multicore architectures, Concurrency and Computation: Practice and Experience, 23 (2011), 187-198,
doi: 10.1007/978-3-642-03869-3_80.
|
| |
G. Bosilca, A. Bouteiller, A. Danalis, M. Faverge, A. Haidar, T. Hérault, J. Kurzak, J. Langou, P. Lemarinier, H. Ltaief, P. Luszczek, A. Yarkhan and J. J. Dongarra, Distibuted dense numerical linear algebra algorithms on massively parallel architectures: DPLASMA, in Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum (IPDPSW'11), PDSEC 2011, Anchorage, United States, (2011), 1432-1441, https://hal.inria.fr/hal-00809680.
|
| |
G. Bosilca, A. Bouteiller, A. Danalis, M. Faverge, T. Hérault and J. J. Dongarra, Parsec: Exploiting heterogeneity to enhance scalability, Computing in Science and Engineering, 15 (2013), 36-45,
doi: 10.1109/MCSE.2013.98.
|
| |
A. Buttari, Fine-grained multithreading for the multifrontal QR factorization of sparse matrices, SIAM Journal on Scientific Computing, 35 (2013), C323-C345,
doi: 10.1137/110846427.
|
| |
M. Cosnard and M. Loi, Automatic task graph generation techniques, in System Sciences, 1995. Proceedings of the Twenty-Eighth Hawaii International Conference on, 2 (1995), 113-122.
doi: 10.1109/HICSS.1995.375471.
|
| |
T. A. Davis
and Y. Hu
, The university of Florida sparse matrix collection, ACM Trans. Math. Softw., 38 (2011)
, 1:1-1:25.
doi: 10.1145/2049662.2049663.
|
| |
G. A. Geist
and E. Ng
, Task scheduling for parallel sparse cholesky factorization, Int. J. Parallel Program., 18 (1990)
, 291-314.
doi: 10.1007/BF01407861.
|
| |
A. George
and J. W. H. Liu
, An automatic nested dissection algorithm for irregular finite element problems, SINUM, 15 (1978)
, 1053-1069.
doi: 10.1137/0715069.
|
| |
P. Hénon
, P. Ramet
and J. Roman
, PaStiX: A high-performance parallel direct solver For sparse symmetric definite systems, Parallel Computing, 28 (2002)
, 301-321.
doi: 10.1016/S0167-8191(01)00141-7.
|
| |
J. D. Hogg
, J. K. Reid
and J. A. Scott
, Design of a multicore sparse cholesky factorization using dags, SIAM Journal on Scientific Computing, 32 (2010)
, 3627-3649.
doi: 10.1137/090757216.
|
| |
J. D. Hogg and J. A. Scott, A modern analyse phase for sparse tree-based direct methods, Technical Report RAL-TR-2010-031, STFC Rutherford Appleton Lab., 2010, https://epubs.stfc.ac.uk/work/54246.
|
| |
F. D. Igual
, E. Chan
, E. S. Quintana-Ortí
, G. Quintana-Ortí
, R. A. van de Geijn
and F. G. V. Zee
, The flame approach: From dense linear algebra algorithms to high-performance multi-accelerator implementations, J. Parallel Distrib. Comput., 72 (2012)
, 1134-1143.
doi: 10.1016/j.jpdc.2011.10.014.
|
| |
G. Karypis and V. Kumar, A fast and high quality multilevel scheme for partitioning irregular graphs,
SIAM J. Sci. Comput., 20 (1998), 359-392,
doi: 10.1137/S1064827595287997.
|
| |
J. W. H. Liu, Modification of the minimum-degree algorithm by multiple elimination, ACM Trans. Math. Softw., 11 (1985), 141-153,
doi: 10.1145/214392.214398.
|
| |
F. Lopez, Task-based Multifrontal QR Solver for Heterogeneous Architectures, Thèse de doctorat, Université Paul Sabatier, Toulouse, France, 2015.
|
| |
W. F. Tinney
and J. W. Walker
, Direct solutions of sparse network equations by optimally ordered triangular factorization, Proceedings of the IEEE, 55 (1967)
, 1801-1809.
doi: 10.1109/PROC.1967.6011.
|