This link will publish NLAFET technical reports and white papers that reflect progress and results of the project.
Working Notes
- WN1 – Distributed One-Stage Hessenberg-Triangular Reduction with Wavefront Scheduling, Björn Adlerborn, Lars Karlsson, and Bo Kågström
- WN2 – PDHGEQZ User Guide, Björn Adlerborn, Bo Kågström, and Daniel Kressner
- WN3 – Low rank approximation of a sparse matrix based on LU factorization with column and row tournament pivoting, Laura Grigori, Sebastien Cayrols, and James W. Demmel
- WN4 – Workshop on Batched, Reproducible, and Reduced Precision BLAS, Sven Hammarling
- WN5 – A Comparison of Potential Interfaces for Batched BLAS Computations, Samuel D. Relton, Pedro Valero-Lara, and Mawussi Zounon
- WN6 – A new sparse LDLT solver using a posteriori threshold pivoting, Jonathan Hogg
- WN7 – Experiments with sparse Cholesky using a sequential task-flow implementation, Iain Duff, Jonathan Hogg, and Florent Lopez
- WN8 – Evaluation of the Tunability of a New NUMA-Aware Hessenberg Reduction Algorithm, Mahmoud Eljammaly, Lars Karlsson, and Bo Kågström
- WN9 – Robust solution of triangular linear systems, Carl Christian Kjelgaard Mikkelsen and Lars Karlsson
- WN10 – Towards Highly Parallel and Compute-Bound Computation of Eigenvectors for Matrices in Schur Form, Björn Adlerborn, Carl Christian Kjelgaard Mikkelsen, Lars Karlsson, and Bo Kågström
- WN11 – Task-Based Parallel Algorithms for Eigenvalue Reordering of Matrices in Real Schur Form, Mirko Myllykoski, Carl Christian Kjelgaard Mikkelsen, Lars Karlsson, and Bo Kågström
- WN12 – Second Workshop on Batched, Reproducible, and Reduced Precision BLAS, Sven Hammarling
- WN13 – Reducing the communication and computational costs of Enlarged Krylov subspaces Conjugate Gradient, Laura Grigori and Olivier Tissot
- WN14 – Experiments with sparse Cholesky using a parametrized task graph implementation, Iain Duff and Florent Lopez
- WN15 – PLASMA 17 Functionality Report Parallel BLAS and Norms, Linear Systems and Least Squares, Mixed Precision and Matrix Inversion, Maksims Abalenkovs, Jack Dongarra, Mark Gates, Azzam Haidar, Jakub Kurzak, Piotr Luszczek, Mawussi Zounon, Samuel Relton, Jakub Sistek, David Stevens, Ichitaro Yamazaki, Asim Yar Khan
- WN16 – PLASMA 17 Performance Report Linear Systems and Least Squares Haswell, Knights Landing, POWER8, Maksims Abalenkovs, Negin Bagherpour, Jack Dongarra, Mark Gates, Azzam Haidar, Jakub Kurzak, Piotr Luszczek, Samuel Relton, Jakub Sistek, David Stevens, Panruo Wu, Ichitaro Yamazaki, Asim Yar Khan, Mawussi Zounon
- WN17 – Sparse direct solution on parallel computers, Iain Duff, Florent Lopez, and Stojce Nakov
- WN18 – An Auto-Tuning Framework for a NUMA-Aware Hessenberg Reduction Algorithm, Mahmoud Eljammaly, Lars Karlsson, and Bo Kågström
- WN19 – Solving linear equations with messenger-field and conjugate gradients techniques – an application to CMB data analysis, Jan Papez, Laura Grigori, Radoslav Stompor
- WN20 – Parallelization of the solve phase in a task-based Cholesky solver using a sequential task flow model, Sébastien Cayrols, Iain Duff and Florent Lopez
- WN21 – A new sparse symmetric indefinite solver using A Posteriori Threshold Pivoting, Iain Duff, Jonathan Hogg and Florent Lopez
- WN22 – Design and implementation of a parallel Markowitz threshold algorithm, Timothy Davis, Iain S. Duff, and Stojce Nakov