Submitted by webmaster on
Title | Batched Generation of Incomplete Sparse Approximate Inverses on GPUs |
Publication Type | Conference Proceedings |
Year of Publication | 2016 |
Authors | Anzt, H., E. Chow, T. Huckle, and J. Dongarra |
Conference Name | Proceedings of the 7th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems |
Series Title | ScalA '16 |
Pagination | 49–56 |
Date Published | 2016-11 |
ISBN Number | 978-1-5090-5222-6 |
Abstract | Incomplete Sparse Approximate Inverses (ISAI) have recently been shown to be an attractive alternative to exact sparse triangular solves in the context of incomplete factorization preconditioning. In this paper we propose a batched GPU-kernel for the efficient generation of ISAI matrices. Utilizing only thread-local memory allows for computing the ISAI matrix with very small memory footprint. We demonstrate that this strategy is faster than the existing strategy for generating ISAI matrices, and use a large number of test matrices to assess the algorithm's efficiency in an iterative solver setting. |
DOI | 10.1109/ScalA.2016.11 |