Mixed-Precision Solution of Linear Systems Using Accelerator-Based Computing

Submitted by scrawford on Mon, 05/11/2020 - 11:51

Title	Mixed-Precision Solution of Linear Systems Using Accelerator-Based Computing
Publication Type	Tech Report
Year of Publication	2020
Authors	Haidar, A., H. Bayraktar, S. Tomov, J. Dongarra, and N. J. Higham
Technical Report Series Title	Innovative Computing Laboratory Technical Report
Number	ICL-UT-20-05
Date Published	2020-05
Institution	University of Tennessee
Abstract	Double-precision floating-point arithmetic (FP64) has been the de facto standard for engineering and scientific simulations for several decades. Problem complexity and the sheer volume of data coming from various instruments and sensors motivate researchers to mix and match various approaches to optimize compute resources, including different levels of floating-point precision. In recent years, machine learning has motivated hardware support for half-precision floating-point arithmetic. A primary challenge in high-performance computing is to leverage reduced- and mixed-precision hardware. We show how the FP16/FP32 Tensor Cores on NVIDIA GPUs can be exploited to accelerate the solution of linear systems of equations Ax = b without sacrificing numerical stability. We achieve a 4×–5× performance increase and 5× better energy efficiency versus the standard FP64 implementation while maintaining an FP64 level of numerical stability.

Project Tags:

File:

External Publication Flag: