Submitted by scrawford on
Title | Harnessing GPU's Tensor Cores Fast FP16 Arithmetic to Speedup Mixed-Precision Iterative Refinement Solvers and Achieve 74 Gflops/Watt on Nvidia V100 |
Publication Type | Poster |
Year of Publication | 2018 |
Authors | Haidar, A., A. Abdelfattah, S. Tomov, and J. Dongarra |
Date Published | 2018-03 |
Event | GPU Technology Conference (GTC), Poster |
Event Location | San Jose, CA |
File:
External Publication Flag: