Submitted by scrawford on
Title | The Case for Directive Programming for Accelerator Autotuner Optimization |
Publication Type | Tech Report |
Year of Publication | 2017 |
Authors | Fayad, D., J. Kurzak, P. Luszczek, P. Wu, and J. Dongarra |
Technical Report Series Title | Innovative Computing Laboratory Technical Report |
Number | ICL-UT-17-07 |
Date Published | 2017-10 |
Institution | University of Tennessee |
Abstract | In this work, we present the use of compiler pragma directives for parallelizing autotuning of specialized compute kernels for hardware accelerators. A set of constructs, that include prallelizing a source code that prune a generated search space with a large number of constraints for an autotunning infrastructure. For a better performance we studied optimization aimed at minimization of the run time.We also studied the behavior of the parallel load balance and the speedup on four different machines: x86, Xeon Phi, ARMv8, and POWER8. |