|
|
Publications |
| Showing records 1 - 10 of 14 | |
|
A. Haidar, P. Luszczek, J. Kurzak and J. Dongarra. "An Improved Parallel Singular Value Algorithm and Its Implementation for Multicore Hardware,"
International Conference for High Performance Computing, Networking, Storage and Analysis, IEEE-SC 2013,
Denver, CO, November, 2013.
|
|
Azzam Haidar, Mark Gates, Stan Tomov, and Jack Dongarra "Toward a scalable multi-GPU eigensolver via compute-intensive kernels and efficient communication,"
International Conference on Supercomputing,
Eugene, OR, June, 2013.
|
|
|
Weaver, V., Terpstra, D., McCraw, H., Johnson, M., Kasichayanula, K., Ralph, J., Nelson, J., Mucci, P., Mohan, T., Moore, S. "PAPI 5: Measuring Power, Energy, and the Cloud,"
Poster Abstract, 2013 IEEE International Symposium on Performance Analysis of Systems and Software,
Austin, TX, April 21-23, 2013.
|
|
|
Kurzak, J., Luszczek, P., YarKhan, A., Faverge, M., Langou, J., Bouwmeester, H., Dongarra, J. "Multithreading in the PLASMA Library,"
Multi and Many-Core Processing: Architecture, Programming, Algorithms, & Applications,
Ahmed, M., Ammar, R., Rajasekaran, S. eds.
Taylor & Francis,
2013.
|
|
|
McCraw, H., Terpstra, D., Dongarra, J., Davis, K., Musselman R. "Beyond the CPU: Hardware Performance Counter Monitoring on Blue Gene/Q,"
International Supercomputing Conference 2013 (ISC'13), Leipzig, Germany,
J.M. Kunkel, T. Ludwig, and H. Meuer (Eds.): ISC 2013, LNCS 7905, pp. 213--225. Springer, Heidelberg eds.
2013.
|
|
|
YarKhan, A. "Dynamic Task Execution on Shared and Distributed Memory Architectures,"
PhD disseration, Major Advisor: Jack Dongarra,
University of Tennessee,
December, 2012.
|
|
|
Dongarra, J., Ltaief, H., Luszczek, P., Weaver, V. "Energy Footprint of Advanced Dense Numerical Linear Algebra using Tile Algorithms on Multicore Architecture,"
The 2nd International Conference on Cloud and Green Computing (submitted),
Xiangtan, Hunan, China, November, 2012.
|
|
|
Baboulin, M., Becker, D., Bosilca, G., Danalis, A., Dongarra, J. "An efficient distributed randomized solver with application to large dense linear systems,"
ICL Technical Report,
ICL-UT-12-02,
July 11, 2012.
|
|
|
Haidar, A., Ltaief, H., Luszczek, P., Dongarra, J. "A Comprehensive Study of Task Coalescing for Selecting Parallelism Granularity in a Two-Stage Bidiagonal Reduction,"
IPDPS 2012,
Shanghai, China, May, 2012.
|
|
|
Kurzak, J., Luszczek, P., Faverge, M., Dongarra, J. "Programming the LU Factorization for a Multicore System with Accelerators,"
Proceedings of VECPAR'12,
Kobe, Japan, April, 2012.
|
|
| Showing records 1 - 10 of 14 | |
Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.
|
|