GridPac

Publications

Showing records 1 - 10 of 14

A. Haidar, P. Luszczek, J. Kurzak and J. Dongarra. "An Improved Parallel Singular Value Algorithm and Its Implementation for Multicore Hardware," International Conference for High Performance Computing, Networking, Storage and Analysis, IEEE-SC 2013, Denver, CO, November, 2013.
Azzam Haidar, Mark Gates, Stan Tomov, and Jack Dongarra "Toward a scalable multi-GPU eigensolver via compute-intensive kernels and efficient communication," International Conference on Supercomputing, Eugene, OR, June, 2013.	↓ PDF
Weaver, V., Terpstra, D., McCraw, H., Johnson, M., Kasichayanula, K., Ralph, J., Nelson, J., Mucci, P., Mohan, T., Moore, S. "PAPI 5: Measuring Power, Energy, and the Cloud," Poster Abstract, 2013 IEEE International Symposium on Performance Analysis of Systems and Software, Austin, TX, April 21-23, 2013.	↓ PDF
Kurzak, J., Luszczek, P., YarKhan, A., Faverge, M., Langou, J., Bouwmeester, H., Dongarra, J. "Multithreading in the PLASMA Library," Multi and Many-Core Processing: Architecture, Programming, Algorithms, & Applications, Ahmed, M., Ammar, R., Rajasekaran, S. eds. Taylor & Francis, 2013.	↓ PDF
McCraw, H., Terpstra, D., Dongarra, J., Davis, K., Musselman R. "Beyond the CPU: Hardware Performance Counter Monitoring on Blue Gene/Q," International Supercomputing Conference 2013 (ISC'13), Leipzig, Germany, J.M. Kunkel, T. Ludwig, and H. Meuer (Eds.): ISC 2013, LNCS 7905, pp. 213--225. Springer, Heidelberg eds. 2013.	↓ PDF
YarKhan, A. "Dynamic Task Execution on Shared and Distributed Memory Architectures," PhD disseration, Major Advisor: Jack Dongarra, University of Tennessee, December, 2012.	↓ PDF
Dongarra, J., Ltaief, H., Luszczek, P., Weaver, V. "Energy Footprint of Advanced Dense Numerical Linear Algebra using Tile Algorithms on Multicore Architecture," The 2nd International Conference on Cloud and Green Computing (submitted), Xiangtan, Hunan, China, November, 2012.	↓ PDF
Baboulin, M., Becker, D., Bosilca, G., Danalis, A., Dongarra, J. "An efficient distributed randomized solver with application to large dense linear systems," ICL Technical Report, ICL-UT-12-02, July 11, 2012.	↓ PDF
Haidar, A., Ltaief, H., Luszczek, P., Dongarra, J. "A Comprehensive Study of Task Coalescing for Selecting Parallelism Granularity in a Two-Stage Bidiagonal Reduction," IPDPS 2012, Shanghai, China, May, 2012.	↓ PDF
Kurzak, J., Luszczek, P., Faverge, M., Dongarra, J. "Programming the LU Factorization for a Multicore System with Accelerators," Proceedings of VECPAR'12, Kobe, Japan, April, 2012.	↓ PDF

Showing records 1 - 10 of 14

Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.


Sponsored By: