Publications

2022

Lindquist, N., M. Gates, P. Luszczek, and J. Dongarra, “Threshold Pivoting for Dense LU Factorization,” ScalAH22: 13th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Heterogeneous Systems , Dallas, Texas, IEEE, November 2022.

(721.77 KB)

2021

Dongarra, J., M. Gates, P. Luszczek, and S. Tomov, “Translational process: Mathematical software perspective,” Journal of Computational Science, vol. 52, pp. 101216, 2021.

2020

Dongarra, J., M. Gates, P. Luszczek, and S. Tomov, “Translational Process: Mathematical Software Perspective,” Journal of Computational Science, September 2020.

(752.59 KB)

Dongarra, J., M. Gates, P. Luszczek, and S. Tomov, “Translational Process: Mathematical Software Perspective,” Innovative Computing Laboratory Technical Report, no. ICL-UT-20-11, August 2020.

(752.59 KB)

Krzhizhanovskaya, V., G. Závodszky, M. Lees, J. Dongarra, P. Sloot, S. Brissos, and J. Teixeira, “Twenty Years of Computational Science,” International Conference on Computational Science (ICCS 2020), Amsterdam, Netherlands, June 2020.

(149.66 KB)

2019

Anzt, H., Y. Chen Chen, T. Cojean, J. Dongarra, G. Flegar, P. Nayak, E. S. Quintana-Orti, Y. M. Tsai, and W. Wang, “Towards Continuous Benchmarking,” Platform for Advanced Scientific Computing Conference (PASC 2019), Zurich, Switzerland, ACM Press, June 2019.

(1.51 MB)

Abdelfattah, A., S. Tomov, and J. Dongarra, “Towards Half-Precision Computation for Complex Matrices: A Case Study for Mixed Precision Solvers on GPUs,” ScalA19: 10th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, Denver, CO, IEEE, November 2019.

(523.87 KB)

(3.42 MB)

2018

Dorris, J., A. YarKhan, J. Kurzak, P. Luszczek, and J. Dongarra, “Task Based Cholesky Decomposition on Xeon Phi Architectures using OpenMP,” International Journal of Computational Science and Engineering (IJCSE), vol. 17, no. 3, October 2018.

Abdelfattah, A., A. Haidar, S. Tomov, and J. Dongarra, Tensor Contractions using Optimized Batch GEMM Routines , San Jose, CA, GPU Technology Conference (GTC), Poster, March 2018.

(1.64 MB)

2017

Luszczek, P., J. Kurzak, I. Yamazaki, and J. Dongarra, “Towards Numerical Benchmark for Half-Precision Floating Point Arithmetic,” 2017 IEEE High Performance Extreme Computing Conference (HPEC), Waltham, MA, IEEE, September 2017.

(1.67 MB)

2016

Lopez, M. G., V. Larrea, W. Joubert, O. Hernandez, A. Haidar, S. Tomov, and J. Dongarra, “Towards Achieving Performance Portability Using Directives for Accelerators,” The International Conference for High Performance Computing, Networking, Storage and Analysis (SC'16), Third Workshop on Accelerator Programming Using Directives (WACCPD), Salt Lake City, Utah, Innovative Computing Laboratory, University of Tennessee, November 2016.

(567.02 KB)

2015

Strohmaier, E., H. Meuer, J. Dongarra, and H. D. Simon, “The TOP500 List and Progress in High-Performance Computing,” IEEE Computer, vol. 48, issue 11, pp. 42-49, November 2015.

Baboulin, M., V. Dobrev, J. Dongarra, C. Earl, J. Falcou, A. Haidar, I. Karlin, T. Kolev, I. Masliah, and S. Tomov, Towards a High-Performance Tensor Algebra Package for Accelerators , Gatlinburg, TN, moky Mountains Computational Sciences and Engineering Conference (SMC15), September 2015.

(1.76 MB)

Haidar, A., P. Luszczek, S. Tomov, and J. Dongarra, “Towards Batched Linear Solvers on Accelerated Hardware Platforms,” 8th Workshop on General Purpose Processing Using GPUs (GPGPU 8) co-located with PPOPP 2015, San Francisco, CA, ACM, February 2015.

(403.74 KB)

Anzt, H., J. Dongarra, and E. S. Quintana-Orti, “Tuning Stationary Iterative Solvers for Fault Resilience,” 6th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems (ScalA15), Austin, TX, ACM, November 2015.

(1.28 MB)

2013

Heroux, M. A., and J. Dongarra, “Toward a New Metric for Ranking High Performance Computing Systems,” SAND2013 - 4744, June 2013.

(225.32 KB)

Haidar, A., M. Gates, S. Tomov, and J. Dongarra, “Toward a scalable multi-GPU eigensolver via compute-intensive kernels and efficient communication,” Proceedings of the 27th ACM International Conference on Supercomputing (ICS '13), Eugene, Oregon, USA, ACM Press, June 2013.

(1.27 MB)

Jia, Y., P. Luszczek, and J. Dongarra, “Transient Error Resilient Hessenberg Reduction on GPU-based Hybrid Architectures,” UT-CS-13-712: University of Tennessee Computer Science Technical Report, June 2013.

(206.42 KB)

Yamazaki, I., T. Dong, R. Solcà, S. Tomov, J. Dongarra, and T. C. Schulthess, “Tridiagonalization of a dense symmetric matrix on multiple GPUs and its application to symmetric eigenvalue problems,” Concurrency and Computation: Practice and Experience, October 2013.

(1.71 MB)

Yamazaki, I., T. Dong, S. Tomov, and J. Dongarra, “Tridiagonalization of a Symmetric Dense Matrix on a GPU Cluster,” The Third International Workshop on Accelerators and Hybrid Exascale Systems (AsHES), May 2013.

2012

Haidar, A., H. Ltaeif, and J. Dongarra, “Toward High Performance Divide and Conquer Eigensolver for Dense Symmetric Matrices,” SIAM Journal on Scientific Computing (Accepted), July 2012.

2011

Haidar, A., H. Ltaeif, and J. Dongarra, “Toward High Performance Divide and Conquer Eigensolver for Dense Symmetric Matrices.,” Submitted to SIAM Journal on Scientific Computing (SISC), 00 2011.

Becker, D., M. Faverge, and J. Dongarra, “Towards a Parallel Tile LDL Factorization for Multicore Architectures,” ICL Technical Report, no. ICL-UT-11-03, Seattle, WA, April 2011.

(425.45 KB)

Luszczek, P., H. Ltaeif, and J. Dongarra, “Two-stage Tridiagonal Reduction for Dense Symmetric Matrices using Tile Algorithms on Multicore Architectures,” IEEE International Parallel and Distributed Processing Symposium (submitted), Anchorage, AK, May 2011.

2010

Hadri, B., E. Agullo, and J. Dongarra, “Tile QR Factorization with Parallel Panel Processing for Multicore Architectures,” 24th IEEE International Parallel and Distributed Processing Symposium (submitted), 00 2010.

(313.98 KB)

Tomov, S., J. Dongarra, and M. Baboulin, “Towards Dense Linear Algebra for Hybrid GPU Accelerated Manycore Systems,” Parallel Computing, vol. 36, no. 5-6, pp. 232-240, 00 2010.

(606.41 KB)

Jagode, H., A. Knuepfer, J. Dongarra, M. Jurenz, M. S. Mueller, and W. E. Nagel, “Trace-based Performance Analysis for the Petascale Simulation Code FLASH,” International Journal of High Performance Computing Applications (to appear), 00 2010.

(887.54 KB)

Du, P., M. Parsons, E. Fuentes, S-L. Shaw, and J. Dongarra, “Tuning Principal Component Analysis for GRASS GIS on Multi-core and GPU Architectures,” FOSS4G 2010, Barcelona, Spain, September 2010.

(1.57 MB)

2009

Hadri, B., H. Ltaeif, E. Agullo, and J. Dongarra, “Tall and Skinny QR Matrix Factorization Using Tile Algorithms on Multicore Architectures,” Innovative Computing Laboratory Technical Report (also LAPACK Working Note 222 and CS Tech Report UT-CS-09-645), no. ICL-UT-09-03, September 2009.

(464.23 KB)

Hadri, B., H. Ltaeif, E. Agullo, and J. Dongarra, “Tile QR Factorization with Parallel Panel Processing for Multicore Architectures,” accepted in 24th IEEE International Parallel and Distributed Processing Symposium (IPDPS 2010), Atlanta, GA, December 2009.

Hoefler, T., Y-S. Dai, and J. Dongarra, “Towards Efficient MapReduce Using MPI,” Lecture Notes in Computer Science, Recent Advances in Parallel Virtual Machine and Message Passing Interface - 16th European PVM/MPI Users' Group Meeting, vol. 5759, Espoo, Finland, Springer Berlin / Heidelberg, pp. 240-249, 00 2009.

Jagode, H., A. Knuepfer, J. Dongarra, M. Jurenz, M. S. Mueller, and W. E. Nagel, “Trace-based Performance Analysis for the Petascale Simulation Code FLASH,” Innovative Computing Laboratory Technical Report, no. ICL-UT-09-01, April 2009.

(887.54 KB)

Seymour, K., A. YarKhan, and J. Dongarra, “Transparent Cross-Platform Access to Software Services using GridSolve and GridRPC,” in Cloud Computing and Software Services: Theory and Techniques (to appear): CRC Press, 00 2009.

2008

Tomov, S., J. Dongarra, and M. Baboulin, “Towards Dense Linear Algebra for Hybrid GPU Accelerated Manycore Systems,” University of Tennessee Computer Science Technical Report, UT-CS-08-632 (also LAPACK Working Note 210), January 2008.

(606.41 KB)

Dongarra, J., “A Tribute to Gene Golub,” Computing in Science and Engineering: IEEE, pp. 5, January 2008.

2006

Canning, A., J. Dongarra, J. Langou, O. Marques, S. Tomov, C. Voemel, and L-W. Wang, “Towards bulk based preconditioning for quantum dot computations,” IEEE/ACM Proceedings of HPCNano SC06 (to appear), January 2006.

(172.46 KB)

Dongarra, J., G. H. Golub, E. Grosse, C. Moler, and K. Moore, “Twenty-Plus Years of Netlib and NA-Net,” University of Tennessee Computer Science Department Technical Report, UT-CS-04-526, 00 2006.

(62.79 KB)

2005

Vadhiyar, S., G. Fagg, and J. Dongarra, “Towards an Accurate Model for Collective Communications,” ICL Technical Report, no. ICL-UT-05-03, January 2005.

(250.73 KB)

2004

Vadhiyar, S., G. Fagg, and J. Dongarra, “Towards an Accurate Model for Collective Communications,” International Journal of High Performance Applications, Special Issue: Automatic Performance Tuning, vol. 18, no. 1, pp. 159-167, January 2004.

(250.73 KB)

Dongarra, J., “Trends in High Performance Computing,” The Computer Journal, vol. 47, no. 4: The British Computer Society, pp. 399-403, 00 2004.

(455.96 KB)

2002

Kennedy, K., J. Mellor-Crummey, K. Cooper, L. Torczon, F. Berman, A. Chien, D. Angulo, I. Foster, D. Gannon, L. Johnsson, et al., “Toward a Framework for Preparing and Executing Adaptive Grid Programs,” International Parallel and Distributed Processing Symposium: IPDPS 2002 Workshops, Fort Lauderdale, FL, pp. 0171, April 2002.

(64.5 KB)

Hiroyasu, T., M. Miki, H. Shimosaka, M. Sano, Y. Tanimura, Y. Mimura, S. Yoshimura, and J. Dongarra, “Truss Structural Optimization Using NetSolve System,” Meeting of the Japan Society of Mechanical Engineers, Kyoto University, Kyoto, Japan, October 2002.

(450.65 KB)

2001

Kennedy, K., B. Broom, K. Cooper, J. Dongarra, R. Fowler, D. Gannon, L. Johnsson, J. Mellor-Crummey, and L. Torczon, “Telescoping Languages: A Strategy for Automatic Generation of Scientific Problem-Solving Systems from Annotated Libraries,” Journal of Parallel and Distributed Computing, vol. 61, no. 12, pp. 1803-1826, December 2001.

(386.37 KB)

2000

Dongarra, J., H. Meuer, and E. Strohmaier, “Top500 Supercomputer Sites (15th edition),” University of Tennessee Computer Science Department Technical Report, no. UT-CS-00-442, June 2000.

(278.88 KB)

1999

Calland, P-Y., J. Dongarra, and Y. Robert, “Tiling on Systems with Communication/Computation Overlap,” Concurrency: Practice and Experience, vol. 11, no. 3, pp. 139-153, January 1999.

(286.14 KB)

Dongarra, J., H. Meuer, and E. Strohmaier, “Top500 Supercomputer Sites (13th edition),” University of Tennessee Computer Science Department Technical Report, no. UT-CS-99-425, June 1999.

(278.51 KB)

Dongarra, J., H. Meuer, and E. Strohmaier, “Top500 Supercomputer Sites (14th edition),” University of Tennessee Computer Science Department Technical Report, no. UT-CS-99-434, November 1999.

(281.81 KB)