Publications

Abdelfattah, A., H. Anzt, J. Dongarra, M. Gates, A. Haidar, J. Kurzak, P. Luszczek, S. Tomov, , and A. YarKhan, “Linear Algebra Software for Large-Scale Accelerated Multicore Computing,” Acta Numerica, vol. 25, pp. 1-160, May 2016.

Gustavson, F. G., J. Wasniewski, and J. Dongarra, “Level-3 Cholesky Kernel Subroutine of a Fully Portable High Performance Minimal Storage Hybrid Format Cholesky Algorithm,” ACM TOMS (submitted), also LAPACK Working Note (LAWN) 211, 00 2010.

(190.2 KB)

Gustavson, F. G., J. Wasniewski, J. Dongarra, J. Herrero, and J. Langou, “Level-3 Cholesky Factorization Routines Improve Performance of Many Cholesky Algorithms,” ACM Transactions on Mathematical Software (TOMS), vol. 39, issue 2, February 2013.

(439.46 KB)

Anderson, E., Z. Bai, C. Bischof, S. Blackford, J. Demmel, J. Dongarra, J. Du Croz, A. Greenbaum, S. Hammarling, A. McKenney, et al., “LAPACK Users' Guide, 3rd ed.,” Philadelphia: Society for Industrial and Applied Mathematics, January 1999.

Demmel, J., and J. Dongarra, LAPACK 2005 Prospectus: Reliable and Scalable Software for Linear Algebra Computations on High End Computers : LAPACK Working Note 164, January 2005.

(172.59 KB)

Ma, T., G. Bosilca, A. Bouteiller, and J. Dongarra, “Kernel-assisted and topology-aware MPI collective communications on multi-core/many-core platforms,” Journal of Parallel and Distributed Computing, vol. 73, issue 7, pp. 1000-1010, July 2013.

(1.4 MB)

Vetter, J., R. Glassbrook, J. Dongarra, K. Schwan, B. Loftis, S. McNally, J. Meredith, J. Rogers, P. Roth, K. Spafford, et al., “Keeneland: Bringing Heterogeneous GPU Computing to the Computational Science Community,” IEEE Computing in Science & Engineering, vol. 13, issue 5, pp. 90-95, August 2011.

(932.57 KB)

Doolin, D., J. Dongarra, and K. Seymour, “JLAPACK - Compiling LAPACK Fortran to Java,” Scientific Programming, vol. 7, no. 2, pp. 111-138, October 2002.

(307.46 KB)

Dongarra, J., V. Eijkhout, and H. van der Vorst, “Iterative Solver Benchmark (LAPACK Working Note 152),” Scientific Programming, vol. 9, no. 4, pp. 223-231, 00 2001.

(168.05 KB)

Dongarra, J., V. Eijkhout, and H. van der Vorst, “An Iterative Solver Benchmark,” Scientific Programming (to appear), 00 2002.

(142.67 KB)

Jagode, H., S. Moore, D. Terpstra, J. Dongarra, A. Knuepfer, M. Jurenz, M. S. Mueller, and W. E. Nagel, “I/O Performance Analysis for the Petascale Simulation Code FLASH,” ISC'09, Hamburg, Germany, June 2009.

(88.88 KB)

Haidar, A., H. Jagode, P. Vaccaro, A. YarKhan, S. Tomov, and J. Dongarra, “Investigating Power Capping toward Energy-Efficient Scientific Applications,” Concurrency Computation: Practice and Experience, vol. 2018, issue e4485, pp. 1-14, April 2018.

(1.2 MB)

Luszczek, P., J. Dongarra, D. Koester, R. Rabenseifner, B. Lucas, J. Kepner, J. McCalpin, D. Bailey, and D. Takahashi, Introduction to the HPC Challenge Benchmark Suite , March 2005.

(124.86 KB)

Dongarra, J., P. Beckman, T. Moore, P. Aerts, G. Aloisio, J-C. Andre, D. Barkai, J-Y. Berthou, T. Boku, B. Braunschweig, et al., “The International Exascale Software Project Roadmap,” International Journal of High Performance Computing, vol. 25, no. 1, pp. 3-60, January 2011.

(719.74 KB)

Dongarra, J., P. Beckman, P. Aerts, F. Cappello, T. Lippert, S. Matsuoka, P. Messina, T. Moore, R. Stevens, A. Trefethen, et al., “The International Exascale Software Project: A Call to Cooperative Action by the Global High Performance Community,” International Journal of High Performance Computing Applications (to appear), July 2009.

(203.04 KB)

Hardt, M., K. Seymour, J. Dongarra, M. Zapf, and N. Ruiter, “Interactive Grid-Access Using Gridsolve and Giggle,” Computing and Informatics, vol. 27, no. 2, pp. 233-248,ISSN1335-9150, 00 2008.

(533.4 KB)

Arnold, D., H. Casanova, and J. Dongarra, “Innovations of the NetSolve Grid Computing System,” Concurrency: Practice and Experience, vol. 14, no. 13-15, pp. 1457-1479, January 2002.

(311.31 KB)

Anzt, H., T. Huckle, J. Bräckle, and J. Dongarra, “Incomplete Sparse Approximate Inverses for Parallel Preconditioning,” Parallel Computing, vol. 71, pp. 1–22, January 2018.

(1.24 MB)

Jeannot, E., K. Seymour, A. YarKhan, and J. Dongarra, “Improved Runtime and Transfer Time Prediction Mechanisms in a Network Enabled Servers Middleware,” Parallel Processing Letters, vol. 17, no. 1, pp. 47-59, March 2007.

(718.4 KB)

Jeannot, E., K. Seymour, A. YarKhan, and J. Dongarra, “Improved Runtime and Transfer Time Prediction Mechanisms in a Network Enabled Server,” Parallel Processing Letters, vol. 17, no. 1, pp. 47-59, March 2006.

(718.4 KB)

Nath, R., S. Tomov, and J. Dongarra, “An Improved MAGMA GEMM for Fermi GPUs,” International Journal of High Performance Computing, vol. 24, no. 4, pp. 511-515, 00 2010.

Kurzak, J., and J. Dongarra, “Implementing Linear Algebra Routines on Multi-Core Processors with Pipelining and a Look Ahead,” University of Tennessee Computer Science Tech Report, UT-CS-06-581, LAPACK Working Note #178, January 2006.

(304.4 KB)

Yamazaki, I., D. Becker, J. Dongarra, A. Druinsky, I.. Peled, S. Toledo, G. Ballard, J. Demmel, and O. Schwartz, “Implementing a Blocked Aasen’s Algorithm with a Dynamic Scheduler on Multicore Architectures,” IPDPS 2013 (submitted), Boston, MA, 00 2013.

(1.22 MB)

Kurzak, J., R. Nath, P. Du, and J. Dongarra, “An Implementation of the Tile QR Factorization for a GPU and Multiple CPUs,” Applied Parallel and Scientific Computing, vol. 7133, pp. 248-257, 00 2012.

(623.5 KB)

Kurzak, J., and J. Dongarra, “Implementation of the Mixed-Precision High Performance LINPACK Benchmark on the CELL Processor,” University of Tennessee Computer Science Tech Report, no. UT-CS-06-580, LAPACK Working Note #177, September 2006.

(506.18 KB)

Kurzak, J., and J. Dongarra, “Implementation of Mixed Precision in Solving Systems of Linear Equations on the Cell Processor,” Concurrency and Computation: Practice and Experience, vol. 19, no. 10, pp. 1371-1385, July 2007.

(453.78 KB)

Keller, R., G. Bosilca, G. Fagg, M. Resch, and J. Dongarra, “Implementation and Usage of the PERUSE-Interface in Open MPI,” Euro PVM/MPI 2006, Bonn, Germany, September 2006.

(310.76 KB)

Kurzak, J., H. Anzt, M. Gates, and J. Dongarra, “Implementation and Tuning of Batched Cholesky Factorization and Solve for NVIDIA GPUs,” IEEE Transactions on Parallel and Distributed Systems, no. 1045-9219, November 2015.

Buttari, A., J. Dongarra, J. Kurzak, J. Langou, P. Luszczek, and S. Tomov, “The Impact of Multicore on Math Software,” PARA 2006, Umea, Sweden, June 2006.

(223.53 KB)

Dongarra, J., D. Gannon, G. Fox, and K. Kennedy, “The Impact of Multicore on Computational Science Software,” CTWatch Quarterly, vol. 3, issue 1, February 2007.

Ma, T., A. Bouteiller, G. Bosilca, and J. Dongarra, “Impact of Kernel-Assisted MPI Communication over Scientific Applications: CPMD and FFTW,” 18th EuroMPI, Santorini, Greece, Springer, pp. 247-254, September 2011.

Agullo, E., C. Augonnet, J. Dongarra, H. Ltaeif, R. Namyst, S. Thibault, and S. Tomov, “A Hybridization Methodology for High-Performance Linear Algebra Software for GPUs,” in GPU Computing Gems, Jade Edition, vol. 2: Elsevier, pp. 473-484, 00 2011.

Ltaeif, H., S. Tomov, R. Nath, and J. Dongarra, “Hybrid Multicore Cholesky Factorization with Multiple GPU Accelerators,” IEEE Transaction on Parallel and Distributed Systems (submitted), March 2010.

(3.75 MB)

Haidar, A., J. Dongarra, K. Kabir, M. Gates, P. Luszczek, S. Tomov, and Y. Jia, “HPC Programming on Intel Many-Integrated-Core Hardware with MAGMA Port to Xeon Phi,” Scientific Programming, vol. 23, issue 1, January 2015.

(553.94 KB)

Dongarra, J., and P. Luszczek, “HPC Challenge: Design, History, and Implementation Highlights,” On the Road to Exascale Computing: Contemporary Architectures in High Performance Computing (to appear): Chapman & Hall/CRC Press, 00 2012.

(469.92 KB)

Dongarra, J., and P. Luszczek, “How Elegant Code Evolves With Hardware: The Case Of Gaussian Elimination,” in Beautiful Code Leading Programmers Explain How They Think: O'Reilly Media, Inc., June 2007.

(257 KB)

Dongarra, J., and P. Luszczek, “How Elegant Code Evolves With Hardware: The Case Of Gaussian Elimination,” in Beautiful Code Leading Programmers Explain How They Think (Chapter 14), pp. 243-282, January 2008.

(257 KB)

White, J. B., and J. Dongarra, “High-Performance High-Resolution Semi-Lagrangian Tracer Transport on a Sphere,” Journal of Computational Physics, vol. 230, issue 17, pp. 6778-6799, July 2011.

(1.68 MB)

Dongarra, J., M. A. Heroux, and P. Luszczek, “High-Performance Conjugate-Gradient Benchmark: A New Metric for Ranking High-Performance Computing Systems,” The International Journal of High Performance Computing Applications, 2015.

(336.19 KB)

Chen, Z., and J. Dongarra, “Highly Scalable Self-Healing Algorithms for High Performance Scientific Computing,” IEEE Transactions on Computers, vol. 58, issue 11, pp. 1512-1524, November 2009.

(1.81 MB)

Dongarra, J., and P. Luszczek, “High Performance Development for High End Computing with Python Language Wrapper (PLW),” International Journal for High Performance Computer Applications, vol. 21, no. 3, pp. 360-369, 00 2007.

(179.32 KB)

Du, P., P. Luszczek, and J. Dongarra, “High Performance Dense Linear System Solver with Soft Error Resilience,” IEEE Cluster 2011, Austin, TX, September 2011.

(1.27 MB)

Du, P., P. Luszczek, and J. Dongarra, “High Performance Dense Linear System Solver with Resilience to Multiple Soft Errors,” ICCS 2012, Omaha, NE, June 2012.

(1.27 MB)

Dongarra, J., M. A. Heroux, and P. Luszczek, “High Performance Conjugate Gradient Benchmark: A new Metric for Ranking High Performance Computing Systems,” International Journal of High Performance Computing Applications, vol. 30, issue 1, pp. 3 - 10, February 2016.

(277.51 KB)

Dongarra, J., H. Meuer, H. D. Simon, and E. Strohmaier, “High Performance Computing Trends,” HERMIS, vol. 2, pp. 155-163, November 2001.

Dongarra, J., and A. J. van der Steen, “High Performance Computing Systems: Status and Outlook,” Acta Numerica, vol. 21, Cambridge, UK, Cambridge University Press, pp. 379-474, May 2012.

(1.48 MB)

Palma, J., J. Dongarra, and V. Hernández, “High Performance Computing for Computational Science,” Lecture Notes in Computer Science, vol. 2565, VECPAR 2002, 5th International Conference June 26-28, 2002, Springer-Verlag, Berlin, January 2003.

Ltaeif, H., P. Luszczek, and J. Dongarra, “High Performance Bidiagonal Reduction using Tile Algorithms on Homogeneous Multicore Architectures,” ACM Transactions on Mathematical Software (TOMS), vol. 39, issue 3, no. 16, 2013.

(665.7 KB)

Ma, T., G. Bosilca, A. Bouteiller, and J. Dongarra, “HierKNEM: An Adaptive Framework for Kernel-Assisted and Topology-Aware Collective Communications on Many-core Clusters,” IPDPS 2012 (Best Paper), Shanghai, China, May 2012.

(165.9 KB)

Main menu

Pages