Publications

Export 1287 results:
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z 
M
Marin, G., J. Dongarra, and D. Terpstra, MIAMI: A Framework for Application Performance Diagnosis ,” IPASS-2014, Monterey, CA, IEEE, March 2014. DOI: 10.1109/ISPASS.2014.6844480  (1010.75 KB)
Beck, M., D. Arnold, A. Bassi, F. Berman, H. Casanova, J. Dongarra, T. Moore, G. Obertelli, J. Plank, M. Swany, et al., Middleware for the Use of Storage in Communication,” Parallel Computing, vol. 28, no. 12, pp. 1773-1788, August 2002.  (87.97 KB)
Tsai, Y-H. Mike, N. Beams, and H. Anzt, Mixed Precision Algebraic Multigrid on GPUs,” Parallel Processing and Applied Mathematics (PPAM 2022), vol. 13826, Cham, Springer International Publishing, April 2023. DOI: 10.1007/978-3-031-30442-2_9
Cayrols, S., J. Li, G. Bosilca, S. Tomov, A. Ayala, and J. Dongarra, Mixed precision and approximate 3D FFTs: Speed for accuracy trade-off with GPU-aware MPI and run-time data compression,” ICL Technical Report, no. ICL-UT-22-04, May 2022.  (706.14 KB)
Buttari, A., J. Dongarra, J. Langou, J. Langou, P. Luszczek, and J. Kurzak, Mixed Precision Iterative Refinement Techniques for the Solution of Dense Linear Systems,” International Journal of High Performance Computer Applications (to appear), August 2007.  (157.4 KB)
Lopez, F., and T. Mary, Mixed Precision LU Factorization on GPU Tensor Cores: Reducing Data Movement and Memory Footprint,” Innovative Computing Laboratory Technical Report, no. ICL-UT-20-13: University of Tennessee, September 2020.  (409 KB)
Tsai, Y. M., P. Luszczek, and J. Dongarra, Mixed-Precision Algorithm for Finding Selected Eigenvalues and Eigenvectors of Symmetric and Hermitian Matrices,” ICL Technical Report, no. ICL-UT-21-05, August 2021.  (3.93 MB)
Yamazaki, I., S. Tomov, J. Kurzak, J. Dongarra, and J. Barlow, Mixed-precision Block Gram Schmidt Orthogonalization,” 6th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, Austin, TX, ACM, November 2015.  (235.69 KB)
Yamazaki, I., S. Tomov, and J. Dongarra, Mixed-Precision Cholesky QR Factorization and its Case Studies on Multicore CPU with Multiple GPUs,” SIAM Journal on Scientific Computing, vol. 37, no. 3, pp. C203-C330, May 2015. DOI: DOI:10.1137/14M0973773  (374.8 KB)
Haidar, A., H. Bayraktar, S. Tomov, J. Dongarra, and N. J. Higham, Mixed-Precision Iterative Refinement using Tensor Cores on GPUs to Accelerate Solution of Linear Systems,” Proceedings of the Royal Society A, vol. 476, issue 2243, November 2020. DOI: 10.1098/rspa.2020.0110  (2.24 MB)
Yamazaki, I., J. Barlow, S. Tomov, J. Kurzak, and J. Dongarra, Mixed-precision orthogonalization process Performance on multicore CPUs with GPUs,” 2015 SIAM Conference on Applied Linear Algebra, Atlanta, GA, SIAM, October 2015.  (301.01 KB)
Yamazaki, I., S. Tomov, T. Dong, and J. Dongarra, Mixed-precision orthogonalization scheme and adaptive step size for CA-GMRES on GPUs,” VECPAR 2014 (Best Paper), Eugene, OR, June 2014.  (438.54 KB)
Haidar, A., H. Bayraktar, S. Tomov, J. Dongarra, and N. J. Higham, Mixed-Precision Solution of Linear Systems Using Accelerator-Based Computing,” Innovative Computing Laboratory Technical Report, no. ICL-UT-20-05: University of Tennessee, May 2020.  (1.03 MB)
Du, P., P. Luszczek, S. Tomov, and J. Dongarra, Mixed-Tool Performance Analysis on Hybrid Multicore Architectures,” First International Workshop on Parallel Software Tools and Tool Infrastructures (PSTI 2010), San Diego, CA, September 2010.  (1.24 MB)
Faverge, M., J. Herrmann, J. Langou, B. Lowery, Y. Robert, and J. Dongarra, Mixing LU-QR Factorization Algorithms to Design High-Performance Dense Linear Algebra Solvers,” Journal of Parallel and Distributed Computing, vol. 85, pp. 32-46, November 2015. DOI: doi:10.1016/j.jpdc.2015.06.007  (5.06 MB)
Dongarra, J., A. Haidar, J. Kurzak, P. Luszczek, S. Tomov, and A. YarKhan, Model-Driven One-Sided Factorizations on Multicore, Accelerated Systems,” Supercomputing Frontiers and Innovations, vol. 1, issue 1, 2014. DOI: http://dx.doi.org/10.14529/jsfi1401  (1.86 MB)
Song, F., S. Moore, and J. Dongarra, Modeling of L2 Cache Behavior for Thread-Parallel Scientific Programs on Chip Multi-Processors,” University of Tennessee Computer Science Technical Report, no. UT-CS-06-583, January 2006.  (652.93 KB)
Supinski, B. R. de, S. Alam, D. Bailey, L. Carrington, C. Daley, A. Dubey, T. Gamblin, D. Gunter, P. D. Hovland, H. Jagode, et al., Modeling the Office of Science Ten Year Facilities Plan: The PERI Architecture Tiger Team,” SciDAC 2009, Journal of Physics: Conference Series, vol. 180(2009)012039, San Diego, California, IOP Publishing, July 2009.  (906.39 KB)
Sharp, D., M. Stoyanov, S. Tomov, and J. Dongarra, A More Portable HeFFTe: Implementing a Fallback Algorithm for Scalable Fourier Transforms,” ICL Technical Report, no. ICL-UT-21-04: University of Tennessee, August 2021.  (493.17 KB)
Pjesivac–Grbovic, J., G. Fagg, T. Angskun, G. Bosilca, and J. Dongarra, MPI Collective Algorithm Selection and Quadtree Encoding,” ICL Technical Report, no. ICL-UT-06-11, 00 2006.  (308.39 KB)
Pjesivac–Grbovic, J., G. Bosilca, G. Fagg, T. Angskun, and J. Dongarra, MPI Collective Algorithm Selection and Quadtree Encoding,” Parallel Computing (Special Edition: EuroPVM/MPI 2006): Elsevier, 00 2007.  (308.39 KB)
Pjesivac–Grbovic, J., G. Fagg, T. Angskun, G. Bosilca, and J. Dongarra, MPI Collective Algorithm Selection and Quadtree Encoding,” Lecture Notes in Computer Science, vol. 4192, no. ICL-UT-06-13: Springer Berlin / Heidelberg, pp. 40-48, September 2006.  (308.39 KB)
Schuchart, J., and G. Bosilca, MPI Continuations And How To Invoke Them,” Sustained Simulation Performance 2021, Cham, Springer International Publishing, pp. 67 - 83, February 2023. DOI: 10.1007/978-3-031-18046-010.1007/978-3-031-18046-0_5
Snir, M., S. Otto, S. Huss-Lederman, D. Walker, and J. Dongarra, MPI - The Complete Reference, Volume 1: The MPI Core , Second, Cambridge, MA, USA, MIT Press, pp. 426, August 1998.
Danalis, A., L. Pollock, M. Swany, and J. Cavazos, MPI-aware Compiler Optimizations for Improving Communication-Computation Overlap,” Proceedings of the 23rd annual International Conference on Supercomputing (ICS '09), Yorktown Heights, NY, USA, ACM, pp. 316-325, June 2009.  (308.92 KB)
Bouteiller, A., F. Cappello, J. Dongarra, A. Guermouche, T. Herault, and Y. Robert, Multi-criteria checkpointing strategies: optimizing response-time versus resource utilization,” University of Tennessee Computer Science Technical Report, no. ICL-UT-13-01, February 2013.  (497.64 KB)
Bouteiller, A., F. Cappello, J. Dongarra, A. Guermouche, T. Herault, and Y. Robert, Multi-criteria Checkpointing Strategies: Response-Time versus Resource Utilization,” Euro-Par 2013, Aachen, Germany, Springer, August 2013.  (431.84 KB)
John, J., J. Milthorpe, T. Herault, and G. Bosilca, Multi-GPU work sharing in a task-based dataflow programming model,” Future Generation Computer Systems, vol. 156, pp. 313 - 324, July 2024. DOI: 10.1016/j.future.2024.03.017
Benoit, A., A. Cavelan, Y. Robert, and H. Sun, Multi-Level Checkpointing and Silent Error Detection for Linear Workflows,” Journal of Computational Science, vol. 28, pp. 398–415, September 2018.
Goebel, F., H. Anzt, T. Cojean, G. Flegar, and E. S. Quintana-Orti, Multiprecision Block-Jacobi for Iterative Triangular Solves,” European Conference on Parallel Processing (Euro-Par 2020): Springer, August 2020. DOI: 10.1007/978-3-030-57675-2_34
Bouteiller, A., T. Herault, and G. Bosilca, A Multithreaded Communication Substrate for OpenSHMEM,” 8th International Conference on Partitioned Global Address Space Programming Models (PGAS), Eugene, OR, October 2014.  (261.66 KB)
Buttari, A., J. Dongarra, P. Husbands, J. Kurzak, and K. Yelick, Multithreading for synchronization tolerance in matrix factorization,” Journal of Physics: Conference Series, SciDAC 2007, vol. 78, no. 2007, January 2007.  (577.73 KB)
Kurzak, J., P. Luszczek, A. YarKhan, M. Faverge, J. Langou, H. Bouwmeester, and J. Dongarra, Multithreading in the PLASMA Library,” Multi and Many-Core Processing: Architecture, Programming, Algorithms, & Applications: Taylor & Francis, 00 2013.  (536.28 KB)
N
Jones, W. B., G. Bester, A. Canning, A. Franceschetti, P. A. Graf, K. Kim, J. Langou, L-W. Wang, J. Dongarra, and A. Zunger, NanoPSE: A Nanoscience Problem Solving Environment for Atomistic Electronic Structure of Semiconductor Nanostructures,” Journal of Physics: Conference Series, issue 16, pp. 277-282, June 2005. DOI: 10.1088/1742-6596/16/1/038  (476.64 KB)
Browne, S., J. Dongarra, J. Horner, P. McMahan, and S. Wells, National HPCC Software Exchange (NHSE): Uniting the High Performance Computing and Communications Community,” D-Lib Magazine, January 1998.  (56.15 KB)
Moore, K., and J. Dongarra, NetBuild,” University of Tennessee Computer Science Technical Report, no. UT-CS-O1-461, January 2001.  (17.71 KB)
Moore, K., J. Dongarra, S. Moore, and E. Grosse, NetBuild: Automated Installation and Use of Network-Accessible Software Libraries,” ICL Technical Report, no. ICL-UT-04-02, January 2004.  (80.52 KB)
Moore, K., and J. Dongarra, NetBuild: Transparent Cross-Platform Access to Computational Software Libraries,” Concurrency and Computation: Practice and Experience, Special Issue: Grid Computing Environments, vol. 14, no. 13-15, pp. 1445-1456, November 2002.  (74.84 KB)
Dongarra, J., G. H. Golub, C. Moler, and K. Moore, Netlib and NA-Net: building a scientific computing community,” In IEEE Annals of the History of Computing (to appear), August 2007.  (352.71 KB)
Dongarra, J., G. H. Golub, E. Grosse, C. Moler, and K. Moore, Netlib and NA-Net: Building a Scientific Computing Community,” IEEE Annals of the History of Computing, vol. 30, no. 2, pp. 30-41, January 2008.  (352.71 KB)
Arnold, D., and J. Dongarra, The NetSolve Environment: Progressing Towards the Seamless Grid,” 2000 International Conference on Parallel Processing (ICPP-2000), Toronto, Canada, August 2000.  (148.85 KB)
Seymour, K., A. YarKhan, S. Agrawal, and J. Dongarra, NetSolve: Grid Enabling Scientific Computing Environments,” Grid Computing and New Frontiers of High Performance Processing, no. 14: Elsevier, 00 2005.  (425 KB)
Agrawal, S., J. Dongarra, K. Seymour, and S. Vadhiyar, NetSolve: Past, Present, and Future - A Look at a Grid Enabled Server,” Making the Global Infrastructure a Reality: Wiley Publishing, 00 2003.  (158.19 KB)
Casanova, H., S. Matsuoka, and J. Dongarra, Network-Enabled Server Systems: Deploying Scientific Simulations on the Grid,” 2001 High Performance Computing Symposium (HPC'01), part of the Advance Simulation Technologies Conference, Seattle, Washington, April 2001.  (175.23 KB)
Dongarra, J., Network-Enabled Solvers: A Step Toward Grid-Based Computing,” SIAM News, vol. 34, no. 10, December 2001.
Haidar, A., P. Luszczek, and J. Dongarra, New Algorithm for Computing Eigenvectors of the Symmetric Eigenvalue Problem,” Workshop on Parallel and Distributed Scientific and Engineering Computing, IPDPS 2014 (Best Paper), Phoenix, AZ, IEEE, May 2014. DOI: 10.1109/IPDPSW.2014.130  (2.33 MB)
Hoefler, T., J. M. Squyres, G. Fagg, G. Bosilca, W. Rehm, and A. Lumsdaine, A New Approach to MPI Collective Communication Implementations,” Distributed and Parallel Systems: Springer US, pp. 45-54, 2007. DOI: 10.1007/978-0-387-69858-8_5  (140.2 KB)
Berman, F., H. Casanova, A. Chien, K. Cooper, H. Dail, A. Dasgupta, W. Deng, J. Dongarra, L. Johnsson, K. Kennedy, et al., New Grid Scheduling and Rescheduling Methods in the GrADS Project,” International Journal of Parallel Programming, vol. 33, no. 2: Springer, pp. 209-229, June 2005.  (306.41 KB)
Dongarra, J., M. A. Heroux, and P. Luszczek, A New Metric for Ranking High-Performance Computing Systems,” National Science Review, vol. 3, issue 1, pp. 30-35, January 2016. DOI: 10.1093/nsr/nwv084  (393.55 KB)
Dongarra, J., and P. Raghavan, A New Recursive Implementation of Sparse Cholesky Factorization,” Proceedings of 16th IMACS World Congress 2000 on Scientific Computing, Applications Mathematics and Simulation, Lausanne, Switzerland, August 2000.

Pages