Publications

Export 1227 results:
2022
Abdulah, S., Q. Cao, Y. Pei, G. Bosilca, J. Dongarra, M. G. Genton, D. E. Keyes, H. Ltaief, and Y. Sun, Accelerating Geostatistical Modeling and Prediction With Mixed-Precision Computations: A High-Productivity Approach With PaRSEC,” IEEE Transactions on Parallel and Distributed Systems, vol. 33, issue 4, pp. 964 - 976, April 2022. DOI: 10.1109/TPDS.2021.3084071
Abdelfattah, A.., P.. Ghysels, W.. Boukaram, S.. Tomov, X.. Li, and J.. Dongarra, Addressing Irregular Patterns of Matrix Computations on GPUs and Their Impact on Applications Powered by Sparse Direct Solvers,” 2022 SC22: International Conference for High Performance Computing, Networking, Storage and Analysis (SC) (SC), Los Alamitos, CA, USA, IEEE Computer Society, pp. 354-367, November 2022.  (1.57 MB)
Ayala, A., S. Tomov, P. Luszczek, S. Cayrols, G. Ragghianti, and J. Dongarra, Analysis of the Communication and Computation Cost of FFT Libraries towards Exascale,” ICL Technical Report, no. ICL-UT-22-07: Innovative Computing Laboratory, July 2022.  (5.91 MB)
Abdelfattah, A., S. Tomov, and J. Dongarra, Batch QR Factorization on GPUs: Design, Optimization, and Tuning,” Lecture Notes in Computer Science, vol. 13350, Cham, Springer International Publishing, June 2022. DOI: 10.1007/978-3-031-08751-6_5
Benoit, A., Y. Du, T. Herault, L. Marchal, G. Pallez, L. Perotin, Y. Robert, H. Sun, and F. Vivien, Checkpointing à la Young/Daly: an overview,” IC3, the 14th Int. Conf. on Contemporary Computing: ACM Press, August 2022.  (639.77 KB)
Alomairy, R., M. Gates, S. Cayrols, D. Sukkari, K. Akbudak, A. YarKhan, P. Bagwell, and J. Dongarra, Communication Avoiding LU with Tournament Pivoting in SLATE,” SLATE Working Notes, no. 18, ICL-UT-22-01, January 2022.  (3.74 MB)
Bosilca, G., A. Bouteiller, T. Herault, V. Le Fèvre, Y. Robert, and J. Dongarra, Comparing Distributed Termination Detection Algorithms for Modern HPC Platforms,” International Journal of Networking and Computing, vol. 12, issue 1, pp. 26 - 46, January 2022. DOI: 10.15803/ijnc.12.1_26
Kovalchuk, S. V., V. V. Krzhizhanovskaya, M. Paszyński, D. Kranzlmüller, J. Dongarra, and P. M. A. Sloot, Computational science for a better future,” Journal of Computational Science, vol. 62, pp. 101745, July 2022. DOI: 10.1016/j.jocs.2022.101745
Sid-Lakhdar, W. M., M. Aznaveh, P. Luszczek, and J. Dongarra, Deep Gaussian process with multitask and transfer learning for performance optimization,” 2022 IEEE High Performance Extreme Computing Conference (HPEC), pp. 1-7, September 2022. DOI: 10.1109/HPEC55821.2022.9926396
Cao, Q., G. Bosilca, N. Losada, W. Wu, D. Zhong, and J. Dongarra, Evaluating Data Redistribution in PaRSEC,” IEEE Transactions on Parallel and Distributed Systems, vol. 33, no. 8, pp. 1856-1872, 2022. DOI: 10.1109/TPDS.2021.3131657  (3.19 MB)
Fortenberry, A., S. Tomov, and K. Wong, Extending MAGMA Portability with OneAPI , Dallas, TX, The International Conference for High Performance Computing, Networking, Storage, and Analysis (SC22), ACM Student Research Competition, November 2022.  (1.33 MB)
Ayala, A., S. Tomov, P. Luszczek, S. Cayrols, G. Ragghianti, and J. Dongarra, FFT Benchmark Performance Experiments on Systems Targeting Exascale,” ICL Technical Report, no. ICL-UT-22-02, March 2022.  (5.87 MB)
Cao, Q., R. Alomairy, Y. Pei, G. Bosilca, H. Ltaief, D. Keyes, and J. Dongarra, A Framework to Exploit Data Sparsity in Tile Low-Rank Cholesky Factorization,” IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2022.  (1.03 MB)
Schuchart, J., P. Nookala, M. Mahdi Javanmard, T. Herault, E. F. Valeev, G. Bosilca, and R. J. Harrison, Generalized Flow-Graph Programming Using Template Task-Graphs: Initial Implementation and Assessment,” 2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS), Lyon, France, IEEE, July 2022. DOI: 10.1109/IPDPS53621.2022.00086
Whitlock, M., N. Morales, G. Bosilca, A. Bouteiller, B. Nicolae, K. Teranishi, E. Giem, and V. Sarkar, Integrating process, control-flow, and data resiliency layers using a hybrid Fenix/Kokkos approach,” 2022 IEEE International Conference on Cluster Computing (CLUSTER 2022), Heidelberg, Germany, September 2022.
Cayrols, S., J. Li, G. Bosilca, S. Tomov, A. Ayala, and J. Dongarra, Lossy all-to-all exchange for accelerating parallel 3-D FFTs on hybrid architectures with GPUs,” 2022 IEEE International Conference on Cluster Computing (CLUSTER), pp. 152-160, September 2022. DOI: 10.1109/CLUSTER51413.2022.00029
Cayrols, S., J. Li, G. Bosilca, S. Tomov, A. Ayala, and J. Dongarra, Mixed precision and approximate 3D FFTs: Speed for accuracy trade-off with GPU-aware MPI and run-time data compression,” ICL Technical Report, no. ICL-UT-22-04, May 2022.  (706.14 KB)
Benoit, A., L. Perotin, Y. Robert, and H. Sun, Online scheduling of moldable task graphs under common speedup models,” ICPP'2022, the 50th Int. Conf. on Parallel Processing: ACM Press, 2022.  (622.81 KB)
Bak, S., C. Bertoni, S. Boehm, R. Budiardja, B. M. Chapman, J. Doerfert, M. Eisenbach, H. Finkel, O. Hernandez, J. Huber, et al., OpenMP application experiences: Porting to accelerated nodes,” Parallel Computing, vol. 109, March 2022. DOI: 10.1016/j.parco.2021.102856
Du, Y., G. Pallez, L. Marchal, and Y. Robert, Optimal checkpointing strategies for iterative applications,” IEEE Trans. Parallel Distributed Systems, vol. 33, no. 3, pp. 507-522, 2022.  (1.47 MB)
Sid-Lakhdar, W. M., S. Cayrols, D. Bielich, A. Abdelfattah, P. Luszczek, M. Gates, S. Tomov, H. Johansen, D. Williams-Young, T. A. Davis, et al., PAQR: Pivoting Avoiding QR factorization,” ICL Technical Report, no. ICL-UT-22-06, June 2022.  (364.85 KB)
Ayala, A., S. Tomov, M. Stoyanov, A. Haidar, and J. Dongarra, Performance Analysis of Parallel FFT on Large Multi-GPU Systems,” 2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), Lyon, France, IEEE, August 2022. DOI: 10.1109/IPDPSW55747.2022.00072
Tsai, Y-H. M., T. Cojean, and H. Anzt, Providing performance portable numerics for Intel GPUs,” Concurrency and Computation: Practice and Experience, vol. n/a, no. n/a, pp. e7400, October 2022. DOI: 10.1002/cpe.7400
Schuchart, J., P. Nookala, T. Herault, E. F. Valeev, and G. Bosilca, Pushing the Boundaries of Small Tasks: Scalable Low-Overhead Data-Flow Programming in TTG,” 2022 IEEE International Conference on Cluster Computing (CLUSTER)2022 IEEE International Conference on Cluster Computing (CLUSTER), Heidelberg, Germany, IEEE, September 2022. DOI: 10.1109/CLUSTER51413.2022.00026
Murray, R., J. Demmel, M. W. Mahoney, B. N. Erichson, M. Melnichenko, O. Asif Malik, L. Grigori, M. Dereziński, M. E. Lopes, T. Liang, et al., Randomized Numerical Linear Algebra: A Perspective on the Field with an Eye to Software , 2022.  (1.05 MB)
Reed, D., D. Gannon, and J. Dongarra, Reinventing High Performance Computing: Challenges and Opportunities,” ICL Technical Report, no. ICL-UT-22-03, March 2022.  (1.36 MB)
Dongarra, J., and A. Geist, Report on the Oak Ridge National Laboratory's Frontier System,” ICL Technical Report, no. ICL-UT-22-05, May 2022.  (16.87 MB)
Cao, Q.., S.. Abdulah, R.. Alomairy, Y.. Pei, P.. Nag, G.. Bosilca, J.. Dongarra, M.. G. Genton, D.. E. Keyes, H.. Ltaief, et al., Reshaping Geostatistical Modeling and Prediction for Extreme-Scale Environmental Applications,” 2022 SC22: International Conference for High Performance Computing, Networking, Storage and Analysis (SC) (SC), Los Alamitos, CA, USA, IEEE Computer Society, pp. 13-24, November 2022.
Benoit, A., V. Le Fèvre, L. Perotin, P. Raghavan, Y. Robert, and H. Sun, Resilient scheduling of moldable parallel jobs to cope with silent errors,” IEEE Trans. Computers, vol. 71, no. 7, 2022.  (1.52 MB)
Luszczek, P., and C. Brown, Surrogate ML/AI Model Benchmarking for FAIR Principles' Conformance,” 2022 IEEE High Performance Extreme Computing Conference (HPEC)2022 IEEE High Performance Extreme Computing Conference (HPEC), Waltham, MA, USA, IEEE, September 2022. DOI: 10.1109/HPEC55821.2022.9926401
Lindquist, N., M. Gates, P. Luszczek, and J. Dongarra, Threshold Pivoting for Dense LU Factorization,” ScalAH22: 13th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Heterogeneous Systems , Dallas, Texas, IEEE, 2022.  (721.77 KB)
Zhong, D., Q. Cao, G. Bosilca, and J. Dongarra, Using long vector extensions for MPI reductions,” Parallel Computing, vol. 109, pp. 102871, March 2022. DOI: 10.1016/j.parco.2021.102871
2021
Kovalchuk, S. V., V. V. Krzhizhanovskaya, PMA. Sloot, G. Závodszky, M. H. Lees, M. Paszyński, and J. Dongarra, 20 years of computational science: Selected papers from 2020 International Conference on Computational Science,” Journal of Computational Science, vol. 53, pp. 101395–101395, 2021. DOI: 10.1016/j.jocs.2021.101395
Ayala, A., S. Tomov, A. Haidar, M.. Stoyanov, S. Cayrols, J. Li, G. Bosilca, and J. Dongarra, Accelerating FFT towards Exascale Computing : NVIDIA GPU Technology Conference (GTC2021), 2021.  (27.23 MB)
Lindquist, N., P. Luszczek, and J. Dongarra, Accelerating Restarted GMRES with Mixed Precision Arithmetic,” IEEE Transactions on Parallel and Distributed Systems, June 2021. DOI: 10.1109/TPDS.2021.3090757  (572.4 KB)
Caron, E., Y. Caniou, A K W. Chang, and Y. Robert, Budget-aware scheduling algorithms for scientific workflows with stochastic task weights on IaaS Cloud platforms,” Concurrency and Computation: Practice and Experience, vol. 33, no. 17, pp. e6065, 2021. DOI: 10.1002/cpe.6065  (1.99 MB)
Schuchart, J., P. Samfass, C. Niethammer, J. Gracia, and G. Bosilca, Callback-based completion notification using MPI Continuations,” Parallel Computing, vol. 21238566, issue 0225, pp. 102793, May Jan. DOI: 10.1016/j.parco.2021.102793
Herault, T., Y. Robert, G. Bosilca, R. Harrison, C. Lewis, E. Valeev, and J. Dongarra, Distributed-Memory Multi-GPU Block-Sparse Tensor Contraction for Electronic Structure,” 35th IEEE International Parallel & Distributed Processing Symposium (IPDPS 2021), Portland, OR, IEEE, May 2021.
Bosilca, G., T. Herault, and J. Dongarra, DTE: PaRSEC Enabled Libraries and Applications : 2021 Exascale Computing Project Annual Meeting, April 2021.  (3.24 MB)
Bathie, G., L. Marchal, Y. Robert, and S. Thibault, Dynamic DAG scheduling under memory constraints for shared-memory platforms,” Int. J. of Networking and Computing, vol. 11, no. 1, pp. 27-49, 2021.  (574.64 KB)
Kolev, T., P. Fischer, M. Min, J. Dongarra, J. Brown, V. Dobrev, T. Warburton, S. Tomov, M. S. Shephard, A. Abdelfattah, et al., Efficient exascale discretizations: High-order finite element methods,” The International Journal of High Performance Computing Applications, pp. 10943420211020803, 2021. DOI: 10.1177/10943420211020803
Barry, D., A. Danalis, and H. Jagode, Effortless Monitoring of Arithmetic Intensity with PAPI’s Counter Analysis Toolkit,” Tools for High Performance Computing 2018/2019: Springer, pp. 195–218, 2021. DOI: 10.1007/978-3-030-66057-4_11
Gao, Y., G. Pallez, Y. Robert, and F. Vivien, Evaluating Task Dropping Strategies for Overloaded Real-Time Systems (Work-In-Progress),” 42nd Real Time Systems Symposium (RTSS): IEEE Computer Society Press, 2021.  (217.13 KB)
Iqbal, Z., S. Nooshabadi, I. Yamazaki, S. Tomov, and J. Dongarra, Exploiting Block Structures of KKT Matrices for Efficient Solution of Convex Optimization Problems,” IEEE Access, 2021. DOI: 10.1109/ACCESS.2021.3106054  (1.35 MB)
Anzt, H., N. Beams, T. Cojean, F. Göbel, T. Grützmacher, A. Kashi, P. Nayak, T. Ribizel, and Y. M. Tsai, Gingko: A Sparse Linear Algebrea Library for HPC : 2021 ECP Annual Meeting, April 2021.  (893.04 KB)
Abdelfattah, A., V. Barra, N. Beams, R. Bleile, J. Brown, J-S. Camier, R. Carson, N. Chalmers, V. Dobrev, Y. Dudouit, et al., GPU algorithms for Efficient Exascale Discretizations,” Parallel Computing, vol. 108, pp. 102841, 2021. DOI: 10.1016/j.parco.2021.102841
Ayala, A., S. Tomov, P. Luszczek, S. Cayrols, G. Ragghianti, and J. Dongarra, Interim Report on Benchmarking FFT Libraries on High Performance Systems,” Innovative Computing Laboratory Technical Report, no. ICL-UT-21-03: University of Tennessee, July 2021.  (2.68 MB)
Hori, A., E. Jeannot, G. Bosilca, T. Ogura, B. Gerofi, J. Yin, and Y. Ishikawa, An international survey on MPI users,” Parallel Computing, vol. 108, December 2021. DOI: 10.1016/j.parco.2021.102853  (1.49 MB)
Penchoff, D. A., E. Valeev, H. Jagode, P. Luszczek, A. Danalis, G. Bosilca, R. J. Harrison, J. Dongarra, and T. L. Windus, An Introduction to High Performance Computing and Its Intersection with Advances in Modeling Rare Earth Elements and Actinides,” Rare Earth Elements and Actinides: Progress in Computational Science Applications, vol. 1388, Washington, DC, American Chemical Society, pp. 3-53, October 2021. DOI: 10.1021/bk-2021-1388.ch001
Jagode, H., H. Anzt, H. Ltaief, and P. Luszczek, Lecture Notes in Computer Science: High Performance Computing , vol. 12761: Springer International Publishing, 2021. DOI: 10.1007/978-3-030-90539-2

Pages