Publications

Show only items where

Author

Type

Term

Year

Keyword

Export 1275 results:

A B C D E F G H I J K L M N O P Q R S T U V W X Y Z

Jagode, H., A. YarKhan, A. Danalis, and J. Dongarra, “Power Management and Event Verification in PAPI,” Tools for High Performance Computing 2015: Proceedings of the 9th International Workshop on Parallel Tools for High Performance Computing, September 2015, Dresden, Germany, Dresden, Germany, Springer International Publishing, pp. pp. 41-51, 2016.

(565.14 KB)

Jagode, H., A. Danalis, and J. Dongarra, What it Takes to keep PAPI Instrumental for the HPC Community , Collegeville, MN, The 2019 Collegeville Workshop on Sustainable Scientific Software (CW3S19), July 2019.

(3.29 MB)

Jagode, H., and A. Danalis, PULSE: PAPI Unifying Layer for Software-Defined Events (Poster) , Seattle, WA, 2020 NSF Cyberinfrastructure for Sustained Scientific Innovation (CSSI) Principal Investigator Meeting, February 2020.

(1.86 MB)

Jagode, H., A. Danalis, G. Bosilca, and J. Dongarra, “Accelerating NWChem Coupled Cluster through dataflow-based Execution,” 11th International Conference on Parallel Processing and Applied Mathematics (PPAM 2015), Krakow, Poland, Springer International Publishing, September 2015.

(452.82 KB)

Jagode, H., J. Dongarra, S. Alam, J. Vetter, W.. Spear, and A. D. Malony, “A Holistic Approach for Performance Measurement and Analysis for Petascale Applications,” ICCS 2009 Joint Workshop: Tools for Program Development and Analysis in Computational Science and Software Engineering for Large-Scale Computing, vol. 2009, Baton Rouge, Louisiana, Springer-Verlag Berlin Heidelberg 2009, pp. 686-695, May 2009.

(3.96 MB)

Jagode, H., A. Knuepfer, J. Dongarra, M. Jurenz, M. S. Mueller, and W. E. Nagel, “Trace-based Performance Analysis for the Petascale Simulation Code FLASH,” International Journal of High Performance Computing Applications (to appear), 00 2010.

(887.54 KB)

Jagode, H., A. Danalis, H. Anzt, I. Yamazaki, M. Hoemmen, E. Boman, S. Tomov, and J. Dongarra, “Software-Defined Events (SDEs) in MAGMA-Sparse,” Innovative Computing Laboratory Technical Report, no. ICL-UT-18-12: University of Tennessee, December 2018.

(481.69 KB)

Jagode, H., and J. Hein, “Custom assignment of MPI ranks for parallel multi-dimensional FFTs: Evaluation of BG/P versus BG/L,” Proceedings of the 2008 IEEE International Symposium on Parallel and Distributed Processing with Applications (ISPA-08), Sydney, Australia, IEEE Computer Society, pp. 271-283, January 2008.

(2.6 MB)

Jagode, H., “Dataflow Programming Paradigms for Computational Chemistry Methods,” Innovative Computing Laboratory Technical Report, no. ICL-UT-17-01, Knoxville, TN, University of Tennessee, May 2017.

Jagode, H., A. Danalis, and J. Dongarra, PAPI's New Software-Defined Events for In-Depth Performance Analysis , Lyon, France, CCDSC 2018: Workshop on Clusters, Clouds, and Data for Scientific Computing, September 2018.

Jagode, H., A. Danalis, H. Anzt, and J. Dongarra, “PAPI Software-Defined Events for in-Depth Performance Analysis,” The International Journal of High Performance Computing Applications, vol. 33, issue 6, pp. 1113-1127, November 2019.

(442.39 KB)

Jagode, H., A. Danalis, and J. Dongarra, “Accelerating NWChem Coupled Cluster through dataflow-based Execution,” The International Journal of High Performance Computing Applications, vol. 32, issue 4, pp. 540--551, July 2018.

(1.68 MB)

Jagode, H., A. Danalis, and J. Dongarra, “Formulation of Requirements for New PAPI++ Software Package: Part I: Survey Results,” PAPI++ Working Notes, no. 1, ICL-UT-20-02: Innovative Computing Laboratory, University of Tennessee Knoxville, January 2020.

(1.49 MB)

Jagode, H., H. Anzt, H. Ltaief, and P. Luszczek, Lecture Notes in Computer Science: High Performance Computing , vol. 12761: Springer International Publishing, 2021.

Jagode, H., A. Danalis, R. Hoque, M. Faverge, and J. Dongarra, “Evaluation of Dataflow Programming Models for Electronic Structure Theory,” Concurrency and Computation: Practice and Experience: Special Issue on Parallel and Distributed Algorithms, vol. 2018, issue e4490, pp. 1–20, May 2018.

(1.69 MB)

Jagode, H., A. Danalis, and J. Dongarra, “What it Takes to keep PAPI Instrumental for the HPC Community,” 1st Workshop on Sustainable Scientific Software (CW3S19), Collegeville, Minnesota, July 2019.

(50.57 KB)

Jagode, H., S. Moore, D. Terpstra, J. Dongarra, A. Knuepfer, M. Jurenz, M. S. Mueller, and W. E. Nagel, “I/O Performance Analysis for the Petascale Simulation Code FLASH,” ISC'09, Hamburg, Germany, June 2009.

(88.88 KB)

Jagode, H., A. Danalis, and J. Dongarra, “Accelerating NWChem Coupled Cluster through Dataflow-Based Execution,” The International Journal of High Performance Computing Applications, pp. 1–13, January 2017.

(4.07 MB)

Jagode, H., A. Knuepfer, J. Dongarra, M. Jurenz, M. S. Mueller, and W. E. Nagel, “Trace-based Performance Analysis for the Petascale Simulation Code FLASH,” Innovative Computing Laboratory Technical Report, no. ICL-UT-09-01, April 2009.

(887.54 KB)

Jagode, H., A. Danalis, and D. Genet, “Roadmap for Refactoring Classic PAPI to PAPI++: Part II: Formulation of Roadmap Based on Survey Results,” PAPI++ Working Notes, no. 2, ICL-UT-20-09: Innovative Computing Laboratory, University of Tennessee, July 2020.

(763.75 KB)

Iqbal, Z., S. Nooshabadi, I. Yamazaki, S. Tomov, and J. Dongarra, “Exploiting Block Structures of KKT Matrices for Efficient Solution of Convex Optimization Problems,” IEEE Access, 2021.

(1.35 MB)

Hurault, A., and A. YarKhan, “Intelligent Service Trading and Brokering for Distributed Network Services in GridSolve,” VECPAR 2010, 9th International Meeting on High Performance Computing for Computational Science, Berkeley, CA, June 2010.

(256.04 KB)

Hunold, S., A. Bhatele, G. Bosilca, and P. Knees, “Predicting MPI Collective Communication Performance Using Machine Learning,” 2020 IEEE International Conference on Cluster Computing (CLUSTER), Kobe, Japan, IEEE, September 2020.

(619.68 KB)

Horton, M., S. Tomov, and J. Dongarra, “A Class of Hybrid LAPACK Algorithms for Multicore and GPU Architectures,” Symposium for Application Accelerators in High Performance Computing (SAAHPC'11), Knoxville, TN, July 2011.

(329.68 KB)

Hori, A., E. Jeannot, G. Bosilca, T. Ogura, B. Gerofi, J. Yin, and Y. Ishikawa, “An international survey on MPI users,” Parallel Computing, vol. 108, December 2021.

(1.49 MB)

Hori, A., K. Yoshinaga, T. Herault, A. Bouteiller, G. Bosilca, and Y. Ishikawa, “Overhead of Using Spare Nodes,” The International Journal of High Performance Computing Applications, February 2020.

(2.15 MB)

Hori, A., Y. Tsujita, A. Shimada, K. Yoshinaga, N. Mitaro, G. Fukazawa, M. Sato, G. Bosilca, A. Bouteiller, and T. Herault, “System Software for Many-Core and Multi-Core Architectures,” Advanced Software Technologies for Post-Peta Scale Computing: The Japanese Post-Peta CREST Research Project, Singapore, Springer Singapore, pp. 59–75, 2019.

Hori, A., T. Ogura, B. Gerofi, J. Yin, Y. Ishikawa, E. Jeannot, and G. Bosilca, A Report of the MPI International Survey (Poster) , Austin, TX, EuroMPI/USA '20: 27th European MPI Users' Group Meeting, September 2020.

Hoque, R., T. Herault, G. Bosilca, and J. Dongarra, “Dynamic Task Discovery in PaRSEC- A data-flow task-based Runtime,” ScalA17, Denver, ACM, September 2017.

(1.15 MB)

Hoemmen, M., D. Hollman, C. Trott, D. Sunderland, N. Liber, L-T. Lo, D. Lebrun-Grandie, G. Lopez, P. Caday, S. Knepper, et al., “P1673R3: A Free Function Linear algebra Interface Based on the BLAS,” ISO JTC1 SC22 WG22, no. P1673R3: ISO, April 2021.

(858.89 KB)

Hoemmen, M., and I. Yamazaki, Production Implementations of Pipelined & Communication-Avoiding Iterative Linear Solvers , Tokyo, Japan, SIAM Conference on Parallel Processing for Scientific Computing, March 2018.

(2.34 MB)

Hoefler, T., Y-S. Dai, and J. Dongarra, “Towards Efficient MapReduce Using MPI,” Lecture Notes in Computer Science, Recent Advances in Parallel Virtual Machine and Message Passing Interface - 16th European PVM/MPI Users' Group Meeting, vol. 5759, Espoo, Finland, Springer Berlin / Heidelberg, pp. 240-249, 00 2009.

Hoefler, T., B. Stevens, A. F. Prein, J. Baehr, T. Schulthess, T. F. Stocker, J. Taylor, D. Klocke, P. Manninen, P. M. Forster, et al., Earth Virtualization Engines - A Technical Perspective , September 2023.

Hoefler, T., J. M. Squyres, G. Fagg, G. Bosilca, W. Rehm, and A. Lumsdaine, “A New Approach to MPI Collective Communication Implementations,” Distributed and Parallel Systems: Springer US, pp. 45-54, 2007.

(140.2 KB)

Hoefler, T., M. Copik, P. Beckman, A. Jones, I. Foster, M. Parashar, D. Reed, M. Troyer, T. Schulthess, D. Ernst, et al., XaaS: Acceleration as a Service to Enable Productive High-Performance Cloud Computing : arXiv, January 2024.

Hiroyasu, T., M. Miki, M. Sano, H. Shimosaka, S. Tsutsui, and J. Dongarra, “Distributed Probablistic Model-Building Genetic Algorithm,” Lecture Notes in Computer Science, vol. 2723: Springer-Verlag, Heidelberg, pp. 1015-1028, January 2003.

(288.91 KB)

Hiroyasu, T., M. Miki, H. Shimosaka, M. Sano, Y. Tanimura, Y. Mimura, S. Yoshimura, and J. Dongarra, “Truss Structural Optimization Using NetSolve System,” Meeting of the Japan Society of Mechanical Engineers, Kyoto University, Kyoto, Japan, October 2002.

(450.65 KB)

Hiroyasu, T., M. Miki, S. Ogura, K. Aoi, T. Yoshida, Y. Okamoto, and J. Dongarra, “Energy Minimization of Protein Tertiary Structure by Parallel Simulated Annealing using Genetic Crossover,” Special Issue on Biological Applications of Genetic and Evolutionary Computation (submitted), March 2003.

(438.68 KB)

Hiroyasu, T., M. Miki, H. Shimosaka, and J. Dongarra, “Optimization Problem Solving System using Grid RPC,” 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, Tokyo, Japan, March 2003.

(71.6 KB)

Hiroyasu, T., M. Miki, J. Sawada, and J. Dongarra, “Optimization of Injection Schedule of Diesel Engine Using GridRPC,” Information Processing Society of Japan Symposium Series, vol. 2003, no. 14, pp. 189-197, January 2003.

(520.96 KB)

Hiroyasu, T., M. Miki, K. Kodama, J. Uekawa, and J. Dongarra, “A Simple Installation and Administration Tool for Large-scaled PC Cluster System,” ClusterWorld Conference and Expo, San Jose, CA, March 2003.

(275.97 KB)

Hiroyasu, T., M. Miki, H. Saito, Y. Tanimura, and J. Dongarra, “Static Scheduling for ScaLAPACK on the Grid Using Genetic Algorithm,” Information Processing Society of Japan Symposium Series, vol. 2003, no. 14, pp. 3-10, January 2003.

(506.42 KB)

Hiroyasu, T., M. Miki, H. Shimosaka, Y. Tanimura, and J. Dongarra, “Optimization System Using Grid RPC,” Meeting of the Japan Society of Mechanical Engineers, Kyoto University, Kyoto, Japan, October 2002.

Herrmann, J., G. Bosilca, T. Herault, L. Marchal, Y. Robert, and J. Dongarra, “Assessing the Cost of Redistribution followed by a Computational Kernel: Complexity and Performance Results,” Parallel Computing, vol. 52, pp. 22-41, February 2016.

(2.06 MB)

Heroux, M. A., and J. Dongarra, “Toward a New Metric for Ranking High Performance Computing Systems,” SAND2013 - 4744, June 2013.

(225.32 KB)

Hernandez, O., F. Song, B. Chapman, J. Dongarra, B. Mohr, S. Moore, and F. Wolf, “Performance Instrumentation and Compiler Optimizations for MPI/OpenMP Applications,” Lecture Notes in Computer Science, OpenMP Shared Memory Parallel Programming, vol. 4315: Springer Berlin / Heidelberg, 00 2008.

(350.9 KB)

Hernandez, O., F. Song, B. Chapman, J. Dongarra, B. Mohr, S. Moore, and F. Wolf, “Performance Instrumentation and Compiler Optimizations for MPI/OpenMP Applications,” Second International Workshop on OpenMP, Reims, France, January 2006.

(350.9 KB)

Hermanns, M-A., B. Mohr, and F. Wolf, “Event-based Measurement and Analysis of One-sided Communication,” In Proceedings of the European Conference on Parallel Computing (Euro-Par), Lisbon, Portugal, Springer, August 2005.

(403.44 KB)

Herault, T., Y. Robert, A. Bouteiller, D. Arnold, K. Ferreira, G. Bosilca, and J. Dongarra, “Checkpointing Strategies for Shared High-Performance Computing Platforms,” International Journal of Networking and Computing, vol. 9, no. 1, pp. 28–52, 2019.

(490.5 KB)

Herault, T., Y. Robert, G. Bosilca, R. Harrison, C. Lewis, E. Valeev, and J. Dongarra, “Distributed-Memory Multi-GPU Block-Sparse Tensor Contraction for Electronic Structure,” 35th IEEE International Parallel & Distributed Processing Symposium (IPDPS 2021), Portland, OR, IEEE, May 2021.

Main menu

Publications

Pages