Publications

Show only items where

Author

Type

Term

Year

Keyword

Export 1276 results:

A B C D E F G H I J K L M N O P Q R S T U V W X Y Z

Eijkhout, V., “Numerical Metadata API Reference,” Innovative Computing Laboratory Technical Report, February 2007.

(454.79 KB)

Eijkhout, V., “Automatic Determination of Matrix-Blocks,” Lapack Working Note 151, University of Tennessee Computer Science Technical Report, no. UT-CS-01-458, January 2001.

(1.15 MB)

Eijkhout, V., and E. Fuentes, “A Proposed Standard for Matrix Metadata,” Innovative Computing Laboratory Technical Report, no. ICL-UT-03-02, Submitted to ACM TOMS, November 2003.

(13.39 KB)

Eijkhout, V., E. Fuentes, T. Eidson, and J. Dongarra, “The Component Structure of a Self-Adapting Numerical Software System,” International Journal of Parallel Programming, vol. 33, no. 2, June 2005.

(64.88 KB)

Kurzak, J., M. Gates, A. Charara, A. YarKhan, and J. Dongarra, “Least Squares Solvers for Distributed-Memory Machines with GPU Accelerators,” ACM International Conference on Supercomputing (ICS '19), Phoenix, Arizona, ACM, pp. 117–126, June 2019.

(1.63 MB)

Eidson, T., V. Eijkhout, and J. Dongarra, “Improvements in the Efficient Composition of Applications,” IPDPS 2004, NGS Workshop (to appear), Sante Fe, 00 2004.

(42.85 KB)

Eidson, T., J. Dongarra, and V. Eijkhout, “Applying Aspect-Oriented Programming Concepts to a Component-based Programming Model,” IPDPS 2003, Workshop on NSF-Next Generation Software, Nice, France, March 2003.

(66.99 KB)

Eberius, D., T. Patinyasakdikul, and G. Bosilca, “Using Software-Based Performance Counters to Expose Low-Level Open MPI Performance Information,” EuroMPI, Chicago, IL, ACM, September 2017.

(745.58 KB)

Du, P., R. Weber, P. Luszczek, S. Tomov, G. D. Peterson, and J. Dongarra, “From CUDA to OpenCL: Towards a Performance-portable Solution for Multi-platform GPU Programming,” Parallel Computing, vol. 38, no. 8, pp. 391-407, August 2012.

(1.64 MB)

Du, P., P. Luszczek, S. Tomov, and J. Dongarra, “Soft Error Resilient QR Factorization for Hybrid System with GPGPU,” Journal of Computational Science, vol. 4, issue 6, pp. 457–464, November 2013.

(995.45 KB)

Du, P., P. Luszczek, and J. Dongarra, “High Performance Dense Linear System Solver with Soft Error Resilience,” IEEE Cluster 2011, Austin, TX, September 2011.

(1.27 MB)

Du, P., A. Bouteiller, G. Bosilca, T. Herault, and J. Dongarra, “Algorithm-Based Fault Tolerance for Dense Matrix Factorization,” Proceedings of the 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP 2012, New Orleans, LA, USA, ACM, pp. 225-234, February 2012.

(865.79 KB)

Du, P., A. Bouteiller, G. Bosilca, T. Herault, and J. Dongarra, “Algorithm-based Fault Tolerance for Dense Matrix Factorizations,” University of Tennessee Computer Science Technical Report, no. UT-CS-11-676, Knoxville, TN, August 2011.

(865.79 KB)

Du, P., P. Luszczek, S. Tomov, and J. Dongarra, “Mixed-Tool Performance Analysis on Hybrid Multicore Architectures,” First International Workshop on Parallel Software Tools and Tool Infrastructures (PSTI 2010), San Diego, CA, September 2010.

(1.24 MB)

Du, Y., G. Pallez, L. Marchal, and Y. Robert, “Optimal Checkpointing Strategies for Iterative Applications,” IEEE Transactions on Parallel Distributed Systems, vol. 33, issue 3, pp. 507-522, March 2022.

(1.47 MB)

Du, P., P. Luszczek, and J. Dongarra, “High Performance Dense Linear System Solver with Resilience to Multiple Soft Errors,” ICCS 2012, Omaha, NE, June 2012.

(1.27 MB)

Du, P., P. Luszczek, S. Tomov, and J. Dongarra, “Soft Error Resilient QR Factorization for Hybrid System with GPGPU,” Journal of Computational Science, Seattle, WA, Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems at SC11, November 2011.

(965.88 KB)

Du, P., P. Luszczek, S. Tomov, and J. Dongarra, “Soft Error Resilient QR Factorization for Hybrid System,” University of Tennessee Computer Science Technical Report, no. UT-CS-11-675, Knoxville, TN, July 2011.

(1.39 MB)

Du, P., P. Luszczek, S. Tomov, and J. Dongarra, “Soft Error Resilient QR Factorization for Hybrid System,” UT-CS-11-675 (also LAPACK Working Note #252), no. ICL-CS-11-675, July 2011.

(1.39 MB)

Du, P., M. Parsons, E. Fuentes, S-L. Shaw, and J. Dongarra, “Tuning Principal Component Analysis for GRASS GIS on Multi-core and GPU Architectures,” FOSS4G 2010, Barcelona, Spain, September 2010.

(1.57 MB)

Du, P., P. Luszczek, and J. Dongarra, “OpenCL Evaluation for Numerical Linear Algebra Library Development,” Symposium on Application Accelerators in High-Performance Computing (SAAHPC '10), Knoxville, TN, July 2010.

(2.69 MB)

Du, Y., L. Marchal, G. Pallez, and Y. Robert, “Robustness of the Young/Daly Formula for Stochastic Iterative Applications,” 49th International Conference on Parallel Processing (ICPP 2020), Edmonton, AB, Canada, ACM Press, August 2020.

(1.11 MB)

Du, P., S. Tomov, and J. Dongarra, “Providing GPU Capability to LU and QR within the ScaLAPACK Framework,” University of Tennessee Computer Science Technical Report (also LAWN 272), no. UT-CS-12-699, September 2012.

(7.48 MB)

Aggarwal, I., P. Nayak, A. Kashi, and H. Anzt, “Preconditioners for Batched Iterative Linear Solvers on GPUs,” Smoky Mountains Computational Sciences and Engineering Conference, vol. 169075: Springer Nature Switzerland, pp. 38 - 53, January 2023.

Dorris, J., A. YarKhan, J. Kurzak, P. Luszczek, and J. Dongarra, “Task Based Cholesky Decomposition on Xeon Phi Architectures using OpenMP,” International Journal of Computational Science and Engineering (IJCSE), vol. 17, no. 3, October 2018.

Doolin, D., J. Dongarra, and K. Seymour, “JLAPACK - Compiling LAPACK Fortran to Java,” Scientific Programming, vol. 7, no. 2, pp. 111-138, October 2002.

(307.46 KB)

Dongarra, J., D. Laforenza, and S.. Orlando, “Recent Advances in Parallel Virtual Machine and Message Passing Interface,” Lecture Notes in Computer Science, vol. 2840: Springer-Verlag, Berlin, January 2003.

Dongarra, J., H. Meuer, and E. Strohmaier, “Top500 Supercomputer Sites (13th edition),” University of Tennessee Computer Science Department Technical Report, no. UT-CS-99-425, June 1999.

(278.51 KB)

Dongarra, J., and D. W. Walker, “The Quest for Petascale Computing,” Computing in Science and Engineering, vol. 3, no. 3, pp. 32-39, May 2001.

(178.3 KB)

Dongarra, J., and P. Luszczek, “How Elegant Code Evolves With Hardware: The Case Of Gaussian Elimination,” in Beautiful Code Leading Programmers Explain How They Think: O'Reilly Media, Inc., June 2007.

(257 KB)

Dongarra, J., and V. Eijkhout, “Self Adapting Numerical Algorithm for Next Generation Applications,” International Journal of High Performance Computing Applications, vol. 17, no. 2, pp. 125-132, January 2003.

(479.18 KB)

Dongarra, J., and V. Eijkhout, “Self-Adapting Numerical Software and Automatic Tuning of Heuristics,” Lecture Notes in Computer Science, vol. 2660, Melbourne, Australia, Springer Verlag, pp. 759-770, June 2003.

(45.95 KB)

Dongarra, J., “Performance of Various Computers Using Standard Linear Equations Software (Linpack Benchmark Report),” University of Tennessee Computer Science Technical Report, UT-CS-89-85, 00 2010.

(6.42 MB)

Dongarra, J., “Trends in High Performance Computing,” The Computer Journal, vol. 47, no. 4: The British Computer Society, pp. 399-403, 00 2004.

(455.96 KB)

Dongarra, J., and S. Moore, “Empirical Performance Tuning of Dense Linear Algebra Software,” in Performance Tuning of Scientific Applications (to appear), 00 2010.

Dongarra, J., T. Dong, M. Gates, A. Haidar, S. Tomov, and I. Yamazaki, MAGMA: A New Generation of Linear Algebra Library for GPU and Multicore Architectures , Salt Lake City, UT, The International Conference for High Performance Computing, Networking, Storage, and Analysis (SC12), Presentation, November 2012.

(4.69 MB)

Dongarra, J., “Performance of Various Computers Using Standard Linear Equations Software,” University of Tennessee Computer Science Technical Report, no. cs-89-85, February 2013.

(539.24 KB)

Dongarra, J., M. A. Heroux, and P. Luszczek, “A New Metric for Ranking High-Performance Computing Systems,” National Science Review, vol. 3, issue 1, pp. 30-35, January 2016.

(393.55 KB)

Dongarra, J., M. Faverge, T. Herault, J. Langou, and Y. Robert, “Hierarchical QR Factorization Algorithms for Multi-Core Cluster Systems,” IPDPS 2012, the 26th IEEE International Parallel and Distributed Processing Symposium, Shanghai, China, IEEE Computer Society Press, May 2012.

(405.71 KB)

Dongarra, J., H. Meuer, and E. Strohmaier, “Top500 Supercomputer Sites (15th edition),” University of Tennessee Computer Science Department Technical Report, no. UT-CS-00-442, June 2000.

(278.88 KB)

Dongarra, J., S. Hammarling, N. J. Higham, S. Relton, and M. Zounon, “Optimized Batched Linear Algebra for Modern Architectures,” Euro-Par 2017, Santiago de Compostela, Spain, Springer, August 2017.

(618.33 KB)

(6.42 MB)

Dongarra, J., M. Gates, A. Haidar, J. Kurzak, P. Luszczek, S. Tomov, and I. Yamazaki, “Accelerating Numerical Dense Linear Algebra Calculations with GPUs,” Numerical Computations with GPUs: Springer International Publishing, pp. 3-28, 2014.

(1.06 MB)

Dongarra, J., “Report on the Fujitsu Fugaku System,” Innovative Computing Laboratory Technical Report, no. ICL-UT-20-06: University of Tennessee, June 2020.

(3.3 MB)

Dongarra, J., T. Herault, and Y. Robert, “Fault Tolerance Techniques for High-performance Computing,” University of Tennessee Computer Science Technical Report (also LAWN 289), no. UT-EECS-15-734: University of Tennessee, May 2015.

Dongarra, J., T. Herault, and Y. Robert, “Revisiting the Double Checkpointing Algorithm,” 15th Workshop on Advances in Parallel and Distributed Computational Models, at the IEEE International Parallel & Distributed Processing Symposium, Boston, MA, May 2013.

(591.1 KB)