|
2022/01/14 |
Mark Gates |
ICL |
Parallel divide & conquer eigenvector computation in SLATE |
mark-gates-parallel-divid-2022-01-14.pdf |
|
2022/01/07 |
Qinglei Cao |
ICL |
Dense, Mixed-Precision and Tile Low-Rank GEMM and Cholesky on Fugaku Using PaRSEC |
qinglei-cao-dense-mixed-pr-2022-01-07.pdf |
|
2021/12/10 |
Daniel Mishler |
|
HHL: a Quantum Algorithm for Exponential Speedup on Systems of Linear Equations |
daniel-mishler-2021-12-10.pdf |
|
2021/12/03 |
Anthony Danalis |
ICL |
SDE library internals a.k.a watching the making of sausage |
|
|
2021/11/12 |
Mohsen Mahmoudi-Aznaveh |
Texas A&M |
Paru: Parallel Unsymmetric Multifrontal sparse LU factorization |
mohsen-mahmoudi-aznaveh-paru-parallel-2021-11-12.pdf |
|
2021/11/05 |
Grzegorz Kwasniewski |
ETH Zurich |
From graph pebbling to I/O optimal and high-performance code |
grzegorz-kwasniewski-from-graph-pebb-2021-11-05.pdf |
|
2021/10/29 |
Rabab Al-omairy |
ICL |
Communication Avoiding LU with Tournament Pivoting in SLATE |
|
|
2021/10/22 |
Tony Castaldo |
ICL |
Parallel Spectrum Slicing for Selected Eigenpairs with Tall-Skinny QR Orthogonalization of Eigenvectors |
|
|
2021/10/15 |
Hartwig Anzt |
ICL |
Batched Iterative Solvers for Sparse Linear Problems |
|
|
2021/10/08 |
Aurelien Bouteiller |
ICL |
|
|
|
2021/10/01 |
Daniel Bielich |
ICL |
Delayed Classical Gram-Schmidt with Reorthogonalization (DCGS2), in the context of QR and Arnoldi |
|
|
2021/09/24 |
Ed Valeev |
Virginia Tech |
Tensor-Centric View of Electrons in Molecules and Materials |
|
|
2021/09/17 |
Anouar Benali |
Argonne National Laboratory |
Towards predictive simulations of molecules and solids using quantum Monte Carlo methods |
|
|
2021/09/10 |
Stephen Herbein |
Lawrence Livermore National Laboratory |
HPC + Cloud Convergence at LLNL |
stephen-herbein-hpc-cloud-con-2021-09-10.pdf |
|
2021/09/03 |
Tu Mai Anh Do |
USC Information Sciences Institute |
Assessing Resource Provisioning and Allocation of Ensembles of In Situ Workflows |
tu-mai-anh-do-assessing-resou-2021-09-03.pdf |
|
2021/08/27 |
Scott Klasky |
Oak Ridge National Laboratory |
Data Reduction via the MultiGrid Adaptive Reduction of Data (MGARD) |
scott-klasky-data-reduction-2021-08-27.pdf |
|
2021/08/20 |
Thomas Herault |
ICL |
Templated Task Graph a new task programing interface in C++ |
|
|
2021/08/06 |
Michele Benzi |
Scuola Normale Superiore in Pisa |
Walk-based measures of centrality, communicability, and robustness in networks |
|
|
2021/07/30 |
Ahmad Abdelfattah |
ICL |
The underlying complexity of optimizing batch routines |
|
|
2021/07/23 |
Jamie Coble |
UTK's Department of Nuclear Engineering |
Data-Driven Decision Making for Improved Economics of Nuclear Power |
|
|
2021/07/16 |
Kate Keahey |
Argonne National Laboratory |
Chamelon: An Innovation Platform for Repeatable Computer Science Research |
|
|
2021/07/09 |
Piotr Luszczek |
ICL |
Towards linear algebra in the standard C++ library |
|
|
2021/07/02 |
Stanimire Tomov |
ICL |
MAGMA: Evolution and Revolution |
|
|
2021/06/25 |
Rabab Al-Omairy |
ICL |
|
|
|
2021/06/18 |
George Bosilca |
ICL |
50 Shades of Cholesky |
Bosilca-50-shades-of-Cholesky.pdf |
|
2021/06/11 |
Hartwig Anzt |
KIT |
Porting Ginkgo to Intel GPUs using DPC++ |
|
|
2021/06/04 |
Jack Dongarra |
ICL |
|
|
|
2021/05/28 |
Fan Zhang |
UTK's Nuclear Engineering Department |
Enhancing the Cybersecurity of Nuclear Facilities through Data Aggregation |
|
|
2021/05/21 |
Dr. Katie Cahill |
Howard H. Baker Jr. Center for Public Policy |
Returning with Care: COVID-19 and Campus Reopening |
|
|
2021/05/14 |
Pedro Valero Lara |
Oak Ridge National Laboratory |
Tasking for Linear Algebra Kernels |
|
|
2021/05/07 |
Wissam Sid Lakhdar |
ICL |
Multitask and Transfer Learning for Autotuning Exascale Applications |
|
|
2021/04/30 |
Natalie Beams |
ICL |
ROCm Road: Milestones, Mishaps, and Mysteries from a Year of Preparing libCEED for an AMD GPU Future |
|
|
2021/04/23 |
Joseph Schuchart |
ICL |
Callback-Based Completion Notification Using MPI Continuations |
|
|
2021/04/09 |
Yu Pei |
ICL |
|
|
|
2021/03/26 |
Dong Zhong |
ICL |
Toward Reliable and Efficient Message Passing Software for HPC Systems: Fault Tolerance and Vector Extension |
|
|
2021/03/19 |
Cade Brown |
ICL |
The Wild World of Derivatives, Gamma Squeezes, and (of course) GameStop |
|
|
2021/03/12 |
Sajal Dash |
Oak Ridge National Laboratory |
Scaling Out a Combinatorial Algorithm for Discovering Carcinogenic Gene Combinations to Thousands of GPUs |
|
|
2021/03/05 |
Azzam Haidar |
NVIDIA |
Tensor Core Accelerated Iterative Refinement Solvers and Their Impact on Scientific Computing |
Haidar-Tensor-Core-Accelerated-Iterative-Refinement-Solvers-and-Its-Impact-on-Scientific-Computing-03-05-2021.pdf |
|
2021/02/26 |
Terry Moore |
ICL |
Some Reflections About the Life of ICL in the 21st Century |
Moore-Some-Reflections-on-the-Life-of-ICL-in-the-21st-Century-02-26-2021.pdf |
|
2021/02/19 |
Sebastien Cayrols |
ICL |
Design and Optimization of MPI_Alltoall for Mixed-Precision Algorithms |
|
|
2021/02/12 |
Qinglei Cao |
ICL |
Accelerating Geostatistical Modeling and Prediction With Mixed-Precision Computations: A High-Productivity Approach with PaRSEC |
|
|
2021/02/05 |
Daniel Barry |
ICL |
The Linear Algebra of Native Hardware Event Identification |
Barry-The-Linear-Algebra-of-Native-Hardware-Event-Identification-02-05-2021.pdf |
|
2021/01/29 |
Alan Ayala |
ICL |
Tuning FFTs for Exascale |
|
|
2021/01/22 |
Seetharami Seelam |
IBM |
Future of HPC on Cloud: A Researcher Perspective |
|
|
2021/01/15 |
Yves Robert |
ENS-Lyon |
Resilient Scheduling of Moldable Jobs on Failure-Prone Platforms |
|
|
2021/01/08 |
Maksim Melnichenko |
ICL |
Randomized Algorithms for the Low Rank Matrix Approximation |
|
|
2020/12/18 |
Ian Lumsden |
Global Computing Laboratory |
|
|
|
2020/12/11 |
Nigel Tan |
Global Computing Laboratory |
Optimizing Vector Particle-In-Cell (VPIC) for Memory Constrained Systems Using Half-Precision |
Nigel-Tan-Optimizing-Vector-Particle-in-Cell-for-Memory-Constrained-Systems-Using-Half-Precision-12-11-2020.pdf |
|
2020/12/04 |
Anthony Danalis |
ICL |
|
|
|
2020/11/13 |
Asim YarKhan |
ICL |
Profiling and Performance Improvements in SLATE: Experience with the Cholesky Factorization |
|