|
2020/11/06 |
Heike Jagode |
ICL |
Dataflow Execution for Computational Chemistry Methods |
|
|
2020/10/30 |
Piotr Luszczek |
ICL |
Scalable Data Generation for Evaluating Mixed-Precision Solvers |
|
|
2020/10/23 |
Sunita Chandrasekaran |
University of Delaware |
Developing Software for Today's and Tomorrow's platform - Fun or a Nightmare! |
|
|
2020/10/16 |
Florent Lopez |
ICL |
Running SLATE using the PaRSEC Runtime System |
Lopez-Running-SLATE-using-the-PaRSEC-Runtime-System-10-16-2020.pdf |
|
2020/10/09 |
Thomas Herault |
ICL |
|
|
|
2020/10/02 |
Micah Beck |
EECS |
One Device To Rule Them All |
|
|
2020/09/25 |
Anthony Skjellum |
UT Chattanooga SimCenter |
|
|
|
2020/09/11 |
Richard Barrett |
Sandia National Laboratories |
Chapel Performance for Some Graph Analytics |
Barrett-Chapel-Performance-for-Some-Graph-Analytics-09-11-2020.pdf |
|
2020/09/04 |
Sudip Seal |
Oak Ridge National Laboratory |
Revisiting Parallel Algorithms for Block Tridiagonal Systems |
Seal-Revisiting-Parallel-Algorithms-for-Block-Tridiagonal-Systems-09-04-2020.pdf |
|
2020/08/28 |
Clark Hathaway and Sebastian Mobo |
Global Computing Laboratory |
A Framework for Linking Urban Traffic and Vehicle Emissions in Smart Cities |
Hathaway-Mobo-A-Framework-for-Linking-Urban-Traffic-and-Vehicle-Emissions-in-Smart-Cities-08-28-2020.pdf |
|
2020/08/21 |
Phil Mucci |
Minimal Metrics |
Automating Workflow Support with EPMT |
Mucci-Automating-Workflow-Support-with-EPMT-08-21-2020.pdf |
|
2020/08/07 |
Chad Steed |
Oak Ridge National Laboratory |
Interactive Visual Analysis of the Top500 List |
Steed-Interactive-Visual-Analysis-of-the-TOP500-List-08-07-2020.pdf |
|
2020/07/31 |
Dalal Sukkari |
ICL |
Leveraging Task-Based Polar Decomposition Using SLATE on Massively Parallel Systems with Hardware Accelerators |
|
|
2020/07/24 |
Richard Archibald |
Oak Ridge National Laboratory |
Inverse Modeling for Experimental Science and Data Analytics |
Archibald-Inverse-Modeling-for-Experimental-Science-and-Data-Analytic-07-24-2020.pdf |
|
2020/07/17 |
Joan Snoderly |
ICL |
Getting Started with Concur: UT's Online Self-Service Travel Booking Tool |
Snoderly-CONCUR-07-17-2020.pdf |
|
2020/07/10 |
Marcus Ritter and Felix Wolf |
Technical University of Darmstadt |
Learning Cost-Effective Sampling Strategies for Empirical Performance Modeling |
Ritter_Wolf-Learning-Cost-Effective-Sampling-Strategies-for-Empirical-Performance-Modeling-07-10-2020.pdf |
|
2020/06/26 |
Tony Castaldo |
ICL |
Controlling Power on NVIDIA and AMD GPUs |
|
|
2020/06/19 |
Michael Wyatt |
Global Computing Laboratory |
AI4IO: A Suite of AI-Based Tools for IO-Aware HPC Resource Management |
|
|
2020/06/12 |
Hartwig Anzt |
ICL |
Porting Linear Algebra Libraries to the AMD Ecosystem |
Anzt-Porting-Linear-Algebra-Libraries-to-the-AMD-Ecosystem-06-12-2020.pdf |
|
2020/06/05 |
Frank Winkler |
ICL |
PIKA: Center-Wide and Job-Aware Cluster Monitoring |
Winkler-PIKA-Job-Monitoring-06-05-2020.pdf |
|
2020/05/29 |
Azzam Haidar |
NVIDIA |
How CUDA Math Libraries Can Help You Unleash the Power of the New NVIDIA A100 GPU |
|
|
2020/05/22 |
Aurelien Bouteiller |
ICL |
Here's What's Fresh with MPI-4 |
Bouteiller-Whats-Fresh-with-MPI-4-05-22-2020.pdf |
|
2020/05/15 |
Jiali Li |
ICL |
Optimizing all-to-all Operation with Awareness of Network Topology |
Li-Optimizing-all-to-all-Operation-with-Awareness-of-Network-Topology-05-15-2020.pdf |
|
2020/05/08 |
Paula Olaya |
Global Computing Laboratory |
Building Containerized Environments for Reproducibility and Traceability of Scientific Workflows |
Olaya-Building-Containerized-Environments-for-Reproducibility-and-Traceability-of-Scientific-Workflows-05-08-2020.pdf |
|
2020/05/01 |
Stanimire Tomov |
ICL |
HeFFTe: FFT-ECP API and High-Performance Library Prototype for Multidimensional FFTs on Large-Scale Heterogeneous Systems |
Tomov-FFT-ECP-05-01-2020.pdf |
|
2020/04/24 |
Dong Zhong |
ICL |
Using Arm Scalable Vector Extension to Optimize Open MPI |
Zhong-Using-ARM-Scalable-Vector-Extension-to-Optimize-Open-MPI-04-24-2020.pdf |
|
2020/04/17 |
Mark Gates |
ICL |
Test Matrix Generation |
Gates-Test-Matrix-Generation-04-17-2020.pdf |
|
2020/04/03 |
Yu Pei |
ICL |
Communication Avoiding 2D Stencil Implementations over PaRSEC Task-Based Runtime |
Pei-CA-Stencil-PaRSEC-04-03-2020.pdf |
|
2020/03/13 |
Ahmad Abdelfattah |
ICL |
Recent Developments for Mixed Precision Solvers in MAGMA |
|
|
2020/03/06 |
Chris Gropp |
Global Computing Laboratory |
Understanding Machine Learning Systems using Adversarial Approaches |
|
|
2020/02/28 |
Mohammed Al Farhan |
ICL |
Optimizing the Cholesky factorization in SLATE |
AlFarhan-SLATE-02-28-2020.pdf |
|
2020/02/21 |
Daniel Schultz |
ICL |
The Effects of MPI Oversubscription & Multi-Process Service on Distributed-memory FFT Libraries |
Schultz-The-Effects-of-MPI-Oversubscription-&-Multi-Process-Service-on-Distributed-memory-FFT-Libraries-02-21-2020.pdf |
|
2020/01/31 |
Neil Lindquist |
ICL |
Improving the Performance of GMRES with Mixed Precision |
Lindquist-Improving-the-Performance-of-GMRES-with-Mixed-Precision-01-31-2020.pdf |
|
2020/01/29 |
Joan Snoderly |
ICL |
ICL Performance Evaluation Process |
Snoderly-ICL-Performance-Evaluation-Process-01-29-2020.pdf |
|
2020/01/24 |
Jeff Larkin |
NVIDIA |
To OpenACC 3.0 and Beyond! |
Larkin-Open-ACC3-01-24-2020.pdf |
|
2020/01/17 |
George Bosilca |
ICL |
Towards Portable Online Prediction of Network Utilization using MPI-level Monitoring |
Bosilca-Towards-Portable-Online-Prediction-of-Network-Utilization-using-MPI-level-Monitoring-01-17-2020.pdf |
|
2020/01/10 |
Hidehiko Hasegawa |
University of Tsukuba |
Predicting the Convergence of BiCG Method from Grayscale Matrix Images |
Hasegawa-Predicting-the-Convergence-of-BiCG-Method-from-Grayscale-Matrix-Images-01-10-2020.pdf |
|
2019/12/13 |
Piotr Luszczek |
ICL |
Building Community through xSDK Software Policies |
Luszczek-xSDK-Community-Policies-12-13-2019.pdf |
|
2019/12/06 |
Damien Genet |
ICL |
PAPI Updates on Power9 |
Genet-PAPI-Updates-on-Power9-12-06-2019.pdf |
|
2019/11/15 |
Anthony Danalis |
ICL |
Questions about hardware events? CAT has the answers. |
|
|
2019/11/08 |
Sticks Mabakane |
ICL |
Effective Callgraph Visualisations for Optimisation of Parallel-Programs |
Mabakane-Effective-Callgraph-Visualisations-for-Optimisation-of-Parallel-Programs_a-Design-Study.pdf |
|
2019/11/01 |
David Eberius |
ICL |
A Flexible MPI Benchmark For Fast Assessment of Multithreaded Communication Performance |
Eberius-Multirate-A-Flexible-MPI-Benchmark-For-Fast-Assessment-of-Multithreaded-Communication-Performance-11-01-2019.pdf |
|
2019/10/25 |
Yaohung Tsai |
ICL |
Autotuning in Deep Learning Kernels |
Tsai-Autotunning-Deep-Learning-10-25-2019.pdf |
|
2019/10/18 |
Alan Ayala |
ICL |
heFFTe: Highly Efficient FFT for Exascale |
Ayala-heFFTe-10-18-2019.pdf |
|
2019/10/11 |
Axel Huebl |
Lawrence Berkeley National Laboratory |
Scalable, Performance-Portable Particle-in-Cell Simulations and PByte-Scale Data-Challenges |
|
|
2019/10/04 |
Yves Robert |
ENS-Lyon |
Scheduling Independent Stochastic Tasks on Heterogeneous Cloud Platforms |
Robert-Scheduling-Independent-Stochastic-Tasks-on-Heterogeneous-Cloud-Platforms-10-04-2019.pdf |
|
2019/09/27 |
Srinivas Aluru |
Georgia Tech |
Parallel Machine Learning Approaches for Reverse Engineering Genome-Scale Networks |
Aluru-Parallel-Machine-Learning-Approaches-for-Reverse-Engineering-Genome-Scale-Networks-09-27-2019.pdf |
|
2019/09/20 |
Oscar Hernandez |
ORNL |
Filling in the Gaps between Applications and the OpenMP Specification for Exascale |
|
|
2019/09/13 |
Nuria Losada |
ICL |
Asynchronous Receiver-Driven Replay for Local Rollback of MPI Applications |
Losada-Asynchronous-Receiver-Driven-Replay-for-Local-Rollback-of-MPI-Applications-09-13-2019.pdf |
|
2019/09/06 |
Asim YarKhan |
ICL |
Linear Systems Solvers for Distributed-Memory Machines with GPU Accelerators |
YarKhan-SLATE-Linear-Systems-09-06-2019.pdf |