|
2015/08/07 |
Ian Masliah |
University of Paris-Sud |
Towards C++ and Beyond |
Masliah-Towards-C++-and-Beyond-08-07-2015.pdf |
|
2015/07/31 |
Joseph Schuchart |
TU Dresden |
HPC energy-efficiency research at ZIH, Or: What the HAEC is HDEEM? |
Schuchart-Energy-Efficiency-Research-at-ZIH-07-31-2015.pdf |
|
2015/07/17 |
Sangamesh Ragate |
ICL |
PC Sampling in GPU |
Ragate-PC-Sampling-in-GPU-2015-17-07.pdf |
|
2015/07/01 |
Ed Valeev |
Virginia Tech |
Tensor Computation for Chemistry Sparsity and More |
Valeev-Tensor-Computation-for-Chemistry-Sparsity-and-More-2015-07-01.pdf |
|
2015/07/01 |
Torsten Hoefler |
ETH Zürich |
Towards Fully Automated Interpretable Performance Models |
Hoefler-Towards-Fully-Automated-Interpretable-Performance-Models-2015-07-01.pdf |
|
2015/06/26 |
Reazul Hoque |
ICL |
Dynamic Task Discovery in PaRSEC |
Hoque-Dynamic-Task-Discovery-in-PaRSEC-2015-06-26.pdf |
|
2015/06/12 |
Damien Genet |
ICL |
Design of Generic Modular Solutions for PDE Solvers for Modern Architectures |
Ganet-Design-of-Generic-Modular-Solutions-for-PDE-Solvers-for-Modern-Architectures-2015-06-12.pdf |
|
2015/06/05 |
Nageswara Rao |
ORNL |
Fault Diagnosis of Hybrid CPU-GPU Computing Systems Using Chaotic Maps |
Rao-Chaotic-Map-Method-for-Detection-and-Diagnosis-of-CPU-GPU-Hybrid-Computing-Systems-2015-06-05.pdf |
|
2015/05/29 |
Chad Steed |
ORNL |
Extreme Scale Visual Data Science |
Steed-Visual-Data-Science-2015-05-29.pdf |
|
2015/05/15 |
Eduardo Ponce |
EECS |
IDR(s)-Biortho: A Case Study of MAGMA Sparse Iterative Solvers |
Ponce-IDR-Solver-for-MAGMA Sparse-Iter-Package-2015-05-15.pdf |
|
2015/05/08 |
Chunyan Tang |
ICL |
From MPI to OpenSHMEM: Porting LAMMPS |
Tang-From-MPI-to-openSHMEM-porting-LAMMPS-2015-05-08.pdf |
|
2015/05/01 |
Wei Wu |
ICL |
Hierarchical DAG Scheduling for Hybrid Distributed Systems |
Wu-Hierarchical-DAG-scheduling-for-Hybrid-Distributed-Systems-2015-05-01.pdf |
|
2015/04/24 |
Manish Parashar |
Rutgers |
Big Data Challenges in Simulation-based Science |
Parashar-Big-Data-Challenges-in-Simulation-based-Science-2015-04-15.pdf |
|
2015/04/17 |
Ahmad Ahmad |
ICL |
GPU Accelerated Memory-bound Linear Algebra Kernels |
Amhad-GPU-Accelerated-Memory-bound-Linear-Algebra-Kernels-2015-04-17.pdf |
|
2015/04/10 |
Tingxing Dong |
ICL |
Batched One-sided Factorizations on Hardware Accelerators Based on GPUs |
Dong-Batched-One-sided-Factorizations-on-Hardware-Accelerators-Based-on-GPUs.pdf |
|
2015/03/27 |
Yves Robert |
INRIA |
Voltage Overscaling Algorithms for Energy-Efficient Workflow Computations With Timing Errors |
Robert-Voltage-Overscaling-Algorithms-for-Energy-Efficient-Workflow-Computations-With-Timing-Errors-2015-03-27.pdf |
|
2015/03/20 |
Anthony Danalis |
ICL |
Using PaRSEC to Develop Non-static Applications |
|
|
2015/03/13 |
Audris Mockus |
EECS |
Evidence Engineering |
Mockus-Evidence-Engineering-2015-03-13.pdf |
|
2015/03/06 |
Azzam Haidar |
ICL |
Performance Bounds in Symmetric Eigenvector Calculations |
Haidar-PLASMA-MAGMA-PARSEC-Performance-Bounds-in-Symmetric-Eigensolver-2015-03-06.pdf |
|
2015/02/27 |
Piotr Luszczek |
ICL |
Deep Neural Networks for Image Classification – A Primer |
Luszczek-Deep-Neural-Net-Primer-2015-02-25.pdf |
|
2015/02/13 |
Yves Robert |
ICL |
Scheduling Computational Workflows on Failure-prone Platforms |
Robert-Scheduling-Computational-Workflows-on-Failure-prone-Platforms-2015-02-13.pdf |
|
2015/02/06 |
Amina Guermouche |
ICL |
FoREST-mn: Runtime DVFS Beyond Communication Slack |
Guermouche-FoREST-mn-Runtime-DVFS-Beyond-Communication-Slack-2015-02-06.pdf |
|
2015/01/23 |
George Bosilca |
ICL |
Building Blocks for Resilient Applications |
Bosilca-Building-Blocks-for-Resilient-Applications-2015-01-23.pdf |
|
2015/01/16 |
Emmanuel Jeannot |
INRIA |
Topology Aware Data Management |
Jeannot-Topology-Aware-Data-Management-2015-01-16.pdf |
|
2015/01/08 |
Tony Hey |
|
The Fourth Paradigm: Data-Intensive Scientific Discovery, Open Science and the Cloud |
Hey-The-Fourth-Paradigm-Data-Intensive-Scientific-Discovery-Open-Science-and-the-Cloud-2015-01-08.pdf |
|
2014/12/12 |
Ichitaro Yamazaki |
ICL |
Mixed-precision orthogonalization scheme and its case-studies with GPUs |
|
|
2014/12/05 |
Asim YarKhan |
ICL |
Latest Developments in the PAPI Performance Monitoring Library |
YarKhan-PAPI-Performance-Application-Programming-Interface-2014-12-05.pdf |
|
2014/11/14 |
Chongxiao Cao |
ICL |
Design for a Soft Error Resilient
Dynamic Task-based Runtime |
Cao-Design-for-a-Soft-Error-Resilient-Dynamic-Task-based-Runtime-2014-11-14.pdf |
|
2014/11/07 |
Adrien Remy |
LRI |
Using Random Butterfly Transformation to Solve Dense Linear Systems Using Accelerators |
Remy-Using-Random-Butterfly-Transformation-to-Solve-Dense-Linear-Systems-Using-Accelerators-2014-11-07.pdf |
|
2014/10/31 |
Aurelien Bouteiller |
ICL |
UCCS: A Communication Substrate for Open SHMEM (and more) |
Bouteiller-UCCS-A-Communication-Substrate-for-Open-SHMEM-2014-10-31.pdf |
|
2014/10/24 |
Yves Robert |
ICL |
Assessing general-purpose algorithms to cope with fail-stop and silent errors |
Robert-Algorithms-for-coping-with-silent-errors-2014-10-24.pdf |
|
2014/10/17 |
Florent Lopez |
ENSEEIHT |
Sparse direct solvers on top of runtime systems |
Lopez-Sparse-direct-solvers-on-top-of-runtime-systems-2014-10-17.pdf |
|
2014/10/10 |
Alfredo Buttari |
ENSEEIHT |
Improving multifrontal solvers by means of Block Low-Rank approximations |
Buttari-Improving-multifrontal-solvers-by-means-of-Block-Low-Rank-approximations-2014-10-10.pdf |
|
2014/10/03 |
Hartwig Anzt |
ICL |
Asynchronous Iterative Algorithm for Computing Incomplete Factorizations on GPUs |
Anzt-Asynchronous-Iterative-Algorithm-for-Computing-Incomplete-Factorizations-on-GPUs-2014-10-03.pdf |
|
2014/09/26 |
Azzam Haidar |
ICL |
Towards Batched Linear Solvers on Accelerated Hardware Platforms |
Haidar-Towards-Batched-Linear-Solvers-on-Accelerated-Hardware-Platforms-2014-09-26.pdf |
|
2014/09/19 |
Simplice Donfack |
ICL |
Improve the applicability of highly efficient stencil compilers to a wider class of problems |
Donfack-Improve-the-applicability-of-highly-efficient-stencil-compilers-2014-09-19.pdf |
|
2014/09/12 |
George Ostrouchov |
ORNL |
Taking R to Big Platforms and Supercomputers with pbdR |
|
|
2014/09/05 |
Theo Mary |
INP-ENSEEIHT |
Performance Study of a Randomized Low-rank Approximation using multi-GPU |
Mary-Randomized-Low-rank-Approximation-using-multi-GPU-2014-09-05.pdf |
|
2014/08/29 |
Gregoire Pichon |
INRIA |
Divide and Conquer: a symmetric tridiagonal eigensolver in PLASMA |
Pichon-Divide-and-Conquer-a-symmetric-tridiagonal-eigensolver-in-PLASMA-2014-08-29.pdf |
|
2014/08/22 |
Tracy Rafferty |
ICL |
Conference travel |
Rafferty-Conference-Travel-2014-08-22.pdf |
|
2014/07/11 |
George Bosilca |
ICL |
Combining Recent HPC Techniques for 3D Geophysics Acceleration |
|
|
2014/06/27 |
Ryan Glasby |
JICS |
Comparison of SU/PG and DG Finite-Element Techniques for the Compressible Navier-Stokes Equations on Anisotropic Unstructured Meshes |
Glasby-Comparison-of-SUPG-and-DG-Finite-Element-Techniques-2014-06-27.pdf |
|
2014/06/20 |
Yves Robert |
ICL |
Algorithms for coping with silent errors |
Robert-Algorithms-for-coping-with-silent-errors-2014-06-20.pdf |
|
2014/06/13 |
Tingxing Dong |
ICL |
A Step towards Energy Efficient Computing: Redesigning A Hydrodynamic Application on CPU-GPU |
Dong-A-Step-towards-Energy-Efficient-Computing-2014-06-14.pdf |
|
2014/06/06 |
Kris Garrett |
ORNL |
A Nonlinear QR Algorithm for Banded Nonlinear Eigenvalue Problems |
Garrett-Nonlinear-QR-Algorithm-for-Banded-Nonlinear-Eigenvalue-Problems-2014-06-06.pdf |
|
2014/05/30 |
Grigori Fursin |
INRIA |
Collective Mind: community-driven systematization
and automation of program optimization |
Fursin-Collective-Mind-program-optimization-2014-05-30.pdf |
|
2014/05/16 |
Azzam Haidar |
ICL |
MAGMA: LU Factorization for Small Matrices |
haidar_may17_2014.pdf |
|
2014/05/14 |
Thomas Herault |
ICL |
DPLASMA/PaRSEC |
|
|
2014/05/09 |
Hartwig Anzt |
ICL |
Hybrid Multi-Elimination ILU Preconditioners on GPUs |
Anzt-Hybrid-Multi-Elimination-ILU-Preconditioners-on-GPUs-2014-05-09.pdf |
|
2014/05/02 |
Ichitaro Yamazaki |
ICL |
Performance of s-step GMRES to avoid communication on/between GPUs |
Yamazaki-Performance-of-s-step-GMRES-to-avoid-communication-on-GPUs-2014-05-02.pdf |