2015/04/24 |
Manish Parashar |
Rutgers |
Big Data Challenges in Simulation-based Science |
Parashar-Big-Data-Challenges-in-Simulation-based-Science-2015-04-15.pdf |
2015/04/17 |
Ahmad Ahmad |
ICL |
GPU Accelerated Memory-bound Linear Algebra Kernels |
Amhad-GPU-Accelerated-Memory-bound-Linear-Algebra-Kernels-2015-04-17.pdf |
2015/04/10 |
Tingxing Dong |
ICL |
Batched One-sided Factorizations on Hardware Accelerators Based on GPUs |
Dong-Batched-One-sided-Factorizations-on-Hardware-Accelerators-Based-on-GPUs.pdf |
2015/03/27 |
Yves Robert |
INRIA |
Voltage Overscaling Algorithms for Energy-Efficient Workflow Computations With Timing Errors |
Robert-Voltage-Overscaling-Algorithms-for-Energy-Efficient-Workflow-Computations-With-Timing-Errors-2015-03-27.pdf |
2015/03/20 |
Anthony Danalis |
ICL |
Using PaRSEC to Develop Non-static Applications |
|
2015/03/13 |
Audris Mockus |
EECS |
Evidence Engineering |
Mockus-Evidence-Engineering-2015-03-13.pdf |
2015/03/06 |
Azzam Haidar |
ICL |
Performance Bounds in Symmetric Eigenvector Calculations |
Haidar-PLASMA-MAGMA-PARSEC-Performance-Bounds-in-Symmetric-Eigensolver-2015-03-06.pdf |
2015/02/27 |
Piotr Luszczek |
ICL |
Deep Neural Networks for Image Classification – A Primer |
Luszczek-Deep-Neural-Net-Primer-2015-02-25.pdf |
2015/02/13 |
Yves Robert |
ICL |
Scheduling Computational Workflows on Failure-prone Platforms |
Robert-Scheduling-Computational-Workflows-on-Failure-prone-Platforms-2015-02-13.pdf |
2015/02/06 |
Amina Guermouche |
ICL |
FoREST-mn: Runtime DVFS Beyond Communication Slack |
Guermouche-FoREST-mn-Runtime-DVFS-Beyond-Communication-Slack-2015-02-06.pdf |
2015/01/23 |
George Bosilca |
ICL |
Building Blocks for Resilient Applications |
Bosilca-Building-Blocks-for-Resilient-Applications-2015-01-23.pdf |
2015/01/16 |
Emmanuel Jeannot |
INRIA |
Topology Aware Data Management |
Jeannot-Topology-Aware-Data-Management-2015-01-16.pdf |
2015/01/08 |
Tony Hey |
|
The Fourth Paradigm: Data-Intensive Scientific Discovery, Open Science and the Cloud |
Hey-The-Fourth-Paradigm-Data-Intensive-Scientific-Discovery-Open-Science-and-the-Cloud-2015-01-08.pdf |
2014/12/12 |
Ichitaro Yamazaki |
ICL |
Mixed-precision orthogonalization scheme and its case-studies with GPUs |
|
2014/12/05 |
Asim YarKhan |
ICL |
Latest Developments in the PAPI Performance Monitoring Library |
YarKhan-PAPI-Performance-Application-Programming-Interface-2014-12-05.pdf |
2014/11/14 |
Chongxiao Cao |
ICL |
Design for a Soft Error Resilient
Dynamic Task-based Runtime |
Cao-Design-for-a-Soft-Error-Resilient-Dynamic-Task-based-Runtime-2014-11-14.pdf |
2014/11/07 |
Adrien Remy |
LRI |
Using Random Butterfly Transformation to Solve Dense Linear Systems Using Accelerators |
Remy-Using-Random-Butterfly-Transformation-to-Solve-Dense-Linear-Systems-Using-Accelerators-2014-11-07.pdf |
2014/10/31 |
Aurelien Bouteiller |
ICL |
UCCS: A Communication Substrate for Open SHMEM (and more) |
Bouteiller-UCCS-A-Communication-Substrate-for-Open-SHMEM-2014-10-31.pdf |
2014/10/24 |
Yves Robert |
ICL |
Assessing general-purpose algorithms to cope with fail-stop and silent errors |
Robert-Algorithms-for-coping-with-silent-errors-2014-10-24.pdf |
2014/10/17 |
Florent Lopez |
ENSEEIHT |
Sparse direct solvers on top of runtime systems |
Lopez-Sparse-direct-solvers-on-top-of-runtime-systems-2014-10-17.pdf |
2014/10/10 |
Alfredo Buttari |
ENSEEIHT |
Improving multifrontal solvers by means of Block Low-Rank approximations |
Buttari-Improving-multifrontal-solvers-by-means-of-Block-Low-Rank-approximations-2014-10-10.pdf |
2014/10/03 |
Hartwig Anzt |
ICL |
Asynchronous Iterative Algorithm for Computing Incomplete Factorizations on GPUs |
Anzt-Asynchronous-Iterative-Algorithm-for-Computing-Incomplete-Factorizations-on-GPUs-2014-10-03.pdf |
2014/09/26 |
Azzam Haidar |
ICL |
Towards Batched Linear Solvers on Accelerated Hardware Platforms |
Haidar-Towards-Batched-Linear-Solvers-on-Accelerated-Hardware-Platforms-2014-09-26.pdf |
2014/09/19 |
Simplice Donfack |
ICL |
Improve the applicability of highly efficient stencil compilers to a wider class of problems |
Donfack-Improve-the-applicability-of-highly-efficient-stencil-compilers-2014-09-19.pdf |
2014/09/12 |
George Ostrouchov |
ORNL |
Taking R to Big Platforms and Supercomputers with pbdR |
|
2014/09/05 |
Theo Mary |
INP-ENSEEIHT |
Performance Study of a Randomized Low-rank Approximation using multi-GPU |
Mary-Randomized-Low-rank-Approximation-using-multi-GPU-2014-09-05.pdf |
2014/08/29 |
Gregoire Pichon |
INRIA |
Divide and Conquer: a symmetric tridiagonal eigensolver in PLASMA |
Pichon-Divide-and-Conquer-a-symmetric-tridiagonal-eigensolver-in-PLASMA-2014-08-29.pdf |
2014/08/22 |
Tracy Rafferty |
ICL |
Conference travel |
Rafferty-Conference-Travel-2014-08-22.pdf |
2014/07/11 |
George Bosilca |
ICL |
Combining Recent HPC Techniques for 3D Geophysics Acceleration |
|
2014/06/27 |
Ryan Glasby |
JICS |
Comparison of SU/PG and DG Finite-Element Techniques for the Compressible Navier-Stokes Equations on Anisotropic Unstructured Meshes |
Glasby-Comparison-of-SUPG-and-DG-Finite-Element-Techniques-2014-06-27.pdf |
2014/06/20 |
Yves Robert |
ICL |
Algorithms for coping with silent errors |
Robert-Algorithms-for-coping-with-silent-errors-2014-06-20.pdf |
2014/06/13 |
Tingxing Dong |
ICL |
A Step towards Energy Efficient Computing: Redesigning A Hydrodynamic Application on CPU-GPU |
Dong-A-Step-towards-Energy-Efficient-Computing-2014-06-14.pdf |
2014/06/06 |
Kris Garrett |
ORNL |
A Nonlinear QR Algorithm for Banded Nonlinear Eigenvalue Problems |
Garrett-Nonlinear-QR-Algorithm-for-Banded-Nonlinear-Eigenvalue-Problems-2014-06-06.pdf |
2014/05/30 |
Grigori Fursin |
INRIA |
Collective Mind: community-driven systematization
and automation of program optimization |
Fursin-Collective-Mind-program-optimization-2014-05-30.pdf |
2014/05/16 |
Azzam Haidar |
ICL |
MAGMA: LU Factorization for Small Matrices |
haidar_may17_2014.pdf |
2014/05/14 |
Thomas Herault |
ICL |
DPLASMA/PaRSEC |
|
2014/05/09 |
Hartwig Anzt |
ICL |
Hybrid Multi-Elimination ILU Preconditioners on GPUs |
Anzt-Hybrid-Multi-Elimination-ILU-Preconditioners-on-GPUs-2014-05-09.pdf |
2014/05/02 |
Ichitaro Yamazaki |
ICL |
Performance of s-step GMRES to avoid communication on/between GPUs |
Yamazaki-Performance-of-s-step-GMRES-to-avoid-communication-on-GPUs-2014-05-02.pdf |
2014/04/25 |
George Bosilca |
ICL |
Toward composite fault management strategies: a quantitative evaluation |
Bosilca-Assessing-the-Impact-of-ABFT-Checkpoint-Composite-Strategies-2014-04-25.pdf |
2014/04/10 |
Dorian Arnold |
UNM |
A Simulation-based Framework for Evaluating Resilience Strategies at Scale |
|
2014/04/04 |
Jakub Kurzak |
ICL |
Some Techniques for Optimizing CUDA More |
Kurzak-Some-Techniques-for-Optimizing-CUDA-More-2014-04-04.pdf |
2014/03/28 |
Hartwig Anzt |
ICL |
Optimizing Krylov Subspace Solvers on Graphics Processing Units |
Anzt-Optimizing-Krylov-Subspace-Solvers-on-GPUs-2014-03-28.pdf |
2014/03/21 |
Mark Gates |
ICL |
Accelerating eigenvector computation |
Gates-Accelerating-Computation-of-Eigenvectors-2014-03-21.pdf |
2014/03/13 |
Atsushi Hori |
RIKEN |
A New Process/Thread Model for Many-core Era |
|
2014/03/07 |
Mathieu Faverge |
INRIA |
Taking advantage of hybrid systems for sparse direct solvers via task-based runtimes |
Faverge-Sparse-Linear-Algebra-over-DAG-Runtimes-2014-03-07.pdf |
2014/02/28 |
Samuel Thibault |
INRIA |
StarPU: Task Graphs from Heterogeneous Platforms to Clusters Thereof |
Thibault-StarPU-Task-Graphs-from-Heterogeneous-Platforms-to-Clusters-Thereof-2014-02-28.pdf |
2014/02/21 |
Simplice Donfack |
ICL |
Improving multicore capabilities in hybrid CPUs/GPUs applications (Case of MAGMA) |
Donfack-Improving-Multicore-Capabilities-in-Hybrid-CPUs-GPUs-Applications-2014-02-21.pdf |
2014/02/14 |
Yves Robert |
ICL |
Scheduling Data Sensor Retrieval for Boolean Tree Query Processing |
|
2014/02/07 |
Aurelien Bouteiller |
ICL |
Fault Tolerant MPI |
Bouteiller-Fault-Tolerant-MPI-2014-02-07.pdf |
2014/01/31 |
Thomas Herault |
ICL |
Assessing the Impact of ABFT and Checkpointing Composite Strategies |
Herault-ABFT-Periodic-Checkpointing-2014-01-31.pdf |