News and Announcements

ActaPACK Funded

ICL’s joint venture with Lawrence Livermore National Laboratory for Small Tensor Operations on Advanced Architectures for High-order Applications (ActaPACK) was recently funded by the US Department of Energy. ActaPACK, which is funded for 18 months beginning in September of 2015, will involve UTK/ICL’s MAGMA group studying the performance optimization of the tensor kernels used in BLAST on various modern architectures. Congratulations to the Linear Algebra Group!

Conference Reports

9th Parallel Tools Workshop

jagode_toolsWS

Heike Jagode made it back to her hometown of Dresden, Germany for the 9th Parallel Tools Workshop on September 2-3. The Parallel Tools Workshop, held annually, is a joint venue between HLRS Stuttgart and ZIH Dresden and brings together researchers and practitioners that develop, maintain, or use tools that aid in the development of parallel applications. For her part, Heike gave a presentation on “Recent Advances in PAPI” on the first day of the workshop.

PPAM 2015

This year’s International Conference on Parallel Processing and Applied Mathematics (PPAM 2015) was held in Krakow, Poland on September 6-9, and is the 11th such conference so far in the biennial series. PPAM 2015 covered topics in parallel and distributed computing, including theory and applications, as well as applied mathematics with a focus on models, algorithms, and software tools that facilitate efficient and convenient utilization of modern parallel and distributed computing architectures.

Heike Jagode also attended PPAM to present ICL’s work on “Accelerating NWChem Coupled Cluster through Dataflow-based Execution.” ICL alumni and frequent collaborator Marc Baboulin presented his paper “Dense Symmetric Indefinite Factorization on GPU Accelerated Architectures,” of which Stan Tomov, Ichi Yamazaki, and Jack Dongarra are co-authors.

HPEC ’15

On the road again, ICL’s Piotr Luszczek went back to Boston to present a paper at the 2015 IEEE High Performance Extreme Computing Conference (HPEC ’15) on September 15-17. HPEC is the largest computing conference in New England and is the premier conference in the world on the convergence of High Performance and Embedded Computing.

The paper, “MAGMA Embedded: Towards a Dense Linear Algebra Library for Energy Efficient Extreme Computing,” earned Piotr and his co-authors (Azzam Haidar, Stan Tomov, and Jack Dongarra) a Best Paper Award. Congratulations!

Intel Big Data Retreat

  • 001
  • 002
  • 003
  • 004
  • 005
  • 006

 

ICL’s Thomas Herault and Piotr Luszczek were invited to the Intel Big Data Retreat in Hillsboro, OR on September 24-25. As the name suggests, this annual retreat was hosted by Intel and provided a venue for experts in Big Data and analytics to discuss the latest challenges, innovations, and solutions in big data applications.

Thomas presented a poster on PaRSEC – Runtime Task Engine for Numerical Computation. Not to be outdone, Piotr presented two posters; the first described Updating SVD Using Randomized Projection and the second outlined Random Sampling for Low Rank Approximation.

Recent Releases

SC15 Videos

ICL’s Linear Algebra Group and DisCo Group both put together videos (with help from ICL Graphics Guru David Rogers) to advertise their upcoming tutorials at SC15. Check them out below and be sure to check out the actual tutorials at SC15 in Austin, TX.

“Linear Algebra Libraries for High-Performance Computing” – Narrated by Dawn Danalis

“Fault-Tolerance for HPC – Theory and Practice” – Narrated by George Bosilca

ULFM 1.0 Released

The major 1.0 milestone has been reached for the User Level Failure Mitigation (ULFM) compliant fault tolerant MPI. ULFM is a set of new interfaces for MPI that enables Message Passing programs to restore MPI functionality affected by process failures. The MPI implementation is spared the expense of internally taking protective and corrective actions against failures. Instead, it reports operations whose completions were rendered impossible by failures.

For the 1.0 release, the UFLM team focused on improving performance, both before and after the occurrence of failures. The list of new features includes:

  • Support for the non-blocking version of the agreement, MPI_COMM_IAGREE.
  • Compliance with the latest ULFM specification draft. In particular, the MPI_COMM_(I)AGREE semantic has changed.
  • New algorithm to perform agreements, with a truly logarithmic complexity in number of ranks, which translates into huge performance boosts in MPI_COMM_(I)AGREE and MPI_COMM_SHRINK. Meet us at SC15 to  learn more about the novel algorithm we designed!
  • New algorithm to perform communicator revocation. MPI_COMM_REVOKE performs a reliable broadcast with a fixed maximum output degree, which scales logarithmically with the number of ranks. Meet us at EuroMPI’15 to learn more about the Revoke algorithm we designed!
  • Improved support for our traditional network layer:
    • TCP: fully tested.
    • SM: fully tested (with the exception of XPMEM, which remains unsupported).
  • Added support for High Performance networks.
    • Open IB: reasonably tested.
    • uGNI: reasonably tested.
  • The tuned collective module is now enabled by default (reasonably tested), expect a huge performance boost compared to the former basic default setting.
  • Back-ported PBS/ALPS fixes from Open MPI.
  • Back-ported OpenIB bug/performance fixes from Open MPI.
  • Improve Context ID allocation algorithm to reduce overheads of Shrink.
  • Miscellaneous bug fixes (look at the commit log for the full list).

Fault tolerance support for RMA and IO is still under development.

Visit the ULFM software page to download the source and happy hacking.

Interview

Phil Vaccaro Then

Phil Vaccaro

Where are you from, originally?

I am from Nashville, TN.

Can you summarize your educational background?

I earned my BS in Computer Science from UTK and I started my MS in Computer Science this fall.

Tell us how you first learned about ICL.

I first heard about ICL during my undergrad when various professors spoke about the work going on at ICL, and it captured my interest. When I found out that ICL was looking for research assistants I jumped at the chance join the group.

What made you want to work for ICL?

HPC plays a critical role in scientific advancement so there is obviously a lot of intrigue in that respect. I also really wanted to be able to participate in leading edge research while getting my MS, and I developed an interest in HPC over the course of my undergrad degree. ICL has such a large footprint in the HPC community that I had no doubt that the work going on here would be at the forefront of the field. I really saw this as an opportunity to push myself intellectually and expand my knowledge into a very exciting realm of computing.

What are you working on while at ICL?

I am a member of the Performance Evaluation group, and I will be contributing to the PAPI-EX project during my time here.

If you weren’t working at ICL, where would you like to be working and why?

If I hadn’t decided to go back to school, I suppose it would be pretty interesting to work somewhere like SpaceX or NASA. I’ve always been fascinated by space, so it would be pretty fun to develop software that went towards the advancement of space travel.

What are your interests/hobbies outside of work?

I really love movies, especially older films with classic actors/actresses from the 50’s and 60’s. I try to watch as many as I can when I have time. I’ve probably watched 2001: A Space Odyssey ten times.

Tell us something about yourself that might surprise people.

I once saw lighting strike a car directly behind me on the interstate. It was strange.

Recent Papers

  1. Song, F., and J. Dongarra, A Scalable Approach to Solving Dense Linear Algebra Problems on Hybrid CPU-GPU Systems,” Concurrency and Computation: Practice and Experience, vol. 27, issue 14, pp. 3702-3723, September 2015. DOI: 10.1002/cpe.3403  (8.16 MB)
  2. Jagode, H., A. Danalis, G. Bosilca, and J. Dongarra, Accelerating NWChem Coupled Cluster through dataflow-based Execution,” 11th International Conference on Parallel Processing and Applied Mathematics (PPAM 2015), Krakow, Poland, Springer International Publishing, September 2015.  (452.82 KB)
  3. Haidar, A., P. Luszczek, S. Tomov, and J. Dongarra, Batched Matrix Computations on Hardware Accelerators,” EuroMPI/Asia 2015 Workshop, Bordeaux, France, September 2015.  (589.05 KB)
  4. Baboulin, M., J. Dongarra, A. Remy, S. Tomov, and I. Yamazaki, Dense Symmetric Indefinite Factorization on GPU Accelerated Architectures,” Lecture Notes in Computer Science, vol. 9573: Springer International Publishing, pp. 86-95, September 2015, 2016. DOI: 10.1007/978-3-319-32149-3_9  (327.14 KB)
  5. Haidar, A., S. Tomov, P. Luszczek, and J. Dongarra, MAGMA Embedded: Towards a Dense Linear Algebra Library for Energy Efficient Extreme Computing,” 2015 IEEE High Performance Extreme Computing Conference (HPEC ’15), (Best Paper Award), Waltham, MA, IEEE, September 2015.  (678.86 KB)
  6. Guidry, M., and A. Haidar, On the Design, Autotuning, and Optimization of GPU Kernels for Kinetic Network Simulations Using Fast Explicit Integration and GPU Batched Computation , Oak Ridge, TN, Joint Institute for Computational Sciences Seminar Series, Presentation, September 2015.  (17.25 MB)
  7. Danalis, A., H. Jagode, G. Bosilca, and J. Dongarra, PaRSEC in Practice: Optimizing a Legacy Chemistry Application through Distributed Task-Based Execution,” 2015 IEEE International Conference on Cluster Computing, Chicago, IL, IEEE, September 2015.  (1.77 MB)
  8. Bouteiller, A., G. Bosilca, and J. Dongarra, Plan B: Interruption of Ongoing MPI Operations to Support Failure Recovery,” 22nd European MPI Users' Group Meeting, Bordeaux, France, ACM, September 2015. DOI: 10.1145/2802658.2802668  (543.32 KB)
  9. Baboulin, M., V. Dobrev, J. Dongarra, C. Earl, J. Falcou, A. Haidar, I. Karlin, T. Kolev, I. Masliah, and S. Tomov, Towards a High-Performance Tensor Algebra Package for Accelerators , Gatlinburg, TN, moky Mountains Computational Sciences and Engineering Conference (SMC15), September 2015.  (1.76 MB)
  10. Haidar, A., A. Abdelfattah, S. Tomov, and J. Dongarra, Batched Matrix Computations on Hardware Accelerators Based on GPUs,” 2015 SIAM Conference on Applied Linear Algebra (SIAM LA), Atlanta, GA, SIAM, October 2015.  (9.36 MB)
  11. Gates, M., S. Tomov, and A. Haidar, Comparing Hybrid CPU-GPU and Native GPU-only Acceleration for Linear Algebra,” 2015 SIAM Conference on Applied Linear Algebra, Atlanta, GA, SIAM, October 2015.  (4.7 MB)
  12. Haidar, A., P. Luszczek, S. Tomov, and J. Dongarra, Efficient Eigensolver Algorithms on Accelerator Based Architectures,” 2015 SIAM Conference on Applied Linear Algebra (SIAM LA), Atlanta, GA, SIAM, October 2015.  (6.98 MB)
  13. Yamazaki, I., J. Barlow, S. Tomov, J. Kurzak, and J. Dongarra, Mixed-precision orthogonalization process Performance on multicore CPUs with GPUs,” 2015 SIAM Conference on Applied Linear Algebra, Atlanta, GA, SIAM, October 2015.  (301.01 KB)
  14. Abalenkovs, M., A. Abdelfattah, J. Dongarra, M. Gates, A. Haidar, J. Kurzak, P. Luszczek, S. Tomov, I. Yamazaki, and A. YarKhan, Parallel Programming Models for Dense Linear Algebra on Heterogeneous Systems,” Supercomputing Frontiers and Innovations, vol. 2, no. 4, October 2015. DOI: 10.14529/jsfi1504  (3.68 MB)
  15. Anzt, H., E. Chow, D. Szyld, and J. Dongarra, Random-Order Alternating Schwarz for Sparse Triangular Solves,” 2015 SIAM Conference on Applied Linear Algebra (SIAM LA), Atlanta, GA, SIAM, October 2015.  (1.53 MB)

Recent Conferences

  1. SEP
    9th Parallel Tools Workshop Dresden, Germany
    Heike McCraw
    Heike
    Heike McCraw
  2. SEP
    PPAM15 Poland, Krakow
    Heike McCraw
    Heike
    Heike McCraw
  3. SEP
    IEEE Cluster 2015 Chicago, Illinois
    Anthony Danalis
    Anthony
    Anthony Danalis
  4. SEP
    ORNL scientific seminar series Oak Ridge, Tennessee
    Azzam Haidar
    Azzam
    Azzam Haidar
  5. SEP
    Piotr Luszczek
    Piotr
    Piotr Luszczek
  6. SEP
    Euro MPI 2015 Bordeaux, France
    George Bosilca
    George
    George Bosilca
  7. SEP
    Intel Big Data Retreat Hillsboro, Oregon
    Piotr Luszczek
    Piotr
    Thomas Herault
    Thomas
    Piotr Luszczek, Thomas Herault
  8. SEP
    Jack Dongarra
    Jack
    Jack Dongarra
  9. OCT
    Argo Members Meeting Argonne, Illinois
    Damien Genet
    Damien
    George Bosilca
    George
    Thomas Herault
    Thomas
    Damien Genet, George Bosilca, Thomas Herault
  10. OCT
    IWACOM-III Tokyo, Japan
    Ichitaro Yamazaki
    Ichitaro
    Ichitaro Yamazaki
  11. OCT
    -
    Jack Dongarra
    Jack
    Jack Dongarra
  12. OCT
    -
    White House Workshop on the National Strategic Computing Initiative (NSCI) Washington, District of Columbia
    Jack Dongarra
    Jack
    Jack Dongarra
  13. OCT
    -
    SIAM LA Atlanta, Georgia
    Azzam Haidar
    Azzam
    Hartwig Anzt
    Hartwig
    Ichitaro Yamazaki
    Ichitaro
    Mark Gates
    Mark
    Piotr Luszczek
    Piotr
    Stanimire Tomov
    Stan
    Yaohung
    Mike
    Azzam Haidar, Hartwig Anzt, Ichitaro Yamazaki, Mark Gates, Piotr Luszczek, Stanimire Tomov, Yaohung "Mike" Tsai

Upcoming Conferences

  1. NOV
    -
    SC15 Austin, Texas
    Aurelien Bouteiller
    Aurelien
    George Bosilca
    George
    Hartwig Anzt
    Hartwig
    Ichitaro Yamazaki
    Ichitaro
    Jack Dongarra
    Jack
    Jakub Kurzak
    Jakub
    Phil Mucci
    Phil
    Piotr Luszczek
    Piotr
    Terry Moore
    Terry
    Thomas Herault
    Thomas
    Tracy Rafferty
    Tracy
    Aurelien Bouteiller, George Bosilca, Hartwig Anzt, Ichitaro Yamazaki, Jack Dongarra, Jakub Kurzak, Phil Mucci, Piotr Luszczek, Terry Moore, Thomas Herault, Tracy Rafferty
  2. NOV
    Blake Haugen
    Blake
    Blake Haugen

Recent Lunch Talks

  1. SEP
    4
    Mathieu Faverge
    Mathieu Faverge
    Inria
    Blocking Strategy Optimizations for Sparse Direct Linear Solver on Heterogeneous Architectures PDF
  2. SEP
    11
    Asim YarKhan
    Asim YarKhan
    OpenMP Tasks and PLASMA PDF
  3. SEP
    18
    Mark Gates
    Mark Gates
    Accelerating Collaborative Filtering Using Concepts from High Performance Computing PDF
  4. SEP
    25
    Ichitaro Yamazaki
    Ichitaro Yamazaki
    Random Sampling to Update Truncated SVD PDF
  5. OCT
    2
    Mike Guidry
    Mike Guidry
    ORNL
    Fast New Methods for Solving Large Sets of Coupled Differential Equations at Scale in Scientific Applications PDF
  6. OCT
    9
    Mike Jantz
    Mike Jantz
    EECS
    Cross-Layer Memory Management to Achieve Power and Performance Goals PDF
  7. OCT
    12
    Edmond Chow
    Edmond Chow
    Georgia Tech
    Very Fine-grained Parallelization of Sparse Linear Algebra Computations PDF
  8. OCT
    16
    Pierre Sens
    Pierre Sens
    LIP6
    Probabilistic Byzantine Tolerance for Cloud Computing PDF
  9. OCT
    23
    Bob Muenchen
    Bob Muenchen
    UTK
    Monitoring Trends in Tools for Data Science PDF
  10. OCT
    30
    Michael Barton
    Michael Barton
    United States Army Research Laboratory (ARL)
    Data Intensive Science and Computing PDF

Upcoming Lunch Talks

  1. NOV
    3
    Takeshi Fukaya
    Takeshi Fukaya
    Hokkaido University
    CholeskyQR2: Cholesky QR factorization with reorthogonalization PDF
  2. NOV
    3
    Toshiyuki Imamura
    Toshiyuki Imamura
    RIKEN AICS
    ASPEN.K2+MUBLAS:level2 CUDA BLAS kernels PDF
  3. NOV
    6
    Moritz Kreutzer
    Moritz Kreutzer
    Friedrich-Alexander University Erlangen-Nürnberg
    Performance Engineering of the Kernel Polynomal Method on Large-Scale CPU-GPU Systems PDF
  4. NOV
    13
    Sticks Mabakane
    Sticks Mabakane
    University of Cape Town
    Novel Visualizations for Optimization of Parallel Programs PDF

Visitors

  1. Pierre Sens
    Pierre Sens from LIP6 will be visiting on Friday, October 16. Pierre will be working with the DisCo group and giving a lunch talk.
  2. Moritz Kreutzer
    Moritz Kreutzer from Regionales RechenZentrum Erlangen (RRZE) will be visiting from October 17 through December 18. Moritz, a PhD student, will be working with Hartwig during his visit to ICL.
  3. Marc Baboulin
    Marc Baboulin from LRI will be visiting from October 27 through October 30.
  4. Takeshi Fukaya
    Takeshi Fukaya from Hokkaido University will be visiting from October 30 through November 5.
  5. Toshiyuki Imamura
    Toshiyuki Imamura from RIKEN will be visiting from October 30 through November 5.

Visitors

  1. Pierre Sens
    Pierre Sens from LIP6 will be visiting on Friday, October 16. Pierre will be working with the DisCo group and giving a lunch talk.
  2. Moritz Kreutzer
    Moritz Kreutzer from Regionales RechenZentrum Erlangen (RRZE) will be visiting from October 17 through December 18. Moritz, a PhD student, will be working with Hartwig during his visit to ICL.
  3. Marc Baboulin
    Marc Baboulin from LRI will be visiting from October 27 through October 30.
  4. Takeshi Fukaya
    Takeshi Fukaya from Hokkaido University will be visiting from October 30 through November 5.
  5. Toshiyuki Imamura
    Toshiyuki Imamura from RIKEN will be visiting from October 30 through November 5.

congratulations

Julie Langou & Bertrand Combre

julieandbertrand

On September 29th, 2015, ICL’s Julie Langou married Bertrand Combre in Independence, CA. Congratulations to the bride and groom!

Dates to Remember

Early Registration for SC15

Early registration for SC15 closes on October 15th. Everyone from ICL who plans to attend SC15 should register by this deadline!

SC15 ICL Dinner

Save the date! The SC15 ICL dinner will be on Wednesday, November 18th, 7pm, at Z’ Tejas (1110 West 6th Street) in Austin, TX.