ICL Newsletter

News and Announcements

ActaPACK Funded

ICL’s joint venture with Lawrence Livermore National Laboratory for Small Tensor Operations on Advanced Architectures for High-order Applications (ActaPACK) was recently funded by the US Department of Energy. ActaPACK, which is funded for 18 months beginning in September of 2015, will involve UTK/ICL’s MAGMA group studying the performance optimization of the tensor kernels used in BLAST on various modern architectures. Congratulations to the Linear Algebra Group!

Conference Reports

9th Parallel Tools Workshop

Heike Jagode made it back to her hometown of Dresden, Germany for the 9th Parallel Tools Workshop on September 2-3. The Parallel Tools Workshop, held annually, is a joint venue between HLRS Stuttgart and ZIH Dresden and brings together researchers and practitioners that develop, maintain, or use tools that aid in the development of parallel applications. For her part, Heike gave a presentation on “Recent Advances in PAPI” on the first day of the workshop.

PPAM 2015

This year’s International Conference on Parallel Processing and Applied Mathematics (PPAM 2015) was held in Krakow, Poland on September 6-9, and is the 11th such conference so far in the biennial series. PPAM 2015 covered topics in parallel and distributed computing, including theory and applications, as well as applied mathematics with a focus on models, algorithms, and software tools that facilitate efficient and convenient utilization of modern parallel and distributed computing architectures.

Heike Jagode also attended PPAM to present ICL’s work on “Accelerating NWChem Coupled Cluster through Dataflow-based Execution.” ICL alumni and frequent collaborator Marc Baboulin presented his paper “Dense Symmetric Indefinite Factorization on GPU Accelerated Architectures,” of which Stan Tomov, Ichi Yamazaki, and Jack Dongarra are co-authors.

HPEC ’15

On the road again, ICL’s Piotr Luszczek went back to Boston to present a paper at the 2015 IEEE High Performance Extreme Computing Conference (HPEC ’15) on September 15-17. HPEC is the largest computing conference in New England and is the premier conference in the world on the convergence of High Performance and Embedded Computing.

The paper, “MAGMA Embedded: Towards a Dense Linear Algebra Library for Energy Efficient Extreme Computing,” earned Piotr and his co-authors (Azzam Haidar, Stan Tomov, and Jack Dongarra) a Best Paper Award. Congratulations!

Intel Big Data Retreat

ICL’s Thomas Herault and Piotr Luszczek were invited to the Intel Big Data Retreat in Hillsboro, OR on September 24-25. As the name suggests, this annual retreat was hosted by Intel and provided a venue for experts in Big Data and analytics to discuss the latest challenges, innovations, and solutions in big data applications.

Thomas presented a poster on PaRSEC – Runtime Task Engine for Numerical Computation. Not to be outdone, Piotr presented two posters; the first described Updating SVD Using Randomized Projection and the second outlined Random Sampling for Low Rank Approximation.

Recent Releases

SC15 Videos

ICL’s Linear Algebra Group and DisCo Group both put together videos (with help from ICL Graphics Guru David Rogers) to advertise their upcoming tutorials at SC15. Check them out below and be sure to check out the actual tutorials at SC15 in Austin, TX.

“Linear Algebra Libraries for High-Performance Computing” – Narrated by Dawn Danalis

“Fault-Tolerance for HPC – Theory and Practice” – Narrated by George Bosilca

ULFM 1.0 Released

The major 1.0 milestone has been reached for the User Level Failure Mitigation (ULFM) compliant fault tolerant MPI. ULFM is a set of new interfaces for MPI that enables Message Passing programs to restore MPI functionality affected by process failures. The MPI implementation is spared the expense of internally taking protective and corrective actions against failures. Instead, it reports operations whose completions were rendered impossible by failures.

For the 1.0 release, the UFLM team focused on improving performance, both before and after the occurrence of failures. The list of new features includes:

Support for the non-blocking version of the agreement, MPI_COMM_IAGREE.
Compliance with the latest ULFM specification draft. In particular, the MPI_COMM_(I)AGREE semantic has changed.
New algorithm to perform agreements, with a truly logarithmic complexity in number of ranks, which translates into huge performance boosts in MPI_COMM_(I)AGREE and MPI_COMM_SHRINK. Meet us at SC15 to learn more about the novel algorithm we designed!
New algorithm to perform communicator revocation. MPI_COMM_REVOKE performs a reliable broadcast with a fixed maximum output degree, which scales logarithmically with the number of ranks. Meet us at EuroMPI’15 to learn more about the Revoke algorithm we designed!
Improved support for our traditional network layer:
- TCP: fully tested.
- SM: fully tested (with the exception of XPMEM, which remains unsupported).
Added support for High Performance networks.
- Open IB: reasonably tested.
- uGNI: reasonably tested.
The tuned collective module is now enabled by default (reasonably tested), expect a huge performance boost compared to the former basic default setting.
Back-ported PBS/ALPS fixes from Open MPI.
Back-ported OpenIB bug/performance fixes from Open MPI.
Improve Context ID allocation algorithm to reduce overheads of Shrink.
Miscellaneous bug fixes (look at the commit log for the full list).

Fault tolerance support for RMA and IO is still under development.

Visit the ULFM software page to download the source and happy hacking.

Interview

Where are you from, originally?

I am from Nashville, TN.

Can you summarize your educational background?

I earned my BS in Computer Science from UTK and I started my MS in Computer Science this fall.

Tell us how you first learned about ICL.

I first heard about ICL during my undergrad when various professors spoke about the work going on at ICL, and it captured my interest. When I found out that ICL was looking for research assistants I jumped at the chance join the group.

What made you want to work for ICL?

HPC plays a critical role in scientific advancement so there is obviously a lot of intrigue in that respect. I also really wanted to be able to participate in leading edge research while getting my MS, and I developed an interest in HPC over the course of my undergrad degree. ICL has such a large footprint in the HPC community that I had no doubt that the work going on here would be at the forefront of the field. I really saw this as an opportunity to push myself intellectually and expand my knowledge into a very exciting realm of computing.

What are you working on while at ICL?

I am a member of the Performance Evaluation group, and I will be contributing to the PAPI-EX project during my time here.

If you weren’t working at ICL, where would you like to be working and why?

If I hadn’t decided to go back to school, I suppose it would be pretty interesting to work somewhere like SpaceX or NASA. I’ve always been fascinated by space, so it would be pretty fun to develop software that went towards the advancement of space travel.

What are your interests/hobbies outside of work?

I really love movies, especially older films with classic actors/actresses from the 50’s and 60’s. I try to watch as many as I can when I have time. I’ve probably watched 2001: A Space Odyssey ten times.

Tell us something about yourself that might surprise people.

I once saw lighting strike a car directly behind me on the interstate. It was strange.

Recent Papers

Song, F., and J. Dongarra, “A Scalable Approach to Solving Dense Linear Algebra Problems on Hybrid CPU-GPU Systems,” Concurrency and Computation: Practice and Experience, vol. 27, issue 14, pp. 3702-3723, September 2015. DOI: 10.1002/cpe.3403 (8.16 MB)
Jagode, H., A. Danalis, G. Bosilca, and J. Dongarra, “Accelerating NWChem Coupled Cluster through dataflow-based Execution,” 11th International Conference on Parallel Processing and Applied Mathematics (PPAM 2015), Krakow, Poland, Springer International Publishing, September 2015. (452.82 KB)
Haidar, A., P. Luszczek, S. Tomov, and J. Dongarra, “Batched Matrix Computations on Hardware Accelerators,” EuroMPI/Asia 2015 Workshop, Bordeaux, France, September 2015. (589.05 KB)
Baboulin, M., J. Dongarra, A. Remy, S. Tomov, and I. Yamazaki, “Dense Symmetric Indefinite Factorization on GPU Accelerated Architectures,” Lecture Notes in Computer Science, vol. 9573: Springer International Publishing, pp. 86-95, September 2015, 2016. DOI: 10.1007/978-3-319-32149-3_9 (327.14 KB)
Haidar, A., S. Tomov, P. Luszczek, and J. Dongarra, “MAGMA Embedded: Towards a Dense Linear Algebra Library for Energy Efficient Extreme Computing,” 2015 IEEE High Performance Extreme Computing Conference (HPEC â15), (Best Paper Award), Waltham, MA, IEEE, September 2015. (678.86 KB)
Guidry, M., and A. Haidar, On the Design, Autotuning, and Optimization of GPU Kernels for Kinetic Network Simulations Using Fast Explicit Integration and GPU Batched Computation , Oak Ridge, TN, Joint Institute for Computational Sciences Seminar Series, Presentation, September 2015. (17.25 MB)
Danalis, A., H. Jagode, G. Bosilca, and J. Dongarra, “PaRSEC in Practice: Optimizing a Legacy Chemistry Application through Distributed Task-Based Execution,” 2015 IEEE International Conference on Cluster Computing, Chicago, IL, IEEE, September 2015. (1.77 MB)
Bouteiller, A., G. Bosilca, and J. Dongarra, “Plan B: Interruption of Ongoing MPI Operations to Support Failure Recovery,” 22nd European MPI Users' Group Meeting, Bordeaux, France, ACM, September 2015. DOI: 10.1145/2802658.2802668 (543.32 KB)
Baboulin, M., V. Dobrev, J. Dongarra, C. Earl, J. Falcou, A. Haidar, I. Karlin, T. Kolev, I. Masliah, and S. Tomov, Towards a High-Performance Tensor Algebra Package for Accelerators , Gatlinburg, TN, moky Mountains Computational Sciences and Engineering Conference (SMC15), September 2015. (1.76 MB)
Haidar, A., A. Abdelfattah, S. Tomov, and J. Dongarra, “Batched Matrix Computations on Hardware Accelerators Based on GPUs,” 2015 SIAM Conference on Applied Linear Algebra (SIAM LA), Atlanta, GA, SIAM, October 2015. (9.36 MB)
Gates, M., S. Tomov, and A. Haidar, “Comparing Hybrid CPU-GPU and Native GPU-only Acceleration for Linear Algebra,” 2015 SIAM Conference on Applied Linear Algebra, Atlanta, GA, SIAM, October 2015. (4.7 MB)
Haidar, A., P. Luszczek, S. Tomov, and J. Dongarra, “Efficient Eigensolver Algorithms on Accelerator Based Architectures,” 2015 SIAM Conference on Applied Linear Algebra (SIAM LA), Atlanta, GA, SIAM, October 2015. (6.98 MB)
Yamazaki, I., J. Barlow, S. Tomov, J. Kurzak, and J. Dongarra, “Mixed-precision orthogonalization process Performance on multicore CPUs with GPUs,” 2015 SIAM Conference on Applied Linear Algebra, Atlanta, GA, SIAM, October 2015. (301.01 KB)
Abalenkovs, M., A. Abdelfattah, J. Dongarra, M. Gates, A. Haidar, J. Kurzak, P. Luszczek, S. Tomov, I. Yamazaki, and A. YarKhan, “Parallel Programming Models for Dense Linear Algebra on Heterogeneous Systems,” Supercomputing Frontiers and Innovations, vol. 2, no. 4, October 2015. DOI: 10.14529/jsfi1504 (3.68 MB)
Anzt, H., E. Chow, D. Szyld, and J. Dongarra, “Random-Order Alternating Schwarz for Sparse Triangular Solves,” 2015 SIAM Conference on Applied Linear Algebra (SIAM LA), Atlanta, GA, SIAM, October 2015. (1.53 MB)

Recent Conferences

SEP
2

9th Parallel Tools Workshop Dresden, Germany
Heike

Heike McCraw
SEP
6

PPAM15 Poland, Krakow
Heike

Heike McCraw
SEP
8

IEEE Cluster 2015 Chicago, Illinois
Anthony

Anthony Danalis
SEP
9

ORNL scientific seminar series Oak Ridge, Tennessee
Azzam

Azzam Haidar
SEP
15

2015 IEEE High Performance Extreme Computing Conference (HPEC'15) Waltham, Massachusetts
Piotr

Piotr Luszczek
SEP
21

Euro MPI 2015 Bordeaux, France
George

George Bosilca
SEP
24

Intel Big Data Retreat Hillsboro, Oregon
Piotr
Thomas

Piotr Luszczek, Thomas Herault
SEP
28

Russian Supercomputing Days Moscow, Russia
Jack

Jack Dongarra
OCT
1

Argo Members Meeting Argonne, Illinois
Damien
George
Thomas

Damien Genet, George Bosilca, Thomas Herault
OCT
12

IWACOM-III Tokyo, Japan
Ichitaro

Ichitaro Yamazaki
OCT
15-16

Workshop on Computational Science & Engineering Software Sustainability and Productivity Challenges (CSESSP) Rockford, Maryland
Jack

Jack Dongarra
OCT
20-21

White House Workshop on the National Strategic Computing Initiative (NSCI) Washington, District of Columbia
Jack

Jack Dongarra
OCT
25-26

SIAM LA Atlanta, Georgia
Azzam
Hartwig
Ichitaro
Mark
Piotr
Stan
Mike

Azzam Haidar, Hartwig Anzt, Ichitaro Yamazaki, Mark Gates, Piotr Luszczek, Stanimire Tomov, Yaohung "Mike" Tsai

Upcoming Conferences

NOV
15-20

SC15 Austin, Texas
Aurelien
George
Hartwig
Ichitaro
Jack
Jakub
Phil
Piotr
Terry
Thomas
Tracy

Aurelien Bouteiller, George Bosilca, Hartwig Anzt, Ichitaro Yamazaki, Jack Dongarra, Jakub Kurzak, Phil Mucci, Piotr Luszczek, Terry Moore, Thomas Herault, Tracy Rafferty
NOV
15

2nd Workshop on Visual Performance Analysis (VPA) Austin, Texas
Blake

Blake Haugen

Recent Lunch Talks

SEP
4
Mathieu Faverge
Inria
Blocking Strategy Optimizations for Sparse Direct Linear Solver on Heterogeneous Architectures PDF
SEP
11
Asim YarKhan
OpenMP Tasks and PLASMA PDF
SEP
18
Mark Gates
Accelerating Collaborative Filtering Using Concepts from High Performance Computing PDF
SEP
25
Ichitaro Yamazaki
Random Sampling to Update Truncated SVD PDF
OCT
2
Mike Guidry
ORNL
Fast New Methods for Solving Large Sets of Coupled Differential Equations at Scale in Scientific Applications PDF
OCT
9
Mike Jantz
EECS
Cross-Layer Memory Management to Achieve Power and Performance Goals PDF
OCT
12
Edmond Chow
Georgia Tech
Very Fine-grained Parallelization of Sparse Linear Algebra Computations PDF
OCT
16
Pierre Sens
LIP6
Probabilistic Byzantine Tolerance for Cloud Computing PDF
OCT
23
Bob Muenchen
UTK
Monitoring Trends in Tools for Data Science PDF
OCT
30
Michael Barton
United States Army Research Laboratory (ARL)
Data Intensive Science and Computing PDF

Upcoming Lunch Talks

NOV
3
Takeshi Fukaya
Hokkaido University
CholeskyQR2: Cholesky QR factorization with reorthogonalization PDF
NOV
3
Toshiyuki Imamura
RIKEN AICS
ASPEN.K2+MUBLAS:level2 CUDA BLAS kernels PDF
NOV
6
Moritz Kreutzer
Friedrich-Alexander University Erlangen-Nürnberg
Performance Engineering of the Kernel Polynomal Method on Large-Scale CPU-GPU Systems PDF
NOV
13
Sticks Mabakane
University of Cape Town
Novel Visualizations for Optimization of Parallel Programs PDF

Visitors

Pierre Sens from LIP6 will be visiting on Friday, October 16. Pierre will be working with the DisCo group and giving a lunch talk.
Moritz Kreutzer from Regionales RechenZentrum Erlangen (RRZE) will be visiting from October 17 through December 18. Moritz, a PhD student, will be working with Hartwig during his visit to ICL.
Marc Baboulin from LRI will be visiting from October 27 through October 30.
Takeshi Fukaya from Hokkaido University will be visiting from October 30 through November 5.
Toshiyuki Imamura from RIKEN will be visiting from October 30 through November 5.

Visitors

Pierre Sens from LIP6 will be visiting on Friday, October 16. Pierre will be working with the DisCo group and giving a lunch talk.
Moritz Kreutzer from Regionales RechenZentrum Erlangen (RRZE) will be visiting from October 17 through December 18. Moritz, a PhD student, will be working with Hartwig during his visit to ICL.
Marc Baboulin from LRI will be visiting from October 27 through October 30.
Takeshi Fukaya from Hokkaido University will be visiting from October 30 through November 5.
Toshiyuki Imamura from RIKEN will be visiting from October 30 through November 5.

congratulations

Julie Langou & Bertrand Combre

On September 29th, 2015, ICL’s Julie Langou married Bertrand Combre in Independence, CA. Congratulations to the bride and groom!

Dates to Remember

Early Registration for SC15

Early registration for SC15 closes on October 15th. Everyone from ICL who plans to attend SC15 should register by this deadline!

SC15 ICL Dinner

Save the date! The SC15 ICL dinner will be on Wednesday, November 18th, 7pm, at Z’ Tejas (1110 West 6th Street) in Austin, TX.

October 2015

News and Announcements

ActaPACK Funded

Conference Reports

9th Parallel Tools Workshop

PPAM 2015

HPEC ’15

Intel Big Data Retreat

Recent Releases

SC15 Videos

ULFM 1.0 Released

Interview

Phil Vaccaro

Recent Papers

Recent Conferences

Upcoming Conferences

Recent Lunch Talks

Upcoming Lunch Talks

Visitors

Visitors

congratulations

Julie Langou & Bertrand Combre

Dates to Remember

Early Registration for SC15

SC15 ICL Dinner

Archives

PDF Editions