ICL Newsletter

News and Announcements

Thirty Years of Innovative Computing

On August 7–9, 2019, the Innovative Computing Laboratory celebrated its 30th anniversary with a “Thirty Years of Innovative Computing” workshop. Held at the University of Tennessee’s Student Union, the celebration and workshop included 50 talks by ICL alumni across 2 days. The participants, around 100 ICLers in total, came from all over the globe—including attendees from as far away as Japan and Saudi Arabia—to share their current interests (research or otherwise) and to celebrate their time at ICL.

Catching up with old friends and colleagues was easy to do with plenty of time to reconnect and remember. A welcome reception on August 7th kicked off the event at the Hilton, and a banquet at the Knoxville Museum of Art followed on August 8th. A farewell dinner at Calhoun’s capped off the event, with friends and family joining in on the festivities.

Throughout the whirlwind workshop—Jack sets a quick tempo—one familiar and eminently relatable sound was heard: laughter. Finding a large group of affable and good-natured people is difficult in any organization, let alone one that has spanned 30 years. And yet, here we are—part of an organization that has always striven for excellence, always been a hub of leadership and talent, and always has something to smile and laugh about. See you all next time.

The Editor would like to thank all of the photo contributors. The entire photo collection can be found here.

EPEXA Funded

The Production-quality Ecosystem for Programming and Executing eXtreme-scale Applications (EPEXA) project has been funded by the National Science Foundation. EPEXA is a collaboration between UTK/ICL (George Bosilca), Stony Brook University (Robert Harrison), and Virginia Tech (Ed Valeev).

Funded for five years, EPEXA is a follow on to the TESSE effort and aims to create a production-quality, open-source software ecosystem that enhances programmer productivity and portable performance for advanced scientific applications on massively-parallel, hybrid, many-core systems.

Specifically, through science-driven codesign, the team plans to harden the previous research prototype into a production-quality, data-flow programming model and associated parallel runtime with the aim of growing the community of scientists employing these tools in their research.

The EPEXA team will leverage the success of their prior work in data-flow and dependency-style programming models and associated runtimes and in fielding sustainable production-quality runtimes, domain-specific languages, and linear algebra libraries. EPEXA’s kickoff meeting will be held on November 1, 2019. Congratulations to everyone involved.

The Editor would like to thank George Bosilca for his contributions to this article.

Conference Reports

Collegeville Workshop on Sustainable Scientific Software

On July 22–24, 2019, St. John’s University in Collegeville, Minnesota hosted the first Workshop on Sustainable Scientific Software (CW3S19), and ICL’s Heike Jagode was among the 40 participants. The goal of this workshop was to bring together community leaders and practitioners in scientific software to identify the challenges and priority research direction(s) needed to improve the sustainability of the scientific software stack.

The meeting was a mixture of talks, panels, and group discussions on topics like sustainability challenges, technical approaches for improved sustainability, cultural approaches for improved sustainability, and the like.

Heike presented sustainability challenges in the Performance API (PAPI) project, drawing on the lessons learned from the past 20 years of the application’s development. To this end, she presented her white paper, “What it Takes to Keep PAPI Instrumental for the HPC Community” and also laid out PAPI’s path forward and the development of a new PAPI++ software package with Modern C++.

The Editor would like to thank Heike Jagode for her contributions to this article.

Euro-Par 2019

On August 26–30, 2019, ICL’s Asim YarKhan was in Göttingen, Germany for the 2019 European Conference on Parallel Processing (Euro-Par 2019). Euro-Par, which is also celebrating its 25th anniversary this year, covers all aspects of parallel and distributed processing.

For his part, Asim presented a paper on the SLATE implementation of “Linear Systems Solvers for Distributed Memory Machines with GPU Accelerators,” coauthored by Jakub Kurzak, Mark Gates, Ali Charara, Ichitaro Yamazaki, and Jack Dongarra.

Göttingen is where Carl Friedrich Gauss did most of his work, so—during his presentation—Asim apparently made a quip about how we are still solving Ax = b using the matrix form of Gaussian elimination. This insight was met with knowing nods and palpable interest. Asim also took a tour that revolved around Gauss’s life and work, culminating in a trip to an observatory that was built for Gauss’s use (pictured above).

Asim wasn’t the only familiar face in Göttingen, as UTK’s Michela Taufer presented the first keynote of the conference on the intersection of data analytics and HPC. Also present were ICL alumni Emmanuel Jeannot, Felix Wolf, and Karl Fürlinger in various roles as workshop chairs, committee members, and authors/presenters.

Euro-Par 2020 will be held in Warsaw, Poland.

The Editor would like to thank Asim YarKhan for his contributions to this article.

2019 ICL Retreat

For 2019, the ICL retreat changed things up a bit as we moved to the Park Vista Hotel in Gatlinburg, Tennessee and were joined by members of the Global Computing Laboratory.

The Park Vista’s larger meeting space provided a good platform for two days of talks that covered student projects and summer internships, the lab’s progress in the areas of linear algebra, distributed computing, benchmarking, and performance analysis, along with recaps of administrative procedures.

Serving as a kickoff to the fall semester, the 2019 retreat was more important than ever, as it provided a platform for the dozen-or-so new ICLers to get their bearings and hit the ground running for another great year at ICL!

Recent Releases

ULFM 4.0.1ulfm2.1rc1 Released

ULFM 4.0.1ulfm2.1rc1 is now available. User Level Failure Mitigation (ULFM) is a set of new interfaces for MPI that enables message passing applications to restore MPI functionality affected by process failures. The MPI implementation is spared the expense of internally taking protective and corrective automatic actions against failures. Instead, it can prevent any fault-related deadlock situation by reporting operations whose completions were rendered impossible by failures.

The ULFM v4.0.1ulfm2.1rc1 release is now in sync with the current Open MPI (v4.0.1). Updates to the ULFM component (since 2.0) include:

New Features:

Added the MPI_Comm_is_revoked function;
Renamed ftbasic collective component to ftagree; and
Restored the pcollreq extension.

Bug Fixes:

Failures of node-local siblings were not always detected, fixed;
Failure propagation and detection was slowed down by trying to notify known dead processes, fixed;
Fixed deadlocks in multi-threaded programs;
Fixed issues with PMPI when compiling Fortran Interfaces; and
Fixed deadlocks on OS-X.

There are also many changes to Open MPI 4.0.1; please see the Open MPI change log for details.

For more details on the ULFM release, see fault-tolerance.org.

MAGMA 2.5.1 Released

MAGMA 2.5.1 is now available. Matrix Algebra on GPU and Multicore Architectures (MAGMA) is a collection of next-generation linear algebra (LA) libraries for heterogeneous architectures. The MAGMA package supports interfaces for current LA packages and standards (e.g., LAPACK and BLAS) to allow computational scientists to easily port any LA-reliant software components to heterogeneous architectures.

Changes for MAGMA 2.5.1 include:

Updates and improvements in CMakeLists.txt for improved/friendlier CMake and spack installations;
Fixes related to MAGMA installation on GPUs and CUDA versions that do not support FP16 arithmetic;
Added support for Turing GPUs;
Removed some C++ features from MAGMA Sparse for friendlier compilation (using nvcc and various CPU compilers);
New routine: magmablas_Xherk_small_reduce (X = ‘s’, ‘d’, ‘c’, or ‘z’) is a special HERK routine that assumes that the output matrix is very small (up to 32 × 32) and that the input matrix is very tall and skinny.

Click here to download the tarball.

Interview

Where are you from, originally?
I was born in Illinois and grew up in Wisconsin and Texas.

Can you summarize your educational background?
I have a BS in mechanical engineering from the University of Oklahoma, and MS/PhD in Theoretical & Applied Mechanics from the University of Illinois.

Where did you work before joining ICL?
Before joining ICL, I was a post doc in the Computational and Applied Mathematics department at Rice University in Houston, Texas.

How did you first hear about the lab, and what made you want to work here?
I first heard about ICL through the use of LAPACK software in graduate school. I heard about potential job openings at ICL at the 2019 SIAM Computational Science and Engineering conference. I thought it would be interesting to see the “other side” of things, as it were—where these libraries and tools I knew from a user standpoint were being created and improved.

What is your focus here at ICL? What are you working on?
To start, I will be working on integration of ICL libraries with application codes for the PEEKS and CEED projects. This is due to my background in numerical methods and previous work on application codes in graduate school and during my postdoc.

What are your interests/hobbies outside of work?
I really enjoy music and love playing the flute. (I like playing the piano, too, but sadly, I don’t own one.) I also like reading and doing crossword puzzles.

Tell us something about yourself that might surprise people.
My last semester of undergrad, I only had nine hours left of required courses but needed twelve hours to be classified as a full-time student. I decided to take “Intro to Martial Arts” and “Ballroom Dancing I” as my extra three hours. Both were pretty outside my comfort zone (and still mostly are) but overall, I had a blast!

If you weren’t working at ICL, where would you like to be working and why?
Maybe NASA? Like many people, I briefly wanted to be an astronaut when I was around eight years old. While I was in Houston, I got to visit NASA, and it was very inspiring.

Recent Papers

Yamazaki, I., A. Ida, R. Yokota, and J. Dongarra, “Distributed-Memory Lattice H-Matrix Factorization,” The International Journal of High Performance Computing Applications, vol. 33, issue 5, pp. 1046â1063, August 2019. DOI: 10.1177/1094342019861139 (1.14 MB)
Kurzak, J., M. Gates, A. Charara, A. YarKhan, I. Yamazaki, and J. Dongarra, “Linear Systems Solvers for Distributed-Memory Machines with GPU Accelerators,” Euro-Par 2019: Parallel Processing, vol. 11725: Springer, pp. 495â506, August 2019. DOI: 10.1007/978-3-030-29400-7_35
Kurzak, J., Y. Tsai, M. Gates, A. Abdelfattah, and J. Dongarra, “Massively Parallel Automated Software Tuning,” 48th International Conference on Parallel Processing (ICPP 2019), Kyoto, Japan, ACM Press, August 2019. DOI: 10.1145/3337821.3337908 (911.88 KB)
Yamazaki, I., E. Chow, A. Bouteiller, and J. Dongarra, “Performance of Asynchronous Optimized Schwarz with One-sided Communication,” Parallel Computing, vol. 86, pp. 66-81, August 2019. DOI: 10.1016/j.parco.2019.05.004 (3.09 MB)
Tseng, S-M., B. Nicolae, G. Bosilca, E. Jeannot, A. Chandramowlishwaran, and F. Cappello, “Towards Portable Online Prediction of Network Utilization Using MPI-Level Monitoring,” 2019 European Conference on Parallel Processing (Euro-Par 2019), Göttingen, Germany, Springer, August 2019. DOI: 10.1007/978-3-030-29400-7_4 (1.07 MB)
YarKhan, A., J. Kurzak, A. Abdelfattah, and J. Dongarra, “An Empirical View of SLATE Algorithms on Scalable Hybrid System,” Innovative Computing Laboratory Technical Report, no. ICL-UT-19-08: University of Tennessee, Knoxville, September 2019. (441.16 KB)
Gamblin, T., P. Beckman, K. Keahey, K. Sato, M. Kondo, and G. Balazs, “BDEC2 Platform White Paper,” Innovative Computing Laboratory Technical Report, no. ICL-UT-19-11: University of Tennessee, September 2019. (30.16 KB)
Davis, J., T. Gao, S. Chandrasekaran, H. Jagode, A. Danalis, P. Balaji, J. Dongarra, and M. Taufer, “Characterization of Power Usage and Performance in Data-Intensive Applications using MapReduce over MPI,” 2019 International Conference on Parallel Computing (ParCo2019), Prague, Czech Republic, September 2019.
Beck, M., T. Moore, N. French, E. Kissel, and M. Swany, “Data Logistics: Toolkit and Applications,” 5th EAI International Conference on Smart Objects and Technologies for Social Good, Valencia, Spain, September 2019. (6.71 MB)
Patinyasakdikul, T., D. Eberius, G. Bosilca, and N. Hjelm, “Give MPI Threading a Fair Chance: A Study of Multithreaded MPI Designs,” IEEE Cluster, Albuquerque, NM, IEEE, September 2019. (220.84 KB)
Shaiek, H., S. Tomov, A. Ayala, A. Haidar, and J. Dongarra, “GPUDirect MPI Communications and Optimizations to Accelerate FFTs on Exascale Systems,” EuroMPI'19 Posters, Zurich, Switzerland, no. icl-ut-19-06: ICL, September 2019. (2.25 MB)
Luszczek, P., I. Yamazaki, and J. Dongarra, “Increasing Accuracy of Iterative Refinement in Limited Floating-Point Arithmetic on Half-Precision Accelerators,” IEEE High Performance Extreme Computing Conference (HPEC 2019), Best Paper Finalist, Waltham, MA, IEEE, September 2019. (470.21 KB)
Danalis, A., H. Jagode, and J. Dongarra, PAPI's new Software-Defined Events for in-depth Performance Analysis , Dresden, Germany, 13th Parallel Tools Workshop, September 2019. (3.14 MB)
Abdelfattah, A., S. Tomov, and J. Dongarra, “Progressive Optimization of Batched LU Factorization on GPUs,” IEEE High Performance Extreme Computing Conference (HPECâ19), Waltham, MA, IEEE, September 2019. (299.38 KB)
Zhong, D., A. Bouteiller, X. Luo, and G. Bosilca, “Runtime Level Failure Detection and Propagation in HPC Systems,” European MPI Users' Group Meeting (EuroMPI '19), Zürich, Switzerland, ACM, September 2019. DOI: 10.1145/3343211.3343225 (1.11 MB)
Gao, Y., L-C. Canon, Y. Robert, and F. Vivien, “Scheduling Independent Stochastic Tasks on Heterogeneous Cloud Platforms,” IEEE Cluster 2019, Albuquerque, New Mexico, IEEE Computer Society Press, September 2019. (651 KB)
Gates, M., M. Al Farhan, A. Charara, J. Kurzak, D. Sukkari, A. YarKhan, and J. Dongarra, “SLATE Working Note 13: Implementing Singular Value and Symmetric/Hermitian Eigenvalue Solvers,” SLATE Working Notes, no. 13, ICL-UT-19-07: Innovative Computing Laboratory, University of Tennessee, September 2019. (3.47 MB)

Recent Conferences

AUG
5-8

Monterey Data Conference Monterey, California
Piotr

Piotr Luszczek
AUG
5-8

ICPP 2019 Kyoto, Japan
Jakub

Jakub Kurzak
AUG
6-8

CEED Third Annual Meeting Blacksburg, Virginia
Stan

Stanimire Tomov
AUG
26-30

Euro-Par 2019 Göttingen, Germany
Asim

Asim YarKhan
SEP
2-6

Challenges in High Performance Computing Workshop Camberra, Australia
George

George Bosilca
SEP
2-3

13th Parallel Tools Workshop Dresden, Germany
Anthony

Anthony Danalis
SEP
4-6

2019 ECP CoPA All-Hands Meeting Santa Fe, New Mexico
Jakub
Stan

Jakub Kurzak, Stanimire Tomov
SEP
5-13

EuroMPI and MPI Forum zurich, Switzerland
Dong

Dong Zhong
SEP
12-13

SPEC HPC2020 Alexandria, Virginia
Piotr

Piotr Luszczek
SEP
17-19

Aurora Programming Workshop Lemont, Illinois
Damien
Piotr
Tony

Damien Genet, Piotr Luszczek, Tony Castaldo
SEP
20

ORNL meeting on MagmaDNN Oak Ridge, Tennessee
Stan

Stanimire Tomov
SEP
23-27

IEEE HPEC 2019 Boston, Massachusetts
Ahmad
Piotr

Ahmad Abdelfattah, Piotr Luszczek
SEP
23-27

GPU Hackathon Brookhaven Upton, NY
Piotr

Piotr Luszczek
SEP
30-4

ENUMATH Egmond aan Zee, Netherlands
George

George Bosilca

Upcoming Conferences

OCT
8-10

Frontier Application Readiness Kick-Off Workshop Knoxville, Tennessee
Heike

Heike Jagode
OCT
15-17

BDEC San Diego San Diego, California
Jack
Joan
Terry

Jack Dongarra, Joan Snoderly, Terry Moore

Recent Lunch Talks

AUG
23
Victoria Stodden
University of Illinois
Advancing Reproducibility and Transparency via Cyberinfrastructure and Community Standards PDF
AUG
30
Daniel Nichols
Joint Institute for Computational Sciences
MagmaDNN: Towards High-Performance Data Analytics and Machine Learning for Data-Driven Scientific Computing PDF
SEP
6
Asim YarKhan
Linear Systems Solvers for Distributed-Memory Machines with GPU Accelerators PDF
SEP
13
Nuria Losada
Asynchronous Receiver-Driven Replay for Local Rollback of MPI Applications PDF
SEP
20
Oscar Hernandez
ORNL
Filling in the Gaps between Applications and the OpenMP Specification for Exascale
SEP
27
Srinivas Aluru
Georgia Tech
Parallel Machine Learning Approaches for Reverse Engineering Genome-Scale Networks PDF

Upcoming Lunch Talks

OCT
4
Yves Robert
ENS-Lyon
Scheduling Independent Stochastic Tasks on Heterogeneous Cloud Platforms PDF
OCT
11
Axel Huebl
Lawrence Berkeley National Laboratory
Scalable, Performance-Portable Particle-in-Cell Simulations and PByte-Scale Data-Challenges
OCT
18
Alan Ayala
heFFTe: Highly Efficient FFT for Exascale PDF
OCT
25
Yaohung Tsai
Autotuning in Deep Learning Kernels PDF

People

Florent Lopez, once an ICL visitor/collaborator, has now joined the group as a post doc. Florent will be working with Stan and the MAGMA team. Welcome back, Florent!
Dalal Sukkari joined ICL at the beginning of August to work on the SLATE and PEEKS projects. Welcome to ICL, Dalal!
Natalie Beams joined ICL at the beginning of August to work with Stan and the MAGMA team. Welcome, Natalie!
Cade Brown joined ICL as an Undergraduate Assistant working with Stan and the MAGMA team. Welcome, Cade!
Mohammed Al Farhan joined ICL in June as part of the SLATE effort. Welcome to ICL, Mohammed!

Dates to Remember

9/12: Vinton Cerf and Michael Witmore

Vinton Cerf (computer scientist and one of the fathers of the Internet) and Michael Witmore (rhetoric scholar and Director of the Folger Shakespeare Library) are visiting UTK and will deliver a lecture on “Machine Reading in the Digital Age” on Thursday, September 12, at 7:00 p.m. in the UT Student Union Auditorium (Room 180).

Vinton and Michael will also be at ICL earlier in the day for an informal meet and greet should you wish to join them in Claxton 233.

Meeting Schedule:
3:00–3:30 – Jack
3:30–4:15 – Students (233)
4:15–5:00 – Research Staff (233)

10/16: Last Day for Early Registration for SC

Here’s a friendly reminder that October 16, 2019 is the last day for SC19’s early registration.

September 2019