News and Announcements

Russian Academy of Sciences

Jack Dongarra and Vladimir Voevodin

Jack Dongarra recently received his certificate as a Foreign Member of the Russian Academy of Sciences (RAS). Established in 1724, the Academy is home to over 1,000 smaller institutions throughout Russia and houses a network of researchers in fields ranging from nuclear physics, applied mathematics, to philosophy and language.

When asked about the news of his induction into the RAS in 2016, Jack noted that “being elected to the academy is not only an honor but also another effective avenue for sharing what we learn from our experimental computer science work.” Congratulations, Jack!

Five UT Centers at SC19

Five computational science research centers from the University of Tennessee—the Bredesen Center, the Global Computing Laboratory, the Joint Institute for Computational Sciences, the Innovative Computing Laboratory, and Chattanooga’s SimCenter—will represent the University with their own booth (#575) at this year’s International Conference for High Performance Computing, Networking, Storage, and Analysis (SC19) on November 17–22 in Denver, Colorado. Stop by and say “Hello!”

ICL @ SC19 Dinner

The ICL @ SC19 dinner is set for Wednesday, November 20th at 6:30 p.m.

Henry’s Tavern (map)
500 16th Street, Unit 184B
Denver, CO 80202
720-399-8466

Respond to Joan Snoderly (jsnoderly@icl.utk.edu) by November 8th if you wish to attend.

Conference Reports

Brookhaven GPU Hackathon

Always on the move, Piotr Luszczek traveled to Upton, NY on September 23–27, where he served as a hacking mentor for the Brookhaven National Laboratory’s GPU Hackathon. The Upton event garnered around 40 attendees who were divided into teams working to port development codes to GPUs or further optimize their applications for the latest and greatest in GPU hardware—all with the help of a team mentor.

Piotr’s team worked on integrating NWChemEx with libraries from ICL and the Extreme-scale Scientific Software Development Kit (xSDK) with the aim of accelerating the application’s performance using NVIDIA GPUs.

This exercise is part of Piotr’s continued push to improve application integration with respect to xSDK and ICL libraries. The next GPU Hackathon is hosted by ORNL on October 21–25 at the Crowne Plaza hotel in Knoxville, TN. Of note: 2019 has seen a record 11 GPU Hackathons, and it seems like Piotr has participated in all of them.

The Editor would like to thank Piotr Luszczek for his contributions to this article.

SPEC19: HPC2020

The Standard Performance Evaluation Corporation’s 2019 workshop (SPEC19) was held on September 12–13, 2019 in Alexandria, VA and focused on benchmarking the next generation of leadership-class computers (hence “HPC2020”). Piotr Luszczek attended the workshop to outline ICL’s latest contributions to the field.

In presenting “Benchmarking: Doomed to Succeed or Simply Succeeding,” Piotr describes his experience in the design, implementation, and deployment/evaluation of 4 widely known benchmark efforts in which he was involved. Outlining the extensive adoption and longevity of these efforts, combined with the large volume of results and data, provided a relevant background on the community process involved in benchmarking and how this process (and experience) can be leveraged for the benefit of the SPEC HPC2020 effort.

According to Piotr, SPEC19 had plenty of informal opportunities to exchange ideas, and—even though there might have been differences of opinion as to which benchmark is the best—everybody was cordial. That’s always a plus. Familiar faces included David Bailey (HPCC), Bill Kramer (NCSA), Jack Wells (OLCF), and Rich Vuduc (Georgia Tech).

The Editor would like to thank Piotr Luszczek for his contribution to this article.

Recent Releases

heFFTe 0.1 Released

The Highly Efficient FFTs for Exascale (heFFTe) library was released on October 1st, 2019. heFFTe 0.1, the first release in ICL’s ECP-FFT effort, aspires to be a new and sustainable high-performance FFT library for exascale platforms that leverages the large investments in FFT software by the broader HPC community.

The heFFTe library has achieved significant acceleration of 3-D FFTs using GPUs, showing very good weak and strong scalability, owing to highly optimized GPU kernels and MPI communications.

heFFTe 0.1 is now available on FFT-ECP’s Bitbucket repo: https://bitbucket.org/icl/heffte/.

Interview

Sebastien Cayrols Then

Sebastien Cayrols

Where are you from, originally?
I am a 33-year-old French guy born in Paris. I grew up in a small town close to Paris, and then I moved back to Paris for middle and high school.

Can you summarize your educational background?
I did not know what I wanted to do, but I was mainly interested in sciences. Following a teacher’s advice in high school, I started to really study Physics at the University of Paris-Sud XI.

During my Bachelor’s in Fundamental Physics, I took some programming courses, and I found it so interesting that I decided to quit physics and start again as a first-year student; it took me five more years to graduate as a “computer scientist” instead. And, during my last year, I had Laura Grigori as an instructor, and she then became my thesis advisor.

Where did you work before joining ICL?
I started my PhD working for INRIA, and I was primarily located at “Maison de la Simulation,” which is part of CEA Saclay. I was also a member of the ALPINE Team, which is a joint team between LJLL, Sorbonne University, and INRIA Paris. My main focus was redesigning algorithms to reduce the amount of communication.

Before defending my PhD, I received a postdoc position with Iain S. Duff at RAL, STFC in the United Kingdom. I then became part of a European project called NLAFET, and I was working on sparse-direct solvers.

How did you first hear about the lab, and what made you want to work here?
I first heard about ICL during my last year as a (an under?)graduate. Marc Baboulin, who was one of my professors, had invited a famous American guy to give a talk at LRI, Orsay, and Marc informed us that we MUST attend. This “American guy” turned out to be Jack. To be honest, I was new to HPC at the time, and I discovered so many things through his talk.

So, a few years later, when I joined NLAFET, I met Jack, again, and he invited me to Knoxville. Finally, I discussed the invitation with Marc, Mawussi, and Florent, and they told me how great it is to be at ICL, and I said why not go to Tennessee if my wife agrees?

What is your focus here at ICL? What are you working on?
I would say that I am a sparse guy, in the mathematical sense, because of what I have done during my PhD, and I have the feeling that my skills are like a gruyere, which is Swiss cheese with so many holes. Being here should allow me to partly remedy this.

I want to do GPU programming, machine learning, and efficient programming just to cite a few. To this end, I am involved in some very interesting projects: PEEKS, FFT, Convex Optimization, and AsyncIS.

What are your interests/hobbies outside of work?
I have a 3-year-old son, and my wife and I are waiting for a second child due in October! 🙂
Otherwise, nothing as exciting as kids I am afraid, though I do enjoy traveling, series, mangas, games, and cooking.

Tell us something about yourself that might surprise people.
I never did want to learn English. I spent years at school avoiding it. I was even targeting a job where English is not necessary. So now the question is: what happened? I still do not know. I guess doing a PhD pushed me to change my mind (actually, I had no other choice).

If you weren’t working at ICL, where would you like to be working and why?
I have no idea.

Recent Papers

  1. YarKhan, A., J. Kurzak, A. Abdelfattah, and J. Dongarra, An Empirical View of SLATE Algorithms on Scalable Hybrid System,” Innovative Computing Laboratory Technical Report, no. ICL-UT-19-08: University of Tennessee, Knoxville, September 2019.  (441.16 KB)
  2. Gamblin, T., P. Beckman, K. Keahey, K. Sato, M. Kondo, and G. Balazs, BDEC2 Platform White Paper,” Innovative Computing Laboratory Technical Report, no. ICL-UT-19-11: University of Tennessee, September 2019.  (30.16 KB)
  3. Davis, J., T. Gao, S. Chandrasekaran, H. Jagode, A. Danalis, P. Balaji, J. Dongarra, and M. Taufer, Characterization of Power Usage and Performance in Data-Intensive Applications using MapReduce over MPI,” 2019 International Conference on Parallel Computing (ParCo2019), Prague, Czech Republic, September 2019.
  4. Beck, M., T. Moore, N. French, E. Kissel, and M. Swany, Data Logistics: Toolkit and Applications,” 5th EAI International Conference on Smart Objects and Technologies for Social Good, Valencia, Spain, September 2019.  (6.71 MB)
  5. Patinyasakdikul, T., D. Eberius, G. Bosilca, and N. Hjelm, Give MPI Threading a Fair Chance: A Study of Multithreaded MPI Designs,” IEEE Cluster, Albuquerque, NM, IEEE, September 2019.  (220.84 KB)
  6. Shaiek, H., S. Tomov, A. Ayala, A. Haidar, and J. Dongarra, GPUDirect MPI Communications and Optimizations to Accelerate FFTs on Exascale Systems,” EuroMPI'19 Posters, Zurich, Switzerland, no. icl-ut-19-06: ICL, September 2019.  (2.25 MB)
  7. Luszczek, P., I. Yamazaki, and J. Dongarra, Increasing Accuracy of Iterative Refinement in Limited Floating-Point Arithmetic on Half-Precision Accelerators,” IEEE High Performance Extreme Computing Conference (HPEC 2019), Best Paper Finalist, Waltham, MA, IEEE, September 2019.  (470.21 KB)
  8. Danalis, A., H. Jagode, and J. Dongarra, PAPI's new Software-Defined Events for in-depth Performance Analysis , Dresden, Germany, 13th Parallel Tools Workshop, September 2019.  (3.14 MB)
  9. Abdelfattah, A., S. Tomov, and J. Dongarra, Progressive Optimization of Batched LU Factorization on GPUs,” IEEE High Performance Extreme Computing Conference (HPEC’19), Waltham, MA, IEEE, September 2019.  (299.38 KB)
  10. Zhong, D., A. Bouteiller, X. Luo, and G. Bosilca, Runtime Level Failure Detection and Propagation in HPC Systems,” European MPI Users' Group Meeting (EuroMPI '19), Zürich, Switzerland, ACM, September 2019. DOI: 10.1145/3343211.3343225  (1.11 MB)
  11. Gao, Y., L-C. Canon, Y. Robert, and F. Vivien, Scheduling Independent Stochastic Tasks on Heterogeneous Cloud Platforms,” IEEE Cluster 2019, Albuquerque, New Mexico, IEEE Computer Society Press, September 2019.  (651 KB)
  12. Gates, M., M. Al Farhan, A. Charara, J. Kurzak, D. Sukkari, A. YarKhan, and J. Dongarra, SLATE Working Note 13: Implementing Singular Value and Symmetric/Hermitian Eigenvalue Solvers,” SLATE Working Notes, no. 13, ICL-UT-19-07: Innovative Computing Laboratory, University of Tennessee, September 2019.  (3.47 MB)
  13. Altintas, I., K. Marcus, V. Vural, S. Purawat, D. Crawl, G. Antoniu, A. Costan, O. Marcu, P. Balaprakash, R. Cao, et al., A Collection of White Papers from the BDEC2 Workshop in San Diego, CA,” Innovative Computing Laboratory Technical Report, no. ICL-UT-19-13: University of Tennessee, October 2019.  (8.25 MB)
  14. Tomov, S., A. Abdelfattah, V. Barra, N. Beams, J. Brown, J-S. Camier, V. Dobrev, J. Dongarra, Y. Dudouit, P. Fischer, et al., CEED ECP Milestone Report: Performance Tuning of CEED Software and 1st and 2nd Wave Apps : Zenodo, October 2019. DOI: 10.5281/zenodo.3477618  (8.31 MB)
  15. Tomov, S., A. Haidar, A. Ayala, H. Shaiek, and J. Dongarra, FFT-ECP Implementation Optimizations and Features Phase,” Innovative Computing Laboratory Technical Report, no. ICL-UT-19-12: University of Tennessee, October 2019.  (4.14 MB)
  16. Bujanovic, Z., and Z. Drmac, New Robust ScaLAPACK Routine for Computing the QR Factorization with Column Pivoting,” LAPACK Working Note, no. LAWN 296, ICL-UT-19-14: University of Tennessee, October 2019.  (454.83 KB)

Recent Conferences

  1. SEP
    -
    George Bosilca
    George
    George Bosilca
  2. SEP
    -
    Anthony Danalis
    Anthony
    Anthony Danalis
  3. SEP
    -
    2019 ECP CoPA All-Hands Meeting Santa Fe, New Mexico
    Jakub Kurzak
    Jakub
    Stanimire Tomov
    Stan
    Jakub Kurzak, Stanimire Tomov
  4. SEP
    -
    EuroMPI and MPI Forum zurich, Switzerland
    Dong Zhong
    Dong
    Dong Zhong
  5. SEP
    -
    SPEC HPC2020 Alexandria, Virginia
    Piotr Luszczek
    Piotr
    Piotr Luszczek
  6. SEP
    -
    Aurora Programming Workshop Lemont, Illinois
    Damien Genet
    Damien
    Piotr Luszczek
    Piotr
    Tony Castaldo
    Tony
    Damien Genet, Piotr Luszczek, Tony Castaldo
  7. SEP
    ORNL meeting on MagmaDNN Oak Ridge, Tennessee
    Stanimire Tomov
    Stan
    Stanimire Tomov
  8. SEP
    -
    IEEE HPEC 2019 Boston, Massachusetts
    Ahmad Abdelfattah
    Ahmad
    Piotr Luszczek
    Piotr
    Ahmad Abdelfattah, Piotr Luszczek
  9. SEP
    -
    Piotr Luszczek
    Piotr
    Piotr Luszczek
  10. SEP
    -
    ENUMATH Egmond aan Zee, Netherlands
    George Bosilca
    George
    George Bosilca
  11. OCT
    -
    Heike Jagode
    Heike
    Heike Jagode
  12. OCT
    -
    BDEC San Diego San Diego, California
    Jack Dongarra
    Jack
    Joan Snoderly
    Joan
    Terry Moore
    Terry
    Jack Dongarra, Joan Snoderly, Terry Moore

Upcoming Conferences

  1. NOV
    -
    SC19 Denver, Colorado
    Ahmad Abdelfattah
    Ahmad
    Alan Ayala
    Alan
    Aurelien Bouteiller
    Aurelien
    Daniel Barry
    Daniel
    David Eberius
    David
    George Bosilca
    George
    Gerald Ragghianti
    Gerald
    Hartwig Anzt
    Hartwig
    Jack Dongarra
    Jack
    Jiali Li
    Jiali
    Joan Snoderly
    Joan
    Mark Gates
    Mark
    Piotr Luszczek
    Piotr
    Qinglei Cao
    Qinglei
    Terry Moore
    Terry
    Thomas Herault
    Thomas
    Yaohung Tsai
    Mike
    Yu Pei
    Yu
    Ahmad Abdelfattah, Alan Ayala, Aurelien Bouteiller, Daniel Barry, David Eberius, George Bosilca, Gerald Ragghianti, Hartwig Anzt, Jack Dongarra, Jiali Li, Joan Snoderly, Mark Gates, Piotr Luszczek, Qinglei Cao, Terry Moore, Thomas Herault, Yaohung Tsai, Yu Pei

Recent Lunch Talks

  1. SEP
    6
    Asim YarKhan
    Asim YarKhan
    Linear Systems Solvers for Distributed-Memory Machines with GPU Accelerators PDF
  2. SEP
    13
    Nuria Losada
    Nuria Losada
    Asynchronous Receiver-Driven Replay for Local Rollback of MPI Applications PDF
  3. SEP
    20
    Oscar Hernandez
    Oscar Hernandez
    ORNL
    Filling in the Gaps between Applications and the OpenMP Specification for Exascale
  4. SEP
    27
    Srinivas Aluru
    Srinivas Aluru
    Georgia Tech
    Parallel Machine Learning Approaches for Reverse Engineering Genome-Scale Networks PDF
  5. OCT
    4
    Yves Robert
    Yves Robert
    ENS-Lyon
    Scheduling Independent Stochastic Tasks on Heterogeneous Cloud Platforms PDF
  6. OCT
    11
    Axel Huebl
    Axel Huebl
    Lawrence Berkeley National Laboratory
    Scalable, Performance-Portable Particle-in-Cell Simulations and PByte-Scale Data-Challenges
  7. OCT
    18
    Alan Ayala
    Alan Ayala
    heFFTe: Highly Efficient FFT for Exascale PDF
  8. OCT
    25
    Yaohung Tsai
    Yaohung Tsai
    Autotuning in Deep Learning Kernels PDF

Upcoming Lunch Talks

  1. NOV
    1
    David Eberius
    David Eberius
    A Flexible MPI Benchmark For Fast Assessment of Multithreaded Communication Performance PDF
  2. NOV
    8
    Sticks Mabakane
    Sticks Mabakane
    Effective Callgraph Visualisations for Optimisation of Parallel-Programs PDF
  3. NOV
    15
    Anthony Danalis
    Anthony Danalis
    Questions about hardware events? CAT has the answers.

Dates to Remember

10/16: Last Day for Early Registration for SC

Here’s a friendly reminder that October 16, 2019 is the last day for SC19’s early registration.