News and Announcements

Jack Dongarra a Foreign Member of the Royal Society

On April 16, 2019, Jack Dongarra was elected as a Foreign Member of the Royal Society (ForMemRS) for his contributions to mathematics and computational science and engineering. The Royal Society is the oldest scientific academy in continuous existence, dating back to 1663, and members include Isaac Newton, Charles Darwin, and Alan Turing—among many other distinguished scientists. Congratulations, Jack!

Exascale with Aurora?

Will the first name of the US exascale (machine) effort be “Aurora?” If the Department of Energy (DOE) has anything to say about it, then—yes—Aurora will be the first supercomputer in the United States to reach 1018 floating point operations per second. For a cool $500 million, this new machine will be installed at Argonne National Laboratory sometime in 2021.

It is very likely that other major players in HPC will also have their own exascale machines by 2021—especially considering China has accelerated (pun intended) its commitment to HPC and currently has 227 systems on the TOP500, comprising nearly half of the list. As of November 2018, the United States has only 109 systems on the TOP500.

In fact, and speaking of the TOP500, ICL’s Jack Dongarra is aware of three possible exascale systems that could be installed in China sometime in 2020.

With that in mind, the US DOE has allocated $1.8 billion for additional exascale machines that could be installed at Oak Ridge National Laboratory and Lawrence Livermore National Laboratory. What has at times seemed like a slow-motion race to exascale is now at our doorstep.

Click here to read more about Aurora and DOE plans for exascale (New York Times).

ICL Winter Reception

Cruising into 2019 with spring right around the corner, the ICL Winter Reception was once again held at Calhoun’s on the River. The reception offered a welcome opportunity for about 50 members of ICL and their loved ones to relax, eat, drink, and enjoy the camaraderie.

Research Spotlight Award from the Office of Research and Engagement

Research Spotlight Award Winners (Left to Right): Interim Chancellor Wayne Davis, Bill Fox, Jack Dongarra, Michael Mason, Kimberly Powell, Alex Bentley (accepting for Dawnie Steadman), Katherine Ambroziak (accepting for Phillip Enquist), Lou Gross, Daniel Feller, Suzie Allard, Sarah Colby

Jack Dongarra was presented with a Research Spotlight Award from the Office of Research and Engagement (ORE) on March 25, 2019. The ORE awards are a recognition of comprehensive research enterprises—including activities related to funding, mentorship, creative achievement, community engagement, and responsible conduct of research. Congratulations, Jack!

Conference Reports

VI-HPS Knoxville

On April 9–12, ICL hosted the Virtual Institute for High Productivity Supercomputing’s (VI-HPS’s) 31st tuning workshop. ICL is one of the original four founders of VI-HPS, and today the virtual institute combines the expertise of twelve partner institutions—each with experience in the development and application of HPC programming tools and projects that contribute leading-edge technology to the institute and the community at large.

A total of 22 participants took advantage of the tuning workshop, and many of them prepared their own MPI, OpenMP, or hybrid MPI + OpenMP parallel application codes for analysis. This workshop approach to analysis resulted in successful and high-quality engagement between participants and tools instructors during the four days of training.

The editor would like to thank Heike Jagode for her contribution to this article.

Santa Fe GPU Hackathon

Another season, another GPU Hackathon featuring Piotr Luszczek. Although it was nearly spring (merely a week away on March 11–15), it certainly looks like winter is coming, and the southern Rocky Mountains were hit with a major snow storm and 60 mph winds.

Around 50 people managed to make it to the Hackathon despite the weather, and the Sante Fe round featured a mix of applications leveraging OpenMP 4.5, OpenACC, and Kokkos. All applications were run on the Fluid Numerics Cloud cluster, which is an elastic HPC cluster powered by the Google Cloud Platform that features a variety of GPUs.

Specifically, Piotr’s goal for this round was to improve application integration with respect to the Extreme-scale Scientific Software Development Kit (xSDK).

The editor would like to thank Piotr Luszczek for his contributions to this article.

Recent Releases

HPCG 3.1 Released

The High Performance Conjugate Gradients (HPCG) 3.1 reference code release is now available. The HPCG benchmark is designed to measure performance that is representative of modern scientific applications. It does so by exercising the computational and communication patterns commonly found in real science and engineering codes, which are often based on sparse iterative solvers. HPCG exhibits the same irregular accesses to memory and fine-grain recursive computations that dominate large-scale scientific workloads used to simulate complex physical phenomena.

Improvements to the HPCG 3.1 code include:

  • Switched the output format for reporting the results from YAML to a basic line-oriented, key-value format with nested naming scheme for the keys.
  • Added faster search for optimal 3-D grid partitioning of a given integer that does not require combinatorial search through the all 3-set partitioning of the prime factors.
  • Closed the outstanding bugs reported as issues on HPCG’s GitHub project page and incorporated the fixes in the source code.

Click here to download the tarball. Follow HPCG’s development on GitHub.

Interview

Dong Zhong Then

Dong Zhong

Where are you from, originally?
I was born in Shaanxi, Chinfa—one of the cradles of Chinese civilization. Thirteen feudal dynasties established their capitals in our province during a span of more than 1,100 years.

Can you summarize your educational background?
I earned my Bachelor’s degree from Toing University at ShangHai, where I majored in computer science. I earned my Master’s degree from ZheJiang University at HangZhou, where my research interest focused on micro-satellite operating system design and implementation, including the design of the operating system, the data structures, and the subsystem interfaces (API) used to control the satellite and communicate with subsystems. The subsystems included the orbit control system, attitude determination and control system, power supply, telemetry and control system, and GPS.

How did you first hear about ICL, and what made you want to work here?
I know some former students from ICL, and I really like the research topics and enjoy the working environment.

What is your focus here? What are you working on?
I am working in the DisCo group as part of the Open MPI project, and my research focuses on fault-tolerant distributed systems, including node/process failure detection and reliable message propagation. I am also involved in the Cross-layer Application-Aware Resilience at Extreme Scale (CAARES) project doing related work.

What would you consider your most valuable “lesson” you have learned so far at ICL?
It is very important to work in a community (such as the Open MPI community) and have good connections with people from industry and research labs.

What are your interests/hobbies outside of work?
I have a pair of lovebirds and several kinds of fish. Keeping them doesn’t require too much effort, but it brings me a lot of joy.

Tell us something about yourself that might surprise people.
I am good at breeding birds, and my bird has hatched three clutches so far. The breeding process is interesting, and the nursing part is challenging.

If you weren’t working at ICL, where would you like to be working and why?
I’d like to be a research assistant or software engineer working with satellite technology or related topics.

Recent Papers

  1. Anzt, H., J. Dongarra, G. Flegar, N. J. Higham, and E. S. Quintana-Orti, Adaptive Precision in Block-Jacobi Preconditioning for Iterative Sparse Linear System Solvers,” Concurrency and Computation: Practice and Experience, vol. 31, no. 6, pp. e4460, March 2019. DOI: 10.1002/cpe.4460  (341.54 KB)
  2. Abdelfattah, A., S. Tomov, and J. Dongarra, Optimizing Batch HGEMM on Small Sizes Using Tensor Cores , San Jose, CA, GPU Technology Conference (GTC), March 2019.  (2.47 MB)
  3. Dongarra, J., S. Gottlieb, and W. T. Kramer, Race to Exascale,” Computing in Science and Engineering, vol. 21, issue 1, pp. 4-5, March 2019. DOI: 10.1109/MCSE.2018.2882574  (106.97 KB)
  4. Brown, J., A. Abdelfattah, V. Barra, V. Dobrev, Y. Dudouit, P. Fischer, T. Kolev, D. Medina, M. Min, T. Ratnayaka, et al., CEED ECP Milestone Report: Public release of CEED 2.0 : Zenodo, April 2019. DOI: 10.5281/zenodo.2641316  (4.98 MB)
  5. Tomov, S., A. Haidar, A. Ayala, D. Schultz, and J. Dongarra, Design and Implementation for FFT-ECP on Distributed Accelerated Systems,” Innovative Computing Laboratory Technical Report, no. ICL-UT-19-05: University of Tennessee, April 2019.  (3.19 MB)
  6. Charara, A., J. Dongarra, M. Gates, J. Kurzak, and A. YarKhan, SLATE Mixed Precision Performance Report,” Innovative Computing Laboratory Technical Report, no. ICL-UT-19-03: University of Tennessee, April 2019.  (1.04 MB)
  7. Danalis, A., H. Jagode, D. Barry, and J. Dongarra, Understanding Native Event Semantics , Knoxville, TN, 9th JLESC Workshop, April 2019.  (2.33 MB)

Recent Conferences

  1. MAR
    -
    MPI Forum Chattanooga, Tennessee
    Aurelien Bouteiller
    Aurelien
    George Bosilca
    George
    Aurelien Bouteiller, George Bosilca
  2. MAR
    -
    GPU Hackathon Santa Fe Santa Fe, New Mexico
    Piotr Luszczek
    Piotr
    Piotr Luszczek
  3. MAR
    -
    Terry Moore
    Terry
    Terry Moore
  4. MAR
    -
    Ahmad Abdelfattah
    Ahmad
    Stanimire Tomov
    Stan
    Ahmad Abdelfattah, Stanimire Tomov
  5. MAR
    -
    7th ADAC Workshop Oak Ridge, Tennessee
    Stanimire Tomov
    Stan
    Stanimire Tomov
  6. APR
    -
    VI-HPS Tuning Workshop Knoxville, Tennessee
    Alan Ayala
    Alan
    Anthony Danalis
    Anthony
    Damien Genet
    Damien
    Daniel Barry
    Daniel
    Frank Winkler
    Frank
    Gerald Ragghianti
    Gerald
    Heike Jagode
    Heike
    Jamie Finney
    Jamie
    Piotr Luszczek
    Piotr
    Alan Ayala, Anthony Danalis, Damien Genet, Daniel Barry, Frank Winkler, Gerald Ragghianti, Heike Jagode, Jamie Finney, Piotr Luszczek
  7. APR
    -
    IEEE SouthEastCon 2019 Huntsville, Alabama
    Alan Ayala
    Alan
    Alan Ayala
  8. APR
    -
    JLESC Knoxville, Tennessee
    Ali Charara
    Ali
    Anthony Danalis
    Anthony
    Asim YarKhan
    Asim
    Damien Genet
    Damien
    George Bosilca
    George
    Gerald Ragghianti
    Gerald
    Hartwig Anzt
    Hartwig
    Ichitaro Yamazaki
    Ichitaro
    Jack Dongarra
    Jack
    Jakub Kurzak
    Jakub
    Nuria Losada
    Nuria
    Piotr Luszczek
    Piotr
    Qinglei Cao
    Qinglei
    Reazul Hoque
    Reazul
    Stanimire Tomov
    Stan
    Terry Moore
    Terry
    Thananon Patinyasakdikul
    Arm
    Xi Luo
    Xi
    Yaohung Tsai
    Mike
    Yu Pei
    Yu
    Ali Charara, Anthony Danalis, Asim YarKhan, Damien Genet, George Bosilca, Gerald Ragghianti, Hartwig Anzt, Ichitaro Yamazaki, Jack Dongarra, Jakub Kurzak, Nuria Losada, Piotr Luszczek, Qinglei Cao, Reazul Hoque, Stanimire Tomov, Terry Moore, Thananon Patinyasakdikul, Xi Luo, Yaohung Tsai, Yu Pei

Upcoming Conferences

  1. MAY
    -
    PETTT Annual PPE Review Vicksburg, Mississippi
    Stanimire Tomov
    Stan
    Stanimire Tomov
  2. MAY
    -
    BDEC2 Poznan, Poland
    Joan Snoderly
    Joan
    Sam Crawford
    Sam
    Terry Moore
    Terry
    Tracy Rafferty
    Tracy
    Joan Snoderly, Sam Crawford, Terry Moore, Tracy Rafferty
  3. MAY
    -
    IPDPS Rio di Janeiro, Brazil
    Anthony Danalis
    Anthony
    Hartwig Anzt
    Hartwig
    Ichitaro Yamazaki
    Ichitaro
    Jack Dongarra
    Jack
    Anthony Danalis, Hartwig Anzt, Ichitaro Yamazaki, Jack Dongarra
  4. MAY
    -
    2019 OLCF User Meeting Oak Ridge, Tennessee
    Stanimire Tomov
    Stan
    Stanimire Tomov
  5. MAY
    -
    MPI Forum Chicago, Illinois
    Aurelien Bouteiller
    Aurelien
    Aurelien Bouteiller
  6. MAY
    -
    DoE/MEXT Chicago, Illinois
    Mark Gates
    Mark
    Mark Gates

Recent Lunch Talks

  1. MAR
    8
    Stephen Herbein
    Stephen Herbein
    Lawrence Livermore National Laboratory
    Flux: Using Next-Generation Resource Management and Scheduling Infrastructure for Exascale Workflows PDF
  2. MAR
    15
    Danny Rorabaugh
    Danny Rorabaugh
    Global Computing Laboratory
    A Workflow for Soil Moisture Analytics PDF
  3. MAR
    22
    Jamie Finney
    Jamie Finney
    DevOps: Tools and Practices PDF
  4. MAR
    26
    Torsten Hoefler
    Torsten Hoefler
    ETH Zürich
    High-Performance Communication in Machine Learning PDF
  5. MAR
    29
    Qinglei Cao
    Qinglei Cao
    Understanding the Performance of PaRSEC in the "Task Bench" Benchmark
  6. APR
    5
    Yves Robert
    Yves Robert
    ENS-Lyon
    Reservation Strategies for Stochastic Jobs PDF
  7. APR
    26
    Daniel Barry
    Daniel Barry
    Everything You Wanted to Know about the Counter Inspection Toolkit PDF

Upcoming Lunch Talks

  1. MAY
    3
    Stephen Thomas
    Stephen Thomas
    Global Computing Laboratory
    Analytics4MD: In-Situ Data Analytics for Next Generation Molecular Dynamics (MD) Workflows
  2. MAY
    10
    Qinglei Cao
    Qinglei Cao
    Task Bench: A Parameterized Benchmark for Evaluating Parallel Runtime Performance
  3. MAY
    17
    Ali Charara
    Ali Charara
    SLATE Data Coherence: An Adapted Cache Coherence Model
  4. MAY
    24
    Thomas Herault
    Thomas Herault
    Comparing the Performance of Rigid, Moldable and Grid-Shaped Applications on Failure-Prone HPC Platforms PDF
  5. MAY
    31
    Reazul Hoque
    Reazul Hoque
    Dynamic Task Discovery in a Data Flow Task-Based Runtime PDF

People

  1. Ichitaro Yamazaki
    After 8 years at ICL, Ichitaro Yamazaki took a research position at Sandia National Laboratories in Albuquerque, NM. Congratulations and good luck, Ichi!
  2. Rizwan Ashraf
    Rizwan Ashraf will join ICL in May as a Research Scientist in the Performance ICL (PICL) group. Welcome aboard, Rizwan!
  3. Neil Lindquist
    Neil Lindquist will be joining ICL as a Graduate Research Assistant this summer. Welcome, Neil!