News and Announcements

Employment Opportunities at ICL

ICL is seeking full-time Research Scientists (MS or PhD) to participate in the design, development, and maintenance of numerical software libraries for solving linear algebra problems on large, distributed-memory machines with multi-core processors, hardware accelerators, and performance monitoring capabilities for new and advanced hardware and software technologies.

The prospective researcher will coauthor papers to document research findings, present the team’s work at conferences and workshops, and help lead students and other team members in their research endeavors in ongoing and future projects. Given the nature of the work, there will be opportunities for publication, travel, and high-profile professional networking and collaboration across academia, labs, and industry.

An MS or PhD in computer science, computational sciences, or math is preferred. Background in at least one of the following areas is also preferred: numerical linear algebra, HPC, performance monitoring, machine learning, or data analytics.

For more information check out ICL’s jobs page: http://www.icl.utk.edu/jobs.

Conference Reports

Workshop on Variable Precision in Mathematical and Scientific Computing

The Workshop on Variable Precision in Mathematical and Scientific Computing was held virtually on May 7–8, 2020. Hosted by the Institute for Computational and Experimental Research in Mathematics (ICERM), the digital workshop drew over 60 attendees with 10 talks spanning the two-day meeting.

ICL’s Jack Dongarra presented the most recent efforts in “Using Mixed Precision in Numerical Computations to Speedup Linear Algebra Solvers,” where he describes how mixed-precision (FP16 and FP64) iterative refinement methods using half-precision Tensor Cores (FP16-TC) for the arithmetic can provide up to 4× speedup while preserving accuracy where possible.

Hartwig Anzt also “dialed in” to present the latest work in the “Multiprecision Effort in the US Exascale Computing Project,” which aims to deploy algorithms that combine different precision formats to increase the performance on modern hardware architectures without impacting the high quality of the final result.

Other attendees from ICL included Neil Lindquist, Piotr Luszczek, and Mike Tsai.

In what might become standard practice these days, ICERM recorded the workshop presentations and posted them on their website for your viewing pleasure. Enjoy.

PDSEC 2020

The 21st IEEE International Workshop on Parallel and Distributed Scientific and Engineering Computing (PDSEC 2020) was held online on May 22, 2020.

ICL’s very own Yu Pei presented the DisCo team’s work on “Communication Avoiding 2D Stencil Implementations over the PaRSEC Task-Based Runtime,” where they were able to minimize communication bottlenecks in distributed stencil computations by combing the computation and communication overlap inherent in PaRSEC with a communication-avoiding scheme.

The Editor would like to thank Yu Pei for his contribution to this article.

Interview

Qinglei Cao Then

Qinglei Cao

Where are you from, originally?
I am originally from the Shandong province in China.

Can you summarize your educational background?
I earned my BS in Information and Computational Science from Hunan University in China and then worked at National University of Defense Technology. I earned my Master’s degree in Computer Application Technology from Ocean University of China in 2016. In the same year, I started my PhD study in Computer Science at the University of Tennessee and Joined ICL in August of 2017.

Where did you work before joining ICL?
I worked as a PhD student at the University of Tennessee.

How did you first hear about the lab, and what made you want to work here?
One of my friends told me about ICL when I joined the University of Tennessee, and this was the first time I heard about ICL; but the first time I heard about Jack was when I was at the National University of Defense Technology and worked around the TH-1A when it was targeting the TOP500.

What is your focus here at ICL? What are you working on?
I am working in the DisCo group, and my research focuses on the PaRSEC task-based runtime system, and I mainly work on the adaptive mesh refinement and Cholesky factorization in PaRSEC.

What are your interests/hobbies outside of work?
Basketball and delicious foods.

Tell us something about yourself that might surprise people.
During my first winter break when I was an undergraduate student, I stood more than 10 hours on the train back home because of the inexperience towards the Spring Festival travel rush.

If you weren’t working at ICL, where would you like to be working and why?
I’d like to be a research assistant working on image processing, machine learning, algorithms, or HPC in other research groups.

Recent Papers

  1. Lopez, F., E. Chow, S. Tomov, and J. Dongarra, Asynchronous SGD for DNN Training on Shared-Memory Parallel Architectures,” Workshop on Scalable Deep Learning over Parallel And Distributed Infrastructures (ScaDL 2020), May 2020.  (188.51 KB)
  2. Kolev, T., P. Fischer, A. Abdelfattah, S. Ananthan, V. Barra, N. Beams, R. Bleile, J. Brown, R. Carson, J-S. Camier, et al., CEED ECP Milestone Report: Improve Performance and Capabilities of CEED-Enabled ECP Applications on Summit/Sierra,” ECP Milestone Reports: Zenodo, May 2020. DOI: 10.5281/zenodo.3860804  (28.12 MB)
  3. Pei, Y., Q. Cao, G. Bosilca, P. Luszczek, V. Eijkhout, and J. Dongarra, Communication Avoiding 2D Stencil Implementations over PaRSEC Task-Based Runtime,” 2020 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), New Orleans, LA, IEEE, May 2020. DOI: 10.1109/IPDPSW50202.2020.00127  (1.33 MB)
  4. Nicolae, B., J. Li, J. M. Wozniak, G. Bosilca, M. Dorier, and F. Cappello, DeepFreeze: Towards Scalable Asynchronous Checkpointing of Deep Learning Models,” 20th IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGRID), Melbourne, VIC, Australia, IEEE, May 2020. DOI: 10.1109/CCGrid49817.2020.00-76  (424.19 KB)
  5. Benoit, A., V. Le Fèvre, P. Raghavan, Y. Robert, and H. Sun, Design and Comparison of Resilient Scheduling Heuristics for Parallel Jobs,” 22nd Workshop on Advances in Parallel and Distributed Computational Models (APDCM 2020), New Orleans, LA, IEEE Computer Society Press, May 2020.  (696.21 KB)
  6. Losada, N., P. González, M. J. Martín, G. Bosilca, A. Bouteiller, and K. Teranishi, Fault Tolerance of MPI Applications in Exascale Systems: The ULFM Solution,” Future Generation Computer Systems, vol. 106, pp. 467-481, May 2020. DOI: 10.1016/j.future.2020.01.026  (2.06 MB)
  7. Haidar, A., H. Bayraktar, S. Tomov, J. Dongarra, and N. J. Higham, Mixed-Precision Solution of Linear Systems Using Accelerator-Based Computing,” Innovative Computing Laboratory Technical Report, no. ICL-UT-20-05: University of Tennessee, May 2020.  (1.03 MB)
  8. Gainaru, A., B. Goglin, V. Honoré, P. Raghavan, G. Pallez, P. Raghavan, Y. Robert, and H. Sun, Reservation and Checkpointing Strategies for Stochastic Jobs,” 34th IEEE International Parallel and Distributed Processing Symposium (IPDPS 2020), New Orleans, LA, IEEE Computer Society Press, May 2020.  (692.4 KB)
  9. Bathie, G., L. Marchal, Y. Robert, and S. Thibault, Revisiting Dynamic DAG Scheduling under Memory Constraints for Shared-Memory Platforms,” 22nd Workshop on Advances in Parallel and Distributed Computational Models (APDCM 2020), New Orleans, LA, IEEE Computer Society Press, May 2020.  (317.93 KB)
  10. Zhong, D., P. Shamis, Q. Cao, G. Bosilca, and J. Dongarra, Using Arm Scalable Vector Extension to Optimize Open MPI,” 20th IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGRID 2020), Melbourne, Australia, IEEE/ACM, May 2020. DOI: 10.1109/CCGrid49817.2020.00-71  (359.95 KB)
  11. Hendrickson, B., P. Messina, B. Bland, J. Chen, P. Colella, E. Dart, J. Dongarra, T. Dunning, I. Foster, R. Gerber, et al., ASCR@40: Highlights and Impacts of ASCR’s Programs : US Department of Energy’s Office of Advanced Scientific Computing Research, June 2020. DOI: 10.2172/1631812
  12. Krzhizhanovskaya, V., G. Závodszky, M. Lees, J. Dongarra, P. Sloot, S. Brissos, and J. Teixeira, Computational Science – ICCS 2020: 20th International Conference, Amsterdam, The Netherlands, June 3–5, 2020, Proceedings, Part I,” Lecture Notes in Computer Science, 1, no. 12137: Springer International Publishing, pp. 707, June 2020. DOI: 10.1007/978-3-030-50371-0
  13. Krzhizhanovskaya, V., G. Závodszky, M. Lees, J. Dongarra, P. Sloot, S. Brissos, and J. Teixeira, Computational Science – ICCS 2020: 20th International Conference, Amsterdam, The Netherlands, June 3–5, 2020, Proceedings, Part II,” Lecture Notes in Computer Science, 1, no. 12138: Springer International Publishing, pp. 697, June 2020. DOI: 10.1007/978-3-030-50417-5
  14. Krzhizhanovskaya, V., G. Závodszky, M. Lees, J. Dongarra, P. Sloot, S. Brissos, and J. Teixeira, Computational Science – ICCS 2020: 20th International Conference, Amsterdam, The Netherlands, June 3–5, 2020, Proceedings, Part III,” Lecture Notes in Computer Science, 1, no. 12139: Springer International Publishing, pp. 648, June 2020. DOI: 10.1007/978-3-030-50420-5
  15. Krzhizhanovskaya, V., G. Závodszky, M. Lees, J. Dongarra, P. Sloot, S. Brissos, and J. Teixeira, Computational Science – ICCS 2020: 20th International Conference, Amsterdam, The Netherlands, June 3–5, 2020, Proceedings, Part IV,” Lecture Notes in Computer Science, 1, no. 12140: Springer International Publishing, pp. 668, June 2020. DOI: 10.1007/978-3-030-50423-6
  16. Krzhizhanovskaya, V., G. Závodszky, M. Lees, J. Dongarra, P. Sloot, S. Brissos, and J. Teixeira, Computational Science – ICCS 2020: 20th International Conference, Amsterdam, The Netherlands, June 3–5, 2020, Proceedings, Part V,” Lecture Notes in Computer Science, 1, no. 12141: Springer International Publishing, pp. 618, June 2020. DOI: 10.1007/978-3-030-50426-7
  17. Krzhizhanovskaya, V., G. Závodszky, M. Lees, J. Dongarra, P. Sloot, S. Brissos, and J. Teixeira, Computational Science – ICCS 2020: 20th International Conference, Amsterdam, The Netherlands, June 3–5, 2020, Proceedings, Part VI,” Lecture Notes in Computer Science, 1, no. 12142: Springer International Publishing, pp. 667, June 2020. DOI: 10.1007/978-3-030-50433-5
  18. Krzhizhanovskaya, V., G. Závodszky, M. Lees, J. Dongarra, P. Sloot, S. Brissos, and J. Teixeira, Computational Science – ICCS 2020: 20th International Conference, Amsterdam, The Netherlands, June 3–5, 2020, Proceedings, Part VII,” Lecture Notes in Computer Science, 1, no. 12143: Springer International Publishing, pp. 775, June 2020. DOI: 10.1007/978-3-030-50436-6
  19. Cao, Q., Y. Pei, K. Akbudak, A. Mikhalev, G. Bosilca, H. Ltaief, D. Keyes, and J. Dongarra, Extreme-Scale Task-Based Cholesky Factorization Toward Climate and Weather Prediction Applications,” Platform for Advanced Scientific Computing Conference (PASC20), Geneva, Switzerland, ACM, June 2020. DOI: 10.1145/3394277.3401846  (2.71 MB)
  20. Wang, L., W. Wu, J. Zhang, H. Liu, G. Bosilca, M. Herlihy, and R. Fonseca, FFT-Based Gradient Sparsification for the Distributed Training of Deep Neural Networks,” 9th International Symposium on High-Performance Parallel and Distributed Computing (HPDC 20), Stockholm, Sweden, ACM, June 2020. DOI: 10.1145/3369583.3392681  (4.72 MB)
  21. Ayala, A., S. Tomov, A. Haidar, and J. Dongarra, heFFTe: Highly Efficient FFT for Exascale,” International Conference on Computational Science (ICCS 2020), Amsterdam, Netherlands, June 2020. DOI: 10.1007/978-3-030-50371-0_19  (2.62 MB)
  22. Abdelfattah, A., S. Tomov, and J. Dongarra, Investigating the Benefit of FP16-Enabled Mixed-Precision Solvers for Symmetric Positive Definite Matrices using GPUs,” International Conference on Computational Science (ICCS 2020), Amsterdam, Netherlands, Springer, Cham, June 2020. DOI: 10.1007/978-3-030-50417-5_18  (702.38 KB)
  23. Dongarra, J., Report on the Fujitsu Fugaku System,” Innovative Computing Laboratory Technical Report, no. ICL-UT-20-06: University of Tennessee, June 2020.  (3.3 MB)
  24. Tsai, Y. M., T. Cojean, and H. Anzt, Sparse Linear Algebra on AMD and NVIDIA GPUs—The Race is On,” ISC High Performance: Springer, June 2020. DOI: 10.1007/978-3-030-50743-5_16  (5.63 MB)
  25. Krzhizhanovskaya, V., G. Závodszky, M. Lees, J. Dongarra, P. Sloot, S. Brissos, and J. Teixeira, Twenty Years of Computational Science,” International Conference on Computational Science (ICCS 2020), Amsterdam, Netherlands, June 2020.  (149.66 KB)

Recent Conferences

  1. MAY
    -
    Hartwig Anzt
    Hartwig
    Jack Dongarra
    Jack
    Neil Lindquist
    Neil
    Piotr Luszczek
    Piotr
    Yaohung Tsai
    Mike
    Hartwig Anzt, Jack Dongarra, Neil Lindquist, Piotr Luszczek, Yaohung Tsai
  2. MAY
    -
    Yu Pei
    Yu
    Yu Pei
  3. JUN
    -
    ISC High Performance 2020 Zoom
    Hartwig Anzt
    Hartwig
    Heike Jagode
    Heike
    Jack Dongarra
    Jack
    Hartwig Anzt, Heike Jagode, Jack Dongarra

Upcoming Conferences

  1. JUL
    -
    Piotr Luszczek
    Piotr
    Piotr Luszczek
  2. JUL
    -
    PEARC20 Virtual
    Florent Lopez
    Florent
    Stanimire Tomov
    Stan
    Florent Lopez, Stanimire Tomov

Recent Lunch Talks

  1. MAY
    1
    Stanimire Tomov
    Stanimire Tomov
    HeFFTe: FFT-ECP API and High-Performance Library Prototype for Multidimensional FFTs on Large-Scale Heterogeneous Systems PDF
  2. MAY
    8
    Paula Olaya
    Paula Olaya
    Global Computing Laboratory
    Building Containerized Environments for Reproducibility and Traceability of Scientific Workflows PDF
  3. MAY
    15
    Jiali Li
    Jiali Li
    Optimizing all-to-all Operation with Awareness of Network Topology PDF
  4. MAY
    22
    Aurelien Bouteiller
    Aurelien Bouteiller
    Here's What's Fresh with MPI-4 PDF
  5. MAY
    29
    Azzam Haidar
    Azzam Haidar
    NVIDIA
    How CUDA Math Libraries Can Help You Unleash the Power of the New NVIDIA A100 GPU
  6. JUN
    5
    Frank Winkler
    Frank Winkler
    PIKA: Center-Wide and Job-Aware Cluster Monitoring PDF
  7. JUN
    12
    Hartwig Anzt
    Hartwig Anzt
    Porting Linear Algebra Libraries to the AMD Ecosystem PDF
  8. JUN
    19
    Michael Wyatt
    Michael Wyatt
    Global Computing Laboratory
    AI4IO: A Suite of AI-Based Tools for IO-Aware HPC Resource Management
  9. JUN
    26
    Tony Castaldo
    Tony Castaldo
    Controlling Power on NVIDIA and AMD GPUs

Upcoming Lunch Talks

  1. JUL
    10
    Marcus RitterFelix Wolf
    Marcus Ritter and Felix Wolf
    Technical University of Darmstadt
    Learning Cost-Effective Sampling Strategies for Empirical Performance Modeling PDF
  2. JUL
    17
    Joan Snoderly
    Joan Snoderly
    Getting Started with Concur: UT's Online Self-Service Travel Booking Tool PDF
  3. JUL
    24
    Richard Archibald
    Richard Archibald
    Oak Ridge National Laboratory
    Inverse Modeling for Experimental Science and Data Analytics PDF
  4. JUL
    31
    Dalal Sukkari
    Dalal Sukkari
    Leveraging Task-Based Polar Decomposition Using SLATE on Massively Parallel Systems with Hardware Accelerators

People

  1. John Batson
    John Batson graduated from UTK/ICL in May, where he earned his MA in English. John has now joined Piper Communications where he works as a Technical Editor. Congratulations on all counts, John!

congratulations

Silicon Valley Stories

ICL alum and entrepreneur Adam Beguelin has published his first non-technical book.
Silicon Valley Stories includes true accounts from inside Inktomi, AOL, Truveo, and a handful of other Silicon Valley startups. Congratulations, Adam!

Dates to Remember

Coffee Chats @ 2:00 p.m.

Don’t forget to attend ICL’s daily 2:00 p.m. coffee chats on Zoom!

ICL Friday Talks

Don’t forget: ICL Friday Talks are up and running on Zoom!