ICL Newsletter

News and Announcements

Jack Earns HPDC Achievement Award

Jack Dongarra has been honored for his groundbreaking contributions to HPC with the 2016 High Performance Parallel and Distributed Computing (HPDC) Achievement Award. Jack recently accepted the award at the 25th annual Association for Computing Machinery’s HPDC conference in Kyoto, Japan.

The HPDC conference established the annual achievement award to recognize leading computer scientists who have made long-lasting and influential contributions to the foundation or practice of high-performance parallel and distributed computing and for seminal contributions and a sustained record of high-impact in the field.

The award reflects achievements in one or more of the following areas:

Conceptual advances that have influenced the design or operation of HPDC systems or applications.
Innovative techniques or tools for the design or analysis of HPDC systems or applications.
The design, implementation, and deployment of HPDC systems, applications, and innovative components.
The analysis of innovative HPDC systems or applications including components.

“It’s an honor to have our pioneering work in the area of high performance computing recognized by the HPDC community,” Dongarra said. “At the same time, the award calls attention to the impact a long-term commitment to experimental computer science work can have on scientific discovery in domains ranging from economics to high energy physics and from human health to geology.”

Congratulations, Jack!

Dongarra named ISC ’17 Program Chair

jack-2013-web-tiny
On May 23rd, the ISC Group named ICL’s Jack Dongarra as the Program Chairman for ISC 2017. The International Supercomputing Conference (ISC High Performance) is held annually in Germany as the European counterpart to SC. As the program chair, Jack will take a leading roll with the ISC program team in defining the 2017 program. In addition, Jack will be working with the steering committee in a longer multi-year effort to further increase the exposure of the ISC High Performance endeavor and of the HPC community in general. Congratulations, Jack!

Conference Reports

Batched BLAS Workshop

On May 18-19, 2016, ICL hosted the Workshop on Batched, Reproducible, and Reduced Precision BLAS (BBLAS) in Knoxville, TN. This workshop focused on extending the Basic Linear Algebra Software Library (BLAS) to provide greater parallelism for small size operations, reproducibility, and reduced precision support, and serves as the beginning of a forum to discuss and formalize details related to batched, reproducible, and reduced precision BLAS. As one might imagine, many ICLers—mostly from the Numerical Linear Algebra group—were on hand to present their latest work with BLAS. A full agenda with slides is available on the workshop website.

BBLAS also served as a reunion of sorts since several of the participants have spent time at ICL throughout the years, including Sven Hammarling, Greg Henry, Hatem Ltaief, and Clint Whaley.

TESSE Group Meeting

On May 4, 2016, ICL’s George Bosilca, Amina Guermouche, and Thomas Herault went to New York, New York as part of the annual meeting for the collaborative TESSE project.

The goal of TESSE (Task-based Environment for Scientific Simulation at Extreme Scale) is to use an application-driven design to create a general-purpose production-quality software framework that attacks the twin challenges of programmer productivity and portable performance for advanced scientific applications on massively-parallel, hybrid, many-core systems of today and tomorrow.

The TESSE team is composed of researchers from Stonybrook, Virginia Tech, and ICL/UTK. Pictured above from left-to-right: Scott Thornton (Stonybrook), George Bosilca (ICL), Ed Valeev (Virginia Tech), Robert Harrison (Stonybrook), Amina Guermouche (ICL), Thomas Herault (ICL), and Mahdi Javanamard (Stonybrook).

IPDPS 2016

On May 23-27, Piotr Luszczek, George Bosilca, and Ahmad Ahmad made their way to Chicago, Illinois for the 30th IEEE International Parallel & Distributed Processing Symposium (IPDPS). IPDPS is an international forum for engineers and scientists from around the world to present their latest research findings in all aspects of parallel computation. In addition to technical sessions of submitted paper presentations, the meeting offers workshops, tutorials, and commercial presentations and exhibits.

Piotr participated in the Sixth International Workshop on Accelerators and Hybrid Exascale Systems (AsHES) on Monday, where he contributed to a tutorial on “Heterogeneous Streaming,” and then presented a paper, “Hessenberg Reduction with Transient Error Resilience on GPU-Based Hybrid Architectures.” Piotr also presented Hartwig’s paper on “Efficiency of General Krylov Methods on GPUs – An Experimental Study,” and served as chair for the Workload Scheduling session. On Friday, Piotr took part in the 11th International Workshop on Automatic Performance Tuning (iWAPT) where he presented the paper, “Search Space Generation and Pruning System for Autotuners.”

George gave a keynote presentation, “Undisrupted Applications,” at the 17th IEEE International Workshop on Parallel and Distributed Scientific and Engineering Computing (PDSEC). Ahmad was also at PDSEC where he presented his paper, “On the Development of Variable Size Batched Computation for Heterogeneous Parallel Architectures.”

Interview

Where are you from, originally?

I am from Bel Air, Maryland which is just north of Baltimore.

Can you summarize your educational background?

I completed my undergraduate studies at a small school in Maryland called Salisbury University with a major in Computer Science and a minor in Mathematics.

Tell us how you first learned about ICL.

I was looking into schools with strong HPC backgrounds and I found UTK. When I was looking into professors to study under, I saw Dr. Dongarra and ICL and I thought it would be a great fit.

What made you want to work for ICL?

I wanted to improve my knowledge of HPC and eventually become an expert. I have found that ICL is definitely an ideal place for me to learn.

What are you working on while at ICL?

Right now I am primarily working on code profiling. More specifically, I am working on the profiling system within PaRSEC to allow for PAPI performance counters to be used during execution. Recently I started working on a PAPI component that will allow users to track software-based events from outside libraries such as OpenMPI.

If you weren’t working at ICL, where would you like to be working and why?

Since I am still a graduate student, if I weren’t working at ICL, I would like to work as a TA because I want to teach at the university level someday.

What are your interests/hobbies outside work?

I enjoy watching movies and TV shows, particularly anime. I also like playing video games, doing video editing, and reading. One of my favorite books right now is The Name of the Wind by Patrick Rothfuss. I also love whistling, and as it turns out, I will be competing in the World Whistlers Convention this summer in Japan.

Tell us something about yourself that might surprise people.

Before choosing Computer Science for my major, I wanted to pursue English and eventually write short stories and novels professionally.

Recent Papers

Anzt, H., J. Dongarra, M. Kreutzer, G. Wellein, and M. Kohler, “Efficiency of General Krylov Methods on GPUs â An Experimental Study,” 2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), pp. 683-691, May 2016. DOI: 10.1109/IPDPSW.2016.45
Anzt, H., J. Dongarra, M. Kreutzer, G. Wellein, and M. Kohler, “Efficiency of General Krylov Methods on GPUs â An Experimental Study,” The Sixth International Workshop on Accelerators and Hybrid Exascale Systems (AsHES), Chicago, IL, IEEE, May 2016. DOI: 10.1109/IPDPSW.2016.45 (285.28 KB)
Jia, Y., P. Luszczek, and J. Dongarra, “Hessenberg Reduction with Transient Error Resilience on GPU-Based Hybrid Architectures,” 30th IEEE International Parallel & Distributed Processing Symposium (IPDPS), Chicago, IL, IEEE, May 2016. (535.72 KB)
Newburn, C. J., G. Bansal, M. Wood, L. Crivelli, J. Planas, A. Duran, P. Souza, L. Borges, P. Luszczek, S. Tomov, et al., “Heterogeneous Streaming,” The Sixth International Workshop on Accelerators and Hybrid Exascale Systems (AsHES), IPDPS 2016, Chicago, IL, IEEE, May 2016. (2.73 MB)
Abdelfattah, A., H. Anzt, J. Dongarra, M. Gates, A. Haidar, J. Kurzak, P. Luszczek, S. Tomov, , and A. YarKhan, “Linear Algebra Software for Large-Scale Accelerated Multicore Computing,” Acta Numerica, vol. 25, pp. 1-160, May 2016. DOI: 10.1017/S0962492916000015
Abdelfattah, A., A. Haidar, S. Tomov, and J. Dongarra, “On the Development of Variable Size Batched Computation for Heterogeneous Parallel Architectures,” The 17th IEEE International Workshop on Parallel and Distributed Scientific and Engineering Computing (PDSEC 2016), IPDPS 2016, Chicago, IL, IEEE, May 2016. (708.62 KB)
Benoit, A., A. Cavelan, Y. Robert, and H. Sun, “Optimal Resilience Patterns to Cope with Fail-stop and Silent Errors,” 2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS), Chicago, IL, IEEE, May 2016. DOI: 10.1109/IPDPS.2016.39 (603.58 KB)
Haugen, B., “Performance Analysis and Modeling of Task-Based Runtimes,” Department of Electrical Engineering and Computer Science, vol. PhD, Knoxville, University of Tennessee, May 2016. (5.14 MB)
Abdelfattah, A., H. Ltaeif, D. Keyes, and J. Dongarra, “Performance optimization of Sparse Matrix-Vector Multiplication for multi-component PDE-based applications using GPUs,” Concurrency and Computation: Practice and Experience, vol. 28, issue 12, pp. 3447 - 3465, May 2016. DOI: 10.1002/cpe.v28.1210.1002/cpe.3874 (3.21 MB)
Luszczek, P., M. Gates, J. Kurzak, A. Danalis, and J. Dongarra, “Search Space Generation and Pruning System for Autotuners,” 30th IEEE International Parallel & Distributed Processing Symposium (IPDPS), Chicago, IL, IEEE, May 2016. (555.44 KB)
Wu, W., G. Bosilca, R. vandeVaart, S. Jeaugey, and J. Dongarra, “GPU-Aware Non-contiguous Data Movement In Open MPI,” 25th International Symposium on High-Performance Parallel and Distributed Computing (HPDC'16), Kyoto, Japan, ACM, June 2016. DOI: http://dx.doi.org/10.1145/2907294.2907317 (482.32 KB)
Abdelfattah, A., M. Baboulin, V. Dobrev, J. Dongarra, C. Earl, J. Falcou, A. Haidar, I. Karlin, T. Kolev, I. Masliah, et al., “High-Performance Tensor Contractions for GPUs,” International Conference on Computational Science (ICCS'16), San Diego, CA, June 2016. (2.36 MB)
Abdelfattah, A., A. Haidar, S. Tomov, and J. Dongarra, “Performance Tuning and Optimization Techniques of Fixed and Variable Size Batched Cholesky Factorization on GPUs,” International Conference on Computational Science (ICCS'16), San Diego, CA, June 2016. (626.21 KB)
Abdelfattah, A., A. Haidar, S. Tomov, and J. Dongarra, “Performance, Design, and Autotuning of Batched GEMM for GPUs,” The International Supercomputing Conference (ISC High Performance 2016), Frankfurt, Germany, June 2016. (1.27 MB)
YarKhan, A., J. Kurzak, P. Luszczek, and J. Dongarra, “Porting the PLASMA Numerical Library to the OpenMP Standard,” International Journal of Parallel Programming, June 2016. DOI: 10.1007/s10766-016-0441-6 (1.66 MB)
Dongarra, J., “Report on the Sunway TaihuLight System,” University of Tennessee Computer Science Technical Report, no. UT-EECS-16-742: University of Tennessee, June 2016.

Recent Conferences

MAY
2

2016 GPU Hackathon Univ. of Delaware Newark, Delaware
Piotr

Piotr Luszczek
MAY
4

TESSE Meeting New York, New York
Amina
George
Thomas

Amina Guermouche, George Bosilca, Thomas Herault
MAY
18

11th Scheduling for Large Scale Systems Workshop Nashville, TN, Tennessee
Amina
Aurelien
George
Ichitaro
Thomas

Amina Guermouche, Aurelien Bouteiller, George Bosilca, Ichitaro Yamazaki, Thomas Herault
MAY
18-19

Workshop on Batched, Reproducible, and Reduced Precision BLAS Knoxville, TN
Ahmad
Azzam
Hartwig
Jack
Jakub
Mark
Piotr
Stan
Terry

Ahmad Ahmad, Azzam Haidar, Hartwig Anzt, Jack Dongarra, Jakub Kurzak, Mark Gates, Piotr Luszczek, Stanimire Tomov, Terry Moore
MAY
23

International Parallel & Distributed Processing Symposium Chicago, Illinois
George
Piotr

George Bosilca, Piotr Luszczek
MAY
27

Parallel and Distributed Scientific and Engineering Computing (PDSEC 2016) Chicago, Illinois
Ahmad

Ahmad Abdelfattah Ahmad
JUN
6

SC16 Program Committee Meeting Houston, Texas
Heike
Thomas

Heike Jagode, Thomas Herault
JUN
6

International Conference on Computational Science (ICCS) San Diego, California
Stan

Stanimire Tomov
JUN
6

MPI Forum WA Bellevue, Washington
Aurelien

Aurelien Bouteiller
JUN
12

ADAC Workshop Luguano, Switzerland
Azzam
Jack
Jakub

Azzam Haidar, Jack Dongarra, Jakub Kurzak
JUN
15-18

Workshop on Big Data and Extreme Scale Computing (BDEC) Frankfurt, Germany
Jack
Terry
Tracy

Jack Dongarra, Terry Moore, Tracy Rafferty
JUN
18-23

International Supercomputing Conference Frankfurt, Germany
Azzam
Jack
Jakub
Terry

Azzam Haidar, Jack Dongarra, Jakub Kurzak, Terry Moore

Upcoming Conferences

JUL
6

PMAA Int. Workshop on Parallel Matrix Algorithms and Applications Bordeaux, France
Mark

Mark Gates

Recent Lunch Talks

MAY
4
Yaohung Tsai
AlphaGo: The Go AI from Google DeepMind
MAY
6
Oleg Shylo
Department of Industrial & Systems Engineering at UTK
Scalable Communication for Parallel Optimization
MAY
13
George Bosilca
PaRSEC - Yet another runtime? PDF
MAY
20
Iain Duff
the Numerical Analysis Group at the Scientific Computing Department of the Science and Technology Facilities Council (UK)
Scalability of Sparse Direct Codes PDF
MAY
27
Azzam Haidar
Heterogeneous Computation: The Current Challenge
JUN
10
Emmanuel Agullo
INRIA
Overview of Task-based Sparse and Data-sparse Solvers on Top of Runtime Systems
JUN
17
Julien Langou
University of Colorado
A Makespan Lower Bound for the Scheduling of the Tiled Cholesky Factorization based on ALAP scheduling PDF

Upcoming Lunch Talks

JUL
1
Emmanuel Jeannot
INRIA
Topology-Aware Data Management PDF
JUL
22
Myungho Lee
Soongsil University
Memory-Efficient Parallelization of 3D Lattice Boltzmann Flow Solver on a GPU

People

ICL Alumnus and recent PhD recipient Dr. Blake Haugen accepted a position as Machine Learning Engineer at PYA Analytics in Knoxville, TN. Congratulations, Blake!

Dates to Remember

2016 ICL Retreat

The 2016 ICL retreat has been set for August 15th – 16th at the RT Lodge in Maryville, TN. Mark your calendars.

June 2016