News and Announcements

Jack Earns HPDC Achievement Award

jack_hpdc

Jack Dongarra has been honored for his groundbreaking contributions to HPC with the 2016 High Performance Parallel and Distributed Computing (HPDC) Achievement Award. Jack recently accepted the award at the 25th annual Association for Computing Machinery’s HPDC conference in Kyoto, Japan.

The HPDC conference established the annual achievement award to recognize leading computer scientists who have made long-lasting and influential contributions to the foundation or practice of high-performance parallel and distributed computing and for seminal contributions and a sustained record of high-impact in the field.

The award reflects achievements in one or more of the following areas:

  • Conceptual advances that have influenced the design or operation of HPDC systems or applications.
  • Innovative techniques or tools for the design or analysis of HPDC systems or applications.
  • The design, implementation, and deployment of HPDC systems, applications, and innovative components.
  • The analysis of innovative HPDC systems or applications including components.

“It’s an honor to have our pioneering work in the area of high performance computing recognized by the HPDC community,” Dongarra said. “At the same time, the award calls attention to the impact a long-term commitment to experimental computer science work can have on scientific discovery in domains ranging from economics to high energy physics and from human health to geology.”

Congratulations, Jack!

Dongarra named ISC ’17 Program Chair

jack-2013-web-tiny
On May 23rd, the ISC Group named ICL’s Jack Dongarra as the Program Chairman for ISC 2017. The International Supercomputing Conference (ISC High Performance) is held annually in Germany as the European counterpart to SC. As the program chair, Jack will take a leading roll with the ISC program team in defining the 2017 program. In addition, Jack will be working with the steering committee in a longer multi-year effort to further increase the exposure of the ISC High Performance endeavor and of the HPC community in general. Congratulations, Jack!

Conference Reports

Batched BLAS Workshop

BBLAS_group

On May 18-19, 2016, ICL hosted the Workshop on Batched, Reproducible, and Reduced Precision BLAS (BBLAS) in Knoxville, TN. This workshop focused on extending the Basic Linear Algebra Software Library (BLAS) to provide greater parallelism for small size operations, reproducibility, and reduced precision support, and serves as the beginning of a forum to discuss and formalize details related to batched, reproducible, and reduced precision BLAS. As one might imagine, many ICLers—mostly from the Numerical Linear Algebra group—were on hand to present their latest work with BLAS. A full agenda with slides is available on the workshop website.

BBLAS also served as a reunion of sorts since several of the participants have spent time at ICL throughout the years, including Sven Hammarling, Greg Henry, Hatem Ltaief, and Clint Whaley.

TESSE Group Meeting

TESSE_dinner

On May 4, 2016, ICL’s George Bosilca, Amina Guermouche, and Thomas Herault went to New York, New York as part of the annual meeting for the collaborative TESSE project.

The goal of TESSE (Task-based Environment for Scientific Simulation at Extreme Scale) is to use an application-driven design to create a general-purpose production-quality software framework that attacks the twin challenges of programmer productivity and portable performance for advanced scientific applications on massively-parallel, hybrid, many-core systems of today and tomorrow.

The TESSE team is composed of researchers from Stonybrook, Virginia Tech, and ICL/UTK. Pictured above from left-to-right: Scott Thornton (Stonybrook), George Bosilca (ICL), Ed Valeev (Virginia Tech), Robert Harrison (Stonybrook), Amina Guermouche (ICL), Thomas Herault (ICL), and Mahdi Javanamard (Stonybrook).

IPDPS 2016

  • dsc01338
  • dsc01337
  • dsc01335
  • dsc01334
  • dsc01333
  • dsc01331
  • dsc01329
  • dsc01258
  • dsc01257
  • dsc01255
  • dsc01254
  • dsc01253
  • dsc01251
  • dsc01250

 

On May 23-27, Piotr Luszczek, George Bosilca, and Ahmad Ahmad made their way to Chicago, Illinois for the 30th IEEE International Parallel & Distributed Processing Symposium (IPDPS). IPDPS is an international forum for engineers and scientists from around the world to present their latest research findings in all aspects of parallel computation. In addition to technical sessions of submitted paper presentations, the meeting offers workshops, tutorials, and commercial presentations and exhibits.

Piotr participated in the Sixth International Workshop on Accelerators and Hybrid Exascale Systems (AsHES) on Monday, where he contributed to a tutorial on “Heterogeneous Streaming,” and then presented a paper, “Hessenberg Reduction with Transient Error Resilience on GPU-Based Hybrid Architectures.” Piotr also presented Hartwig’s paper on “Efficiency of General Krylov Methods on GPUs – An Experimental Study,” and served as chair for the Workload Scheduling session. On Friday, Piotr took part in the 11th International Workshop on Automatic Performance Tuning (iWAPT) where he presented the paper, “Search Space Generation and Pruning System for Autotuners.”

George gave a keynote presentation, “Undisrupted Applications,” at the 17th IEEE International Workshop on Parallel and Distributed Scientific and Engineering Computing (PDSEC). Ahmad was also at PDSEC where he presented his paper, “On the Development of Variable Size Batched Computation for Heterogeneous Parallel Architectures.”

Interview

David Eberius Then

David Eberius

Where are you from, originally?

I am from Bel Air, Maryland which is just north of Baltimore.

Can you summarize your educational background?

I completed my undergraduate studies at a small school in Maryland called Salisbury University with a major in Computer Science and a minor in Mathematics.

Tell us how you first learned about ICL.

I was looking into schools with strong HPC backgrounds and I found UTK. When I was looking into professors to study under, I saw Dr. Dongarra and ICL and I thought it would be a great fit.

What made you want to work for ICL?

I wanted to improve my knowledge of HPC and eventually become an expert. I have found that ICL is definitely an ideal place for me to learn.

What are you working on while at ICL?

Right now I am primarily working on code profiling. More specifically, I am working on the profiling system within PaRSEC to allow for PAPI performance counters to be used during execution. Recently I started working on a PAPI component that will allow users to track software-based events from outside libraries such as OpenMPI.

If you weren’t working at ICL, where would you like to be working and why?

Since I am still a graduate student, if I weren’t working at ICL, I would like to work as a TA because I want to teach at the university level someday.

What are your interests/hobbies outside work?

I enjoy watching movies and TV shows, particularly anime. I also like playing video games, doing video editing, and reading. One of my favorite books right now is The Name of the Wind by Patrick Rothfuss. I also love whistling, and as it turns out, I will be competing in the World Whistlers Convention this summer in Japan.

Tell us something about yourself that might surprise people.

Before choosing Computer Science for my major, I wanted to pursue English and eventually write short stories and novels professionally.

Recent Papers

  1. Anzt, H., J. Dongarra, M. Kreutzer, G. Wellein, and M. Kohler, Efficiency of General Krylov Methods on GPUs – An Experimental Study,” The Sixth International Workshop on Accelerators and Hybrid Exascale Systems (AsHES), Chicago, IL, IEEE, May 2016. DOI: 10.1109/IPDPSW.2016.45  (285.28 KB)
  2. Anzt, H., J. Dongarra, M. Kreutzer, G. Wellein, and M. Kohler, Efficiency of General Krylov Methods on GPUs – An Experimental Study,” 2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), pp. 683-691, May 2016. DOI: 10.1109/IPDPSW.2016.45
  3. Jia, Y., P. Luszczek, and J. Dongarra, Hessenberg Reduction with Transient Error Resilience on GPU-Based Hybrid Architectures,” 30th IEEE International Parallel & Distributed Processing Symposium (IPDPS), Chicago, IL, IEEE, May 2016.  (535.72 KB)
  4. Newburn, C. J., G. Bansal, M. Wood, L. Crivelli, J. Planas, A. Duran, P. Souza, L. Borges, P. Luszczek, S. Tomov, et al., Heterogeneous Streaming,” The Sixth International Workshop on Accelerators and Hybrid Exascale Systems (AsHES), IPDPS 2016, Chicago, IL, IEEE, May 2016.  (2.73 MB)
  5. Abdelfattah, A., H. Anzt, J. Dongarra, M. Gates, A. Haidar, J. Kurzak, P. Luszczek, S. Tomov, , and A. YarKhan, Linear Algebra Software for Large-Scale Accelerated Multicore Computing,” Acta Numerica, vol. 25, pp. 1-160, May 2016. DOI: 10.1017/S0962492916000015
  6. Abdelfattah, A., A. Haidar, S. Tomov, and J. Dongarra, On the Development of Variable Size Batched Computation for Heterogeneous Parallel Architectures,” The 17th IEEE International Workshop on Parallel and Distributed Scientific and Engineering Computing (PDSEC 2016), IPDPS 2016, Chicago, IL, IEEE, May 2016.  (708.62 KB)
  7. Benoit, A., A. Cavelan, Y. Robert, and H. Sun, Optimal Resilience Patterns to Cope with Fail-stop and Silent Errors,” 2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS), Chicago, IL, IEEE, May 2016. DOI: 10.1109/IPDPS.2016.39  (603.58 KB)
  8. Haugen, B., Performance Analysis and Modeling of Task-Based Runtimes,” Department of Electrical Engineering and Computer Science, vol. PhD, Knoxville, University of Tennessee, May 2016.  (5.14 MB)
  9. Abdelfattah, A., H. Ltaeif, D. Keyes, and J. Dongarra, Performance optimization of Sparse Matrix-Vector Multiplication for multi-component PDE-based applications using GPUs,” Concurrency and Computation: Practice and Experience, vol. 28, issue 12, pp. 3447 - 3465, May 2016. DOI: 10.1002/cpe.v28.1210.1002/cpe.3874  (3.21 MB)
  10. Luszczek, P., M. Gates, J. Kurzak, A. Danalis, and J. Dongarra, Search Space Generation and Pruning System for Autotuners,” 30th IEEE International Parallel & Distributed Processing Symposium (IPDPS), Chicago, IL, IEEE, May 2016.  (555.44 KB)
  11. Wu, W., G. Bosilca, R. vandeVaart, S. Jeaugey, and J. Dongarra, GPU-Aware Non-contiguous Data Movement In Open MPI,” 25th International Symposium on High-Performance Parallel and Distributed Computing (HPDC'16), Kyoto, Japan, ACM, June 2016. DOI: http://dx.doi.org/10.1145/2907294.2907317  (482.32 KB)
  12. Abdelfattah, A., M. Baboulin, V. Dobrev, J. Dongarra, C. Earl, J. Falcou, A. Haidar, I. Karlin, T. Kolev, I. Masliah, et al., High-Performance Tensor Contractions for GPUs,” International Conference on Computational Science (ICCS'16), San Diego, CA, June 2016.  (2.36 MB)
  13. Abdelfattah, A., A. Haidar, S. Tomov, and J. Dongarra, Performance Tuning and Optimization Techniques of Fixed and Variable Size Batched Cholesky Factorization on GPUs,” International Conference on Computational Science (ICCS'16), San Diego, CA, June 2016.  (626.21 KB)
  14. Abdelfattah, A., A. Haidar, S. Tomov, and J. Dongarra, Performance, Design, and Autotuning of Batched GEMM for GPUs,” The International Supercomputing Conference (ISC High Performance 2016), Frankfurt, Germany, June 2016.  (1.27 MB)
  15. YarKhan, A., J. Kurzak, P. Luszczek, and J. Dongarra, Porting the PLASMA Numerical Library to the OpenMP Standard,” International Journal of Parallel Programming, June 2016. DOI: 10.1007/s10766-016-0441-6  (1.66 MB)
  16. Dongarra, J., Report on the Sunway TaihuLight System,” University of Tennessee Computer Science Technical Report, no. UT-EECS-16-742: University of Tennessee, June 2016.

Recent Conferences

  1. MAY
    Piotr Luszczek
    Piotr
    Piotr Luszczek
  2. MAY
    TESSE Meeting New York, New York
    Amina Guermouche
    Amina
    George Bosilca
    George
    Thomas Herault
    Thomas
    Amina Guermouche, George Bosilca, Thomas Herault
  3. MAY
    Amina Guermouche
    Amina
    Aurelien Bouteiller
    Aurelien
    George Bosilca
    George
    Ichitaro Yamazaki
    Ichitaro
    Thomas Herault
    Thomas
    Amina Guermouche, Aurelien Bouteiller, George Bosilca, Ichitaro Yamazaki, Thomas Herault
  4. MAY
    -
    Ahmad Ahmad
    Ahmad
    Azzam Haidar
    Azzam
    Hartwig Anzt
    Hartwig
    Jack Dongarra
    Jack
    Jakub Kurzak
    Jakub
    Mark Gates
    Mark
    Piotr Luszczek
    Piotr
    Stanimire Tomov
    Stan
    Terry Moore
    Terry
    Ahmad Ahmad, Azzam Haidar, Hartwig Anzt, Jack Dongarra, Jakub Kurzak, Mark Gates, Piotr Luszczek, Stanimire Tomov, Terry Moore
  5. MAY
    George Bosilca
    George
    Piotr Luszczek
    Piotr
    George Bosilca, Piotr Luszczek
  6. MAY
    Ahmad Abdelfattah Ahmad
    Ahmad
    Ahmad Abdelfattah Ahmad
  7. JUN
    Heike Jagode
    Heike
    Thomas Herault
    Thomas
    Heike Jagode, Thomas Herault
  8. JUN
    Stanimire Tomov
    Stan
    Stanimire Tomov
  9. JUN
    MPI Forum WA Bellevue, Washington
    Aurelien Bouteiller
    Aurelien
    Aurelien Bouteiller
  10. JUN
    ADAC Workshop Luguano, Switzerland
    Azzam Haidar
    Azzam
    Jack Dongarra
    Jack
    Jakub Kurzak
    Jakub
    Azzam Haidar, Jack Dongarra, Jakub Kurzak
  11. JUN
    -
    Jack Dongarra
    Jack
    Terry Moore
    Terry
    Tracy Rafferty
    Tracy
    Jack Dongarra, Terry Moore, Tracy Rafferty
  12. JUN
    -
    Azzam Haidar
    Azzam
    Jack Dongarra
    Jack
    Jakub Kurzak
    Jakub
    Terry Moore
    Terry
    Azzam Haidar, Jack Dongarra, Jakub Kurzak, Terry Moore

Upcoming Conferences

  1. JUL
    Mark Gates
    Mark
    Mark Gates

Recent Lunch Talks

  1. MAY
    4
    Yaohung Tsai
    Yaohung Tsai
    AlphaGo: The Go AI from Google DeepMind
  2. MAY
    6
    Oleg Shylo
    Oleg Shylo
    Department of Industrial & Systems Engineering at UTK
    Scalable Communication for Parallel Optimization
  3. MAY
    13
    George Bosilca
    George Bosilca
    PaRSEC - Yet another runtime? PDF
  4. MAY
    20
    Iain Duff
    Iain Duff
    the Numerical Analysis Group at the Scientific Computing Department of the Science and Technology Facilities Council (UK)
    Scalability of Sparse Direct Codes PDF
  5. MAY
    27
    Azzam Haidar
    Azzam Haidar
    Heterogeneous Computation: The Current Challenge
  6. JUN
    10
    Emmanuel Agullo
    Emmanuel Agullo
    INRIA
    Overview of Task-based Sparse and Data-sparse Solvers on Top of Runtime Systems
  7. JUN
    17
    Julien Langou
    Julien Langou
    University of Colorado
    A Makespan Lower Bound for the Scheduling of the Tiled Cholesky Factorization based on ALAP scheduling PDF

Upcoming Lunch Talks

  1. JUL
    1
    Emmanuel Jeannot
    Emmanuel Jeannot
    INRIA
    Topology-Aware Data Management PDF
  2. JUL
    22
    Myungho Lee
    Myungho Lee
    Soongsil University
    Memory-Efficient Parallelization of 3D Lattice Boltzmann Flow Solver on a GPU

People

  1. Blake Haugen
    ICL Alumnus and recent PhD recipient Dr. Blake Haugen accepted a position as Machine Learning Engineer at PYA Analytics in Knoxville, TN. Congratulations, Blake!

Dates to Remember

2016 ICL Retreat

The 2016 ICL retreat has been set for August 15th – 16th at the RT Lodge in Maryville, TN. Mark your calendars.