News and Announcements

Jack Dongarra in Manchester

As a Turing Fellow at the University of Manchester for the past several years, Jack has had the opportunity to spend part of each summer interacting with the people in the University’s schools of Mathematics and Computer Science.  Below are two pictures from this summer’s visit. The first shows Sven Hammarling, his wife Pam (center), and Magaret Wright from NYU, all enjoying a salubrious libation in celebration of Sven’s seventieth birthday.

The second photo shows Jack and Iain Duff finishing a “10 + 2” year old bottle of Scotch.  Both Iain and Sven are two of Jack’s long-time friends, and have often collaborated with Jack and ICL.

SAAHPC ’11

Mitch Horton and Stanimire Tomov recently attended the Symposium on Application Accelerators in High Performance Computing (SAAHPC), on July 19-20, in Knoxville.  As the name indicates, the majority of presentations at the conference focused on the use of accelerators, namely Nvidia GPUs, to increase performance in scientific computing.  Greg Peterson, from UT’s Electrical Engineering and Computer Science department, was the symposium chair, and Pradeep Dubey, from Intel, gave the keynote address.

Mitch’s presentation and paper, “A Class of Hybrid LAPACK Algorithms for Multicore and GPU Architectures,” describes what is essentially an extension to the current release of MAGMA for QR, LU, and Cholesky factorizations, in which an attempt to fully utilize all available CPU cores is made while using a single GPU on a single node.

The Exaflops Target

In the August issue of Communications of the ACM, Jack Dongarra weighs in on the goal of exascale supercomputing.  The article, which also discusses Asia’s rise to prominence in high performance computing, gets down to brass tacks when talking about the potential energy consumption of an exascale system.  An analogy is also established between the jump from teraflops to petaflops, which took roughly a decade, and the challenging jump to exascale.  Can a 1,000-fold leap in performance be repeated, despite limitations in software parallelism and energy efficiency?  The race is on to find out.

Piotr Makes the News

Our very own Piotr Luszczek made the front page of the July 26th issue of The Daily Beacon for his work on the LINPACK Benchmark app for iOS. Specifically, Piotr is trying to break one gigaflops on the iPad 2 by optimizing the LINPACK Benchmark to run more efficiently on the Apple device.

This just in: On August 1st, “just before lunch,” Piotr successfully broke the one gigaflops barrier on the iPad 2, using the LINPACK Benchmark app!

The End of TeraGrid and the Dawn of XSEDE

Shirley and Piotr recently attended TeraGrid’11 which was held in downtown Salt Lake City the week of July 18. Piotr helped with a FutureGrid tutorial on the 18th and also graciously carried Shirley’s PAPI poster to SLC, given that Shirley was staying 4+ miles away due to not being able to get into the conference hotel block. Shirley co-authored a paper with ICL alumnus Haihang You, who then presented the paper in the Tuesday afternoon Science and Technology session. Shirley also presented a poster on new features of PAPI in the Tuesday evening poster session.

Much of the talk during the conference was about TeraGrid ending and being replaced by Extreme Science and Engineering Digital Environment (XSEDE). In fact, you’re not supposed to go to the TeraGrid website anymore. Instead, the latest and greatest info is to be found on XSEDE’s website. A big emphasis with XSEDE is including a wider range of resources in the cyberinfrastructure, not just supercomputers, but also scientific instruments, data collections, etc., and making it all easy to access and use together, as well as broadening the user base. Consequently, projects such as Eclipse Parallel Tools Platform and Science Gateways are getting a lot of NSF funding and attention. But that’s not to say that good old-fashioned performance measurement and optimization aren’t still important. There were plenty of visitors at the PAPI poster, including several NSF program managers. Most were already familiar with PAPI, and were just interested in hearing about recent developments, but a few newbies stopped by who didn’t know anything about it yet.

All in all, it was an information- and fun-filled week, including an excursion to the Clark Planetarium with educational IMAX movies on black holes and such things. Since it was summertime, many attendees brought spouses and/or kids as well. I guess we’ll look forward next year to XSEDE’12.

ICL Retreat 2011

This year’s ICL retreat will be held on August 11-12 at the Buckberry Lodge in Gatlinburg.   Please talk to Leighanne about transportation if you would like to carpool.

The Lodge at Buckberry Creek
961 Campbell Lead Road
Gatlinburg, TN 37738

Click here for directions or use the map below.


View Larger Map

Recent Releases

PLASMA 2.4.1 Released

The Parallel Linear Algebra for Scalable Multi-core Architectures (PLASMA) 2.4.1 release is now available for download.

This package contains two major bug fixes:

  • Fixed bug with Fujitsu compiler reported on the forum.
  • Unbind threads in PLASMA_Finalize to avoid problem of binding in OpenMP section following PLASMA calls (still possible on Mac and AIX without hwloc). A better fix is to create the OpenMP thread in the user code before any call to PLASMA, thanks to a fake parallel section.

See the software page to download the tarball.

QUARK Users’ Guide Released

The QUeueing And Runtime for Kernels (QUARK) Users’ Guide has been released.  QUARK is used for scheduling in both the PLASMA and MAGMA projects.  The guide is available as a PDF here.

Recent Papers

  1. Horton, M., S. Tomov, and J. Dongarra, A Class of Hybrid LAPACK Algorithms for Multicore and GPU Architectures,” Symposium for Application Accelerators in High Performance Computing (SAAHPC'11), Knoxville, TN, July 2011.  (329.68 KB)
  2. Baboulin, M., J. Dongarra, J. Herrmann, and S. Tomov, Accelerating Linear System Solutions Using Randomization Techniques,” INRIA RR-7616 / LAWN #246 (presented at International AMMCS’11), Waterloo, Ontario, Canada, July 2011.  (358.79 KB)
  3. You, H., B. Rekapalli, Q. Liu, and S. Moore, Autotuned Parallel I/O for Highly Scalable Biosequence Analysis,” TeraGrid'11, Salt Lake City, Utah, July 2011.  (275.34 KB)
  4. White, J. B., and J. Dongarra, High-Performance High-Resolution Semi-Lagrangian Tracer Transport on a Sphere,” Journal of Computational Physics, vol. 230, issue 17, pp. 6778-6799, July 2011. DOI: 10.1016/j.jcp.2011.05.008  (1.68 MB)
  5. Du, P., P. Luszczek, S. Tomov, and J. Dongarra, Soft Error Resilient QR Factorization for Hybrid System,” UT-CS-11-675 (also LAPACK Working Note #252), no. ICL-CS-11-675, July 2011.  (1.39 MB)
  6. Du, P., P. Luszczek, S. Tomov, and J. Dongarra, Soft Error Resilient QR Factorization for Hybrid System,” University of Tennessee Computer Science Technical Report, no. UT-CS-11-675, Knoxville, TN, July 2011.  (1.39 MB)
  7. Du, P., A. Bouteiller, G. Bosilca, T. Herault, and J. Dongarra, Algorithm-based Fault Tolerance for Dense Matrix Factorizations,” University of Tennessee Computer Science Technical Report, no. UT-CS-11-676, Knoxville, TN, August 2011.  (865.79 KB)
  8. Bouteiller, A., T. Herault, G. Bosilca, and J. Dongarra, Correlated Set Coordination in Fault Tolerant Message Logging Protocols,” Proceedings of 17th International Conference, Euro-Par 2011, Part II, vol. 6853, Bordeaux, France, Springer, pp. 51-64, August 2011.  (486.68 KB)
  9. Luszczek, P., E. Meek, S. Moore, D. Terpstra, V. M. Weaver, and J. Dongarra, Evaluation of the HPC Challenge Benchmarks in Virtualized Environments,” 6th Workshop on Virtualization in High-Performance Cloud Computing, Bordeaux, France, August 2011.  (114.73 KB)
  10. Vetter, J., R. Glassbrook, J. Dongarra, K. Schwan, B. Loftis, S. McNally, J. Meredith, J. Rogers, P. Roth, K. Spafford, et al., Keeneland: Bringing Heterogeneous GPU Computing to the Computational Science Community,” IEEE Computing in Science & Engineering, vol. 13, issue 5, pp. 90-95, August 2011. DOI: 10.1109/MCSE.2011.83  (932.57 KB)
  11. Tomov, S., and J. Dongarra, MAGMA - LAPACK for HPC on Heterogeneous Architectures , Oak Ridge, TN, Titan Summit at Oak Ridge National Laboratory, Presentation, August 2011.  (20.43 MB)
  12. Haidar, A., H. Ltaeif, and J. Dongarra, Parallel Reduction to Condensed Forms for Symmetric Eigenvalue Problems using Aggregated Fine-Grained and Memory-Aware Kernels,” University of Tennessee Computer Science Technical Report, UT-CS-11-677, (also Lawn254), August 2011.  (636.01 KB)

Recent Lunch Talks

  1. JUL
    8
    Vince Weaver
    Vince Weaver
    Evaluation of the HPC Challenge Benchmarks in Virtualized Environments PDF
  2. JUL
    15
    Teng Ma
    Teng Ma
    Kernel Assisted MPI Communication on Multi-core Clusters PDF
  3. JUL
    22
    Yiannis Cotronis
    Yiannis Cotronis
    University of Athens
    Non-Intrusive Collection and Management of Data Provenance in e-Science Workflows PDF
  4. JUL
    29
    Shirley MooreAdriana Garties
    Shirley Moore and Adriana Garties
    ICL/REU
    Autotuned Parallel I/O for Highly Scalable Biosequence Analysis PDF
  5. AUG
    5
    Thomas Herault
    Thomas Herault
    On Scalability for MPI Runtime Systems
  6. AUG
    19
    Phil Mucci
    Phil Mucci
    Measuring I/O in Virtualized Environments PDF
  7. AUG
    26
    Tracy Rafferty
    Tracy Rafferty
    Travel

Upcoming Lunch Talks

  1. SEP
    2
    Vladimir VoevodinVictor Gergel
    Vladimir Voevodin and Victor Gergel
    Russia
    Perspectives for HPC Infrastructure in Russia PDF
  2. SEP
    9
    Tingxing
    Tingxing "Tim" Dong
    Acceleration of the BLAST hydro code on GPU PDF
  3. SEP
    16
    Hartwig Anzt
    Hartwig Anzt
    Energy-Efficient High-Performance-Computing PDF
  4. SEP
    23
    Piotr Luszczek
    Piotr Luszczek
    Energy footprint for the LINPACK benchmark from supercomputers to tablet devices PDF
  5. SEP
    30
    Alexander Gaenko
    Alexander Gaenko
    Assitant scientist at Ames Laboratory of US DOE, Ames, IA
    Let's Work Together: New Computational Science via Re-implementation, Componentization, Parallelization

People

  1. Mark Gates
    Mark Gates, a Post Doctoral Research Associate, joins ICL on August 8th. He will be working on the PLASMA and MAGMA projects.
  2. Hartwig Anzt
    Hartwig Anzt, from the Karlsruhe Institute of Technology in Germany, will be visiting ICL from September 1st through the end of December.
  3. Raffaele Solca
    Raffaele Solca, a repeat ICL visitor from Switzerland, is working with Stan and the rest of the MAGMA team through the end of August.
  4. Tim Poore
    Tim Poore joins ICL on August 1st, and will be working as a consultant on the PAPI-V project.
  5. Ichitaro Yamazaki
    Ichitaro Yamazaki joins ICL on August 1st as a Senior Research Associate.  He will be working on the PLASMA and MAGMA projects.
  6. New Students
    New Students
    ICL would like to welcome eight new students arriving for the fall semester!
    • Anthony Canino, pursuing PhD
    • Chongxiao Cao, pursuing PhD
    • Peter Gaultney, pursuing PhD
    • Matt Johnson, pursuing PhD
    • Vijay Joshi, pursuing MS
    • Khairul Kabir, pursuing PhD
    • John Nelson, pursuing MS
    • Wei Wu, pursuing PhD