ICL Newsletter

News and Announcements

Titan Summit at ORNL

The Oak Ridge Leadership Computing Facility (OLCF) hosted a summit on August 15-17th, at Oak Ridge National Laboratory (ORNL), which provided a forum for current and prospective users to meet with OLCF staff and industry vendors to discuss Titan, the next-generation leadership-class computational resource. Titan, built by Cray, will utilize the upcoming “Kepler” GPU co-processor from NVIDIA.

Users were encouraged to present their application plans for exascale computing, and our own Stan Tomov presented ICL’s efforts with the MAGMA project. His talk, “MAGMA: LAPACK for HPC on heterogeneous architectures,” covered what ICL provides, through MAGMA, for the exascale software stack on heterogeneous architectures. In particular, the talk focused on two of MAGMA’s strengths: highly optimized kernels as building blocks of HPC applications, and runtime systems that will take “sequential-like” algorithms, described in terms of computational tasks, and efficiently schedule their execution on large scale parallel systems.

CScADS Workshop ’11

This year’s Center for Scalable Application Development Software (CScADS) workshops were held at Lake Tahoe, in the Granlibakken lodge. Jack Dongarra, Jakub Kurzak, and Dan Terpstra all attended workshops. Jack gave the introduction talk for the Autotunning workshop and Jakub gave a talk about autotuning GEMMs for Fermi (ASTRA). Dan gave a talk about porting PAPI to the cloud, Performance Tools Workshop.

There were many others in attendance, of course, including Clint Whaley from the University of Texas, Jim Demmel from Berkeley, and Rich Vuduc from Georgia Tech. The HPC industry was also well represented with Michael Garland from NVIDIA, John Levesque and Keita Teranishi from Cray, and Greg Henry from Intel. Clint Whaley, Greg Henry and Keita Teranishi are ICL alumni.

ICL Annual Retreat

This year’s ICL retreat was held on August 11th and 12th in Gatlinburg, Tennessee. Over forty ICL folks were present and more than 30 talks were given about ICL research and operations. There were many new faces, alongside longtime ICL veterans, who enjoyed the new surroundings of the Lodge at Buckberry Creek in the Great Smoky Mountains.

Computational Science Kickoff

On Friday, August 19th, leaders of our campus computational science community invited students, faculty, and staff to the University Center’s Shiloh Room to discover some of the exciting developments in computational science at UTK during the Computational Science Kickoff. The kickoff was a success, and those in attendance began the academic year by learning about the exciting intellectual, educational, and professional opportunities in computational science and engineering.

China: A Quick Ascent

Jack Dongarra was interviewed on the August 2nd episode of NPR’s All Things Considered. During the episode, Jack weighed in on China’s Tianhe-1A supercomputer—the fastest supercomputer in the world from November 2010 to June 2011—and China’s new-found position as a formidable contender in high performance computing.

The striking thing is, back in 2001, China had zero computers on the [Top 500] list. So China very quickly grew its high-performance computing capabilities, and are now No. 2 on the list in terms of the number of high-performance computers deployed.

But hardware isn’t everything. As Jack explains, the machine is nothing without the proper software:

This is a critical thing. They have a race car; now you have to build something around the race car to effectively use it. You can’t just invest in hardware. You need to make an investment across the board. Sometimes these ecosystems are out of balance, and as a result of that, the computer would be very hard to use.

Recent Releases

MAGMA 1.0

MAGMA 1.0 is now available. This release includes the MAGMA sources. MAGMA 1.0 is intended for a single CUDA enabled NVIDIA GPU. It extends version 0.2 by adding support for the Fermi GPUs. For more details see the MAGMA 1.0 release notes and the MAGMA 1.0 presentation.

Included are routines for the following algorithms:

LU, QR, and Cholesky factorizations in both real and complex arithmetic (single and double);
Hessenberg, bidiagonal, and tridiagonal reductions in both real and complex arithmetic (single and double);
Linear solvers based on LU, QR, and Cholesky in both real and complex arithmetic (single and double);
Eigen and singular value problem solvers in both real and complex arithmetic (single and double);
Generalized Hermitian-definite eigenproblem solvers;
Mixed-precision iterative refinement solvers based on LU, QR, and Cholesky in both real and complex arithmetic;
MAGMA BLAS in real arithmetic (single and double), including gemm, gemv, symv, and trsm.

See the software section for a download link.

Interview

Ichitaro Yamazaki Then — Ichitaro Yamazaki

Where are you from, originally?

I am originally from from Chiba, a small city in Japan. I moved to Northern California with my family during my senior year of high school. Last fall, we went back to Japan so that my grandma could meet her first great-grandson. We also enjoyed visiting a few temples, hiking up beautiful mountains, and of course eating good Japanese food.

Can you summarize your educational background?

After graduating from high school, I started out at Foothill College, and then I transferred to UCLA with a major in Business Administration. I graduated from UCLA in 2000 with a B.S. in Mathematics of Computation. For two years after graduation, I worked for a couple of different start-ups in Silicon Valley. Then, I entered graduate school at the University of California, Davis, and I earned my Ph.D. in Computer Science in 2008.

Where did you work before joining ICL?

I was a postdoc in the Scientific Computing Group at Lawrence Berkeley National Laboratory, working on two SciDAC projects, TOPS and COMPASS. Our main objective was to develop a parallel solver for large-scale sparse linear systems of equations, and the application of our main interest was to design accelerator cavities and fusion devices.

Tell us how you first learned about ICL. What made you want to work for ICL?

I had met several people from ICL at conferences. Everyone seemed very nice and their projects at ICL interested me. I was especially impressed by how passionate you are about your jobs. ICL seemed like a great fit for me, so I decided to make the big jump from California.

What are your interests/hobbies outside work?

Right now, I spend most of my free time having adventures with my wife and my one-year-old son. We love outdoor activities like hiking, so we are looking forward to visiting the Smoky Mountains. I am also hoping to get back into playing sports. My favorite sport is tennis, so I am thinking of signing up for IM tennis, or perhaps racquetball, this quarter.

Tell us something about yourself that might surprise people.

My wife and I love to travel. We went to Guatemala for our honeymoon, and the airlines ended up losing our luggage. For most of the trip we literally had nothing but the clothes on our backs. We were hiking around in the middle of the jungle in sandals and going to five-star restaurants in sweatpants, but we still had a great time!

What are you working on while at ICL?

I have not decided on an exact project yet, but I will be working mainly with the Linear Algebra Group.

If you weren’t working at ICL, where would you like to be working and why?

I like the outdoors, so it might be nice to be a forest ranger in a state or national park. My brother-in-law is in this line of work in Oregon, and it seems like an exciting job.

Recent Papers

Du, P., A. Bouteiller, G. Bosilca, T. Herault, and J. Dongarra, “Algorithm-based Fault Tolerance for Dense Matrix Factorizations,” University of Tennessee Computer Science Technical Report, no. UT-CS-11-676, Knoxville, TN, August 2011. (865.79 KB)
Bouteiller, A., T. Herault, G. Bosilca, and J. Dongarra, “Correlated Set Coordination in Fault Tolerant Message Logging Protocols,” Proceedings of 17th International Conference, Euro-Par 2011, Part II, vol. 6853, Bordeaux, France, Springer, pp. 51-64, August 2011. (486.68 KB)
Luszczek, P., E. Meek, S. Moore, D. Terpstra, V. M. Weaver, and J. Dongarra, “Evaluation of the HPC Challenge Benchmarks in Virtualized Environments,” 6th Workshop on Virtualization in High-Performance Cloud Computing, Bordeaux, France, August 2011. (114.73 KB)
Vetter, J., R. Glassbrook, J. Dongarra, K. Schwan, B. Loftis, S. McNally, J. Meredith, J. Rogers, P. Roth, K. Spafford, et al., “Keeneland: Bringing Heterogeneous GPU Computing to the Computational Science Community,” IEEE Computing in Science & Engineering, vol. 13, issue 5, pp. 90-95, August 2011. DOI: 10.1109/MCSE.2011.83 (932.57 KB)
Tomov, S., and J. Dongarra, MAGMA - LAPACK for HPC on Heterogeneous Architectures , Oak Ridge, TN, Titan Summit at Oak Ridge National Laboratory, Presentation, August 2011. (20.43 MB)
Haidar, A., H. Ltaeif, and J. Dongarra, “Parallel Reduction to Condensed Forms for Symmetric Eigenvalue Problems using Aggregated Fine-Grained and Memory-Aware Kernels,” University of Tennessee Computer Science Technical Report, UT-CS-11-677, (also Lawn254), August 2011. (636.01 KB)
Dongarra, J., M. Faverge, H. Ltaeif, and P. Luszczek, “Achieving Numerical Accuracy and High Performance using Recursive Tile LU Factorization,” University of Tennessee Computer Science Technical Report (also as a LAWN), no. ICL-UT-11-08, September 2011. (618.53 KB)
Coulomb, K., A. Degomme, M. Faverge, and F. Trahay, “An open-source tool-chain for performance analysis,” Parallel Tools Workshop, Dresden, Germany, September 2011. (622.1 KB)
Du, P., P. Luszczek, and J. Dongarra, “High Performance Dense Linear System Solver with Soft Error Resilience,” IEEE Cluster 2011, Austin, TX, September 2011. (1.27 MB)
Ma, T., A. Bouteiller, G. Bosilca, and J. Dongarra, “Impact of Kernel-Assisted MPI Communication over Scientific Applications: CPMD and FFTW,” 18th EuroMPI, Santorini, Greece, Springer, pp. 247-254, September 2011.
Ma, T., G. Bosilca, A. Bouteiller, B. Goglin, J. Squyres, and J. Dongarra, “Kernel Assisted Collective Intra-node MPI Communication Among Multi-core and Many-core CPUs,” Int'l Conference on Parallel Processing (ICPP '11), Taipei, Taiwan, September 2011.
Chaarawi, M., E. Gabriel, R. Keller, R. L. Graham, G. Bosilca, and J. Dongarra, “OMPIO: A Modular Software Architecture for MPI I/O,” 18th EuroMPI, Santorini, Greece, Springer, pp. 81-89, September 2011.
Bosilca, G., T. Herault, A. Rezmerita, and J. Dongarra, “On Scalability for MPI Runtime Systems,” International Conference on Cluster Computing (CLUSTER), Austin, TX, USA, IEEEE, pp. 187-195, September 2011. (898.76 KB)
Malony, A. D., S. Biersdorff, S. Shende, H. Jagode, S. Tomov, G. Juckeland, R. Dietrich, D. Poole, and C. Lamb, “Parallel Performance Measurement of Heterogeneous Parallel Systems with GPUs,” International Conference on Parallel Processing (ICPP'11), Taipei, Taiwan, ACM, September 2011. DOI: 10.1109/ICPP.2011.71 (1.41 MB)
Kasichayanula, K., H. You, S. Moore, S. Tomov, H. Jagode, and M. Johnson, Power-aware Computing on GPGPUs , Gatlinburg, TN, Fall Creek Falls Conference, Poster, September 2011. (2.89 MB)
Lively, C., X. Wu, V. Taylor, S. Moore, H-C. Chang, C-Y. Su, and K. Cameron, “Power-Aware Prediction Models of Hybrid (MPI/OpenMP) Scientific Applications,” International Conference on Energy-Aware High Performance Computing (EnA-HPC 2011), Hamburg, Germany, September 2011. (479.49 KB)
Ltaeif, H., P. Luszczek, and J. Dongarra, “Profiling High Performance Dense Linear Algebra Algorithms on Multicore Architectures for Power and Energy Efficiency,” International Conference on Energy-Aware High Performance Computing (EnA-HPC 2011), Hamburg, Germany, September 2011. (1.27 MB)
Bosilca, G., T. Herault, P. Lemariner, J. Dongarra, and A. Rezmerita, “Scalable Runtime for MPI: Efficiently Building the Communication Infrastructure,” Proceedings of Recent Advances in the Message Passing Interface - 18th European MPI Users' Group Meeting, EuroMPI 2011, vol. 6960, Santorini, Greece, Springer, pp. 342-344, September 2011. (115.75 KB)

Recent Lunch Talks

AUG
5
Thomas Herault
On Scalability for MPI Runtime Systems
AUG
19
Phil Mucci
Measuring I/O in Virtualized Environments PDF
AUG
26
Tracy Rafferty
Travel
SEP
2
Vladimir Voevodin and Victor Gergel
Russia
Perspectives for HPC Infrastructure in Russia PDF
SEP
9
Tingxing "Tim" Dong
Acceleration of the BLAST hydro code on GPU PDF
SEP
16
Hartwig Anzt
Energy-Efficient High-Performance-Computing PDF
SEP
23
Piotr Luszczek
Energy footprint for the LINPACK benchmark from supercomputers to tablet devices PDF
SEP
30
Alexander Gaenko
Assitant scientist at Ames Laboratory of US DOE, Ames, IA
Let's Work Together: New Computational Science via Re-implementation, Componentization, Parallelization

Upcoming Lunch Talks

OCT
7
Yulu Jia and Khairul Kabir
OCT
14
Jim Browne
TACC
PerfExpert PDF
OCT
21
Wes Kendall
EECS
DStep: An Infrastructure for Large-Scale Flow Analysis PDF
OCT
28
Blake Haugen
Onion Peeling: A New Approach to Predicting Tiled QR Factorization Performance PDF