ICL Newsletter

News and Announcements

RaPyDLI Underway

From left to right: Judy Qui, Jakub Kurzak, Piotr Luszczek, Gregor von Laszewski, Geoffrey Fox, and Terry Moore.

ICL recently teamed up with Geoffrey Fox from Indiana University and Andrew Ng from Stanford to take on a deep learning project called RaPyDLI (Rapid Python Deep Learning Infrastructure). The NSF-funded project will allow users to program deep learning models in Python, port them to supercomputers, and scale-out to cloud systems, thereby bringing the performance of HPC to deep learning. RaPyDLI will support GPU accelerators and Intel Xeon Phi coprocessors and a broad range of storage approaches including files, NoSQL, HDFS, and databases.

Terry Moore, Jakub Kurzak, and Piotr Luszczek recently met with the rest of the RaPyDLI team to establish an open line of communication and harden the strategy for the RaPyDLI software infrastructure.

ICL now an Intel Parallel Computing Center

The Innovative Computing Laboratory recently became the newest Intel Parallel Computing Center (IPCC). The objective of ICL’s IPCC is the development and optimization of numerical linear algebra libraries and technologies for applications, while tackling current challenges in heterogeneous Intel Xeon Phi coprocessor-based High Performance Computing. In collaboration with Intel’s MKL team, the IPCC at ICL will modernize the popular LAPACK and ScaLAPACK libraries to run efficiently on current and future manycore architectures, and will disseminate the developments through the open source MAGMA MIC library.

New ICL Website Released

As we announced in last month’s newsletter, David Rogers has been hard at work developing ICL’s new web presence. The new ICL website utilizes a Drupal back end and features a completely new look and advanced publications database. Well, as of October 1st, the website is up and running. There are still some new features that will be rolled out in the coming months, but the core site is in place and functional. ICLers are encouraged to check it out!

URL: http://www.icl.utk.edu/

Conference Reports

EuroMPI/ASIA 2014

On September 9 – 12, ICL’s Aurelien Bouteiller and George Bosilca made their way to Kyoto, Japan for the 21st EuroMPI/Asia meeting. While previous meetings were typically held in Europe, this year’s meeting arrives at a new venue in Japan. Location notwithstanding, EuroMPI is the preeminent meeting for users, developers, and researchers to interact and discuss new developments and applications of message-passing parallel computing, and the Message Passing Interface (MPI) in particular.

While at the meeting, Aurelien and George hosted a tutorial for resilient applications using MPI-level constructs before heading to the MPI Forum in Kobe.

CCDSC 2014

On September 2 – 5, the Châteauform’ La Maison des Contes hosted this year’s workshop on Clusters, Clouds, and Data for Scientific Computing (CCDSC). CCDSC 2014 is a continuation of a series of workshops started in 1992, which are held every two years and alternate between the U.S. and France. The purpose of this meeting, which is by invitation only, is to evaluate the state-of-the-art and future trends for cluster computing and the use of computational clouds for scientific computing.

ICL’s Jack Dongarra and former ICLer Bernard Tourancheau (now a Professor at the Université Grenoble Alpes) co-charied, while George Bosilca and Anthony Danalis each gave invited talks. George gave his talk, “Mixed Resilience Solutions,” on the second day, while Anthony gave his talk, “Why PaRSEC is the right runtime for Exascale computing,” on the fourth and final day of the workshop. Overall, the meeting had over 50 attendees and nearly 40 individual talks.

MCSoC-14

IEEE’s 8th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC-14) was held at the University of Aizu, Aizu-Wakamatsu, Japan, on September 23 – 25. MCSoC provides a forum for leading researchers in academia and industry working in areas of embedded Multicore/Many-core SoCs software, tools, and application design.

Jakub Kurzak and Anthony Danalis both gave invited talks at the symposium. Jakub gave his talk, “BEAST: An Automatic Tuner for Numerical Kernels on Accelerators” on the first day, and later gave another talk, “PULSAR: A Lightweight Runtime for Scientifc Computing,” at the University of Tokyo. On the second day, Anthony gave his talk about, “Dataflow based Task Execution through PaRSEC for High Performance Computing Systems.”

Recent Releases

MAGMA MIC 1.2 Released

MAGMA MIC 1.2 is now available. This release provides implementations for MAGMA’s one-sided (LU, QR, and Cholesky) and two-sided (Hessenberg, bi- and tridiagonal reductions) dense matrix factorizations, as well as linear and eigenproblem solvers for Intel Xeon Phi coprocessors. More information on the approach is given in this presentation.

The MAGMA MIC 1.2 release adds usage and performance improvements.

Visit the MAGMA software page to download the tarball.

Interview

Where are you from, originally?

I was born in Suining, Sichuan Province. Sichuan is in the southwest of China and is famous for its namesake feature, which is like a basin surrounded by mountains. My hometown is in the center of the Sichuan Basin.

Can you summarize your educational background?

I earned my BS in Physics from Beijing Normal University and then obtained my PhD in Condensed Matter Physics in 2006 at the Institute of Physics, Chinese Academy of Sciences. I worked at Brookhaven National Lab as a postdoc for about three years. After that I worked as an assistant project scientist at the University of California, San Diego. I moved to Knoxville with my family in 2011. After my son was born I decided to pursue a different career path. I enrolled in the EECS Master’s program at UTK in the fall of 2013.

Tell us how you first learned about ICL.

My husband, Dong Li, works in HPC at Oak Ridge National Lab. He received help from Jack when he applied for a joint faculty position at UTK. When I looked for a graduate supervisor, Jack was my first choice. Fortunately, ICL also provided me with a graduate research assistant position starting in January 2014. I am so lucky to have this opportunity to join ICL.

What made you want to work for ICL?

Jack Dongarra is a big name in the HPC arena. ICL’s research projects are attractive, and the work experience I gain at ICL will be very helpful in my future job hunt. ICL is also a sizeable lab, so there are ample opportunities to network and collaborate with people.

What are you working on while at ICL?

The first project I worked on involved the Common Communication Interface (CCI). Now I am working on a collaborative project with a team at ORNL. We are porting OpenSHMEM to exascale applications (LAMMPS and S3D), and comparing the performance with an MPI implementation. Our next step is to implement MPI on universal common communication substrate (UCCS). We will apply the combination of MPI and OpenSHMEM to exascale applications to get higher performance.

If you weren’t working at ICL, where would you like to be working and why?

If I was not working at ICL, I might find another lab at UTK to proceed with my project in lieu of a thesis. If I wasn’t enrolled in the EECS’s MS program, I might work as an electron microscopist in a company or a lab. Another possibility is to be a stay at home mom.

What are your interests/hobbies outside work?

I like swimming, hiking, and singing. I also play PingPong and badminton.

Tell us something about yourself that might surprise people.

My right arm was dislocated twice. The first dislocation happened when I was about 7 years old. I was jumpy on the way home after school and then suddenly fell down and hurt my arm. The second dislocation happened when I was 12 and learning to ride a bicycle and suddenly fell down…

Recent Papers

Dong, T., A. Haidar, S. Tomov, and J. Dongarra, “A Fast Batched Cholesky Factorization on a GPU,” International Conference on Parallel Processing (ICPP-2014), Minneapolis, MN, September 2014. (1.37 MB)
McCraw, H., J. Ralph, A. Danalis, and J. Dongarra, “Power Monitoring with PAPI for Extreme Scale Architectures and Dataflow-based Programming Models,” 2014 IEEE International Conference on Cluster Computing, no. ICL-UT-14-04, Madrid, Spain, IEEE, September 2014. DOI: 10.1109/CLUSTER.2014.6968672 (3.45 MB)
Haugen, B., and J. Kurzak, “Search Space Pruning Constraints Visualization,” VISSOFT'14: 2nd IEEE Working Conference on Software Visualization, Victoria, BC, Canada, IEEE, September 2014. (1.32 MB)
Aliaga, J. I., H. Anzt, M. Castillo, J. C. FernÃ¡ndez, G. LeÃ³n, J. PÃ©rez, and E. S. Quintana-Orti, “Unveiling the Performance-energy Trade-off in Iterative Linear System Solvers for Multithreaded Processors,” Concurrency and Computation: Practice and Experience, vol. 27, issue 4, pp. 885-904, September 2014. DOI: 10.1002/cpe.3341 (1.83 MB)
McCraw, H., A. Danalis, G. Bosilca, J. Dongarra, K. Kowalski, and T. Windus, “Utilizing Dataflow-based Execution for Coupled Cluster Methods,” 2014 IEEE International Conference on Cluster Computing, no. ICL-UT-14-02, Madrid, Spain, IEEE, September 2014. (260.23 KB)
Bouteiller, A., T. Herault, and G. Bosilca, “A Multithreaded Communication Substrate for OpenSHMEM,” 8th International Conference on Partitioned Global Address Space Programming Models (PGAS), Eugene, OR, October 2014. (261.66 KB)
Anzt, H., S. Tomov, and J. Dongarra, “Accelerating the LOBPCG method on GPUs using a blocked Sparse Matrix Vector Product,” University of Tennessee Computer Science Technical Report, no. UT-EECS-14-731: University of Tennessee, October 2014. (1.83 MB)
Yamazaki, I., T. Mary, J. Kurzak, S. Tomov, and J. Dongarra, “Access-averse Framework for Computing Low-rank Matrix Approximations,” First International Workshop on High Performance Big Graph Data Management, Analysis, and Mining, Washington, DC, October 2014.

Recent Conferences

SEP
17

RaPyDLI Working Meeting Palo Alto, CA, California
Jakub
Piotr
Terry

Jakub Kurzak, Piotr Luszczek, Terry Moore
SEP
22

Resilience Building Blocks Proposal Meeting Argonne, Illinois
Aurelien
George
Thomas

Aurelien Bouteiller, George Bosilca, Thomas Herault
OCT
7

OUG: Open SHMEM users Group Eugene, Oregon
Aurelien

Aurelien Bouteiller
OCT
15

Super Fall Meeting College Park, Maryland
George

George Bosilca
OCT
27

IEEE International Conference on Big Data Washington, District of Columbia
Ichitaro

Ichitaro Yamazaki

Upcoming Conferences

NOV
6

HPC China Guangzhou, China
Jack

Jack Dongarra
NOV
16-26

SC14 New Orleans, Louisiana
Anthony
Asim
Aurelien
George
Heike
Ichitaro
Jack
Jakub
Piotr
Terry
Thomas
Tracy
Wei
Yulu
Yves

Anthony Danalis, Asim YarKhan, Aurelien Bouteiller, George Bosilca, Heike McCraw, Ichitaro Yamazaki, Jack Dongarra, Jakub Kurzak, Piotr Luszczek, Terry Moore, Thomas Herault, Tracy Rafferty, Wei Wu, Yulu Jia, Yves Robert

Recent Lunch Talks

SEP
5
Theo Mary
INP-ENSEEIHT
Performance Study of a Randomized Low-rank Approximation using multi-GPU PDF
SEP
12
George Ostrouchov
ORNL
Taking R to Big Platforms and Supercomputers with pbdR
SEP
19
Simplice Donfack
Improve the applicability of highly efficient stencil compilers to a wider class of problems PDF
SEP
26
Azzam Haidar
Towards Batched Linear Solvers on Accelerated Hardware Platforms PDF
OCT
3
Hartwig Anzt
Asynchronous Iterative Algorithm for Computing Incomplete Factorizations on GPUs PDF
OCT
10
Alfredo Buttari
ENSEEIHT
Improving multifrontal solvers by means of Block Low-Rank approximations PDF
OCT
17
Florent Lopez
ENSEEIHT
Sparse direct solvers on top of runtime systems PDF
OCT
24
Yves Robert
Assessing general-purpose algorithms to cope with fail-stop and silent errors PDF
OCT
31
Aurelien Bouteiller
UCCS: A Communication Substrate for Open SHMEM (and more) PDF

Upcoming Lunch Talks

NOV
7
Adrien Remy
LRI
Using Random Butterfly Transformation to Solve Dense Linear Systems Using Accelerators PDF
NOV
14
Chongxiao Cao
Design for a Soft Error Resilient Dynamic Task-based Runtime PDF

Visitors

Adrien Remy from LRI will be visiting from October 21 through November 7. Adrien will be working with Ichi and the Linear Algebra group.

Visitors

Adrien Remy from LRI will be visiting from October 21 through November 7. Adrien will be working with Ichi and the Linear Algebra group.

congratulations

Mr. and Mrs. Haugen

On August 30th, 2014, ICL’s Blake Haugen married Rebekkah Haugen née Radcliff in Knoxville, TN. Congratulations to Blake and Rebekkah!

Dates to Remember

SC14 Early Registration

The SC14 early registration deadline is October 15th. Registering before this date can save ICL up to $275/person, so please take advantage of early registration!

If you are a SIGHPC member, you maybe be eligible for a further discount when registering for SC14. See the SC14 registration form for details.

ICL’s 25th Anniversary

We are pleased to announce that ICL will be hosting the “25 Years of Innovative Computing Conference” on March 31 – April 2, 2015 in honor of the lab’s 25th year. Mark your calendars!

October 2014

News and Announcements

RaPyDLI Underway

ICL now an Intel Parallel Computing Center

New ICL Website Released

Conference Reports

EuroMPI/ASIA 2014

CCDSC 2014

MCSoC-14

Recent Releases

MAGMA MIC 1.2 Released

Interview

Chunyan Tang

Recent Papers

Recent Conferences

Upcoming Conferences

Recent Lunch Talks

Upcoming Lunch Talks

Visitors

Visitors

congratulations

Mr. and Mrs. Haugen

Dates to Remember

SC14 Early Registration

ICL’s 25th Anniversary

Archives

PDF Editions