News and Announcements

RaPyDLI Underway

RaPyDLI_3

From left to right: Judy Qui, Jakub Kurzak, Piotr Luszczek, Gregor von Laszewski, Geoffrey Fox, and Terry Moore.

ICL recently teamed up with Geoffrey Fox from Indiana University and Andrew Ng from Stanford to take on a deep learning project called RaPyDLI (Rapid Python Deep Learning Infrastructure). The NSF-funded project will allow users to program deep learning models in Python, port them to supercomputers, and scale-out to cloud systems, thereby bringing the performance of HPC to deep learning. RaPyDLI will support GPU accelerators and Intel Xeon Phi coprocessors and a broad range of storage approaches including files, NoSQL, HDFS, and databases.

Terry Moore, Jakub Kurzak, and Piotr Luszczek recently met with the rest of the RaPyDLI team to establish an open line of communication and harden the strategy for the RaPyDLI software infrastructure.

ICL now an Intel Parallel Computing Center

The Innovative Computing Laboratory recently became the newest Intel Parallel Computing Center (IPCC). The objective of ICL’s IPCC is the development and optimization of numerical linear algebra libraries and technologies for applications, while tackling current challenges in heterogeneous Intel Xeon Phi coprocessor-based High Performance Computing. In collaboration with Intel’s MKL team, the IPCC at ICL will modernize the popular LAPACK and ScaLAPACK libraries to run efficiently on current and future manycore architectures, and will disseminate the developments through the open source MAGMA MIC library.

New ICL Website Released

As we announced in last month’s newsletter, David Rogers has been hard at work developing ICL’s new web presence. The new ICL website utilizes a Drupal back end and features a completely new look and advanced publications database. Well, as of October 1st, the website is up and running. There are still some new features that will be rolled out in the coming months, but the core site is in place and functional. ICLers are encouraged to check it out!

URL: http://www.icl.utk.edu/

website

Conference Reports

EuroMPI/ASIA 2014

On September 9 – 12, ICL’s Aurelien Bouteiller and George Bosilca made their way to Kyoto, Japan for the 21st EuroMPI/Asia meeting. While previous meetings were typically held in Europe, this year’s meeting arrives at a new venue in Japan. Location notwithstanding, EuroMPI is the preeminent meeting for users, developers, and researchers to interact and discuss new developments and applications of message-passing parallel computing, and the Message Passing Interface (MPI) in particular.

While at the meeting, Aurelien and George hosted a tutorial for resilient applications using MPI-level constructs before heading to the MPI Forum in Kobe.

CCDSC 2014

ccdsc_group
On September 2 – 5, the Châteauform’ La Maison des Contes hosted this year’s workshop on Clusters, Clouds, and Data for Scientific Computing (CCDSC). CCDSC 2014 is a continuation of a series of workshops started in 1992, which are held every two years and alternate between the U.S. and France. The purpose of this meeting, which is by invitation only, is to evaluate the state-of-the-art and future trends for cluster computing and the use of computational clouds for scientific computing.

ICL’s Jack Dongarra and former ICLer Bernard Tourancheau (now a Professor at the Université Grenoble Alpes) co-charied, while George Bosilca and Anthony Danalis each gave invited talks. George gave his talk, “Mixed Resilience Solutions,” on the second day, while Anthony gave his talk, “Why PaRSEC is the right runtime for Exascale computing,” on the fourth and final day of the workshop. Overall, the meeting had over 50 attendees and nearly 40 individual talks.

MCSoC-14

IEEE’s 8th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC-14) was held at the University of Aizu, Aizu-Wakamatsu, Japan, on September 23 – 25. MCSoC provides a forum for leading researchers in academia and industry working in areas of embedded Multicore/Many-core SoCs software, tools, and application design.

Jakub Kurzak and Anthony Danalis both gave invited talks at the symposium. Jakub gave his talk, “BEAST: An Automatic Tuner for Numerical Kernels on Accelerators” on the first day, and later gave another talk, “PULSAR: A Lightweight Runtime for Scientifc Computing,” at the University of Tokyo. On the second day, Anthony gave his talk about, “Dataflow based Task Execution through PaRSEC for High Performance Computing Systems.”

Recent Releases

MAGMA MIC 1.2 Released

MAGMA MIC 1.2 is now available. This release provides implementations for MAGMA’s one-sided (LU, QR, and Cholesky) and two-sided (Hessenberg, bi- and tridiagonal reductions) dense matrix factorizations, as well as linear and eigenproblem solvers for Intel Xeon Phi coprocessors. More information on the approach is given in this presentation.

The MAGMA MIC 1.2 release adds usage and performance improvements.

Visit the MAGMA software page to download the tarball.

Interview

Chunyan Tang Then

Chunyan Tang

Where are you from, originally?

I was born in Suining, Sichuan Province. Sichuan is in the southwest of China and is famous for its namesake feature, which is like a basin surrounded by mountains. My hometown is in the center of the Sichuan Basin.

Can you summarize your educational background?

I earned my BS in Physics from Beijing Normal University and then obtained my PhD in Condensed Matter Physics in 2006 at the Institute of Physics, Chinese Academy of Sciences. I worked at Brookhaven National Lab as a postdoc for about three years. After that I worked as an assistant project scientist at the University of California, San Diego. I moved to Knoxville with my family in 2011. After my son was born I decided to pursue a different career path. I enrolled in the EECS Master’s program at UTK in the fall of 2013.

Tell us how you first learned about ICL.

My husband, Dong Li, works in HPC at Oak Ridge National Lab. He received help from Jack when he applied for a joint faculty position at UTK. When I looked for a graduate supervisor, Jack was my first choice. Fortunately, ICL also provided me with a graduate research assistant position starting in January 2014. I am so lucky to have this opportunity to join ICL.

What made you want to work for ICL?

Jack Dongarra is a big name in the HPC arena. ICL’s research projects are attractive, and the work experience I gain at ICL will be very helpful in my future job hunt. ICL is also a sizeable lab, so there are ample opportunities to network and collaborate with people.

What are you working on while at ICL?

The first project I worked on involved the Common Communication Interface (CCI). Now I am working on a collaborative project with a team at ORNL. We are porting OpenSHMEM to exascale applications (LAMMPS and S3D), and comparing the performance with an MPI implementation. Our next step is to implement MPI on universal common communication substrate (UCCS). We will apply the combination of MPI and OpenSHMEM to exascale applications to get higher performance.

If you weren’t working at ICL, where would you like to be working and why?

If I was not working at ICL, I might find another lab at UTK to proceed with my project in lieu of a thesis. If I wasn’t enrolled in the EECS’s MS program, I might work as an electron microscopist in a company or a lab. Another possibility is to be a stay at home mom.

What are your interests/hobbies outside work?

I like swimming, hiking, and singing. I also play PingPong and badminton.

Tell us something about yourself that might surprise people.

My right arm was dislocated twice. The first dislocation happened when I was about 7 years old. I was jumpy on the way home after school and then suddenly fell down and hurt my arm. The second dislocation happened when I was 12 and learning to ride a bicycle and suddenly fell down…

Recent Papers

  1. Dong, T., A. Haidar, S. Tomov, and J. Dongarra, A Fast Batched Cholesky Factorization on a GPU,” International Conference on Parallel Processing (ICPP-2014), Minneapolis, MN, September 2014.  (1.37 MB)
  2. McCraw, H., J. Ralph, A. Danalis, and J. Dongarra, Power Monitoring with PAPI for Extreme Scale Architectures and Dataflow-based Programming Models,” 2014 IEEE International Conference on Cluster Computing, no. ICL-UT-14-04, Madrid, Spain, IEEE, September 2014. DOI: 10.1109/CLUSTER.2014.6968672  (3.45 MB)
  3. Haugen, B., and J. Kurzak, Search Space Pruning Constraints Visualization,” VISSOFT'14: 2nd IEEE Working Conference on Software Visualization, Victoria, BC, Canada, IEEE, September 2014.  (1.32 MB)
  4. Aliaga, J. I., H. Anzt, M. Castillo, J. C. Fernández, G. León, J. Pérez, and E. S. Quintana-Orti, Unveiling the Performance-energy Trade-off in Iterative Linear System Solvers for Multithreaded Processors,” Concurrency and Computation: Practice and Experience, vol. 27, issue 4, pp. 885-904, September 2014. DOI: 10.1002/cpe.3341  (1.83 MB)
  5. McCraw, H., A. Danalis, G. Bosilca, J. Dongarra, K. Kowalski, and T. Windus, Utilizing Dataflow-based Execution for Coupled Cluster Methods,” 2014 IEEE International Conference on Cluster Computing, no. ICL-UT-14-02, Madrid, Spain, IEEE, September 2014.  (260.23 KB)
  6. Bouteiller, A., T. Herault, and G. Bosilca, A Multithreaded Communication Substrate for OpenSHMEM,” 8th International Conference on Partitioned Global Address Space Programming Models (PGAS), Eugene, OR, October 2014.  (261.66 KB)
  7. Anzt, H., S. Tomov, and J. Dongarra, Accelerating the LOBPCG method on GPUs using a blocked Sparse Matrix Vector Product,” University of Tennessee Computer Science Technical Report, no. UT-EECS-14-731: University of Tennessee, October 2014.  (1.83 MB)
  8. Yamazaki, I., T. Mary, J. Kurzak, S. Tomov, and J. Dongarra, Access-averse Framework for Computing Low-rank Matrix Approximations,” First International Workshop on High Performance Big Graph Data Management, Analysis, and Mining, Washington, DC, October 2014.

Recent Conferences

  1. SEP
    RaPyDLI Working Meeting Palo Alto, CA, California
    Jakub Kurzak
    Jakub
    Piotr Luszczek
    Piotr
    Terry Moore
    Terry
    Jakub Kurzak, Piotr Luszczek, Terry Moore
  2. SEP
    Resilience Building Blocks Proposal Meeting Argonne, Illinois
    Aurelien Bouteiller
    Aurelien
    George Bosilca
    George
    Thomas Herault
    Thomas
    Aurelien Bouteiller, George Bosilca, Thomas Herault
  3. OCT
    Aurelien Bouteiller
    Aurelien
    Aurelien Bouteiller
  4. OCT
    Super Fall Meeting College Park, Maryland
    George Bosilca
    George
    George Bosilca
  5. OCT
    IEEE International Conference on Big Data Washington, District of Columbia
    Ichitaro Yamazaki
    Ichitaro
    Ichitaro Yamazaki

Upcoming Conferences

  1. NOV
    HPC China Guangzhou, China
    Jack Dongarra
    Jack
    Jack Dongarra
  2. NOV
    -
    SC14 New Orleans, Louisiana
    Anthony Danalis
    Anthony
    Asim YarKhan
    Asim
    Aurelien Bouteiller
    Aurelien
    George Bosilca
    George
    Heike McCraw
    Heike
    Ichitaro Yamazaki
    Ichitaro
    Jack Dongarra
    Jack
    Jakub Kurzak
    Jakub
    Piotr Luszczek
    Piotr
    Terry Moore
    Terry
    Thomas Herault
    Thomas
    Tracy Rafferty
    Tracy
    Wei Wu
    Wei
    Yulu Jia
    Yulu
    Yves Robert
    Yves
    Anthony Danalis, Asim YarKhan, Aurelien Bouteiller, George Bosilca, Heike McCraw, Ichitaro Yamazaki, Jack Dongarra, Jakub Kurzak, Piotr Luszczek, Terry Moore, Thomas Herault, Tracy Rafferty, Wei Wu, Yulu Jia, Yves Robert

Recent Lunch Talks

  1. SEP
    5
    Theo Mary
    Theo Mary
    INP-ENSEEIHT
    Performance Study of a Randomized Low-rank Approximation using multi-GPU PDF
  2. SEP
    12
    George Ostrouchov
    George Ostrouchov
    ORNL
    Taking R to Big Platforms and Supercomputers with pbdR
  3. SEP
    19
    Simplice Donfack
    Simplice Donfack
    Improve the applicability of highly efficient stencil compilers to a wider class of problems PDF
  4. SEP
    26
    Azzam Haidar
    Azzam Haidar
    Towards Batched Linear Solvers on Accelerated Hardware Platforms PDF
  5. OCT
    3
    Hartwig Anzt
    Hartwig Anzt
    Asynchronous Iterative Algorithm for Computing Incomplete Factorizations on GPUs PDF
  6. OCT
    10
    Alfredo Buttari
    Alfredo Buttari
    ENSEEIHT
    Improving multifrontal solvers by means of Block Low-Rank approximations PDF
  7. OCT
    17
    Florent Lopez
    Florent Lopez
    ENSEEIHT
    Sparse direct solvers on top of runtime systems PDF
  8. OCT
    24
    Yves Robert
    Yves Robert
    Assessing general-purpose algorithms to cope with fail-stop and silent errors PDF
  9. OCT
    31
    Aurelien Bouteiller
    Aurelien Bouteiller
    UCCS: A Communication Substrate for Open SHMEM (and more) PDF

Upcoming Lunch Talks

  1. NOV
    7
    Adrien Remy
    Adrien Remy
    LRI
    Using Random Butterfly Transformation to Solve Dense Linear Systems Using Accelerators PDF
  2. NOV
    14
    Chongxiao Cao
    Chongxiao Cao
    Design for a Soft Error Resilient Dynamic Task-based Runtime PDF

Visitors

  1. Adrien Remy
    Adrien Remy from LRI will be visiting from October 21 through November 7. Adrien will be working with Ichi and the Linear Algebra group.

Visitors

  1. Adrien Remy
    Adrien Remy from LRI will be visiting from October 21 through November 7. Adrien will be working with Ichi and the Linear Algebra group.

congratulations

Mr. and Mrs. Haugen

On August 30th, 2014, ICL’s Blake Haugen married Rebekkah Haugen née Radcliff in Knoxville, TN. Congratulations to Blake and Rebekkah!

Dates to Remember

SC14 Early Registration

The SC14 early registration deadline is October 15th. Registering before this date can save ICL up to $275/person, so please take advantage of early registration!

If you are a SIGHPC member, you maybe be eligible for a further discount when registering for SC14. See the SC14 registration form for details.

ICL’s 25th Anniversary

We are pleased to announce that ICL will be hosting the “25 Years of Innovative Computing Conference” on March 31 – April 2, 2015 in honor of the lab’s 25th year. Mark your calendars!