Submitted by scrawford on
Title | Out of Memory SVD Solver for Big Data |
Publication Type | Conference Paper |
Year of Publication | 2017 |
Authors | Haidar, A., K. Kabir, D. Fayad, S. Tomov, and J. Dongarra |
Conference Name | 2017 IEEE High Performance Extreme Computing Conference (HPEC'17) |
Date Published | 2017-09 |
Publisher | IEEE |
Conference Location | Waltham, MA |
Abstract | Many applications – from data compression to numerical weather prediction and information retrieval – need to compute large dense singular value decompositions (SVD). When the problems are too large to fit into the computer’s main memory, specialized out-of-core algorithms that use disk storage are required. A typical example is when trying to analyze a large data set through tools like MATLAB or Octave, but the data is just too large to be loaded. To overcome this, we designed a class of out-of-memory (OOM) algorithms to reduce, as well as overlap communication with computation. Of particular interest is OOM algorithms for matrices of size m×n, where m >> n or m << n, e.g., corresponding to cases of too many variables, or too many observations. To design OOM SVDs, we first study the communications cost for the SVD techniques as well as for the QR/LQ factorization followed by SVD. We present the theoretical analysis about the data movement cost and strategies to design OOM SVD algorithms. We show performance results for multicore architecture that illustrate our theoretical findings and match our performance models. Moreover, our experimental results show the feasibility and superiority of the OOM SVD. |