Do all the functions of MAGMA on fermi GPU outperform those in MKL on CPU xeon 5600 series with flags "openmp" and "parallel" turned on?
In the presentation slides, there is only one slide showing MAGMA is faster than MKL for LU decomposition. I don't know what about the other operations, such as dheevd?
Thanks.