Code: Select all
device 0: Quadro 4000, 950.0 MHz clock, 2047.6 MB memory
Testing TRANSA = N TRANSB = N
N MAGMA GFLop/s CUBLAS GFlop/s error
========================================================
1024 250.00 240.08 0.000000e+00
2048 285.11 284.37 0.000000e+00
3072 306.28 303.81 0.000000e+00
4096 301.63 291.22 0.000000e+00
5120 300.09 298.82 0.000000e+00
6144 307.52 304.75 0.000000e+00
7168 305.28 298.07 0.000000e+00
8192 303.59 301.83 0.000000e+00
9216 308.28 305.31 0.000000e+00
10240 306.52 300.44 0.000000e+00
Code: Select all
device 0: Quadro 4000, 950.0 MHz clock, 2047.6 MB memory
Testing TRANSA = N TRANSB = N
N MAGMA GFLop/s CUBLAS GFlop/s error
========================================================
1024 135.67 138.92 0.000000e+00
2048 141.77 141.54 0.000000e+00
3072 142.48 142.40 0.000000e+00
4096 142.68 142.41 0.000000e+00
5120 142.81 142.66 0.000000e+00
6144 142.87 142.85 0.000000e+00
7168 143.06 143.03 0.000000e+00
8192 147.19 183251937.96 0.000000e+00
can not bind to texture
9216 43486543.87 260919263.23 0.000000e+00
can not bind to texture
10240 79536431.41 357913941.33 0.000000e+00
Code: Select all
device 0: Quadro 4000, 950.0 MHz clock, 2047.6 MB memory
Usage:
testing_dgetrf_gpu -M 1024 -N 1024
!!!! cudaMallocHost failed for: h_R
Thanks.