CUDA 3.0 released has CUBLAS functions you need!
Posted: Sun Nov 08, 2009 8:28 am
Perhaps you don't know but seems Nvidia shipped a better CUBLAS to CUlatools as they have near all this extensions from 30 september!
You can grab the basic and get cublas.dll,cublas.so with all the functions without a .lib in Windows for linking to it.
Anyway with 3.0 you have this
functions that seem to be so useful:
* cublasCtrsm()
* cublasCtrmm()
* cublasCsyrk()
* cublasZtrsm()
* cublasZtrmm()
* cublasZsyrk()
It would be good if you can use them to release (at least in later build than 14) with complex data types and single and double precision
- Added the BLAS1 functions:
* cublasZaxpy()
* cublasZcopy()
* cublasZswap()
- Added the BLAS2 functions:
* cublasDtrmv()
* cublasCtrmv()
* cublasCgemv()
* cublasCgeru()
* cublasCgerc()
* cublasZtrmv()
* cublasZgemv()
* cublasZgeru()
* cublasZgerc()
- Added the BLAS3 functions:
* cublasCtrsm()
* cublasCtrmm()
* cublasCsyrk()
* cublasCsymm()
* cublasCherk()
* cublasZtrsm()
* cublasZtrmm()
* cublasZsyrk()
* cublasZsymm()
* cublasZherk()
You can grab the basic and get cublas.dll,cublas.so with all the functions without a .lib in Windows for linking to it.
Anyway with 3.0 you have this
functions that seem to be so useful:
* cublasCtrsm()
* cublasCtrmm()
* cublasCsyrk()
* cublasZtrsm()
* cublasZtrmm()
* cublasZsyrk()
It would be good if you can use them to release (at least in later build than 14) with complex data types and single and double precision
- Added the BLAS1 functions:
* cublasZaxpy()
* cublasZcopy()
* cublasZswap()
- Added the BLAS2 functions:
* cublasDtrmv()
* cublasCtrmv()
* cublasCgemv()
* cublasCgeru()
* cublasCgerc()
* cublasZtrmv()
* cublasZgemv()
* cublasZgeru()
* cublasZgerc()
- Added the BLAS3 functions:
* cublasCtrsm()
* cublasCtrmm()
* cublasCsyrk()
* cublasCsymm()
* cublasCherk()
* cublasZtrsm()
* cublasZtrmm()
* cublasZsyrk()
* cublasZsymm()
* cublasZherk()