An implementation of BLAS (Basic Linear Algebra Subprograms) on top of the NVIDIA CUDA runtime.
copied from cf-post-staging / libcublas-staticThe cuBLAS Library provides a GPU-accelerated implementation of the basic linear algebra subroutines (BLAS).