An implementation of BLAS (Basic Linear Algebra Subprograms) on top of the NVIDIA CUDA runtime.
conda install main::libcublas-dev
The cuBLAS Library provides a GPU-accelerated implementation of the basic linear algebra subroutines (BLAS).