The bitsandbytes library is a lightweight Python wrapper around CUDA custom functions, in particular 8-bit optimizers, matrix multiplication (LLM.int8()), and 8 & 4-bit quantization functions.
conda install rocketce::bitsandbytes
conda install rocketce/label/rocketce-1.11.5::bitsandbytes