Python bindings for NVIDIA NCCL
copied from cf-post-staging / nccl4pyNCCL4Py provides Pythonic access to NCCL for efficient multi-GPU and
multi-node collective communication and point-to-point transfers. It
ships low-level Cython bindings plus a high-level nccl.core API.