NVIDIA Collective Communications Library. Implements multi-GPU and multi-node collective communication primitives for NVIDIA GPUs.
NCCL (pronounced "Nickel") is a stand-alone library of standard collective communication routines for GPUs, implementing all-reduce, all-gather, reduce, broadcast, and reduce-scatter.