nccl
Optimized primitives for collective multi-GPU communication
Optimized primitives for collective multi-GPU communication
To install this package, run one of the following:
The NVIDIA Collective Communications Library (NCCL) implements multi-GPU and multi-node collective communication primitives that are performance optimized for NVIDIA GPUs. NCCL provides routines such as all-gather, all-reduce, broadcast, reduce, reduce-scatter, that are optimized to achieve high bandwidth over PCIe and NVLink high-speed interconnect.
Summary
Optimized primitives for collective multi-GPU communication
Information Last Updated
Mar 25, 2025 at 16:24
License
BSD-3-Clause
Total Downloads
2.6K
Platforms
GitHub Repository
https://github.com/NVIDIA/nccl