nccl
Optimized primitives for collective multi-GPU communication
Optimized primitives for collective multi-GPU communication
To install this package, run one of the following:
NCCL (pronounced "Nickel") is a stand-alone library of standard collective communication routines, such as all-gather, reduce, broadcast, etc., that have been optimized to achieve high bandwidth over PCIe. NCCL supports an arbitrary number of GPUs installed in a single node and can be used in either single- or multi-process (e.g., MPI) applications.
Summary
Optimized primitives for collective multi-GPU communication
Last Updated
Jun 21, 2018 at 18:47
License
BSD 3-Clause
Total Downloads
572
Supported Platforms