A collective communications library
conda install mark.harfouche::gloo
It comes with a number of collective algorithms useful for machine learning applications. These include a barrier, broadcast, and allreduce.