GitHub - NVIDIA/nccl: Optimized primitives for collective multi-GPU communication
Optimized primitives for collective multi-GPU communication - NVIDIA/nccl