GitHub - NVIDIA/nccl: Optimized primitives for collective multi-GPU communication

Optimized primitives for collective multi-GPU communication - NVIDIA/nccl