文件名称:使用NCCL进行多GPU训练(MULTI-GPU TRAINING WITH NCCL)
文件大小:453KB
文件格式:PDF
更新时间:2023-06-27 04:30:43
GPU AI 深度学习 NVIDIA 并行计算
使用NCCL进行多GPU深度学习训练,其中涉及多机多卡,单机多卡等技术。 Optimized inter-GPU communication for DL and HPC Optimized for all NVIDIA platforms, most OEMs and Cloud Scales to 100s of GPUs, targeting 10,000s in the near future. Aims at covering all communication needs for multi-GPU computing. Only relies on CUDA. No dependency on MPI or any parallel environment.