Hello! Thanks for the nice work. I've tested nccl-tests with vccl, and got good results.
So I used vccl in my v0.13 megatron following the doc.
But I didnt see any improvements in my training. Could you tell me what I did wrong and how to fix it?
I use slurm to train my model.
16nodes with TP=4 and PP = 4.


Hello! Thanks for the nice work. I've tested nccl-tests with vccl, and got good results.
So I used vccl in my v0.13 megatron following the doc.
But I didnt see any improvements in my training. Could you tell me what I did wrong and how to fix it?
I use slurm to train my model.

16nodes with TP=4 and PP = 4.