You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi,
I have six H100 nodes,and each with 8*400Gb cx7 nics. And for RDMA, I use RoCE. I want to see the overall throughout.
about allreduce, it seems that the params effect little,and the busbw is the overall throughout?
abou all2all,the params effect large,as follows:
and for all2all,the busbw is for single node or something else?How can I calculate the overall throughout?I can not understand deeply about the busbw for all2all,and what params are the best to test alltoall?the performence will down with the same config when add more node
thanks!
The text was updated successfully, but these errors were encountered:
Hi,
I have six H100 nodes,and each with 8*400Gb cx7 nics. And for RDMA, I use RoCE. I want to see the overall throughout.
about allreduce, it seems that the params effect little,and the busbw is the overall throughout?
abou all2all,the params effect large,as follows:
and for all2all,the busbw is for single node or something else?How can I calculate the overall throughout?I can not understand deeply about the busbw for all2all,and what params are the best to test alltoall?the performence will down with the same config when add more node
thanks!
The text was updated successfully, but these errors were encountered: