You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Paper states that "We train our model with 32 NVIDIA Tesla V100 GPUs in a batch size of 1024", but it doesn't tell how long the pretraining takes in this setting. Could you tell me the pretraining cost?
The text was updated successfully, but these errors were encountered:
Paper states that "We train our model with 32 NVIDIA Tesla V100 GPUs in a batch size of 1024", but it doesn't tell how long the pretraining takes in this setting. Could you tell me the pretraining cost?
The text was updated successfully, but these errors were encountered: