-
Notifications
You must be signed in to change notification settings - Fork 154
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add a3u-slurm-ubuntu-gcs blueprint #3435
Add a3u-slurm-ubuntu-gcs blueprint #3435
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
A couple first comments.
examples/hypercompute_clusters/a3u-slurm-ubuntu-gcs/a3u-slurm-ubuntu-gcs.yaml
Outdated
Show resolved
Hide resolved
examples/hypercompute_clusters/a3u-slurm-ubuntu-gcs/a3u-slurm-ubuntu-gcs.yaml
Show resolved
Hide resolved
45fd00b
to
903d735
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This PR was successfully tested against v1.44.0 of the Toolkit including comparison of NCCL tests run by Ramble.
--- SUMMARY for 8GB Message Sizes --
looked accurate by eye.
Removing explicit listing of TF modules Co-authored-by: Tom Downes <[email protected]>
903d735
to
91b63b5
Compare
c49a1f2
into
GoogleCloudPlatform:a3ultra-preview
Submission Checklist
NOTE: Community submissions can take up to 2 weeks to be reviewed.
Please take the following actions before submitting this pull request.