Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Per rank mlflow tracking location and updating fcn_afno config for easier benchmarking #121

Merged
merged 1 commit into from
Oct 6, 2023

Conversation

akshaysubr
Copy link
Collaborator

Modulus Pull Request

Description

Small PR that adds:

  • per rank mlflow tracking location to fix multi-gpu race conditions
  • small updates to fcn_afno config for easier benchmarking

Closes #66

Checklist

Dependencies

…dating fcn_afno config for easier benchmarking

Signed-off-by: Akshay Subramaniam <[email protected]>
@akshaysubr akshaysubr requested a review from NickGeneva October 6, 2023 04:57
@NickGeneva
Copy link
Collaborator

/blossom-ci

1 similar comment
@NickGeneva
Copy link
Collaborator

/blossom-ci

@NickGeneva NickGeneva merged commit ef94792 into NVIDIA:main Oct 6, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

🐛[BUG]: MLFlow Logging Typically fails for Multi-process
2 participants