Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Pynta restart for NERSC and Polaris #43

Draft
wants to merge 10 commits into
base: master
Choose a base branch
from
Draft

Conversation

sakim8048
Copy link
Contributor

Previous comment from Ray (October 2023):
I have updated the way we map tasks on each node for ALCF machines. Each task runs on a different FWorker, and each FWorker is associated with a node. This is available for multilauncher. The optimal approach is to set num_jobs to the number of nodes.

Additionally, I have added functionality to use PWDFT, including the calculator and related functions.


Updated comments from Shinae (Feb 2024)
I have updated the way we restart Pynta based on how Ray implemented the restart from Polaris.
From Pynta object, machine=<Machine type> should be specified to restart Pynta. Also for NERSC and any other machines, workflow id should be added to pyn.reset() to rerun the previous workflow.
pyn.reset() is tested in Perlmutter and was able to restart from queue=True mode.

This updates include Trevor's pull request (#33), which not yet merged.


Updated comments from Shinae (March 2024)
I rebased to current master, with recent changes. Added @rayhe88 as the author. I still need to add him as an author for previous commits.

@sakim8048 sakim8048 marked this pull request as draft April 15, 2024 04:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants