Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PBS job is detected, but nodes do not power on #105

Open
daantreurniet opened this issue Jun 5, 2021 · 2 comments
Open

PBS job is detected, but nodes do not power on #105

daantreurniet opened this issue Jun 5, 2021 · 2 comments

Comments

@daantreurniet
Copy link

Dear team,

I am trying to set up CLUES for a HPC using Torque with PBS as queue manager. After some configurating, I managed to start the CLUES server succesfully with the PowerOn_Requests scheduler enabled. I can also see that the queue is detected, because when I submit a new job, it is shown in the CLUES server output. However, after the job is detected, no nodes are powered on to start working. All nodes stay powered off and the job remains in the queue, waiting for resources. I tested the poweron and poweroff commands via the clues CLI (I'm using IPMI to command the nodes) and that is working properly.

Any idea what the problem could be?
I hope this project is still being monitored, as it would be very useful!

Best regards

@micafer
Copy link
Member

micafer commented Jun 14, 2021

Dear @daantreurniet,

Could you attach the clues logs to check what can be the problem?

Best regards.

@daantreurniet
Copy link
Author

When starting cluesserver manually without specifying a Log file, I get the following output. It keeps going with messages similar to the last 6 in the log file.
log.txt

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants