-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ubuntu Slurm Cgroups #5
Comments
Thanks for your feedback. I have just realised that those cgroups services are not provided in ubuntu but I hadn't noticed it because I don't install them in the CI tests and I do my testing using centos. I have pushed a new commit to master branch which should fix this problem by skipping this step in Debian/Ubuntu systems. Can you test the latest code in master? To be honest I don't know how the slurm cgroup limits will work on ubuntu. I have never tested it to verify that the cpu/memory limits defined in slurm apply correctly on ubuntu systems. |
Hello, Thanks for getting back to me, below is the recent update. Maybe this would help https://github.com/mknoxnv/ubuntu-slurm. / https://blog.llandsmeer.com/tech/2020/03/02/slurm-single-instance.html. Does this work with AlmaLinux and.or Rocky Linux? I was able to get it installed on Amazon linux, but I need to add packages to the actual OS, I can share shortly. Thanks
|
FYI... Running on Amazon Linux [ec2-user@head ~]$ uname -a |
if I understand correctly the role works ok on Amazon linux, right? Amazon linux is based on RHEL (like CentOS) so it should work but I haven't tested it myself I don't get why you get an error in task The only explanation I could think of is that your apt cache is not updated and the packages are not installed and thus the folder |
can you check if using the latest version in master branch you still get an error in task I have added a task to make sure that the apt cache is always updated. |
Hello, Much appreciated -JG- |
Hello, I had pulled down the wrong repo. That worked! ubuntu@head: |
I have published a new version https://github.com/scicore-unibas-ch/ansible-role-slurm/releases/tag/0.0.8 |
Hello,
I'm running your playbook on Ubuntu 20.04 LTS within AWS, and I'm getting the following errors. Would you be able to provide some guidance and/or input on how to resolve.
TASK [slurm : Enable and start cgroup services if this is a worker_node] ****************************************************************************************************************
task path: /Users/jimmy.griffin/Desktop/AnsibleDevProject/slurm5/slurm/tasks/slurm-worker.yml:67
failed: [c1] (item=cgred) => {"ansible_loop_var": "item", "changed": false, "item": "cgred", "msg": "Could not find the requested service cgred: host"}
failed: [c2] (item=cgred) => {"ansible_loop_var": "item", "changed": false, "item": "cgred", "msg": "Could not find the requested service cgred: host"}
failed: [c1] (item=cgconfig) => {"ansible_loop_var": "item", "changed": false, "item": "cgconfig", "msg": "Could not find the requested service cgconfig: host"}
failed: [c2] (item=cgconfig) => {"ansible_loop_var": "item", "changed": false, "item": "cgconfig", "msg": "Could not find the requested service cgconfig: host"}
META: noop
META: noop
META: noop
META: noop
META: noop
TASK [slurm : Configure slurm submit hosts] *********************************************************************************************************************************************
task path: /Users/jimmy.griffin/Desktop/AnsibleDevProject/slurm5/slurm/tasks/main.yml:187
included: /Users/jimmy.griffin/Desktop/AnsibleDevProject/slurm5/slurm/tasks/slurm-submit.yml for head
TASK [slurm : Create /etc/slurm in RedHat based systems] ********************************************************************************************************************************
task path: /Users/jimmy.griffin/Desktop/AnsibleDevProject/slurm5/slurm/tasks/slurm-submit.yml:3
skipping: [head] => {"changed": false, "skip_reason": "Conditional result was False"}
TASK [slurm : Make a symlink /etc/slurm >> /etc/slurm-llnl on Debian based systems] *****************************************************************************************************
task path: /Users/jimmy.griffin/Desktop/AnsibleDevProject/slurm5/slurm/tasks/slurm-submit.yml:12
ok: [head] => {"changed": false, "dest": "/etc/slurm", "gid": 0, "group": "root", "mode": "0777", "owner": "root", "size": 15, "src": "/etc/slurm-llnl", "state": "link", "uid": 0}
TASK [slurm : Make a symlink /var/log/slurm >> /var/log/slurm-llnl on Debian based systems] *********************************************************************************************
task path: /Users/jimmy.griffin/Desktop/AnsibleDevProject/slurm5/slurm/tasks/slurm-submit.yml:21
ok: [head] => {"changed": false, "dest": "/var/log/slurm", "gid": 0, "group": "root", "mode": "0777", "owner": "root", "size": 19, "src": "/var/log/slurm-llnl", "state": "link", "uid": 0}
TASK [slurm : Deploy /etc/slurm/slurm.conf] *********************************************************************************************************************************************
task path: /Users/jimmy.griffin/Desktop/AnsibleDevProject/slurm5/slurm/tasks/slurm-submit.yml:30
ok: [head] => {"changed": false, "checksum": "d514bce8f38cb45baf7d0cf61222c56e6965bee5", "dest": "/etc/slurm/slurm.conf", "gid": 64030, "group": "slurm", "mode": "0644", "owner": "slurm", "path": "/etc/slurm/slurm.conf", "size": 4661, "state": "file", "uid": 64030}
META: role_complete for head
META: ran handlers
META: ran handlers
PLAY RECAP ******************************************************************************************************************************************************************************
c1 : ok=17 changed=0 unreachable=0 failed=1 skipped=13 rescued=0 ignored=0
c2 : ok=17 changed=0 unreachable=0 failed=1 skipped=13 rescued=0 ignored=0
The text was updated successfully, but these errors were encountered: