Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WIP: Use hostnetwork daemonset for slurmd #16

Closed
wants to merge 4 commits into from
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion docker-entrypoint.sh
Original file line number Diff line number Diff line change
Expand Up @@ -78,7 +78,7 @@ then
echo "-- slurmctld is now active ..."

echo "---> Starting the Slurm Node Daemon (slurmd) ..."
exec /usr/sbin/slurmd -F -Dvvv
exec /usr/sbin/slurmd -Z -Dvvv
fi

if [ "$1" = "login" ]
Expand Down
4 changes: 0 additions & 4 deletions slurm-cluster-chart/files/slurm.conf
Original file line number Diff line number Diff line change
Expand Up @@ -47,12 +47,8 @@ AccountingStorageType=accounting_storage/slurmdbd
AccountingStorageHost=slurmdbd
AccountingStoragePort=6819
#
SlurmctldParameters=cloud_dns,cloud_reg_addrs
CommunicationParameters=NoAddrCache

# NODES
MaxNodeCount=10
NodeName=slurmd-[0-9] State=FUTURE

# PARTITIONS
PartitionName=all Default=yes Nodes=ALL
Expand Down
14 changes: 3 additions & 11 deletions slurm-cluster-chart/templates/slurmd-deployment.yaml
Original file line number Diff line number Diff line change
@@ -1,33 +1,23 @@
apiVersion: apps/v1
kind: StatefulSet
kind: DaemonSet
metadata:
creationTimestamp: null
labels:
app.kubernetes.io/name: slurm
app.kubernetes.io/component: slurmd
name: slurmd
spec:
replicas: {{ .Values.replicas.slurmd }}
selector:
matchLabels:
app.kubernetes.io/name: slurm
app.kubernetes.io/component: slurmd
serviceName: slurmd
template:
metadata:
creationTimestamp: null
labels:
app.kubernetes.io/name: slurm
app.kubernetes.io/component: slurmd
spec:
topologySpreadConstraints:
- maxSkew: 1
whenUnsatisfiable: ScheduleAnyway
topologyKey: kubernetes.io/hostname
labelSelector:
matchLabels:
app.kubernetes.io/name: slurm
app.kubernetes.io/component: slurmd
containers:
- args:
- slurmd
Expand All @@ -47,6 +37,8 @@ spec:
subPath: munge.key
securityContext:
privileged: true
hostNetwork: true
dnsPolicy: ClusterFirstWithHostNet
dnsConfig:
searches:
- slurmd.default.svc.cluster.local
Expand Down
2 changes: 1 addition & 1 deletion slurm-cluster-chart/values.yaml
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
sdcImage: ghcr.io/stackhpc/slurm-docker-cluster:c12d04e
sdcImage: ghcr.io/stackhpc/slurm-docker-cluster:104546b

replicas:
slurmd: 2
Expand Down