The nvidia-device-plugin label GPU node automatically? #70

pytimer · 2018-08-30T11:35:18Z

Hi,

I deploy the nvidia-device-plugin on my kubernetes cluster. But on my cluster, different nodes have different GPU type, i hope my pod can deploy the specify GPU type node.

Now i must label GPU on the node manually, i hope nvidia-device-plugin can label GPU type automatically.

Is it reasonable?

flx42 · 2018-08-30T15:57:11Z

There is a plan in the Kubernetes community: kubernetes/community#2265
The problem with a per-node label is that it wouldn't work for heterogeneous nodes (different GPU types on the same node). In addition, injecting those labels from the device plugin today would require RBAC and the deployment would be more complex.

pytimer · 2018-09-02T03:12:37Z

Thanks for your reply.

I know. But now if i want to implement my case, Should i must be label GPU type on the node?

cliffburdick · 2018-09-06T14:28:23Z

@flx42 we're interested in this too. Can the nvidia agent on each node report to the kubelet or API server how many GPUs and of what type there are? The node label seems restrictive since it's static and the amount of each isn't tracked.

Bharathkumarraju · 2018-09-13T07:12:13Z

I did this with some terraform varible with this simple command, I did node-labeling

sed -i '/\/usr\/bin\/kubelet/ a \ \ --node-labels nodetype=${var.instance-label} \\' /etc/systemd/system/kubelet.service

pytimer · 2018-09-13T13:41:04Z

@Bharathkumarraju which program add --node-labels value to kubelet? Your own program or when you deploy this node?

Now i write a daemonset to do it, when i deploy a GPU pod, i have to choose a node through my program, this program add nodeSelector to yaml.

I think it is not good, but i don't know other methods to do it now.

RenaudWasTaken · 2019-05-12T05:38:25Z

This is now possible with the GPU feature discovery: https://github.com/NVIDIA/gpu-feature-discovery

RenaudWasTaken closed this as completed May 12, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The nvidia-device-plugin label GPU node automatically? #70

The nvidia-device-plugin label GPU node automatically? #70

pytimer commented Aug 30, 2018

flx42 commented Aug 30, 2018

pytimer commented Sep 2, 2018

cliffburdick commented Sep 6, 2018

Bharathkumarraju commented Sep 13, 2018

pytimer commented Sep 13, 2018

RenaudWasTaken commented May 12, 2019

The nvidia-device-plugin label GPU node automatically? #70

The nvidia-device-plugin label GPU node automatically? #70

Comments

pytimer commented Aug 30, 2018

flx42 commented Aug 30, 2018

pytimer commented Sep 2, 2018

cliffburdick commented Sep 6, 2018

Bharathkumarraju commented Sep 13, 2018

pytimer commented Sep 13, 2018

RenaudWasTaken commented May 12, 2019