-
Notifications
You must be signed in to change notification settings - Fork 716
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add environment name in ControllerConfig #450
Comments
@wbuchwalter did you mean #451 ? |
@jimexist Yes :) Although, not sure if the |
Would it be better to use the ksonnet template as opposed to the controller config to handle GPU config? In the way we have Cloud specific parts for TFServing We could have cloud specific bits for TFJob and we could use this to specify resources like the GPU resource name conditioned on the cloud. Using the operator to specify GPU config predates our use of ksonnet. Now that we are using ksonnet it might make more sense to use ksonnet to define suitable templates for different clouds? |
@wbuchwalter Any update on this? Should the different resource names just be handled in the UI? E.g. in the UI we could let users specify What priority should we assign this issue? If it must get done and will get done for our 0.2 release (end of June) then it should be P1 otherwise P2. |
Haven't worked on this yet, but let's mark it as P1. |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. |
It would be useful to add the environment name (gke, azure etc.) in the
ControllerConfig
.For example, in the dashboard we want to be able to specify the
resources.limits
for master/worker.However for GPUs, GKE is using the device plugin, so
nvidia.com/gpu
while Azure is usingalpha.kubernetes.io/nvidia-gpu
for example.The dashboard could first read the environment name in the
ControllerConfig
first and choose the appropriate resource based on this.@jlewi thoughts?
The text was updated successfully, but these errors were encountered: