[Bug]: Accelerator enablement in kserve is not working #2244
Labels
feature/model-serving
Model Serving Feature
kind/bug
Something isn't working
priority/blocker
Critical issue that needs to be fixed asap; blocks up coming releases
rhods-2.5
Is there an existing issue for this?
Deploy type
OpenDataHub core version (eg.
v1.6.0
)Version
2.5.0
Current Behavior
We are currently assigning gpu resources to all the containers in a ServingRuntime spec for KServe, this has been ok for
Modelmesh
but it's creating an issue where we add more resources than the cluster might have.Expected Behavior
The outcome will be the following:
For modelmesh
Keep the same flow and get the creation of
InferenceServices
andServingRuntimes
as it is right nowFor kserve
spec.predictor.tolerations
such as this examplespec.predictor-model-resources
sectionSteps To Reproduce
Workaround (if any)
No response
What browsers are you seeing the problem on?
No response
Anything else
No response
The text was updated successfully, but these errors were encountered: