Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support Caikit Serving Runtime with REST #2113

Merged
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
47 changes: 25 additions & 22 deletions manifests/modelserving/caikit-ootb.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -17,30 +17,33 @@ objects:
- apiVersion: serving.kserve.io/v1alpha1
kind: ServingRuntime
metadata:
name: caikit-runtime
annotations:
openshift.io/display-name: Caikit TGIS ServingRuntime for KServe
labels:
opendatahub.io/dashboard: 'true'
name: caikit-tgis-runtime
annotations:
openshift.io/display-name: Caikit TGIS ServingRuntime for KServe
labels:
opendatahub.io/dashboard: 'true'
spec:
multiModel: false
supportedModelFormats:
- autoSelect: true
name: caikit
containers:
- env:
- name: kserve-container
image: text-generation-inference
command:
- text-generation-launcher
args:
- --model-name=/mnt/models/artifacts/
env:
- name: TRANSFORMERS_CACHE
value: /tmp/transformers_cache
- name: transformer-container
image: caikit-tgis-serving
env:
- name: RUNTIME_LOCAL_MODELS_DIR
value: /mnt/models
image: caikit-1
name: kserve-container
ports:
- containerPort: 8085
name: h2c
protocol: TCP
resources:
limits:
cpu: '0'
memory: 0Gi
requests:
cpu: '0'
memory: 0Gi
multiModel: false
supportedModelFormats:
- autoSelect: true
name: caikit
- containerPort: 8080
name: h2c
protocol: TCP
parameters: []
9 changes: 6 additions & 3 deletions manifests/modelserving/kustomization.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -10,6 +10,9 @@ images:
- name: ovms-1
newName: quay.io/opendatahub/openvino_model_server
digest: sha256:2cbe8a48ab0bc6fe7fb76919bf33253e83a6218a9c4b486b744c3dcf30679616
- name: caikit-1
newName: quay.io/opendatahub/caikit-tgis-serving:stable
digest: sha256:794adc22d52cb3ac4b5aadfb286e8431cca829acdc4909719329cf8c4fabb4ec
- name: caikit-tgis-serving
newName: quay.io/opendatahub/caikit-tgis-serving
digest: sha256:9d6e8bc10f3b61774f9d321790f8593f6ded5674b64a366248fcde6a07d66833
- name: text-generation-inference
newName: quay.io/opendatahub/text-generation-inference
digest: sha256:54959e3dc0e3bb139e9db5b912143a7dedcef725c55db58cfef0e71dd057e4a5
Loading