-
Notifications
You must be signed in to change notification settings - Fork 29
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update POC Readme #219
Comments
Interesting, I just ran through this myself last week. And I came up with some of the things you mentioned:
But otherwise I had everything come up nicely. Do you know why your GW was unable to configure correctly? |
Note that we have an
The gateway never gets an external IP: $ kubectl apply -f ./pkg/manifests/gateway.yaml
gateway.gateway.networking.k8s.io/inference-gateway created
gatewayclass.gateway.networking.k8s.io/inference-gateway created
backend.gateway.envoyproxy.io/backend-dummy created
httproute.gateway.networking.k8s.io/llm-route created
$ kubectl get gtw
NAME CLASS ADDRESS PROGRAMMED AGE
inference-gateway inference-gateway False 5s
$ get svc -n envoy-gateway-system
NAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S) AGE
envoy-default-inference-gateway-6454a873 LoadBalancer 10.96.197.219 <pending> 8080:32455/TCP,8081:31502/TCP 27s
envoy-gateway ClusterIP 10.96.96.73 <none> 18000/TCP,18001/TCP,18002/TCP,19001/TCP 5h34m To resolve this issue, either MetalLB or cloud-provider-kind becomes a dependency. |
The POC readme causes the following error when trying to install the example InferencePool and InferenceModel custom resources (
../examples/poc/manifests/inferencepool-with-model.yaml
):The CRDs must be installed before creating instances of each custom resource. Additionally, the doc should state that commands should be run from the repo root and remove the
../
.The example gateway manifest fails to install:
$ kubectl apply -f manifests/gateway.yaml error: the path "manifests/gateway.yaml" does not exist
The path needs to be updated:
The same goes for the
ext_proc.yaml
andpatch_policy.yaml
:The path needs to be updated:
The gateway never reaches a "Ready" state:
Note that I am using a kind cluster to run the POC.
The text was updated successfully, but these errors were encountered: