-
Notifications
You must be signed in to change notification settings - Fork 834
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
connect timed out with cluster-manager and api server #523
Comments
This seems to suggest the cluster-manager pod can't connect to the k8s API. Is there anything special about the cluster you are running on? Does it allow k8s API access from your namespace or does the RBAC of your cluster disallow this? |
Thanks your reply! The cluster is ok, we already have a customer scheduler with client-go sdk writing and running on it. and the corresponding role is:
I guess the k8s java sdk using this mount token: /var/run/secrets/kubernetes.io/serviceaccount |
Yes it will be using seldon-local RBAC and the default token that k8s adds to the pod. Is your cluster setup with any restrictions? |
Please reopen if still an issue on 0.4.0 release. |
* refactor method name * add conn close * add signal handler * change const name * add msg for agent * lint * remove const * agent protos for new message * send drain event to scheduler * http drainer server * add test * wire up drainer service in agent * fix test * fix flaky test * add dummy handler on scheduler * add env variable * tidy up envs for compose * refactor variable name * fix lint * add scheduler logic for drain * add model waiter helper * Add test for model waiter * signal model * adjust test * add Draining state * add isDraining state to Server Replica * update memory to mark server replica as draining * fix state updates with draining * add Draining to function return * add filter for server is draining * add scheduler test * lint * add extra wait * skip draining * prefer available replicas over draining * wait on other replica is available * add note * update stats with draining * reviews * update to model state if schedulefailed previously * restore state-> string test * PR reviews
I followed the instruction with helm to install the seldon core, but get follow error:
and I've checked the role/rolebinding/sa, it's the excepted with helm charts.
Any suggestions folks?
BTW, the command I use is:
The text was updated successfully, but these errors were encountered: