You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Please vote on this issue by adding a 👍 reaction to the original issue to help the community and maintainers prioritize this request. Searching for pre-existing feature requests helps us consolidate datapoints for identical requirements into a single place, thank you!
Please do not leave "+1" or other comments that do not add relevant new information or questions, they generate extra noise for issue followers and do not help prioritize the request.
If you are interested in working on this issue or have submitted a pull request, please leave a comment.
Overview of the Issue
When using a dedicated nodepool called consulpool1 and making use of the nodeSelector for clients and servers, the initial deployment fails as the Consul Federation Secret pod does not successfully complete. Upon further investigation, it appears that this pod is scheduled on the default system nodepool and not the dedicated nodepool: consulpool1.
Reproduction Steps
Error: INSTALLATION FAILED: failed post-install: timed out waiting for the condition
Logs
output from kubectl logs consul-create-federation-secret-mffk4:
2022-03-21T21:12:29.179Z [ERROR] Error retrieving current datacenter, retrying: err="Get "https://192.168.32.4:8501/v1/agent/self": dial tcp 192.168.32.4:8501: connect: connection refused"
@hashildy, thank you for bringing this to our attention and finding a potential cause. With a cursory look, I don't see any reason why we wouldn't be able to modify the job you mentioned so that it uses the nodePool value. I will confirm this with my team and if we are able to make this change, we will.
Community Note
Overview of the Issue
When using a dedicated nodepool called consulpool1 and making use of the nodeSelector for clients and servers, the initial deployment fails as the Consul Federation Secret pod does not successfully complete. Upon further investigation, it appears that this pod is scheduled on the default system nodepool and not the dedicated nodepool: consulpool1.
Reproduction Steps
Error: INSTALLATION FAILED: failed post-install: timed out waiting for the conditionLogs
output from kubectl logs consul-create-federation-secret-mffk4:
2022-03-21T21:12:29.179Z [ERROR] Error retrieving current datacenter, retrying: err="Get "https://192.168.32.4:8501/v1/agent/self": dial tcp 192.168.32.4:8501: connect: connection refused"
Expected behavior
The expected behavior is that deploying clients and servers to a dedicated nodepool should not result in a failed deployment. It looks like this job for the create federation secret is not checking the value of the nodeSelector for the client: https://github.com/hashicorp/consul-k8s/blob/main/charts/consul/templates/create-federation-secret-job.yaml. Is it possible to implement code similar to the following so that this value is checking before scheduling this pod? https://github.com/hashicorp/consul-k8s/blob/main/charts/consul/templates/client-snapshot-agent-deployment.yaml#L211-L213.
Environment details
consul-k8s
version: chart version: 0.41.1 App version: 1.11.3values.yaml
used to deploy the helm chart: see aboveAdditionally, please provide details regarding the Kubernetes Infrastructure, as shown below:
The text was updated successfully, but these errors were encountered: