-
Notifications
You must be signed in to change notification settings - Fork 414
Kube UI not working #1367
Comments
I am also having a problem with the UI on AWS. But, instead of the 500 error, I am getting a |
@ryane I also get that, I think it's something to do with the kubeworker. Sometimes it comes up other times not really. |
This appears to be the source of the problem (at least on AWS):
On AWS,
Open questions:
|
The Kubernetes integration brings in some older code that the core team didn't write. It's entirely possible that I missed something when upgrading it. #1374 seems like the best solution, it does need to be resolvable. |
I'm thinking we should also change the service file to use Any idea why DNS is not working for the internal AWS hostname? |
Yes, sorry, that was my intention: we should mirror the behavior of the
other PR.
As for the DNS failure, that’s bizarre. Have you tried putting DNSMasq
into debug mode and watching as you try to resolve? It’s got pretty
good logging for dealing with these kinds of issues.
|
Is it that queries are not getting forwarded to the internal dns servers (only consul) for some reason? On GCE, I see something like this in the logs:
|
Is that when you try and resolve |
RE: #1346 (comment) ah! My cluster is down now but I'll try that in a bit. |
I guess the internal aws nameservers just don't resolve the short |
The nameservers don't resolve it because I do see that dnsmasq forwarded the IP to the upstream nameserver in the log:
But it doesn't have any information about how it applied the search paths, which are clearly not being applied if the query is failing. This is turning into two different issues at this point. We need to change the Kubelet hostname to the inventory name instead of the IP, but we also need to fix this search config in dnsmasq. |
@ryane I am happy if you close this issue I think if the search configuration is updated it will also work in Vagrant |
After applying 1374, I can bring up that endpoint. |
I get this when I spin up a new cluster.
Internal Server Error (500)
Get https://10.254.0.1:443/api/v1/replicationcontrollers: dial tcp 10.254.0.1:443: getsockopt: connection refused
- Ansible version (1.9.4): - Python version (2.79): - Git commit hash or branch: a86bf60 - Cloud Environment: Vagrant - Terraform version (0.6.4.11):
The text was updated successfully, but these errors were encountered: