Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is ETCD discovery down? #9978

Closed
ankitforcode opened this issue Aug 2, 2018 · 21 comments
Closed

Is ETCD discovery down? #9978

ankitforcode opened this issue Aug 2, 2018 · 21 comments

Comments

@ankitforcode
Copy link

ankitforcode commented Aug 2, 2018

This page isn’t working
discovery.etcd.io is currently unable to handle this request.
HTTP ERROR 503
@hexfusion
Copy link
Contributor

/cc @gyuho, @philips is this service still supported?

*   Trying 52.9.92.97...
* TCP_NODELAY set
* Connected to discovery.etcd.io (52.9.92.97) port 443 (#0)
* ALPN, offering h2
* ALPN, offering http/1.1
* successfully set certificate verify locations:
*   CAfile: /etc/ssl/certs/ca-certificates.crt
  CApath: none
* TLSv1.2 (OUT), TLS handshake, Client hello (1):
* TLSv1.2 (IN), TLS handshake, Server hello (2):
* TLSv1.2 (IN), TLS handshake, Certificate (11):
* TLSv1.2 (IN), TLS handshake, Server key exchange (12):
* TLSv1.2 (IN), TLS handshake, Server finished (14):
* TLSv1.2 (OUT), TLS handshake, Client key exchange (16):
* TLSv1.2 (OUT), TLS change cipher, Client hello (1):
* TLSv1.2 (OUT), TLS handshake, Finished (20):
* TLSv1.2 (IN), TLS handshake, Finished (20):
* SSL connection using TLSv1.2 / ECDHE-RSA-AES128-GCM-SHA256
* ALPN, server did not agree to a protocol
* Server certificate:
*  subject: CN=discovery.etcd.io
*  start date: Dec  8 00:00:00 2017 GMT
*  expire date: Jan  8 12:00:00 2019 GMT
*  subjectAltName: host "discovery.etcd.io" matched cert's "discovery.etcd.io"
*  issuer: C=US; O=Amazon; OU=Server CA 1B; CN=Amazon
*  SSL certificate verify ok.
> GET /new?size=3 HTTP/1.1
> Host: discovery.etcd.io
> User-Agent: curl/7.61.0
> Accept: */*
> 
< HTTP/1.1 503 Service Unavailable: Back-end server is at capacity
< Content-Length: 0
< Connection: keep-alive
< 
* Connection #0 to host discovery.etcd.io left intact

I think this is the LB responding.

@ankitforcode
Copy link
Author

@hexfusion I have an old production cluster that is still using the discovery endpoint and was able to use that some time back today morning. This could have been down since last few hours.

I can switch the existing cluster to use the private ip's however that could require more effort.

@hexfusion
Copy link
Contributor

hexfusion commented Aug 2, 2018

@ankitforcode sure I am just curious if this service is going to be supported moving forward. I know the package is not maintained #9836

We should remove from docs in that case IMO.

@ankitforcode
Copy link
Author

@hexfusion The url is working again. 👍

@ankitforcode
Copy link
Author

replied too early, it went down again 👎

@espala
Copy link

espala commented Aug 2, 2018

It's still down. It shouldn't take that long. There is no status page to follow.

could the service be over?

@hexfusion
Copy link
Contributor

@espala I have pinged the folks who would know. The domain etcd.io will move to the new org soon #9965 so perhaps the community will be able to support the service directly in the future. For now I am afraid I have no transparency on the matter.

@hexfusion
Copy link
Contributor

This service appears to be working as expected again. I expect further conversations will take place in the upcoming weeks regarding its future.

@ledsole
Copy link

ledsole commented Aug 6, 2018

Looks like it's down again

@strigazi
Copy link

strigazi commented Aug 8, 2018

Down again

Wed Aug 8 10:09:19 UTC 2018

@joelegasse
Copy link
Contributor

discovery.etcd.io is currently in a degraded state. We are working on getting it back online, attempting to preserve what we can. As mentioned here the site has not been maintained, and will soon be deprecated.

What kind of use cases or scenarios are people running into that can't be solved without this service? Is it just being used for convenience, or are there technical obstacles that are being hit when attempting to initialize a cluster?

@joelegasse joelegasse reopened this Aug 9, 2018
@philips
Copy link
Contributor

philips commented Aug 9, 2018

The service is operational again. Please let us know if you hit more issues.

Also, we would like feedback on why you are using discovery.etcd.io vs. static configuration as we decide how to maintain or deprecate the service overtime.

@waipeng
Copy link

waipeng commented Aug 10, 2018

Hi @philips, thanks for bringing it back!

OpenStack Magnum kubernetes cluster uses discovery.etcd.io by default when spinning up new clusters.
https://docs.openstack.org/magnum/latest/user/

@canori01
Copy link

@philips we use this at Openstack Magnum to bootstrap etcd clusters. It allows us to easily stand up the clusters without knowing the ip addresses of the nodes in advance.

@hexfusion
Copy link
Contributor

Discovery endpoint is still stuck in /v2 which is not supported. Perhaps efforts should be made to bring this into /v3? This way users can stand up there own "discovery cluster" and have it supported. In general, token generation/management is a workflow that could be handled in many different ways. But providing the capabilities seems to be a general asset to users and having that native to etcd API makes sense to me. Implemention of a "discovery cluster" via etcd-operator would probably make this all quite trivial.

If documented and supported I think there would be little reason to need the public service. But currently we give the user no supported alternative. Just my 2c.

ref: custom-etcd-discovery-service

@philips
Copy link
Contributor

philips commented Aug 15, 2018 via email

@gyuho
Copy link
Contributor

gyuho commented Aug 15, 2018

@hexfusion
Copy link
Contributor

@philips, @gyuho I will make some effort to grease the wheels on this over the weekend as the issue seems timely, and I ran my mouth about needing it fixed :). If anyone else is interested in working on this please ping me so we are can distribute efforts.

@joelegasse
Copy link
Contributor

@hexfusion I can help out, how would you like to coordinate?

@hexfusion
Copy link
Contributor

Hi @joelegasse, sounds great I guess for starters we need to review the above. I have seen the folks at grpc-go use https://reviewable.io for larger stuff. I wonder if it would be useful to collab here or just make things overly complicated.

There is a lot going on with what he already did so I guess we both need to get a general grasp of the approach and review code as we go. I expect the first part to take a fair amount of time. I plan to hit this hard over the weekend. I will email you my details and we can chat over slack if that works for you.

Looking forward to it.

@xiang90
Copy link
Contributor

xiang90 commented Aug 31, 2018

this issue is tracked here: coreos/discovery.etcd.io#61. closing

@xiang90 xiang90 closed this as completed Aug 31, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Development

No branches or pull requests