Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

stability: make debug tools useful for wedged/underreplicated clusters #17904

Closed
tbg opened this issue Aug 24, 2017 · 2 comments
Closed

stability: make debug tools useful for wedged/underreplicated clusters #17904

tbg opened this issue Aug 24, 2017 · 2 comments
Labels
S-1-stability Severe stability issues that can be fixed by upgrading, but usually don’t resolve by restarting
Milestone

Comments

@tbg
Copy link
Member

tbg commented Aug 24, 2017

For a cluster whose kv store is not available, we can still return useful information. For example,

./cockroach node ls --insecure

could still return a list. Gossip has a lot of information.

It's easy to iterate on these things: stand up a 3x cluster, kill two nodes, see which debug tools break and think about which information they still should be able to return.

More seriously, having locked down the vmodule endpoint with a cluster setting means we're now unable to change it on a wedged cluster, something that @nvanbenschoten and I are presently unhappy about.

@tbg tbg added the S-1-stability Severe stability issues that can be fixed by upgrading, but usually don’t resolve by restarting label Aug 24, 2017
@tbg tbg added this to the 1.2 milestone Aug 24, 2017
@a-robinson
Copy link
Contributor

+1 to this.

Related issues:
#16489
#15342

Gossip has a lot of information.

We do have ./cockroach debug gossip-values, in case you weren't aware.

@tbg
Copy link
Member Author

tbg commented Aug 24, 2017

I think these other issues are good enough, thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
S-1-stability Severe stability issues that can be fixed by upgrading, but usually don’t resolve by restarting
Projects
None yet
Development

No branches or pull requests

2 participants