stability: make debug tools useful for wedged/underreplicated clusters #17904
Labels
S-1-stability
Severe stability issues that can be fixed by upgrading, but usually don’t resolve by restarting
Milestone
For a cluster whose kv store is not available, we can still return useful information. For example,
./cockroach node ls --insecure
could still return a list. Gossip has a lot of information.
It's easy to iterate on these things: stand up a 3x cluster, kill two nodes, see which debug tools break and think about which information they still should be able to return.
More seriously, having locked down the vmodule endpoint with a cluster setting means we're now unable to change it on a wedged cluster, something that @nvanbenschoten and I are presently unhappy about.
The text was updated successfully, but these errors were encountered: