cli,server: disable latency jump detection with start-single-node
and/or when using Docker on macOS
#98066
Labels
A-logging
In and around the logging infrastructure.
A-observability-inf
B-os-macos
Issues specific to macOS.
C-bug
Code not up to spec/doc, specs & docs deemed correct. Solution expected to change code/behavior.
S-3-ux-surprise
Issue leaves users wondering whether CRDB is behaving properly. Likely to hurt reputation/adoption.
Describe the problem
When running inside a Docker container on macOS, the TCP stack has very irregular latencies. This causes spurious log messages for folk who are exploring / testing using
cockroach start-single-node
.Like this:
To Reproduce
run
cockroach start-single-node
on macOS via the Docker image.Expected behavior
We should avoid the spurious warnings in that case.
Note that this logging mechanism is otherwise desirable in production clusters. A latency jump from, say, 70ms to 242ms between nodes, or on a node connecting to itself via the loopback interface, is a MAJOR operational event and should be reported.
The anomalous situation here is caused by macOS. So if we make the situation better in that case, that should not be at the expense of proper network observability for everyone else.
Jira issue: CRDB-25060
The text was updated successfully, but these errors were encountered: