Export time-series metrics re: LSM state broken down by level #65276
Labels
A-storage
Relating to our storage engine (Pebble) on-disk storage.
C-enhancement
Solution expected to add code/behavior + preserve backward-compat (pg compat issues are exception)
O-sre
For issues SRE opened or otherwise cares about tracking.
T-storage
Storage Team
Stats like count & size make fantastic time-series metrics:
My understanding is the number of levels doesn't grow forever, so we can break the metrics down by level without worry about carnality limits.
It'll be much quicker to make basic sense of what is going with such metrics. Logs are good too but metrics can be made sense of at higher speed. Also, it is easier to correlate such metrics with some impact based metrics (SQL latencies or kvprober error rate) than it is to do so with logs.
@petermattis points out pebble exports such metrics even if CRDB doesn't export em: https://github.com/cockroachdb/pebble/blob/master/metrics.go#L125.
Here is an example of a metric that can broken down by level:
This will compliment #65277.
Jira issue: CRDB-7558
The text was updated successfully, but these errors were encountered: