-
Notifications
You must be signed in to change notification settings - Fork 3.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
c2c: gather perf metrics from prometheus #97465
Conversation
From the logs of a
|
a4e563b
to
48c0e08
Compare
d3185e2
to
52c78e5
Compare
if err != nil { | ||
t.L().Errorf("Could not query prom %s", err.Error()) | ||
} | ||
metricSnap[name] = sumOverLabel(point, stat.LabelName) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is it possible to put this summation into the Query like you can in the grafana graph?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sadly, not with this collector.CollectPoint()
api. I'll leave a todo and file an issue.
c2c roachtest performance metrics are now gathered by a prom/grafana instance running locally on the roachprod cluster. This change allows us to gather and process any metrics exposed to the crdb prom endpoint. Specifically, we now gather: `capacity_used`, `replication_logical_bytes`, `replication_sst_bytes` at various points during the c2c roachtest, allowing us to measure: - Initial Scan Throughput: initial scan size / initial scan duration - Workload Throughput: data ingested during workload / workload duration - Cutover Throughput: (data ingested between cutover time and cutover cmd) / (cutover process duration) where the size of these operations can be measured as either physical replicated bytes, logical ingested bytes, or physical ingested bytes on the source cluster. This patch also fixes a recent bug which mislabeled src cluster throughput as initial scan throughput. Epic: None
This patch streamlines how we remove ru limiting for roachtests that use tenants. For the c2c tests specifically, we know remove the limits on the dst cluster tenant as soon as the replication stream begins. Release note: None
Epic: none Release note: none
52c78e5
to
4610bc6
Compare
TFTR! bors r=stevendanna |
Build failed (retrying...): |
Build succeeded: |
c2c roachtest performance metrics are now gathered by a prom/grafana instance running locally on the roachprod cluster. This change allows us to gather and process any metrics exposed to the crdb prom endpoint. Specifically, we now gather:
capacity_used
,replication_logical_bytes
,replication_sst_bytes
at various points during the c2c roachtest, allowing us to measure:where the size of these operations can be measured as either physical replicated bytes, logical ingested bytes, or physical ingested bytes on the source cluster.
This patch also fixes a recent bug which mislabeled src cluster throughput as initial scan throughput.
Informs #89176
Release note: None