Sum values in unwrapped rate aggregation instead of treating them as counter #6361

chaudum · 2022-06-10T08:01:14Z

What this PR does / why we need it:

This PR implements the first part of the RFC described in #6351

It reverts rate() to its previous implementation prior to #5013 That means it calculates the per-second rate from the sum of all extracted values.

Which issue(s) this PR fixes:

#6351

Checklist

Documentation added
Tests updated
Is this an important fix or new feature? Add an entry in the CHANGELOG.md.
Changes that require user attention or interaction to upgrade are documented in docs/sources/upgrading/_index.md

grafanabot · 2022-06-10T08:06:37Z

./tools/diff_coverage.sh ../loki-main/test_results.txt test_results.txt ingester,distributor,querier,querier/queryrange,iter,storage,chunkenc,logql,loki

Change in test coverage per package. Green indicates 0 or positive change, red indicates that test coverage for a package fell.

+           ingester	0%
-        distributor	-0.3%
+            querier	0%
+ querier/queryrange	0%
+               iter	0%
+            storage	0%
+           chunkenc	0%
-              logql	-0.2%
+               loki	0%

grafanabot · 2022-06-10T08:11:02Z

./tools/diff_coverage.sh ../loki-main/test_results.txt test_results.txt ingester,distributor,querier,querier/queryrange,iter,storage,chunkenc,logql,loki

Change in test coverage per package. Green indicates 0 or positive change, red indicates that test coverage for a package fell.

-           ingester	-0.1%
-        distributor	-0.3%
+            querier	0%
+ querier/queryrange	0.1%
+               iter	0%
+            storage	0%
+           chunkenc	0%
-              logql	-0.2%
+               loki	0%

chaudum · 2022-06-10T08:27:02Z

@liguozhong Wanted to give you a heads up because it probably affects you.

liguozhong · 2022-06-10T08:48:16Z

@liguozhong Wanted to give you a heads up because it probably affects you.

Thanks , I got it

DylanGuedes

Few nits.

Btw, do you mind renaming the PR title to something like what you have in the CHANGELOG? In the changelog it is described as "Sum values in unwrapped rate aggregation instead of treating them as counter".

CHANGELOG.md

docs/sources/upgrading/_index.md

chaudum · 2022-06-13T06:08:11Z

Few nits.

Btw, do you mind renaming the PR title to something like what you have in the CHANGELOG? In the changelog it is described as "Sum values in unwrapped rate aggregation instead of treating them as counter".

Let me make two separate pull requests. Then the PR title also matches the changes.

grafanabot · 2022-06-13T08:35:49Z

./tools/diff_coverage.sh ../loki-main/test_results.txt test_results.txt ingester,distributor,querier,querier/queryrange,iter,storage,chunkenc,logql,loki

Change in test coverage per package. Green indicates 0 or positive change, red indicates that test coverage for a package fell.

+           ingester	0%
+        distributor	0%
+            querier	0%
+ querier/queryrange	0%
+               iter	0%
+            storage	0%
+           chunkenc	0%
-              logql	-0.2%
+               loki	0%

DylanGuedes

lgtm! (the jsonnet failure isn't related to your change)

This PR reverts the implementation done in #5013 to the original implementation that sums the extracted values from the log lines instead of treating them like a Prometheus counter metric. Signed-off-by: Christian Haudum <[email protected]>

Signed-off-by: Christian Haudum <[email protected]>

grafanabot · 2022-06-14T15:14:56Z

./tools/diff_coverage.sh ../loki-main/test_results.txt test_results.txt ingester,distributor,querier,querier/queryrange,iter,storage,chunkenc,logql,loki

Change in test coverage per package. Green indicates 0 or positive change, red indicates that test coverage for a package fell.

+           ingester	0%
-        distributor	-0.3%
+            querier	0%
+ querier/queryrange	0%
+               iter	0%
+            storage	0%
+           chunkenc	0%
-              logql	-0.2%
+               loki	0%

trevorwhitney

so, I don't fully understand this PR. for one, why are all the values we assert against in the tests changing? does this actually change computation behavior? was the previous implementation incorrect?

CHANGELOG.md

Signed-off-by: Christian Haudum <[email protected]>

grafanabot · 2022-06-14T15:48:08Z

./tools/diff_coverage.sh ../loki-main/test_results.txt test_results.txt ingester,distributor,querier,querier/queryrange,iter,storage,chunkenc,logql,loki

Change in test coverage per package. Green indicates 0 or positive change, red indicates that test coverage for a package fell.

+           ingester	0%
+        distributor	0%
+            querier	0%
+ querier/queryrange	0%
+               iter	0%
+            storage	0%
+           chunkenc	0%
-              logql	-0.2%
+               loki	0%

ssncferreira · 2022-06-14T16:10:56Z

pkg/logql/engine_test.go

+			// SUM(n=47, 61, 1) = 15
+			// 15 / 30 = 0.5
+			promql.Vector{promql.Sample{Point: promql.Point{T: 60 * 1000, V: 0.5}, Metric: labels.Labels{labels.Label{Name: "app", Value: "foo"}}}},


I'm having some difficulty understanding these tests 😕
What does the SUM(n=47, 61, 1) mean? I assume this is the result of the newSeries method on the data property, but don't understand how these values were calculated 🤔

Regarding the SUM(n=47, 61, 1): it just means the sum of the values from item 47 to item 61 where the value is a constant of 1, like https://www.wolframalpha.com/input?i2d=true&i=Sum%5B1%2C%7Bn%2C47%2C61%7D%5D

Regarding, why 47, 61, and 1, we have to look at the output of the newSeries() function: It creates a stream {app="foo"} with 300 samples starting at timestamp 46e9 (46s) and ending at timestamp 345e9 (345s). The sample value is a constant 1.

So the query now looks at the time range from ts=30s to ts=60s, where the lower bound is not included. There are 15 items (item 47 to item 61) from the generated series that matches.

Hope this helps.

Thank you! 🙏 This helps a lot.
Would it be possible to add this information in the comments to make the tests easier to understand? e.g.

// create a stream {app="foo"} with 300 samples starting at 46s and ending at 345s with a constant value of 1 [][]logproto.Series{ {newSeries(testSize, offset(46, constantValue(1)), `{app="foo"}`)}, }, // query between the time range from ts=30s and ts=60s where the lower bound is not included []SelectSampleParams{ {&logproto.SampleQueryRequest{Start: time.Unix(30, 0), End: time.Unix(60, 0), Selector: `rate({app="foo"} | unwrap foo[30s])`}}, }, // SUM(n=47, 61, 1) = 15 - there are 15 samples (from 47 to 61) matched from the generated series // 15 / 30 = 0.5 promql.Vector{promql.Sample{Point: promql.Point{T: 60 * 1000, V: 0.5}, Metric: labels.Labels{labels.Label{Name: "app", Value: "foo"}}}},

@ssncferreira Addressed your feedback in https://github.com/grafana/loki/pull/6412/files#diff-6f4083532ac476e6ab63b44775eb6356fba146db52e0a19759eff5045b92de2a

KMiller-Grafana

Docs in this PR look good to me.

…counter (#6361) * Revert unwrapped rate aggregation to previous implementation This PR reverts the implementation done in #5013 to the original implementation that sums the extracted values from the log lines instead of treating them like a Prometheus counter metric. Signed-off-by: Christian Haudum <[email protected]> * Move changelog entry Signed-off-by: Christian Haudum <[email protected]> * Remove unused/dead code Signed-off-by: Christian Haudum <[email protected]> * Clean changelog Signed-off-by: Christian Haudum <[email protected]> (cherry picked from commit b315ed0)

…counter (#6361) (#6555) * Revert unwrapped rate aggregation to previous implementation This PR reverts the implementation done in #5013 to the original implementation that sums the extracted values from the log lines instead of treating them like a Prometheus counter metric. Signed-off-by: Christian Haudum <[email protected]> * Move changelog entry Signed-off-by: Christian Haudum <[email protected]> * Remove unused/dead code Signed-off-by: Christian Haudum <[email protected]> * Clean changelog Signed-off-by: Christian Haudum <[email protected]> (cherry picked from commit b315ed0) Co-authored-by: Christian Haudum <[email protected]>

pull-request-size bot added the size/XL label Jun 10, 2022

github-actions bot added the area/docs label Jun 10, 2022

chaudum force-pushed the chaudum/rfc-6351 branch from 2241854 to 64edec9 Compare June 10, 2022 08:04

chaudum mentioned this pull request Jun 10, 2022

Fix panic in instant query splitting when using unwrapped rate #6348

Merged

4 tasks

chaudum marked this pull request as ready for review June 10, 2022 08:17

chaudum requested review from KMiller-Grafana and a team as code owners June 10, 2022 08:17

DylanGuedes reviewed Jun 10, 2022

View reviewed changes

CHANGELOG.md Outdated Show resolved Hide resolved

docs/sources/upgrading/_index.md Outdated Show resolved Hide resolved

pull-request-size bot added size/M and removed size/XL labels Jun 13, 2022

chaudum changed the title ~~Revert implementation of unwrapped rate aggregation (RFC #6351)~~ Sum values in unwrapped rate aggregation instead of treating them as counter Jun 13, 2022

chaudum requested a review from DylanGuedes June 13, 2022 06:23

pull-request-size bot added size/L and removed size/M labels Jun 13, 2022

DylanGuedes approved these changes Jun 13, 2022

View reviewed changes

chaudum force-pushed the chaudum/rfc-6351 branch from 5220c7a to 275a1ab Compare June 14, 2022 13:25

chaudum requested a review from slim-bean June 14, 2022 13:25

chaudum added 3 commits June 14, 2022 17:09

Move changelog entry

0a64ae4

Signed-off-by: Christian Haudum <[email protected]>

Remove unused/dead code

a7ab57a

Signed-off-by: Christian Haudum <[email protected]>

chaudum force-pushed the chaudum/rfc-6351 branch from 275a1ab to a7ab57a Compare June 14, 2022 15:10

trevorwhitney reviewed Jun 14, 2022

View reviewed changes

chaudum commented Jun 14, 2022

View reviewed changes

CHANGELOG.md Outdated Show resolved Hide resolved

Clean changelog

1b4f2bf

Signed-off-by: Christian Haudum <[email protected]>

ssncferreira reviewed Jun 14, 2022

View reviewed changes

KMiller-Grafana approved these changes Jun 14, 2022

View reviewed changes

owen-d approved these changes Jun 16, 2022

View reviewed changes

owen-d merged commit b315ed0 into main Jun 16, 2022

owen-d deleted the chaudum/rfc-6351 branch June 16, 2022 13:44

ssncferreira added the backport release-2.6.x Tag a PR with this label to create a PR which cherry pics it into the release-2.6.x branch label Jun 30, 2022

grafanabot mentioned this pull request Jun 30, 2022

[release-2.6.x] Sum values in unwrapped rate aggregation instead of treating them as counter #6555

Merged

osg-grafana added type/docs Issues related to technical documentation; the Docs Squad uses this label across many repositories and removed area/docs labels Oct 19, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sum values in unwrapped rate aggregation instead of treating them as counter #6361

Sum values in unwrapped rate aggregation instead of treating them as counter #6361

chaudum commented Jun 10, 2022 •

edited

Loading

grafanabot commented Jun 10, 2022

grafanabot commented Jun 10, 2022

chaudum commented Jun 10, 2022

liguozhong commented Jun 10, 2022 •

edited

Loading

DylanGuedes left a comment

chaudum commented Jun 13, 2022

grafanabot commented Jun 13, 2022

DylanGuedes left a comment

grafanabot commented Jun 14, 2022

trevorwhitney left a comment

grafanabot commented Jun 14, 2022

ssncferreira Jun 14, 2022

chaudum Jun 14, 2022

ssncferreira Jun 15, 2022

chaudum Jun 16, 2022

KMiller-Grafana left a comment

Sum values in unwrapped rate aggregation instead of treating them as counter #6361

Sum values in unwrapped rate aggregation instead of treating them as counter #6361

Conversation

chaudum commented Jun 10, 2022 • edited Loading

What this PR does / why we need it:

Which issue(s) this PR fixes:

Checklist

grafanabot commented Jun 10, 2022

grafanabot commented Jun 10, 2022

chaudum commented Jun 10, 2022

liguozhong commented Jun 10, 2022 • edited Loading

DylanGuedes left a comment

Choose a reason for hiding this comment

chaudum commented Jun 13, 2022

grafanabot commented Jun 13, 2022

DylanGuedes left a comment

Choose a reason for hiding this comment

grafanabot commented Jun 14, 2022

trevorwhitney left a comment

Choose a reason for hiding this comment

grafanabot commented Jun 14, 2022

ssncferreira Jun 14, 2022

Choose a reason for hiding this comment

chaudum Jun 14, 2022

Choose a reason for hiding this comment

ssncferreira Jun 15, 2022

Choose a reason for hiding this comment

chaudum Jun 16, 2022

Choose a reason for hiding this comment

KMiller-Grafana left a comment

Choose a reason for hiding this comment

chaudum commented Jun 10, 2022 •

edited

Loading

liguozhong commented Jun 10, 2022 •

edited

Loading