Limit statsz updates #5470

wjordan · 2024-05-23T23:53:23Z

This PR adds a filter to sendStatsz that limits statsz updates to the current heartbeat interval (max once per second), adding a time.Time field to track the time the last statsz update was sent. This limit should reduce overall STATSZ system event load in large clusters while still allowing initial statsz update to quickly reach newly-discovered nodes.

Fixes #5469.

Signed-off-by: Will Jordan [email protected]

derekcollison

Looks pretty good, few small comments.

server/events.go

derekcollison

LGTM

derekcollison · 2024-06-04T21:54:07Z

This change seems to be causing some tests to flap since the meta layer for jetstream does not have up to date information in order to complete the test.

e.g. TestJetStreamSuperClusterOverflowPlacement

derekcollison · 2024-06-04T21:59:19Z

Might need to fix these flappers before merging.

We could set the cstatsz to a lower value possibly for those tests.

derekcollison · 2024-06-04T22:02:29Z

Also seems we now see a race..

derekcollison · 2024-06-04T22:16:15Z

TestRoutePings has the datarace now.

wjordan · 2024-06-04T23:08:51Z

the race seems unrelated, neither (TestRoutePings) nor adjustPingInterval behavior is touched by this PR.

This test is timing-sensitive and a lower statsz rate limit seems to reduce test flakiness.

derekcollison

LGTM

derekcollison · 2024-06-05T01:56:54Z

The race was because your new test TestClusterSetupMsgs was not shutting down the cluster when done, so it kept running and it was accessing that routeMaxPingInterval. Will clean up.

Also need to make the tweak to statszRateLimit global since other tests are flapping too.

Will take care of it.

This PR adds a filter to `sendStatsz` that limits statsz updates to the current heartbeat interval (max once per second), adding a `time.Time` field to track the time the last statsz update was sent. This limit should reduce overall `STATSZ` system event load in large clusters while still allowing initial statsz update to quickly reach newly-discovered nodes. Fixes #5469. Signed-off-by: Will Jordan <[email protected]>

Includes: - #5464 - #5472 - #5475 - #5476 - #5481 - #5482 - #5470 - #5485 - #5487 - #5489

wjordan requested a review from a team as a code owner May 23, 2024 23:53

wjordan changed the title ~~Limit statsz updates per client~~ Limit statsz updates May 24, 2024

Limit statsz updates to max once per second

cedc286

wjordan force-pushed the statz_ratelimit branch from 725bbb0 to cedc286 Compare May 24, 2024 05:09

wjordan added 2 commits May 23, 2024 23:06

write lock limitStatsz to fix race

07b9ff0

send requested statsz responses immediately

1a7e60c

derekcollison reviewed Jun 4, 2024

View reviewed changes

server/events.go Outdated Show resolved Hide resolved

server/events.go Outdated Show resolved Hide resolved

wjordan added 2 commits June 4, 2024 14:27

statszRateLimit constant

63d10a4

add resetLastStatsz helper function

a359aeb

derekcollison approved these changes Jun 4, 2024

View reviewed changes

reduce statszRateLimit in TestJetStreamSuperClusterOverflowPlacement

1658502

This test is timing-sensitive and a lower statsz rate limit seems to reduce test flakiness.

derekcollison approved these changes Jun 5, 2024

View reviewed changes

derekcollison merged commit 958b61f into nats-io:main Jun 5, 2024
2 checks passed

wallyqs mentioned this pull request Jun 5, 2024

Cherry picks for v2.10.17 RC.1 #5491

Merged

wallyqs added a commit that referenced this pull request Jun 5, 2024

Cherry picks for v2.10.17 RC.1 (#5491)

57040d7

Includes: - #5464 - #5472 - #5475 - #5476 - #5481 - #5482 - #5470 - #5485 - #5487 - #5489

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Limit statsz updates #5470

Limit statsz updates #5470

wjordan commented May 23, 2024 •

edited

Loading

derekcollison left a comment

derekcollison left a comment

derekcollison commented Jun 4, 2024

derekcollison commented Jun 4, 2024

derekcollison commented Jun 4, 2024

derekcollison commented Jun 4, 2024

wjordan commented Jun 4, 2024

derekcollison left a comment

derekcollison commented Jun 5, 2024

Limit statsz updates #5470

Limit statsz updates #5470

Conversation

wjordan commented May 23, 2024 • edited Loading

derekcollison left a comment

Choose a reason for hiding this comment

derekcollison left a comment

Choose a reason for hiding this comment

derekcollison commented Jun 4, 2024

derekcollison commented Jun 4, 2024

derekcollison commented Jun 4, 2024

derekcollison commented Jun 4, 2024

wjordan commented Jun 4, 2024

derekcollison left a comment

Choose a reason for hiding this comment

derekcollison commented Jun 5, 2024

wjordan commented May 23, 2024 •

edited

Loading