replica_rac2: optimize logTracker.admitted #132140

pav-kv · 2024-10-08T00:24:37Z

Improve cache friendliness by checking the dirty and scheduling bits before resetting them. Cache the latest admitted vector to avoid computing it on every call. Instead, it is recomputed only if dirty bit is true.

Informs #128033

blathers-crl · 2024-10-08T00:24:41Z

It looks like your PR touches production code but doesn't add or edit any test code. Did you consider adding tests to your PR?

_{🦉 Hoot! I am a Blathers, a bot for CockroachDB. My owner is dev-inf.}

cockroach-teamcity · 2024-10-08T00:24:47Z

This change is

pav-kv · 2024-10-08T00:25:22Z

@sumeerbhola Tangentially to #132137, in case more optimizations are needed.

sumeerbhola

This specific doesn't seem relevant based on looking at profiles. I will look into more allocation reduction, and reduce mutex acquisitions and measure again.

Reviewable status: complete! 0 of 0 LGTMs obtained

pav-kv · 2024-10-08T14:03:17Z

It still separates the concerns a bit, should we merge? The slight micro-cost reduction is an optional bonus here.

sumeerbhola

Reviewed 1 of 2 files at r1.
Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @pav-kv)

pkg/kv/kvserver/kvflowcontrol/replica_rac2/log_tracker.go line 42 at r1 (raw file):

// admitted returns the current admitted vector, and resets the dirty bit.
func (l *logTracker) admitted() rac2.AdmittedVector {

nit: prefer admittedAndResetDirty or something akin.

pkg/kv/kvserver/kvflowcontrol/replica_rac2/log_tracker.go line 56 at r1 (raw file):

// allows the next logAdmitted call to return true and allow scheduling a Ready
// iteration again. This flow avoids unnecessary Ready scheduling events.
func (l *logTracker) admittedDirty() (av rac2.AdmittedVector, dirty bool) {

nit: prefer admittedIfDirtyAndResetDirty or something akin.

Even if it not fully self-explanatory, it causes readers to read the method comment.

pkg/kv/kvserver/kvflowcontrol/replica_rac2/log_tracker.go line 59 at r1 (raw file):

	l.Lock()
	defer l.Unlock()
	l.scheduled = false

The thing I worry about optimizations that don't show up in profiles is that I can't judge the value of a branch reduction given out-of-order speculative execution in processors.

I don't generally have a good mental model here. https://johnnysswlab.com/how-branches-influence-the-performance-of-your-code-and-what-can-you-do-about-it/ is a decent article and concludes that one shouldn't try to reduce branches, and it is more important to be cache friendly. Unconditional writes violate the latter.

So if one were trying to micro-optimize this, I would suggest changing this to

if l.scheduled {
  l.scheduled = false
}

Same for the l.dirty = false in the admitted() method.

pkg/kv/kvserver/kvflowcontrol/replica_rac2/log_tracker.go line 61 at r1 (raw file):

	l.scheduled = false
	if !l.dirty {
		return

nit: I believe the preference in CRDB code is to always do an explicit return with the values being returned.

pav-kv

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @sumeerbhola)

pkg/kv/kvserver/kvflowcontrol/replica_rac2/log_tracker.go line 42 at r1 (raw file):

Previously, sumeerbhola wrote…

nit: prefer admittedAndResetDirty or something akin.

Not relevant in the new version of this PR.

pkg/kv/kvserver/kvflowcontrol/replica_rac2/log_tracker.go line 56 at r1 (raw file):

Previously, sumeerbhola wrote…

nit: prefer admittedIfDirtyAndResetDirty or something akin.

Even if it not fully self-explanatory, it causes readers to read the method comment.

Not relevant in the new version of this PR.

pkg/kv/kvserver/kvflowcontrol/replica_rac2/log_tracker.go line 59 at r1 (raw file):

Previously, sumeerbhola wrote…

The thing I worry about optimizations that don't show up in profiles is that I can't judge the value of a branch reduction given out-of-order speculative execution in processors.

I don't generally have a good mental model here. https://johnnysswlab.com/how-branches-influence-the-performance-of-your-code-and-what-can-you-do-about-it/ is a decent article and concludes that one shouldn't try to reduce branches, and it is more important to be cache friendly. Unconditional writes violate the latter.

So if one were trying to micro-optimize this, I would suggest changing this to
if l.scheduled {
  l.scheduled = false
}
Same for the l.dirty = false in the admitted() method.

Done. Since both methods would be doing these conditionals anyway, I merged them back to be one method. Instead, I am now caching the latest admitted vector (see the new av field). The idea is to avoid computing it on every call, and instead do it only if dirty == true (since we're checking this bit anyway).

Improve cache friendliness by checking the dirty and scheduling bits before resetting them. Cache the latest admitted vector to avoid computing it on every call. Instead, it is recomputed only if dirty bit is true. Epic: none Release note: none

sumeerbhola

Reviewed 2 of 2 files at r2, all commit messages.
Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @pav-kv)

pav-kv · 2024-10-17T15:07:00Z

TFTR!

bors r=sumeerbhola

craig · 2024-10-17T16:06:32Z

Build succeeded:

pav-kv requested a review from sumeerbhola October 8, 2024 00:24

pav-kv requested a review from a team as a code owner October 8, 2024 00:24

sumeerbhola reviewed Oct 8, 2024

View reviewed changes

sumeerbhola requested changes Oct 11, 2024

View reviewed changes

pav-kv force-pushed the log-tracker-opt branch from ad4cef4 to df4b52e Compare October 15, 2024 12:18

pav-kv commented Oct 15, 2024

View reviewed changes

pav-kv requested a review from sumeerbhola October 15, 2024 12:22

pav-kv force-pushed the log-tracker-opt branch from df4b52e to 4483d3a Compare October 15, 2024 12:37

sumeerbhola approved these changes Oct 17, 2024

View reviewed changes

craig bot merged commit a69119b into cockroachdb:master Oct 17, 2024
23 checks passed

pav-kv deleted the log-tracker-opt branch October 17, 2024 16:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

replica_rac2: optimize logTracker.admitted #132140

replica_rac2: optimize logTracker.admitted #132140

pav-kv commented Oct 8, 2024 •

edited

Loading

blathers-crl bot commented Oct 8, 2024

cockroach-teamcity commented Oct 8, 2024

pav-kv commented Oct 8, 2024

sumeerbhola left a comment

pav-kv commented Oct 8, 2024

sumeerbhola left a comment

pav-kv left a comment

sumeerbhola left a comment

pav-kv commented Oct 17, 2024

craig bot commented Oct 17, 2024

replica_rac2: optimize logTracker.admitted #132140

replica_rac2: optimize logTracker.admitted #132140

Conversation

pav-kv commented Oct 8, 2024 • edited Loading

blathers-crl bot commented Oct 8, 2024

cockroach-teamcity commented Oct 8, 2024

pav-kv commented Oct 8, 2024

sumeerbhola left a comment

Choose a reason for hiding this comment

pav-kv commented Oct 8, 2024

sumeerbhola left a comment

Choose a reason for hiding this comment

pav-kv left a comment

Choose a reason for hiding this comment

sumeerbhola left a comment

Choose a reason for hiding this comment

pav-kv commented Oct 17, 2024

craig bot commented Oct 17, 2024

pav-kv commented Oct 8, 2024 •

edited

Loading