-
Notifications
You must be signed in to change notification settings - Fork 3.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ttljob: fix range decoding bug #78163
Conversation
Previously, when a range was split by a prefix of a PRIMARY KEY, the TTL job would fail. This PR rectifies that. Release note: None
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reviewed 4 of 4 files at r1, all commit messages.
Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @otan)
thanks bors r=rafiss |
Build failed (retrying...): |
Build failed (retrying...): |
Build failed: |
bors r=rafiss |
77588: kv/bulk: split/scatter after range says it is full r=dt a=dt Previously we would split-and-scatter after adding "filling" a range so that as we continued to fill, we would do so to a new range that was first scattered to a new node. We did this once we had written 48MB to one range, as determined by not hitting a split while sending that many bytes during a single flush of the above buffer. 48MB was an OK default when the default range size was 64MB, but has long since been far below what we now consider full. This was a minor annoyance (causing merges) but calling split-and-scatter so much more than is needed has recently become a more severe issue now that ~every scatter call actually moves a range and blocks on that snapshot. Even when 48MB was a reasonable fraction of the 64MB default, this was just a rough approximation for full that failed to account for the actual range's configured size and only counted data flushed since the last explicit Flush(). Instead, this change replaces this "split when i've sent enough" logic with a new approach that lets the range itself reply to each SST with its remaining capacity and end key. The caller can then use this to determine if there is room to send another file to that range or if it should consider it full, and split if/when another key that would be added to that ranges span is added to the batch. This significantly reduces how often ingestion stops to wait on scatter of a split span with the recently updated scatter behavior, more than doubling observed speeds of bulk ingestion. Release justification: high impact fix to new/modified functionality. Release note (performance improvement): Ranges are split and rebalanced during bulk ingestion only when they become full, reducing excessive splits and subsequent merging and associated rebalancing-related performance impact. 78022: sql/schemachanger: set non-cancelable property on jobs correctly r=ajwerner a=ajwerner This PR sets the non-cancelability of jobs at the correct moment so that there is not a phase where the job is cancelable but the state implies that the job should not be reverted. It also propagates this state correctly to descriptors so that it can be correctly synthesized during a restore. Backport will address #76441. Release note: None 78163: ttljob: fix range decoding bug r=rafiss a=otan Previously, when a range was split by a prefix of a PRIMARY KEY, the TTL job would fail. This PR rectifies that. Resolves #78162 Release note: None Co-authored-by: David Taylor <[email protected]> Co-authored-by: Andrew Werner <[email protected]> Co-authored-by: Oliver Tan <[email protected]>
Build failed (retrying...): |
Build failed (retrying...): |
Build succeeded: |
Previously, when a range was split by a prefix of a PRIMARY KEY, the TTL
job would fail. This PR rectifies that.
Resolves #78162
Release note: None