-
Notifications
You must be signed in to change notification settings - Fork 5.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
lightning: split and scatter regions in batches (#33625) #34258
lightning: split and scatter regions in batches (#33625) #34258
Conversation
Signed-off-by: ti-srebot <[email protected]>
[REVIEW NOTIFICATION] This pull request has been approved by:
To complete the pull request process, please ask the reviewers in the list to review by filling The full list of commands accepted by this bot can be found here. Reviewer can indicate their review by submitting an approval review. |
/run-all-tests |
@gozssky you're already a collaborator in bot's repo. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
/merge |
This pull request has been accepted and is ready to merge. Commit hash: b2449a1
|
/merge |
This pull request has been accepted and is ready to merge. Commit hash: dafee54
|
Code Coverage Details: https://codecov.io/github/pingcap/tidb/commit/dafee545eb4c1a50c384efb5c0a6e13b6e4926a5 |
cherry-pick #33625 to release-5.4
You can switch your code base to this Pull Request by using git-extras:
# In tidb repo: git pr https://github.com/pingcap/tidb/pull/34258
After apply modifications, you can push your change to this PR via:
What problem does this PR solve?
Issue Number: close #33618
Problem Summary:
Region scatter may timeout when split and scatter many regions at a time.
What is changed and how it works?
Add a batch limit for split and scatter regions. In a batch, lightning processes up to 4096 ranges. After all ranges in batch have been processed completely, lightning can process the next batch.
Check List
Tests
I prepared about 1.8T data for a table which contains only one clustered primary key. TiDB cluster have ten TiKV nodes, each node have 200% cpu limit and 16GiB memory limit. Then I used lightning to import these data to TiDB cluster and observed the scatter operator status in grafana dashboard. To reproduce huge amount of regions, I set
region-split-size
to 8MiB. Finally, about 260k regions were created.Master
Many scatter operator is timeout.
This PR
There is no operator timeout after this PR.
Release note