Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Optimization schema memory utilization of ColumnFile in instance level #6589

Conversation

hongyunyan
Copy link
Contributor

@hongyunyan hongyunyan commented Jan 5, 2023

What problem does this PR solve?

Issue Number: ref #6401

Problem Summary:

What is changed and how it works?

  1. Add a new class ColumnFileSchema to store schema information together for ColumnFileInMemory and ColumnFileTiny.
  2. Change the code to share the same ColumnFileSchema when the schema is the same in various ColumnFileInMemory and ColumnFileTiny.

In performance test, after this optimization, the memory usage(static memory) can reduce 10%-30%(which is related with the number of segments for a table)
image

Check List

Tests

  • Unit test
  • Integration test
  • Manual test (add detailed scripts or steps below)
  • No code

Side effects

  • Performance regression: Consumes more CPU
  • Performance regression: Consumes more Memory
  • Breaking backward compatibility

Documentation

  • Affects user behaviors
  • Contains syntax changes
  • Contains variable changes
  • Contains experimental features
  • Changes MySQL compatibility

Release note

Reduced memory usage by up to 30% in the case without query load

@ti-chi-bot
Copy link
Member

ti-chi-bot commented Jan 5, 2023

[REVIEW NOTIFICATION]

This pull request has been approved by:

  • JaySon-Huang
  • breezewish

To complete the pull request process, please ask the reviewers in the list to review by filling /cc @reviewer in the comment.
After your PR has acquired the required number of LGTMs, you can assign this pull request to the committer in the list by filling /assign @committer in the comment to help you merge this pull request.

The full list of commands accepted by this bot can be found here.

Reviewer can indicate their review by submitting an approval review.
Reviewer can cancel approval by submitting a request changes review.

@ti-chi-bot ti-chi-bot added release-note-none Denotes a PR that doesn't merit a release note. do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. labels Jan 5, 2023
@hongyunyan
Copy link
Contributor Author

/run-all-tests

@hongyunyan
Copy link
Contributor Author

/run-all-tests

}
return true;
},
false);
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
false);
/*multi*/false, /*interval_ms*/ 60000);

Can make it run less frequently. e.g. every 60 seconds instead of 10 seconds by default

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, I agree it don't need run once 10 seconds, but I also not sure how long is an appropriate duration.

Copy link
Contributor

@JaySon-Huang JaySon-Huang left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Rest LGTM

dbms/src/Storages/DeltaMerge/ColumnFile/ColumnFileSchema.h Outdated Show resolved Hide resolved
@ti-chi-bot ti-chi-bot added the status/LGT1 Indicates that a PR has LGTM 1. label Feb 6, 2023
Copy link
Member

@breezewish breezewish left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The rest LGTM

dbms/src/Storages/DeltaMerge/ColumnFile/ColumnFileTiny.cpp Outdated Show resolved Hide resolved
dbms/src/Storages/DeltaMerge/ColumnFile/ColumnFile_V2.cpp Outdated Show resolved Hide resolved
dbms/src/Storages/DeltaMerge/Delta/MemTableSet.cpp Outdated Show resolved Hide resolved
@hongyunyan
Copy link
Contributor Author

/run-all-tests

Copy link
Member

@breezewish breezewish left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Some minor suggestions

dbms/src/Storages/DeltaMerge/ColumnFile/ColumnFileSchema.h Outdated Show resolved Hide resolved
dbms/src/Storages/DeltaMerge/ColumnFile/ColumnFileSchema.h Outdated Show resolved Hide resolved
Comment on lines 127 to 134
ColumnFileSchemaPtr find(const Digest & digest)
{
std::lock_guard<std::mutex> lock(mutex);
auto it = column_file_schemas.find(digest);
if (it == column_file_schemas.end())
return nullptr;
return it->second.lock();
}
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This interface seems to be not needed any more.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It still be used in StorageDeltaMerge.cpp:L966.

@ti-chi-bot ti-chi-bot added status/LGT2 Indicates that a PR has LGTM 2. and removed status/LGT1 Indicates that a PR has LGTM 1. labels Feb 6, 2023
@hongyunyan
Copy link
Contributor Author

/run-all-tests

@hongyunyan
Copy link
Contributor Author

/run-all-tests

@hongyunyan
Copy link
Contributor Author

/merge

@ti-chi-bot
Copy link
Member

@hongyunyan: It seems you want to merge this PR, I will help you trigger all the tests:

/run-all-tests

You only need to trigger /merge once, and if the CI test fails, you just re-trigger the test that failed and the bot will merge the PR for you after the CI passes.

If you have any questions about the PR merge process, please refer to pr process.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the ti-community-infra/tichi repository.

@ti-chi-bot
Copy link
Member

This pull request has been accepted and is ready to merge.

Commit hash: 766e29a

@ti-chi-bot ti-chi-bot added the status/can-merge Indicates a PR has been approved by a committer. label Feb 7, 2023
@ti-chi-bot ti-chi-bot merged commit f992ced into pingcap:master Feb 7, 2023
@ti-chi-bot ti-chi-bot added release-note Denotes a PR that will be considered when it comes time to generate release notes. and removed release-note-none Denotes a PR that doesn't merit a release note. labels Feb 7, 2023
ywqzzy pushed a commit to ywqzzy/tiflash_1 that referenced this pull request Feb 13, 2023
@JaySon-Huang JaySon-Huang mentioned this pull request Feb 13, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
release-note Denotes a PR that will be considered when it comes time to generate release notes. size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. status/can-merge Indicates a PR has been approved by a committer. status/LGT2 Indicates that a PR has LGTM 2.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants