Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Exchange receiver decode optimization to do squashing work at the same time #6202

Merged
merged 14 commits into from
Nov 2, 2022

Conversation

yibin87
Copy link
Contributor

@yibin87 yibin87 commented Oct 28, 2022

What problem does this PR solve?

Issue Number: close #6157

Problem Summary:

What is changed and how it works?

Previously, TiFlash adds a Squash transform after ExchangeReceiver, because the block output by ExchangeReceiver might be too small and not efficient to handle. However, the two-stage solution would introduce too many memory operations, like column allocations and de-allocations. So in this PR, the one-stage solution is provided.

For a simple local test, with 10 integer columns, 1024 rows per block, performance improves from: 140ms => 60ms.

Check List

Tests

  • Unit test
  • Integration test
  • Manual test (add detailed scripts or steps below)
  • No code

Side effects

  • Performance regression: Consumes more CPU
  • Performance regression: Consumes more Memory
  • Breaking backward compatibility

Documentation

  • Affects user behaviors
  • Contains syntax changes
  • Contains variable changes
  • Contains experimental features
  • Changes MySQL compatibility

Release note

None

@ti-chi-bot
Copy link
Member

ti-chi-bot commented Oct 28, 2022

[REVIEW NOTIFICATION]

This pull request has been approved by:

  • SeaRise
  • windtalker

To complete the pull request process, please ask the reviewers in the list to review by filling /cc @reviewer in the comment.
After your PR has acquired the required number of LGTMs, you can assign this pull request to the committer in the list by filling /assign @committer in the comment to help you merge this pull request.

The full list of commands accepted by this bot can be found here.

Reviewer can indicate their review by submitting an approval review.
Reviewer can cancel approval by submitting a request changes review.

@ti-chi-bot ti-chi-bot added release-note-none Denotes a PR that doesn't merit a release note. size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. labels Oct 28, 2022
@fuzhe1989
Copy link
Contributor

Fine-grained partitioning will exacerbate the chunk fragmentation in exchange and would benefit from this pr much more from normal cases.

@yibin87
Copy link
Contributor Author

yibin87 commented Oct 28, 2022

Fine-grained partitioning will exacerbate the chunk fragmentation in exchange and would benefit from this pr much more from normal cases.

Yeah, and for large-scale clusters, it would benefit also.

dbms/src/Flash/Mpp/ExchangeReceiver.cpp Outdated Show resolved Hide resolved
dbms/src/DataStreams/TiRemoteBlockInputStream.h Outdated Show resolved Hide resolved
dbms/src/DataStreams/TiRemoteBlockInputStream.h Outdated Show resolved Hide resolved
dbms/src/Flash/Mpp/ExchangeReceiver.cpp Outdated Show resolved Hide resolved
codec.readColumnMeta(i, istr, column);

/// Data
MutableColumnPtr read_column = column.type->createColumn();
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks like read_column is not used?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ah, removed it now.


#include <Flash/Coprocessor/StreamingDAGResponseWriter.cpp>
#include <Flash/Mpp/BroadcastOrPassThroughWriter.cpp>
#include <Flash/Mpp/ExchangeReceiver.cpp>
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

format the includes

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done

dbms/src/Flash/Mpp/ExchangeReceiver.cpp Outdated Show resolved Hide resolved
@yibin87 yibin87 requested review from windtalker and SeaRise November 1, 2022 08:29
@yibin87
Copy link
Contributor Author

yibin87 commented Nov 1, 2022

/run-unit-tests

Signed-off-by: yibin <[email protected]>
Signed-off-by: yibin <[email protected]>
@yibin87 yibin87 requested a review from SeaRise November 2, 2022 02:23
@yibin87
Copy link
Contributor Author

yibin87 commented Nov 2, 2022

/run-unit-tests

@yibin87
Copy link
Contributor Author

yibin87 commented Nov 2, 2022

/run-integration-test

/// Data
MutableColumnPtr read_column = column.type->createColumn();
if (reserve_size > 0)
read_column->reserve(reserve_size);
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

reserve rows if reserve_size <= 0 and if reserve_size >0, reserve std::max(rows, reserve_size) ?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Make sense, Done.

@yibin87 yibin87 requested a review from windtalker November 2, 2022 04:40
Signed-off-by: yibin <[email protected]>
Signed-off-by: yibin <[email protected]>
@yibin87 yibin87 requested a review from SeaRise November 2, 2022 07:00
Copy link
Contributor

@SeaRise SeaRise left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

others LGTM

dbms/src/Flash/Coprocessor/CodecUtils.cpp Outdated Show resolved Hide resolved
dbms/src/Flash/Coprocessor/CodecUtils.cpp Outdated Show resolved Hide resolved
Signed-off-by: yibin <[email protected]>
Copy link
Contributor

@SeaRise SeaRise left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@ti-chi-bot ti-chi-bot added the status/LGT1 Indicates that a PR has LGTM 1. label Nov 2, 2022
Copy link
Contributor

@windtalker windtalker left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@ti-chi-bot ti-chi-bot added status/LGT2 Indicates that a PR has LGTM 2. and removed status/LGT1 Indicates that a PR has LGTM 1. labels Nov 2, 2022
@yibin87
Copy link
Contributor Author

yibin87 commented Nov 2, 2022

/merge

@ti-chi-bot
Copy link
Member

@yibin87: It seems you want to merge this PR, I will help you trigger all the tests:

/run-all-tests

You only need to trigger /merge once, and if the CI test fails, you just re-trigger the test that failed and the bot will merge the PR for you after the CI passes.

If you have any questions about the PR merge process, please refer to pr process.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the ti-community-infra/tichi repository.

@ti-chi-bot
Copy link
Member

This pull request has been accepted and is ready to merge.

Commit hash: e8594cf

@ti-chi-bot ti-chi-bot added the status/can-merge Indicates a PR has been approved by a committer. label Nov 2, 2022
@ti-chi-bot ti-chi-bot merged commit dc28b51 into pingcap:master Nov 2, 2022
@JasonWu0506
Copy link

/type ehencement

@ti-chi-bot
Copy link
Contributor

ti-chi-bot bot commented May 30, 2023

@JasonWu0506: The label(s) type/ehencement cannot be applied, because the repository doesn't have them.

In response to this:

/type ehencement

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the ti-community-infra/tichi repository.

@JasonWu0506
Copy link

/type enhencement

@ti-chi-bot
Copy link
Contributor

ti-chi-bot bot commented May 30, 2023

@JasonWu0506: The label(s) type/enhencement cannot be applied, because the repository doesn't have them.

In response to this:

/type enhencement

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the ti-community-infra/tichi repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
release-note-none Denotes a PR that doesn't merit a release note. size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. status/can-merge Indicates a PR has been approved by a committer. status/LGT2 Indicates that a PR has LGTM 2.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Join & Aggregation Fine Grained Partition Optimization
6 participants