Improve speed of nonconsensus data removal #2717

pasqu4le · 2019-09-20T16:06:44Z

Motivation

The recently introduction of non-consensus block's data removal is not efficient enough to keep the pace with block creation.

Changelog

Enhancements

This PR changes the logic to remove only data of blocks that lose consensus because they are being overridden (as opposed to also for blocks that are invalid neighbors) and changes the queries to perform.

Checklist for your PR

I added an entry to CHANGELOG.md with this PR
If I added new functionality, I added tests covering it.
If I fixed a bug, I added a regression test to prevent the bug from silently reappearing again.
I checked whether I should update the docs and did so if necessary
If I added/changed/removed ENV var, I should update the list of env vars in https://github.com/poanetwork/blockscout/blob/master/docs/env-variables.md to reflect changes in the table here https://poanetwork.github.io/blockscout/#/env-variables?id=blockscout-env-variables. I've set master in the Version column.
If I add new indices into DB, I checked, that they don't redundant with PGHero or other tools

Problem: removal of nonconsensus data is too inefficient and as a result blocks are imported too slow. Solution: reformulation of deletion logic for better performance

coveralls · 2019-09-20T16:22:38Z

Pull Request Test Coverage Report for Build 16282840-fb5b-4d95-9aba-e9af732f0d66

18 of 22 (81.82%) changed or added relevant lines in 1 file are covered.
No unchanged relevant lines lost coverage.
Overall coverage decreased (-0.03%) to 78.506%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
apps/explorer/lib/explorer/chain/import/runner/blocks.ex	18	22	81.82%

Totals
Change from base Build d94c369a-8b26-40e5-96e1-4626be9542eb:	-0.03%
Covered Lines:	5245
Relevant Lines:	6681

💛 - Coveralls

ayrat555 · 2019-09-23T08:09:15Z

apps/explorer/lib/explorer/chain/import/runner/blocks.ex

-    {_, result} =
+    acquire_query =
+      from(
+        block in where_invalid_neighbour(changes_list),


PR's description says

This PR changes the logic to remove only data of blocks that lose consensus because they are being overridden (as opposed to also for blocks that are invalid **neighbors**)

it looks like it also removes data from invalid neighbours because they are selected with where_invalid_neighbour(changes_list)

I see optimisation in using block_hash'es for selecting transactions instread of using block_number's because we have DB index for block_hash in the transactions table

@ayrat555 yes the invalid neighbors blocks are still removed, however if you look at each remove_nonconsensus_xxx step you'll find that now they take transaction hashes from the result of derive_transaction_forks, in other terms all the hashes of the transactions that have been forked.

What I figured was in fact that we do not fork transactions for invalid neighbors because those will be handled after anyway (neighbor loses consensus > get refetched > new block is inserted and old transactions get forked) and we can do the same for the rest of the nonconsensus data.

Improve speed of nonconsensus data removal

0944ea5

Problem: removal of nonconsensus data is too inefficient and as a result blocks are imported too slow. Solution: reformulation of deletion logic for better performance

pasqu4le self-assigned this Sep 20, 2019

pasqu4le requested review from ayrat555 and vbaranov September 20, 2019 16:12

pasqu4le added the ready for review This PR is ready for reviews. label Sep 20, 2019

changelog entry

bc47fe9

ayrat555 reviewed Sep 23, 2019

View reviewed changes

ayrat555 approved these changes Sep 23, 2019

View reviewed changes

vbaranov approved these changes Sep 23, 2019

View reviewed changes

vbaranov merged commit 3f1ce48 into master Sep 23, 2019

vbaranov deleted the pp-faster-nonconsensus-data-removal branch September 23, 2019 14:37

vbaranov mentioned this pull request Oct 16, 2019

Internal txs fetcher doesn't have time to process all blocks #2780

Closed

kritsadanuansutha mentioned this pull request Mar 26, 2024

[Snyk] Upgrade sweetalert2 from 11.10.5 to 11.10.6 kritsadanuansutha/blockscout#4

Open

Woodpile37 mentioned this pull request Mar 27, 2024

[Snyk] Upgrade sweetalert2 from 11.10.5 to 11.10.6 Woodpile37/blockscout#20

Merged

snyk-io bot mentioned this pull request May 23, 2024

[Snyk] Upgrade sweetalert2 from 11.10.5 to 11.10.8 Hawthorne001/blockscout#3

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve speed of nonconsensus data removal #2717

Improve speed of nonconsensus data removal #2717

pasqu4le commented Sep 20, 2019

coveralls commented Sep 20, 2019

ayrat555 Sep 23, 2019

pasqu4le Sep 23, 2019

Improve speed of nonconsensus data removal #2717

Improve speed of nonconsensus data removal #2717

Conversation

pasqu4le commented Sep 20, 2019

Motivation

Changelog

Enhancements

Checklist for your PR

coveralls commented Sep 20, 2019

Pull Request Test Coverage Report for Build 16282840-fb5b-4d95-9aba-e9af732f0d66

💛 - Coveralls

ayrat555 Sep 23, 2019

Choose a reason for hiding this comment

pasqu4le Sep 23, 2019

Choose a reason for hiding this comment