check: set lastBlockTime in PrepareRequest handler, fix #55 #56

roman-khimov · 2021-11-29T11:43:21Z

It's the earliest point, if a view change happens the timer is reset (because
new view comes with a new set of transactions, potentially picking up ones
received between views).

roman-khimov · 2021-11-29T11:46:28Z

Four nodes, 1000 TPS:

Same with 200ms network delay:

roman-khimov · 2021-11-29T12:05:16Z

Max performance for four nodes with 5s block time:

With 200ms delay:

I'm not entirely sure why, but does affect TPS and not in a good way (hence this is a draft).

roman-khimov · 2021-11-29T12:06:49Z

Two seconds with 128K pool:

roman-khimov · 2024-06-13T09:15:54Z

Rebased.

codecov · 2024-06-13T09:16:35Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 58.31%. Comparing base (b2ba0cd) to head (b3c1c3b).
Report is 2 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master      #56      +/-   ##
==========================================
+ Coverage   58.21%   58.31%   +0.09%     
==========================================
  Files          32       32              
  Lines        2257     2262       +5     
==========================================
+ Hits         1314     1319       +5     
  Misses        859      859              
  Partials       84       84

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

roman-khimov · 2024-12-31T09:11:46Z

As we remember from #56 (comment) this patch works perfectly for the case of constant low/moderate load, the difference is rather obvious and for the good. Let's concentrate on less positive side of it that was left as "unknown" three years ago.

All tests below are using 50K memory pool (and are limited by it in many cases), BoltDB as the DB and 30 workers pushing as many transactions as they can.

Starting with the simple case, single node:

It's obvious that we have zero communication overhead in this case, but one can notice that the patched version still shaves off some milliseconds from the block time leading to slightly better average TPS results (even if some blocks drop below the line which can happen from test to test irrespective of this patch):

Moving on to four nodes, 5s block time and no delays:

This network is obviously limited by memory pool TPS-wise, and the patch effect is hardly noticeable, 5s is too much of a time, the network is perfectly synchronized wrt block contents and always pushes 50K transactions in each block:

To make things more interesting we can drop block time to 1s:

In this case nodes aren't perfectly synchronized and they can't consume 50K transactions in one second which leads to lower number of transactions in each block:

But notice that block time is more stable here and TPS is actually better:

Which is connected to the fact of more stable block time with memory pool having some spare slots. Now to more complex things, we're adding 200ms delay and run with 5s block time:

It's rather obvious that block times are much better here (as expected and even if we're to forget about CV), but TPS picture is similar to the one in #56 (comment):

There is a degradation. And we can clearly see the tick-tock pattern in TPS which directly follows from the tick-tock in TPB:

And now I can finally explain this. This network has problems with transaction propagation and consensus, in general it can consume ~50K transactions in 5s, but accepting a block takes time, so once it flushes 50K block nodes have empty memory pools and they're adjusting timers for consensus delay leaving very little time for the next block, so when time comes to make it we don't have a lot of transactions in pools, so we end up with ~empty block, but the next one has enough overall time to fill up memory pool again, then it's flushed with 50K transactions and the cycle repeats. It's not the first time we see this pattern in various cases, but here longer block time of non-patched version just allows to collect more transactions. So we have a clear trade-off between average TPS and latency (since blocks are delayed for longer time without this patch).

But this is a memory pool limit effect again, to avoid it we can drop block time to 1s (with the same 200ms delay):

Consensus takes a lot of time here because nodes are not well-synchronized, they're actively exchanging transactions when new block is being proposed and some nodes have to ask others about some transactions. Both master and patched version are far from 1s target time. But what happens to average TPS is:

It's a bit better and it's more stable. And that happens because this time it's master that is doing tick-tock again:

So overall I can say that:

this patch is safe
it is beneficial for block time in all cases, especially when we're dealing with well-synchronized networks (constant low/medium transaction load)
it's good TPS-wise for networks where mempool is not constantly saturated
it can degrade TPS in some specific cases of mempool saturation, but the same degradation happens to unpatched version in different scenarios, but this degradation is not catastrophic either (on the order or 10%)
given that most of our real usage scenarios are not related to constantly saturated memory pools it's a good change

AnnaShaleva

Otherwise this change looks legit to me.

check.go

It's the earliest point, if a view change happens the timer is reset (because new view comes with a new set of transactions, potentially picking up ones received between views). Signed-off-by: Roman Khimov <[email protected]>

roman-khimov mentioned this pull request Sep 26, 2023

discussion about block time neo-project/neo#2918

Open

roman-khimov mentioned this pull request Dec 20, 2023

Move d.lastBlockTime setting to PrepareRequest receival #55

Closed

roman-khimov force-pushed the move-lastblock-time branch from 288fc24 to 4c7e221 Compare June 13, 2024 09:15

roman-khimov mentioned this pull request Dec 16, 2024

[Consensus] Include the consensus time into block interval. neo-project/neo#3627

Open

roman-khimov force-pushed the move-lastblock-time branch 5 times, most recently from 279e52e to 42d6f59 Compare December 31, 2024 08:31

roman-khimov marked this pull request as ready for review December 31, 2024 08:31

roman-khimov requested a review from AnnaShaleva as a code owner December 31, 2024 08:31

roman-khimov mentioned this pull request Dec 31, 2024

[Core Plugin DBFT] include consensus time into block interval neo-project/neo#3637

Open

14 tasks

AnnaShaleva reviewed Jan 10, 2025

View reviewed changes

check.go Show resolved Hide resolved

AnnaShaleva modified the milestone: v0.3.2 Jan 10, 2025

check: set lastBlockTime in PrepareRequest handler, fix #55

b3c1c3b

It's the earliest point, if a view change happens the timer is reset (because new view comes with a new set of transactions, potentially picking up ones received between views). Signed-off-by: Roman Khimov <[email protected]>

roman-khimov force-pushed the move-lastblock-time branch from 42d6f59 to b3c1c3b Compare January 10, 2025 08:48

roman-khimov requested a review from AnnaShaleva January 10, 2025 08:48

AnnaShaleva approved these changes Jan 10, 2025

View reviewed changes

AnnaShaleva merged commit 27db04c into master Jan 10, 2025
12 checks passed

AnnaShaleva deleted the move-lastblock-time branch January 10, 2025 09:09

roman-khimov mentioned this pull request Jan 10, 2025

context: add simple rtt estimator #140

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

check: set lastBlockTime in PrepareRequest handler, fix #55 #56

check: set lastBlockTime in PrepareRequest handler, fix #55 #56

roman-khimov commented Nov 29, 2021

roman-khimov commented Nov 29, 2021

roman-khimov commented Nov 29, 2021

roman-khimov commented Nov 29, 2021

roman-khimov commented Jun 13, 2024

codecov bot commented Jun 13, 2024 •

edited

Loading

roman-khimov commented Dec 31, 2024

AnnaShaleva left a comment

check: set lastBlockTime in PrepareRequest handler, fix #55 #56

check: set lastBlockTime in PrepareRequest handler, fix #55 #56

Conversation

roman-khimov commented Nov 29, 2021

roman-khimov commented Nov 29, 2021

roman-khimov commented Nov 29, 2021

roman-khimov commented Nov 29, 2021

roman-khimov commented Jun 13, 2024

codecov bot commented Jun 13, 2024 • edited Loading

Codecov Report

roman-khimov commented Dec 31, 2024

AnnaShaleva left a comment

Choose a reason for hiding this comment

codecov bot commented Jun 13, 2024 •

edited

Loading