Recently rejected cache for transaction queue #9005

tomusdrw · 2018-06-28T14:34:40Z

Tests shows that we receive the same transactions multiple times (especially when running with large number of peers).
This causes excessive verification overhead, when the same transaction is verified over and over again and then rejected for exactly the same reason.

This PR introduces a cache for rejections, which we clear after every block.

Also because of large amount of transactions the sync -> IoWorker queue was easily saturated and a lot of transactions were just dropped, this PR also addresses that by making the queue size depend on the transaction pool size.

In subsequent PR I plan to optimize cache eviction, cause initial tests indicate that the most common rejection reasons don't need to be cleared at every block:

$ cat recently.log | awk "{print \$12}" | sort | uniq -c | sort -n
      3 LimitReached
   6403 GasLimitExceeded
  39812 Old
 157679 InsufficientBalance
 291630 InsufficientGasPrice
 634291 TooCheapToReplace
 717894 AlreadyImported

Related #8696

dvdplm · 2018-06-29T07:13:34Z

miner/src/pool/queue.rs

+}
+
+/// Minimal size of rejection cache, by default it's equal to queue size.
+const MIN_REJECTED_CACHE_SIZE: usize = 2048;


Maybe use this to instantiate the hashmap with_capacity()? Not sure how hot this cache is expected to be.

Good idea, the cache is used quite extensively, especially when you are running with a small transaction pool.

dvdplm · 2018-06-29T07:16:19Z

miner/src/pool/queue.rs

@@ -138,6 +138,50 @@ impl CachedPending {
 	}
 }

+#[derive(Debug)]
+struct RecentlyRejected {
+	inner: RwLock<HashMap<H256, transaction::Error>>,


Curious about why you're storing the error here as opposed to using a HashSet. Do you plan to use the error count stats to refine the logic further down?

The error is stored here, so that we can return it later (we could return generic RecentlyRejected, but It's more descriptive to have the actual values), remember that Error can look like this: InsufficientGasPrice { got, minimal }, so for every transaction the values might be completely different.

I'm planning to refine the logic a bit as well, especially with regard to cache invalidation (like: "Clear all InsufficientBlanace errors efficiently"), and was considering approaches like Vec<(Error, HashSet)> or Vec<ErrorCode, HashMap<Hash, Error>>, but haven't decided yet, what path to follow. Will run some performance tests, cause maybe HashMap<Hash, Error> and iterating over all entries (or using retain) will be efficient enough.

Thank you for the explanation! :)

dvdplm · 2018-06-29T07:20:53Z

miner/src/pool/queue.rs

+		inner.insert(hash, err.clone());
+
+		// clean up
+		if inner.len() > self.limit {


If we never go above the limit the cache entries are never expired, is that ok?

The cache is completely cleared on Pool::cull, which in turn should be run on every block.

5chdn · 2018-07-02T13:03:35Z

@dvdplm or @andresilva for a final review?

* Store recently rejected transactions. * Don't cache AlreadyImported rejections. * Make the size of transaction verification queue dependent on pool size. * Add a test for recently rejected. * Fix logging for recently rejected. * Make rejection cache smaller. * obsolete test removed * obsolete test removed * Construct cache with_capacity.

* parity-version: bump beta to 1.11.6 * scripts: remove md5 checksums (#8884) * Add support for --chain tobalaba * Convert indents to tabs :) * Fixes for misbehavior reporting in AuthorityRound (#8998) * aura: only report after checking for repeated skipped primaries * aura: refactor duplicate code for getting epoch validator set * aura: verify_external: report on validator set contract instance * aura: use correct validator set epoch number when reporting * aura: use epoch set when verifying blocks * aura: report skipped primaries when generating seal * aura: handle immediate transitions * aura: don't report skipped steps from genesis to first block * aura: fix reporting test * aura: refactor duplicate code to handle immediate_transitions * aura: let reporting fail on verify_block_basic * aura: add comment about possible failure of reporting * Only return error log for rustls (#9025) * Transaction Pool improvements (#8470) * Don't use ethereum_types in transaction pool. * Hide internal insertion_id. * Fix tests. * Review grumbles. * Improve should_replace on NonceAndGasPrice (#8980) * Additional tests for NonceAndGasPrice::should_replace. * Fix should_replace in the distinct sender case. * Use natural priority ordering to simplify should_replace. * Minimal effective gas price in the queue (#8934) * Minimal effective gas price. * Fix naming, add test * Fix minimal entry score and add test. * Fix worst_transaction. * Remove effective gas price threshold. * Don't leak gas_price decisions out of Scoring. * Never drop local transactions from different senders. (#9002) * Recently rejected cache for transaction queue (#9005) * Store recently rejected transactions. * Don't cache AlreadyImported rejections. * Make the size of transaction verification queue dependent on pool size. * Add a test for recently rejected. * Fix logging for recently rejected. * Make rejection cache smaller. * obsolete test removed * obsolete test removed * Construct cache with_capacity. * Optimize pending transactions filter (#9026) * rpc: return unordered transactions in pending transactions filter * ethcore: use LruCache for nonce cache Only clear the nonce cache when a block is retracted * Revert "ethcore: use LruCache for nonce cache" This reverts commit b382c19. * Use only cached nonces when computing pending hashes. * Give filters their own locks, so that they don't block one another. * Fix pending transaction count if not sealing. * Clear cache only when block is enacted. * Fix RPC tests. * Address review comments. * A last bunch of txqueue performance optimizations (#9024) * Clear cache only when block is enacted. * Add tracing for cull. * Cull split. * Cull after creating pending block. * Add constant, remove sync::read tracing. * Reset debug. * Remove excessive tracing. * Use struct for NonceCache. * Fix build * Remove warnings. * Fix build again. * miner: add missing macro use for trace_time * ci: remove md5 merge leftovers

…rp_sync_on_light_client * 'master' of https://github.com/paritytech/parity: Never drop local transactions from different senders. (openethereum#9002) Precise HTTP or WebSockets for JSON-RPC options (openethereum#9027) Recently rejected cache for transaction queue (openethereum#9005) Make HashDB generic (openethereum#8739) Only return error log for rustls (openethereum#9025) Update Changelogs for 1.10.8 and 1.11.5 (openethereum#9012)

tomusdrw added 7 commits June 20, 2018 14:38

Store recently rejected transactions.

f07f6ba

Merge branch 'master' into td-recently-rejected

4e3a0d9

Don't cache AlreadyImported rejections.

43eb599

Make the size of transaction verification queue dependent on pool size.

1d31e4e

Add a test for recently rejected.

080bd76

Fix logging for recently rejected.

e6d480a

Merge branch 'master' into td-recently-rejected

842bf95

tomusdrw added A0-pleasereview 🤓 Pull request needs code review. B0-patchthis M4-core ⛓ Core client code / Rust. labels Jun 28, 2018

tomusdrw requested a review from andresilva June 28, 2018 14:34

Make rejection cache smaller.

d3fe51e

tomusdrw force-pushed the td-recently-rejected branch from 385dddf to d3fe51e Compare June 28, 2018 14:35

5chdn added this to the 1.12 milestone Jun 28, 2018

svyatonik approved these changes Jun 29, 2018

View reviewed changes

svyatonik added 2 commits June 29, 2018 09:26

obsolete test removed

5bf7033

obsolete test removed

ee199f4

dvdplm reviewed Jun 29, 2018

View reviewed changes

Construct cache with_capacity.

44d2476

5chdn mentioned this pull request Jun 30, 2018

Beta: 1.11.6 backports #9015

Merged

12 tasks

Merge branch 'master' into td-recently-rejected

222e5d2

5chdn added A8-looksgood 🦄 Pull request is reviewed well. and removed A0-pleasereview 🤓 Pull request needs code review. labels Jul 2, 2018

andresilva approved these changes Jul 2, 2018

View reviewed changes

andresilva merged commit 78e0012 into master Jul 2, 2018

5chdn deleted the td-recently-rejected branch July 3, 2018 09:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Recently rejected cache for transaction queue #9005

Recently rejected cache for transaction queue #9005

tomusdrw commented Jun 28, 2018

dvdplm Jun 29, 2018

tomusdrw Jun 29, 2018

dvdplm Jun 29, 2018

tomusdrw Jun 29, 2018

dvdplm Jul 3, 2018

dvdplm Jun 29, 2018

tomusdrw Jun 29, 2018

5chdn commented Jul 2, 2018

Recently rejected cache for transaction queue #9005

Recently rejected cache for transaction queue #9005

Conversation

tomusdrw commented Jun 28, 2018

dvdplm Jun 29, 2018

Choose a reason for hiding this comment

tomusdrw Jun 29, 2018

Choose a reason for hiding this comment

dvdplm Jun 29, 2018

Choose a reason for hiding this comment

tomusdrw Jun 29, 2018

Choose a reason for hiding this comment

dvdplm Jul 3, 2018

Choose a reason for hiding this comment

dvdplm Jun 29, 2018

Choose a reason for hiding this comment

tomusdrw Jun 29, 2018

Choose a reason for hiding this comment

5chdn commented Jul 2, 2018