test: mempool: Add unit and integration tests #8017

brdji · 2022-02-02T17:09:32Z

Related Issues

There were 0 existing messagepool integration tests. Since the pool was only being tested as a side-effect of other features and integration tests, adding tests targeting only the messagepool will allow easier debugging, code safety, and improve coverage.

Proposed Changes

Add more unit and integration tests covering untested behaviors:

Uncovered lines (e.g. check_test.go, and Add message tests in messagepool_test.go)
Uncovered integration behaviors: batch push, remove, etc.

Checklist

Before you mark the PR ready for review, please make sure that:

All commits have a clear commit message.
The PR title is in the form of of <PR type>: <area>: <change being made>
- example: fix: mempool: Introduce a cache for valid signatures
- PR type: fix, feat, INTERFACE BREAKING CHANGE, CONSENSUS BREAKING, build, chore, ci, docs,perf, refactor, revert, style, test
- area: api, chain, state, vm, data transfer, market, mempool, message, block production, multisig, networking, paychan, proving, sealing, wallet, deps
This PR has tests for new functionality or change in behaviour
If new user-facing features are introduced, clear usage guidelines and / or documentation updates should be included in https://lotus.filecoin.io or Discussion Tutorials.
CI is green

codecov · 2022-02-08T18:00:07Z

Codecov Report

Merging #8017 (3438732) into master (2e22781) will increase coverage by 0.08%.
The diff coverage is 81.81%.

@@            Coverage Diff             @@
##           master    #8017      +/-   ##
==========================================
+ Coverage   39.13%   39.22%   +0.08%     
==========================================
  Files         662      663       +1     
  Lines       72160    72171      +11     
==========================================
+ Hits        28242    28310      +68     
+ Misses      39008    38953      -55     
+ Partials     4910     4908       -2

Impacted Files	Coverage Δ
itests/kit/circuit.go	`81.81% <81.81%> (ø)`
markets/loggers/loggers.go	`88.88% <0.00%> (-11.12%)`	⬇️
chain/events/observer.go	`71.64% <0.00%> (-6.72%)`	⬇️
storage/wdpost_sched.go	`75.49% <0.00%> (-5.89%)`	⬇️
node/hello/hello.go	`63.63% <0.00%> (-3.41%)`	⬇️
extern/sector-storage/stores/local.go	`57.22% <0.00%> (-2.78%)`	⬇️
chain/store/store.go	`63.00% <0.00%> (-2.50%)`	⬇️
extern/storage-sealing/fsm.go	`56.44% <0.00%> (-2.44%)`	⬇️
miner/miner.go	`54.75% <0.00%> (-1.97%)`	⬇️
chain/gen/gen.go	`68.19% <0.00%> (-1.23%)`	⬇️
... and 18 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 2e22781...3438732. Read the comment docs.

chain/messagepool/check_test.go

ZenGround0

I'm newer to this area of the code but LGTM. If I'm missing a logic problem or flaky test introduction we can always remove tests as they become problems.

chain/messagepool/messagepool_test.go

itests/mempool_test.go

@vyzo

Per @vyzo's feedback, we shouldn't parse err messages but figure out a way to do this smarter. I updated the code just check for error existence and @brdji should figure out what to do next.

ZenGround0 · 2022-02-10T19:11:50Z

@TheMenko tests failing now

ZenGround0

Since your last commit had no logic change I think this mean one of your tests is flaky

TheDivic · 2022-02-10T20:04:31Z

@ZenGround0 Seems we have a flake, I may have just died a little bit inside. 🥲
Do you have some advice how can I rerun tests on CI manually to test if I fixed it, or I have to push changes to trigger it?

ZenGround0 · 2022-02-11T18:21:28Z

I have the ability to rerun failures but since the test is flaky that won't do much good because a failure always means you need another push, it's rerunning successes that you're interested in.

In general I would run it under right conditions and enough times locally to get a failure reproduction, make the change that removes the flake and try to reproduce again locally if you're unsure. Usually understanding the root cause of the flake and convincing myself that it's gone is enough for me.

In the rare case local reproduction is just not happening then after making the change I was convinced removed the flake I would make a bunch of noop changes and push enough times to convince myself with high probability the flake was gone.

TheDivic · 2022-02-11T18:31:41Z

@ZenGround0 Thanks for the detailed explanation! We will find a way to reproduce and fix it locally and we'll get back to you after with an explanation! 🙂

…to bloxico/mempool_tests

The flake was caused by improper waiting for certain chain operations to finish: - We didn't wait for messages to be registered as pushed - We improperly waited for a fixed time (10 seconds) for messages to be mined, which in the best case would wait longer than necessary and in the worst case would cause the test to break. What I did: - fixed by waiting in a loop for "just enough time". This fixed the flake and made the test run faster, on average, because we don't have unnecessary waiting. - I added a "circuit-breaker" where the wait loop will timeout after 10 seconds.

Using the same pattern described in my previous commit. I also added the CircuitBreaker to the itests kit as it may be useful for other integration tests when debugging flakyness caused by timeouts.

TheDivic · 2022-02-12T19:27:52Z

@ZenGround0 I think this PR is ready to be merged now.
The flake was caused by improper waiting for async operations to complete, using time.Sleep.
I reproduced it locally by running it many times with go test -count=N. I used the same approach to validate that it's now fixed.
The solution is explained in detail in my commit messages.
I also extended the itests/kit with a useful helper function for debugging and fixing flakes caused by this same reason, as it may be useful in other tests.

ZenGround0 · 2022-02-14T13:37:29Z

itests/kit/circuit.go

+You can use it if t.Deadline() is not "granular" enough, and you want to know which specific piece of code timed out,
+or you need to set different deadlines in the same test.
+*/
+func CircuitBreaker(t *testing.T, label string, throttle, timeout time.Duration, cb func() bool) {


Add unit and integration tests for mempool

008fbbd

brdji requested a review from a team as a code owner February 2, 2022 17:09

Update ci config to match auto gen

03bc45a

ZenGround0 reviewed Feb 8, 2022

View reviewed changes

chain/messagepool/check_test.go Show resolved Hide resolved

ZenGround0 approved these changes Feb 9, 2022

View reviewed changes

vyzo reviewed Feb 9, 2022

View reviewed changes

chain/messagepool/messagepool_test.go Outdated Show resolved Hide resolved

TheMenko reviewed Feb 9, 2022

View reviewed changes

itests/mempool_test.go Show resolved Hide resolved

test: don't parse err messages in messagepool_test

7d2810a

Per @vyzo's feedback, we shouldn't parse err messages but figure out a way to do this smarter. I updated the code just check for error existence and @brdji should figure out what to do next.

ZenGround0 self-requested a review February 10, 2022 19:40

ZenGround0 requested changes Feb 10, 2022

View reviewed changes

TheDivic added 3 commits February 12, 2022 17:11

Merge branch 'master' of https://github.com/filecoin-project/lotus in…

0e8a709

…to bloxico/mempool_tests

test: fix flaky message pool integration tests

3438732

Using the same pattern described in my previous commit. I also added the CircuitBreaker to the itests kit as it may be useful for other integration tests when debugging flakyness caused by timeouts.

ZenGround0 approved these changes Feb 14, 2022

View reviewed changes

ZenGround0 merged commit 3b5b55d into master Feb 14, 2022

ZenGround0 deleted the bloxico/mempool_tests branch February 14, 2022 13:46

rvagg mentioned this pull request Sep 4, 2024

chore(tests): replace kit.CircuitBreaker with require.Eventually #12430

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test: mempool: Add unit and integration tests #8017

test: mempool: Add unit and integration tests #8017

brdji commented Feb 2, 2022

codecov bot commented Feb 8, 2022 •

edited

Loading

ZenGround0 left a comment

ZenGround0 commented Feb 10, 2022

ZenGround0 left a comment

TheDivic commented Feb 10, 2022

ZenGround0 commented Feb 11, 2022

TheDivic commented Feb 11, 2022

TheDivic commented Feb 12, 2022

ZenGround0 Feb 14, 2022

test: mempool: Add unit and integration tests #8017

test: mempool: Add unit and integration tests #8017

Conversation

brdji commented Feb 2, 2022

Related Issues

Proposed Changes

Checklist

codecov bot commented Feb 8, 2022 • edited Loading

Codecov Report

ZenGround0 left a comment

Choose a reason for hiding this comment

ZenGround0 commented Feb 10, 2022

ZenGround0 left a comment

Choose a reason for hiding this comment

TheDivic commented Feb 10, 2022

ZenGround0 commented Feb 11, 2022

TheDivic commented Feb 11, 2022

TheDivic commented Feb 12, 2022

ZenGround0 Feb 14, 2022

Choose a reason for hiding this comment

codecov bot commented Feb 8, 2022 •

edited

Loading