Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix(ci): Avoid inbound service overloads and fix failing tests #6537

Merged
merged 5 commits into from
Apr 23, 2023

Conversation

teor2345
Copy link
Contributor

@teor2345 teor2345 commented Apr 18, 2023

Motivation

It looks like the syncer or state is hanging, then the inbound service is getting overloaded, which causes CI failures in some tests.

This might fix #6506.

Complex Code or Requirements

It's probably a concurrency bug, but this PR just tweaks some timings and test logs.

Solution

Fixes:

Diagnostics:

Review

This is an urgent fix for failures that are stopping other PRs merging.

Reviewer Checklist

  • Will the PR name make sense to users?
    • Does it need extra CHANGELOG info? (new features, breaking changes, large changes)
  • Are the PR labels correct?
  • Does the code do what the ticket and PR says?
    • Does it change concurrent code, unsafe code, or consensus rules?
  • How do you know it works? Does it have tests?

Follow Up Work

We might need to fix a state or syncer hang, but we need more debug info to know which one to fix.

@teor2345 teor2345 added P-High 🔥 I-hang A Zebra component stops responding to requests I-integration-fail Continuous integration fails, including build and test failures A-network Area: Network protocol updates or fixes C-testing Category: These are tests A-state Area: State / database changes C-trivial Category: A trivial change that is not worth mentioning in the CHANGELOG A-concurrency Area: Async code, needs extra work to make it work properly. labels Apr 18, 2023
@teor2345 teor2345 requested a review from a team as a code owner April 18, 2023 04:07
@teor2345 teor2345 self-assigned this Apr 18, 2023
@teor2345 teor2345 requested a review from a team as a code owner April 18, 2023 04:07
@teor2345 teor2345 requested review from oxarbitrage and removed request for a team April 18, 2023 04:07
@github-actions github-actions bot added the C-bug Category: This is a bug label Apr 18, 2023
@codecov
Copy link

codecov bot commented Apr 18, 2023

Codecov Report

Merging #6537 (8ed898b) into main (f2133c9) will decrease coverage by 0.19%.
The diff coverage is 10.00%.

Additional details and impacted files
@@            Coverage Diff             @@
##             main    #6537      +/-   ##
==========================================
- Coverage   77.92%   77.74%   -0.19%     
==========================================
  Files         306      306              
  Lines       40236    40267      +31     
==========================================
- Hits        31355    31305      -50     
- Misses       8881     8962      +81     

Base automatically changed from private-nonce-mem-fix to main April 18, 2023 08:13
This disables around 10,000 logs like:
2023-04-18T02:46:28.441662Z  WARN init{config=Config { checkpoint_sync: true, debug_skip_parameter_preload: false } network=Mainnet debug_skip_parameter_preload=true}: unexpected error: Closed in state request while verifying previous state checkpoints
Copy link
Contributor

@oxarbitrage oxarbitrage left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me, thanks.

mergify bot added a commit that referenced this pull request Apr 23, 2023
@mergify mergify bot merged commit 9c15b14 into main Apr 23, 2023
@mergify mergify bot deleted the inbound-overload-fix branch April 23, 2023 13:41
@teor2345 teor2345 changed the title fix(ci): Avoid inbound service overloads in tests fix(ci): Avoid inbound service overloads. mainly in tests Apr 26, 2023
@teor2345 teor2345 changed the title fix(ci): Avoid inbound service overloads. mainly in tests fix(ci): Avoid inbound service overloads and fix failing tests Apr 26, 2023
@oxarbitrage oxarbitrage mentioned this pull request May 9, 2023
38 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
A-concurrency Area: Async code, needs extra work to make it work properly. A-network Area: Network protocol updates or fixes A-state Area: State / database changes C-bug Category: This is a bug C-testing Category: These are tests C-trivial Category: A trivial change that is not worth mentioning in the CHANGELOG I-cost Zebra infrastructure costs I-hang A Zebra component stops responding to requests I-integration-fail Continuous integration fails, including build and test failures
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Multiple tests fail in CI with "inbound service overloaded" warnings
2 participants