Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

services/horizon/ledger: Fix memory leak in HistoryDBSource #2548

Merged
merged 5 commits into from
May 5, 2020

Conversation

bartekn
Copy link
Contributor

@bartekn bartekn commented May 4, 2020

PR Checklist

PR Structure

  • This PR has reasonably narrow scope (if not, break it down into smaller PRs).
  • This PR avoids mixing refactoring changes with feature changes (split into two PRs
    otherwise).
  • This PR's title starts with name of package that is most changed in the PR, ex.
    services/friendbot, or all or doc if the changes are broad or impact many
    packages.

Thoroughness

  • This PR adds tests for the most critical parts of the new functionality or fixes.
  • I've updated any docs (developer docs, .md
    files, etc... affected by this change). Take a look in the docs folder for a given service,
    like this one.

Release planning

  • I've updated the relevant CHANGELOG (here for Horizon) if
    needed with deprecations, added features, breaking changes, and DB schema changes.
  • I've decided if this PR requires a new major/minor version according to
    semver, or if it's mainly a patch change. The PR is targeted at the next
    release branch if it's not a patch change.

What

This commit fixes a memory leak in HistoryDBSource.NextLedger method. The method is usually used like in stream_handler.go:

select {
case currentLedgerSequence = <-handler.LedgerSource.NextLedger(currentLedgerSequence):
continue
case <-ctx.Done():
stream.Done()
return
}

When streaming connection is closed but the are no new ledgers (ex. due to database or network issues) the goroutines started in HistoryDBSource.NextLedger will not be closed.

To fix this ledger.Source interface was updated with Close() method that should be called when a ledger source is no longer used and it exist internal goroutine. The code was refactored to create a new HistoryDBSource for each streaming request (so it can be closed when handler execution ends).

Why

When there are no new ledgers, the goroutines started in HistoryDBSource.NextLedger will leak and will eventually consume all available memory, like in #2535.

Known limitations

The new code creates a new HistoryDBSource for each streaming request (instead of reusing a single HistoryDBSource in all streaming requests). This should have a really minimal impact on memory usage.

@bartekn bartekn requested a review from a team May 4, 2020 18:58
@@ -46,6 +49,10 @@ func (source HistoryDBSource) NextLedger(currentSequence uint32) chan uint32 {
time.Sleep(source.updateFrequency)
}

if source.closed {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

another go routine may write to source.closed so I think technically the behavior is undefined. I think it would be a good idea to use a lock or use a context to signal cancelation

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You're right. I was under the impression that single read/single write is totally fine but https://golang.org/ref/mem is clear that it's not. Fixed by using explicit synchronization.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Check for possible memleak when --exp-ingest-in-memory-only and ingestion errors
2 participants