Refactor ingestion data-flow #4909

tamirms · 2023-06-15T08:42:50Z

In https://docs.google.com/document/d/1YETNALx5EzqZDNSVWzTfaK5Ogw84PsBlrt64nOr-Njg/edit?usp=sharing we were able to speed up ingestion by refactoring the ingestion data-flow. Currently the Horizon processors for the history tables are coupled to a single ledger. We need to break this coupling and allow a Horizon processor instance to be used on batches of ledgers.

Also, we need to refactor how the DB is used in a Horizon processor. Currently, we periodically insert batches of rows to the DB and flush all remaining rows to the DB at the end of the ingestion round. We should instead accumulate all the rows in memory and only use the DB at the very end of the ingestion round to flush all the rows using COPY statements.

Note that these changes only need to be applied on the Horizon processors for the history tables. We do not need to modify the Horizon processors for the state tables.

In the spike branch the interface for the Horizon history processors was changed to:

type horizonTransactionProcessor interface {
	ProcessTransaction(xdr.LedgerCloseMeta, ingest.LedgerTransaction) error
	Commit(ctx context.Context, session db.SessionInterface) error
}

Note how ProcessTransaction() now takes xdr.LedgerCloseMeta parameter which allows the processor to be used on transactions spanning multiple ledgers. Also, ProcessTransaction() no longer has a context parameter because we don't expect to have any DB operations in ProcessTransaction(). Instead, ProcessTransaction() will only accumulate rows for the history tables in-memory. The Commit() function is the only part of the processor which should have access to the db and it will be used to flush the in-memory rows to the DB.

Some of the history tables rely on lookup tables to obtain integer ids for accounts, assets, claimable balances, and liquidity pools. In this case the data-flow will be slightly more complex. In the spike branch, the code for keeping track of all the id lookups was encapsulated into a "loader" component. Whenever the processor encountered an account string in ProcessTransaction(), the processor would register the account string in the loader component. The resulting data-flow looked like:

	accountLoader := history.NewAccountLoader()
	cbLoader := history.NewClaimableBalanceLoader()
	lpLoader := history.NewLiquidityPoolLoader()
	assetLoader := history.NewAssetLoader()
	processors := buildTransactionProcessors(
		s.historyQ,
		accountLoader,
		cbLoader,
		lpLoader,
		assetLoader,
	)

       // apply all the ledgers in the batch on the processors
       for _, ledger := range ledgers {
		if err = s.runner.ApplyProcessorsOnLedger(processors, ledgerCloseMeta); err != nil {
			return err
		}
       }

       // use the loaders to lookup all the accounts, assets, claimable balances, and liquidity pools registered
       // by the processors
       err = func() error {
		if err := s.historyQ.Begin(); err != nil {
			return errors.Wrap(err, "Error starting a transaction")
		}
		defer s.historyQ.Rollback()

		if err := accountLoader.Exec(s.ctx, s.historyQ); err != nil {
			return err
		}
		if err := cbLoader.Exec(s.ctx, s.historyQ); err != nil {
			return err
		}
		if err := lpLoader.Exec(s.ctx, s.historyQ); err != nil {
			return err
		}
		if err := assetLoader.Exec(s.ctx, s.historyQ); err != nil {
			return err
		}
		if err := s.historyQ.Commit(); err != nil {
			return errors.Wrap(err, commitErrMsg)
		}
		return nil
	}()

        // flush the rows to the db, the processors will be able to obtain the integer ids from the loaders
	if err := s.historyQ.Begin(); err != nil {
		return errors.Wrap(err, "Error starting a transaction")
	}
	defer s.historyQ.Rollback()
	if err := processors.Commit(s.ctx, s.historyQ); err != nil {
		return err
	}
	if err := s.historyQ.Commit(); err != nil {
		return errors.Wrap(err, commitErrMsg)
	}

This refactoring will need to be implemented on the following Horizon ingestion processors:

The text was updated successfully, but these errors were encountered:

sreuland · 2023-10-12T17:55:25Z

have begun working on sub-task to finish integrating the new processor interface into the tx/commit flow of processor runner

…ocessor runners

…lder copy statements

… 0 or more transactions

…er supported

…count per ledger

…asset texts, trade processor depends on ordered id's for buyer/seller delineation

…etKey

sreuland · 2023-10-24T19:09:38Z

I think before closing this ticket, the criteria also includes porting the latest mainline of horizon back to the ingestion-next feature, assert all tests pass, and then deploy build to staging infrastructure and test full history ingestion on pubnet.

tamirms · 2023-11-01T09:27:05Z

@sreuland once #5096 is merged, I think we should merge the ingestion-next branch into master.

We can deploy to staging and run the verify-range jobs on full history during the testing phase of the release process. Running verify-range will take more than 1 week and I'd prefer to avoid having the master branch diverge from ingestion-next (thus creating more merge conflicts) during that time.

…batch builder rows

sreuland · 2023-11-01T19:39:30Z

@sreuland once #5096 is merged, I think we should merge the ingestion-next branch into master.

We can deploy to staging and run the verify-range jobs on full history during the testing phase of the release process. Running verify-range will take more than 1 week and I'd prefer to avoid having the master branch diverge from ingestion-next (thus creating more merge conflicts) during that time.

@tamirms got it, no staging performance tests after merging ingestion-next to master, rather @urvisavla and I would proceed with those new remaining feature dev tickets for ingestion perf - #5098/#5099, get those into master, then release prep for horizon 2.28.0, and run the staging verify-range on a larger than usual range for performance insight as part of release, sounds good, as that reduces verify-range effort to one occurrence.

tamirms · 2023-11-01T19:46:40Z

@sreuland yes, that's right

tamirms added horizon snapshots labels Jun 15, 2023

tamirms added this to Platform Scrum Jun 15, 2023

github-project-automation bot moved this to Backlog in Platform Scrum Jun 15, 2023

tamirms moved this from Backlog to Next Sprint Proposal in Platform Scrum Jun 15, 2023

mollykarcher added performance issues aimed at improving performance and removed snapshots labels Jun 15, 2023

mollykarcher moved this from Next Sprint Proposal to Current Sprint in Platform Scrum Jun 20, 2023

Shaptic moved this from Current Sprint to Next Sprint Proposal in Platform Scrum Jun 29, 2023

tamirms mentioned this issue Jul 18, 2023

services/horizon/internal/db2/history: Use FastBatchInsertBuilder to insert transactions into the history_transactions #4950

Merged

7 tasks

mollykarcher moved this from Next Sprint Proposal to Current Sprint in Platform Scrum Jul 19, 2023

tamirms moved this from Current Sprint to In Progress in Platform Scrum Jul 25, 2023

tamirms self-assigned this Jul 25, 2023

tamirms mentioned this issue Aug 1, 2023

services/horizon/internal/ingest/processors: Refactor ledgers, transactions, and operations processors to support new ingestion data flow #5004

Merged

7 tasks

tamirms mentioned this issue Aug 11, 2023

services/horizon/internal/db2/history: Implement account loader and future account ids #5015

Merged

7 tasks

sreuland self-assigned this Oct 12, 2023

sreuland added a commit to sreuland/go that referenced this issue Oct 16, 2023

stellar#4909: initial wip on integrating loaders and builders into pr…

da5f7de

…ocessor runners

sreuland added a commit to sreuland/go that referenced this issue Oct 17, 2023

stellar#4909: working test results with byte arrays in fast batch bui…

c1d5ce7

…lder copy statements

sreuland added a commit to sreuland/go that referenced this issue Oct 17, 2023

stellar#4909: fixed json column tests on fast batch builder

30ed273

sreuland added a commit to sreuland/go that referenced this issue Oct 19, 2023

stellar#4909: run ledger processor regardless of whether a ledger has…

1efb7a3

… 0 or more transactions

sreuland added a commit to sreuland/go that referenced this issue Oct 20, 2023

stellar#4909: removed panic from Value() on loaders

7c3842b

sreuland added a commit to sreuland/go that referenced this issue Oct 20, 2023

stellar#4909: fix asset loader on null terminated codes

bf0f64c

sreuland added a commit to sreuland/go that referenced this issue Oct 20, 2023

stellar#4909: removed out of sequence order integration test, no long…

df07d8b

…er supported

sreuland added a commit to sreuland/go that referenced this issue Oct 23, 2023

stellar#4909: fixed integration test to not submit multiple tx per ac…

9a1010b

…count per ledger

sreuland added a commit to sreuland/go that referenced this issue Oct 24, 2023

stellar#4909: fixed bulk asset insertion order id's after sorting by …

80cfc01

…asset texts, trade processor depends on ordered id's for buyer/seller delineation

sreuland added a commit to sreuland/go that referenced this issue Oct 24, 2023

stellar#4909: follow xdr Asset for String serialization of loader Ass…

208f48c

…etKey

sreuland added a commit to sreuland/go that referenced this issue Oct 24, 2023

stellar#4909: resolved verify-range job not running

9a4d0eb

sreuland added a commit to sreuland/go that referenced this issue Oct 25, 2023

stellar#4909: review feedback, cleanup

dd30b6a

sreuland added a commit to sreuland/go that referenced this issue Oct 26, 2023

stellar#4909: removed stub loader Sealed() method for test purposes

625ce1f

sreuland added a commit to sreuland/go that referenced this issue Oct 26, 2023

stellar#4909: fixed removed unused method parameter

d7cfc1c

This was referenced Oct 26, 2023

integrating new loaders and builders into processors #5083

Merged

Merge latest master to ingestion-next #5096

Merged

sreuland added a commit to sreuland/go that referenced this issue Oct 31, 2023

stellar#4909: review feedback on err handling in processor

2be340b

tamirms mentioned this issue Nov 1, 2023

Ingest batches of ledgers in-memory before flushing to DB #5099

Closed

4 tasks

sreuland added a commit to sreuland/go that referenced this issue Nov 1, 2023

stellar#4909: convert byte[] values to string before sending to fast …

769a287

…batch builder rows

sreuland mentioned this issue Nov 1, 2023

horizon/services/ingest: merge ingestion-next to master #5101

Merged

7 tasks

tamirms closed this as completed in #5101 Nov 2, 2023

github-project-automation bot moved this from In Progress to Done in Platform Scrum Nov 2, 2023

sreuland mentioned this issue Nov 15, 2023

services/horizon/ingest: historyRange and reingestHistoryRange states send batches of ledgers to tx processors #5117

Merged

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor ingestion data-flow #4909

Refactor ingestion data-flow #4909

tamirms commented Jun 15, 2023 •

edited by sreuland

Loading

sreuland commented Oct 12, 2023

sreuland commented Oct 24, 2023

tamirms commented Nov 1, 2023

sreuland commented Nov 1, 2023

tamirms commented Nov 1, 2023

Refactor ingestion data-flow #4909

Refactor ingestion data-flow #4909

Comments

tamirms commented Jun 15, 2023 • edited by sreuland Loading

sreuland commented Oct 12, 2023

sreuland commented Oct 24, 2023

tamirms commented Nov 1, 2023

sreuland commented Nov 1, 2023

tamirms commented Nov 1, 2023

tamirms commented Jun 15, 2023 •

edited by sreuland

Loading