Horizon doesn't enforce retention policy (`HISTORY_RETENTION_COUNT` environment variable) #3711

crypto-billy · 2021-06-22T07:21:07Z

What version are you using?

Horizon: 2.3.0-5029e28d1ec6272a44f5c03ad732059b2fead31d
Core: stellar-core 17.1.0 (fbc0325759ff75dd250cb5e175978669cdb4e90a)
go: go1.16.3

What did you do?

Horizon was started with the following environment variables, including HISTORY_RETENTION_COUNT=200000:

APPLY_MIGRATIONS=true
CAPTIVE_CORE_CONFIG_APPEND_PATH=/opt/stellar/stellar-captive-core-stub.toml
CAPTIVE_CORE_STORAGE_PATH=/var/stellar/core-data
DATABASE_URL=postgres://postgres:{{ postgres_password }}@horizon-postgres:5432/horizon?sslmode=disable
ENABLE_CAPTIVE_CORE_INGESTION=true
HISTORY_ARCHIVE_URLS=https://history.stellar.org/prd/core-live/core_live_001
HISTORY_RETENTION_COUNT=200000
INGEST=true
NETWORK_PASSPHRASE=Public Global Stellar Network ; September 2015
PARALLEL_JOB_SIZE=100000
PER_HOUR_RATE_LIMIT=0
RETRIES=10
RETRY_BACKOFF_SECONDS=20
STELLAR_CORE_BINARY_PATH=/usr/bin/stellar-core
STELLAR_CORE_URL=http://localhost:11626

What did you expect to see?

Only latest 200,000 blocks are available on node, and disk consumption to stop growing.

What did you see instead?

The disk space consumption of the node has reached over 500GB, checked via API to find that the eldest_block was around 600,000 blocks away from block tip. Seems like db reaping did not occur at all.

Performed horizon db reap and was given the following output, the new_elder block was respective to the HISTORY_RETENTION_COUNT, which was 200,000 blocks away from the block tip:

INFO[2021-06-21T08:40:48.056Z] reaper: clearing                              new_elder=35800188 pid=315
INFO[2021-06-21T10:22:05.403Z] reaper succeeded                              new_elder=35800188 pid=315

I suppose this suggests that the retention configuration was set in place, but for some reason the reaper did not act on it for whatever reason?

Current workaround

Manually perform horizon db reap, and stop horizon+core to perform a Postgres vacuum to reclaim disk space.

The text was updated successfully, but these errors were encountered:

bartekn · 2021-06-22T11:04:07Z

Reaping is performed every one hour. Have you waited one hour?

crypto-billy · 2021-06-23T03:14:51Z

@bartekn Yes the node has been running for several weeks, it was started by manually ingesting the most recent (at the time) 200,000 blocks.

bartekn · 2021-06-23T11:11:24Z

Thanks! It's possible this was broken by #3567. We're going to check this. EDIT: after checking your logs I'm pretty sure it got broken by #3567. It took around 1.5h to clear ledgers. We probably need to move reap.Tick out of the main app Tick method.

leevlad · 2021-06-30T14:39:33Z

related: #3728

I believe this was broken by the 10-second timeout on the shared context in the app ticker.

crypto-billy added the bug label Jun 22, 2021

bartekn mentioned this issue Jul 9, 2021

services/horizon: Reap data in a DB transaction #3754

Merged

7 tasks

bartekn mentioned this issue Jul 23, 2021

services/horizon: Move reap service outside global tick #3777

Merged

7 tasks

bartekn closed this as completed in #3777 Jul 23, 2021

bartekn mentioned this issue Jul 23, 2021

horizon table history_transactions is not reaped with HISTORY_RETENTION_COUNT stellar/quickstart#266

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Horizon doesn't enforce retention policy (`HISTORY_RETENTION_COUNT` environment variable) #3711

Horizon doesn't enforce retention policy (`HISTORY_RETENTION_COUNT` environment variable) #3711

crypto-billy commented Jun 22, 2021

bartekn commented Jun 22, 2021

crypto-billy commented Jun 23, 2021

bartekn commented Jun 23, 2021 •

edited

Loading

leevlad commented Jun 30, 2021

Horizon doesn't enforce retention policy (HISTORY_RETENTION_COUNT environment variable) #3711

Horizon doesn't enforce retention policy (HISTORY_RETENTION_COUNT environment variable) #3711

Comments

crypto-billy commented Jun 22, 2021

What version are you using?

What did you do?

What did you expect to see?

What did you see instead?

Current workaround

bartekn commented Jun 22, 2021

crypto-billy commented Jun 23, 2021

bartekn commented Jun 23, 2021 • edited Loading

leevlad commented Jun 30, 2021

Horizon doesn't enforce retention policy (`HISTORY_RETENTION_COUNT` environment variable) #3711

Horizon doesn't enforce retention policy (`HISTORY_RETENTION_COUNT` environment variable) #3711

bartekn commented Jun 23, 2021 •

edited

Loading