Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Captive Core's temporary db directory disappears breaking execution #3145

Closed
2opremio opened this issue Oct 21, 2020 · 3 comments · Fixed by #3162
Closed

Captive Core's temporary db directory disappears breaking execution #3145

2opremio opened this issue Oct 21, 2020 · 3 comments · Fixed by #3162
Assignees

Comments

@2opremio
Copy link
Contributor

2opremio commented Oct 21, 2020

While working on #3144 I encountered the following error in captive core:

time="2020-10-21T17:22:50.833Z" level=error msg="Error in ingestion state machine" current_state="resume(latestSuccessfullyProcessedLedger=1)" error="error preparing range: opening subprocess: error running stellar-core: error waiting for `stellar-core new-db` subprocess: could not start `stellar-core [new-db]` cmd: chdir /tmp/captive-stellar-core-9f8cf1da5cf913b7: no such file or directory" next_state=start pid=195 service=ingest

which happens in a loop

@2opremio
Copy link
Contributor Author

Could it be that the directory is removed by the system due to being in /tmp ?

@2opremio
Copy link
Contributor Author

The full context is:

time="2020-10-22T14:17:34.762Z" level=error msg="Error in ingestion state machine" current_state="resume(latestSuccessfullyProcessedLedger=1)" error="Error running processors on ledger: Protocol version not supported: Error getting ledger: unexpected ledger (expected=2 actual=3)" next_state="resume(latestSuccessfullyProcessedLedger=1)" pid=199 service=ingest
time="2020-10-22T14:17:35.762Z" level=info msg="Ingestion system state machine transition" current_state="resume(latestSuccessfullyProcessedLedger=1)" next_state="resume(latestSuccessfullyProcessedLedger=1)" pid=199 service=ingest
time="2020-10-22T14:17:35.769Z" level=info msg="Released ingestion lock to prepare range" pid=199 service=ingest
time="2020-10-22T14:17:35.770Z" level=info msg="Preparing range" ledger=2 pid=199 service=ingest
time="2020-10-22T14:17:35.773Z" level=error msg="Error in ingestion state machine" current_state="resume(latestSuccessfullyProcessedLedger=1)" error="error preparing range: opening subprocess: error running stellar-core: error waiting for `stellar-core new-db` subprocess: could not start `stellar-core [new-db]` cmd: chdir /tmp/captive-stellar-core-c40ecca427b35c92: no such file or directory" next_state=start pid=199 service=ingest

@2opremio 2opremio self-assigned this Oct 22, 2020
@2opremio
Copy link
Contributor Author

What is happening is that the captive core backend doesn't properly reset it's tmp directory on error. Fix coming up.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
2 participants