Check if storage is working when returning the response to a faucet healthcheck request. #1189

philippecamacho · 2022-06-29T23:41:29Z

Closes #1186.

I am not sure if the storage requests are neutral.
Also I have an error when I run the tests locally. Something similar to the error described in #1179.

CLAassistant · 2022-06-29T23:41:35Z

All committers have signed the CLA.

sveitser · 2022-06-30T10:55:11Z

I think this should do the job.

If only the remove steps fails (unlikely) then subsequent inserts would not actually write to disk anymore because if the key is already in the index it doesn't insert it again until it has received the grants and is removed. This could be avoided by using a random key instead.

    /// Add an element to the persistent index.
    ///
    /// Returns `true` if the element was inserted or `false` if it was already in the index.
    fn insert(&mut self, key: UserPubKey) -> Result<bool, FaucetError> {
        if self.index.contains_key(&key) {
            // If the key is already in the index, we don't have to persist anything.
            return Ok(false);
        }
...

Another concern would be the faucet becoming unresponsive if the /healthcheck endpoint gets hit often because of the lock. This could be avoided by checking if we can write to disk in some other way (for example by creating a random file). The good thing about the current implementation is however that it exercises a lot more of the machinery we use in the faucet.

Overall I think it may reduce problems or alert us earlier so I'd be happy to try like this.

I'd be curious to get @jbearer's opinion though.

jbearer · 2022-06-30T15:38:44Z

I don't think lock contention is a big problem. The time this healthcheck endpoint spends with the lock is nothing compared to how long it takes to build a transaction. Plus the healthcheck is only called every few seconds, I believe.

If only the remove step fails, then we will actually end up sending a grant to the default address, which wastes a small amount of funds and time, but it's not too bad given that this should be extremely rare. And when we send the grant to the default address, we will remove it from the queue, so subsequent healthchecks will start doing the correct thing again.

Only suggestion is I think we should log something at ERROR level if either of these writes fails, so that if the healthcheck fails we will know why, and especially if only one of the writes fails, we will know it happened.

…ealthcheck request.

philippecamacho · 2022-06-30T16:39:58Z

Only suggestion is I think we should log something at ERROR level if either of these writes fails, so that if the healthcheck fails we will know why, and especially if only one of the writes fails, we will know it happened.

Right, done in 5b62743.

sveitser

LGTM

philippecamacho requested review from sveitser and jbearer June 29, 2022 23:41

philippecamacho added 2 commits June 30, 2022 11:47

Check if storage is working when returning the response to a faucet h…

261f6f8

…ealthcheck request.

Log errors in Faucet healthcheck.

5b62743

philippecamacho force-pushed the feat/t1186-try-storage-healthcheck-faucet branch from 83f952f to 5b62743 Compare June 30, 2022 16:37

sveitser approved these changes Jun 30, 2022

View reviewed changes

jbearer approved these changes Jun 30, 2022

View reviewed changes

philippecamacho merged commit 8971a97 into main Jun 30, 2022

philippecamacho deleted the feat/t1186-try-storage-healthcheck-faucet branch June 30, 2022 20:21

sveitser mentioned this pull request Jul 14, 2022

EQS: access disk during healthcheck #1197

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Check if storage is working when returning the response to a faucet healthcheck request. #1189

Check if storage is working when returning the response to a faucet healthcheck request. #1189

philippecamacho commented Jun 29, 2022

CLAassistant commented Jun 29, 2022 •

edited

Loading

sveitser commented Jun 30, 2022 •

edited

Loading

jbearer commented Jun 30, 2022

philippecamacho commented Jun 30, 2022

sveitser left a comment

Check if storage is working when returning the response to a faucet healthcheck request. #1189

Check if storage is working when returning the response to a faucet healthcheck request. #1189

Conversation

philippecamacho commented Jun 29, 2022

CLAassistant commented Jun 29, 2022 • edited Loading

sveitser commented Jun 30, 2022 • edited Loading

jbearer commented Jun 30, 2022

philippecamacho commented Jun 30, 2022

sveitser left a comment

Choose a reason for hiding this comment

CLAassistant commented Jun 29, 2022 •

edited

Loading

sveitser commented Jun 30, 2022 •

edited

Loading