Make sync, inbound, and block verifier check if a block hash is in any chain or any queue #862

teor2345 · 2020-08-10T04:43:12Z

Motivation

In the sync service (#853), inbound downloader, and block verifier, we are using the GetDepth request to check if a block is present in the state:
https://github.com/ZcashFoundation/zebra/pull/853/files#r467684617

But the sync, inbound, and block verifier actually need to know if a block hash is present in any chain. (This is a bug.)

They also don't need a depth, or any other extra data.

Scheduling

This risk is acceptable for the stable release, but we need to fix it before we support lightwalletd.

We should also fix this bug if Zebra continues to hang, after we fix known hang bugs.

Solution

Each section should be implemented in a separate PR.

A. Add a new "contains" block hash request:

add an additional state service request, which checks if a block is in:
- any chain
- the pending non-finalized state queue
- the pending finalized state queue
use it in the sync service to ignore block downloads (or check for genesis)
use it in the inbound downloader to ignore block downloads
make it a HashSet of block hashes, to reduce the number of state requests

B. We want to do these checks:

before downloading the block, and
after the download, but before verifying it (Move already in state check #2327)
- See PR Move already in state check #2327 for a draft of this change

For example, see this code:

zebra/zebra-state/src/service.rs

Lines 99 to 104 in ebe1c9f

    
           if self.mem.any_chain_contains(&prepared.hash) || self.disk.hash(prepared.height).is_some() 
        
           { 
        
               let (rsp_tx, rsp_rx) = oneshot::channel(); 
        
               let _ = rsp_tx.send(Err("block is already committed to the state".into())); 
        
               return rsp_rx; 
        
           }

D. Check if the blocks are already waiting in a checkpoint verifier queue:

change the request type for the checkpoint verifier into an enum
check if blocks are already waiting in the checkpoint verifier queue

E. De-duplicate the sync and inbound block downloaders:

Use an enum to distinguish sync and inbound requests
Don't apply the inbound limits to sync blocks
Use a buffer or channels to get responses back to the right service?

Alternatives

We could implement the new request as a wrapper function around a more specific query, which discards any extra data. But there aren't any requests that find blocks in any chain.

We might also want to return success on duplicate blocks, rather than an error. But sync restarts should be even rarer once we fix this bug.

To be more precise, there's not exactly a race condition, just the potential for TOCTOU issues where another part of the software requests information about what's in the state, acts on it, and the state is updated in the meantime. But this is the case for any state query, and I think that the solution is the direction we're already going (where all state updates are permissioned by the state itself, which can do synchronous checks). So I don't see the difference between returning the depth or not, in either case the information returned by the state can become stale.

It seems like the only problem is that you want to check whether a block hash is in any chain, while the current API checks whether a block hash is in the best chain.

teor2345 · 2020-08-13T20:02:50Z

Functionally, that's the only change - but I think a specific function for checking a hash is also useful for readability and modularity. (And potentially future optimisations.)

It's also worth noting the two different kinds of TOCTOU issues:

a block is not in the state, but is later downloaded or verified
a block is in the state, but is later pruned from the state as a side-chain (if we implement pruning)

yaahc · 2020-08-13T20:54:12Z

It seems like the only problem is that you want to check whether a block hash is in any chain

This seems like a reasonable default for block lookups.

while the current API checks whether a block hash is in the best chain.

😅 s/the best chain/the only chain/

mpguerra · 2021-07-07T08:21:06Z

Is this something that needs to be done as part of #2224 ?

teor2345 · 2022-02-25T00:16:57Z

This might be needed to fix syncing bugs.

teor2345 · 2022-03-01T04:25:16Z

This is a real bug, but it doesn't seem to cause that many problems in practice.

dconnolly · 2023-03-08T22:50:22Z

@arya2 I think you mentioned seeing this issue in the block verifier as well? Can you populate some of that context here? Gracias

mpguerra · 2023-03-09T11:10:34Z

As part of resolving this we should remember to update the related TODO in

zebra/zebrad/src/components/sync.rs

Line 961 in 5a88fe7

/// TODO BUG: check if the hash is in any chain (#862)

mpguerra · 2023-03-14T09:46:03Z

The issue highlighted by the audit was that this issue was closed while there was still a TODO comment referring to it.

So, by re-opening this issue, we have already "fixed" the issue highlighted by the audit (#6281).

The next question is: how important is the actual issue to fix right now?

teor2345 · 2023-03-24T05:08:53Z

We handled this issue by checking for blocks that are already in side-chains in PR #6335.

We closed PR #6397 because it didn't work, and we'd run out of time to fix it. The fix is optional because duplicate queued blocks are already handled in the syncer and inbound downloader, we just don't handle duplicates across both of them. Which is ok for now.

teor2345 added Poll::Ready A-rust Area: Updates to Rust code C-cleanup Category: This is a cleanup labels Aug 10, 2020

teor2345 added this to the Sync and validate zcash mainnet milestone Aug 10, 2020

teor2345 mentioned this issue Aug 10, 2020

Make sync ignore known hashes #853

Merged

teor2345 changed the title ~~Add a state service request that checks if a block hash is in any chain~~ Add a state service request to check if a block hash is in the state Aug 10, 2020

teor2345 changed the title ~~Add a state service request to check if a block hash is in the state~~ Add a request that checks if a block hash is in the state Aug 10, 2020

teor2345 mentioned this issue Aug 10, 2020

Stop checking for duplicate blocks in sync and verifiers #865

Closed

8 tasks

teor2345 changed the title ~~Add a request that checks if a block hash is in the state~~ Add a function that checks if a block hash is in the state Aug 12, 2020

hdevalence removed the Poll::Ready label Aug 17, 2020

teor2345 modified the milestones: Block Validation, Sync and Network Sep 29, 2020

mpguerra mentioned this issue Jan 5, 2021

Tracking: sync correctness #884

Closed

44 tasks

mpguerra removed this from the Sync and Network milestone Jan 5, 2021

teor2345 added C-bug Category: This is a bug P-Medium and removed C-cleanup Category: This is a cleanup E-easy labels Feb 9, 2021

teor2345 changed the title ~~Add a function that checks if a block hash is in the state~~ Make sync and inbound check if a block hash is in any chain Feb 9, 2021

This was referenced Feb 9, 2021

Handle duplicate block errors #1372

Closed

Document a state_contains bug #1715

Merged

teor2345 added the I-slow Problems with performance or responsiveness label Mar 4, 2021

teor2345 changed the title ~~Make sync and inbound check if a block hash is in any chain~~ Make sync and inbound check if a block hash is in any chain or any queue Mar 4, 2021

teor2345 mentioned this issue Jun 18, 2021

Move already in state check #2327

Closed

3 tasks

teor2345 mentioned this issue Aug 26, 2021

Add transaction downloader and verifier #2679

Merged

3 tasks

teor2345 mentioned this issue Dec 7, 2021

Security: Drop blocks that are a long way ahead of the tip #3167

Merged

3 tasks

teor2345 added P-Low and removed P-Medium labels Jan 6, 2022

teor2345 mentioned this issue Jan 6, 2022

Fix frequent Zebra hangs during syncing #3322

Closed

13 tasks

teor2345 mentioned this issue Feb 25, 2022

Fix slowness when syncing near the tip #3375

Closed

teor2345 added the S-incomplete label Mar 1, 2022

teor2345 closed this as completed Mar 1, 2022

teor2345 closed this as not planned Won't fix, can't repro, duplicate, stale Sep 6, 2022

dconnolly reopened this Mar 8, 2023

dconnolly assigned arya2 Mar 8, 2023

This was referenced Mar 9, 2023

Epic: Improvements from Zebra Audit #6277

Closed

Tracking: TODOs with closed tasks #6281

Closed

mpguerra added the C-audit Category: Issues arising from audit findings label Mar 9, 2023

arya2 changed the title ~~Make sync and inbound check if a block hash is in any chain or any queue~~ Make sync, inbound, and block verifier check if a block hash is in any chain or any queue Mar 11, 2023

mpguerra added P-Medium ⚡ and removed P-Low ❄️ labels Mar 13, 2023

arya2 mentioned this issue Mar 15, 2023

change(state): Stop re-downloading blocks that are in non-finalized side chains #6335

Merged

6 tasks

mpguerra removed the C-audit Category: Issues arising from audit findings label Mar 16, 2023

mpguerra mentioned this issue Mar 23, 2023

can_fork_chain_at() should ignore blocks below the finalized tip #6388

Closed

mergify bot closed this as completed in #6335 Mar 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make sync, inbound, and block verifier check if a block hash is in any chain or any queue #862

Make sync, inbound, and block verifier check if a block hash is in any chain or any queue #862

teor2345 commented Aug 10, 2020 •

edited by arya2

Loading

teor2345 commented Aug 10, 2020

hdevalence commented Aug 10, 2020

teor2345 commented Aug 10, 2020

teor2345 commented Aug 12, 2020 •

edited

Loading

hdevalence commented Aug 13, 2020

teor2345 commented Aug 13, 2020

yaahc commented Aug 13, 2020

mpguerra commented Jul 7, 2021

teor2345 commented Feb 25, 2022

teor2345 commented Mar 1, 2022

dconnolly commented Mar 8, 2023 •

edited

Loading

mpguerra commented Mar 9, 2023

mpguerra commented Mar 14, 2023 •

edited

Loading

teor2345 commented Mar 24, 2023 •

edited

Loading

Make sync, inbound, and block verifier check if a block hash is in any chain or any queue #862

Make sync, inbound, and block verifier check if a block hash is in any chain or any queue #862

Comments

teor2345 commented Aug 10, 2020 • edited by arya2 Loading

Motivation

Scheduling

Solution

Alternatives

Related

teor2345 commented Aug 10, 2020

hdevalence commented Aug 10, 2020

teor2345 commented Aug 10, 2020

teor2345 commented Aug 12, 2020 • edited Loading

hdevalence commented Aug 13, 2020

teor2345 commented Aug 13, 2020

yaahc commented Aug 13, 2020

mpguerra commented Jul 7, 2021

teor2345 commented Feb 25, 2022

teor2345 commented Mar 1, 2022

dconnolly commented Mar 8, 2023 • edited Loading

mpguerra commented Mar 9, 2023

mpguerra commented Mar 14, 2023 • edited Loading

teor2345 commented Mar 24, 2023 • edited Loading

teor2345 commented Aug 10, 2020 •

edited by arya2

Loading

teor2345 commented Aug 12, 2020 •

edited

Loading

dconnolly commented Mar 8, 2023 •

edited

Loading

mpguerra commented Mar 14, 2023 •

edited

Loading

teor2345 commented Mar 24, 2023 •

edited

Loading