Safe Preallocation for (most) `Vec<T>` #1880

preston-evans98 · 2021-03-10T23:52:57Z

Overview - Safe Preallocation

Currently, Zebra doesn't preallocate Vectors when deserializing, since blind preallocations present a DOS vector. Instead, it relies on maximum block and transaction size to limit allocations. This is inefficient.

We can mitigate the DOS potential and allow for preallocation as follows:

Define a new trait SafeAllocate, defining the maximum length a Vec<T: SafeAllocate> can sensibly reach for each implementing type
Create a specialized ZcashDeserialize implementation for SafeAllocaters.

pub trait SafeAllocate {
    const fn max_allocation() -> usize;
}

impl<T: ZcashDeserialize + SafeAllocate> ZcashDeserialize for Vec<T> {
    fn zcash_deserialize<R: io::Read>(mut reader: R) -> Result<Self, SerializationError> {
        let len = reader.read_compactsize()?;
        if len > T::max_allocation() {
            return Err(SerializationError::Parse(("Vector longer than max_allocation")));
        }
        let mut vec = Vec::with_capacity(len);
        for _ in 0..len {
            vec.push(T::zcash_deserialize(&mut reader)?);
        }
        Ok(vec)
    }
}

This allows us to pre-allocate for certain types without presenting a DOS vector, while retaining the flexibilty to use the current (lazy) allocation strategy for types that defy safe blind allocation.

Note that deserialization using this method is guaranteed to fail during the deserialization of the first malicious vector. However, Block messages contain nested Vec<T> types, and it is possible for a Byzantine message to cause improper allocations at each level of nesting before being discovered. This is why the potential spike for a Block message is much higher than for other message types.

Potential SafeAllocation Implementors

Arc<Transaction> - Suggested allocation limit: 49_000. A Zcash block is capped at 2MB. Since transparent Txs are the smallest allowable transactions, and each transparent Tx requires at least one Input with a minimum serialized size of 41 bytes, an upper bound the length of a Vec<Arc<Transaction>> is 2MB / 41 Bytes < 49_000. Note that std::mem::size_of::<Arc<T>>() is 8, so the maximum memory wasted by a Byzantine Vec<Arc<Transaction>> is 49_000 * 8 Bytes < 400KB.
transparent::Input - Suggested allocation limit: 49_000. A Zcash block is capped at 2MB. Each Input has a minimum serialized size of 41 bytes, so an upper bound the length of a Vec<transparent::Input> is 2MB / 41 Bytes < 49_000. With Inputs taking less than 80 bytes on the stack (36 for the Outpoint, 26 for the Script, and 4 for the sequence), the maximum memory wasted by a Byzantine Vec<transparent::Input> is 49_000 * 80 Bytes < 4MB.
transparent::Output - Suggested allocation limit: 225_000. A Zcash block is capped at 2MB. Outputs have a minimum serialized size of 9 bytes, so an upper bound the length of a Vec<Arc<transparent::Output>> is 2MB / 9 Bytes < 225_000. With Outputs taking less than 40 bytes on the stack (call it 10 for the value, 26 for the Script, for a total of 36 ), the maximum memory wasted by a Byzantine Vec<Arc<transparent::Output>> is 225_000 * 40 Bytes = 9MB.
MetaAddr - Suggested allocation limit: 1000. No fancy math required, since this limit is in the Addr message specification. Estimate a MetaAddr at a generous 100 bytes of stack space and you get max memory waste = 1000 * 100B = 100KB.
block::Hash - Suggested allocation limit: 65536. We derive this limit as MAX_MESSAGE_SIZE / 32 = 2 * 1024 _ 1024 / 32. Since a block hash takes 32 bytes on the stack, the max waste here is MAX_MESSAGE_SIZE.
InventoryHash - Suggested allocation limit: 50,000. This limit is the listed in the Inv message spec. Maximum waste: 50,000 * 32 B = 1.6 MB.
block::CountedHeader - Suggested allocation limit: 2000. This limit is in the Headers message spec. Per the most recent spec, each Zcash header is less than 2kb. Maximum waste: 2000 * 2000 Bytes < 4MB
u8 - Suggested allocation limit: MAX_MESSAGE_SIZE. Since a u8 takes 1 byte on the stack, the maximum waste here is MAX_MESSAGE_SIZE = 2MB.

Example attacks

Using all of these numbers, we calculate the total improper allocation caused by the worst-case instance of each Zcash Message as follows:

Version: N.A.
Verack: N.A.
Ping: N.A.
Pong: N.A.
Reject: N.A.
GetAddr: N.A.
Addr: contains 1 Vec<MetaAddr> => 100KB (see above)
Get locks: contains 1 Vec<block::Hash> => MAX_MESSAGE_SIZE (2MB)
Inv: contains 1 Vec<InventoryHash> => 1.6 MB (see above)
GetHeaders: contains 1 Vec<block::Hash> => MAX_MESSAGE_SIZE (2MB)
Headers: contains 1 Vec<block::CountedHeader> => 4MB
GetData: contains 1 Vec<InventoryHash> => 1.6 MB (see above)
Block: The worst case block contains 1 Vec<Arc<Transaction>, 1 Vec<transparent::Output>, and 1 malicious Script(Vec<u8>) => 400KB + 9MB + 2MB = 11.4 MB.
- Note that a dishonest vector is discovered during its own deserialization, so a malicious Vec<transparent::Input> would be discovered before the malicious Vec<transparent::Output> was allocated for. Since Outputs can waste more memory than Inputs, a smart attacker will choose to make only his Vec<transparent::Output> malicious.
Tx: N.A.
NotFound: contains 1 Vec<InventoryHash> => 1.6 MB (see above)
Mempool: N.A.
FilterLoad: contains 1 Filter(Vec<u8>) => 2MB
FilterAdd: contains 1 Vec<u8> => 2MB
FilterClear: N.A.

Alternatives

Do nothing. This is a fine option, since Zebra's networking stack is already fast!

Summary and Recommendations

With the SafeAllocate trait, we can allow preallocation for many Vector types without risking DOS attacks.

In the worst case, a malicious message can cause a short-lived spike in memory usage. The size of this spike depends on the max_allocation defined in SafeAllocate and the depth of nested Vec<T: SafeAllocate> types. Calculations of the maximum spike caused by each message type are included above. Based on these calculations, I recommend implementing SafeAllocate for all types listed in the "Potential SafeAllocate Implementors" section, with the possible exception of transparent::Input and transparent::Output.

If this recommendation is adopted , the worst case memory spike that a malicious peer can cause will be roughly 4MB, or roughly double that peer connection's usual memory consumption.

If transparent::Inputs and transparent::Outputs are included, the worst case spike rises to 11.5MB, or about six times a peer connection's usual memory consumption.

The text was updated successfully, but these errors were encountered:

teor2345 · 2021-03-11T02:47:48Z

Hi @preston-evans98, thanks for this suggestion, and working through these calculations.

In general, we've been working on the assumption that parsed data structures are slightly larger than their serialized equivalents. But I don't think we realised that malicious blocks could take up 6 times as much memory when deserialized.

Here's some further analysis of this issue:

Rust `Vec` over-allocation

Currently, Vec allocations can be up to 2x larger than their current length. That yields an overall 12x memory usage, or 24MB per 2MB malicious block.

https://github.com/rust-lang/rust/blob/ff6ee2a70218543f410e557f390e246131847572/library/alloc/src/raw_vec.rs#L415

Although:

Vec does not guarantee any particular growth strategy when reallocating when full

https://doc.rust-lang.org/std/vec/struct.Vec.html#guarantees

Preallocation would limit the maximum memory usage to 12MB per block.

Validation Pipeline Memory Usage

One of the reasons that Zebra is fast is that it processes many blocks in parallel. So memory usage issues can be quite serious. Zebra achieves this parallelism using a block validation pipeline: it deserialises blocks, then passes them through a number of validation stages. The underlying Arc<Block> data isn't dropped until after the block fails validation.

Inbound Block Gossip Service

The limit on this service is 600 blocks, which is far too high. We'll reduce that to 10 in #1881.

Sync Service

In general, a malicious node can place 500/50 = 10 malicious blocks in the sync pipeline due to request fanout. See #1881 for details.

Checkpoint Validation

The checkpoint verifier stores a queue of blocks, while awaiting the blocks at previous heights. To increase the cost of filling this queue, we check block difficulty in #1882.

Full Block Validation

It's hard to exploit full block validation, because it checks block difficulty before doing any other checks.

teor2345 · 2021-03-11T02:50:20Z

Overall, I think this is a good change, but we should think about our test strategy.

If any of these new limits are too low, they might end up rejecting large blocks that other nodes accept. (Rejecting other large messages isn't as much of an issue, because most large messages aren't consensus-critical.)

So let's think about a test strategy that:

constructs the smallest possible value of each type
runs a proptest to confirm it's the smallest
make sure the limit is correct for the smallest possible value

We can't really run proptests to find the largest possible blocks, because large proptests are very slow.

* Implement SafePreallocate. Resolves #1880 * Add proptests for SafePreallocate * Apply suggestions from code review Comments which did not include replacement code will be addressed in a follow-up commit. Co-authored-by: teor <[email protected]> * Rename [Safe-> Trusted]Allocate. Add doc and tests Add tests to show that the largest allowed vec under TrustedPreallocate is small enough to fit in a Zcash block/message (depending on type). Add doc comments to all TrustedPreallocate test cases. Tighten bounds on max_trusted_alloc for some types. Note - this commit does NOT include TrustedPreallocate impls for JoinSplitData, String, and Script. These impls will be added in a follow up commit * Implement SafePreallocate. Resolves #1880 * Add proptests for SafePreallocate * Apply suggestions from code review Comments which did not include replacement code will be addressed in a follow-up commit. Co-authored-by: teor <[email protected]> * Rename [Safe-> Trusted]Allocate. Add doc and tests Add tests to show that the largest allowed vec under TrustedPreallocate is small enough to fit in a Zcash block/message (depending on type). Add doc comments to all TrustedPreallocate test cases. Tighten bounds on max_trusted_alloc for some types. Note - this commit does NOT include TrustedPreallocate impls for JoinSplitData, String, and Script. These impls will be added in a follow up commit * Impl TrustedPreallocate for Joinsplit * Impl ZcashDeserialize for Vec<u8> * Arbitrary, TrustedPreallocate, Serialize, and tests for Spend<SharedAnchor> Co-authored-by: teor <[email protected]>

preston-evans98 added C-enhancement Category: This is an improvement S-needs-triage Status: A bug report needs triage labels Mar 10, 2021

teor2345 added A-consensus Area: Consensus rule updates A-rust Area: Updates to Rust code I-heavy Problems with excessive memory, disk, or CPU usage I-slow Problems with performance or responsiveness P-Low C-security Category: Security issues and removed P-Low labels Mar 11, 2021

This was referenced Mar 11, 2021

Reduce inbound concurrency to limit memory usage #1881

Merged

Verify proof-of-work in the checkpoint verifier #1882

Merged

mpguerra removed the S-needs-triage Status: A bug report needs triage label Mar 15, 2021

teor2345 mentioned this issue Mar 17, 2021

Check Transaction cardinality at parse time, to limit memory usage #1917

Closed

2 tasks

preston-evans98 added a commit to preston-evans98/zebra that referenced this issue Mar 18, 2021

Implement SafePreallocate. Resolves ZcashFoundation#1880

fac8c2e

preston-evans98 mentioned this issue Mar 18, 2021

Implement Trusted Vector Preallocation #1920

Merged

2 tasks

preston-evans98 added a commit to preston-evans98/zebra that referenced this issue Mar 25, 2021

Implement SafePreallocate. Resolves ZcashFoundation#1880

b8fd760

teor2345 closed this as completed in #1920 Apr 5, 2021

arya2 mentioned this issue Mar 31, 2023

feat(script): Add binary for finding references to closed issues #6347

Merged

6 tasks

mpguerra mentioned this issue Apr 11, 2023

Tracking: TODOs with closed tasks #6281

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Safe Preallocation for (most) `Vec<T>` #1880

Safe Preallocation for (most) `Vec<T>` #1880

preston-evans98 commented Mar 10, 2021 •

edited by teor2345

Loading

teor2345 commented Mar 11, 2021

teor2345 commented Mar 11, 2021

Safe Preallocation for (most) Vec<T> #1880

Safe Preallocation for (most) Vec<T> #1880

Comments

preston-evans98 commented Mar 10, 2021 • edited by teor2345 Loading

Overview - Safe Preallocation

Potential SafeAllocation Implementors

Example attacks

Alternatives

Summary and Recommendations

teor2345 commented Mar 11, 2021

Rust Vec over-allocation

Validation Pipeline Memory Usage

Inbound Block Gossip Service

Sync Service

Checkpoint Validation

Full Block Validation

teor2345 commented Mar 11, 2021

Safe Preallocation for (most) `Vec<T>` #1880

Safe Preallocation for (most) `Vec<T>` #1880

preston-evans98 commented Mar 10, 2021 •

edited by teor2345

Loading

Rust `Vec` over-allocation