Name	Name	Last commit message	Last commit date
parent directory ..
README.md	README.md
baseline.toml	baseline.toml
notes.md	notes.md
rfcBBL203A.toml	rfcBBL203A.toml

RFC|BB|L2-03A: Use of compression and adjustable block size

Status: Prototype
Implementation here: https://github.com/adlrocha/go-bitswap/tree/feature/rfcBBL203A
Compression in libp2p: https://github.com/adlrocha/go-libp2p-compression-examples

Abstract

This RFC proposes the exploration of using compression in block transmission. These techniques go from: Block by block standard compression (e.g. gzip) Whole transfer compression (e.g. when responding to a graphsync query, send all the blocks compressed) Custom coding tables for sequences of bytes that appeared often (e.g. generate an Huffman table for all the protobuf headings so that these are compressed by default, like hpack does for http)

Additionally, to optimize the use of these schemes, a system of adjustable block sizes and coding strategies in transmission could be devised (e.g. dynamic Huffman tables).

Shortcomings

Blocks in IPFS are exchanged without the use of compression, this is a huge opportunity loss to minimize the bandwidth footprint and latency of transferring a file. For context, even minimal web assets are transmitted compressed through HTTP to increase website loading performance, most of them are below 256KiB, which is IPFS default block size. We expect to see several gains in transmission times.

Description

Current implementation of file-sharing protocols may benefit from the use of on-the-fly compression to optimize the use of bandwidth and optimize the transmission of content. Even more, when using the “Graphsynced” approach in the discovery of content, where we request peers for the level of fulfillment of an IPLD selector, we can request all the blocks for the IPLD selector to be compressed in the same package and forwarded to the requestor.

Some of the compression approaches to be explored in this RFC are:

Block by block standard compression (e.g. gzip): Every block (and optionally every single Bitswap message) is compressed. Get inspiration from web compression.
Whole transfer compression: All the blocks requested by a peer in a Wantlist or a graphsync IPLD selector are compressed in the same package.
Custom coding tables for sequences of bytes that appeared often (e.g. generate a Huffman table for all the protobuf headings so that these are compressed by default, like hpack does for http).
Use of “compressed caches” so that when a specific content has been identified as “regularly exchanged”, instead of having to compress it again it can be retrieved from the cache. This scheme may not be trivial.
Use of different compression algorithms.
Use of different block sizes before compression.

Implementation plan

Implementation details

Block compression: Files within Bitswap are exchanged in the form of blocks. Files are composed of several blocks organized in a DAG structure (with each block having a size limit of 256KB). In this compression approach, we compress blocks before including them in a message and transmitting them to the network.
Full message compression: In this compression strategy instead of only compressing blocks we compress every single message before sending it. It is the equivalent of compressing header+body in HTTP.
Stream compression: It uses compression at a stream level, so every byte that enters a stream from the node to other peers is compressed (i.e. using a compressed writer).
To drive the compression idea even further, we prototyped a Compression transport into libp2p (between the Muxer and the Security layer) so that every stream running over a libp2p node can potentially benefit from the use of compression. This is a non-breaking change as the transport-upgrader has also been updated to enable compression negotiation (so eventually anyone can come with their own compression and embed it into libp2p seamlessly). Some repos to get started with compression in libp2p:
- Compression example: https://github.com/adlrocha/go-libp2p-compression-examples
- Gzip compressor: https://github.com/adlrocha/go-libp2p-gzip
- Testbed to test compression over IPFS: https://github.com/adlrocha/beyond-bitswap/tree/feat/compression

See a discussion on the results here.

Impact

A reduction of latency due to compressed transmissions. Potential increase in computational overhead.

Evaluation Plan

The IPFS File Transfer benchmarks.
See the computational footprint of different compression strategies and algorithms.
Compare the data sent and received using compression and with baseline Bitswap.

Prior Work

This RFC takes inspiration from:

Results

The results for the implementation of this RFC were reported here: https://research.protocol.ai/blog/2020/honey-i-shrunk-our-libp2p-streams/

Future Work

If the use of exchange requests and the negotiation phase for content transmission (RFC | BB | L1/2-01) is implemented, it makes sense that once identified a specific peer (or a group of them) as the ones storing a large number of the desired blocks, to request more advanced compression and network coding techniques for their transmission.
Detect the type of data being exchanged in blocks and apply the most suitable compression for the data type, such as image-specific compression if images are being exchanged (for this approach, a node will need to have all the blocks for the data).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rfcBBL203A

rfcBBL203A

README.md

RFC|BB|L2-03A: Use of compression and adjustable block size

Abstract

Shortcomings

Description

Implementation plan

Implementation details

Impact

Evaluation Plan

Prior Work

Results

Future Work

Files

rfcBBL203A

Directory actions

More options

Directory actions

More options

Latest commit

History

rfcBBL203A

Folders and files

parent directory

README.md

RFC|BB|L2-03A: Use of compression and adjustable block size

Abstract

Shortcomings

Description

Implementation plan

Implementation details

Impact

Evaluation Plan

Prior Work

Results

Future Work