Inspect io wrappers #5033

farnz · 2022-09-20T10:15:41Z

Motivation

When writing or reading data, it can be useful to inspect the bytes passing through, so that you can (e.g.) hash as you write, or verify hashes of read data concurrently with deserialization.

Wrapping AsyncRead and AsyncWrite successfully so that you can inspect the bytes being handled has a few edge cases to get right, so let's bundle this all up into one place.

Solution

A pair of wrappers that allow you to inspect either the bytes being read, or the bytes that have been written, plus test cases to show that they work correctly even when the underlying implementation does partial writes, or is only able to read in small chunks.

There are use cases like checking hashes of files that benefit from being able to inspect bytes read as they come in, while still letting the main code process the bytes as normal (e.g. deserializing into objects, knowing that if there's a hash failure, you'll discard the result). As this is non-trivial to get right (e.g. handling a `buf` that's not empty when passed to `poll_read`, add a wrapper `InspectReader` that gets this right, passing all newly read bytes to a supplied `FnMut` closure. Fixes: tokio-rs#4584

When writing things out, it's useful to be able to inspect the bytes that are being written and do things like hash them as they go past. This isn't trivial to get right, due to partial writes and efficiently handling vectored writes (if used). Provide an `InspectWriter` wrapper that gets this right, giving a supplied `FnMut` closure a chance to inspect the buffers that have been successfully written out. Fixes: tokio-rs#4584

Darksonn

These comments on where bounds also apply to the writer.

tokio-util/src/io/inspect.rs

Darksonn · 2022-09-27T21:59:22Z

Sorry about the delay in reviewing this. We had some trouble with our CI setup that took a while to fix.

tokio-util/tests/io_inspect.rs

Co-authored-by: Alice Ryhl <[email protected]>

Darksonn · 2022-09-28T13:09:57Z

tokio-util/src/io/inspect.rs

+        if let Poll::Ready(Ok(count)) = res {
+            (me.f)(&buf[..count]);
+        }


What if count is zero?

In current code, f gets passed an empty slice, and is expected to handle this - which is fine for hashing and teeing uses,

I would prefer to document this, but can put a guard in place to stop you getting an empty slice if you'd prefer that.

It seems to me that we should avoid empty slices.

Will do so for both writes and reads - the user should know about EOF on read, or ENOSPC on write from the "main" write or read process, and thus be able to handle that without needing the inspection callback to be called.

Passing an empty slice on EOF read seems ok, but otherwise I would not expect the closure to ever get passed an empty slice.

Darksonn · 2022-09-28T13:10:06Z

tokio-util/src/io/inspect.rs

+            for buf in bufs {
+                let size = count.min(buf.len());
+                (me.f)(&buf[..size]);
+                count -= size;


What if buf is empty?

Then f gets called with an empty slice, since count.min(buf.len()) should be zero, and &buf[..0] is an empty slice.

I only see two cases in which this can happen, and in both cases I think f should be able to handle it (reasoning as above):

The wrapped AsyncWrite genuinely returned a zero byte write. In this case, it is claiming to successfully write zero bytes, which is allowed in the contract (implying that the underlying object can't accept more data ever again).

The caller supplied one or more empty buffers to poll_write_vectored. In this case, you're being told that the supplied empty buffer was indeed written, which is accurate (if not particularly helpful).

I will add a documentation note to warn users that f must cope with empty buffers to both wrappers.

Avoiding empty slices instead.

Darksonn

LGTM.

Darksonn

Some doc suggestions.

Darksonn · 2022-09-28T14:17:46Z

tokio-util/src/io/inspect.rs

+    /// If no new data is supplied by a successful `poll_read`, then `f` will
+    /// be called with an empty slice.


Suggested change

/// If no new data is supplied by a successful `poll_read`, then `f` will

/// be called with an empty slice.

/// The closure is called with an empty slice only if the inner reader has reached EOF, or if `poll_read` is called with an empty buffer.

Darksonn · 2022-09-28T14:19:04Z

tokio-util/src/io/inspect.rs

+    /// `f` will never be called with an empty slice; a vectored write will
+    /// result in multiple calls to `f`, one for each buffer that was used by
+    /// the write.


I generally avoid sentences that start with code since we can't make f a capital letter.

Suggested change

/// `f` will never be called with an empty slice; a vectored write will

/// result in multiple calls to `f`, one for each buffer that was used by

/// the write.

/// The closure `f` will never be called with an empty slice. A vectored write can result in multiple calls to `f`.

farnz added 2 commits September 20, 2022 10:46

Darksonn added A-tokio-util Area: The tokio-util crate M-io Module: tokio/io labels Sep 20, 2022

farnz mentioned this pull request Sep 20, 2022

tee adaptor for AsnycRead #4584

Closed

Darksonn reviewed Sep 27, 2022

View reviewed changes

tokio-util/src/io/inspect.rs Outdated Show resolved Hide resolved

tokio-util/src/io/inspect.rs Outdated Show resolved Hide resolved

tokio-util/src/io/inspect.rs Outdated Show resolved Hide resolved

Darksonn and others added 4 commits September 27, 2022 23:59

Merge branch 'master' into inspect_io_wrappers

ed31c0d

Merge remote-tracking branch 'origin/master' into inspect_io_wrappers

5b1ae95

update to master, fix @Darksonn's review comments

cd68c48

make clippy happy

9d15d7a

Darksonn reviewed Sep 28, 2022

View reviewed changes

tokio-util/tests/io_inspect.rs Outdated Show resolved Hide resolved

farnz and others added 2 commits September 28, 2022 14:02

use drain instead of a byte-by-byte loop

ed89580

Co-authored-by: Alice Ryhl <[email protected]>

Merge branch 'master' into inspect_io_wrappers

6e389c7

Darksonn reviewed Sep 28, 2022

View reviewed changes

no empty slices on write

0739402

Darksonn approved these changes Sep 28, 2022

View reviewed changes

Darksonn reviewed Sep 28, 2022

View reviewed changes

Take @Darksonn's advice on writing clear docs

542ee97

Darksonn approved these changes Sep 28, 2022

View reviewed changes

Darksonn enabled auto-merge (squash) September 28, 2022 17:59

Darksonn merged commit 96fab05 into tokio-rs:master Sep 28, 2022

farnz deleted the inspect_io_wrappers branch September 28, 2022 19:20

dbischof90 pushed a commit to dbischof90/tokio that referenced this pull request Oct 1, 2022

io: wrappers for inspecting data on IO resources (tokio-rs#5033)

ef37cdb

huonw mentioned this pull request Jan 22, 2023

Release a new version of tokio-util #5388

Closed

Darksonn mentioned this pull request Feb 9, 2023

Prepare tokio-util v0.7.5 #5442

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inspect io wrappers #5033

Inspect io wrappers #5033

farnz commented Sep 20, 2022

Darksonn left a comment

Darksonn commented Sep 27, 2022

Darksonn Sep 28, 2022

farnz Sep 28, 2022

Darksonn Sep 28, 2022

farnz Sep 28, 2022

Darksonn Sep 28, 2022

Darksonn Sep 28, 2022

farnz Sep 28, 2022

farnz Sep 28, 2022

Darksonn left a comment

Darksonn left a comment

Darksonn Sep 28, 2022

Darksonn Sep 28, 2022

		/// If no new data is supplied by a successful `poll_read`, then `f` will
		/// be called with an empty slice.

	/// If no new data is supplied by a successful `poll_read`, then `f` will
	/// be called with an empty slice.
	/// The closure is called with an empty slice only if the inner reader has reached EOF, or if `poll_read` is called with an empty buffer.

Inspect io wrappers #5033

Inspect io wrappers #5033

Conversation

farnz commented Sep 20, 2022

Motivation

Solution

Darksonn left a comment

Choose a reason for hiding this comment

Darksonn commented Sep 27, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Darksonn left a comment

Choose a reason for hiding this comment

Darksonn left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment