Add a method to trio socket objects that checks whether the OS thinks the socket is readable #760

njsmith · 2018-10-31T09:46:46Z

Say you're writing an HTTP/1.1 client library, that supports connection pooling. When you take an idle connection out of your pool, it might be fine to use... OR, the server might have gotten tired of waiting for that connection to do something while it was sitting idle in the pool, and have already closed it. Because of infelicities in HTTP/1.1's design, there is no 100% reliable way to detect this, because there's a race condition: you might send a new request down the connection just as the server is closing it, and then you have no idea what happened and whether the connection dropped before or after the server started processing your request.

So, we would like to minimize how often this happens. But, because there's actually no way to detect it at the protocol level, there really is no good way to do this using regular high-level APIs. The best approach is: when you take an idle connection out of the pool, before you start using it again, do a quick check whether the underlying socket is readable. You don't want to actually read from it; there's nothing to learn by doing so. If an HTTP/1.1 server sent anything on an idle connection, it's either a 408 Request Timeout or just a preemptory close. You especially don't want to block waiting to receive data, since that would defeat the whole purpose – we just want to check whether you could receive data. And you want to do this directly on the raw socket, ignoring things like any TLS encryption layer, because TLS does not really provide any way to do a pure check for whether unencrypted bytes are available – to figure that out you first have to read the encrypted bytes and then check if they make a complete frame, and we don't want to read any bytes.

So... for this specific, weird, and incredibly important use case, you don't want to use Trio's high-level Stream layer, you want to peek through all the abstraction layers, ignore Trio's socket I/O API entirely, and call select or poll directly on the underlying socket just to ask the OS whether it is readable.

So far, so good; Trio lets you do all that using supported, public APIs.

But! Suppose that for testing, you want to use a fake virtual networking layer. That's great: we have first-class support for that in Trio's socket I/O API (#170). Except... asks or urllib3 or whatever, for this one operation, currently has to do an end-run around Trio's socket I/O API – it has to directly call the OS's select or poll operation, which means that it needs a real OS socket, not a fake virtual socket. And that means that currently, our network faking API cannot handle HTTP clients, which is kind of an important use case.

Solution: we should add a method on Trio socket objects that does this readability check. It's pretty straightforward, though there are a few complications. (Mostly, you need to use select on windows and poll everywhere else.) And the whole point of this is that libraries like asks or urllib3 will call this method instead of doing their own call to select/poll. And it will do exactly the same thing they would have done by hand. Except..... since it's a method on the socket object, our fake virtual sockets will be able to override it to do a fake virtual select/poll. And that will solve the problem.

The text was updated successfully, but these errors were encountered:

njsmith · 2018-11-19T11:26:29Z

@wgwz You mentioned in gitter that you had some questions about this?

njsmith · 2019-06-12T02:58:36Z

This is slightly larger than we usually use for "good first issues", but if you're ambitious then it could work.

Basically what's needed is:

a new method is_readable on trio._socket._SocketType
- on Windows, it does rready, _, _ = select.select([self._sock], [], [], 0); return bool(rready)
- everywhere else, it does p = select.poll(); p.register(self._sock, select.POLLIN); return bool(p.poll(0))
- don't actually write the code as a one-liner, use normal formatting :-)
a new test in trio/tests/test_socket.py
add it to the docs
add a newfragment

ziirish · 2019-06-12T07:38:46Z

I'll have a look at this

add a new is_readable method to SocketType (fix #760)

ghost · 2021-12-16T01:40:10Z

It sounds a little strange to hear "could" receive data. Does that mean it is a readable object or does it mean there is data on the socket that may be read by the next receive_some() command? I am looking to find a way to put all my trio.nursery()'s in to one nursery group but when I do a receive_some() on a socket it locks up and freezes the app. I know the socket is readable I just need to know if it is a good time to read from it or not. I other words, if the socket is going to lock up then I can skip and check the next socket and see if there is data there to be read. And just have one loop if the socket had an attribute "read_waiting" == True when there is data that will be read or the socket will be closed as usual if the read is emtpy.

Fuyukai added the missing piece label Nov 4, 2018

njsmith added the good first issue label Jun 12, 2019

ziirish added a commit that referenced this issue Jul 3, 2019

add a new is_readable method to SocketType (fix #760)

dab8ec8

ziirish mentioned this issue Jul 3, 2019

add a new is_readable method to SocketType (fix #760) #1137

Merged

njsmith closed this as completed in 317b09a Jul 6, 2019

njsmith added a commit that referenced this issue Jul 6, 2019

Merge pull request #1137 from python-trio/gh-760

254ded3

add a new is_readable method to SocketType (fix #760)

njsmith mentioned this issue Jul 25, 2019

Detect EOF signaling remote server closed connection encode/httpx#143

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a method to trio socket objects that checks whether the OS thinks the socket is readable #760

Add a method to trio socket objects that checks whether the OS thinks the socket is readable #760

njsmith commented Oct 31, 2018

njsmith commented Nov 19, 2018

njsmith commented Jun 12, 2019

ziirish commented Jun 12, 2019

ghost commented Dec 16, 2021

Add a method to trio socket objects that checks whether the OS thinks the socket is readable #760

Add a method to trio socket objects that checks whether the OS thinks the socket is readable #760

Comments

njsmith commented Oct 31, 2018

njsmith commented Nov 19, 2018

njsmith commented Jun 12, 2019

ziirish commented Jun 12, 2019

ghost commented Dec 16, 2021