WebSockets and HTTP sharing the same port using request headers #116

pshaughn · 2016-05-25T14:28:01Z

RFC 6455 doesn't mandate that a WebSocket server implementation be able to share the same port with HTTP, but the protocol was designed to allow the possibility.

Currently, WebSocketServerProtocol.handshake doesn't test for the "Upgrade: websocket" header. This doesn't cause any problems because only a WebSocket client would be able to finish the handshake process, but it does appear to rule out port sharing.

It would be convenient in some cases to be able to pass something like an HTTPServer into the WebSocketServer constructor and have requests without the "Upgrade: websocket" header be forwarded to it.

I realize that just putting WebSocket traffic on its own port solves the same problem more easily in most cases, so this idea probably isn't worth implementing if it would take a lot of effort.

aaugustin · 2016-05-25T15:35:33Z

Indeed, I didn't want to implement a HTTP server, only a websocket server (unlike e.g. aiohttp).

You can /probably/ inherit or wrap WebSocketServerProtocol into something that handles HTTP. I made an effort in the design of the API not to rule out this possibility immediately, but I never tried it, so it's entirely possible that it doesn't work...

If you try it, I'm interested into hearing about the results :-) However I suspect it will always be easier to run on a different IP or port.

ghostplant · 2016-06-04T01:25:39Z

I also need this feature, because it will make deployment more easy when been packaged by docker and other engines. Sharing the same port will make it easy to design the port forwarding : )

ghostplant · 2016-06-04T01:27:03Z

I just implement web ssh (https://github.com/ghostplant/websocket-myssh-tunnel.git) using this websockets library, but listening on two ports (8022/8023) seems to be ugly. I am finding a solution.

I think supporting static html files based on GET method (like a SimpleHTTPServer do) will be enough because javascript in html will be able to communicate with the websocket server!

aaugustin · 2016-06-04T07:31:45Z

I believe the Docker best practice is to run different services in different containers, so you shouldn't serve static contents and websockets from the same container?

Supporting static file isn't enough, someone will ask for dynamic stuff shortly thereafter...

pshaughn · 2016-06-04T12:16:13Z

In your case, ghostplant, it sounds like you can use a regular web-server with server-side proxying, and have it proxy requests on a specific path into the websocket port. Since you're already using Docker, you can easily set up nginx to do that.

ghostplant · 2016-06-04T15:15:24Z

Firstly, Python Flask / Java Tomcat based websocket are designed to share http port and websocket port.

Secondly, if I seperate them two into different docker, when it comes to deploy multiple group of these services, it will be a little more complex, because javascript (from html client side) has to write a complex proceduce to calculate the corrent port of proxy requests to the websocket server.

ghostplant · 2016-06-04T15:17:34Z

To some extent, HTTP Request and WebSocket Request have same format so they can be regarded as one service.

pshaughn · 2016-06-04T15:57:48Z

I like that this is just a pure-Python websockets program with no framework dependencies, and I wouldn't want that to change. For your port-combining case, ghostplant, you can put nginx and this in the same Docker container, expose just the nginx port of that container, and set up nginx as a reverse proxy forwarding one path to the non-exposed websocket port and serving the other path itself, thus getting a container with one visible port serving both websocket and http requests.

aaugustin · 2016-06-04T21:26:17Z

I have to say that I don't aim at competing with Tomcat ;-) More seriously, websockets isn't designed to support HTTP(S). The scope of this issue is to make it easier to integrate with a third-party server.

ghostplant · 2016-06-05T05:51:42Z

So can you add a hook to allow users to implement handling HTTP event by themselves? Current "handshake" method in server.py doesn't satisfy idempotency so if users inherit that Protocol, they have to re-implement most of the codes.

I suggest two ways to support this hook in server.py, approximately adding 2-8 lines. Can I suggest the PR?

aaugustin · 2016-06-05T07:05:37Z

Of course you can submit a PR, that's the best way to describe the approach you're proposing.

tgwizard · 2016-09-03T10:37:13Z

I hacked together a simple subclass of WebSocketServerProtocol that can also serve regular HTTP requests. Something like this:

class HttpWSSProtocol(websockets.WebSocketServerProtocol):
    async def handler(self):
        try:
            request_line, headers = await websockets.http.read_message(self.reader)
            method, path, version = request_line[:-2].decode().split(None, 2)
        except:
            self.writer.close()
            self.ws_server.unregister(self)
            raise

        # TODO: Check headers etc. to see if we are to upgrade to WS.
        if path == '/ws':
            # HACK: Put the read data back, to continue with normal WS handling.
            self.reader.feed_data(bytes(request_line))
            self.reader.feed_data(headers.as_bytes().replace(b'\n', b'\r\n'))
            return await super(HttpWSSProtocol, self).handler()
        else:
            try:
                return self.http_handler(method, path, version)
            finally:
                self.writer.close()
                self.ws_server.unregister(self)

    def http_handler(self, method, path, version):
        response = '\r\n'.join([
            'HTTP/1.1 404 Not Found',
            'Content-Type: text/plain',
            '',
            '404 Not Found\n%s %s.' % (method, path),
        ])
        self.writer.write(response.encode())


async def ws_handler(websocket, path):
    pass


start_server = websockets.serve(ws_handler, '0.0.0.0', port, klass=HttpWSSProtocol)

It's unfortunate that we'd need to read the request headers twice for the WS case, and I guess there's where a PR could simplify a bit. Full working (but very hacky) code in https://github.com/tgwizard/touch-down

Badg · 2016-10-15T00:45:42Z

It's probably worth mentioning that load balancer health checks (on, for example, AWS, which now supports Websockets natively) are often http-only. Right now an easy workaround is to just use a second port and http.server for a trivial health check (respond to everything with 200), but it would be nice to be able to actually check the health of the websockets server itself, since that's ultimately what matters.

AWS limits successful response codes for health checks to be in the 200-399 (inclusive) range, so unfortunately you can't use the 101 response to pass the health check.

aaugustin · 2016-10-15T11:35:16Z

On that topic, other people have had issues with load balancer health checks that just opened a TCP connection. This worked but filled the logs with errors.

aaugustin · 2017-05-05T10:56:49Z

One solution would be a hook just before key = check_request(get_header) in WebSocketServerProtocol, that could be implemented in subclasses that don't want to proceed with the WebSocket handshake. I'm not very enthusiastic about that.

cjerdonek · 2017-07-09T02:39:40Z

In my own work, I'm finding there are other use cases where a hook for normal HTTP requests seems to make sense.

One is if you want to expose an API on the server for callers to query something about the state of the websockets process. Another is if you want the server to be able to receive data out-of-band via HTTP. In both cases, it seems like an unnecessary step for the caller to have to "wrap" such one-off API calls in a websocket connection (and similarly, on the server-side, for the server to have to process it as a websocket connection).

In terms of where such a hook could go, it seems like it might make more sense to go in WebSocketServerProtocol.handler() before the call to handshake(). This is because, semantically, it seems to make more sense to call handshake() only if you actually want a websocket connection.

There is a second benefit to putting a hook in handler() before handshake(). It relates to a comment I made here about handling 404's.

Currently, to return a 404, it seems like the only way is to override get_response_status(). But get_response_status() happens relatively late in the process (e.g. after calling check_request()). In the usual case where a 404 depends only on the path, it seems like there should be a way to return a 404 before the handshake even begins. This could be done in the same place as the (non-ws) HTTP hook I'm suggesting above (i.e. before handshake() in handler()). This would let 404-handling be on the "fast path" (where it probably should be).

Putting a hook where I'm suggesting would require some minor refactoring like calling read_http_request() in handler() before handshake() instead of inside handshake().

aaugustin · 2017-07-10T21:39:04Z

Yes, I think a hook on the server-side just before handshake would be appropriate.

Perhaps that hook should replace get_response_status, perhaps not.

I may have strong opinions about the API we choose :-)

cjerdonek · 2017-07-10T21:48:51Z

The way I did it was to add a pre_handshake() method containing the boilerplate, but the hook is still get_response_status() (called by pre_handshake()).

cjerdonek · 2017-07-14T20:22:56Z

Just to clarify, the patch I posted in PR #202 keeps the API exactly the same as before. All it changes is when get_response_status() is called, namely before handshake() instead of during.

The patch also moves the bits in handshake() that aren't specific to websockets to be called before handshake(). This way get_response_status() can access the same general attributes as before like path, origin, etc. Moving these bits has the additional benefit of simplifying handshake() and making it responsible only for the websocket-specific logic.

I will finish adding a couple tests to PR #202. Documentation changes should I think wait until after PR #199 is committed, because the changes would be on top of that (and appear to be minimal, from what I can tell, since the patch doesn't change the API). Thanks!

[Edit: correction, there is a slight change to the API in the PR, namely with what get_response_status() should return.]

This replaces the get_response_status() API which never made it into a release (so there's no backwards incompatibility). Remove a test that depends on get_response_status() being called after check_request(). The extension point must be before check_request() so it can handle regular HTTP requests. Fix #116. Supersedes #202 #154, #137.

aaugustin · 2017-08-12T08:37:08Z

Given the very large overlap between the goals of get_response_status() and the extension point discussed here, I think it makes sense to change get_response_status() into a more generic API, which I drafted in #248 with a new API: process_request(). Dropping the get_response_status() keeps things simple.

process_request() only has access to path (== self.path) and request_headers (== self.request_headers). It may also poke into self.raw_request_headers if needed. That's sufficient to access origin with self.request_headers['Origin'] if needed.

I agree with postponing the documentation changes to #199, after we make a decision on the API.

aaugustin · 2017-08-12T08:39:34Z

Everyone who contributed to this issue: thanks for you inputs; can you comment on #248 to tell me if it solves your use case? Thanks!

This replaces the get_response_status() API which never made it into a release (so there's no backwards incompatibility). Remove a test that depends on get_response_status() being called after check_request(). The extension point must be before check_request() so it can handle regular HTTP requests. Fix #116. Supersedes #202 #154, #137.

aaugustin added the documentation label Dec 23, 2016

aaugustin closed this as completed in d0d434a May 5, 2017

cjerdonek mentioned this issue Jul 9, 2017

Improve documentation around customizing handling of handshake #197

Closed

aaugustin reopened this Jul 9, 2017

This was referenced Jul 9, 2017

Address issue #197 re: handshake customization docs. #199

Closed

Address issue #116: make existing hook take place before the handshake #202

Closed

cjerdonek added enhancement and removed documentation labels Jul 24, 2017

aaugustin mentioned this issue Aug 12, 2017

Add process_request() to override request handling. #248

Merged

aaugustin closed this as completed in #248 Aug 20, 2017

aaugustin mentioned this issue Jun 23, 2019

Add a FAQ #621

Closed

18 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WebSockets and HTTP sharing the same port using request headers #116

WebSockets and HTTP sharing the same port using request headers #116

pshaughn commented May 25, 2016

aaugustin commented May 25, 2016

ghostplant commented Jun 4, 2016

ghostplant commented Jun 4, 2016 •

edited

Loading

aaugustin commented Jun 4, 2016

pshaughn commented Jun 4, 2016

ghostplant commented Jun 4, 2016

ghostplant commented Jun 4, 2016

pshaughn commented Jun 4, 2016

aaugustin commented Jun 4, 2016

ghostplant commented Jun 5, 2016

aaugustin commented Jun 5, 2016

tgwizard commented Sep 3, 2016

Badg commented Oct 15, 2016

aaugustin commented Oct 15, 2016

aaugustin commented May 5, 2017

cjerdonek commented Jul 9, 2017

aaugustin commented Jul 10, 2017

cjerdonek commented Jul 10, 2017

cjerdonek commented Jul 14, 2017 •

edited

Loading

aaugustin commented Aug 12, 2017

aaugustin commented Aug 12, 2017

WebSockets and HTTP sharing the same port using request headers #116

WebSockets and HTTP sharing the same port using request headers #116

Comments

pshaughn commented May 25, 2016

aaugustin commented May 25, 2016

ghostplant commented Jun 4, 2016

ghostplant commented Jun 4, 2016 • edited Loading

aaugustin commented Jun 4, 2016

pshaughn commented Jun 4, 2016

ghostplant commented Jun 4, 2016

ghostplant commented Jun 4, 2016

pshaughn commented Jun 4, 2016

aaugustin commented Jun 4, 2016

ghostplant commented Jun 5, 2016

aaugustin commented Jun 5, 2016

tgwizard commented Sep 3, 2016

Badg commented Oct 15, 2016

aaugustin commented Oct 15, 2016

aaugustin commented May 5, 2017

cjerdonek commented Jul 9, 2017

aaugustin commented Jul 10, 2017

cjerdonek commented Jul 10, 2017

cjerdonek commented Jul 14, 2017 • edited Loading

aaugustin commented Aug 12, 2017

aaugustin commented Aug 12, 2017

ghostplant commented Jun 4, 2016 •

edited

Loading

cjerdonek commented Jul 14, 2017 •

edited

Loading