socket.io redis store leaks channels #663

brettkiefer · 2011-11-23T21:51:24Z

Repro:

Socket.io 0.8.7
Run socket.io with the redis backing store.
run 'info' on your redis instance, noting the number of pubsub_channels
Create a websocket connection to socket.io
run 'info' again: you should now have 5 more pubsub_channels than before
Close the websocket connection
run 'info' again: you now have 3 more pubsub_channels than when you started

Repeat -- it leaks 3 channels each time.

Is this intended?

This line would cause it to leak 2 channels, because we're not unsubscribing in the non-local case https://github.com/LearnBoost/socket.io/blob/master/lib/manager.js#L516

But even if I remove the if (local) logic, I still leak one channel per opened and closed connection.

3rd-Eden · 2011-11-23T21:59:54Z

ping @dshaw ^

dshaw · 2011-11-24T01:56:21Z

On it.

brettkiefer · 2011-11-25T15:39:51Z

The third leaked subscription is this one: https://github.com/LearnBoost/socket.io/blob/master/lib/manager.js#L433

It looks like this is because the other processes never get a disconnect message, and it looks to me as though that's because connected process's manager never publishes a message on the 'disconnect' channel -- so that probably means we're leaking memory in the client processes as well as channels in the Redis instance. I'll issue a pull; if you want it against some repo other than this one, let me know which.

dshaw · 2011-11-26T19:21:59Z

@brettkeifer Pull Request #665 is not going to handle it correctly. Sorry, I've been tied up with Thanksgiving holidays. Landing patch(es) now.

…s not getting passed through onClientDisconnect().

…s not getting passed through onClientDisconnect()." This reverts commit d5ab46d.

brettkiefer · 2011-11-27T01:48:54Z

@dshaw Cool, and I see that you picked up the disconnect publish from c742735 in c110036 to get rid of the other leaked channel, so that looks like it ought to fix our issue, I'll give your changes a try. Thanks! I did it the way I did because it looked to me as though onClientDisconnect would only ever be called when 'local' should be true, so there was no need for the param -- can you explain if that reasoning was wrong?

@guille Uh oh, looks like this broke something for you, hence the reverts?

rauchg · 2011-11-27T01:49:22Z

It might or might not have broken our tests :D

dshaw · 2011-11-27T07:48:28Z

Tests run clean for me.

dshaw · 2011-11-27T07:56:49Z

@brettkiefer onClientDisconnect is called in 3 places, only one of which is with the local flag.

dshaw · 2011-11-27T09:43:48Z

Issue is with Travis, not dshaw. ;-P

brettkiefer · 2011-11-27T17:26:59Z

@dshaw Okay, got it - so a Socket.prototype.disconnect and a Manager.prototype.handleClient happen in processes where the client isn't 'local'?

brettkiefer · 2011-12-05T03:03:28Z

Running with dshaw's patch applied to 0.8.7, it's certainly not leaking a 3 channels per closed connection (thanks!). It's still leaking channels, but very slowly, and I haven't yet figured out when or why.

Also, just in case this rings any bells, it also LOOKS like we're seeing an issue in production where one of our processes will start creating socket.io pubsub channels so quickly that it will flood all of the space available for Redis in under a minute, so I've added logging to try to figure that out, too. It looks like it happens about once per 10000 open websockets per hour.

dshaw · 2011-12-05T03:40:33Z

If you have reproducible steps for the additional leaks, please post them. I'd like to squash that too.

I've encountered an issue like that intermittently myself. Seems to be related to XHR transports getting in a weird state. Are you only using pure websockets?

brettkiefer · 2011-12-05T14:19:18Z

Thanks, and of course I'll post more info on the other issues as I get it. Yes, pure websockets only -- we use a homegrown and fairly efficient short xhr poll as the fallback for Trello because that approach ALWAYS works, and we had to turn Socket.io websockets off right after launch due to some difficult-to-diagnose, sudden, and catastrophic problems we were having with it under load. So we're trying to get all of our websocket-capable browsers back on websockets now.

brettkiefer · 2011-12-05T14:33:53Z

Okay, it looks like the issue with Redis consuming all of its available memory in a very short time (issuecomment 3010815, three comments up) is NOT due to a bunch of pubsub channels being opened -- measurement error on my part, my apologies.

brettkiefer · 2011-12-07T18:18:38Z

The issue with Redis going down the tubes and evicting all of its keys turned out to be redis/redis#91, see that case for more info. It LOOKED like it was a problem with socket.io because without socket.io running the offending process would die quickly (and was almost always the first to err on a SET), whereas with socket.io we try to close all of our websocket connections on process exit to work around issue #495, and the extra time doing that kept the Redis connection (and thus its output_list) around long enough to do more damage.

…s not getting passed through onClientDisconnect().

…663) Bumps [engine.io](https://github.com/socketio/engine.io) from 4.1.2 to 6.2.1. - [Release notes](https://github.com/socketio/engine.io/releases) - [Changelog](https://github.com/socketio/engine.io/blob/main/CHANGELOG.md) - [Commits](socketio/engine.io@4.1.2...6.2.1) --- updated-dependencies: - dependency-name: engine.io dependency-type: direct:production ... Signed-off-by: dependabot[bot] <[email protected]> Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

brettkiefer mentioned this issue Nov 25, 2011

Stop leaking Redis channels on client disconnect #665

Closed

dshaw added a commit to dshaw/socket.io that referenced this issue Nov 26, 2011

Fixes leaking Redis subscriptions for socketio#663. The local flag wa…

d5ab46d

…s not getting passed through onClientDisconnect().

rauchg added a commit that referenced this issue Nov 26, 2011

Revert "Fixes leaking Redis subscriptions for #663. The local flag wa…

94d513c

…s not getting passed through onClientDisconnect()." This reverts commit d5ab46d.

rauchg closed this as completed Nov 27, 2011

rauchg reopened this Nov 27, 2011

brettkiefer closed this as completed Dec 7, 2011

pitr pushed a commit to uken/socket.io that referenced this issue Jun 26, 2013

Fixes leaking Redis subscriptions for socketio#663. The local flag wa…

fd25e05

…s not getting passed through onClientDisconnect().

gkostov mentioned this issue Aug 26, 2013

Memory leak when using RedisStore #1303

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

socket.io redis store leaks channels #663

socket.io redis store leaks channels #663

brettkiefer commented Nov 23, 2011

3rd-Eden commented Nov 23, 2011

dshaw commented Nov 24, 2011

brettkiefer commented Nov 25, 2011

dshaw commented Nov 26, 2011

brettkiefer commented Nov 27, 2011

rauchg commented Nov 27, 2011

dshaw commented Nov 27, 2011

dshaw commented Nov 27, 2011

dshaw commented Nov 27, 2011

brettkiefer commented Nov 27, 2011

brettkiefer commented Dec 5, 2011

dshaw commented Dec 5, 2011

brettkiefer commented Dec 5, 2011

brettkiefer commented Dec 5, 2011

brettkiefer commented Dec 7, 2011

socket.io redis store leaks channels #663

socket.io redis store leaks channels #663

Comments

brettkiefer commented Nov 23, 2011

3rd-Eden commented Nov 23, 2011

dshaw commented Nov 24, 2011

brettkiefer commented Nov 25, 2011

dshaw commented Nov 26, 2011

brettkiefer commented Nov 27, 2011

rauchg commented Nov 27, 2011

dshaw commented Nov 27, 2011

dshaw commented Nov 27, 2011

dshaw commented Nov 27, 2011

brettkiefer commented Nov 27, 2011

brettkiefer commented Dec 5, 2011

dshaw commented Dec 5, 2011

brettkiefer commented Dec 5, 2011

brettkiefer commented Dec 5, 2011

brettkiefer commented Dec 7, 2011