All of a sudden, I'm receiving this error #390

thelinuxlich · 2016-11-02T16:51:43Z

[Error: Too many Cluster redirections. Last error: ReplyError: MOVED 6686 10.240.100.31:7000]

I have a 6 nodes setup, I saw this in the log of a node:

Oct 31 19:04:54 aidax-redis-2 redis-server[1501]: 1501:M 31 Oct 19:04:54.996 * Marking node bd1251b171bf467313c4300d761d92417cbb586f as failing (quorum reached).
Oct 31 19:04:55 aidax-redis-2 redis-server[1501]: 1501:M 31 Oct 19:04:55.439 * Clear FAIL state for node bd1251b171bf467313c4300d761d92417cbb586f: slave is reachable again.

I think ioredis is not recovering well alongside the redis cluster(3.2)

The text was updated successfully, but these errors were encountered:

luin · 2016-11-03T02:19:26Z

Could you please enable the debug mode (DEBUG=ioredis:* node yourapp.js) and post the logs during this problem happens here?

thelinuxlich · 2016-11-03T02:36:50Z

Ok, I will keep you updated

thelinuxlich · 2016-11-03T17:09:24Z

So today it happened again and this is the log with DEBUG:

https://gist.github.com/thelinuxlich/365b8cafc98295b14077a38a44a949a7

luin · 2016-11-04T03:33:53Z

When a command is rejected with error MOVED, ioredis will update the local slot cache and send the command to the new node. It's strange that the command is sent multiple times to the same node.

thelinuxlich · 2016-11-04T05:06:00Z

I'm configuring it to connect with all the 6 nodes, and when this happens, I have to restart the whole process :(

thelinuxlich · 2016-11-06T09:14:37Z

Some more info: I'm using Node 5.5 and the redis operations consist of GET, SET, SETEX and EVALSHA scripts(using defineCommand)

thelinuxlich · 2016-11-07T16:19:54Z

Is there anymore info I can provide you to help pinpoint the issue?

luin · 2016-11-07T16:36:12Z

@thelinuxlich Is this issue related with your environment? If the issue cannot be reproduced with general setups, it would be very helpful to provide the docker files so I can debug the problem locally in exact the same environment as yours. Otherwise is it possible to help me narrow down the possibility where the problem happens.

thelinuxlich · 2016-11-07T16:39:11Z

I'm testing with older ioredis versions, let's see

thelinuxlich · 2016-11-07T22:23:05Z

Interesting, ioredis 2.0.0 is doing okay

luin · 2016-11-08T02:06:30Z

That's strange since there's not much difference between the two versions: v2.0.0...v2.4.0. Hmm...

thelinuxlich · 2016-11-08T16:51:51Z

Now I can confirm that switching to ioredis 2.0.0 totally mitigates the problem

luin · 2016-11-08T20:41:54Z

@thelinuxlich Could you please also test with ioredis v2.1.0 & v2.2.0 so we can know which version was the issue introduced. Thanks a lot 😆

thelinuxlich · 2016-11-17T02:49:16Z

After days of testing, can confirm this happens in ioredis 2.2+

luin · 2016-11-17T09:17:39Z

That means 2.1.0 doesn't have the problem? Looks like there's no much difference in cluster mode between 2.1.0 & 2.2.0: v2.1.0...v2.2.0. Hmm...

AVVS · 2016-11-17T10:03:17Z

To me it looks like cluster misconfiguration. the difference is that ioredis create a new node based on report from a cluster node, which tells to ask some new node not from the list - it tries to ask that node, gets another redirect and so on. One would assume that that nodes report wrong ip addresses for some reason and that leads to redirection loops

Other possibility is that response is parsed incorrectly? But I assume it's not the case though

thelinuxlich · 2016-11-18T19:26:56Z

Well, it can't be misconfiguration because I've been using ioredis for more than a year without changes in configuration and all of a sudden when I update from 2.0 to 2.4 this began to happen.

AVVS · 2016-11-18T19:39:26Z

@thelinuxlich maybe you are right. The place it can happen is

v2.1.0...v2.2.0#diff-ded17b5f6b1633a10701ad5e98a4e3c8R428

Aditya-Chowdhry · 2017-10-01T18:09:10Z

Recently used Ioredis cluster in production and got the above-mentioned errors. Just want to check up on the status of the issue. Is it resolved in latest version?

luin · 2017-10-02T09:28:54Z

@Aditya-Chowdhry No, the issue hasn't been reproduced in my environment.
@thelinuxlich How did you solve the problem in the end?
@AVVS The link you posted seems to be broken. Which part of code did you mention?

AVVS · 2017-10-02T15:33:03Z

https://github.com/luin/ioredis/compare/v2.1.0...v2.2.0#diff-ded17b5f6b1633a10701ad5e98a4e3c8R428

Fixed link

luin · 2017-10-03T05:09:38Z

@AVVS Thanks for the update. How is this line of code related with the issue?

Aditya-Chowdhry · 2017-10-03T07:55:01Z

In my case, I am only getting this same exact error repeatedly

MOVED 0 xxx.xx.xx.xx:6389

where the cluster consists of only 4 masters, ports - 6389,6390,6391,6392

I have another cluster consisting of master & slaves for which I am not getting any error.

luin · 2017-10-03T08:39:48Z

@Aditya-Chowdhry Was 6389 down while getting the error? Is the issue reproducible?

Aditya-Chowdhry · 2017-10-03T09:38:19Z

@luin No 6389 was not down. I am not able to reproduce it locally. Debugging further. Will get back if found something ASAP.

Aditya-Chowdhry · 2017-10-04T07:44:09Z

@luin Got the issue. This error that I posted comes when null cache key is passed. I wrote a rough small code to reproduce the issue.

'use strict';
const Redis = require('ioredis');
const CLUSTER_HOST = '127.0.0.1';


const cluster = new Redis.Cluster([{
    port: 6383,
    host: CLUSTER_HOST
}, {
    port: 6384,
    host: CLUSTER_HOST
}]);

let cacheKey = null;
let data = 'Hey this is test data';
console.log('CacheKey', cacheKey);
for(var i = 0; i < 100; ++i) {
	cluster.set(cacheKey, data, function(err, reply){
		if(err) console.log('Error', err);
		if(reply) console.log('Okay set done', reply);
		cluster.expire(cacheKey, 1234, function(err, reply){
			if(err) console.log('Error', err);
			if(reply) console.log('Okay expire done', reply);
			cluster.get(cacheKey, function(err, reply){
				if(err) console.log('Error', err);
				if(reply) console.log('Okay get done', reply);
			});
		});
	});
}

> Error: Too many Cluster redirections. Last error: ReplyError: MOVED 0 127.0.0.1:6380

Before this fix, empty key names (""/null/undefined) was sent to random nodes. That should not be the case when key name is an empty string. Related with #390. Thank Aditya-Chowdhry for addressing this issue!

luin · 2017-10-04T10:02:30Z

@Aditya-Chowdhry Just fixed this issue in v3.2.1. Thank you for the example, that helps!

tuananh · 2017-10-04T10:10:10Z

@luin one question: why when key undefined, you want to calculate slot anyway?

luin · 2017-10-04T11:47:46Z

@tuananh null and undefined are converted to empty strings in ioredis. It makes sense to calculate slot for an empty string.

stale · 2017-11-03T12:38:56Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed after 7 days if no further activity occurs, but feel free to re-open a closed issue if needed.

thenitai · 2017-12-27T17:10:58Z

I'm seeing this issue with 3.2.2 in our production environment with 3 redis nodes (different IP's)

stale bot added the wontfix label Nov 3, 2017

stale bot closed this as completed Nov 10, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

All of a sudden, I'm receiving this error #390

All of a sudden, I'm receiving this error #390

thelinuxlich commented Nov 2, 2016

luin commented Nov 3, 2016

thelinuxlich commented Nov 3, 2016

thelinuxlich commented Nov 3, 2016

luin commented Nov 4, 2016

thelinuxlich commented Nov 4, 2016

thelinuxlich commented Nov 6, 2016

thelinuxlich commented Nov 7, 2016

luin commented Nov 7, 2016

thelinuxlich commented Nov 7, 2016

thelinuxlich commented Nov 7, 2016

luin commented Nov 8, 2016

thelinuxlich commented Nov 8, 2016

luin commented Nov 8, 2016

thelinuxlich commented Nov 17, 2016

luin commented Nov 17, 2016

AVVS commented Nov 17, 2016

thelinuxlich commented Nov 18, 2016

AVVS commented Nov 18, 2016

Aditya-Chowdhry commented Oct 1, 2017

luin commented Oct 2, 2017

AVVS commented Oct 2, 2017

luin commented Oct 3, 2017

Aditya-Chowdhry commented Oct 3, 2017

luin commented Oct 3, 2017 •

edited

Loading

Aditya-Chowdhry commented Oct 3, 2017 •

edited

Loading

Aditya-Chowdhry commented Oct 4, 2017

luin commented Oct 4, 2017

tuananh commented Oct 4, 2017

luin commented Oct 4, 2017

stale bot commented Nov 3, 2017

thenitai commented Dec 27, 2017

All of a sudden, I'm receiving this error #390

All of a sudden, I'm receiving this error #390

Comments

thelinuxlich commented Nov 2, 2016

luin commented Nov 3, 2016

thelinuxlich commented Nov 3, 2016

thelinuxlich commented Nov 3, 2016

luin commented Nov 4, 2016

thelinuxlich commented Nov 4, 2016

thelinuxlich commented Nov 6, 2016

thelinuxlich commented Nov 7, 2016

luin commented Nov 7, 2016

thelinuxlich commented Nov 7, 2016

thelinuxlich commented Nov 7, 2016

luin commented Nov 8, 2016

thelinuxlich commented Nov 8, 2016

luin commented Nov 8, 2016

thelinuxlich commented Nov 17, 2016

luin commented Nov 17, 2016

AVVS commented Nov 17, 2016

thelinuxlich commented Nov 18, 2016

AVVS commented Nov 18, 2016

Aditya-Chowdhry commented Oct 1, 2017

luin commented Oct 2, 2017

AVVS commented Oct 2, 2017

luin commented Oct 3, 2017

Aditya-Chowdhry commented Oct 3, 2017

luin commented Oct 3, 2017 • edited Loading

Aditya-Chowdhry commented Oct 3, 2017 • edited Loading

Aditya-Chowdhry commented Oct 4, 2017

luin commented Oct 4, 2017

tuananh commented Oct 4, 2017

luin commented Oct 4, 2017

stale bot commented Nov 3, 2017

thenitai commented Dec 27, 2017

luin commented Oct 3, 2017 •

edited

Loading

Aditya-Chowdhry commented Oct 3, 2017 •

edited

Loading