[Urgent!!!!!!] worker will report cross slot error when working with redis cluster #93

steven-zou · 2018-04-12T09:06:40Z

Apr 11 22:53:03 172.18.0.1 jobservice[971]: message repeated 439 times: [ ERROR: worker.fetch - CROSSSLOT Keys in request don't hash to the same slot]
Apr 11 22:53:03 172.18.0.1 jobservice[971]: ERROR: requeuer.process - CROSSSLOT Keys in request don't hash to the same slot
Apr 11 22:53:03 172.18.0.1 jobservice[971]: ERROR: requeuer.process - CROSSSLOT Keys in request don't hash to the same slot
Apr 11 22:53:03 172.18.0.1 jobservice[971]: ERROR: worker.fetch - CROSSSLOT Keys in request don't hash to the same slot

https://stackoverflow.com/questions/38042629/redis-cross-slot-error

NOTES: jobservice is our component name.

steven-zou · 2018-04-12T09:09:28Z

@shdunning any comments?

shdunning · 2018-05-03T20:05:53Z

I suspect this has something to do with the Lua scripts referencing keys that aren't defined on that node where the Lua script is running -- like our Lua script for fetching the next job, which requires 5 keys: (1) job queue (2) in progress queue (3) pause key (4) lockKey (5) lock info key (6) set concurrency key.

We'll likely need to investigate how to make these Lua scripts cluster-safe.

shdunning · 2018-05-03T20:07:16Z

https://redis.io/topics/cluster-spec#keys-hash-tags

austintaylor · 2018-05-04T00:11:20Z

@steven-zou What commit are you running? I believe we were constructing keys in a lua script at one point, which would cause this error, but we fixed this in #52

steven-zou · 2018-05-09T03:09:04Z

Thanks, @austintaylor. After rough checking, there might be a mistake when importing work library into our project with dep. Probably, an older version is imported. I'll do more verification to make sure if the above issue existing.

shdunning · 2018-05-09T14:09:37Z

@steven-zou if it's any help, we've been having better luck with https://github.com/kardianos/govendor instead of dep.

stefanoschrs · 2018-06-28T18:43:34Z

Have the same issue on 0.5.1, when using an elasticache instance I get the

ERROR: requeuer.process - CROSSSLOT Keys in request don't hash to the same slot
ERROR: worker.fetch - CROSSSLOT Keys in request don't hash to the same slot

sebcoetzee · 2018-07-04T22:47:08Z

@stefanoschrs any luck fixing this on 0.5.1?

stefanoschrs · 2018-07-05T17:48:59Z

Nope, nothing..

sebcoetzee · 2018-07-06T08:16:01Z

@stefanoschrs I ended up switching out the clustered Redis instance for a normal one to get around this.

stefanoschrs · 2018-07-06T09:10:09Z

That's what I did also, but it's not a solution..

steven-zou · 2018-07-16T09:07:44Z

Yes, check with v0.5.1, the issue still is there. Any bits of advice? @shdunning @austintaylor

steven-zou · 2018-07-23T04:55:25Z

Just push up:

@shdunning @austintaylor

shdunning · 2018-07-23T14:23:17Z

@steven-zou i would need to do some digging. this (normally) occurs if we are dynamically referencing a key in the lua scripts instead of explicitly passing them in via the KEYS argv. I'll try to spend some time this week digging into the lua scripts to see if I can track down where this might be occuring.

steven-zou · 2018-07-24T03:28:44Z

@shdunning
Thanks a lot! Expecting we can locate the root cause can fix it.

shdunning · 2018-07-25T21:01:09Z

@steven-zou when you're initializing your worker pool, can you try setting your namespace to to begin and end in { and } chars? E.g., if your namespace is work then try setting it to {work} in the call to initialize a NewWorkerPool. Our theory is that this will force all of the gocraft/work keys into the same hash slot on one of the nodes in the cluster (see docs).

Hopefully this is easy enough for you to try out and get back to us.

steven-zou · 2018-07-26T13:39:45Z

@shdunning

I'll have a try following the way you mentioned. Will let u know the results later. Thanks.

shdunning · 2018-07-26T14:40:53Z

Cool. If that works, we can update the README for now with this info.

I'll also create a followup issue to see what it would take to make this lib redis-cluster (we use redis sentinel and have no issues, but that doesn't do key distribution across the nodes like redis cluster).

steven-zou · 2018-07-27T08:55:12Z

@shdunning

We tried, putting namespace key in {} does work well. Thanks.

I'll close this issue as the way you mentioned can fix the problem with an easy way.

shdunning · 2018-07-30T14:33:39Z

@steven-zou nice! I'm glad this worked. Note that this isn't an ideal solution because it forces all of the gocraft/work keys onto one node in the cluster. I need to think some more on how we would solve this problem for real; that is, allow gocraft/work LUA scripts to take advantage of multi-node redis clusters. I'll create a separate issue for that.

ifraixedes · 2019-02-27T16:10:09Z

Hi folks,

We have tried to run work in a Redis cluster hosted by AWS ElastiCache and we have got some issues.

The first issue was the one commented (ERROR: requeuer.process - CROSSSLOT) which mostly was solved with the workaround commented here and in the README, we were aware of that but we forgot to add the curly braces after we changed the Redis hosted in AWS ElastiCache from single node to a cluster.

The second issue is that we're getting ERROR: requeuer.process - MOVED 3223 10.3.3.127:6379 and ERROR: worker.fetch - MOVED 3223 10.3.3.127:6379 without stopping.

If we understood correctly, it's the client that should do the redirection when it gets MOVED errors.

We are wondering how we can configure work to use a Redis cluster because it's based on github.com/gomodule/redigo and we found that it doesn't support a redis cluster.

Could help us telling what are we missing?

Thank you ver much in advance.

EDIT

Our current AWS ElastiCache Redis is formed by 2 nodes redis (v5.0.0) cluster in multi-AZ; using 1 shards, 1 master/1 slave.

In case that this information could help to give us an answer.

steven-zou changed the title ~~[Urgent!!!!!!] worker will report cross slot error when work with redis cluster~~ [Urgent!!!!!!] worker will report cross slot error when working with redis cluster Apr 12, 2018

steven-zou closed this as completed Jul 27, 2018

darkpssngr mentioned this issue Oct 17, 2018

Getting Error from Redigo while job queuing #113

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Urgent!!!!!!] worker will report cross slot error when working with redis cluster #93

[Urgent!!!!!!] worker will report cross slot error when working with redis cluster #93

steven-zou commented Apr 12, 2018

steven-zou commented Apr 12, 2018

shdunning commented May 3, 2018

shdunning commented May 3, 2018

austintaylor commented May 4, 2018

steven-zou commented May 9, 2018

shdunning commented May 9, 2018

stefanoschrs commented Jun 28, 2018

sebcoetzee commented Jul 4, 2018

stefanoschrs commented Jul 5, 2018

sebcoetzee commented Jul 6, 2018

stefanoschrs commented Jul 6, 2018

steven-zou commented Jul 16, 2018

steven-zou commented Jul 23, 2018

shdunning commented Jul 23, 2018

steven-zou commented Jul 24, 2018

shdunning commented Jul 25, 2018

steven-zou commented Jul 26, 2018

shdunning commented Jul 26, 2018

steven-zou commented Jul 27, 2018

shdunning commented Jul 30, 2018

ifraixedes commented Feb 27, 2019 •

edited

Loading

[Urgent!!!!!!] worker will report cross slot error when working with redis cluster #93

[Urgent!!!!!!] worker will report cross slot error when working with redis cluster #93

Comments

steven-zou commented Apr 12, 2018

steven-zou commented Apr 12, 2018

shdunning commented May 3, 2018

shdunning commented May 3, 2018

austintaylor commented May 4, 2018

steven-zou commented May 9, 2018

shdunning commented May 9, 2018

stefanoschrs commented Jun 28, 2018

sebcoetzee commented Jul 4, 2018

stefanoschrs commented Jul 5, 2018

sebcoetzee commented Jul 6, 2018

stefanoschrs commented Jul 6, 2018

steven-zou commented Jul 16, 2018

steven-zou commented Jul 23, 2018

shdunning commented Jul 23, 2018

steven-zou commented Jul 24, 2018

shdunning commented Jul 25, 2018

steven-zou commented Jul 26, 2018

shdunning commented Jul 26, 2018

steven-zou commented Jul 27, 2018

shdunning commented Jul 30, 2018

ifraixedes commented Feb 27, 2019 • edited Loading

ifraixedes commented Feb 27, 2019 •

edited

Loading