go-redis/v8 nil shard in the shards map #2126

alemrtv · 2022-06-21T14:42:00Z

Expected Behavior

no panic due to nil pointer dereference after retrieving a nil shard from the shards map

Current Behavior

There is no check if a shard is not nil before returning it.

It might cause panic due to nil pointer dereference here

Possible Solution

Check if shard is not nil before returning it.

Steps to Reproduce

We have a proprietary code which wraps go-redis/go and sometimes recreates the Ring structure (and hence repopulating the shards map), all while holding the mutex.
Despite it, sometimes after the ring is recreated, we get panic in the place linked above because the shard retrieved by the name is nil.

Possible Implementation

func (c *ringShards) GetByName(shardName string) (*ringShard, error) {
	if shardName == "" {
		return c.Random()
	}

	c.mu.RLock()
	shard := c.shards[shardName]
	c.mu.RUnlock()

	if shard == nil {
		return nil, fmt.Errorf("a shard named %q is nil", shardName)
	}

	return shard, nil
}

The text was updated successfully, but these errors were encountered:

vmihailenco · 2022-06-22T12:07:36Z

c.shards should not contain nil shards. Do you have an idea what conditions cause go-redis to have a nil shard in the map?

alemrtv · 2022-06-22T13:15:00Z

We use a structure S which (simplified) looks like this:

import "github.com/go-redis/redis/v8"
...
type S struct {
  mu sync.RWMutex
  ring *redis.Ring
  ...
}

`*redis.Ring` is the one from `package redis` in go-redis/redis@v8.11.4.

And function f which (simplified) does something like this:

func (s *S) f() {
  s.mu.Lock()
  defer s.mu.Unlock()
  ...
  // here we change opts.Addrs map
  var newOpts redis.RingOptions = ... 
  // here the shards map gets changed and we substitute the old ring with the new one
  prev, s.ring = s.ring, redis.NewRing(&newOpts) the//new one
  ...
  defer func() {
    if prev != nil {
      go func() {
        if err := prev.Close(); err != nil {
          log(err)
        }
      }()    
    }
  }()
}

At some random points of time we call f() to reinitialize S in order to change opts.Addrs. We take the lock while doing this to avoid any race conditions.

However, as I can see in the source code, between the moment when the hash is calculated and the moment when we retrieve a shard by it a bit later, there is no lock held.

I suppose that we might have a situation, when things happen in the following order:

The hash is calculated in generalProcessPipeline()
Our s.f() is called, it does its work and changes the shards map
GetByName() tries to retrieve a shard with an invalid hash and gets a shard without checking if it is nil. This is where the nil pointer dereference happens.

Fixes redis#2126

vmihailenco added the bug label Jun 27, 2022

kavu pushed a commit to kavu/redis that referenced this issue Jul 13, 2022

fix: Handle panic in ringShards Hash function when Ring got closed

1d4456b

Fixes redis#2126

kavu mentioned this issue Jul 13, 2022

fix: Handle panic in ringShards Hash function when Ring got closed #2153

Merged

kavu pushed a commit to kavu/redis that referenced this issue Jul 13, 2022

fix: handle panic in ringShards Hash function when Ring got closed

ffbab22

Fixes redis#2126

kavu added a commit to kavu/redis that referenced this issue Jul 13, 2022

fix: handle panic in ringShards Hash function when Ring got closed

a80b84f

Fixes redis#2126

kavu added a commit to kavu/redis that referenced this issue Jul 13, 2022

fix: handle panic in ringShards Hash function when Ring got closed

a9a5aff

Fixes redis#2126

kavu mentioned this issue Jul 14, 2022

fix: provide a signal channel to end heartbeat goroutine #2157

Merged

vmihailenco closed this as completed in #2153 Jul 28, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

go-redis/v8 nil shard in the shards map #2126

go-redis/v8 nil shard in the shards map #2126

alemrtv commented Jun 21, 2022 •

edited

Loading

vmihailenco commented Jun 22, 2022

alemrtv commented Jun 22, 2022

go-redis/v8 nil shard in the shards map #2126

go-redis/v8 nil shard in the shards map #2126

Comments

alemrtv commented Jun 21, 2022 • edited Loading

Expected Behavior

Current Behavior

Possible Solution

Steps to Reproduce

Possible Implementation

vmihailenco commented Jun 22, 2022

alemrtv commented Jun 22, 2022

alemrtv commented Jun 21, 2022 •

edited

Loading