Add retry policy and fix documentation for Cassandra storage backend #10467

byumov · 2020-11-30T17:41:34Z

Shortly, this PR fixes two problems:

Current documentation for Cassandra backend is incorrect
There is no option to change the default retry policy for cluster now

More details below...

Current documentation says, that default value for connection_timeout is 0, you might think there is no timeout by default, but in fact, if we don't set these options timeout will be 600ms.
see:

vault/physical/cassandra/cassandra.go

Lines 127 to 133 in 0e8c6c2

    
           if connTimeoutStr, ok := conf["connection_timeout"]; ok { 
        
           	connectionTimeout, err := strconv.Atoi(connTimeoutStr) 
        
           	if err != nil { 
        
           		return nil, fmt.Errorf("'connection_timeout' must be an integer") 
        
           	} 
        
           	cluster.Timeout = time.Duration(connectionTimeout) * time.Second 
        
           }

cluster.Timeout is not changing somewhere else and get default value:

vault/vendor/github.com/gocql/gocql/cluster.go

Line 49 in 665d668

    
           Timeout            time.Duration                            // connection timeout (default: 600ms)

If we have a Cassandra cluster with several nodes, we don't want to get an error if one of nodes has gone. For supporting this behavior with gocql(which used in Cassandra backend) we must set one of RetryPolicy for the cluster: https://github.com/gocql/gocql/blob/5913df4d474e0b2492a129d17bbb3c04537a15cd/policies.go#L158
By default(current behavior) cluster use "retry on same connection" policy. So, if current active node is down client will get an error:
https://github.com/gocql/gocql/blob/5913df4d474e0b2492a129d17bbb3c04537a15cd/policies.go#L141
We can easily fix this by using SimpleRetryPolicy (retry for another connection) when creating the cluster.

Also, one new option added to set timeout for the initial connection.

I tried to make changes as little as possible.
This PR doesn't change the behavior of current installations.

fix docs for connection_timeout

byumov · 2021-02-25T09:27:05Z

@ncabatoff, Hi! Could you take a look, please?

aphorise · 2022-08-16T19:47:30Z

Enhancement request along with accompanying documentation improvements do seem relevant. Can this be reviewed for release in the near future?

aphorise · 2022-08-26T16:15:50Z

Likely related to #15899 - and on merger user on that issue should be advised to retest before closing that issue too.

HridoyRoy

lgtm

HridoyRoy · 2022-08-30T17:30:01Z

The changelog check is ok -- I triggered the CI run off another branch and PR, so the check is looking for the wrong changelog entry.

Zlaticanin

LGTM. We gave it a test run and it all seems good! 👍

byumov changed the title ~~Add retry policy and fix documentation for Cassandra backend~~ Add retry policy and fix documentation for Cassandra storage backend Dec 2, 2020

add simple_retry policy and initial_connection_timeout options,

fdcab88

fix docs for connection_timeout

byumov force-pushed the some-improvements-for-cassandra-backend branch from b34bcef to fdcab88 Compare January 10, 2021 18:59

pmmukh added ecosystem storage/cassandra labels Sep 16, 2021

aphorise requested review from briankassouf, ncabatoff, mladlow, calvn and pmmukh August 16, 2022 19:46

Cassandra: policy fix - re-merger with 1.11.x.

c53e4be

aphorise requested a review from taoism4504 as a code owner August 26, 2022 16:03

Cassandra: policy fix - added changelog.

ed08857

HridoyRoy approved these changes Aug 30, 2022

View reviewed changes

HridoyRoy mentioned this pull request Aug 30, 2022

PR 10467 CI Run #16936

Closed

Zlaticanin self-requested a review August 30, 2022 17:39

Zlaticanin approved these changes Aug 30, 2022

View reviewed changes

HridoyRoy merged commit e75d2dc into hashicorp:main Aug 30, 2022

aphorise mentioned this pull request Sep 3, 2022

Cassandra database plugin intermittent timeouts connection to Cassandra #8527

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add retry policy and fix documentation for Cassandra storage backend #10467

Add retry policy and fix documentation for Cassandra storage backend #10467

byumov commented Nov 30, 2020

byumov commented Feb 25, 2021 •

edited

Loading

aphorise commented Aug 16, 2022

aphorise commented Aug 26, 2022

HridoyRoy left a comment

HridoyRoy commented Aug 30, 2022

Zlaticanin left a comment

	if connTimeoutStr, ok := conf["connection_timeout"]; ok {
	connectionTimeout, err := strconv.Atoi(connTimeoutStr)
	if err != nil {
	return nil, fmt.Errorf("'connection_timeout' must be an integer")
	}
	cluster.Timeout = time.Duration(connectionTimeout) * time.Second
	}

Add retry policy and fix documentation for Cassandra storage backend #10467

Add retry policy and fix documentation for Cassandra storage backend #10467

Conversation

byumov commented Nov 30, 2020

byumov commented Feb 25, 2021 • edited Loading

aphorise commented Aug 16, 2022

aphorise commented Aug 26, 2022

HridoyRoy left a comment

Choose a reason for hiding this comment

HridoyRoy commented Aug 30, 2022

Zlaticanin left a comment

Choose a reason for hiding this comment

byumov commented Feb 25, 2021 •

edited

Loading