Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Amazon Cassandra : Column family TableIdentifier(ksName=system, cfName=local) is not writable #1389

Closed
nikhiljangi opened this issue Jan 13, 2020 · 28 comments · Fixed by #1412

Comments

@nikhiljangi
Copy link

Please answer these questions before submitting your issue. Thanks!

What version of Cassandra are you using?

Amazon Managed Cassandra - version 3.11.2

What version of Gocql are you using?

v0.0.0-20200103014340-68f928edb90a

What did you do?

Tried to connect to Amazon Managed Cassandra using gocql

clusterConfig := gocql.NewCluster("cassandra.us-east-1.amazonaws.com:9142")
clusterConfig.Authenticator = gocql.PasswordAuthenticator{Username: username, Password: password}
clusterConfig.SslOpts = &gocql.SslOptions{
		CaPath: "AmazonRootCA1.pem",
	}

clusterConfig.Keyspace = "test"
session, err := clusterConfig.CreateSession()

What did you expect to see?

Successful gocql Session creation

What did you see instead?

{"error": "gocql: unable to create session: control: unable to setup connection: Column family TableIdentifier(ksName=system, cfName=local) is not writable"}


If you are having connectivity related issues please share the following additional information

Describe your Cassandra cluster

please provide the following information

  • output of nodetool status
    nodetool: Failed to connect to 'cassandra.us-east-1.amazonaws.com:9142' - ConnectIOException: 'non-JRMP server at remote endpoint'.

  • output of SELECT peer, rpc_address FROM system.peers

peer | rpc_address
3.83.170.140 | null
3.83.168.142 | null
3.83.169.141 | null
3.83.170.152 | null
3.83.168.154 | null
3.83.169.153 | null
3.83.168.143 | null
3.83.170.151 | null
3.83.171.143 | null

  • rebuild your application with the gocql_debug tag and post the output

2020-01-13T16:53:55.664-0500 ERROR filter/filter.go:65 error: {"error": "gocql: unable to create session: control: unable to setup connection: Column family TableIdentifier(ksName=system, cfName=local) is not writable"}

@t2y
Copy link

t2y commented Jan 17, 2020

I also had the same issue.

@Zariel
Copy link
Contributor

Zariel commented Jan 17, 2020

This is odd, we don't write to system.local only read from it.

@t2y
Copy link

t2y commented Jan 20, 2020

According to my debugging, when gocql create prepared statement and communicate with MCS, then it failed as below.

row, err := c.query(ctx, "SELECT * FROM system.local WHERE key='local'").rowMap()
=> info, err = c.prepareStatement(ctx, qry.stmt, qry.trace)

However, I'm not sure why creating prepare statement will cause "not writable" error.

@Zariel
Copy link
Contributor

Zariel commented Jan 20, 2020

I think thats a bug in AWS, this works in Cassandra.

@tschirmer
Copy link

tschirmer commented Jan 21, 2020

same issue here. I don't think this is a bug in AWS the system.local query works within the AWS Query editor

image

@mattmassicotte
Copy link

Running into this problem as well. And, made an interesting discovery on the AWS documentation site for MCS. Looks like they may have found this problem themselves during testing.

Currently, Go client drivers are not supported.

https://docs.aws.amazon.com/mcs/latest/devguide/cqlsh.html#using_driver

@tschirmer
Copy link

Yeah I see that, but they've probably tried to use this driver and then marked it as incompatible because of the problems. At the end of the day the driver is just preparing and communicating packets via TCP. If it works via python, and C#, C++. we should be able to make it work in golang . It'd be good to see why the prepare statement is trying to write to the system table.

@dahankzter
Copy link
Contributor

Could it be the events registration on the controlConn @Zariel ? It seems to be the only thing possible at this point in the path.

@dahankzter
Copy link
Contributor

The Java drivers control connection does this as well however so it seems unlikely.

@dahankzter
Copy link
Contributor

Anyway perhaps the errors from the lower levels also should be decorated to pin exactly where it happens. The newer Go error facilities could perhaps be used to wrap/unwrap them.

@dahankzter
Copy link
Contributor

The server rejects prepared statements to the system tables and the driver always prepares when it can. Thx @slivne for finding it and testing it. Seems to be a server side issue.

@Zariel
Copy link
Contributor

Zariel commented Jan 22, 2020

Can someone try in Java/cpp doing a prepare + execute on MCS with the query SELECT * from system.local and see if it works?

If they are blocking creating prepared statements on system tables that is very strange. I don't have any contacts at AWS to talk about this.

@dahankzter
Copy link
Contributor

That would be great but unless I am lost in the callback code of the Java driver it does indeed look like it doesn't prepare any of the maintenance queries to system.* tables.

@dahankzter
Copy link
Contributor

The patch that made it work for for us was:

--- a/session.go
+++ b/session.go
@@ -1016,6 +1016,9 @@ func (q *Query) GetRoutingKey() ([]byte, error) {
 }
 
 func (q *Query) shouldPrepare() bool {
+        if strings.Contains(q.stmt, "system.") {
+           return false;
+        }

So it seems clear that there is something on the server stopping them.

@tschirmer
Copy link

tschirmer commented Jan 23, 2020

@dahankzter That seems like a reasonable solution.

Prepared Statements make sense as a default, but for this instance turning them off for system calls might be our only option. We could always set it as an option in the config to turn off/on and document it for AWS Managed Cassandra?

@dahankzter
Copy link
Contributor

I don't know if that's the way to go @tschirmer. If it's supported by Cassandra then AMC should also support it. Don't you think? This snippet was just something that was tried to diagnose the issue rather than as a proposed fix. What do you think @Zariel?

@tschirmer
Copy link

@dahankzter
tl;dr My opinion is to add a config option to disable prepared statements for system tables; but default it to enabled, so we don't exclude managed / serverless technologies.

AWS have a couple managed services (Serverless MySQL for example) where they've locked down system variable changes so that they can create a service that's easily scalable. The cost of that, is that users can't change or tune things themselves, but the upside is they don't manage it.

I don't think we should be creating a driver that locks people out of serverless architecture because certain system values can't be written. In many cases where I've run SaaS infrastructure, we've had to lock out users from certain features so that we could easily manage it.

My opinion is that we make it an option that people can turn on or off; with the default of being on. Then add a comment that there are security risks if people start using it to write to system tables without bound variables (I'm thinking injection attacks).

@dahankzter
Copy link
Contributor

I don't know the implementers motivations for this limitation @tschirmer. It would be nice if someone from the AMC team could chime in to explain. We are not explicitly writing to the system tables just preparing a query so the argument seems a little strange and a server side implementation detail.

It's up to @Zariel to decide if we should accommodate special cases for implementations other than Cassandra itself which really is the reference.

@slivne
Copy link

slivne commented Jan 23, 2020

@tschirmer the issue is that the driver is not generating a write (CQL INSERT/UPDATE) the driver is generating a read (CQL SELECT). So while I do understand the value in blocking writes to system tables blocking reads done via prepared statements seems like a bug not a feature.

@Zariel
Copy link
Contributor

Zariel commented Jan 23, 2020

I would rather not add the somewhat hacky check in shouldPrepare to work around this, or a config option, I'm also somewhat surprised that the driver continued to work. I would like to hear from AWS as to why, as @slivne says that creating a prepared statement on a system table counts as a write even though we only execute a read.

@tschirmer
Copy link

tschirmer commented Jan 23, 2020

@slivne it'd be nice to get more info from either the Cassandra team themselves, or the AWS Team. I'd like to know if Cassandra is executing a write while doing a prepared statement in the background. I don't like hacky jobs either; and in principle it shouldn't be conducting a write.

That said, personally, I'd rather have a working driver with a rather non-impacting edge case exclusion than not be able to use a managed service; the business payoff is too large to use them (I also understand this is a security concern, so I totally understand the hesitation).

I'm meeting with our AWS rep in a couple weeks; I can bring it up. Maybe someone else has contacts they can bring into this convo?

@gaurish
Copy link

gaurish commented Jan 24, 2020

We spoke to AWS about this. they said:

We have reached out to the service team regarding this issue and would like to resolve this issue as soon as we can.

Also, would it be possible for you to provide us a stack trace? That error message is ours, but it's correct. A client cannot and should not modify system tables, so we really need to know where the driver is doing that.

will keep you guys updated with the the aws cassandra service team has to say.

@danilop
Copy link

danilop commented Jan 28, 2020

Hi, I am from AWS! As @gaurish said, we are aware of the issue and working on a solution. We will update the GitHub issue when the fix has been deployed.

@Zariel
Copy link
Contributor

Zariel commented Jan 28, 2020

Thanks @danilop!

@sagarp-webonise
Copy link

@danilop any update on issue fix? when can we expect the deployment of fix?

@danilop
Copy link

danilop commented Feb 12, 2020

@sagarp-webonise Thank you for following up! I don’t have new information to share at the moment, the team is actively working on the driver support. We will update this issue when the driver support is available.

@Dhaval08
Copy link

Meanwhile, is there an alternative way to access AWS MCS using golang? Is there some API that we could call in order to perform various operations?

@danilop
Copy link

danilop commented Mar 10, 2020

We identified the cause of the warning messages customers were seeing when connecting to MCS using the gocql driver. We made a change that enables the gocql driver to connect to MCS without generating the warning messages about invalid peers. The change will apply to versions cd4b606 and newer of the gocql driver.

Please let us know if you see any further issues using the gocql driver with MCS.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.