[question] : Support Spark SQL standalone #2483

amramtamir · 2017-03-27T10:26:47Z

Hi,
I am trying to connect to Spark 2.1.0 , but the documentation is fairly poor and not well covered yet.
I have tried via thrift server using the hive:// sql alchemy but with no success.

Do I need to setup Druid or can it be without it ?
Can you please share what is supported (if supported) and how to achieve it?

My Spark cluster is standalone, the Superset is running on the same network as Spark and thrift.

Thanks!

xrmx · 2017-03-27T16:50:24Z

Dup of #241

bobzdar · 2017-07-13T18:27:09Z

I can connect using the hive:// sql alchemy, but it throws a "Could not locate column in row for column 'database_name'" error when I save the connector as there are no tables in the default (or any other) database - all tables are in global_temp to allow multi-session thrift usage for resource management purposes and thrift does not allow you to set global_temp as the default database since it's a system preserved database. The issue with multi session mode is that it creates a new session for each connection and will not have any tables loaded, so I'm trying to figure out how to create a (dummy) table when the connection is initiated (using metadata_params?) to allow it to find a table and save. Then I should be able to query the global_temp tables.

If you run thrift in single session mode and create some tables or views before connecting superset, it should work. I did have to install some sasl dependencies on the superset python install to get it to connect.

If you get a SASL error on connect, start thrift in NOSASL mode and specify "auth":"NOSASL" in engine_params in superset, or try the following:

pip install thrift-sasl
pip install sasl
pip install thriftpy
yum -y install cyrus-sasl-plain

Hope this helps.

xrmx closed this as completed Mar 27, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[question] : Support Spark SQL standalone #2483

[question] : Support Spark SQL standalone #2483

amramtamir commented Mar 27, 2017

xrmx commented Mar 27, 2017

bobzdar commented Jul 13, 2017

[question] : Support Spark SQL standalone #2483

[question] : Support Spark SQL standalone #2483

Comments

amramtamir commented Mar 27, 2017

xrmx commented Mar 27, 2017

bobzdar commented Jul 13, 2017