Integration tests for query runners (data sources) #3807

arikfr · 2019-05-15T12:52:28Z

We have plenty of open pull requests updating query runners, that we are hesitant to merge because there is no way to verify it doesn't introduce a regression. We also had our share of regressions introduced in changes that seemed safe and were merged...

I think it's due time to start looking at setting up integration tests for data sources. We will probably start with the most used once and move forward from there. The community's help here is instrumental.

As a first step, I want to document which data sources have a Docker image we can use:

Data Source	Docker Image	Notes
Amazon Elasticsearch	Y	We can create a test AWS account
Athena	X	We can create a test AWS account
Axibase	?
BigQuery	X	We can create a test account
Cassandra	Y	Not sure how straightforward is the Docker image to use
Clickhouse	Y
Couchbase	?
Databricks	X	We can create a test account, but testing the Hive query runner should be enough.
IBM DB2	?
Apache Drill	?	It's HTTP based, so the very least we can have mocks.
Druid	Y
DynamoDB	X	We can create a test AWS account
Elasticsearch	Y
Google Analytics	X	We can create a test account
Google Spreadsheets	X	We can create a test account
Graphite	?
Hive	?
Impala	?
InfluxDB	Y
JIRA	X	We can create a test account
Kylin	?
MapD	?
MemSQL	?
MongoDB	Y
Microsoft SQL Server	?
MySQL	Y
Oracle	?
PostgreSQL	Y
Phoenix	?
Presto	?
Prometheus	?
Qubole	?
Rockset	X	We can create a test account
Salesforce	X
Snowflake	X	We can create a test account
SQLite	Y
TreasureData	X
Uptycs	?
Vertica	?
Yandex Metrica	?

Some notes:

Because some (most?) tests will require some data for tests, it might make sense to have all the possible data sources setup on a remote server, and test against it. Although this introduces its own set of issues, so need to check.
For each data source, we need to test against multiple versions. Which versions we test against/support should be documented.

koooge · 2019-05-22T07:53:56Z

Hi. I'm with you and I worried about their stability.

dynamodb-local or localstack might be useful as a dynamodb mock
https://hub.docker.com/r/amazon/dynamodb-local/
https://github.com/localstack/localstack

arikfr added the Backend label May 15, 2019

weekly-digest bot mentioned this issue May 20, 2019

Weekly Digest (13 May, 2019 - 20 May, 2019) #3814

Closed

This was referenced Oct 24, 2019

Use SHOW_COLUMNS to fetch Snowflake's schema #4217

Closed

Fix impala connection #4272

Closed

add thrift version 0.9.3 to requirements #3303

Closed

This was referenced May 13, 2022

[Snyk] Fix for 6 vulnerabilities change/redash#27

Closed

[Snyk] Fix for 2 vulnerabilities change/redash#28

Merged

justinclift self-assigned this Jul 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integration tests for query runners (data sources) #3807

Integration tests for query runners (data sources) #3807

arikfr commented May 15, 2019

koooge commented May 22, 2019

Integration tests for query runners (data sources) #3807

Integration tests for query runners (data sources) #3807

Comments

arikfr commented May 15, 2019

koooge commented May 22, 2019