Allow models to execute on different warehouses #488

rcypher-databricks · 2023-10-26T22:52:50Z

Resolves PECO-1182

Description

Allows models in a dbt project to run on different SQL warehouses.
The available warehouses are specified as named values in the dbt profile.
The configuration for a model can specify by name which warehouse it should run on.

Checklist

I have run this code in development and it appears to resolve the stated issue
This PR includes tests, or tests are not required/relevant for this PR
I have updated the CHANGELOG.md and added information about my change to the "dbt-databricks next" section.

benc-db · 2023-10-26T23:02:38Z

Out of curiosity, how did you validate? Do you have tests that are uncommitted, or did you look at the logs of some existing tests?

rcypher-databricks · 2023-10-26T23:07:49Z

I verified by modifying one of the integration tests and stepping through the code as well as adding some temporary print statements. I validated the open session parameters for a model that specifies a different warehouse than the one in the profile.

benc-db · 2023-10-26T23:10:33Z

Stylistically looks good, and to the extent I can follow, the logic looks right. Assuming we go this way, please add unit tests for the new logic, and at least 1 functional test that uses the compute definitions from the existing profiles.

benc-db · 2023-10-26T23:11:46Z

Stylistically looks good, and to the extent I can follow, the logic looks right. Assuming we go this way, please add unit tests for the new logic, and at least 1 functional test that uses the compute definitions from the existing profiles.

Per discussion with dbt, we're going to try to finish migrating to their functional tests, so no new integration tests.

rcypher-databricks · 2023-10-26T23:12:04Z

Definitely going to add tests if this is the implementation we decide to use. That's why its a draft PR at this point

Signed-off-by: Raymond Cypher <[email protected]>

Raise an exception if a model specifies a compute resource that is not defined in the profile. Signed-off-by: Raymond Cypher <[email protected]>

Signed-off-by: Raymond Cypher <[email protected]>

benc-db · 2023-11-10T17:46:40Z

tests/functional/adapter/warehouse_per_model/test_warehouse_per_model.py

+        return {"model_names": ["target3"]}
+
+
+class TestSpecifyingForProjectModelsInFolder(BaseSpecifyingCompute):


Do we know if we can specify a compute to use with models of a particular tag? This came up in a customer call where they would want to tag certain models as heavy_compute for example.

After reading the dbt docs on tags, I don't think that would work, which is probably fine. I think having the named compute approach gets us 95% of the way to what it would be if they could target compute to tags.

rcypher-databricks requested review from andrefurlan-db, susodapop and benc-db as code owners October 26, 2023 22:52

rcypher-databricks force-pushed the main branch from 135d4e6 to ef89e18 Compare October 26, 2023 22:56

rcypher-databricks marked this pull request as draft October 26, 2023 22:56

rcypher-databricks added 3 commits November 8, 2023 16:48

Allow models to execute on different warehouses

183897c

Signed-off-by: Raymond Cypher <[email protected]>

Raise exception on missing compute resource

81fa7ba

Raise an exception if a model specifies a compute resource that is not defined in the profile. Signed-off-by: Raymond Cypher <[email protected]>

Tests for warehouse-per-model

e1f89c0

Signed-off-by: Raymond Cypher <[email protected]>

rcypher-databricks force-pushed the main branch from fbe3f25 to e1f89c0 Compare November 8, 2023 23:48

rcypher-databricks marked this pull request as ready for review November 9, 2023 00:33

benc-db reviewed Nov 10, 2023

View reviewed changes

benc-db approved these changes Nov 10, 2023

View reviewed changes

rcypher-databricks merged commit 7c9fc66 into databricks:main Nov 10, 2023
18 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow models to execute on different warehouses #488

Allow models to execute on different warehouses #488

rcypher-databricks commented Oct 26, 2023

benc-db commented Oct 26, 2023

rcypher-databricks commented Oct 26, 2023

benc-db commented Oct 26, 2023

benc-db commented Oct 26, 2023

rcypher-databricks commented Oct 26, 2023

benc-db Nov 10, 2023

benc-db Nov 10, 2023

		return {"model_names": ["target3"]}


		class TestSpecifyingForProjectModelsInFolder(BaseSpecifyingCompute):

Allow models to execute on different warehouses #488

Allow models to execute on different warehouses #488

Conversation

rcypher-databricks commented Oct 26, 2023

Description

Checklist

benc-db commented Oct 26, 2023

rcypher-databricks commented Oct 26, 2023

benc-db commented Oct 26, 2023

benc-db commented Oct 26, 2023

rcypher-databricks commented Oct 26, 2023

benc-db Nov 10, 2023

Choose a reason for hiding this comment

benc-db Nov 10, 2023

Choose a reason for hiding this comment