[CT-3529] [Unit Testing] Unit Testing Versioned Models #9344

emmyoop · 2024-01-05T21:22:24Z

Housekeeping

I am a maintainer of dbt-core

Short description

This is the outcome of the spike+ #8799. Exact details of what input should look like can be found there.

Outcomes of the spike

We need to patch unit-tests to be able to determine what versions of a model exist since versions are defined in the schema files and schema files are parsed at the end. This is largely done in spike unit test versions #9302.
There will still only be one unit test node, even though we may be executing multiple unit tests. The versioned models that the unit test can run against will be listed in the depends_on of the UnitTestDefinition
We may need to do something similar to what @gshank did in the build command here, where there are two 'selected' lists, one with unit tests and one without, but we would need two selection lists, one with models and one without (in order to account for run results)

Acceptance criteria

A unit test definition can define the model versions to include or exclude from a test
If no versions are defined in the unit test definition, but the target model is versioned, a unit test will be run for all versions of the model
When dbt build is run with a select for a versioned model, only the unit test for that specific model version will run, even if no version is defined in the schema file
When a command selects on a unit test that is for a versioned model, unit tests for all versions of that model will be run

Impact to Other Teams

no

Will backports be required?

no

Context

Suggested Tests

test with no version specified, should create a separate unit test for each version
with with an exclude version specified, should create a separate unit test for each version except the excluded version
test with an include version specified, should create a single unit test for only the version specified
test with an include and exclude version specified, should get ValidationError
test with an include for an unversioned model, should error
partial parsing test: test with no version specified, then add an exclude version, then switch to include version and make sure the right unit tests are generated for each
test with no version specified in the schema file and use selection logic on a versioned model for a specific version
test with no version specified in the schema file and use selection logic on a unit test - expect unit tests for all versioned models
test specifying the fixture version with {{ ref(name, version) }}

The text was updated successfully, but these errors were encountered:

graciegoheen · 2024-01-08T20:36:17Z

This would create some known :( funky behavior:

dbt retry (and only one version of the tests had failed, all would be re-tried)
if you fail on any version, we'd block on all versions

Are we ok with that?

Alt.

unit test only applies to a single version
if you don't supply a version, applies to latest
otherwise you must supply an explict version
not as DRY
not automatic to catch breaking changes to unit testing logic when creating a new version

graciegoheen · 2024-01-09T02:00:17Z

After discussing internally, we've decided we are not ok with this funky behavior.

We are going to try again with making one node per unit test run (instead of bundling them together). This is consistent with how we treat data tests that are configured on a model with multiple versions.

Example: a uniqueness test with a versioned model.

models:
  - name: my_model
    columns:
      - name: id
        tests:
          - unique
    versions:    
      - v: 1
      - v: 2

Command:

dbt list -s my_model

Output:

20:28:35  Running with dbt=1.7.4
20:28:36  Registered adapter: duckdb=1.7.0
20:28:36  Found 3 models, 1 snapshot, 1 analysis, 1 seed, 2 tests, 1 source, 0 exposures, 1 metric, 391 macros, 0 groups, 1 semantic model
my_project.my_model.v1
my_project.my_model.v2
my_project.unique_my_model_v1_id
my_project.unique_my_model_v2_id

Note where it says “2 tests” and that it shows those 2 tests.

If we are unable to overcome the partial parsing issues with the above solution, we will have a known restriction that you can only apply 1 unit test per model version. We will then not allow folks to specify multiple versions of a model that unit test to apply to. If no version is specified, we will use the latest version.

emmyoop added the user docs [docs.getdbt.com] Needs better documentation label Jan 5, 2024

github-actions bot changed the title ~~[Unit Testing] Unit Testing Versioned Models~~ [CT-3529] [Unit Testing] Unit Testing Versioned Models Jan 5, 2024

emmyoop mentioned this issue Jan 5, 2024

[CT-3195] [spike+] unit testing versioned models #8799

Closed

3 tasks

graciegoheen mentioned this issue Jan 5, 2024

[CT-2911] [Epic] Unit testing dbt models #8283

Closed

dbeatty10 mentioned this issue Jan 10, 2024

[CT-3039] Ensure that list command works for unit_tests and has reasonable output #8508

Closed

emmyoop assigned gshank Jan 16, 2024

gshank mentioned this issue Jan 22, 2024

Enable unit testing versioned models #9421

Merged

5 tasks

gshank closed this as completed in #9421 Jan 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CT-3529] [Unit Testing] Unit Testing Versioned Models #9344

[CT-3529] [Unit Testing] Unit Testing Versioned Models #9344

emmyoop commented Jan 5, 2024 •

edited by MichelleArk

Loading

graciegoheen commented Jan 8, 2024

graciegoheen commented Jan 9, 2024

[CT-3529] [Unit Testing] Unit Testing Versioned Models #9344

[CT-3529] [Unit Testing] Unit Testing Versioned Models #9344

Comments

emmyoop commented Jan 5, 2024 • edited by MichelleArk Loading

Housekeeping

Short description

Outcomes of the spike

Acceptance criteria

Impact to Other Teams

Will backports be required?

Context

Suggested Tests

graciegoheen commented Jan 8, 2024

graciegoheen commented Jan 9, 2024

emmyoop commented Jan 5, 2024 •

edited by MichelleArk

Loading