[CT-2537] Include "compiled" node attributes in run results #7519

jtcohen6 · 2023-05-05T12:56:47Z

Proposed change

Let's include attributes that can vary at compile/runtime. Those attributes are:

compiled: bool = True (always False after parsing)
compiled_code (always empty after parsing) - this is most important
relation_name - this is set after parsing, but it can be updated for tests that have --store-failures enabled
depends_on.macros ([CT-500] Investigate differences in manifest.json output for 'compile' and 'run' commands #5079) - since additional macro dependencies are required/registered while a model is being materialized, versus just compiled (e.g. create_table_as)

(Here is where we set many of those attributes, during compilation)

The run_results.json produced by compile, docs generate, run/build/test/etc should include these fields (and only these fields) in addition to the node's unique_id. All other fields can be accessed from the manifest, and represent logical state at a point in time.

Background

Way back in v0.19, we removed the full node entry from run_results.json, and moved the compiled_code attribute into the manifest, so as to power dbt-docs from only manifest + catalog.
Starting in v1.4 (CT 1604 remove compiled classes #6384), we narrowed down the set of fields that differ between "uncompiled" and "compiled" nodes.

Rationale

The manifest represents "logical" state (parsed from project code + configs)
Run results + catalog represent "applied" state (as materialized by dbt, as the objects exist in the data warehouse)

Compiled code can be a form of "applied" state. If a model's SQL depends on the results of an introspective query, it can vary given different inputs from the data warehouse.

Acceptance Criteria

The run_results.json produced by compile, docs generate, run, build, and test should include the compiled, compiled_code, relation_name, and depends_on.macros fields (and only these fields) in addition to the node's unique_id.
The same information should also be included in the results returned by dbtRunner when is used to invoke dbt as a library.

The text was updated successfully, but these errors were encountered:

iknox-fa · 2023-05-15T18:58:00Z

@jtcohen6 per BLG: We're estimating this as written but have some concerns and would like to understand the use-case better-- Maybe a situation that would call for a new runtime artifact, compile_results.json?

jtcohen6 added enhancement New feature or request artifacts Team:Execution labels May 5, 2023

github-actions bot changed the title ~~Include "compiled" node attributes in run results~~ [CT-2537] Include "compiled" node attributes in run results May 5, 2023

jtcohen6 removed the Team:Execution label Jul 19, 2023

peterallenwebb mentioned this issue Aug 7, 2023

[Epic] Applied State (part 1) #8316

Closed

jtcohen6 assigned peterallenwebb Aug 16, 2023

peterallenwebb mentioned this issue Aug 24, 2023

Include Compiled Node Attributes in run_results.json #8492

Merged

4 tasks

peterallenwebb closed this as completed in #8492 Aug 30, 2023

jtcohen6 mentioned this issue Nov 6, 2023

Fix back compat for run_results pre-v5 #9009

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CT-2537] Include "compiled" node attributes in run results #7519

[CT-2537] Include "compiled" node attributes in run results #7519

jtcohen6 commented May 5, 2023 •

edited by peterallenwebb

Loading

iknox-fa commented May 15, 2023

[CT-2537] Include "compiled" node attributes in run results #7519

[CT-2537] Include "compiled" node attributes in run results #7519

Comments

jtcohen6 commented May 5, 2023 • edited by peterallenwebb Loading

Proposed change

Background

Rationale

Acceptance Criteria

iknox-fa commented May 15, 2023

jtcohen6 commented May 5, 2023 •

edited by peterallenwebb

Loading