[CT-1582] [Feature] Refactor tasks to breakout parsing the manifest into a separate piece #6357

ChenyuLInx · 2022-12-02T00:46:20Z

Is this your first time submitting a feature request?

I have read the expectations for open source contributors
I have searched the existing issues, and I could not find an existing issue for this feature
I am requesting a straightforward extension of existing dbt functionality, rather than a Big Idea better suited to a discussion

Describe the feature

Right now we are doing the parsing and generating a manifest object as part of the runtime_initialization under task.
This makes this important step of dbt hidden under multiple layer of inheritance.
This also makes use need to do hacky solutions to fit the usecases of dbt-server where we want to have some endpoint do parsing upon files being modified, and saves some kind of manifest object for faster dbt invocation.

In order to resolve this, we should refactor out the ManifestTask out of the inheritance chain of tasks into it's own module. Then we will initialize tasks that need parsing of the project with a constructed manifest that generated by the refactored out Manifest loader.

One thing tbd is where we want the compile step to happen after this refactor. (Stu: created #6708 to address abstract graph generation as well)

The text was updated successfully, but these errors were encountered:

jtcohen6 · 2022-12-02T10:17:33Z

This sounds right to me! dbt-server wants to be able to reuse an already-parsed manifest. We should enable that by creating a clean split between:

Steps for creating a manifest, i.e. ManifestLoader.get_full_manifest(self.config)
- This is called in the ManifestTask and in lib.parse_to_manifest
- Rather than taking the entire RuntimeConfig as its argument, we should pass in just the components we need, per [CT-1586] [Feature] Refactor tasks to be initialized using Profile, Project, and Flags instead of RuntimeConfig #6360)
Everything that happens after a full manifest is provided

One thing tbd is where we want the compile step to happen after this refactor.

For now, let's keep compilation as a step that happens separately from & subsequent to parsing. The important steps of compilation are:

Interpolating ephemeral model CTEs into models that ref() them — this actually mutates the manifest
Building a networkx graph from the manifest — which is different for dbt build, versus other commands, because build wants additional test edges

We can think in the future about whether we want to perform that first step (ephemeral model interpolation), and mutate the manifest, before caching it. We can also think about whether we want to additionally create & cache the graph object, created by compilation, for additional performance speedup. We might want to create two graph objects, one for build and one for non-build commands.

jtcohen6 · 2023-01-08T19:26:18Z

Related issues:

Prerequisite: [CT-926] dbt parse works in click #5550
Proposal: [CT-1767] [Feature] dbt parse should return a manifest #6547

jtcohen6 · 2023-01-25T09:49:41Z

Resolved by #6565

ChenyuLInx added enhancement New feature or request triage labels Dec 2, 2022

github-actions bot changed the title ~~[Feature] Refactor tasks to breakout parsing the manifest into a separate piece~~ [CT-1582] [Feature] Refactor tasks to breakout parsing the manifest into a separate piece Dec 2, 2022

This was referenced Dec 2, 2022

[CT-1583][Feature] Utilize new way of providing manifest for tasks dbt-labs/dbt-server#127

Closed

[CT-1581] [Epic] dbt-core as a library: first steps #6356

Closed

dbeatty10 added Refinement Maintainer input needed and removed triage labels Dec 2, 2022

jtcohen6 added python_api Issues related to dbtRunner Python entry point Team:Execution labels Dec 2, 2022

jtcohen6 removed the Refinement Maintainer input needed label Dec 2, 2022

leahwicz assigned stu-k Dec 7, 2022

iknox-fa mentioned this issue Jan 9, 2023

[CT-926] dbt parse works in click #5550

Closed

stu-k mentioned this issue Jan 10, 2023

Abstract manifest generation from tasks #6565

Merged

6 tasks

This was referenced Jan 11, 2023

[CT-901] [Epic] API-ification + CLI - Phase 1 #5527

Closed

[CT-1834] Merge feature/click-cli feature branch into main #6631

Closed

jtcohen6 closed this as completed Jan 25, 2023

jtcohen6 mentioned this issue Jan 26, 2023

[CT-1889] [Epic] API-ification + CLI - Phase 2 #6706

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CT-1582] [Feature] Refactor tasks to breakout parsing the manifest into a separate piece #6357

[CT-1582] [Feature] Refactor tasks to breakout parsing the manifest into a separate piece #6357

ChenyuLInx commented Dec 2, 2022 •

edited by stu-k

Loading

jtcohen6 commented Dec 2, 2022

jtcohen6 commented Jan 8, 2023

jtcohen6 commented Jan 25, 2023

[CT-1582] [Feature] Refactor tasks to breakout parsing the manifest into a separate piece #6357

[CT-1582] [Feature] Refactor tasks to breakout parsing the manifest into a separate piece #6357

Comments

ChenyuLInx commented Dec 2, 2022 • edited by stu-k Loading

Is this your first time submitting a feature request?

Describe the feature

jtcohen6 commented Dec 2, 2022

jtcohen6 commented Jan 8, 2023

jtcohen6 commented Jan 25, 2023

ChenyuLInx commented Dec 2, 2022 •

edited by stu-k

Loading