tasks that define their own arguments and types #2381

beckjake · 2020-04-30T18:03:05Z

Describe the feature

dbt should define task-specific argument parsing behavior in the tasks that use those arguments, instead of main.py. The output of argument parsing should be a typed object so tasks can be sure of what to rely on.

Currently, dbt-core defines tasks in the task/ folder and ties argument parsing to tasks in main.py. The argument parsing code in main.py all has explicit references to tasks and task names (and their RPC names, if they exist). It's difficult to know for sure out what args are safe to access in a given task or what they are. We are fortunate that argparse results in generally-reasonable and intuitive behavior, and that we've mostly kept the arguments available to various commands sane. But wouldn't it be nice if flake8/mypy just did that for us?

So I propose a couple related changes:

tie some sort of argparse-setting method to classes themselves
make the args object that ends up on configs be a typed dataclass
have main.py look at the tasks and use those to generate the argument parser

I think a cool and kind of fun thing that's probably not too hard would be to write "hologram for argparse", but that's not a requirement for this by any means.

Motivation

There are two big motivations here:
One is that it would be nice for RPC and CLI tasks to converge more. RPC tasks already define their arguments as typed objects. It doesn't make much sense to push those back into an argparse.Namespace, but going the other way does make a lot of sense because type systems are a nice way to find bugs before the people running your program do.

The second is that in the future, it would be great to provide some sort of pluggable tasks. One way of doing that is to make tasks a PEP 420 namespace package, like adapters and include already are, and then make tasks discoverable like adapters. That would be very nice for development. We could write some sort of dbt-dev-tools plugin that supplies useful tasks that are useful for developing dbt ("connected adapter in a repl", "render me this jinja file in this context", "jinja repl"). I can also imagine prototyping a dbt run+test that way, as an external package.

Describe alternatives you've considered

I could set up complicated and brittle local framework with git tag and git cherry-pick and a local-only branch that has debugging tasks. I kind of have that now, but I don't use it much because maintaining it when main.py changes is not really fun and getting it set up/torn down is a pain every time I use it.

Who will this benefit?

This is pretty developer-centric!

The text was updated successfully, but these errors were encountered:

drewbanin · 2020-05-01T14:08:53Z

I think there are a lot of people out there that would like the ability to register their own dbt tasks too :D

I'm into it!

github-actions · 2021-11-20T01:47:01Z

This issue has been marked as Stale because it has been open for 180 days with no activity. If you would like the issue to remain open, please remove the stale label or comment on the issue, or it will be closed in 7 days.

jtcohen6 · 2021-11-29T10:15:25Z

I'd still like to do this, or something like it. When we get there, we'll open a new issue :)

beckjake added enhancement New feature or request triage labels Apr 30, 2020

drewbanin removed the triage label May 1, 2020

jtcohen6 mentioned this issue Dec 22, 2020

Spark External Table Bugs dbt-labs/dbt-external-tables#53

Closed

3 tasks

jtcohen6 mentioned this issue Jun 17, 2021

Jinja context variable for selected resources #3471

Closed

jtcohen6 mentioned this issue Sep 2, 2021

dbt extensions / dbt jinja extensions #3830

Closed

jtcohen6 mentioned this issue Sep 28, 2021

Overriding/Extending default adapters in dbt project #3962

Closed

github-actions bot added the stale Issues that have gone stale label Nov 20, 2021

github-actions bot closed this as completed Nov 28, 2021

ChenyuLInx mentioned this issue Jan 26, 2022

[Feature] dbt hooks #4333

Closed

1 task

jtcohen6 mentioned this issue Apr 7, 2022

[CT-466] [Feature] Make run-operation accept selectors to be able to use the selected_resources Jinja variable #5005

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tasks that define their own arguments and types #2381

tasks that define their own arguments and types #2381

beckjake commented Apr 30, 2020 •

edited

Loading

drewbanin commented May 1, 2020

github-actions bot commented Nov 20, 2021

jtcohen6 commented Nov 29, 2021

tasks that define their own arguments and types #2381

tasks that define their own arguments and types #2381

Comments

beckjake commented Apr 30, 2020 • edited Loading

Describe the feature

Motivation

Describe alternatives you've considered

Who will this benefit?

drewbanin commented May 1, 2020

github-actions bot commented Nov 20, 2021

jtcohen6 commented Nov 29, 2021

beckjake commented Apr 30, 2020 •

edited

Loading