Skip to content

This issue was moved to a discussion.

You can continue the conversation there. Go to discussion →

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Mapping / Integrating DVC pipeline into production (Luigi/Airflow, for example) #1099

Closed
Casyfill opened this issue Sep 6, 2018 · 5 comments
Labels
enhancement Enhances DVC feature request Requesting a new feature

Comments

@Casyfill
Copy link

Casyfill commented Sep 6, 2018

it would be nice to have some options (or even a tutorial) on how to inject/map DVC pipelines onto popular frameworks, so that ideally ther won't be any need to work with the "production" code directly.

main problem here, in my humble opinion, is mapping and unifying parametets and paths

@efiop efiop added enhancement Enhances DVC feature request Requesting a new feature labels Sep 6, 2018
@efiop
Copy link
Contributor

efiop commented Sep 6, 2018

It is not very clear how to achieve it in an elegant way in the current state of dvc. We will have to take a closer look at it to see if there are any additions for dvc that will help with the integration. Going to investigate it pretty soon.

Thank you for the feedback!

@Casyfill
Copy link
Author

Casyfill commented Sep 6, 2018

True, and I am not terribly convinced this SHOULD be part of DVC per se (you know better though), but would love to see/think/talk out loud /

@villasv
Copy link
Contributor

villasv commented Nov 13, 2018

I do intend to work on this, but it would be done outside of DVC.

DVC is pretty agnostic as far as things are going to be executed. I imagine it wouldn't bee too complex to convert a DVC DAG into an Airflow DAG of bash operators that just do dvc repro on output files. I'm guessing this would be a DagBuilder function that doesn't quite fit in a PR for both projects.

I have my hands full at the moment improving my airflow aws stack, but perhaps I'll have some time by the end of the year.

@i2
Copy link

i2 commented Nov 20, 2018

I like DVC and I think this could be a very integral feature for DVC. I vote for it.

@nraw
Copy link

nraw commented Nov 13, 2019

It seems like the issue might not be as straight forward as one would assume at first, due to the point that dvc repro or dvc run would keep re-running previous parts of the code for each step in case these are not in the same environment or new commits are not being made every time. This is being discussed in as well #2212

@efiop efiop closed this as completed May 3, 2021
@iterative iterative locked and limited conversation to collaborators May 3, 2021

This issue was moved to a discussion.

You can continue the conversation there. Go to discussion →

Labels
enhancement Enhances DVC feature request Requesting a new feature
Projects
None yet
Development

No branches or pull requests

5 participants