-
Notifications
You must be signed in to change notification settings - Fork 910
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Load data from intermediate after processing? #517
Comments
So I believe that at the moment, this isn't something supported (at least out of the box) with Kedro. There is
Then You can then do |
Hi @jmrichardson, Unfortunately, this feature hasn't been supported by Kedro's high level API (Kedro context or CLI) although several Kedro users have requested: #30 @gotin I have seen 3 approaches by Kedro users.
Hope Kedro supports this feature as other tools such as Spotify's |
There's an ongoing discussion in https://discourse.kedro.community/t/speeding-up-pipeline-processing-with-change-detection/90 for how this could be supported. |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. |
Hi, I am new to Kedro and have been looking through the documentation and can't find a reference for automatically loading the intermediate (already processed dataset) vs processing each time I run a pipeline. In other words, I would like to pre process a file, save to intermediate location:
The above does that, but the next time I run "kedro run" it does the whole pipeline again even though the original source data file hasn't changed. Is there a way to enable caching when node hasn't changed and the data itself hasn't changed?
The text was updated successfully, but these errors were encountered: