Skip to content

This issue was moved to a discussion.

You can continue the conversation there. Go to discussion →

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Debugging] Show which datasets are outdated #1704

Closed
2 tasks
francisduval opened this issue Jan 15, 2024 · 2 comments
Closed
2 tasks

[Debugging] Show which datasets are outdated #1704

francisduval opened this issue Jan 15, 2024 · 2 comments

Comments

@francisduval
Copy link

Description

When running kedro viz run, there is no way to know which datasets are up to date and which ones are outdated. A dataset is said to be outdated if the code upstream has changed since the dataset was run for the last time. This feature exists with the Targets package in R. Also, when you run the targets pipeline, only nodes that are outdated are run, which saves computing time.

Context

This could be a nice feature since without it, there is no effective way to tell which parts of the pipeline you should rerun when changes have been made to the code. Sometimes, you are unsure if a dataset is up to date or not, and then you have to rerun it to be sure, which can take a long time.

Possible Implementation

Color datasets that are outdated with another color. Also, it would be nice to have a kedro command that would only run outdated datasets, such as kedro run --only_outdated or kedro run --pipeline pipeline_name --only_outdated.

Checklist

  • In kedro viz, color outdated datasets with another color.
  • In kedro, include an option which ensures that only outdated datasets are run.
@astrojuanlu
Copy link
Member

astrojuanlu commented Jan 15, 2024

Somewhat related: kedro-org/kedro#221, kedro-org/kedro#2307

@rashidakanchwala rashidakanchwala moved this to Inbox in Kedro-Viz Jan 30, 2024
@NeroOkwa NeroOkwa moved this from Inbox to Backlog in Kedro-Viz Feb 12, 2024
@NeroOkwa
Copy link
Contributor

NeroOkwa commented Mar 25, 2024

Backlog grooming notes:

This was also highlighted in #1750, and would build on the dataset preview and debugging work stream. We should consider implementing this.
Next step - investigation of technical feasibility.

@rashidakanchwala rashidakanchwala changed the title Show which datasets are outdated [Debugging] Show which datasets are outdated Jul 18, 2024
@kedro-org kedro-org locked and limited conversation to collaborators Oct 23, 2024
@rashidakanchwala rashidakanchwala converted this issue into discussion #2151 Oct 23, 2024
@github-project-automation github-project-automation bot moved this from Backlog to Done in Kedro-Viz Oct 23, 2024

This issue was moved to a discussion.

You can continue the conversation there. Go to discussion →

Projects
Status: Done
Development

No branches or pull requests

3 participants