-
Notifications
You must be signed in to change notification settings - Fork 212
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ensure data refresh can still be run if no initial index exist #2727
Conversation
Thanks for submitting this fix @sruthiv98! You may want to look at our general setup guide for instructions on how to set up On testing, this particular case is pretty specific since it technically shouldn't happen during normal processing. I've confirmed that you can replicate the original problem using the following steps:
After your change, the behavior of step 4 should change so that it skips, but the rest of the DAG runs (it may fail, because there's no data, but as long as it runs even if Hope this helps! |
Thanks for the detailed testing notes!! Super helpful. Just ran this and |
Oh, I remember you mentioning something about running the initialization (via |
Oh this is how I tested it originally! But yes, just tried it again and the run still stalls with |
Sorry it's taken me so long to get back to this @sruthiv98! Just to confirm, did you also mark the |
c76c611
to
379c8a0
Compare
@sruthiv98 this required a few more changes than I anticipated, so in the interest of getting this merged I applied those changes and rebased. Thanks for your help on the initial portion! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you for your contribution, @sruthiv98 🎉
Fixes
Fixes #2725 by @AetherUnbound
Description
This PR adds a
none_failed
trigger rule to thegenerate_index_suffix
task in the primary data refresh DAG. It ties together the old index retrieval and old index deletion tasks so both are either run or skipped together. It also adds anensure_downstream_runs
task at the end of the data refresh task group to ensure that the task state from promote is reflected downstream (rather than the task state fromtrigger_delete_index
which can now be skipped).Before
After _(ignore the failures, they're expected and should propagate all the way through to the end of the DAG)
Testing Instructions
See this comment below: #2727 (comment)
When these steps are run on this branch, both the
get_current_index
andtrigger_delete_index
steps should be skipped, but all other data refresh steps should run as intended (in the case provided, this means that all tasks after the data refresh task group should remain "upstream failed").Checklist
Update index.md
).main
) or a parent feature branch.Developer Certificate of Origin
Developer Certificate of Origin