Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use DagRun.run_id instead of execution_date when updating state of TIs(UI & REST API) #18724

Merged
merged 7 commits into from
Jan 14, 2022

Conversation

ephraimbuddy
Copy link
Contributor

@ephraimbuddy ephraimbuddy commented Oct 4, 2021

We can now use run_id as well as execution_date to update states
of task instances

Depends on #18642
Closes: #18672


^ Add meaningful description above

Read the Pull Request Guidelines for more information.
In case of fundamental code change, Airflow Improvement Proposal (AIP) is needed.
In case of a new dependency, check compliance with the ASF 3rd Party License Policy.
In case of backwards incompatible changes please leave a note in UPDATING.md.

@ephraimbuddy
Copy link
Contributor Author

I took the easiest way for this PR. There are a set of functions that uses the execution date underneath this PR but I limited the PR to the REST API. Let me know if those functions need to be touched too

@github-actions
Copy link

This pull request has been automatically marked as stale because it has not had recent activity. It will be closed in 5 days if no further activity occurs. Thank you for your contributions.

@github-actions github-actions bot added the stale Stale PRs per the .github/workflows/stale.yml policy file label Nov 20, 2021
@uranusjr uranusjr removed the stale Stale PRs per the .github/workflows/stale.yml policy file label Nov 20, 2021
airflow/api/common/experimental/mark_tasks.py Outdated Show resolved Hide resolved
airflow/api_connexion/endpoints/task_instance_endpoint.py Outdated Show resolved Hide resolved
airflow/api_connexion/endpoints/task_instance_endpoint.py Outdated Show resolved Hide resolved
airflow/api_connexion/endpoints/task_instance_endpoint.py Outdated Show resolved Hide resolved
airflow/models/dag.py Outdated Show resolved Hide resolved
@ephraimbuddy ephraimbuddy force-pushed the add-run-id branch 2 times, most recently from 6c74ac7 to fa4ca9e Compare November 22, 2021 13:20
@ephraimbuddy ephraimbuddy marked this pull request as draft December 3, 2021 18:00
airflow/models/taskinstance.py Outdated Show resolved Hide resolved
airflow/models/taskinstance.py Outdated Show resolved Hide resolved
@ephraimbuddy ephraimbuddy force-pushed the add-run-id branch 4 times, most recently from c7b46d8 to acd7db4 Compare January 13, 2022 07:48
Comment on lines 1671 to 1680
input_execution_date = execution_date
if dag_run_id:
dag_run = (
session.query(DagRun).filter(DagRun.run_id == dag_run_id, DagRun.dag_id == self.dag_id).one()
) # Raises an error if not found
resolve_execution_date = dag_run.execution_date
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

How about

if execution_date is None:
    dag_run = ...
    resolved_execution_date = dag_run.execution_date
else:
    resolved_execution_date = execution_date

This would simplify the later code to

end_date = resolve_execution_date if not future else None
start_date = resolve_execution_date if not future else None

Comment on lines +332 to 336
dag_run_id=run_id,
execution_date=execution_date,
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This combined with the exactly_one check in set_task_instance_state means the request would error (with 500) if the user passes both execution_date and run_id.=

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We have schema level validation. So it won't allow both values to be supplied

@uranusjr uranusjr requested a review from ashb January 13, 2022 09:06
description: |
The task instance's DAG run ID. Either set this or execution_date but not both.

*New in version 2.2.3*
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

2.2.3 is already out now isn't it, So 2.2.4 or 2.3.0 (Probably 2.3, as this one doesn't feel like a bug fix so shouldn't be backported.)

ashb
ashb previously requested changes Jan 13, 2022
@@ -742,6 +742,34 @@ def error(self, session=NEW_SESSION):
session.merge(self)
session.commit()

@classmethod
@provide_session
def find(
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This function is only used for this one test, right? If so lets not add it (I'm not sure the pattern we have for DagRun.find etc is good)

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ah its used one place -- but still, I don't think we need it.

ephraimbuddy and others added 6 commits January 13, 2022 12:55
Error when the task instance does not exist

Update airflow/api_connexion/endpoints/task_instance_endpoint.py

Co-authored-by: Ash Berlin-Taylor <[email protected]>

remove mock

use try/accept

Support the use of run_id in set_task_instance_state(REST API)

We can now use run_id as well as execution_date to update states
of task instances

deprecate execution_date

improve error message

add run_id to set_state function

require either the run_id or the execution_date

fixup! require either the run_id or the execution_date

Update airflow/api_connexion/endpoints/task_instance_endpoint.py

Co-authored-by: Tzu-ping Chung <[email protected]>

fixup! Update airflow/api_connexion/endpoints/task_instance_endpoint.py

Ensure supplying both execution_date and run_id raises for dag.set_task_instance_state

fixup! Ensure supplying both execution_date and run_id raises for dag.set_task_instance_state

Do not depreccate execution_date

fixup! Do not depreccate execution_date

Use func.count

use run_id instead of execution_date

fix tests and deprecate mark_tasks

fixup! fix tests and deprecate mark_tasks

fixup! fixup! fix tests and deprecate mark_tasks

fixup! fixup! fixup! fix tests and deprecate mark_tasks

fixup! fixup! fixup! fix tests and deprecate mark_tasks

fixup! fixup! fixup! fixup! fix tests and deprecate mark_tasks

fixup! Ensure task_instance exists before running update on its state(REST API)

Use execution_date for run/clear

fixup! Use execution_date for run/clear

constrain trino

Apply suggestions from code review

Co-authored-by: Ash Berlin-Taylor <[email protected]>

add execution_date

remove lib

fix tests

fixup! fix tests

apply reviews from code review

Fix typing

Fix tests

Apply suggestions from code review

Co-authored-by: Tzu-ping Chung <[email protected]>

fixup! Apply suggestions from code review

add find method to task instance

name execution_date explicitly

fix mypy

accept only run_id

accept only run_id

fixup! accept only run_id

Update airflow/models/taskinstance.py

Co-authored-by: Tzu-ping Chung <[email protected]>

update docstring
@ashb ashb dismissed their stale review January 13, 2022 14:06

Change made.

@ephraimbuddy ephraimbuddy added this to the Airflow 2.3.0 milestone Jan 13, 2022
@ephraimbuddy ephraimbuddy merged commit 2b4bf7f into apache:main Jan 14, 2022
@ephraimbuddy ephraimbuddy deleted the add-run-id branch January 14, 2022 08:48
ephraimbuddy added a commit to astronomer/airflow that referenced this pull request Jan 14, 2022
When I merged apache#18724 the jobs ran successfully but it's now failing in main.
This PR fixes it
@ephraimbuddy ephraimbuddy mentioned this pull request Jan 14, 2022
ephraimbuddy added a commit that referenced this pull request Jan 14, 2022
When I merged #18724 the jobs ran successfully but it's now failing in main.
This PR fixes it

Co-authored-by: Kaxil Naik <[email protected]>
Co-authored-by: Ash Berlin-Taylor <[email protected]>
@ephraimbuddy ephraimbuddy added the type:misc/internal Changelog: Misc changes that should appear in change log label Apr 11, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area:API Airflow's REST/HTTP API type:misc/internal Changelog: Misc changes that should appear in change log
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Add run_id to this /dags/{dag_id}/updateTaskInstancesState API endpoint
3 participants