Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(sdk): add retry policy support to kfp v2 #7867

Merged

Conversation

connor-mccarthy
Copy link
Member

@connor-mccarthy connor-mccarthy commented Jun 8, 2022

Description of your changes:
Adds retry policy support to KFP v2 SDK.

Checklist:

@connor-mccarthy connor-mccarthy requested a review from chensun June 8, 2022 21:03
@google-oss-prow
Copy link

Skipping CI for Draft Pull Request.
If you want CI signal for your change, please convert it to an actual PR.
You can still manually trigger a test run with /test all

@google-oss-prow google-oss-prow bot requested a review from zijianjoy June 8, 2022 21:04
@connor-mccarthy connor-mccarthy force-pushed the support-task-level-retry-v2 branch 2 times, most recently from 6b4b62e to e28dd1a Compare June 9, 2022 18:19
@connor-mccarthy
Copy link
Member Author

/test all

@connor-mccarthy connor-mccarthy marked this pull request as ready for review June 9, 2022 18:26
@connor-mccarthy connor-mccarthy force-pushed the support-task-level-retry-v2 branch from e28dd1a to 64b026d Compare June 9, 2022 18:29
@connor-mccarthy connor-mccarthy marked this pull request as draft June 9, 2022 18:47
@connor-mccarthy connor-mccarthy force-pushed the support-task-level-retry-v2 branch from c0b1aa3 to 6f39253 Compare June 9, 2022 18:54
num_retries: int,
backoff_duration: Optional[str] = None,
backoff_factor: Optional[float] = None,
backoff_max_duration: Optional[str] = None) -> 'PipelineTask':
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Note: A policy parameter was included in the v1 ContainerOp.set_retry method. policy is specific to the Argo workflow spec and not included in the IR RetryPolicy message.

Should we add here, but raise a DeprecationWarning indicating that policy is a no-op? Or omit this parameter in v2 as a small breaking change?

cc: @chensun

@connor-mccarthy connor-mccarthy marked this pull request as ready for review June 9, 2022 18:55
@connor-mccarthy connor-mccarthy force-pushed the support-task-level-retry-v2 branch from 6f39253 to e4dc3d2 Compare June 9, 2022 18:56
@connor-mccarthy
Copy link
Member Author

/retest

3 similar comments
@connor-mccarthy
Copy link
Member Author

/retest

@connor-mccarthy
Copy link
Member Author

/retest

@connor-mccarthy
Copy link
Member Author

/retest

@connor-mccarthy connor-mccarthy force-pushed the support-task-level-retry-v2 branch from e4dc3d2 to c583250 Compare June 15, 2022 19:37
@connor-mccarthy
Copy link
Member Author

/retest

4 similar comments
@connor-mccarthy
Copy link
Member Author

/retest

@connor-mccarthy
Copy link
Member Author

/retest

@connor-mccarthy
Copy link
Member Author

/retest

@connor-mccarthy
Copy link
Member Author

/retest

@connor-mccarthy connor-mccarthy force-pushed the support-task-level-retry-v2 branch from c583250 to 4105908 Compare June 17, 2022 15:39
sdk/RELEASE.md Outdated
@@ -4,7 +4,8 @@

## Breaking Changes

### For Pipeline Authors
### For Pipeline Authors]
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: extra ]


hello_world(text=text).set_retry(
num_retries=3,
backoff_duration='30s',
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As discussed, remove the parameters that don't work on Vertex.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

on a second look, it's fine to keep this as-is, as long as the samples under test_data/pipelines doesn't show these no-op usage.

@connor-mccarthy
Copy link
Member Author

/retest

@connor-mccarthy
Copy link
Member Author

/retest

@connor-mccarthy
Copy link
Member Author

/test all

@connor-mccarthy connor-mccarthy requested a review from chensun July 6, 2022 15:55
@connor-mccarthy connor-mccarthy force-pushed the support-task-level-retry-v2 branch from 583677a to bd76c4d Compare July 8, 2022 15:49
@connor-mccarthy
Copy link
Member Author

/retest

@connor-mccarthy
Copy link
Member Author

/lgtm

@google-oss-prow
Copy link

@connor-mccarthy: you cannot LGTM your own PR.

In response to this:

/lgtm

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@connor-mccarthy
Copy link
Member Author

@chensun, can you re-LGTM after rebase and one-character cleanup commit?

Copy link
Member

@chensun chensun left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/lgtm

@google-oss-prow google-oss-prow bot added the lgtm label Jul 8, 2022
@google-oss-prow
Copy link

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: chensun

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@connor-mccarthy
Copy link
Member Author

/retest

@google-oss-prow google-oss-prow bot merged commit 850a750 into kubeflow:master Jul 11, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants