Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[WIP]add mpi-operator to the unified operator #1413

Closed
wants to merge 2 commits into from
Closed

[WIP]add mpi-operator to the unified operator #1413

wants to merge 2 commits into from

Conversation

hackerboy01
Copy link
Member

add mpi-operator to the unified operator

@google-cla
Copy link

google-cla bot commented Sep 22, 2021

Thanks for your pull request. It looks like this may be your first contribution to a Google open source project (if not, look below for help). Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

📝 Please visit https://cla.developers.google.com/ to sign.

Once you've signed (or fixed any issues), please reply here with @googlebot I signed it! and we'll verify it.


What to do if you already signed the CLA

Individual signers
Corporate signers

ℹ️ Googlers: Go here for more info.

@aws-kf-ci-bot
Copy link
Contributor

Hi @hackerboy01. Thanks for your PR.

I'm waiting for a kubeflow member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@hackerboy01
Copy link
Member Author

@googlebot I signed it!

@google-cla
Copy link

google-cla bot commented Sep 22, 2021

We found a Contributor License Agreement for you (the sender of this pull request), but were unable to find agreements for all the commit author(s) or Co-authors. If you authored these, maybe you used a different email address in the git commits than was used to sign the CLA (login here to double check)? If these were authored by someone else, then they will need to sign a CLA as well, and confirm that they're okay with these being contributed to Google.
In order to pass this check, please resolve this problem and then comment @googlebot I fixed it.. If the bot doesn't comment, it means it doesn't think anything has changed.

ℹ️ Googlers: Go here for more info.

@johnugeorge
Copy link
Member

/assign @terrytangyuan
/assign @alculquicondor

@gaocegege
Copy link
Member

Thanks for your contribution! 🎉 👍

Please deal with the CLA issues.

cla/google — Need a CLA for one or more commit authors

/cc @carmark

@hackerboy01 hackerboy01 changed the title add mpi-operator to the unified operator [WIP]add mpi-operator to the unified operator Sep 22, 2021
@review-notebook-app
Copy link

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

@google-cla
Copy link

google-cla bot commented Sep 22, 2021

All (the pull request submitter and all commit authors) CLAs are signed, but one or more commits were authored or co-authored by someone other than the pull request submitter.

We need to confirm that all authors are ok with their commits being contributed to this project. Please have them confirm that by leaving a comment that contains only @googlebot I consent. in this pull request.

Note to project maintainer: There may be cases where the author cannot leave a comment, or the comment is not properly detected as consent. In those cases, you can manually confirm consent of the commit author(s), and set the cla label to yes (if enabled on your project).

ℹ️ Googlers: Go here for more info.

@gaocegege
Copy link
Member

gaocegege commented Sep 22, 2021

Hi, there is no need to close the PR, you can just use your github verified email to sign the commit.

git commit -s

@hackerboy01 hackerboy01 deleted the test-pr branch September 22, 2021 09:09
@alculquicondor
Copy link

We actually want to migrate only the v2 controller: https://github.com/kubeflow/mpi-operator/tree/master/v2

@alculquicondor
Copy link

/hold

@Jeffwan
Copy link
Member

Jeffwan commented Sep 22, 2021

Address #1345

have you decided which version to support? Technically, if the crd version is different. We can support both because in the future, we will bump TF.PyTorch as well for elastic training support and there will be multiple version running

@alculquicondor
Copy link

alculquicondor commented Sep 22, 2021

We could support multiple API versions. But we should be running a single controller version.

EDIT: notably, we are lacking a conversion webhook to properly translate some fields.

@terrytangyuan
Copy link
Member

Ideally both versions can be supported and users can choose to opt-in a particular version (they don't necessarily need to serve two versions at the same time).

@alculquicondor
Copy link

That sounds like a maintenance nightmare. The new controller supports the same features as the old controller and more, it's more robust and has greater coverage. I don't see why we should migrate the old controller. If people really want to use it, they can refer to the implementation in the old repository.

Copy link
Member

@gaocegege gaocegege left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/ok-to-test

@aws-kf-ci-bot
Copy link
Contributor

@hackerboy01: The following test failed, say /retest to rerun all failed tests:

Test name Commit Details Rerun command
kubeflow-tf-operator-presubmit 4aa8372 link /test kubeflow-tf-operator-presubmit

Full PR test history. Your PR dashboard.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

@alculquicondor
Copy link

This shouldn't merge without tests

@google-cla
Copy link

google-cla bot commented Nov 1, 2021

We found a Contributor License Agreement for you (the sender of this pull request), but were unable to find agreements for all the commit author(s) or Co-authors. If you authored these, maybe you used a different email address in the git commits than was used to sign the CLA (login here to double check)? If these were authored by someone else, then they will need to sign a CLA as well, and confirm that they're okay with these being contributed to Google.
In order to pass this check, please resolve this problem and then comment @googlebot I fixed it.. If the bot doesn't comment, it means it doesn't think anything has changed.

ℹ️ Googlers: Go here for more info.

@google-oss-robot
Copy link

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: hackerboy01
To complete the pull request process, please ask for approval from terrytangyuan after the PR has been reviewed.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@hackerboy01 hackerboy01 closed this Nov 1, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

8 participants