Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Skip flaky e2e MultiKueue test for XGBoost #2861

Merged

Conversation

mszadkow
Copy link
Contributor

@mszadkow mszadkow commented Aug 20, 2024

What type of PR is this?

/kind bug

What this PR does / why we need it:

Skips flaky test for e2e MultiKueue XGBoost, due to transitioning bug in training-operator.
Found here: kubeflow/training-operator#1711

Which issue(s) this PR fixes:

Part of #2838

Special notes for your reviewer:

Does this PR introduce a user-facing change?

NONE

@k8s-ci-robot
Copy link
Contributor

Skipping CI for Draft Pull Request.
If you want CI signal for your change, please convert it to an actual PR.
You can still manually trigger a test run with /test all

@k8s-ci-robot k8s-ci-robot added release-note-none Denotes a PR that doesn't merit a release note. do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. kind/bug Categorizes issue or PR as related to a bug. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. labels Aug 20, 2024
Copy link

netlify bot commented Aug 20, 2024

Deploy Preview for kubernetes-sigs-kueue canceled.

Name Link
🔨 Latest commit 2f3240e
🔍 Latest deploy log https://app.netlify.com/sites/kubernetes-sigs-kueue/deploys/66c5a3ddf75a620008a15823

@k8s-ci-robot k8s-ci-robot added the size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. label Aug 20, 2024
@mbobrovskyi
Copy link
Contributor

mbobrovskyi commented Aug 20, 2024

Please rename "Fixes #2838" to "Part of #2838" or something like that to don't close issue.

/lgtm

@k8s-ci-robot k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Aug 20, 2024
@k8s-ci-robot
Copy link
Contributor

LGTM label has been added.

Git tree hash: 4fd59aee4ac5eaee2e826f0b64b4cb38ba2a73a5

@@ -633,6 +633,7 @@ var _ = ginkgo.Describe("MultiKueue", func() {
})

ginkgo.It("Should run a kubeflow XGBoostJob on worker if admitted", func() {
ginkgo.Skip("Skipped due to state transitioning bug in training-operator")
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Put a comment with a link to the open issue in the training-operator repo

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I agree and I would have done that, but they don't have actual issue open, just the reference - kubeflow/training-operator#1711

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let's use that issue

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I believe that only PyTorchJob e2e would be enough since we already commonized the KFJobs MK adapters into the kubeflowjob.MKAdapter in #2795.

So, after we merge this PR, can you refactor the e2e and integration tests for the KFJob MK adapter?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ok, then I will use this.
@tenzen-y Yes I will publish the test refactor - I pretty much have it.

@k8s-ci-robot k8s-ci-robot removed the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Aug 21, 2024
@mszadkow mszadkow marked this pull request as ready for review August 21, 2024 08:23
@k8s-ci-robot k8s-ci-robot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Aug 21, 2024
@k8s-ci-robot k8s-ci-robot requested a review from denkensk August 21, 2024 08:23
@mbobrovskyi mbobrovskyi removed their assignment Aug 21, 2024
@alculquicondor
Copy link
Contributor

/lgtm
/approve

@k8s-ci-robot k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Aug 21, 2024
@k8s-ci-robot
Copy link
Contributor

LGTM label has been added.

Git tree hash: 5af49054b2e5f745ccd882c565a1700d93ff9186

@k8s-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: alculquicondor, mszadkow

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@k8s-ci-robot k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Aug 21, 2024
@k8s-ci-robot k8s-ci-robot merged commit 60cc84e into kubernetes-sigs:main Aug 21, 2024
16 checks passed
@k8s-ci-robot k8s-ci-robot added this to the v0.9 milestone Aug 21, 2024
@mszadkow mszadkow deleted the flaky/xgboost-e2e-multikueue-test branch August 23, 2024 07:08
kannon92 pushed a commit to openshift-kannon92/kubernetes-sigs-kueue that referenced this pull request Nov 19, 2024
* Skip flaky e2e multikueue test for XGBoost

* add comment with the link issue
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. kind/bug Categorizes issue or PR as related to a bug. lgtm "Looks good to me", indicates that a PR is ready to be merged. release-note-none Denotes a PR that doesn't merit a release note. size/XS Denotes a PR that changes 0-9 lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants