Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Skip flaky e2e MultiKueue test for XGBoost #2861

Merged
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 2 additions & 0 deletions test/e2e/multikueue/e2e_test.go
Original file line number Diff line number Diff line change
Expand Up @@ -633,6 +633,8 @@ var _ = ginkgo.Describe("MultiKueue", func() {
})

ginkgo.It("Should run a kubeflow XGBoostJob on worker if admitted", func() {
// Skipped due to known bug - https://github.com/kubeflow/training-operator/issues/1711
ginkgo.Skip("Skipped due to state transitioning bug in training-operator")
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Put a comment with a link to the open issue in the training-operator repo

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I agree and I would have done that, but they don't have actual issue open, just the reference - kubeflow/training-operator#1711

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let's use that issue

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I believe that only PyTorchJob e2e would be enough since we already commonized the KFJobs MK adapters into the kubeflowjob.MKAdapter in #2795.

So, after we merge this PR, can you refactor the e2e and integration tests for the KFJob MK adapter?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ok, then I will use this.
@tenzen-y Yes I will publish the test refactor - I pretty much have it.

// Since it requires 1.5 CPU, this job can only be admitted in worker 1.
xgboostJob := testingxgboostjob.MakeXGBoostJob("xgboostjob1", managerNs.Name).
Queue(managerLq.Name).
Expand Down