-
Notifications
You must be signed in to change notification settings - Fork 671
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Core feature] Add support KubeRay 1.0 #4244
Comments
I am running Flyte While looking for ways how to debug it (also on Flyte Slack), I have noticed that kuberay version |
FYI, after update to Flyte
Checked with It looks like it was caused by #4389 and that it is getting fixed with #4656. |
I think this is now complete! |
Motivation: Why do you think this is important?
Kuberay v0.3 ~v0.6 are not stable, and have many bugs and performance issues. People have run into tons of problems when using KubeRay operators.
kuberay issues:
0.3.0
: Kuberay may create two rayJob sometimes0.4.0
: Kuberay failed to run the RayJob when running multiple rayJobs (2+) in the same time.0.5.0
Kuberay add an init container but this init container doesn't have default request and limit, so it can't be run in the project-doman namespace0.5.1
: same issue as 0.50.5.2
: We can run the rayjob successfully but still have the problem in0.3
0.6.0
: Kuberay added a k8sJob in the Rayjob, which used to submit the ray remote task, but it doesn't have default request and limit also.We should
Goal: What should the final outcome look like, ideally?
Flyte can a run ray tasks without any problems When using kuberay 1.0
Describe alternatives you've considered
NA
Propose: Link/Inline OR Additional context
No response
Are you sure this issue hasn't been raised already?
Have you read the Code of Conduct?
The text was updated successfully, but these errors were encountered: