-
Notifications
You must be signed in to change notification settings - Fork 716
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[scalability testing] large number of jobs (100?) running concurrently? #829
Comments
/priority p1 |
/priority p2 Don't have large users hitting this as far as I'm aware |
/remove-priority p1 |
/assign |
Anything else left to do here? Otherwise we can close this. |
Results can be found at https://bit.ly/2CtWFn3. Closing this for now. |
@richardsliu: Closing this issue. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
As a step towards reaching v1, I think we should consider doing a scalability test to see if we can correctly handle a large number 100-1000 of jobs running concurrently.
I think what we want to look at is:
For the latter part we might need to add instrumentation to TFOperator to report metrics about event processing to Prometheus. Would be good to sync with folks in K8s community to see what they do.
/cc @richardsliu
/cc @johnugeorge
The text was updated successfully, but these errors were encountered: