-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ensure ImportJobTest is not flaky by checking WriteToStore metric and requesting adequate resources for testing #332
Ensure ImportJobTest is not flaky by checking WriteToStore metric and requesting adequate resources for testing #332
Conversation
Also print random Redis element to debug that some FeatureRow has been ingested properly
Some tests (like ingestion test) expect the operation to complete in certain amount of time. This can only be guaranteed if the process have adequate CPU and memory. Without it, when the test cluster is overloaded, the test process may get little CPU time allocated and the expected completion time is no longer valid
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: davidheryanto The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
Thanks for this @davidheryanto Should we perhaps rename the PR to be more descriptive? Its not clear that this ensures any less flakyness. |
/hold |
So we can obtain information about no of elements have been written in the pipeline without resorting to external metrics collector This method makes use built in metrics util in Apache Beam
…he ingestion result in ImportJobTest
/hold cancel |
/lgtm |
/retest |
@davidheryanto: Updated the
In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
feast.ingestion.ImportJobTest sometimes fail unpredictably because not all the keys ingested in Redis are found during retrieval, but the test log does not provide useful info for debugging the failure.
Example of such error:
This pull request:
Adds resources request for the Pod running the test. Previously, the resources that the test Pod get assigned depends on how overloaded the test cluster is. By having a guaranteed amount of CPU and memory, the duration of tests should be more predictable.
Adds more debugging info when such test fails:
These should ensure
ImportJobTest
test result is reproducible.