-
Notifications
You must be signed in to change notification settings - Fork 24.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[CI] TransportMonitoringMigrateAlertsActionTests.testLocalAlertsRemoval #66586
Comments
Pinging @elastic/es-core-features (Team:Core/Features) |
These have been failing more recently on Some possibly-relevant logs from the failures:
|
See elastic#66586 for details.
See elastic#66586 for details.
See elastic#66586 for details.
The mentioned stacktrace is caused by the fact that the monitoring test doesn't load the data stream plugin and then the watcher history index template can't be added because it uses data stream (the data stream field type is missing). I also noticed that other plugins are not loaded which monitoring requires to run (enrich for |
…ons for ccr and enrich. The data stream plugin and dummy transport actions that are added to LocalStateMonitoring will allow for monitoring java integration tests to function properly without printing error messages that make debugging harder. For example the data stream plugin was added so that index templates with data streams can be added without failing constantly in the background and enrich stats dummy transport action so that the EnrichStatsCollector doesn't fail. Also unmutes tests that were muted via elastic#66586, to have another opportunity to look at logs without all the noise, perhaps all these errors contributed to the test failures.
Adds data-streams plugin to LocalStateMonitoring and dummy stats actions for ccr and enrich. The data stream plugin and dummy transport actions that are added to LocalStateMonitoring will allow for monitoring java integration tests to function properly without printing error messages that make debugging harder. For example the data stream plugin was added so that index templates with data streams can be added without failing constantly in the background and enrich stats dummy transport action so that the EnrichStatsCollector doesn't fail. Also unmutes tests that were muted via #66586, to have another opportunity to look at logs without all the noise, perhaps all these errors contributed to the test failures.
I'm muting |
There are a couple of fresh failures for this test among others that were unmuted. I was looking into this before the end of last year (turns out there was a duplicate issue #66391). From everything I was able to gather, something is delaying the setup of the
|
Another instance: https://gradle-enterprise.elastic.co/s/jcpiyio2mmcv6 |
I've muted both |
Apologies, I missed the updates to this issue. I've looked into the latest failure and I can confirm what @jbaiera has observed. In the case of the |
Relates to elastic#66586
unmute TransportMonitoringMigrateAlertsActionTests#testLocalAlertsRemoval and TransportMonitoringMigrateAlertsActionTests#testRepeatedLocalAlertsRemoval tests Somehow during these tests the monitor watches are not installed. Both tests use the local exporter and this exporter only installs the watches under specific conditions via the elected master node. I suspect the conditions are never met. The http exporter is more relaxed when attempting to install monitor watches and the tests using the http exporter seem not to be prone by the fact that tests fail because monitor watches have not been installed. Relates to elastic#66586
unmute TransportMonitoringMigrateAlertsActionTests#testLocalAlertsRemoval and TransportMonitoringMigrateAlertsActionTests#testRepeatedLocalAlertsRemoval tests Somehow during these tests the monitor watches are not installed. Both tests use the local exporter and this exporter only installs the watches under specific conditions via the elected master node. I suspect the conditions are never met. The http exporter is more relaxed when attempting to install monitor watches and the tests using the http exporter seem not to be prone by the fact that tests fail because monitor watches have not been installed. Relates to #66586
After adding more logging via #68752 and this test failure, that the setup of watcher is never attempted, however the local exporter setup is marked as completed and because of this during the The problem is
Update: Turns out that watches are not installed, because initially I will open a pr. |
Change tests to use monitor bulk api on elected master node before verifying watcher index exists. Sometimes the monitor service on the elected master doesn't yet export monitor documents resulting in tests using the `ensureInitialLocalResources(...)` method to fail. Cluster alerts watcher are only installed when local exporter tries to resolve local bulk. Relates to elastic#66586
…#69139) Change tests to use monitor bulk api on elected master node before verifying watcher index exists. Sometimes the monitor service on the elected master doesn't yet export monitor documents resulting in tests using the `ensureInitialLocalResources(...)` method to fail. Cluster alerts watcher are only installed when local exporter tries to resolve local bulk. Relates to #66586
…8752) unmute TransportMonitoringMigrateAlertsActionTests#testLocalAlertsRemoval and TransportMonitoringMigrateAlertsActionTests#testRepeatedLocalAlertsRemoval tests Somehow during these tests the monitor watches are not installed. Both tests use the local exporter and this exporter only installs the watches under specific conditions via the elected master node. I suspect the conditions are never met. The http exporter is more relaxed when attempting to install monitor watches and the tests using the http exporter seem not to be prone by the fact that tests fail because monitor watches have not been installed. Relates to elastic#66586
…elastic#69139) Change tests to use monitor bulk api on elected master node before verifying watcher index exists. Sometimes the monitor service on the elected master doesn't yet export monitor documents resulting in tests using the `ensureInitialLocalResources(...)` method to fail. Cluster alerts watcher are only installed when local exporter tries to resolve local bulk. Relates to elastic#66586
…69326) Re-enabled TransportMonitoringMigrateAlertsActionTests#testLocalAlertsRemoval and TransportMonitoringMigrateAlertsActionTests#testRepeatedLocalAlertsRemoval on 7.x branch. Includes changes from #69139 and #68752 Relates to #66586 Included commits: * Add more trace logging when installing monitor watches and (#68752) unmute TransportMonitoringMigrateAlertsActionTests#testLocalAlertsRemoval and TransportMonitoringMigrateAlertsActionTests#testRepeatedLocalAlertsRemoval tests Somehow during these tests the monitor watches are not installed. Both tests use the local exporter and this exporter only installs the watches under specific conditions via the elected master node. I suspect the conditions are never met. The http exporter is more relaxed when attempting to install monitor watches and the tests using the http exporter seem not to be prone by the fact that tests fail because monitor watches have not been installed. Relates to #66586 * Manually trigger local exporter to open a bulk in some monitor tests. (#69139) Change tests to use monitor bulk api on elected master node before verifying watcher index exists. Sometimes the monitor service on the elected master doesn't yet export monitor documents resulting in tests using the `ensureInitialLocalResources(...)` method to fail. Cluster alerts watcher are only installed when local exporter tries to resolve local bulk. Relates to #66586
The tests haven't failed in almost 5 days. Also re-enabled these tests in 7.x branch. |
Encountered on a PR build https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+pull-request-2/15205/ (multiple tests failing
testLocalExporterWithAlertingDisabled
,testLocalAlertsRemoval
andtestDisabledLocalExporterAlertsRemoval
Build scan:
https://gradle-enterprise.elastic.co/s/or4i6aw2kxx6u
Repro line:
Reproduces locally?: No
Applicable branches: 7.11
Failure history:
https://gradle-enterprise.elastic.co/scans/tests?search.relativeStartTime=P7D&search.tags=CI&search.timeZoneId=America/Los_Angeles&tests.container=org.elasticsearch.xpack.monitoring.action.TransportMonitoringMigrateAlertsActionTests&tests.sortField=FAILED&tests.test=testLocalAlertsRemoval&tests.unstableOnly=true
Failure excerpt:
The text was updated successfully, but these errors were encountered: