-
Notifications
You must be signed in to change notification settings - Fork 24.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
SettingsBasedHostProviderIT#testClusterFormsByScanningPorts couldn't find a free port #34781
Comments
Pinging @elastic/es-distributed |
Muted the test in c5a0739. |
I think this failure is actually because the nodes failed to start up, and the message about leaking threads is because the test then bailed out without doing any cleanup:
Looking at elasticsearch/server/src/test/java/org/elasticsearch/discovery/zen/SettingsBasedHostProviderIT.java Lines 69 to 70 in 18e004c
We didn't, however, discuss what to do if this happened, so I'm marking this for team discussion. |
We discussed this in today's team sync. This test is testing a property that's important for the OOTB experience of using Elasticsearch and starting multiple nodes on the same host, even though it's not that useful in real production deployments. It's therefore something we want to continue to test. However it's always going to be tricky to reliably find two free ports that are ≤5 apart in an integration test, and there's no option to increase the "5" in this statement. We therefore decided that we should try and test this in a packaging test, since these run in a much cleaner environment. I think this specific test can be deleted and I will open an issue about adding this kind of packaging test after speaking to some people about it. |
Closed by #34813 |
It looks like the cluster from
SettingsBasedHostProviderIT#testClusterFormsWithSingleSeedHostInSettings
isn't always properly shut down (we can seeSEVERE: There are still zombie threads that couldn't be terminated: ...
in the logs). This causes the suite to fail as a whole because threads are leaked, and also the next caseSettingsBasedHostProviderIT#testClusterFormsByScanningPorts
fails with a port conflict. I haven't been able to reproduce this locally.Link to the build: https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+master+multijob-unix-compatibility/os=centos/14/console
Example reproduction line:
Relevant excerpts from the logs:
Full build log: build_log.txt
The text was updated successfully, but these errors were encountered: