Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CI] Many multijob-unix-compatibility CI runs fail due to absence of Docker #43096

Closed
droberts195 opened this issue Jun 11, 2019 · 6 comments
Closed
Labels
:Delivery/Build Build or test infrastructure Team:Delivery Meta label for Delivery team >test-failure Triaged test failures from CI

Comments

@droberts195
Copy link
Contributor

droberts195 commented Jun 11, 2019

Many multijob-unix-compatibility builds are failing with the error:

* What went wrong:
Docker (checked [/usr/bin/docker,/usr/local/bin/docker]) is required to run the following tasks: 
 :distribution:docker:buildDockerImage
 :distribution:docker:buildOssDockerImage
you can address this by attending to the reported issue, removing the offending tasks from being executed, or by passing -Dbuild.docker=false

Some examples are:

Those same 4 OSs fail like this every 8 hours. Either Docker needs to be added to the OS images used by CI or maybe the multijob-unix-compatibility jobs should explicitly exclude building the Docker distribution?

@droberts195 droberts195 added :Delivery/Build Build or test infrastructure >test-failure Triaged test failures from CI labels Jun 11, 2019
@elasticmachine
Copy link
Collaborator

Pinging @elastic/es-core-infra

@droberts195
Copy link
Contributor Author

@alpar-t
Copy link
Contributor

alpar-t commented Jun 11, 2019

Thanks for opening the issue @droberts195 this works as designed, we should bail when we try to build the docker images and no docker is available ( we have technical reasons for the not having it on the workers that don't have it btw ).
The test tasks that depend on docker are auto disabled when docker is not present, but not the image build itself. So check doesn't trigger the image build, but we have build in CI which depends on assemble and thus triggers the image builds.

The fix for this one is to switch to using check instead of build in CI for the support matrix specific tests.

@olksdr olksdr closed this as completed Jun 12, 2019
@bizybot
Copy link
Contributor

bizybot commented Jun 14, 2019

Reopening this issue based on the discussions in core infra channel.
Looks like we need to do something more here, as we think the fixture does not start and then causing failures in the cluster start. This needs more investigation.

https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+master+multijob-unix-compatibility/os=oraclelinux-7/37/console

> Configure project :test:fixtures:krb5kdc-fixture
Tests for :test:fixtures:krb5kdc-fixture require docker-compose at /usr/local/bin/docker-compose or /usr/bin/docker-compose but none could be found so these will be skipped

and then

* What went wrong:
Execution failed for task ':x-pack:qa:kerberos-tests:integTestCluster#wait'.
> Failed to locate seed node transport file [/var/lib/jenkins/workspace/elastic+elasticsearch+master+multijob-unix-compatibility/os/oraclelinux-7/x-pack/qa/kerberos-tests/build/cluster/integTestCluster node0/elasticsearch-8.0.0-SNAPSHOT/logs/transport.ports]: timed out waiting for it to be created after 40 seconds

@bizybot bizybot reopened this Jun 14, 2019
@alpar-t
Copy link
Contributor

alpar-t commented Jun 14, 2019

The problem here is that the build fixture task is explicitly enabled, overriding the configuration done in the plugin. The plugin disables it because no docker is available but the build scripts turns it back on.

@alpar-t
Copy link
Contributor

alpar-t commented Jun 14, 2019

Since this is a simple one liner change that's not covered by PR checks I pushed 83182d6 directly to fix it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
:Delivery/Build Build or test infrastructure Team:Delivery Meta label for Delivery team >test-failure Triaged test failures from CI
Projects
None yet
Development

No branches or pull requests

5 participants