Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

org.elasticsearch.repositories.s3.RepositoryS3ClientYamlTestSuiteIT failures #40726

Closed
jkakavas opened this issue Apr 2, 2019 · 3 comments
Closed
Assignees
Labels
:Distributed Coordination/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs >test-failure Triaged test failures from CI

Comments

@jkakavas
Copy link
Member

jkakavas commented Apr 2, 2019

This failed in a PR build https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+pull-request-1/11310/console. I won't be muting it as it doesn't seem to reproduce locally.

https://gradle.com/s/mc2fttwxkboa2
https://gradle.com/s/of3fjbh6gv4kq

REPRODUCE WITH: ./gradlew :plugins:repository-s3:integTestMinioRunner \
-Dtests.seed=1C404B332F7967AD \
-Dtests.class=org.elasticsearch.repositories.s3.RepositoryS3ClientYamlTestSuiteIT \
-Dtests.method="test {yaml=repository_s3/20_repository_permanent_credentials/Register a repository with a non existing client}" -Dtests.security.manager=true -Dtests.locale=ar-LB -Dtests.timezone=America/Chihuahua -Dcompiler.java=12 -Druntime.java=8 -Dtests.rest.blacklist=repository_s3/30_repository_temporary_credentials/*,repository_s3/40_repository_ec2_credentials/*,repository_s3/50_repository_ecs_credentials/*
java.lang.AssertionError: Failure at [repository_s3/20_repository_permanent_credentials:21]: expected [2xx] status code but api [snapshot.delete] returned [429 Too Many Requests] [{"error":{"root_cause":[{"type":"circuit_breaking_exception","reason":"[parent] Data too large, data for [<http_request>] would be [498234232/475.1mb], which is larger than the limit of [493030604/470.1mb], real usage: [498234232/475.1mb], new bytes reserved: [0/0b]","bytes_wanted":498234232,"bytes_limit":493030604,"durability":"TRANSIENT","stack_trace":"CircuitBreakingException[[parent] Data too large, data for [<http_request>] would be [498234232/475.1mb], which is larger than the limit of [493030604/470.1mb], real usage: [498234232/475.1mb], new bytes reserved: [0/0b]]

Looks like a request to delete a snapshot returned a very large response that tripped the circuit breaker. There seems to have been quite a few failures in this class last week but not this one exactly

@jkakavas jkakavas added :Distributed Coordination/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs >test-failure Triaged test failures from CI labels Apr 2, 2019
@elasticmachine
Copy link
Collaborator

Pinging @elastic/es-distributed

@original-brownbear
Copy link
Member

original-brownbear commented Apr 2, 2019

Seems we just tripped the parent circuit breaker here (so it may not even be a huge request in the snapshot delete?) and it only happened this once. The other failures I see in the build stats were a result of an issue with the third party tests fixed in #40588. Maybe we just saw some random slowness here an it tripped the circuit breaker?

@original-brownbear
Copy link
Member

I haven't seen this again after this issue was opened 3 weeks ago and I think it's likely not a specific S3 plugin issue as explained above -> closing here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
:Distributed Coordination/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs >test-failure Triaged test failures from CI
Projects
None yet
Development

No branches or pull requests

3 participants