Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CI] AzureSnapshotBasedRecoveryIT testRecoveryUsingSnapshots failing #111377

Closed
elasticsearchmachine opened this issue Jul 27, 2024 · 3 comments · Fixed by #111397 or #111569
Closed

[CI] AzureSnapshotBasedRecoveryIT testRecoveryUsingSnapshots failing #111377

elasticsearchmachine opened this issue Jul 27, 2024 · 3 comments · Fixed by #111397 or #111569
Labels
:Distributed Coordination/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs needs:risk Requires assignment of a risk label (low, medium, blocker) Team:Distributed Meta label for distributed team (obsolete) >test-failure Triaged test failures from CI

Comments

@elasticsearchmachine
Copy link
Collaborator

elasticsearchmachine commented Jul 27, 2024

Build Scans:

Reproduction Line:

./gradlew ":x-pack:plugin:snapshot-based-recoveries:qa:azure:javaRestTest" --tests "org.elasticsearch.xpack.snapshotbasedrecoveries.recovery.AzureSnapshotBasedRecoveryIT.testRecoveryUsingSnapshots" -Dtests.seed=9A7F28CC7BD51B6D -Dtests.locale=pl -Dtests.timezone=MET -Druntime.java=17

Applicable branches:
main

Reproduces locally?:
N/A

Failure History:
See dashboard

Failure Message:

org.elasticsearch.client.ResponseException: method [PUT], host [http://[::1]:42705], URI [/_snapshot/repository?verify=true], status line [HTTP/1.1 500 Internal Server Error]
---
error:
  root_cause:
  - type: "invalid_algorithm_parameter_exception"
    reason: "invalid_algorithm_parameter_exception: the trustAnchors parameter must\
      \ be non-empty"
  type: "repository_verification_exception"
  reason: "[repository] path [_snapshot_based_recoveries_tests_9A7F28CC7BD51B6D] is\
    \ not accessible on master node"
  caused_by:
    type: "runtime_exception"
    reason: "runtime_exception: Unexpected error: java.security.InvalidAlgorithmParameterException:\
      \ the trustAnchors parameter must be non-empty"
    caused_by:
      type: "invalid_algorithm_parameter_exception"
      reason: "invalid_algorithm_parameter_exception: the trustAnchors parameter must\
        \ be non-empty"
    suppressed:
    - type: "exception"
      reason: "exception: #block terminated with an error"
status: 500

Issue Reasons:

  • [main] 2 consecutive failures in test testRecoveryUsingSnapshots
  • [main] 3 consecutive failures in step openjdk17_checkpart2_java-matrix
  • [main] 2 consecutive failures in step graalvm-ce17_checkpart2_java-matrix
  • [main] 16 failures in test testRecoveryUsingSnapshots (3.1% fail rate in 517 executions)
  • [main] 3 failures in step openjdk17_checkpart2_java-matrix (100.0% fail rate in 3 executions)
  • [main] 2 failures in step graalvm-ce17_checkpart2_java-matrix (100.0% fail rate in 2 executions)
  • [main] 6 failures in step openjdk17_checkpart2_java-fips-matrix (85.7% fail rate in 7 executions)
  • [main] 5 failures in step part-2-fips (83.3% fail rate in 6 executions)
  • [main] 8 failures in pipeline elasticsearch-periodic (66.7% fail rate in 12 executions)

Note:
This issue was created using new test triage automation. Please report issues or feedback to es-delivery.

@elasticsearchmachine elasticsearchmachine added :Distributed Coordination/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs >test-failure Triaged test failures from CI labels Jul 27, 2024
@elasticsearchmachine
Copy link
Collaborator Author

This has been muted on branch main

Mute Reasons:

  • [main] 5 consecutive failures in step openjdk17_checkpart2_java-fips-matrix
  • [main] 5 consecutive failures in step part-2-fips
  • [main] 12 failures in test testRecoveryUsingSnapshots (3.2% fail rate in 372 executions)
  • [main] 5 failures in step openjdk17_checkpart2_java-fips-matrix (100.0% fail rate in 5 executions)
  • [main] 5 failures in step part-2-fips (100.0% fail rate in 5 executions)
  • [main] 7 failures in pipeline elasticsearch-periodic (77.8% fail rate in 9 executions)

Build Scans:

elasticsearchmachine added a commit that referenced this issue Jul 27, 2024
…apshotBasedRecoveryIT testRecoveryUsingSnapshots #111377
@elasticsearchmachine elasticsearchmachine added Team:Distributed Meta label for distributed team (obsolete) needs:risk Requires assignment of a risk label (low, medium, blocker) labels Jul 27, 2024
@elasticsearchmachine
Copy link
Collaborator Author

Pinging @elastic/es-distributed (Team:Distributed)

DaveCTurner added a commit to DaveCTurner/elasticsearch that referenced this issue Jul 29, 2024
We cannot (easily) override the JVM trust store in FIPS tests. This
commit falls back to using HTTP in these cases.

Closes elastic#111279
Closes elastic#111345
Closes elastic#111307
Closes elastic#111377
Closes elastic#111280
elasticsearchmachine added a commit that referenced this issue Jul 30, 2024
…apshotBasedRecoveryIT testRecoveryUsingSnapshots #111377
@elasticsearchmachine
Copy link
Collaborator Author

This has been muted on branch main

Mute Reasons:

  • [main] 2 consecutive failures in step graalvm-ce17_checkpart2_java-matrix
  • [main] 2 consecutive failures in step openjdk17_checkpart2_java-matrix
  • [main] 15 failures in test testRecoveryUsingSnapshots (2.9% fail rate in 516 executions)
  • [main] 2 failures in step graalvm-ce17_checkpart2_java-matrix (100.0% fail rate in 2 executions)
  • [main] 6 failures in step openjdk17_checkpart2_java-fips-matrix (85.7% fail rate in 7 executions)
  • [main] 5 failures in step part-2-fips (83.3% fail rate in 6 executions)
  • [main] 2 failures in step openjdk17_checkpart2_java-matrix (100.0% fail rate in 2 executions)
  • [main] 8 failures in pipeline elasticsearch-periodic (66.7% fail rate in 12 executions)

Build Scans:

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
:Distributed Coordination/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs needs:risk Requires assignment of a risk label (low, medium, blocker) Team:Distributed Meta label for distributed team (obsolete) >test-failure Triaged test failures from CI
Projects
None yet
1 participant