[CI] :qa:full-cluster-restart:v7.0.1#upgradedClusterTest failing #91470

pxsalehi · 2022-11-09T15:37:19Z

CI Link

https://gradle-enterprise.elastic.co/s/ptwgurvvtzx5s

Repro line

Probably with ./gradlew :qa:full-cluster-restart:v7.0.1#upgradedClusterTest

Does it reproduce?

Didn't try

Applicable branches

main

Failure history

No response

Failure excerpt

See https://gradle-enterprise.elastic.co/s/ptwgurvvtzx5s/console-log/raw?task=:qa:full-cluster-restart:v7.0.1%23upgradedClusterTest

It also fails for other BWC versions: https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+main+periodic+bwc/
(7.0.0 to 7.1.1)

» [2022-11-09T13:11:28,587][ERROR][o.e.b.ElasticsearchUncaughtExceptionHandler] [v7.0.1-1] fatal error in thread [elasticsearch[v7.0.1-1][masterService#updateTask][T#1]], exiting java.lang.AssertionError: {[testclosedindices][0]=2}
»  	at [email protected]/org.elasticsearch.cluster.routing.allocation.allocator.DesiredBalanceReconciler.allocateUnassignedInvariant(DesiredBalanceReconciler.java:118)
»  	at [email protected]/org.elasticsearch.cluster.routing.allocation.allocator.DesiredBalanceReconciler.run(DesiredBalanceReconciler.java:81)
»  	at [email protected]/org.elasticsearch.cluster.routing.allocation.allocator.DesiredBalanceShardsAllocator.recordTime(DesiredBalanceShardsAllocator.java:299)
»  	at [email protected]/org.elasticsearch.cluster.routing.allocation.allocator.DesiredBalanceShardsAllocator.reconcile(DesiredBalanceShardsAllocator.java:213)
»  	at [email protected]/org.elasticsearch.cluster.routing.allocation.allocator.DesiredBalanceShardsAllocator$ReconcileDesiredBalanceExecutor.lambda$applyBalance$1(DesiredBalanceShardsAllocator.java:277)
»  	at [email protected]/org.elasticsearch.cluster.routing.allocation.AllocationService.reroute(AllocationService.java:518)
»  	at [email protected]/org.elasticsearch.cluster.routing.allocation.AllocationService.executeWithRoutingAllocation(AllocationService.java:444)
»  	at [email protected]/org.elasticsearch.cluster.ClusterModule.reconcile(ClusterModule.java:145)
»  	at [email protected]/org.elasticsearch.cluster.routing.allocation.allocator.DesiredBalanceShardsAllocator$ReconcileDesiredBalanceExecutor.applyBalance(DesiredBalanceShardsAllocator.java:275)
»  	at [email protected]/org.elasticsearch.cluster.routing.allocation.allocator.DesiredBalanceShardsAllocator$ReconcileDesiredBalanceExecutor.execute(DesiredBalanceShardsAllocator.java:261)
»  	at [email protected]/org.elasticsearch.cluster.service.MasterService.innerExecuteTasks(MasterService.java:1052)
»  	at [email protected]/org.elasticsearch.cluster.service.MasterService.executeTasks(MasterService.java:1017)
»  	at [email protected]/org.elasticsearch.cluster.service.MasterService.runTasks(MasterService.java:278)
»  	at [email protected]/org.elasticsearch.cluster.service.MasterService$Batcher.run(MasterService.java:170)
»  	at [email protected]/org.elasticsearch.cluster.service.TaskBatcher.runIfNotProcessed(TaskBatcher.java:110)
»  	at [email protected]/org.elasticsearch.cluster.service.TaskBatcher$BatchedTask.run(TaskBatcher.java:148)
»  	at [email protected]/org.elasticsearch.common.util.concurrent.ThreadContext$ContextPreservingRunnable.run(ThreadContext.java:850)
»  	at [email protected]/org.elasticsearch.common.util.concurrent.PrioritizedEsThreadPoolExecutor$TieBreakingPrioritizedRunnable.runAndClean(PrioritizedEsThreadPoolExecutor.java:257)
»  	at [email protected]/org.elasticsearch.common.util.concurrent.PrioritizedEsThreadPoolExecutor$TieBreakingPrioritizedRunnable.run(PrioritizedEsThreadPoolExecutor.java:223)
»  	at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136)
»  	at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635)
»  	at java.base/java.lang.Thread.run(Thread.java:833)

The text was updated successfully, but these errors were encountered:

elasticsearchmachine · 2022-11-09T15:37:42Z

Pinging @elastic/es-distributed (Team:Distributed)

mark-vieira · 2022-11-10T22:58:28Z

@DaveCTurner Given when this started failing and the stacktraces could #91343 be the culprit here? Looks like this is limited to upgrading from nodes earlier than 7.2.0.

DaveCTurner · 2022-11-21T13:48:57Z

Yeah, well at least #91343 introduced the assertion that's tripping here. Not yet clear why it would be tripping tho, but closed indices from pre-7.2 nodes are a special and interesting corner case (#33888). If I don't see an obvious problem soon I'll try and work out a way to mute these tests.

This assertion fails in the presence of pre-7.2.0 closed indices because such indices don't even have routing table entries. Relates elastic#33888 Closes elastic#91470

This assertion fails in the presence of pre-7.2.0 closed indices because such indices don't even have routing table entries. Relates #33888 Closes #91470

This assertion fails in the presence of pre-7.2.0 closed indices because such indices don't even have routing table entries. Relates elastic#33888 Closes elastic#91470

This assertion fails in the presence of pre-7.2.0 closed indices because such indices don't even have routing table entries. Relates #33888 Closes #91470

pxsalehi added >test-failure Triaged test failures from CI :Distributed Coordination/Allocation All issues relating to the decision making around placing a shard (both master logic & on the nodes) labels Nov 9, 2022

elasticsearchmachine added the Team:Distributed Meta label for distributed team (obsolete) label Nov 9, 2022

jdconrad mentioned this issue Nov 10, 2022

Add fielddata and scripting support for byte-sized vectors #91184

Merged

DaveCTurner self-assigned this Nov 21, 2022

DaveCTurner mentioned this issue Nov 21, 2022

Skip ancient closed indices in desired balance #91765

Merged

elasticsearchmachine closed this as completed in #91765 Nov 21, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CI] :qa:full-cluster-restart:v7.0.1#upgradedClusterTest failing #91470

[CI] :qa:full-cluster-restart:v7.0.1#upgradedClusterTest failing #91470

pxsalehi commented Nov 9, 2022 •

edited

Loading

elasticsearchmachine commented Nov 9, 2022

mark-vieira commented Nov 10, 2022 •

edited

Loading

DaveCTurner commented Nov 21, 2022

[CI] :qa:full-cluster-restart:v7.0.1#upgradedClusterTest failing #91470

[CI] :qa:full-cluster-restart:v7.0.1#upgradedClusterTest failing #91470

Comments

pxsalehi commented Nov 9, 2022 • edited Loading

CI Link

Repro line

Does it reproduce?

Applicable branches

Failure history

Failure excerpt

elasticsearchmachine commented Nov 9, 2022

mark-vieira commented Nov 10, 2022 • edited Loading

DaveCTurner commented Nov 21, 2022

pxsalehi commented Nov 9, 2022 •

edited

Loading

mark-vieira commented Nov 10, 2022 •

edited

Loading