[BUG] Stuck segments upload leads to high refresh lag #11020
Labels
bug
Something isn't working
Storage:Durability
Issues and PRs related to the durability framework
Storage:Remote
Describe the bug
There is a use case where when segments are getting uploaded to remote store and a flush/force merge happens, there is a possibility that the segment tracker's data can get erased while there is async refresh retry is happening. This leads to latch not getting counted down.
OpenSearch/server/src/main/java/org/opensearch/index/shard/RemoteStoreRefreshListener.java
Lines 388 to 403 in a2febe9
To Reproduce
Run indexing for a long time.
Expected behavior
Refreshes should never get stuck.
The text was updated successfully, but these errors were encountered: