-
Notifications
You must be signed in to change notification settings - Fork 3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug]: [benchmark][cluster] search raises error fail to search on QueryNode 7: distribution is not servcieable: channel not available
in concurrent dql and dml(multi-partition) scene
#33550
Comments
/assign @zhagnlu |
same error, different sceneargo task: multi-vector-corn-1-1718892000 server:
client pod name: multi-vector-corn-1-1718892000-3432833946
test steps:
test result:
|
Recurrentargo task: multi-vector-corn-1-1722866400 server:
test result:
|
Recurrent on 2.5 branchargo task: fouramf-85cth server:
client log:
test steps:
test result:
|
) issue: #33550 cause concurrent issue may occur between remove parition in target manager and sync segment list to delegator. when it happens, some segment may be released in delegator, and those segment may also be synced to delegator, which cause delegator become unserviceable due to lack of necessary segments, then search/query fails. this PR make sure that all write access to target_manager will be executed in serial to avoid the concurrent issues. Signed-off-by: Wei Liu <[email protected]>
…vus-io#35919) issue: milvus-io#33550 cause concurrent issue may occur between remove parition in target manager and sync segment list to delegator. when it happens, some segment may be released in delegator, and those segment may also be synced to delegator, which cause delegator become unserviceable due to lack of necessary segments, then search/query fails. this PR make sure that all write access to target_manager will be executed in serial to avoid the concurrent issues. Signed-off-by: Wei Liu <[email protected]>
) (#36019) issue: #33550 pr: #35919 cause concurrent issue may occur between remove parition in target manager and sync segment list to delegator. when it happens, some segment may be released in delegator, and those segment may also be synced to delegator, which cause delegator become unserviceable due to lack of necessary segments, then search/query fails. this PR make sure that all write access to target_manager will be executed in serial to avoid the concurrent issues. Signed-off-by: Wei Liu <[email protected]>
issue: #33550 cause wrong impl of UpdateCollectionNextTarget, if ReleaseCollection and UpdateCollectionNextTarget happens at same time, the the released partition's segment list may be add to target again, and delegator will be marked as unserviceable due to lack of segment. This PR fix the impl of UpdateCollectionNextTarget Signed-off-by: Wei Liu <[email protected]>
issue: milvus-io#33550 cause wrong impl of UpdateCollectionNextTarget, if ReleaseCollection and UpdateCollectionNextTarget happens at same time, the the released partition's segment list may be add to target again, and delegator will be marked as unserviceable due to lack of segment. This PR fix the impl of UpdateCollectionNextTarget Signed-off-by: Wei Liu <[email protected]>
issue: #33550 pr: #37049 cause wrong impl of UpdateCollectionNextTarget, if ReleaseCollection and UpdateCollectionNextTarget happens at same time, the the released partition's segment list may be add to target again, and delegator will be marked as unserviceable due to lack of segment. This PR fix the impl of UpdateCollectionNextTarget Signed-off-by: Wei Liu <[email protected]>
reproduceargo task: memory-opt-scenes-2x7j4 server:
client logs:
test steps:
@weiliu1031 please help to check |
should be fixed by #37694, please verify this with latest image |
/assign @wangting0128 |
Recurrent on 2.5 branchargo task: fouramf-bitmap-scenes-h7r46 server:
client log:
test step:
/assign @weiliu1031 please help on this |
issue: #33550 balance segment and balance segment execute at same time, which will cause bounch of corner case. This PR disable simultaneous balance of segments and channels Signed-off-by: Wei Liu <[email protected]>
…#37850) issue: milvus-io#33550 balance segment and balance segment execute at same time, which will cause bounch of corner case. This PR disable simultaneous balance of segments and channels Signed-off-by: Wei Liu <[email protected]>
…37939) issue: #33550 pr: #37850 balance segment and balance segment execute at same time, which will cause bounch of corner case. This PR disable simultaneous balance of segments and channels Signed-off-by: Wei Liu <[email protected]>
…#37850) issue: milvus-io#33550 balance segment and balance segment execute at same time, which will cause bounch of corner case. This PR disable simultaneous balance of segments and channels Signed-off-by: Wei Liu <[email protected]>
Did you verify it on master? |
verification passed, not reproduce recentlyargo task: fouramf-concurrent-xcq |
Is there an existing issue for this?
Environment
Current Behavior
argo task: multi-vector-corn-1-1717336800
test case name: test_hybrid_search_locust_dql_dml_partition_hybrid_search_cluster
server:
client pod name: multi-vector-corn-1-1717336800-161976730
client logs:
hybrid search request raises error
Expected Behavior
No response
Steps To Reproduce
Milvus Log
No response
Anything else?
test result:
The text was updated successfully, but these errors were encountered: