-
Notifications
You must be signed in to change notification settings - Fork 102
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Compatibility with segment replication #974
Comments
Request owners to add |
IMO the following changes will entail for alerting plugin to be compatible with segrep
Tagging @lezzago to review |
Hi Plugin Owners, |
Discussed this further with @dreamer-89 . Currently the system/hidden indices used by the plugin, such as to maintain the states - Alerting Config, Findings, Alerts etc, will continue to work as is in the release 2.9 as we are not onboarding them to start using the
|
I dont think so since it is stated that I believe the only data that we write and immediately try to read are docs in the doc_level_queries index for document level alerting. We would need to modify the code to read that index to only fetch primary shards to ensure the guarantee. |
Thanks @lezzago for the update. Yes, using |
@dreamer-89, I have noticed one potential issue for customers by enabling segment replication. If the customer is using percolate queries, they need to index their queries before running the percolate query. By design, that will require a strong read on the recently indexed queries. Has core made changes to ensure that it could handle this use case and just query on the primary shards of the indexed queries? |
Closing as once core makes the necessary changes to handle seg rep, the alerting plugin will be fine. |
Reopening until opensearch-project/OpenSearch#9669 is resolved. |
@lezzago : Can you please share more details why percolate queries may not work as intended with segment replication feature on core issue opensearch-project/OpenSearch#9669 ? |
In the plugin, we use percolate queries for our Document Level Monitor. Whenever the monitor is run, we update an Alerting index with the updated queries and schema mappings and have a refresh immediate policy set for it. Then we run a percolate query search with the Alerting index as the query index store and query the data that the monitor needs to search. With the seg rep changes, the percolate query code inside OpenSearch core needs to ensure the query index store its searching is up to date by searching on the primary shards. If this doesn't happen, it would mean that the Document Level Monitor can potentially not fetch all the data and miss out on generating alerts when it should have. That could have big impacts for the customer and is very bad as this would be a silent error and they would not know about it. Additionally, if Document Level monitors cannot ensure its fetching all the data, it would have big repercussions for the Security-Analytics plugin as they utilize Document Level monitors heavily for their detectors and they could miss security issues for the users of that plugin. |
Summary
With 2.9.0 release, there are lot of enhancements going in for segment replication[1][2] feature (went GA in 2.7.0), we need to ensure different plugins are compatible with current state of this feature. Previously, we ran tests on plugin repos to verify this compatibility but want plugin owners to be aware of these changes so that required updates (if any) can be made. With
2.10.0
release, remote store feature is going GA which internally uses SEGMENT replication strategy only i.e. it enforces all indices to useSEGMENT
replication strategy. So, it is important to validate plugins are compatible with segment replication feature.What changed
1. Refresh policy behavior
2. Refresh lag on replicas
With segment replication, there is inherent delay in documents to be searchable on replica shard copies. This is due to the fact that replica shard copies over data (segment) files from primary. Thus, compared to document replication, there will be on average increase in amount of time the replica shards are consistent with primaries.
3. System/hidden indices support
With opensearch-project/OpenSearch#8200, system and hidden indices are now supported with
SEGMENT
replication strategy. We need to ensure there are no bottlenecks which prevents system/hidden indices with segment replication.Next steps
With segment replication strong reads are not guaranteed. Thus, if the plugin needs strong reads guarantees specially as alternative to change in behavior of refresh policy and lag on replicas (point 1 and 2 above), we need to update search requests to target primary shard only. With opensearch-project/OpenSearch#7375, core now supports primary shards only based search. Please follow documentation for examples and details
Open questions
In case of any questions or issues, please post it in core issue
Reference
[1] Design
[2] Documentation
The text was updated successfully, but these errors were encountered: