Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add documentation for new bloom filter settings #6449

Merged
merged 6 commits into from
Feb 20, 2024

Conversation

mgodwan
Copy link
Member

@mgodwan mgodwan commented Feb 20, 2024

Description

Add documentation for new bloom filter settings

Issues Resolved

Closes #6434

Checklist

  • By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license and subject to the Developers Certificate of Origin.
    For more information on following Developer Certificate of Origin and signing off your commits, please check here.

@hdhalter hdhalter added 4 - Doc review PR: Doc review in progress release-notes PR: Include this PR in the automated release notes v2.12.0 labels Feb 20, 2024
Copy link
Contributor

@vagimeli vagimeli left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please see doc review changes. We can move this to editorial once these changes are addressed. Thank you.

@bbarani
Copy link
Member

bbarani commented Feb 20, 2024

Thanks for reviewing this PR so quickly! @vagimeli

Signed-off-by: mgodwan <[email protected]>
@mgodwan
Copy link
Member Author

mgodwan commented Feb 20, 2024

Thanks @vagimeli for the review. I've addressed the comments based on your suggestions.
Could you please check?

@vagimeli
Copy link
Contributor

Thanks @vagimeli for the review. I've addressed the comments based on your suggestions. Could you please check?

@mgodwan It's in editorial review. I'll address any of the editor's feedback and merge the PR. If any tech questions come up, I'll reach out to you. Thank you!

Copy link
Collaborator

@natebower natebower left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@vagimeli @mgodwan Please see my comments and changes and let me know if you have any questions. Thanks!

@@ -182,6 +182,10 @@ OpenSearch supports the following dynamic index-level index settings:

- `index.final_pipeline` (String): The final ingest node pipeline for the index. If the final pipeline is set and the pipeline does not exist, then index requests fail. The pipeline name `_none` specifies that the index does not have an ingest pipeline.

- `index.optimize_doc_id_lookup.fuzzy_set.enabled` (Boolean): This setting controls whether `fuzzy_set` should be enabled for optimizing document ID lookups in indexing or searching calls by using an additional data structure, in this case, the Bloom filter data structure. Enabling this setting improves performance for upsert and search operations that rely on document ID by creating a new data structure (Bloom filter). The Bloom filter allows for the handling of negative cases (that is, IDs being absent in the existing index) through faster off-heap look-ups. Default is `false`. This setting can only be used if the feature flag `opensearch.experimental.optimize_doc_id_lookup.fuzzy_set.enabled` is set to `true`.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"in order to optimize" instead of "for optimizing"? "lookups in index or search calls"?

@@ -182,6 +182,10 @@ OpenSearch supports the following dynamic index-level index settings:

- `index.final_pipeline` (String): The final ingest node pipeline for the index. If the final pipeline is set and the pipeline does not exist, then index requests fail. The pipeline name `_none` specifies that the index does not have an ingest pipeline.

- `index.optimize_doc_id_lookup.fuzzy_set.enabled` (Boolean): This setting controls whether `fuzzy_set` should be enabled for optimizing document ID lookups in indexing or searching calls by using an additional data structure, in this case, the Bloom filter data structure. Enabling this setting improves performance for upsert and search operations that rely on document ID by creating a new data structure (Bloom filter). The Bloom filter allows for the handling of negative cases (that is, IDs being absent in the existing index) through faster off-heap look-ups. Default is `false`. This setting can only be used if the feature flag `opensearch.experimental.optimize_doc_id_lookup.fuzzy_set.enabled` is set to `true`.

- `index.optimize_doc_id_lookup.fuzzy_set.false_positive_probability` (Double): Set the false-positive probability for the underlying `fuzzy_set` (that is, the Bloom filter). A lower false-positive probability ensures higher throughput improvement for `UPSERT` or `GET` operations. Allowed values range between `0.01` and `0.50`. Default is `0.20`. This setting can only be used if the feature flag `opensearch.experimental.optimize_doc_id_lookup.fuzzy_set.enabled` is set to `true`.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"Sets"?

@vagimeli vagimeli merged commit 2c43562 into opensearch-project:main Feb 20, 2024
3 checks passed
oeyh pushed a commit to oeyh/documentation-website that referenced this pull request Mar 14, 2024
)

* Add documentation for new bloom filter settings

Signed-off-by: mgodwan <[email protected]>

* Address PR comments

Signed-off-by: mgodwan <[email protected]>

* Update _install-and-configure/configuring-opensearch/index-settings.md

Co-authored-by: Nathan Bower <[email protected]>
Signed-off-by: Melissa Vagi <[email protected]>

* Update _install-and-configure/configuring-opensearch/index-settings.md

Co-authored-by: Nathan Bower <[email protected]>
Signed-off-by: Melissa Vagi <[email protected]>

* Update index-settings.md

Signed-off-by: Melissa Vagi <[email protected]>

Signed-off-by: Melissa Vagi <[email protected]>

---------

Signed-off-by: mgodwan <[email protected]>
Signed-off-by: Melissa Vagi <[email protected]>
Co-authored-by: Melissa Vagi <[email protected]>
Co-authored-by: Nathan Bower <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
4 - Doc review PR: Doc review in progress release-notes PR: Include this PR in the automated release notes v2.12.0
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[DOC] Documentation for new bloom filter settings
5 participants