Extend support of allowedFields to getMatchingFieldNames and getAllFields #106862

javanna · 2024-03-28T10:20:16Z

The SearchExecutionContext supports the notion of allowed fields, provided via a specific setter method. Fields are though only filtered for the getFieldType method. There needs to be consistency between getMatchingFieldNames and getFieldType. In fact there are places in the code where getMatchingFieldNames is called to resolve field name patterns, and later getFieldType is called on each of the resolved fields. If the former resolves to one field that we can't retrieve a field type for, that is unexpected and to be considered a bug.

In addition, this commit adds consistency for getAllFields: this is only called by field caps, hence a different codepath that does not seem to set allowed fields for now, but it's important for the context to provide consistency around fields access, especially for methods that are as broad as getAllFields, despite their currently very specific usage.

This surfaced as we are trying to move fetching of the _ignored field to use value fetchers, which use a search execution context and resolve the field type, whereas until now they are retrieved directly via StoredFieldsPhase and completely bypass such check.

This commit also adds a test that was missing around verifying that SearchExecutionContext applies the allowedFields predicate when provided.

…elds The SearchExecutionContext supports the notion of allowed fields, provided via a specific setter method. Fields are though only filtered for the getFieldType method. There needs to be consistency between getMatchingFieldNames and getFieldType. In fact there are places in the code where getMatchingFieldNames is called to resolve field name patterns, and later getFieldType is called on each of the resolved fields. If the former resolves to one field that we can't retrieve a field type for, that is unexpected and to be considered a bug. In addition, this commit adds consistency for getAllFields: this is only called by field caps, hence a different codepath that does not seem to set allowed fields for now, but it's important for the context to provide consistency around fields access, especially for methods that are as broad as getAllFields, despite their currently very specific usage. This surfaced as we are trying to move fetching of the `_ignored` field to use value fetchers, which use a search execution context and resolve the field type, whereas until now they are retrieved directly via StoredFieldsPhase and completely bypass such check. This commit also adds a test that was missing around verifying that SearchExecutionContext applies the allowedFields predicate when provided.

elasticsearchmachine · 2024-03-28T10:20:40Z

Hi @javanna, I've created a changelog YAML for you.

elasticsearchmachine · 2024-03-28T10:20:40Z

Pinging @elastic/es-security (Team:Security)

elasticsearchmachine · 2024-03-28T10:20:40Z

Pinging @elastic/es-search (Team:Search)

salvatore-campagna · 2024-03-28T12:57:43Z

server/src/main/java/org/elasticsearch/index/query/QueryRewriteContext.java

                }
            }
        }
-        return matches;
+        // If the field is not allowed, behave as if it is not mapped
+        return allowedFields == null ? matches : matches.stream().filter(allowedFields).collect(Collectors.toSet());


I see here and below we always need to check for allowedFields to be null or not to decide if we need to filter one or more fields. I am wondering if we should require it to be not null and force the caller to use an identity predicate lambda (fieldName -> true) in case no field is filtered out.

I am not too sure what is best here. Skip the filtering entirely when the predicate is null, or always filter although the predicate always returns true? I went for the former, which leaves things as close as they are outside of api keys queries (when setAllowedFields is not called).

I agree that, maybe, performance-wise not applying the filter at all might be better if there is nothing to filter out...which is the reason why I said it looks good to me...it is just that all those null checks are annoying for me.

salvatore-campagna · 2024-03-28T13:01:38Z

LGTM...I just left a couple of comments.

tteofili

the changes look good, Luca ;)

salvatore-campagna · 2024-03-28T13:04:30Z

server/src/main/java/org/elasticsearch/index/query/QueryRewriteContext.java

                }
            }
        }
-        return matches;
+        // If the field is not allowed, behave as if it is not mapped
+        return allowedFields == null ? matches : matches.stream().filter(allowedFields).collect(Collectors.toSet());


If I understand correctly this is just the same we were doing before + an additional filter on allowed fields based n field names...maybe we can just extract the existing method into something like a private getMatchingFieldNamesInternal to which we apply the filter?

what is the gain compared to a single method?

javanna added >bug :Search Foundations/Mapping Index mappings, including merging and defining field types :Security/Security Security issues without another label v8.14.0 labels Mar 28, 2024

elasticsearchmachine added Team:Search Meta label for search team Team:Security Meta label for security team labels Mar 28, 2024

Update docs/changelog/106862.yaml

21b2690

javanna removed :Security/Security Security issues without another label Team:Security Meta label for security team labels Mar 28, 2024

update changelog

fdabfff

salvatore-campagna reviewed Mar 28, 2024

View reviewed changes

tteofili approved these changes Mar 28, 2024

View reviewed changes

salvatore-campagna approved these changes Mar 28, 2024

View reviewed changes

javanna merged commit 917f54a into elastic:main Mar 28, 2024
14 checks passed

javanna deleted the fix/allowed_fields_support branch March 28, 2024 14:10

salvatore-campagna added a commit to salvatore-campagna/elasticsearch that referenced this pull request Apr 2, 2024

fix: null check not required after elastic#106862

ab96ec1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extend support of allowedFields to getMatchingFieldNames and getAllFields #106862

Extend support of allowedFields to getMatchingFieldNames and getAllFields #106862

javanna commented Mar 28, 2024

elasticsearchmachine commented Mar 28, 2024

elasticsearchmachine commented Mar 28, 2024

elasticsearchmachine commented Mar 28, 2024

salvatore-campagna Mar 28, 2024 •

edited

Loading

javanna Mar 28, 2024

salvatore-campagna Mar 28, 2024

salvatore-campagna commented Mar 28, 2024 •

edited

Loading

tteofili left a comment

salvatore-campagna Mar 28, 2024

javanna Mar 28, 2024

Extend support of allowedFields to getMatchingFieldNames and getAllFields #106862

Extend support of allowedFields to getMatchingFieldNames and getAllFields #106862

Conversation

javanna commented Mar 28, 2024

elasticsearchmachine commented Mar 28, 2024

elasticsearchmachine commented Mar 28, 2024

elasticsearchmachine commented Mar 28, 2024

salvatore-campagna Mar 28, 2024 • edited Loading

Choose a reason for hiding this comment

javanna Mar 28, 2024

Choose a reason for hiding this comment

salvatore-campagna Mar 28, 2024

Choose a reason for hiding this comment

salvatore-campagna commented Mar 28, 2024 • edited Loading

tteofili left a comment

Choose a reason for hiding this comment

salvatore-campagna Mar 28, 2024

Choose a reason for hiding this comment

javanna Mar 28, 2024

Choose a reason for hiding this comment

salvatore-campagna Mar 28, 2024 •

edited

Loading

salvatore-campagna commented Mar 28, 2024 •

edited

Loading