-
Notifications
You must be signed in to change notification settings - Fork 24.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
.fdt file still exist when disable _source #69584
Comments
Thanks for the report. One thing that looks to be missing is the file size comparison between the source enabled and source disabled indices. Could you provide that? Otherwise all you may be showing is that there is some residual use of stored values (possibly not related to source). |
I think we need to know exactly what version this is about too, the OP says In some versions we temporarily retain the source for replica recovery (and CCR) even if |
Relates #41628 |
Pinging @elastic/es-distributed (Team:Distributed) |
I tested it in 7.9.0, 7.10.0, 7.11.0 |
just like @DaveCTurner said, I have to disable soft_delete
then, the .fdt file is the proper size: |
I said nothing about disabling soft deletes, and I certainly don't recommend that either. What I did say was that the source is only kept while it's needed for recovery or CCR, and it's removed by a later merge. It's therefore the expected behaviour for there to be stored fields in some segments: there's only a bug here if we are retaining stored fields in merges even though there's no reason to retain it, and so far we haven't seen any evidence of that. |
No further response after a few weeks so I'm closing this. If you can find and share evidence of a bug (see my previous message) then we can reopen this. |
Elasticsearch version (
bin/elasticsearch --version
):7.11
Plugins installed: []
no
JVM version (
java -version
):jdk 12
OS version (
uname -a
if on a Unix-like system):mac
Description of the problem including expected versus actual behavior:
When I disable the _source, It is expected lower storage usage, but it didn't.
After set
_source: { enabled: false}
, Elasticsearch doesn't return_source
in get/search request, but it still keep the .fdt file in segmentElasticsearch are wildly use in OLAP case, when we just need the analysis report without show the documents, we could get rid of the stored field
Steps to reproduce:
1, add sample data:
2, reindex it with
_source
disable3, new index return without _source
4, new index with the same size
The text was updated successfully, but these errors were encountered: