Replies: 2 comments 1 reply
-
Hi @gplechuck, |
Beta Was this translation helpful? Give feedback.
-
By the way, I think it is more an issue than a discussion ;) |
Beta Was this translation helpful? Give feedback.
-
Hi @gplechuck, |
Beta Was this translation helpful? Give feedback.
-
By the way, I think it is more an issue than a discussion ;) |
Beta Was this translation helpful? Give feedback.
-
Hey @fbaligand ,
Very nice plugin, have been using the 'Document Table' visualization for some time and am very happy with it as a lightweight alternative to the 'Data Table' visualization. Have noticed a problem when querying large datasets however!
It seems that when the 'Max Hits' is set higher than 10000 , a sort by '_id' field is introduced into the resulting query. Sorting by '_id' is not recommended by Elasticsearch and can lead to a large amount of cached _id field data occupying the heap (I think use of _id field data will actually be disabled as default in future versions , see elastic/elasticsearch#64511 ). I recently ran a query over a couple years worth of our data, and the overall heap usage spiked by 91GB, almost exclusively _id field data.
Use of the '_id' field does not appear to be configurable in the Document Table 'Query Parameters'
I had a very quick look at the source files and can see where the _id sort is introduced -
Is this behaviour essential for anything and would there be any adverse effects in removing that snippet of code from the plugin as a workaround, so we could continue to use the table for large datasets ?
Cheers
Beta Was this translation helpful? Give feedback.
All reactions