Can't build visualizations on text fields #6769

Bargs · 2016-04-04T20:15:50Z

Selecting a text field as the target for an aggregation returns the following error:

Fielddata is disabled on text fields by default. Set fielddata=true on [agent] in order to load fielddata in memory by uninverting the inverted index. Note that this can however use significant memory.

Another thing that's odd is that the http response from ES containing the error has a 200 status code. I'm not sure if that's an intentional change in ES, but it doesn't seem right to me.

The text was updated successfully, but these errors were encountered:

clintongormley · 2016-04-05T08:06:38Z

Another thing that's odd is that the http response from ES containing the error has a 200 status code. I'm not sure if that's an intentional change in ES, but it doesn't seem right to me.

I presume you're talking about the msearch response? It is correct that msearch will return 200 - the msearch request completed correctly, you have to look at the individual items to see if they executed correctly or not. This is just like the bulk API.

spalger · 2016-04-05T23:08:37Z

Not sure why this would be a P1 blocker. Until we have a way to ask elasticsearch what fields are "aggregatable" we simply have to give the user errors. Ideally the error would say something about using the not-analyzed variant of the chosen field (should it exist), but perhaps this is an enhancement we can bring to this UI issue.

Bargs · 2016-04-06T00:39:00Z

@spalger Text fields replaced analyzed string fields. Aggregating on analyzed string fields didn't used to throw an error. It wasn't recommended, but it didn't throw an error.

spalger · 2016-04-06T06:51:01Z

Sure, but text fields not being aggregatable by default is a new behavior in elasticsearch that bubbles up to Kibana, how is that a bug in Kibana?

If the user wants to continue to aggregate on these value types they should do as the error message suggests and "set fielddata=true".

spalger · 2016-04-06T06:52:41Z

What type of solution do you imagine here?

Bargs · 2016-04-06T14:34:51Z

TBH when I read the error message, I didn't know enough about fielddata and how it relates to aggregations to understand that it essentially meant "this field is not aggregatable". I would guess most users would have the same reaction. So if text fields can't be aggregated on by default, we should hide them from the field list in the vis editor by default.

spalger · 2016-04-07T00:21:06Z

yeah, this is why I said:

Until we have a way to ask elasticsearch what fields are "aggregatable" we simply have to give the user errors.

spalger · 2016-04-07T00:29:48Z

We used to try and guess which fields were aggregateable, but the details about what qualifies/disqualifies a field are quite complex and have changed in the past without actually causing any breaks in Kibana. Users then started filing issues (#3335, #5914) about how elasticsearch had added the ability to aggregate on fields in some new scenario and we had no workaround for them, Kibana was simply going to prevent them from aggregating on that field until the next version was released.

This is why we did #5806, and why we fall back to the error message that elasticsearch chooses to explain the issue.

Bargs · 2016-04-07T14:33:11Z

The historical context helps... I understand what you're saying. But this change in ES defaults, combined with our policy to simply throw an ES error if we get one, is going to lead to a really terrible user experience. By default, half of a user's string fields (all the non-raw fields) are going to throw a really cryptic error in their face. If this happened to me as a brand new Kibana user, I might just assume the app is broken. This is worse than previous versions where the defaults worked, and the user would only get an error if they intentionally messed with advanced mapping options.

I don't know what an acceptable solution would be since I don't know all the details of the previous discussions about removing the bucketable property, but I feel like at the very least we need to give the user some sort of warning or more friendly error message. Longer term, getting this into ES becomes much more important.

Bargs · 2016-05-02T17:01:35Z

Now that elastic/elasticsearch#17980 is merged, we should be able to fix this.

streamnsight · 2016-06-27T14:47:33Z

Just tried out Kibana 5 alpha and running into this issue as well.

I have to agree with @Bargs that the error is cryptic and I don't know what to do from here.
Since the error message suggests an option to fix the problem, it would be nice to have a way to do so in the UI, but I don't see any obvious one (no option to set fielddata=true in options for the mappings.)

streamnsight · 2016-06-27T14:59:15Z

Just realize there is a new .keyword extension after the text field to build visualization...

Seems to work, but it raises a question: is this a 'representation' for the UI or an actual new field ?
What if I have a nested field ending with .keyword ? Is it going to be interpreted as the field that can be aggregated or am I going to see two fields with the same name?

Bargs · 2016-06-27T15:08:38Z

@streamnsight in 5.0 strings are mapped as multi fields with text and keyword versions by default: https://www.elastic.co/guide/en/elasticsearch/reference/master/breaking_50_mapping_changes.html#_default_string_mappings. So .keyword isn't a UI only construct, it's coming from elasticsearch.

streamnsight · 2016-06-27T15:49:17Z

@Bargs thanks for the link...
Can you confirm: does that mean keyword is now a reserved field name, and I can't have a nested key called mytextfield.keyword ?

Bargs · 2016-06-27T16:45:35Z

@streamnsight It's not reserved, it's just a default. You can override that default by creating your own mappings for the field in your index, or index template.

Or if you want to disable the automatic multi-field entirely, you can edit the default mappings for all indices.

clintongormley · 2016-06-27T19:33:21Z

Once Kibana starts using the feature added in elastic/elasticsearch#17980, this problem should go away as the text field won't be shown as aggregatable

LeeDr · 2016-06-29T21:48:55Z

This has an even uglier result in Graph UI. If you use the text field there you get a server 500 error. Elasticsearch and Kibana are showing the same error.
cc @markharwood

Caused by: java.lang.IllegalArgumentException: Fielddata is disabled on text fields by default. Set fielddata=true on [agent] in order to load fielddata in memory by uninverting the inverted index. Note that this can however use significant memory.

but Graph is showing a 500 error;

irab · 2016-08-16T01:50:29Z

Still getting this error on fresh install of Elasticsearch 5 alpha 5 and Kibana 5 alpha 5. elastic/elasticsearch#17980 has not fixed this.

clintongormley · 2016-08-16T12:51:19Z

@irab Kibana needs to start using the feature added in elastic/elasticsearch#17980 before you'll see any difference

irab · 2016-08-17T05:31:11Z

Hi @clintongormley. I took a look at that issue - it's tagged "v5.0.0.-alpha3" and was committed to Master back in April. I'm assuming it's in the version i'm using - v5.0.0.-alpha5 release Aug 9th.

clintongormley · 2016-08-17T10:43:57Z

@irab i repeat: Kibana needs to start using the feature, which will mean not showing fields that shouldn't be used in aggregations.

irab · 2016-08-17T12:15:25Z

Thanks for the clarification. Hard to tell what is enabled...

This adds a simple API for getting the searchable/aggregatable status of a list of fields in a given index, list of indices, or index pattern. In the future this will probably evolve into a full blown fields info API that we can use when removing the index pattern mapping cache. For now though it's built to provide the minimum info needed to fix elastic#6769 Usage: The API exposes a single GET endpoint. ``` GET /api/kibana/{indices}/field_capabilities ``` `indices` can be a single index, a comma delimited list, or a wildcard pattern Example response: ``` { "fields": { "imsearchable": { "searchable": true, "aggregatable": false }, "imaggregatable": { "searchable": true, "aggregatable": true }, } } ```

--------- **Commit 1:** Add field_capabilities API This adds a simple API for getting the searchable/aggregatable status of a list of fields in a given index, list of indices, or index pattern. In the future this will probably evolve into a full blown fields info API that we can use when removing the index pattern mapping cache. For now though it's built to provide the minimum info needed to fix #6769 Usage: The API exposes a single GET endpoint. ``` GET /api/kibana/{indices}/field_capabilities ``` `indices` can be a single index, a comma delimited list, or a wildcard pattern Example response: ``` { "fields": { "imsearchable": { "searchable": true, "aggregatable": false }, "imaggregatable": { "searchable": true, "aggregatable": true }, } } ``` * Original sha: 1af6b76 * Authored by Matthew Bargar <[email protected]> on 2016-09-21T18:38:34Z **Commit 2:** Filter non-aggregatable fields from vis editor UI Using the field_capabilities API added in the previous commit, this commit enhances the client side index pattern object with information about the searchable and aggregatable status of each field in the index pattern. We then use this information to filter out non-aggregatable fields from the vis editor so that users won't accidentally select them and get nasty errors. An example of a non-aggregatable field would be a `text` field without fielddata enabled (which is the default). I also added the searchable and aggregatable flags to the index pattern page so users can see the status of their fields. I removed the `indexed` column because it was mostly redundant with `searchable` and I needed the horizontal space. The addition of the searchable and aggregatable properties for index pattern fields would require users to manually refresh their field list when upgrading to 5.0. This commit also adds a check for those properties and if they're missing it automatically refreshes the field list for the user in a seamless manner. * Original sha: 4a906f3 * Authored by Matthew Bargar <[email protected]> on 2016-09-21T19:18:10Z

zjost · 2017-02-03T22:03:44Z

I'm not sure I understand how to proceed. I see published tutorials like this which use the text of tweets to do what I want to do (the "Graphing Tweet Text Contents" section).

What do I need to change to allow this sort of analysis and where do I need to change it? I'm trying to recreate the example using twitter data.

Bargs · 2017-02-03T22:41:41Z

@zjost I wasn't able to find the data set this blog post is using, but I suspect entities.hashtags.text is one giant string in the source JSON. It would be better to split that string into an array prior to indexing and then select the keyword version of the field. The other option is to turn on fielddata for the text version of the field to make it aggregatable, which would be fine if you're just playing around with things in a local environment, but it can suck up a lot of memory so you generally don't want to use it in production.

pavankumarb · 2017-02-04T07:51:10Z

Using keyword version of the fields doesn't work in kibana graph workspace.

Graphing UI make a REST call to http://localhost:5601/api/graph/graphExplore, which returns an empty response : {"ok":true,"resp":{"took":0,"timed_out":false,"failures":[],"vertices":[],"connections":[]}} .

ES&Kibana versions being used : 5.1.2

zjost · 2017-02-05T17:37:15Z

@Bargs thanks! Is there a way to run the text through the standard analyzer before using the keyword method? I like the keyword functionality, but it only makes sense if you can first standardize the text strings or #DataScience != #datascience

markharwood · 2017-02-06T10:56:40Z

Is there a way to run the text through the standard analyzer before using the keyword method?

See normalizers in 5.2

markharwood · 2017-02-06T10:58:49Z

@pavankumarb Checkout the troubleshooting docs for no results

zjost · 2017-02-06T16:15:20Z

So there's no way to use an analyzer and then index the tokens? The whole point is to do stemming...etc and find patterns in documents. It seems that's exactly what many of the old tutorials do, but there are new defaults that make this either difficult or impossible. Is there anyway to recreate the result where, i.e. given the text field of a tweet one can use Graph on the field so that tokens that are significantly related are represented by the graph? Not full tweet text, but tokens within. Thanks again for the help.

clintongormley · 2017-02-07T11:11:09Z

@zjost the only alternative would be to enable fielddata on the text field - just be aware that it is going to use a lot of memory

--------- **Commit 1:** Add field_capabilities API This adds a simple API for getting the searchable/aggregatable status of a list of fields in a given index, list of indices, or index pattern. In the future this will probably evolve into a full blown fields info API that we can use when removing the index pattern mapping cache. For now though it's built to provide the minimum info needed to fix elastic#6769 Usage: The API exposes a single GET endpoint. ``` GET /api/kibana/{indices}/field_capabilities ``` `indices` can be a single index, a comma delimited list, or a wildcard pattern Example response: ``` { "fields": { "imsearchable": { "searchable": true, "aggregatable": false }, "imaggregatable": { "searchable": true, "aggregatable": true }, } } ``` * Original sha: bea909d97634b69f07013485eee41f62d5d017e0 [formerly 1af6b76] * Authored by Matthew Bargar <[email protected]> on 2016-09-21T18:38:34Z **Commit 2:** Filter non-aggregatable fields from vis editor UI Using the field_capabilities API added in the previous commit, this commit enhances the client side index pattern object with information about the searchable and aggregatable status of each field in the index pattern. We then use this information to filter out non-aggregatable fields from the vis editor so that users won't accidentally select them and get nasty errors. An example of a non-aggregatable field would be a `text` field without fielddata enabled (which is the default). I also added the searchable and aggregatable flags to the index pattern page so users can see the status of their fields. I removed the `indexed` column because it was mostly redundant with `searchable` and I needed the horizontal space. The addition of the searchable and aggregatable properties for index pattern fields would require users to manually refresh their field list when upgrading to 5.0. This commit also adds a check for those properties and if they're missing it automatically refreshes the field list for the user in a seamless manner. * Original sha: b823b877f90ce84cb6f789ea90a0fb17e53ad12f [formerly 4a906f3] * Authored by Matthew Bargar <[email protected]> on 2016-09-21T19:18:10Z Former-commit-id: 672f009

Bargs added v5.0.0 v5.0.0-alpha1 labels Apr 4, 2016

epixa added bug Fixes for quality problems that affect the customer experience P1 blocker and removed v5.0.0-alpha1 labels Apr 4, 2016

Bargs mentioned this issue Apr 7, 2016

Return an aggregated view of all mappings/properties of all types elastic/elasticsearch#15728

Closed

epixa added the v5.0.0-rc1 label Sep 13, 2016

epixa added v5.0.0 and removed v5.0.0 v5.0.0-rc1 labels Sep 13, 2016

epixa assigned Bargs Sep 19, 2016

Bargs mentioned this issue Sep 21, 2016

Filter non-aggregatable fields from vis editor UI #8421

Merged

Bargs closed this as completed in #8421 Sep 23, 2016

This was referenced Sep 23, 2016

[backport] PR #8421 to 5.0 #8462

Merged

[backport] PR #8421 to 5.x #8463

Merged

kronthto mentioned this issue Oct 28, 2017

Scout orderBy() not supported ErickTamayo/laravel-scout-elastic#86

Closed

elasticmachine mentioned this issue Apr 25, 2018

Error 500 Internal Server Error: An internal server error occurred - shown for a user who has insufficient permissions to access certain indices . #18240

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can't build visualizations on text fields #6769

Can't build visualizations on text fields #6769

Bargs commented Apr 4, 2016

clintongormley commented Apr 5, 2016

spalger commented Apr 5, 2016

Bargs commented Apr 6, 2016

spalger commented Apr 6, 2016

spalger commented Apr 6, 2016

Bargs commented Apr 6, 2016

spalger commented Apr 7, 2016

spalger commented Apr 7, 2016

Bargs commented Apr 7, 2016

Bargs commented May 2, 2016

streamnsight commented Jun 27, 2016

streamnsight commented Jun 27, 2016

Bargs commented Jun 27, 2016

streamnsight commented Jun 27, 2016 •

edited

Loading

Bargs commented Jun 27, 2016

clintongormley commented Jun 27, 2016

LeeDr commented Jun 29, 2016

irab commented Aug 16, 2016

clintongormley commented Aug 16, 2016

irab commented Aug 17, 2016

clintongormley commented Aug 17, 2016

irab commented Aug 17, 2016

zjost commented Feb 3, 2017

Bargs commented Feb 3, 2017

pavankumarb commented Feb 4, 2017

zjost commented Feb 5, 2017

markharwood commented Feb 6, 2017

markharwood commented Feb 6, 2017

zjost commented Feb 6, 2017 •

edited

Loading

clintongormley commented Feb 7, 2017

Can't build visualizations on text fields #6769

Can't build visualizations on text fields #6769

Comments

Bargs commented Apr 4, 2016

clintongormley commented Apr 5, 2016

spalger commented Apr 5, 2016

Bargs commented Apr 6, 2016

spalger commented Apr 6, 2016

spalger commented Apr 6, 2016

Bargs commented Apr 6, 2016

spalger commented Apr 7, 2016

spalger commented Apr 7, 2016

Bargs commented Apr 7, 2016

Bargs commented May 2, 2016

streamnsight commented Jun 27, 2016

streamnsight commented Jun 27, 2016

Bargs commented Jun 27, 2016

streamnsight commented Jun 27, 2016 • edited Loading

Bargs commented Jun 27, 2016

clintongormley commented Jun 27, 2016

LeeDr commented Jun 29, 2016

irab commented Aug 16, 2016

clintongormley commented Aug 16, 2016

irab commented Aug 17, 2016

clintongormley commented Aug 17, 2016

irab commented Aug 17, 2016

zjost commented Feb 3, 2017

Bargs commented Feb 3, 2017

pavankumarb commented Feb 4, 2017

zjost commented Feb 5, 2017

markharwood commented Feb 6, 2017

markharwood commented Feb 6, 2017

zjost commented Feb 6, 2017 • edited Loading

clintongormley commented Feb 7, 2017

streamnsight commented Jun 27, 2016 •

edited

Loading

zjost commented Feb 6, 2017 •

edited

Loading