Query failing for large volume

gdevelek · May 22, 2024, 12:22pm

Running OpeanSearch 2.11.1 as a search engine for Graylog logging tool, I have a query that sorts data by an integer field named “myField” after first grouping by a keyword field, and then shows the result in a table. If I run it for 6 hours worth of data, it runs fine. If I run it for one day, it errors out within a second, printing this:


Unable to perform search query: OpenSearch exception [type=illegal_argument_exception, reason=Text fields are not optimised for operations that require per-document field data like aggregations and sorting, so these operations are disabled by default. Please use a keyword field instead. Alternatively, set fielddata=true on [myField] in order to load field data by uninverting the inverted index. Note that this can use significant memory.].

So it gives a rather misleading error message that the field is not well configured, when in reality it’s the volume of data that is to blame. Opensearch is running on 3 nodes, with each process having 4GB of heap space.

Can someone please suggest a solution here? Does it make sense to add “fielddata=true” to the keyword field or myField?

gaobinlong · May 23, 2024, 3:34am

Could you check the mapping of the field myField, it looks like that it is a text type, not a keyword.

GET {index}/_mapping

gdevelek · May 23, 2024, 6:25am

As I already wrote it is mapped as integer. It has to, because we need to take the maximum value in this query.

Topic		Replies	Views
Issue with message field data OpenSearch	2	1696	March 30, 2023
Keyword field unable to be used for aggregation OpenSearch troubleshoot	8	8122	October 10, 2023
Term aggregations - cannot order by keyword field OpenSearch	2	585	August 1, 2022
Getting weird error when trying to load saved objects in OSD 2.7.0 OpenSearch Dashboards	4	1433	June 15, 2023
Document contains at least one immense term in field=\"search.keyword\" OpenSearch troubleshoot	2	614	September 26, 2023

Query failing for large volume

Related Topics