Row Count limit hit - which rows returned?

dangelic0 · July 14, 2024, 4:43pm

**Versions OpenSearch Discover 2.7 AWS Chrome

I prepare and run a query against an index. I get back 10000 rows/documents. OK I can live with the limit as I understand the implications.
But which rows are returned? If my filter/query would have returned 1mm records, but I only get back 10K, are they the first ones? Sorted how? It looks like a random sampling amidst the time interval.

Thanks for any help
Dean

Configuration:

Relevant Logs or Screenshots:

dangelic0 · July 14, 2024, 7:38pm

For clarification, I am specifically asking a query with 1mm documents returned, but then with a CSV report export, I get 10000 rows. I know why the report is limited to 10K. But which rows are returned? First 10K in order of matcvh score? Date? random?
Thanks
Dean

pablo · July 15, 2024, 10:39am

@dangelic0 The search result in OpenSearch is sorted by relevance score " _score".
Why would you need to return all the documents? You should narrow your results by building efficient queries.

dangelic0 · July 15, 2024, 2:34pm

I don’t really need all those rows. I need to perform aggregation across a large set of rows/documents. Like a bucket aggregation, or a cardinality aggregation. But I found that even the aggregations only accept 10000 rows from the query.

So I was going to do the aggregation outside of OpenSearch by exporting the rows. So wondering which rows are returned when limited by the 10K limit.

How can I do an aggregation on 1mm rows? I see the doc count in the aggregation set to 10K…

pablo · July 15, 2024, 3:21pm

@dangelic0 The query will process all the documents in the index. The limitation regards the output.

If you’d like to increase the 10k cap, then modify the index.max_result_window.

dangelic0 · July 16, 2024, 12:24am

Excellent. Thanks very much. This solved it.
Dean

Topic		Replies	Views
Required Paging in aggregation OpenSearch discuss	3	1873	February 21, 2023
OpenSearch returns less number of search results than size General Feedback	4	1475	August 13, 2022
Opensearch fetch restriction to 10K records OpenSearch troubleshoot	3	4312	August 1, 2022
Export csv for more then 10000 lines Reporting Plugin discuss	5	8898	April 19, 2022
Losing top documents when query reaches the `terminate_after` limit OpenSearch troubleshoot	3	455	November 3, 2023

Row Count limit hit - which rows returned?

Related topics