Any best practices for warming up indices on node startup

frejonb · January 5, 2023, 10:56am

The current behavior of knn queries is that they will first check if there are unloaded native library indices and load them to RAM before the query is performed. OpenSearch provides the warmup endpoint (/_plugins/_knn/warmup/index1,index2,index3) which makes it easier to find strategies that reduce the initial query latency.

I’m wondering if there are any best practices for implementing this endpoint in kubernetes. Our current scenario is as follows:

We have an OpenSearch cluster with knn indices whose shards live in different nodes. Some times a node may go down, but by having multiple replicated shards, this results in no reduced availability. The issue appears when the node comes back online. What we see is the following:

A node containing a replica shard of a knn index goes down. All knn queries on this index execute without any extra latency or downtime, because all native library indices are already loaded
The node that was down comes back online, and now any knn query to the index will first try to load the native library indices of the node that just connected.

We would like to find a strategy for removing the latency caused by the node coming back online. One idea is to prevent the newly connected node from receiving any traffic from the masters, until it has warmed up. Though I don’t know if this is technically possible; the warmup endpoint works on all shards, but is it possible for it to run on a shard that is not yet available? or what would be a way of letting the warmup endpoint run on the new shard but preventing the standard knn queries from trying to load the native library indices of the shard?

Navneet · January 5, 2023, 7:06pm

Hi,
There is no way currently we can do the warmup of the k-NN index on a node which is coming back on the cluster. This can be a good feature request. I would request you to create a github issue for this here: k-NN Feature request.

frejonb · January 10, 2023, 12:30pm

Thanks, I added it [FEATURE] Prevent slow knn queries when nodes restart · Issue #711 · opensearch-project/k-NN · GitHub

Topic		Replies	Views
Getting latency and timeouts for initial knn search queries even after using warmup api k-NN	3	1083	March 31, 2021
KNN Queries Are Slow And Not Cached k-NN	1	175	December 6, 2024
Moving KNN index from hot to ultrawarm by updating it to non-KNN k-NN discuss	1	153	August 5, 2024
Migrating and using kNN indexes from OpenDistro to OpenSearch OpenSearch	8	649	March 27, 2023
Problems with kNN-searches OpenSearch troubleshoot , configure	1	485	March 21, 2024

Any best practices for warming up indices on node startup

Related topics