k-NN search with different ef_construction, ef_search and m return same results

Garance · November 14, 2024, 5:51pm

Versions (relevant - OpenSearch/Dashboard/Server OS/Browser):
OpenSearch 2.11.0 on AWS

Describe the issue:
We have 2 k-NN indices of the type nmslib HNSW, each index contains more than 30 millions of vectors, the vectors are the same, with the dimension 256. They are generated outside of OpenSearch.

Index 1:
ef_construction: 256
ef_search: 100
m: 16
space type: l2

Index 2:
ef_construction: 1024
ef_search: 1024
m: 64

On k-NN search with k=100, we fetch 100 results from both indexes, for the same queries the results are always the same with the same score.

Why the parameters ef_construction, ef_search and m don’t change the results? How to improve the search relevance?

Configuration:

Relevant Logs or Screenshots:

Navneet · November 14, 2024, 8:38pm

Hi @Garance

On k-NN search with k=100, we fetch 100 results from both indexes, for the same queries the results are always the same with the same score.

If same documents are retrieved then score of the documents will be same as they have same vectors.

Why the parameters ef_construction, ef_search and m don’t change the results? How to improve the search relevance?

This is a very good question. The simple answer here is, if with lower values of efs and m you are getting a very high recall then even when you increase the values for efs and m you won’t see a change in accuracy. So, can you please clarify what recall you are getting with different values of efs.

Garance · November 17, 2024, 1:13pm

Hi @Navneet, thank you for the response. After some tests I found that efs and m don’t change the result score, as your explanation.
Otherwise we haven’t measured the recall, I think the recall is high, because I don’t find a relevant document outside of the result list. By the way, how to measure the recall?

I tested the faiss engine with the same vectors, the search results are very similar to HNSW ones.

I continue to look for the ways to improve accuracy and relevance for k-NN search, such as change LLM models, space type, k… Are there any efficient methods?

yeonghyeonKo · November 18, 2024, 1:27am

In your case, taking manual way or setting relevance score of retrived documents from search API can be used as measurement.

Garance · November 19, 2024, 4:56pm

Thanks, I read these documents and I will calcul relevance score.

system · January 18, 2025, 4:57pm

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Different results for Nmslib and Elastic Knn Search k-NN	21	4448	August 6, 2020
Knn search is too slow OpenSearch troubleshoot	1	939	June 29, 2023
Opendistro KNN score giving different scores on the same query vector k-NN	3	1404	October 13, 2020
Reindexing Produces Different Result On The Same Query Vector k-NN	9	1426	May 12, 2021
Cannot explain knn search results k-NN	2	389	August 5, 2024

k-NN search with different ef_construction, ef_search and m return same results

Related topics