Normalisation in Hybrid Search

Praveen · March 7, 2023, 12:37pm

What is the best way to normalise scores of BM25 and ANN results. I am trying to build a hybrid search system where the results are ranked by combining the scores of keyword search and Neural search in a linear way.

To achieve this combination, the scores should be normalised on a same scale. I tried min-max normalisation of BM25 scores but that involves additional query to find the max score first and then use script scoring to normalise the BM25 scores (score/max-score) and sum them with respective KNN cosine similarity score in the subsequent query. Any other better ways to achieve this ?

vamshin · March 7, 2023, 4:07pm

Hi Praveen. We are actively working on this problem and we put RFC out recently. [RFC] High Level Approach and Design For Normalization and Score Combination · Issue #126 · opensearch-project/neural-search · GitHub

Your inputs will be helpful. Please feel free to provide feedback in the above RFC

system · May 6, 2023, 4:08pm

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
RFC: High Level Approach and Design For Normalization and Score Combination Request For Comments feature-request	1	398	March 2, 2023
Hybrid Search Score Normalisation OpenSearch discuss	3	787	May 18, 2024
Hybrid query to be combined with function score k-NN	15	1524	January 2, 2024
Hybrid search and normalization processor k-NN	1	263	May 19, 2024
What's the combination in normalization processor? k-NN	1	38	January 18, 2025

Normalisation in Hybrid Search

Related topics