Support for MRL Embedding Models

asfoorial · September 2, 2024, 7:07pm

Hi all,

Does OpenSearch supports, or will support, Matryoshka Representation Learning (MRL) embedding models? Just recently Jina AI released such model with excellent performance and very small dimension. I have noticed significant effors in OpenSearch to improve vector compression. I believe that supporting MRL will greatly contribute to such efforts.

Thanks

jmazane · September 13, 2024, 9:24pm

@asfoorial Thats pretty interesting. I know there was some work around colbert style, multi-vector embeddings in Lucene: [WIP] Multi-Vector support for HNSW search by vigyasharma · Pull Request #13525 · apache/lucene · GitHub.

It is possible to get some of this functionality via the nested doc vector support.

Just so I understand correctly, the dimension of the vectors is fixed, but its a variable number of vectors per document, correct?

asfoorial · September 14, 2024, 3:52am

It is more about how the model is called. Once we upload a model to ml-common, we should be able to configure it to return only the top N values of the embedding it usually generates. For example, if the model generates 1024 values, then ml-common register/deploy API should have a config that sets the return dimension. If we set that return dimension to 64, then whenever that model is called it will always return 64 values. Weather the call is coming from the _predict API, an ingest pipeline, or a neural-search query, it wil always return 64.

In all the above, there is only one vector per document. So if the model is configured to return 64 values, then the knn index should have one vector field of size 64.

system · November 13, 2024, 3:53am

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Multi-lingual search Request For Comments	2	915	August 25, 2022
[Feedback] Machine Learning Model Serving Framework - Experimental Release General Feedback releases	48	3002	July 12, 2023
Inconsistent similarity scores using L2 space type and larger embedding model OpenSearch troubleshoot	0	148	October 17, 2024
Vectorizing big chunk of data returns errors Machine Learning	3	280	April 12, 2024
Documentation for new ML features in 2.4 OpenSearch	2	399	November 17, 2022

Support for MRL Embedding Models

Related topics