How should we handle updating ML/Sparse Encoding Models?

dswitzer2 · March 19, 2024, 3:01pm

Given the fast pace of changes in the ML world right now, I suspect we will see frequent updates to the pre-trained models being offered which improve on the results.

Given that the inference process is the slowest part of indexing content, we’re planning on using the Predict API to cache the embedding vectors outside of OpenSeach that way we only have to run the inference process on content that actually changed when indexing our content.

How should we plan to manage updates to the models we are using?

If we’re using just an updated version of a model (e.g. move from opensearch-neural-sparse-encoding-doc-v1 v1.0.0 to v1.0.1), is it safe to use the embeddings created with the older models? Or will this create issues?

My thinking is that I probably needed to track a model version with our embeddings, that way when I prepare to switch to a newer model I can first go through all our documents, create embedding using the new model and once that has finished, update my indexes with the new embeddings and delete the old ones.

However, if the old embeddings should basically be expected to work, it certainly would be more straightforward to just update the in-place embeddings and then trigger a refresh once the embedding caches have been updated.

I just want to make sure I plan accordingly, but I also want to make sure I’m not over engineering a solution.

xinyual · March 20, 2024, 1:30am

Generally, you cannot use the older vector since we update the model weights. You need to embed text again.

dswitzer2 · March 20, 2024, 1:12pm

@xinyual,

Thanks for the quick reply. That was my assumption that I’d need to re-encode everything, but just wanted to verify that it was going to be a necessary step.

This helps me tremendously in my planning, so thanks!

-Dan

system · May 19, 2024, 1:12pm

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Model weights for sparse encoders Machine Learning	20	606	February 9, 2024
Error When Loading Embedding Model Into Memory Machine Learning	11	1144	October 2, 2023
How open will OpenSearch update it's supported embedding models list? Machine Learning	1	25	July 14, 2025
Best way to introduce new embedding models to existing indexes? k-NN discuss , configure	1	250	August 10, 2024
Is there a way to create Sparse Neural index that uses raw vectors? Machine Learning	6	320	May 18, 2024

How should we handle updating ML/Sparse Encoding Models?

Related topics