How to deploy Model2Vec Embedding Models

asfoorial · October 11, 2024, 3:39pm

Hi all,

I was wondering how to deploy embedding models generated using MinishLab/model2vec: Model2Vec: Distill a Small Fast Model from any Sentence Transformer (github.com) . They are supported by SentenceTransformers Release v3.2.0 - ONNX and OpenVINO backends offering 2-3x speedup; Static Embeddings offering 50x-500x speedups at ~10-20% performance cost · UKPLab/sentence-transformers (github.com) and they are very efficient in terms of speed and with small loss in accuracy.

dhrubo · October 31, 2024, 8:18pm

We don’t have any pre-trained traced model support for this model. You can use follow this notebook to trace your model and register in opensearch. If you want any pre-trained traced model support from opensearch, please cut an issue to either ml-commons or opensearch-py-ml repos

Topic		Replies	Views
Which rerank models are supported by OS 2.12.0 and how to deploy them? Machine Learning	5	524	May 2, 2024
Help Needed: Fine-Tuning and Deploying a Model into OpenSearch Machine Learning discuss , troubleshoot , configure , install	2	177	August 12, 2024
The new sparse encoding model is not deployable Machine Learning	4	42	October 30, 2024
How can we deploy ML model (.zip) to nodes locally, not via SSL or the firewall OpenSearch discuss , troubleshoot , configure , install	8	194	August 19, 2024
Uploading a sentence transformer model of medical domain to OpenSearch Machine Learning troubleshoot	2	371	January 29, 2024

How to deploy Model2Vec Embedding Models

Related topics