Hi all,
I was wondering how to deploy embedding models generated using MinishLab/model2vec: Model2Vec: Distill a Small Fast Model from any Sentence Transformer (github.com) . They are supported by SentenceTransformers Release v3.2.0 - ONNX and OpenVINO backends offering 2-3x speedup; Static Embeddings offering 50x-500x speedups at ~10-20% performance cost · UKPLab/sentence-transformers (github.com) and they are very efficient in terms of speed and with small loss in accuracy.