[RFC] neural sparse models improvement plan

grunt-solaces.0h · May 23, 2024, 8:25am

Our main pain points are around ingestion throughput (1) and search latency (3). This is mainly due to the shared threadpool in OS.

Another one was 4b, as it was not obvious from the docs that the model cannot be deployed inside OS and we need to deploy in SageMaker.

Relevant forum threads with additional details:

Thanks for working on this!

Topic		Replies	Views
How to scale neural sparse ingestion pipeline OpenSearch	3	340	May 7, 2024
Default search pipeline support with sparse encoding indices Machine Learning	3	335	December 26, 2023
How to register sparse encoding model in AWS OpenSearch Machine Learning troubleshoot	15	1987	April 21, 2024
Model weights for sparse encoders Machine Learning	20	629	February 9, 2024
Provided Text Chunking Example fails with Neural Sparse! OpenSearch	0	59	May 9, 2025