How to register sparse encoding model in AWS OpenSearch

grunt-solaces.0h · January 3, 2024, 2:35pm

Thanks for the answer!

We were wondering about the thread number for the following reasons:

We switched to a larger machine, with more CPUs and as a result it seems the ingestion pipeline is sending more documents for inference, in parallel
At first we used a CPU instance for inference and it was being underutilised as it was processing only X documents, where X seemed to be the number of CPUs in the OS instance
We switched to a GPU instance for inference which is again underutilised. Indeed, it does the inference faster per document, but it seems like the bottleneck is still the ingestion pipeline which doesn’t send as many documents to the SageMaker instance as it is capable of handling

Topic		Replies	Views
The new sparse encoding model is not deployable Machine Learning	4	91	October 30, 2024
Can't register a pre-trained model OpenSearch	5	75	June 27, 2025
How deployed the model via AWS SageMaker to AWS Opensearch? Machine Learning	2	528	April 20, 2024
Model weights for sparse encoders Machine Learning	20	602	February 9, 2024
Sparse encoding model sagemaker error OpenSearch troubleshoot , configure	3	186	January 23, 2024