How to separate ml ingestion workload from search workload

asfoorial · November 2, 2023, 1:43pm

OS 2.11.0

How can I seperate ML ingestion workload and search workloads on the same index. I have 2 master nodes, 2 data nodes and 2 (ingest+ml) nodes.

I tried to create both sparse and dense indices and I noticed very high CPU utilization during ingestion. This has tremendously lowered search speed over the index.

Any recommendations?

Also, is there a way to force ingestion pipelines to use specific ML nodes so that I always have free ML nodes for search workloads?

Thanks

ylwu · November 18, 2023, 1:07am

hi, @asfoorial , you can set this setting as true plugins.ml_commons.allow_custom_deployment_plan, then you can deploy model to a specific node. That way can help shift some ML model workload. But the ingestion and searching workload seems hard to split as they depend on the same index.

system · January 17, 2024, 1:08am

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Multiple ingest pipelines for an index Machine Learning	2	930	February 19, 2024
OpenSearch Ingest Pipeline - Performance Understanding Performance Analyzer	1	1063	January 10, 2022
Taking too much time in Data Ingestion in Vector Index Machine Learning index-management	2	472	April 20, 2024
[Feedback] Machine Learning Model Serving Framework - Experimental Release General Feedback releases	48	3000	July 12, 2023
How to scale neural sparse ingestion pipeline OpenSearch	3	323	May 7, 2024

How to separate ml ingestion workload from search workload

Related topics