High rate of queries to .plugins-ml-model. Is this normal?

sousu · June 20, 2025, 8:24pm

Versions :

OpenSearch: 2.19
Server: Amazon OpenSearch Server

Describe the issue:

Since we enabled the ml-plugin and deployed a model we see a high usage of 8 query per minute even on inactive hours like when no user is performing actions. Before enabling the plugin the search rate would be 0 or near 0 on inactive hours.

Checking the Query Insight plugin I can see multiple queries per minute to the .plugins-ml-model index. All of them are the same:

{
  "query": {
    "size": 10000,
    "query": {
      "bool": {
        "filter": [
          {
            "terms": {
              "model_state": [
                "LOADING",
                "PARTIALLY_LOADED",
                "LOADED",
                "LOAD_FAILED",
                "DEPLOYING",
                "PARTIALLY_DEPLOYED",
                "DEPLOYED",
                "DEPLOY_FAILED"
              ],
              "boost": 1
            }
          }
        ],
        "adjust_pure_negative": true,
        "boost": 1
      }
    },
    "_source": {
      "includes": [
        "tenant_id",
        "model_state",
        "algorithm",
        "deploy_to_all_nodes",
        "planning_worker_nodes",
        "planning_worker_node_count",
        "last_updated_time",
        "current_worker_node_count"
      ],
      "excludes": []
    }
  }
}

Is this behavior normal? Is there a way to change the rate of this query? I don’t understand why would it be necessary even on a not so used cluster to inquiry something so regularly. If it’s not normal behavior what could be causing it?

Relevant Logs or Screenshots:

(Sorry for the clumped screenshot but as a new user I’m only permitted to post a single image per post… Which undermines my ability to provide good information and get help… But okay…)

dhrubo · June 20, 2025, 8:55pm

@pybot could you please take a look into this? Thanks.

pybot · June 20, 2025, 9:01pm

@sousu

Do you have the exact query? This would be helpful to debug. This is definitely not normal or expected.

I’m also curious what actions are you using the model for and if you triggered a batch_predict?

sousu · June 23, 2025, 2:48pm

sousu:

{
  "query": {
    "size": 10000,
    "query": {
      "bool": {
        "filter": [
          {
            "terms": {
              "model_state": [
                "LOADING",
                "PARTIALLY_LOADED",
                "LOADED",
                "LOAD_FAILED",
                "DEPLOYING",
                "PARTIALLY_DEPLOYED",
                "DEPLOYED",
                "DEPLOY_FAILED"
              ],
              "boost": 1
            }
          }
        ],
        "adjust_pure_negative": true,
        "boost": 1
      }
    },
    "_source": {
      "includes": [
        "tenant_id",
        "model_state",
        "algorithm",
        "deploy_to_all_nodes",
        "planning_worker_nodes",
        "planning_worker_node_count",
        "last_updated_time",
        "current_worker_node_count"
      ],
      "excludes": []
    }
  }
}

@pybot The query that is being performed repetitively is this one.

The model is used to vectorize query on search and ingest pipeline for indexing the vectors. We use a opensearch available pre-trained model. We did not explicitly use batch_predict I’m not sure if anything could trigger it indirectly though.

pablo · June 24, 2025, 4:54pm

@sousu This could be caused by a sync-up job.

sousu · June 24, 2025, 7:23pm

I have not configured this manually when deploying the model nor did I made any change to my amazon opensearch config. As this have a default value of 3 shouldn’t it be a problem for every user? So is this normal behavior?

also what are the impacts of disabling it? Should I not disable it?

pybot · June 24, 2025, 8:08pm

@sousu Can confirm it is the syncup job, looping in @Xun to share more details

PS: the query runs every 10 secs, the documentation is wrong, will get this fixed

(ml-commons/plugin/src/main/java/org/opensearch/ml/settings/MLCommonsSettings.java at e681c58ad88402b8528b05bfe0674e6ae9c4b529 · opensearch-project/ml-commons · GitHub)

Xun · June 24, 2025, 8:13pm

@sousu - this SyncUp job is running to maintain your ml model status across the cluster in data nodes. By default it’s running every 10 seconds. In each run, it would query the ml-model index to get the model status and sync up the status for all the nodes. This is an expected behavior by design. If this query bothers you, please increase the interval through this setting plugins.ml_commons.sync_up_job_interval_in_seconds. ML Commons cluster settings - OpenSearch Documentation

sousu · June 24, 2025, 8:19pm

I configured it to 0 and can confirm that query rate immediately dropped. I have a 1 node cluster so as I understand I don’t need to sync it? So I could have it disable with no drawbacks? Maybe it could be disabled by default when discovery.type is set to single-node?

dhrubo · June 25, 2025, 4:36pm

@sousu if you don’t mind may be you can cut an issue: GitHub · Where software is built

Topic		Replies	Views
[Feedback] Machine Learning Model Serving Framework - Experimental Release General Feedback releases	48	3004	July 12, 2023
Model is Partially responding in case of non-ML node restart Machine Learning	7	452	February 16, 2024
Opensearch remote model connection error Machine Learning	4	387	September 25, 2024
Performance and scaling of ML models and dense vector data Machine Learning discuss	6	715	May 12, 2023
[Feedback] ML Commons: ML Model Health Dashboard for Admins - Experimental Release Request For Comments releases	5	1123	May 4, 2023

High rate of queries to .plugins-ml-model. Is this normal?

Related topics