Muti-variate Anomaly Detection

Kenton · September 20, 2022, 5:20pm

Hi all,

My team and I are looking into OpenSearch anomaly detection and have some questions we hoped the community could help with.

We have event data with a structure similar to:

{"timestamp": "2022-09-20T15:00:00.000Z", "eventType": "click", "component": "some.api.name"}

We had intended on defining eventType and component as OpenSearch anomaly detection features but realized that the values contained for each field are not considered.

Am I right in saying that we would need to define a detector and model for each of the following:

{"timestamp": "...", "eventType": "click", "component": "component.A"}

{"timestamp": "...", "eventType": "search", "component": "component.A"}

{"timestamp": "...", "eventType": "click", "component": "component.B"}

{"timestamp": "...", "eventType": "search", "component": "component.B"}

Are there other approaches that are more scalable that aggregate on distinct values for each feature?

Thanks for the help!

kris · October 17, 2022, 4:04pm

(admin) Moved to machine-learning sub-category.

@ylwu - could you or the team help on this?

ylwu · October 17, 2022, 7:24pm

@Kenton Thanks for your question. Not sure if I understand this correctly “defining eventType and component as OpenSearch anomaly detection features”. As the “eventType” and “component” are not numeric type, so you want to use count of these values as features?

a.emrekaraman · January 29, 2023, 9:35pm

Hi,

Did you find any solution? I’m looking same kind of solution

kaituo · January 30, 2023, 11:25pm

You can specify eventType and component as categorical fields. We will aggregate on distinct values of eventType and component and create separate models.

Topic		Replies	Views
Derivative Features for Anomaly Detection Plugins Machine Learning	2	393	August 11, 2023
Anomaly Detection and Pattern Analysis with Categorical Data in OpenSearch General Feedback	3	59	October 18, 2024
Anomaly detection with term aggregations Machine Learning	2	1349	June 4, 2020
Real Time Anomaly Detection in Open Distro for Elasticsearch \| Open Distro General Feedback	2	956	April 7, 2020
Anomaly detection not working with simple sample data Machine Learning anomaly-detection	0	19	April 10, 2025

Muti-variate Anomaly Detection

Related topics