Versions (relevant - OpenSearch/Dashboard/Server OS/Browser):
Opensearch 2.8 / Red Hat 8.8
Describe the issue:
Hello
I have discovered a strange behaviour of the alerting module of Opensearch 2.8. We have a monitor “per bucket” that check the status of an item on every monitored hosts of our system. Every host can have multiple different items. We want to have an alert by item and by host, that’s why we choose the “per bucket” monitor.
When the number of couple host/item (a bucket if i’m correct) goes over 500 at a time (execution of the monitor every minute), then the number of active alerts increase indefinitely.
When there is less than 500 couples, the number of active alerts is exactly the number of couples (which is correct, what we want) and the number doesn’t evolve (until the number of couples evolve too)
So, under 500 couples (buckets), the behaviour is correct, but over 500 buckets, the alerting module start to get mad and increase the number of alerts without explainations (and without error messages).
Is there a parameter somewhere that set that limit of 500 buckets ?
I tried to find something in the advanced settings of the stack management module, wihout success. I also changed the max_compilations_rate (1000/1m) and script.cache.max_size to 1000, but it doesn’t change the limit of 500 buckets max
Does anyone have a solution for this ?