I have recently moved from elasticsearch to opensearch and have noticed about 25% of documents are getting lost on a high throughput cluster. I also changed from indexing to alias to using streams. I let document IDs be autogenerated.
Lower cluster volume seems to be fine. There are occasional logs missing there too.
I do occasionally see failures due to duplicate IDs which should not be happening as I don’t define the ID usually.
@doug_f I’ve seen this ID issue before when Security Analytics monitors had over 30 mappings. Plugin was creating plenty of fan_outs tasks and cluster was falling apart even when Java heap size was at 32 GB and CPU nodes weren’t too busy.
Also plugin was creating a lot of long running fanout tasks.