Versions Opensearch 2.0.6
Describe the issue:
Hi all, after a system crash (VM that hosts Opensearch) of 4 nodes of 8 nodes opensearch cluster, system cannot be restored and continue to propose me a message that cannot find eligible master node, but after crash nodes are already up. This message is present in all nodes. Where is a solution to reset and re-initialize the cluster without index data loss?
Configuration:
8 nodes on docker swarm hosted in 8 Redhat Virtual Machine
Relevant Logs or Screenshots:
[2023-12-20T17:06:12,526][WARN ][o.o.c.c.ClusterFormationFailureHelper] [opensearch-node138] cluster-manager not discovered or elected yet, an election requires at least 3 nodes with ids from [DLX4Sb2NQVuUJsk0mki_Cw, MJOjIGghSY2tDvgSKzsXow, C-RQbYb_SEWelFWkfIZ4TA, pRAaw5rVRtyyI12PdpkxBQ, lKWeKuRST8q_PdvHZkbmAQ], have discovered [{opensearch-node138}{MJOjIGghSY2tDvgSKzsXow}{Srqu7VSgSOG5Ica1mw1xYQ}{10.0.4.87}{10.0.4.87:9300}{dimr}{zone=zoneA, shard_indexing_pressure_enabled=true}, {opensearch-node192}{C-RQbYb_SEWelFWkfIZ4TA}{msr8_cebSt2NLzIN0MNhFA}{10.0.4.92}{10.0.4.92:9300}{dimr}{zone=zoneB, shard_indexing_pressure_enabled=true}, {opensearch-node193}{pRAaw5rVRtyyI12PdpkxBQ}{wvDIA94qSPqEhO6vOv-v9A}{10.0.4.96}{10.0.4.96:9300}{dimr}{zone=zoneB, shard_indexing_pressure_enabled=true}, {opensearch-node194}{DLX4Sb2NQVuUJsk0mki_Cw}{JhxEioE6QbmIUY7uRjibOg}{10.0.4.81}{10.0.4.81:9300}{dimr}{zone=zoneB, shard_indexing_pressure_enabled=true}, {opensearch-node195}{lKWeKuRST8q_PdvHZkbmAQ}{7ct5u7F1Sd-ZoQL_7TWpfg}{10.0.4.76}{10.0.4.76:9300}{dimr}{zone=zoneB, shard_indexing_pressure_enabled=true}, {opensearch-node137}{0DEpK09tR4ayvq2IehcSgg}{SyHyfl1bQaCzEjmCoQxx9A}{10.0.4.94}{10.0.4.94:9300}{dimr}{zone=zoneA, shard_indexing_pressure_enabled=true}, {opensearch-node135}{qAxlwiG5Qd6hgObvsE436g}{oycqYEhsTMmyOUy7VteXbg}{10.0.4.85}{10.0.4.85:9300}{dimr}{zone=zoneA, shard_indexing_pressure_enabled=true}, {opensearch-node136}{dnp53JsUQ0qfaRi6y82IkQ}{32uYFHwKTkGO4I026sMqLw}{10.0.4.83}{10.0.4.83:9300}{dimr}{zone=zoneA, shard_indexing_pressure_enabled=true}] which is a quorum; discovery will continue using [10.0.4.84:9300, 10.0.4.82:9300, 10.0.4.93:9300, 10.0.4.86:9300, 10.0.4.91:9300, 10.0.4.95:9300, 10.0.4.80:9300, 10.0.4.75:9300] from hosts providers and [{opensearch-node138}{MJOjIGghSY2tDvgSKzsXow}{Srqu7VSgSOG5Ica1mw1xYQ}{10.0.4.87}{10.0.4.87:9300}{dimr}{zone=zoneA, shard_indexing_pressure_enabled=true}] from last-known cluster state; node term 77, last-accepted version 1708 in term 77