Multi node docker setup not working

jbates5873 · July 25, 2023, 2:58am

Versions (relevant - OpenSearch/Dashboard/Server OS/Browser):
2.8.0

Describe the issue:
Hi All,

We are trying to deploy OS on our multi node docker cluster. There are a total of 3 physical hosts. But no mater what i do, i cant seem to get them to join the cluster. If i deploy them on the one node, all works as expected. But once i split them out to load balance over the nodes, it just will not create a quorum. I have looked oveer some of the other threads on this, namely Opensearch with multiple nodes on different servers not working and tried the solutions in there, but to no success.

Can someone have a look over my config below and let me know where im going wrong?

The stack is deployed with

docker stack deploy opensearch -c opensearch_docker_compose.yml

Configuration:

---
version: "3.4"
services:
###########################################
# OpeanSearch Start
###########################################
  os-node1:
    image: opensearchproject/opensearch:2.8.0
    container_name: opensearch-node1
    environment:
      - cluster.name=opensearch-cluster
      - node.name=os-node1
      - discovery.seed_hosts=os-node1,os-node2,os-node3
      - cluster.initial_master_nodes=os-node1,os-node2,os-node3
      - bootstrap.memory_lock=true
      - "OPENSEARCH_JAVA_OPTS=-Xms1g -Xmx1g"
      - "DISABLE_INSTALL_DEMO_CONFIG=true"
      - "DISABLE_SECURITY_PLUGIN=true"
    ulimits:
      memlock:
        soft: -1 
        hard: -1
      nofile:
        soft: 65536 
        hard: 65536
    volumes:
      - opensearch-data1:/usr/share/opensearch/data 
    ports:
      - 9200:9200
      - 9300:9300
      - 9600:9600
    networks:
      - opensearch_net
      - internal_prod
    deploy:
      placement:
        constraints:
          - node.hostname == cube-node-1

  os-node2:
    image: opensearchproject/opensearch:2.8.0 
    container_name: opensearch-node2
    environment:
      - cluster.name=opensearch-cluster
      - node.name=os-node2
      - discovery.seed_hosts=os-node1,os-node2,os-node3
      - cluster.initial_master_nodes=os-node1,os-node2,os-node3
      - bootstrap.memory_lock=true
      - "OPENSEARCH_JAVA_OPTS=-Xms1g -Xmx1g"
      - "DISABLE_INSTALL_DEMO_CONFIG=true"
      - "DISABLE_SECURITY_PLUGIN=true"
    ulimits:
      memlock:
        soft: -1
        hard: -1
      nofile:
        soft: 65536
        hard: 65536
    volumes:
      - opensearch-data2:/usr/share/opensearch/data
    networks:
      - opensearch_net
    deploy:
      placement:
        constraints:
          - node.hostname == cube-node-2

  os-node3:
    image: opensearchproject/opensearch:2.8.0 
    container_name: opensearch-node3
    environment:
      - cluster.name=opensearch-cluster
      - node.name=os-node3
      - discovery.seed_hosts=os-node1,os-node2,os-node3
      - cluster.initial_master_nodes=os-node1,os-node2,os-node3
      - bootstrap.memory_lock=true
      - "OPENSEARCH_JAVA_OPTS=-Xms1g -Xmx1g"
      - "DISABLE_INSTALL_DEMO_CONFIG=true"
      - "DISABLE_SECURITY_PLUGIN=true"
    ulimits:
      memlock:
        soft: -1
        hard: -1
      nofile:
        soft: 65536
        hard: 65536
    volumes:
      - opensearch-data3:/usr/share/opensearch/data
    networks:
      - opensearch_net
    deploy:
      placement:
        constraints:
          - node.hostname == cube-node-3


###########################################
# OpenSearch End
###########################################

###########################################
# OpenSearch Dashboards Start
###########################################

  opensearch-dashboards:
    image: opensearchproject/opensearch-dashboards:2.8.0
    container_name: opensearch-dashboards
    ports:
      - 5601:5601
    expose:
      - "5601"
    environment:
      - 'OPENSEARCH_HOSTS=["http://os-node1:9200","http://os-node2:9200","http://os-node3:9200"]'
      - "DISABLE_SECURITY_DASHBOARDS_PLUGIN=false"
    networks:
      - opensearch_net
      - internal_prod


###########################################
# OpenSearch Dashboards End
###########################################

volumes:
  opensearch-data1:
  opensearch-data2:
  opensearch-data3:

networks:
  opensearch_net:
    external: true
  internal_prod:
    external: true

Relevant Logs or Screenshots:
Node 1

[2023-07-25T02:56:49,650][WARN ][o.o.c.c.ClusterFormationFailureHelper] [os-node1] cluster-manager not discovered or elected yet, an election requires 2 nodes with ids [I4dTdjuzRrC_spgVMveBHw, P9h3-UXnSTGnsdPSw8IzYQ], have discovered [{os-node1}{I4dTdjuzRrC_spgVMveBHw}{wfIBFc5dS1mGVmbjqQUFUw}{10.0.0.24}{10.0.0.24:9300}{dimr}{shard_indexing_pressure_enabled=true}, {os-node2}{P9h3-UXnSTGnsdPSw8IzYQ}{GLuWLKrDRYK9QIAlIQnezA}{10.0.10.22}{10.0.10.22:9300}{dimr}{shard_indexing_pressure_enabled=true}, {os-node3}{bVYnmh1NRA-wo7jxLqcSGA}{ONfMrMEiQzSHiuYXeVmI0w}{10.0.10.14}{10.0.10.14:9300}{dimr}{shard_indexing_pressure_enabled=true}] which is a quorum; discovery will continue using [10.0.12.24:9300, 10.0.10.21:9300, 10.0.10.13:9300] from hosts providers and [{os-node1}{I4dTdjuzRrC_spgVMveBHw}{wfIBFc5dS1mGVmbjqQUFUw}{10.0.0.24}{10.0.0.24:9300}{dimr}{shard_indexing_pressure_enabled=true}] from last-known cluster state; node term 2, last-accepted version 0 in term 0
[2023-07-25T02:56:59,650][WARN ][o.o.c.c.ClusterFormationFailureHelper] [os-node1] cluster-manager not discovered or elected yet, an election requires 2 nodes with ids [I4dTdjuzRrC_spgVMveBHw, P9h3-UXnSTGnsdPSw8IzYQ], have discovered [{os-node1}{I4dTdjuzRrC_spgVMveBHw}{wfIBFc5dS1mGVmbjqQUFUw}{10.0.0.24}{10.0.0.24:9300}{dimr}{shard_indexing_pressure_enabled=true}, {os-node2}{P9h3-UXnSTGnsdPSw8IzYQ}{GLuWLKrDRYK9QIAlIQnezA}{10.0.10.22}{10.0.10.22:9300}{dimr}{shard_indexing_pressure_enabled=true}, {os-node3}{bVYnmh1NRA-wo7jxLqcSGA}{ONfMrMEiQzSHiuYXeVmI0w}{10.0.10.14}{10.0.10.14:9300}{dimr}{shard_indexing_pressure_enabled=true}] which is a quorum; discovery will continue using [10.0.12.24:9300, 10.0.10.21:9300, 10.0.10.13:9300] from hosts providers and [{os-node1}{I4dTdjuzRrC_spgVMveBHw}{wfIBFc5dS1mGVmbjqQUFUw}{10.0.0.24}{10.0.0.24:9300}{dimr}{shard_indexing_pressure_enabled=true}] from last-known cluster state; node term 2, last-accepted version 0 in term 0
[2023-07-25T02:57:09,651][WARN ][o.o.c.c.ClusterFormationFailureHelper] [os-node1] cluster-manager not discovered or elected yet, an election requires 2 nodes with ids [I4dTdjuzRrC_spgVMveBHw, P9h3-UXnSTGnsdPSw8IzYQ], have discovered [{os-node1}{I4dTdjuzRrC_spgVMveBHw}{wfIBFc5dS1mGVmbjqQUFUw}{10.0.0.24}{10.0.0.24:9300}{dimr}{shard_indexing_pressure_enabled=true}, {os-node2}{P9h3-UXnSTGnsdPSw8IzYQ}{GLuWLKrDRYK9QIAlIQnezA}{10.0.10.22}{10.0.10.22:9300}{dimr}{shard_indexing_pressure_enabled=true}, {os-node3}{bVYnmh1NRA-wo7jxLqcSGA}{ONfMrMEiQzSHiuYXeVmI0w}{10.0.10.14}{10.0.10.14:9300}{dimr}{shard_indexing_pressure_enabled=true}] which is a quorum; discovery will continue using [10.0.12.24:9300, 10.0.10.21:9300, 10.0.10.13:9300] from hosts providers and [{os-node1}{I4dTdjuzRrC_spgVMveBHw}{wfIBFc5dS1mGVmbjqQUFUw}{10.0.0.24}{10.0.0.24:9300}{dimr}{shard_indexing_pressure_enabled=true}] from last-known cluster state; node term 2, last-accepted version 0 in term 0
[2023-07-25T02:57:11,341][INFO ][o.o.c.c.JoinHelper       ] [os-node1] failed to join {os-node3}{bVYnmh1NRA-wo7jxLqcSGA}{ONfMrMEiQzSHiuYXeVmI0w}{10.0.10.14}{10.0.10.14:9300}{dimr}{shard_indexing_pressure_enabled=true} with JoinRequest{sourceNode={os-node1}{I4dTdjuzRrC_spgVMveBHw}{wfIBFc5dS1mGVmbjqQUFUw}{10.0.0.24}{10.0.0.24:9300}{dimr}{shard_indexing_pressure_enabled=true}, minimumTerm=2, optionalJoin=Optional.empty}
org.opensearch.transport.RemoteTransportException: [os-node3][10.0.10.14:9300][internal:cluster/coordination/join]
Caused by: org.opensearch.transport.ConnectTransportException: [os-node1][10.0.0.24:9300] connect_timeout[30s]
	at org.opensearch.transport.TcpTransport$ChannelsConnectedListener.onTimeout(TcpTransport.java:1082) ~[opensearch-2.8.0.jar:2.8.0]
	at org.opensearch.common.util.concurrent.ThreadContext$ContextPreservingRunnable.run(ThreadContext.java:747) ~[opensearch-2.8.0.jar:2.8.0]
	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136) ~[?:?]
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635) ~[?:?]
	at java.lang.Thread.run(Thread.java:833) [?:?]
[2023-07-25T02:57:11,341][INFO ][o.o.c.c.JoinHelper       ] [os-node1] failed to join {os-node3}{bVYnmh1NRA-wo7jxLqcSGA}{ONfMrMEiQzSHiuYXeVmI0w}{10.0.10.14}{10.0.10.14:9300}{dimr}{shard_indexing_pressure_enabled=true} with JoinRequest{sourceNode={os-node1}{I4dTdjuzRrC_spgVMveBHw}{wfIBFc5dS1mGVmbjqQUFUw}{10.0.0.24}{10.0.0.24:9300}{dimr}{shard_indexing_pressure_enabled=true}, minimumTerm=2, optionalJoin=Optional.empty}
org.opensearch.transport.RemoteTransportException: [os-node3][10.0.10.14:9300][internal:cluster/coordination/join]
Caused by: org.opensearch.transport.ConnectTransportException: [os-node1][10.0.0.24:9300] connect_timeout[30s]
	at org.opensearch.transport.TcpTransport$ChannelsConnectedListener.onTimeout(TcpTransport.java:1082) ~[opensearch-2.8.0.jar:2.8.0]
	at org.opensearch.common.util.concurrent.ThreadContext$ContextPreservingRunnable.run(ThreadContext.java:747) ~[opensearch-2.8.0.jar:2.8.0]
	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136) ~[?:?]
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635) ~[?:?]
	at java.lang.Thread.run(Thread.java:833) [?:?]

Node 2

[2023-07-25T02:14:10,813][INFO ][o.o.n.Node               ] [os-node2] initialized
[2023-07-25T02:14:10,813][INFO ][o.o.n.Node               ] [os-node2] starting ...
[2023-07-25T02:14:10,895][INFO ][o.o.t.TransportService   ] [os-node2] publish_address {10.0.10.22:9300}, bound_addresses {0.0.0.0:9300}
[2023-07-25T02:14:10,997][INFO ][o.o.b.BootstrapChecks    ] [os-node2] bound or publishing to a non-loopback address, enforcing bootstrap checks
[2023-07-25T02:14:15,227][INFO ][o.o.c.c.JoinHelper       ] [os-node2] failed to join {os-node1}{I4dTdjuzRrC_spgVMveBHw}{A8n71U64R0iwJmaV5S92hw}{10.0.0.23}{10.0.0.23:9300}{dimr}{shard_indexing_pressure_enabled=true} with JoinRequest{sourceNode={os-node2}{P9h3-UXnSTGnsdPSw8IzYQ}{GLuWLKrDRYK9QIAlIQnezA}{10.0.10.22}{10.0.10.22:9300}{dimr}{shard_indexing_pressure_enabled=true}, minimumTerm=0, optionalJoin=Optional[Join{term=1, lastAcceptedTerm=0, lastAcceptedVersion=0, sourceNode={os-node2}{P9h3-UXnSTGnsdPSw8IzYQ}{GLuWLKrDRYK9QIAlIQnezA}{10.0.10.22}{10.0.10.22:9300}{dimr}{shard_indexing_pressure_enabled=true}, targetNode={os-node1}{I4dTdjuzRrC_spgVMveBHw}{A8n71U64R0iwJmaV5S92hw}{10.0.0.23}{10.0.0.23:9300}{dimr}{shard_indexing_pressure_enabled=true}}]}
org.opensearch.transport.NodeNotConnectedException: [os-node1][10.0.0.23:9300] Node not connected
	at org.opensearch.transport.ClusterConnectionManager.getConnection(ClusterConnectionManager.java:206) ~[opensearch-2.8.0.jar:2.8.0]
	at org.opensearch.transport.TransportService.getConnection(TransportService.java:904) ~[opensearch-2.8.0.jar:2.8.0]
	at org.opensearch.transport.TransportService.sendRequest(TransportService.java:820) [opensearch-2.8.0.jar:2.8.0]
	at org.opensearch.cluster.coordination.JoinHelper.sendJoinRequest(JoinHelper.java:335) [opensearch-2.8.0.jar:2.8.0]
	at org.opensearch.cluster.coordination.JoinHelper.sendJoinRequest(JoinHelper.java:263) [opensearch-2.8.0.jar:2.8.0]
	at org.opensearch.cluster.coordination.JoinHelper.lambda$new$2(JoinHelper.java:201) [opensearch-2.8.0.jar:2.8.0]
	at org.opensearch.indexmanagement.rollup.interceptor.RollupInterceptor$interceptHandler$1.messageReceived(RollupInterceptor.kt:113) [opensearch-index-management-2.8.0.0.jar:2.8.0.0]
	at org.opensearch.performanceanalyzer.transport.PerformanceAnalyzerTransportRequestHandler.messageReceived(PerformanceAnalyzerTransportRequestHandler.java:43) [opensearch-performance-analyzer-2.8.0.0.jar:2.8.0.0]
	at org.opensearch.transport.RequestHandlerRegistry.processMessageReceived(RequestHandlerRegistry.java:106) [opensearch-2.8.0.jar:2.8.0]
	at org.opensearch.transport.InboundHandler$RequestHandler.doRun(InboundHandler.java:453) [opensearch-2.8.0.jar:2.8.0]
	at org.opensearch.common.util.concurrent.ThreadContext$ContextPreservingAbstractRunnable.doRun(ThreadContext.java:806) [opensearch-2.8.0.jar:2.8.0]
	at org.opensearch.common.util.concurrent.AbstractRunnable.run(AbstractRunnable.java:52) [opensearch-2.8.0.jar:2.8.0]
	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136) [?:?]
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635) [?:?]
	at java.lang.Thread.run(Thread.java:833) [?:?]
[2023-07-25T02:14:15,238][INFO ][o.o.c.c.JoinHelper       ] [os-node2] failed to join {os-node1}{I4dTdjuzRrC_spgVMveBHw}{A8n71U64R0iwJmaV5S92hw}{10.0.0.23}{10.0.0.23:9300}{dimr}{shard_indexing_pressure_enabled=true} with JoinRequest{sourceNode={os-node2}{P9h3-UXnSTGnsdPSw8IzYQ}{GLuWLKrDRYK9QIAlIQnezA}{10.0.10.22}{10.0.10.22:9300}{dimr}{shard_indexing_pressure_enabled=true}, minimumTerm=0, optionalJoin=Optional[Join{term=1, lastAcceptedTerm=0, lastAcceptedVersion=0, sourceNode={os-node2}{P9h3-UXnSTGnsdPSw8IzYQ}{GLuWLKrDRYK9QIAlIQnezA}{10.0.10.22}{10.0.10.22:9300}{dimr}{shard_indexing_pressure_enabled=true}, targetNode={os-node1}{I4dTdjuzRrC_spgVMveBHw}{A8n71U64R0iwJmaV5S92hw}{10.0.0.23}{10.0.0.23:9300}{dimr}{shard_indexing_pressure_enabled=true}}]}
org.opensearch.transport.NodeNotConnectedException: [os-node1][10.0.0.23:9300] Node not connected
	at org.opensearch.transport.ClusterConnectionManager.getConnection(ClusterConnectionManager.java:206) ~[opensearch-2.8.0.jar:2.8.0]
	at org.opensearch.transport.TransportService.getConnection(TransportService.java:904) ~[opensearch-2.8.0.jar:2.8.0]
	at org.opensearch.transport.TransportService.sendRequest(TransportService.java:820) [opensearch-2.8.0.jar:2.8.0]
	at org.opensearch.cluster.coordination.JoinHelper.sendJoinRequest(JoinHelper.java:335) [opensearch-2.8.0.jar:2.8.0]
	at org.opensearch.cluster.coordination.JoinHelper.sendJoinRequest(JoinHelper.java:263) [opensearch-2.8.0.jar:2.8.0]
	at org.opensearch.cluster.coordination.JoinHelper.lambda$new$2(JoinHelper.java:201) [opensearch-2.8.0.jar:2.8.0]
	at org.opensearch.indexmanagement.rollup.interceptor.RollupInterceptor$interceptHandler$1.messageReceived(RollupInterceptor.kt:113) [opensearch-index-management-2.8.0.0.jar:2.8.0.0]
	at org.opensearch.performanceanalyzer.transport.PerformanceAnalyzerTransportRequestHandler.messageReceived(PerformanceAnalyzerTransportRequestHandler.java:43) [opensearch-performance-analyzer-2.8.0.0.jar:2.8.0.0]
	at org.opensearch.transport.RequestHandlerRegistry.processMessageReceived(RequestHandlerRegistry.java:106) [opensearch-2.8.0.jar:2.8.0]
	at org.opensearch.transport.InboundHandler$RequestHandler.doRun(InboundHandler.java:453) [opensearch-2.8.0.jar:2.8.0]
	at org.opensearch.common.util.concurrent.ThreadContext$ContextPreservingAbstractRunnable.doRun(ThreadContext.java:806) [opensearch-2.8.0.jar:2.8.0]
	at org.opensearch.common.util.concurrent.AbstractRunnable.run(AbstractRunnable.java:52) [opensearch-2.8.0.jar:2.8.0]
	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136) [?:?]
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635) [?:?]
	at java.lang.Thread.run(Thread.java:833) [?:?]
[2023-07-25T02:14:15,482][INFO ][o.o.c.c.Coordinator      ] [os-node2] setting initial configuration to VotingConfiguration{{bootstrap-placeholder}-os-node1,bVYnmh1NRA-wo7jxLqcSGA,P9h3-UXnSTGnsdPSw8IzYQ}
[2023-07-25T02:14:15,576][INFO ][o.o.c.c.CoordinationState] [os-node2] cluster UUID set to [vPTgjomxR4OufpYvcyz8RA]
[2023-07-25T02:14:15,597][INFO ][o.o.c.s.ClusterApplierService] [os-node2] cluster-manager node changed {previous [], current [{os-node3}{bVYnmh1NRA-wo7jxLqcSGA}{ONfMrMEiQzSHiuYXeVmI0w}{10.0.10.14}{10.0.10.14:9300}{dimr}{shard_indexing_pressure_enabled=true}]}, added {{os-node3}{bVYnmh1NRA-wo7jxLqcSGA}{ONfMrMEiQzSHiuYXeVmI0w}{10.0.10.14}{10.0.10.14:9300}{dimr}{shard_indexing_pressure_enabled=true}}, term: 2, version: 1, reason: ApplyCommitRequest{term=2, version=1, sourceNode={os-node3}{bVYnmh1NRA-wo7jxLqcSGA}{ONfMrMEiQzSHiuYXeVmI0w}{10.0.10.14}{10.0.10.14:9300}{dimr}{shard_indexing_pressure_enabled=true}}
[2023-07-25T02:14:15,600][INFO ][o.o.a.c.ADClusterEventListener] [os-node2] Cluster is not recovered yet.

Node 3

[2023-07-25T02:14:15,223][INFO ][o.o.n.Node               ] [os-node3] initialized
[2023-07-25T02:14:15,223][INFO ][o.o.n.Node               ] [os-node3] starting ...
[2023-07-25T02:14:15,299][INFO ][o.o.t.TransportService   ] [os-node3] publish_address {10.0.10.14:9300}, bound_addresses {0.0.0.0:9300}
[2023-07-25T02:14:15,402][INFO ][o.o.b.BootstrapChecks    ] [os-node3] bound or publishing to a non-loopback address, enforcing bootstrap checks
[2023-07-25T02:14:15,461][INFO ][o.o.c.c.Coordinator      ] [os-node3] setting initial configuration to VotingConfiguration{{bootstrap-placeholder}-os-node1,bVYnmh1NRA-wo7jxLqcSGA,P9h3-UXnSTGnsdPSw8IzYQ}
[2023-07-25T02:14:15,536][INFO ][o.o.c.s.MasterService    ] [os-node3] elected-as-cluster-manager ([2] nodes joined)[{os-node2}{P9h3-UXnSTGnsdPSw8IzYQ}{GLuWLKrDRYK9QIAlIQnezA}{10.0.10.22}{10.0.10.22:9300}{dimr}{shard_indexing_pressure_enabled=true} elect leader, {os-node3}{bVYnmh1NRA-wo7jxLqcSGA}{ONfMrMEiQzSHiuYXeVmI0w}{10.0.10.14}{10.0.10.14:9300}{dimr}{shard_indexing_pressure_enabled=true} elect leader, _BECOME_CLUSTER_MANAGER_TASK_, _FINISH_ELECTION_], term: 2, version: 1, delta: cluster-manager node changed {previous [], current [{os-node3}{bVYnmh1NRA-wo7jxLqcSGA}{ONfMrMEiQzSHiuYXeVmI0w}{10.0.10.14}{10.0.10.14:9300}{dimr}{shard_indexing_pressure_enabled=true}]}, added {{os-node2}{P9h3-UXnSTGnsdPSw8IzYQ}{GLuWLKrDRYK9QIAlIQnezA}{10.0.10.22}{10.0.10.22:9300}{dimr}{shard_indexing_pressure_enabled=true}}
[2023-07-25T02:14:15,574][INFO ][o.o.c.c.CoordinationState] [os-node3] cluster UUID set to [vPTgjomxR4OufpYvcyz8RA]
[2023-07-25T02:14:15,622][INFO ][o.o.c.s.ClusterApplierService] [os-node3] cluster-manager node changed {previous [], current [{os-node3}{bVYnmh1NRA-wo7jxLqcSGA}{ONfMrMEiQzSHiuYXeVmI0w}{10.0.10.14}{10.0.10.14:9300}{dimr}{shard_indexing_pressure_enabled=true}]}, added {{os-node2}{P9h3-UXnSTGnsdPSw8IzYQ}{GLuWLKrDRYK9QIAlIQnezA}{10.0.10.22}{10.0.10.22:9300}{dimr}{shard_indexing_pressure_enabled=true}}, term: 2, version: 1, reason: Publication{term=2, version=1}
[2023-07-25T02:14:15,629][INFO ][o.o.a.c.ADClusterEventListener] [os-node3] Cluster is not recovered yet.
[2023-07-25T02:14:15,633][INFO ][o.o.a.u.d.DestinationMigrationCoordinator] [os-node3] Detected cluster change event for destination migration
[2023-07-25T02:14:15,639][INFO ][o.o.c.r.a.DiskThresholdMonitor] [os-node3] skipping monitor as a check is already in progress
[2023-07-25T02:14:15,648][INFO ][o.o.i.i.ManagedIndexCoordinator] [os-node3] Cache cluster manager node onClusterManager time: 1690251255648
[2023-07-25T02:14:15,650][INFO ][o.o.m.a.MLModelAutoReDeployer] [os-node3] Model auto reload configuration is false, not performing auto reloading!
[2023-07-25T02:14:15,653][WARN ][o.o.p.c.s.h.ConfigOverridesClusterSettingHandler] [os-node3] Config override setting update called with empty string. Ignoring.
[2023-07-25T02:14:15,658][INFO ][o.o.d.PeerFinder         ] [os-node3] setting findPeersInterval to [1s] as node commission status = [true] for local node [{os-node3}{bVYnmh1NRA-wo7jxLqcSGA}{ONfMrMEiQzSHiuYXeVmI0w}{10.0.10.14}{10.0.10.14:9300}{dimr}{shard_indexing_pressure_enabled=true}]
[2023-07-25T02:14:15,660][INFO ][o.o.h.AbstractHttpServerTransport] [os-node3] publish_address {10.0.10.14:9200}, bound_addresses {0.0.0.0:9200}
[2023-07-25T02:14:15,661][INFO ][o.o.n.Node               ] [os-node3] started
[2023-07-25T02:14:15,661][INFO ][o.o.s.OpenSearchSecurityPlugin] [os-node3] Node started
[2023-07-25T02:14:15,661][INFO ][o.o.s.OpenSearchSecurityPlugin] [os-node3] 0 OpenSearch Security modules loaded so far: []
[2023-07-25T02:14:15,686][INFO ][o.o.a.c.HashRing         ] [os-node3] Node added: [bVYnmh1NRA-wo7jxLqcSGA, P9h3-UXnSTGnsdPSw8IzYQ]
[2023-07-25T02:14:15,687][INFO ][o.o.a.u.d.DestinationMigrationCoordinator] [os-node3] Detected cluster change event for destination migration
[2023-07-25T02:14:18,615][INFO ][o.o.c.r.a.AllocationService] [os-node3] Cluster health status changed from [YELLOW] to [GREEN] (reason: [shards started [[.kibana_1][0]]]).
[2023-07-25T02:14:18,646][INFO ][o.o.a.u.d.DestinationMigrationCoordinator] [os-node3] Detected cluster change event for destination migration
[2023-07-25T02:14:45,456][WARN ][o.o.d.HandshakingTransportAddressConnector] [os-node3] [connectToRemoteMasterNode[10.0.10.19:9300]] completed handshake with [{os-node1}{I4dTdjuzRrC_spgVMveBHw}{A8n71U64R0iwJmaV5S92hw}{10.0.0.23}{10.0.0.23:9300}{dimr}{shard_indexing_pressure_enabled=true}] but followup connection failed
org.opensearch.transport.ConnectTransportException: [os-node1][10.0.0.23:9300] connect_exception
	at org.opensearch.transport.TcpTransport$ChannelsConnectedListener.onFailure(TcpTransport.java:1076) ~[opensearch-2.8.0.jar:2.8.0]
	at org.opensearch.action.ActionListener.lambda$toBiConsumer$2(ActionListener.java:215) ~[opensearch-2.8.0.jar:2.8.0]
	at org.opensearch.common.concurrent.CompletableContext.lambda$addListener$0(CompletableContext.java:57) ~[opensearch-common-2.8.0.jar:2.8.0]
	at java.util.concurrent.CompletableFuture.uniWhenComplete(CompletableFuture.java:863) ~[?:?]
	at java.util.concurrent.CompletableFuture$UniWhenComplete.tryFire(CompletableFuture.java:841) ~[?:?]
	at java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:510) ~[?:?]
	at java.util.concurrent.CompletableFuture.completeExceptionally(CompletableFuture.java:2162) ~[?:?]
	at org.opensearch.common.concurrent.CompletableContext.completeExceptionally(CompletableContext.java:72) ~[opensearch-common-2.8.0.jar:2.8.0]
	at org.opensearch.transport.netty4.Netty4TcpChannel.lambda$addListener$0(Netty4TcpChannel.java:81) ~[?:?]
	at io.netty.util.concurrent.DefaultPromise.notifyListener0(DefaultPromise.java:590) ~[?:?]
	at io.netty.util.concurrent.DefaultPromise.notifyListeners0(DefaultPromise.java:583) ~[?:?]
	at io.netty.util.concurrent.DefaultPromise.notifyListenersNow(DefaultPromise.java:559) ~[?:?]
	at io.netty.util.concurrent.DefaultPromise.notifyListeners(DefaultPromise.java:492) ~[?:?]
	at io.netty.util.concurrent.DefaultPromise.setValue0(DefaultPromise.java:636) ~[?:?]
	at io.netty.util.concurrent.DefaultPromise.setFailure0(DefaultPromise.java:629) ~[?:?]
	at io.netty.util.concurrent.DefaultPromise.tryFailure(DefaultPromise.java:118) ~[?:?]
	at io.netty.channel.nio.AbstractNioChannel$AbstractNioUnsafe$1.run(AbstractNioChannel.java:262) ~[?:?]
	at io.netty.util.concurrent.PromiseTask.runTask(PromiseTask.java:98) ~[?:?]
	at io.netty.util.concurrent.ScheduledFutureTask.run(ScheduledFutureTask.java:153) ~[?:?]
	at io.netty.util.concurrent.AbstractEventExecutor.runTask(AbstractEventExecutor.java:174) ~[?:?]
	at io.netty.util.concurrent.AbstractEventExecutor.safeExecute(AbstractEventExecutor.java:167) ~[?:?]
	at io.netty.util.concurrent.SingleThreadEventExecutor.runAllTasks(SingleThreadEventExecutor.java:470) ~[?:?]
	at io.netty.channel.nio.NioEventLoop.run(NioEventLoop.java:569) ~[?:?]
	at io.netty.util.concurrent.SingleThreadEventExecutor$4.run(SingleThreadEventExecutor.java:997) ~[?:?]
	at io.netty.util.internal.ThreadExecutorMap$2.run(ThreadExecutorMap.java:74) ~[?:?]
	at java.lang.Thread.run(Thread.java:833) [?:?]
Caused by: io.netty.channel.ConnectTimeoutException: connection timed out: 10.0.0.23/10.0.0.23:9300
	at io.netty.channel.nio.AbstractNioChannel$AbstractNioUnsafe$1.run(AbstractNioChannel.java:261) ~[?:?]
	... 9 more

Docker Network Configuration

docker network ls | grep opensearch_net
59k0hn8zxn6k   opensearch_net                             overlay   swarm

docker network ls | grep internal_prod
6ebe0bze3ztm   internal_prod                              overlay   swarm

pablo · July 26, 2023, 1:41pm

@jbates5873 Did you try to run a curl from one OS container against the remaining ones?
Can you resolve their names or connect them with IP addresses?

i.e.

docker exec <os-node1 container> curl --insecure -u admin:admin https://<os-node2_FQDN_or_IP>:9200

pablo · July 26, 2023, 9:58pm

@jbates5873 Please check this GitHub issue.

github.com/opensearch-project/opensearch-devops

[BUG] Example Docker Compose fails in Docker Swarm

opened 04:48PM - 16 Feb 23 UTC

designermonkey

bug

**Describe the bug** I have proven locally that I can get the [example docker… compose file](https://opensearch.org/docs/latest/install-and-configure/install-opensearch/docker/#deploy-an-opensearch-cluster-using-docker-compose) to work locally, yet when I try the exact same file using docker swarm mode, it will not bring the cluster up. The first node always tries to connect to itself and fails **To Reproduce** Steps to reproduce the behavior: 1. Use the example compose file 2. Run `docker stack deploy --prune --with-registry-auth --compose-file docker-compose.yml` 3. View the docker service logs for the first node 4. See error: ``` Enabling execution of install_demo_configuration.sh for OpenSearch Security Plugin ************************************************************************** ** This tool will be deprecated in the next major release of OpenSearch ** ** https://github.com/opensearch-project/security/issues/1755 ** ************************************************************************** OpenSearch Security Demo Installer ** Warning: Do not use on production or public reachable systems ** Basedir: /usr/share/opensearch OpenSearch install type: rpm/deb on NAME="Amazon Linux" OpenSearch config dir: /usr/share/opensearch/config OpenSearch config file: /usr/share/opensearch/config/opensearch.yml OpenSearch bin dir: /usr/share/opensearch/bin OpenSearch plugins dir: /usr/share/opensearch/plugins OpenSearch lib dir: /usr/share/opensearch/lib Detected OpenSearch Version: x-content-2.5.0 Detected OpenSearch Security Version: 2.5.0.0 ### Success ### Execute this script now on all your nodes and then start all nodes ### OpenSearch Security will be automatically initialized. ### If you like to change the runtime configuration ### change the files in ../../../config/opensearch-security and execute: "/usr/share/opensearch/plugins/opensearch-security/tools/securityadmin.sh" -cd "/usr/share/opensearch/config/opensearch-security" -icl -key "/usr/share/opensearch/config/kirk-key.pem" -cert "/usr/share/opensearch/config/kirk.pem" -cacert "/usr/share/opensearch/config/root-ca.pem" -nhnv ### or run ./securityadmin_demo.sh ### To use the Security Plugin ConfigurationGUI ### To access your secured cluster open https://<hostname>:<HTTP port> and log in with admin/admin. ### (Ignore the SSL certificate warning because we installed self-signed demo certificates) Enabling OpenSearch Security Plugin Enabling execution of OPENSEARCH_HOME/bin/opensearch-performance-analyzer/performance-analyzer-agent-cli for OpenSearch Performance Analyzer Plugin [2023-02-16T16:23:42,004][INFO ][o.o.n.Node ] [node01] version[2.5.0], pid[103], build[tar/b8a8b6c4d7fc7a7e32eb2cb68ecad8057a4636ad/2023-01-18T23:49:00.584806002Z], OS[Linux/5.10.104-linuxkit/aarch64], JVM[Eclipse Adoptium/OpenJDK 64-Bit Server VM/17.0.5/17.0.5+8] [2023-02-16T16:23:42,005][INFO ][o.o.n.Node ] [node01] JVM home [/usr/share/opensearch/jdk], using bundled JDK [true] [2023-02-16T16:23:42,005][INFO ][o.o.n.Node ] [node01] JVM arguments [-Xshare:auto, -Dopensearch.networkaddress.cache.ttl=60, -Dopensearch.networkaddress.cache.negative.ttl=10, -XX:+AlwaysPreTouch, -Xss1m, -Djava.awt.headless=true, -Dfile.encoding=UTF-8, -Djna.nosys=true, -XX:-OmitStackTraceInFastThrow, -XX:+ShowCodeDetailsInExceptionMessages, -Dio.netty.noUnsafe=true, -Dio.netty.noKeySetOptimization=true, -Dio.netty.recycler.maxCapacityPerThread=0, -Dio.netty.allocator.numDirectArenas=0, -Dlog4j.shutdownHookEnabled=false, -Dlog4j2.disable.jmx=true, -Djava.locale.providers=SPI,COMPAT, -Xms1g, -Xmx1g, -XX:+UseG1GC, -XX:G1ReservePercent=25, -XX:InitiatingHeapOccupancyPercent=30, -Djava.io.tmpdir=/tmp/opensearch-7599599729676416352, -XX:+HeapDumpOnOutOfMemoryError, -XX:HeapDumpPath=data, -XX:ErrorFile=logs/hs_err_pid%p.log, -Xlog:gc*,gc+age=trace,safepoint:file=logs/gc.log:utctime,pid,tags:filecount=32,filesize=64m, -Dclk.tck=100, -Djdk.attach.allowAttachSelf=true, -Djava.security.policy=/usr/share/opensearch/config/opensearch-performance-analyzer/opensearch_security.policy, --add-opens=jdk.attach/sun.tools.attach=ALL-UNNAMED, -Dopensearch.cgroups.hierarchy.override=/, -Xms2048m, -Xmx2048m, -XX:MaxDirectMemorySize=1073741824, -Dopensearch.path.home=/usr/share/opensearch, -Dopensearch.path.conf=/usr/share/opensearch/config, -Dopensearch.distribution.type=tar, -Dopensearch.bundled_jdk=true] [2023-02-16T16:23:43,095][WARN ][stderr ] [node01] SLF4J: Failed to load class "org.slf4j.impl.StaticLoggerBinder". [2023-02-16T16:23:43,095][WARN ][stderr ] [node01] SLF4J: Defaulting to no-operation (NOP) logger implementation [2023-02-16T16:23:43,095][WARN ][stderr ] [node01] SLF4J: See http://www.slf4j.org/codes.html#StaticLoggerBinder for further details. [2023-02-16T16:23:43,109][INFO ][o.o.s.s.t.SSLConfig ] [node01] SSL dual mode is disabled [2023-02-16T16:23:43,109][INFO ][o.o.s.OpenSearchSecurityPlugin] [node01] OpenSearch Config path is /usr/share/opensearch/config [2023-02-16T16:23:43,648][INFO ][o.o.s.s.DefaultSecurityKeyStore] [node01] JVM supports TLSv1.3 [2023-02-16T16:23:43,649][INFO ][o.o.s.s.DefaultSecurityKeyStore] [node01] Config directory is /usr/share/opensearch/config/, from there the key- and truststore files are resolved relatively [2023-02-16T16:23:44,113][INFO ][o.o.s.s.DefaultSecurityKeyStore] [node01] TLS Transport Client Provider : JDK WARNING: A terminally deprecated method in java.lang.System has been called WARNING: System::setSecurityManager has been called by org.opensearch.bootstrap.OpenSearch (file:/usr/share/opensearch/lib/opensearch-2.5.0.jar) WARNING: Please consider reporting this to the maintainers of org.opensearch.bootstrap.OpenSearch WARNING: System::setSecurityManager will be removed in a future release WARNING: A terminally deprecated method in java.lang.System has been called WARNING: System::setSecurityManager has been called by org.opensearch.bootstrap.Security (file:/usr/share/opensearch/lib/opensearch-2.5.0.jar) WARNING: Please consider reporting this to the maintainers of org.opensearch.bootstrap.Security WARNING: System::setSecurityManager will be removed in a future release [2023-02-16T16:23:44,113][INFO ][o.o.s.s.DefaultSecurityKeyStore] [node01] TLS Transport Server Provider : JDK [2023-02-16T16:23:44,113][INFO ][o.o.s.s.DefaultSecurityKeyStore] [node01] TLS HTTP Provider : JDK [2023-02-16T16:23:44,114][INFO ][o.o.s.s.DefaultSecurityKeyStore] [node01] Enabled TLS protocols for transport layer : [TLSv1.3, TLSv1.2] [2023-02-16T16:23:44,114][INFO ][o.o.s.s.DefaultSecurityKeyStore] [node01] Enabled TLS protocols for HTTP layer : [TLSv1.3, TLSv1.2] [2023-02-16T16:23:44,119][INFO ][o.o.s.OpenSearchSecurityPlugin] [node01] Clustername: cluster [2023-02-16T16:23:44,123][WARN ][o.o.s.OpenSearchSecurityPlugin] [node01] Directory /usr/share/opensearch/config has insecure file permissions (should be 0700) [2023-02-16T16:23:44,123][WARN ][o.o.s.OpenSearchSecurityPlugin] [node01] File /usr/share/opensearch/config/esnode-key.pem has insecure file permissions (should be 0600) [2023-02-16T16:23:44,123][WARN ][o.o.s.OpenSearchSecurityPlugin] [node01] File /usr/share/opensearch/config/kirk.pem has insecure file permissions (should be 0600) [2023-02-16T16:23:44,123][WARN ][o.o.s.OpenSearchSecurityPlugin] [node01] File /usr/share/opensearch/config/root-ca.pem has insecure file permissions (should be 0600) [2023-02-16T16:23:44,124][WARN ][o.o.s.OpenSearchSecurityPlugin] [node01] File /usr/share/opensearch/config/esnode.pem has insecure file permissions (should be 0600) [2023-02-16T16:23:44,124][WARN ][o.o.s.OpenSearchSecurityPlugin] [node01] File /usr/share/opensearch/config/kirk-key.pem has insecure file permissions (should be 0600) [2023-02-16T16:23:44,453][INFO ][o.o.p.c.PluginSettings ] [node01] Config: metricsLocation: /dev/shm/performanceanalyzer/, metricsDeletionInterval: 1, httpsEnabled: false, cleanup-metrics-db-files: true, batch-metrics-retention-period-minutes: 7, rpc-port: 9650, webservice-port 9600 [2023-02-16T16:23:44,761][INFO ][o.o.i.r.ReindexPlugin ] [node01] ReindexPlugin reloadSPI called [2023-02-16T16:23:44,761][INFO ][o.o.i.r.ReindexPlugin ] [node01] Unable to find any implementation for RemoteReindexExtension [2023-02-16T16:23:44,786][INFO ][o.o.j.JobSchedulerPlugin ] [node01] Loaded scheduler extension: reports-scheduler, index: .opendistro-reports-definitions [2023-02-16T16:23:44,788][INFO ][o.o.j.JobSchedulerPlugin ] [node01] Loaded scheduler extension: opendistro_anomaly_detector, index: .opendistro-anomaly-detector-jobs [2023-02-16T16:23:44,789][INFO ][o.o.j.JobSchedulerPlugin ] [node01] Loaded scheduler extension: opendistro-index-management, index: .opendistro-ism-config [2023-02-16T16:23:44,798][INFO ][o.o.j.JobSchedulerPlugin ] [node01] Loaded scheduler extension: observability, index: .opensearch-observability-job [2023-02-16T16:23:44,801][INFO ][o.o.p.PluginsService ] [node01] loaded module [aggs-matrix-stats] [2023-02-16T16:23:44,801][INFO ][o.o.p.PluginsService ] [node01] loaded module [analysis-common] [2023-02-16T16:23:44,801][INFO ][o.o.p.PluginsService ] [node01] loaded module [geo] [2023-02-16T16:23:44,801][INFO ][o.o.p.PluginsService ] [node01] loaded module [ingest-common] [2023-02-16T16:23:44,801][INFO ][o.o.p.PluginsService ] [node01] loaded module [ingest-geoip] [2023-02-16T16:23:44,801][INFO ][o.o.p.PluginsService ] [node01] loaded module [ingest-user-agent] [2023-02-16T16:23:44,801][INFO ][o.o.p.PluginsService ] [node01] loaded module [lang-expression] [2023-02-16T16:23:44,801][INFO ][o.o.p.PluginsService ] [node01] loaded module [lang-mustache] [2023-02-16T16:23:44,801][INFO ][o.o.p.PluginsService ] [node01] loaded module [lang-painless] [2023-02-16T16:23:44,802][INFO ][o.o.p.PluginsService ] [node01] loaded module [mapper-extras] [2023-02-16T16:23:44,802][INFO ][o.o.p.PluginsService ] [node01] loaded module [opensearch-dashboards] [2023-02-16T16:23:44,802][INFO ][o.o.p.PluginsService ] [node01] loaded module [parent-join] [2023-02-16T16:23:44,802][INFO ][o.o.p.PluginsService ] [node01] loaded module [percolator] [2023-02-16T16:23:44,802][INFO ][o.o.p.PluginsService ] [node01] loaded module [rank-eval] [2023-02-16T16:23:44,802][INFO ][o.o.p.PluginsService ] [node01] loaded module [reindex] [2023-02-16T16:23:44,802][INFO ][o.o.p.PluginsService ] [node01] loaded module [repository-url] [2023-02-16T16:23:44,802][INFO ][o.o.p.PluginsService ] [node01] loaded module [systemd] [2023-02-16T16:23:44,802][INFO ][o.o.p.PluginsService ] [node01] loaded module [transport-netty4] [2023-02-16T16:23:44,802][INFO ][o.o.p.PluginsService ] [node01] loaded plugin [opensearch-alerting] [2023-02-16T16:23:44,802][INFO ][o.o.p.PluginsService ] [node01] loaded plugin [opensearch-anomaly-detection] [2023-02-16T16:23:44,802][INFO ][o.o.p.PluginsService ] [node01] loaded plugin [opensearch-asynchronous-search] [2023-02-16T16:23:44,802][INFO ][o.o.p.PluginsService ] [node01] loaded plugin [opensearch-cross-cluster-replication] [2023-02-16T16:23:44,802][INFO ][o.o.p.PluginsService ] [node01] loaded plugin [opensearch-geospatial] [2023-02-16T16:23:44,802][INFO ][o.o.p.PluginsService ] [node01] loaded plugin [opensearch-index-management] [2023-02-16T16:23:44,802][INFO ][o.o.p.PluginsService ] [node01] loaded plugin [opensearch-job-scheduler] [2023-02-16T16:23:44,802][INFO ][o.o.p.PluginsService ] [node01] loaded plugin [opensearch-knn] [2023-02-16T16:23:44,802][INFO ][o.o.p.PluginsService ] [node01] loaded plugin [opensearch-ml] [2023-02-16T16:23:44,802][INFO ][o.o.p.PluginsService ] [node01] loaded plugin [opensearch-neural-search] [2023-02-16T16:23:44,803][INFO ][o.o.p.PluginsService ] [node01] loaded plugin [opensearch-notifications] [2023-02-16T16:23:44,803][INFO ][o.o.p.PluginsService ] [node01] loaded plugin [opensearch-notifications-core] [2023-02-16T16:23:44,803][INFO ][o.o.p.PluginsService ] [node01] loaded plugin [opensearch-observability] [2023-02-16T16:23:44,803][INFO ][o.o.p.PluginsService ] [node01] loaded plugin [opensearch-performance-analyzer] [2023-02-16T16:23:44,803][INFO ][o.o.p.PluginsService ] [node01] loaded plugin [opensearch-reports-scheduler] [2023-02-16T16:23:44,803][INFO ][o.o.p.PluginsService ] [node01] loaded plugin [opensearch-security] [2023-02-16T16:23:44,803][INFO ][o.o.p.PluginsService ] [node01] loaded plugin [opensearch-security-analytics] [2023-02-16T16:23:44,803][INFO ][o.o.p.PluginsService ] [node01] loaded plugin [opensearch-sql] [2023-02-16T16:23:44,811][INFO ][o.o.s.OpenSearchSecurityPlugin] [node01] Disabled https compression by default to mitigate BREACH attacks. You can enable it by setting 'http.compression: true' in opensearch.yml [2023-02-16T16:23:44,832][DEPRECATION][o.o.d.c.s.Settings ] [node01] [node.max_local_storage_nodes] setting was deprecated in OpenSearch and will be removed in a future release! See the breaking changes documentation for the next major version. [2023-02-16T16:23:44,839][INFO ][o.o.e.NodeEnvironment ] [node01] using [1] data paths, mounts [[/usr/share/opensearch/data (virtiofs0)]], net usable_space [240gb], net total_space [460.4gb], types [virtiofs] [2023-02-16T16:23:44,839][INFO ][o.o.e.NodeEnvironment ] [node01] heap size [2gb], compressed ordinary object pointers [true] [2023-02-16T16:23:44,953][INFO ][o.o.n.Node ] [node01] node name [node01], node ID [Bqk8Khh8R5GjoDQaF-C-Cg], cluster name [cluster], roles [ingest, remote_cluster_client, data, cluster_manager] [2023-02-16T16:23:47,055][WARN ][o.o.s.c.Salt ] [node01] If you plan to use field masking pls configure compliance salt e1ukloTsQlOgPquJ to be a random string of 16 chars length identical on all nodes [2023-02-16T16:23:47,076][INFO ][o.o.s.a.i.AuditLogImpl ] [node01] Message routing enabled: true [2023-02-16T16:23:47,098][INFO ][o.o.s.f.SecurityFilter ] [node01] <NONE> indices are made immutable. [2023-02-16T16:23:47,289][INFO ][o.o.a.b.ADCircuitBreakerService] [node01] Registered memory breaker. [2023-02-16T16:23:47,498][INFO ][o.o.m.b.MLCircuitBreakerService] [node01] Registered ML memory breaker. [2023-02-16T16:23:47,499][INFO ][o.o.m.b.MLCircuitBreakerService] [node01] Registered ML disk breaker. [2023-02-16T16:23:47,499][INFO ][o.o.m.b.MLCircuitBreakerService] [node01] Registered ML native memory breaker. [2023-02-16T16:23:47,574][INFO ][o.r.Reflections ] [node01] Reflections took 31 ms to scan 1 urls, producing 12 keys and 32 values [2023-02-16T16:23:48,095][INFO ][o.o.t.NettyAllocator ] [node01] creating NettyAllocator with the following configs: [name=opensearch_configured, chunk_size=256kb, suggested_max_allocation_size=256kb, factors={opensearch.unsafe.use_netty_default_chunk_and_page_size=false, g1gc_enabled=true, g1gc_region_size=1mb}] [2023-02-16T16:23:48,133][INFO ][o.o.d.DiscoveryModule ] [node01] using discovery type [zen] and seed hosts providers [settings] [2023-02-16T16:23:48,416][WARN ][o.o.g.DanglingIndicesState] [node01] gateway.auto_import_dangling_indices is disabled, dangling indices will not be automatically detected or imported and must be managed manually [2023-02-16T16:23:48,663][INFO ][o.o.p.h.c.PerformanceAnalyzerConfigAction] [node01] PerformanceAnalyzer Enabled: false [2023-02-16T16:23:48,681][INFO ][o.o.n.Node ] [node01] initialized [2023-02-16T16:23:48,681][INFO ][o.o.n.Node ] [node01] starting ... [2023-02-16T16:23:48,823][INFO ][o.o.t.TransportService ] [node01] publish_address {10.0.0.212:9300}, bound_addresses {0.0.0.0:9300} [2023-02-16T16:23:48,981][INFO ][o.o.b.BootstrapChecks ] [node01] bound or publishing to a non-loopback address, enforcing bootstrap checks [2023-02-16T16:23:48,984][INFO ][o.o.c.c.Coordinator ] [node01] cluster UUID [b0uSEvAVTFSEd_HDG0NTjw] [2023-02-16T16:23:58,998][WARN ][o.o.c.c.ClusterFormationFailureHelper] [node01] cluster-manager not discovered or elected yet, an election requires a node with id [enQl9djoRA24TYJYOUGMnw], have discovered [{node01}{Bqk8Khh8R5GjoDQaF-C-Cg}{brqDhviuRpKIrMGzzWX7Xw}{10.0.0.212}{10.0.0.212:9300}{dimr}{shard_indexing_pressure_enabled=true}, {node02}{enQl9djoRA24TYJYOUGMnw}{L3OlmhvHRY2Y1HxH_wibhQ}{10.0.24.6}{10.0.24.6:9300}{dimr}{shard_indexing_pressure_enabled=true}] which is a quorum; discovery will continue using [10.0.24.2:9300, 10.0.24.5:9300] from hosts providers and [{node01}{Bqk8Khh8R5GjoDQaF-C-Cg}{brqDhviuRpKIrMGzzWX7Xw}{10.0.0.212}{10.0.0.212:9300}{dimr}{shard_indexing_pressure_enabled=true}] from last-known cluster state; node term 70, last-accepted version 18 in term 68 [2023-02-16T16:24:09,013][WARN ][o.o.c.c.ClusterFormationFailureHelper] [node01] cluster-manager not discovered or elected yet, an election requires a node with id [enQl9djoRA24TYJYOUGMnw], have discovered [{node01}{Bqk8Khh8R5GjoDQaF-C-Cg}{brqDhviuRpKIrMGzzWX7Xw}{10.0.0.212}{10.0.0.212:9300}{dimr}{shard_indexing_pressure_enabled=true}, {node02}{enQl9djoRA24TYJYOUGMnw}{L3OlmhvHRY2Y1HxH_wibhQ}{10.0.24.6}{10.0.24.6:9300}{dimr}{shard_indexing_pressure_enabled=true}] which is a quorum; discovery will continue using [10.0.24.2:9300, 10.0.24.5:9300] from hosts providers and [{node01}{Bqk8Khh8R5GjoDQaF-C-Cg}{brqDhviuRpKIrMGzzWX7Xw}{10.0.0.212}{10.0.0.212:9300}{dimr}{shard_indexing_pressure_enabled=true}] from last-known cluster state; node term 70, last-accepted version 18 in term 68 [2023-02-16T16:24:18,996][WARN ][o.o.n.Node ] [node01] timed out while waiting for initial discovery state - timeout: 30s [2023-02-16T16:24:19,010][INFO ][o.o.h.AbstractHttpServerTransport] [node01] publish_address {10.0.0.212:9200}, bound_addresses {0.0.0.0:9200} [2023-02-16T16:24:19,010][INFO ][o.o.n.Node ] [node01] started [2023-02-16T16:24:19,011][INFO ][o.o.s.OpenSearchSecurityPlugin] [node01] Node started [2023-02-16T16:24:19,011][INFO ][o.o.s.c.ConfigurationRepository] [node01] Will attempt to create index .opendistro_security and default configs if they are absent [2023-02-16T16:24:19,012][INFO ][o.o.s.OpenSearchSecurityPlugin] [node01] 0 OpenSearch Security modules loaded so far: [] [2023-02-16T16:24:19,013][INFO ][o.o.s.c.ConfigurationRepository] [node01] Background init thread started. Install default config?: true [2023-02-16T16:24:19,018][WARN ][o.o.c.c.ClusterFormationFailureHelper] [node01] cluster-manager not discovered or elected yet, an election requires a node with id [enQl9djoRA24TYJYOUGMnw], have discovered [{node01}{Bqk8Khh8R5GjoDQaF-C-Cg}{brqDhviuRpKIrMGzzWX7Xw}{10.0.0.212}{10.0.0.212:9300}{dimr}{shard_indexing_pressure_enabled=true}, {node02}{enQl9djoRA24TYJYOUGMnw}{L3OlmhvHRY2Y1HxH_wibhQ}{10.0.24.6}{10.0.24.6:9300}{dimr}{shard_indexing_pressure_enabled=true}] which is a quorum; discovery will continue using [10.0.24.2:9300, 10.0.24.5:9300] from hosts providers and [{node01}{Bqk8Khh8R5GjoDQaF-C-Cg}{brqDhviuRpKIrMGzzWX7Xw}{10.0.0.212}{10.0.0.212:9300}{dimr}{shard_indexing_pressure_enabled=true}] from last-known cluster state; node term 70, last-accepted version 18 in term 68 [2023-02-16T16:24:20,416][INFO ][o.o.c.c.JoinHelper ] [node01] failed to join {node02}{enQl9djoRA24TYJYOUGMnw}{L3OlmhvHRY2Y1HxH_wibhQ}{10.0.24.6}{10.0.24.6:9300}{dimr}{shard_indexing_pressure_enabled=true} with JoinRequest{sourceNode={node01}{Bqk8Khh8R5GjoDQaF-C-Cg}{brqDhviuRpKIrMGzzWX7Xw}{10.0.0.212}{10.0.0.212:9300}{dimr}{shard_indexing_pressure_enabled=true}, minimumTerm=70, optionalJoin=Optional[Join{term=70, lastAcceptedTerm=68, lastAcceptedVersion=18, sourceNode={node01}{Bqk8Khh8R5GjoDQaF-C-Cg}{brqDhviuRpKIrMGzzWX7Xw}{10.0.0.212}{10.0.0.212:9300}{dimr}{shard_indexing_pressure_enabled=true}, targetNode={node02}{enQl9djoRA24TYJYOUGMnw}{L3OlmhvHRY2Y1HxH_wibhQ}{10.0.24.6}{10.0.24.6:9300}{dimr}{shard_indexing_pressure_enabled=true}}]} org.opensearch.transport.RemoteTransportException: [node02][10.0.24.6:9300][internal:cluster/coordination/join] Caused by: org.opensearch.transport.ConnectTransportException: [node01][10.0.0.212:9300] connect_timeout[30s] at org.opensearch.transport.TcpTransport$ChannelsConnectedListener.onTimeout(TcpTransport.java:1082) ~[opensearch-2.5.0.jar:2.5.0] at org.opensearch.common.util.concurrent.ThreadContext$ContextPreservingRunnable.run(ThreadContext.java:747) ~[opensearch-2.5.0.jar:2.5.0] at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136) ~[?:?] at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635) ~[?:?] at java.lang.Thread.run(Thread.java:833) [?:?] [2023-02-16T16:24:20,429][INFO ][o.o.c.c.JoinHelper ] [node01] failed to join {node02}{enQl9djoRA24TYJYOUGMnw}{L3OlmhvHRY2Y1HxH_wibhQ}{10.0.24.6}{10.0.24.6:9300}{dimr}{shard_indexing_pressure_enabled=true} with JoinRequest{sourceNode={node01}{Bqk8Khh8R5GjoDQaF-C-Cg}{brqDhviuRpKIrMGzzWX7Xw}{10.0.0.212}{10.0.0.212:9300}{dimr}{shard_indexing_pressure_enabled=true}, minimumTerm=70, optionalJoin=Optional[Join{term=70, lastAcceptedTerm=68, lastAcceptedVersion=18, sourceNode={node01}{Bqk8Khh8R5GjoDQaF-C-Cg}{brqDhviuRpKIrMGzzWX7Xw}{10.0.0.212}{10.0.0.212:9300}{dimr}{shard_indexing_pressure_enabled=true}, targetNode={node02}{enQl9djoRA24TYJYOUGMnw}{L3OlmhvHRY2Y1HxH_wibhQ}{10.0.24.6}{10.0.24.6:9300}{dimr}{shard_indexing_pressure_enabled=true}}]} org.opensearch.transport.RemoteTransportException: [node02][10.0.24.6:9300][internal:cluster/coordination/join] Caused by: org.opensearch.transport.ConnectTransportException: [node01][10.0.0.212:9300] connect_timeout[30s] at org.opensearch.transport.TcpTransport$ChannelsConnectedListener.onTimeout(TcpTransport.java:1082) ~[opensearch-2.5.0.jar:2.5.0] at org.opensearch.common.util.concurrent.ThreadContext$ContextPreservingRunnable.run(ThreadContext.java:747) ~[opensearch-2.5.0.jar:2.5.0] at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136) ~[?:?] at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635) ~[?:?] at java.lang.Thread.run(Thread.java:833) [?:?] [2023-02-16T16:24:29,023][WARN ][o.o.c.c.JoinHelper ] [node01] last failed join attempt was 8.5s ago, failed to join {node02}{enQl9djoRA24TYJYOUGMnw}{L3OlmhvHRY2Y1HxH_wibhQ}{10.0.24.6}{10.0.24.6:9300}{dimr}{shard_indexing_pressure_enabled=true} with JoinRequest{sourceNode={node01}{Bqk8Khh8R5GjoDQaF-C-Cg}{brqDhviuRpKIrMGzzWX7Xw}{10.0.0.212}{10.0.0.212:9300}{dimr}{shard_indexing_pressure_enabled=true}, minimumTerm=70, optionalJoin=Optional[Join{term=70, lastAcceptedTerm=68, lastAcceptedVersion=18, sourceNode={node01}{Bqk8Khh8R5GjoDQaF-C-Cg}{brqDhviuRpKIrMGzzWX7Xw}{10.0.0.212}{10.0.0.212:9300}{dimr}{shard_indexing_pressure_enabled=true}, targetNode={node02}{enQl9djoRA24TYJYOUGMnw}{L3OlmhvHRY2Y1HxH_wibhQ}{10.0.24.6}{10.0.24.6:9300}{dimr}{shard_indexing_pressure_enabled=true}}]} org.opensearch.transport.RemoteTransportException: [node02][10.0.24.6:9300][internal:cluster/coordination/join] Caused by: org.opensearch.transport.ConnectTransportException: [node01][10.0.0.212:9300] connect_timeout[30s] at org.opensearch.transport.TcpTransport$ChannelsConnectedListener.onTimeout(TcpTransport.java:1082) ~[opensearch-2.5.0.jar:2.5.0] at org.opensearch.common.util.concurrent.ThreadContext$ContextPreservingRunnable.run(ThreadContext.java:747) ~[opensearch-2.5.0.jar:2.5.0] at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136) [?:?] at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635) [?:?] at java.lang.Thread.run(Thread.java:833) [?:?] [2023-02-16T16:24:29,025][WARN ][o.o.c.c.ClusterFormationFailureHelper] [node01] cluster-manager not discovered or elected yet, an election requires a node with id [enQl9djoRA24TYJYOUGMnw], have discovered [{node01}{Bqk8Khh8R5GjoDQaF-C-Cg}{brqDhviuRpKIrMGzzWX7Xw}{10.0.0.212}{10.0.0.212:9300}{dimr}{shard_indexing_pressure_enabled=true}, {node02}{enQl9djoRA24TYJYOUGMnw}{L3OlmhvHRY2Y1HxH_wibhQ}{10.0.24.6}{10.0.24.6:9300}{dimr}{shard_indexing_pressure_enabled=true}] which is a quorum; discovery will continue using [10.0.24.2:9300, 10.0.24.5:9300] from hosts providers and [{node01}{Bqk8Khh8R5GjoDQaF-C-Cg}{brqDhviuRpKIrMGzzWX7Xw}{10.0.0.212}{10.0.0.212:9300}{dimr}{shard_indexing_pressure_enabled=true}] from last-known cluster state; node term 70, last-accepted version 18 in term 68 [2023-02-16T16:24:39,034][WARN ][o.o.c.c.ClusterFormationFailureHelper] [node01] cluster-manager not discovered or elected yet, an election requires a node with id [enQl9djoRA24TYJYOUGMnw], have discovered [{node01}{Bqk8Khh8R5GjoDQaF-C-Cg}{brqDhviuRpKIrMGzzWX7Xw}{10.0.0.212}{10.0.0.212:9300}{dimr}{shard_indexing_pressure_enabled=true}, {node02}{enQl9djoRA24TYJYOUGMnw}{L3OlmhvHRY2Y1HxH_wibhQ}{10.0.24.6}{10.0.24.6:9300}{dimr}{shard_indexing_pressure_enabled=true}] which is a quorum; discovery will continue using [10.0.24.2:9300, 10.0.24.5:9300] from hosts providers and [{node01}{Bqk8Khh8R5GjoDQaF-C-Cg}{brqDhviuRpKIrMGzzWX7Xw}{10.0.0.212}{10.0.0.212:9300}{dimr}{shard_indexing_pressure_enabled=true}] from last-known cluster state; node term 70, last-accepted version 18 in term 68 [2023-02-16T16:24:49,025][ERROR][o.o.s.c.ConfigurationRepository] [node01] Cannot apply default config (this is maybe not an error!) org.opensearch.discovery.ClusterManagerNotDiscoveredException: null at org.opensearch.action.support.clustermanager.TransportClusterManagerNodeAction$AsyncSingleAction$2.onTimeout(TransportClusterManagerNodeAction.java:348) ~[opensearch-2.5.0.jar:2.5.0] at org.opensearch.cluster.ClusterStateObserver$ContextPreservingListener.onTimeout(ClusterStateObserver.java:394) ~[opensearch-2.5.0.jar:2.5.0] at org.opensearch.cluster.ClusterStateObserver$ObserverClusterStateListener.onTimeout(ClusterStateObserver.java:294) ~[opensearch-2.5.0.jar:2.5.0] at org.opensearch.cluster.service.ClusterApplierService$NotifyTimeout.run(ClusterApplierService.java:707) ~[opensearch-2.5.0.jar:2.5.0] at org.opensearch.common.util.concurrent.ThreadContext$ContextPreservingRunnable.run(ThreadContext.java:747) ~[opensearch-2.5.0.jar:2.5.0] at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136) ~[?:?] at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635) ~[?:?] at java.lang.Thread.run(Thread.java:833) [?:?] [2023-02-16T16:24:49,032][ERROR][o.o.s.c.ConfigurationLoaderSecurity7] [node01] Exception while retrieving configuration for [INTERNALUSERS, ACTIONGROUPS, CONFIG, ROLES, ROLESMAPPING, TENANTS, NODESDN, WHITELIST, ALLOWLIST, AUDIT] (index=.opendistro_security) org.opensearch.cluster.block.ClusterBlockException: blocked by: [SERVICE_UNAVAILABLE/1/state not recovered / initialized]; at org.opensearch.cluster.block.ClusterBlocks.globalBlockedException(ClusterBlocks.java:205) ~[opensearch-2.5.0.jar:2.5.0] at org.opensearch.cluster.block.ClusterBlocks.globalBlockedRaiseException(ClusterBlocks.java:191) ~[opensearch-2.5.0.jar:2.5.0] at org.opensearch.action.get.TransportMultiGetAction.doExecute(TransportMultiGetAction.java:81) ~[opensearch-2.5.0.jar:2.5.0] at org.opensearch.action.get.TransportMultiGetAction.doExecute(TransportMultiGetAction.java:58) ~[opensearch-2.5.0.jar:2.5.0] at org.opensearch.action.support.TransportAction$RequestFilterChain.proceed(TransportAction.java:218) [opensearch-2.5.0.jar:2.5.0] at org.opensearch.indexmanagement.rollup.actionfilter.FieldCapsFilter.apply(FieldCapsFilter.kt:118) [opensearch-index-management-2.5.0.0.jar:2.5.0.0] at org.opensearch.action.support.TransportAction$RequestFilterChain.proceed(TransportAction.java:216) [opensearch-2.5.0.jar:2.5.0] at org.opensearch.security.filter.SecurityFilter.apply0(SecurityFilter.java:232) [opensearch-security-2.5.0.0.jar:2.5.0.0] at org.opensearch.security.filter.SecurityFilter.apply(SecurityFilter.java:149) [opensearch-security-2.5.0.0.jar:2.5.0.0] at org.opensearch.action.support.TransportAction$RequestFilterChain.proceed(TransportAction.java:216) [opensearch-2.5.0.jar:2.5.0] at org.opensearch.performanceanalyzer.action.PerformanceAnalyzerActionFilter.apply(PerformanceAnalyzerActionFilter.java:78) [opensearch-performance-analyzer-2.5.0.0.jar:2.5.0.0] at org.opensearch.action.support.TransportAction$RequestFilterChain.proceed(TransportAction.java:216) [opensearch-2.5.0.jar:2.5.0] at org.opensearch.action.support.TransportAction.execute(TransportAction.java:188) [opensearch-2.5.0.jar:2.5.0] at org.opensearch.action.support.TransportAction.execute(TransportAction.java:107) [opensearch-2.5.0.jar:2.5.0] at org.opensearch.client.node.NodeClient.executeLocally(NodeClient.java:110) [opensearch-2.5.0.jar:2.5.0] at org.opensearch.client.node.NodeClient.doExecute(NodeClient.java:97) [opensearch-2.5.0.jar:2.5.0] at org.opensearch.client.support.AbstractClient.execute(AbstractClient.java:461) [opensearch-2.5.0.jar:2.5.0] at org.opensearch.client.support.AbstractClient.multiGet(AbstractClient.java:577) [opensearch-2.5.0.jar:2.5.0] at org.opensearch.security.configuration.ConfigurationLoaderSecurity7.loadAsync(ConfigurationLoaderSecurity7.java:208) [opensearch-security-2.5.0.0.jar:2.5.0.0] at org.opensearch.security.configuration.ConfigurationLoaderSecurity7.load(ConfigurationLoaderSecurity7.java:99) [opensearch-security-2.5.0.0.jar:2.5.0.0] at org.opensearch.security.configuration.ConfigurationRepository.getConfigurationsFromIndex(ConfigurationRepository.java:372) [opensearch-security-2.5.0.0.jar:2.5.0.0] at org.opensearch.security.configuration.ConfigurationRepository.reloadConfiguration0(ConfigurationRepository.java:318) [opensearch-security-2.5.0.0.jar:2.5.0.0] at org.opensearch.security.configuration.ConfigurationRepository.reloadConfiguration(ConfigurationRepository.java:303) [opensearch-security-2.5.0.0.jar:2.5.0.0] at org.opensearch.security.configuration.ConfigurationRepository$1.run(ConfigurationRepository.java:163) [opensearch-security-2.5.0.0.jar:2.5.0.0] at java.lang.Thread.run(Thread.java:833) [?:?] [2023-02-16T16:24:49,035][WARN ][o.o.c.c.ClusterFormationFailureHelper] [node01] cluster-manager not discovered or elected yet, an election requires a node with id [enQl9djoRA24TYJYOUGMnw], have discovered [{node01}{Bqk8Khh8R5GjoDQaF-C-Cg}{brqDhviuRpKIrMGzzWX7Xw}{10.0.0.212}{10.0.0.212:9300}{dimr}{shard_indexing_pressure_enabled=true}, {node02}{enQl9djoRA24TYJYOUGMnw}{L3OlmhvHRY2Y1HxH_wibhQ}{10.0.24.6}{10.0.24.6:9300}{dimr}{shard_indexing_pressure_enabled=true}] which is a quorum; discovery will continue using [10.0.24.2:9300, 10.0.24.5:9300] from hosts providers and [{node01}{Bqk8Khh8R5GjoDQaF-C-Cg}{brqDhviuRpKIrMGzzWX7Xw}{10.0.0.212}{10.0.0.212:9300}{dimr}{shard_indexing_pressure_enabled=true}] from last-known cluster state; node term 70, last-accepted version 18 in term 68 [2023-02-16T16:24:51,126][INFO ][o.o.c.c.JoinHelper ] [node01] failed to join {node02}{enQl9djoRA24TYJYOUGMnw}{L3OlmhvHRY2Y1HxH_wibhQ}{10.0.24.6}{10.0.24.6:9300}{dimr}{shard_indexing_pressure_enabled=true} with JoinRequest{sourceNode={node01}{Bqk8Khh8R5GjoDQaF-C-Cg}{brqDhviuRpKIrMGzzWX7Xw}{10.0.0.212}{10.0.0.212:9300}{dimr}{shard_indexing_pressure_enabled=true}, minimumTerm=70, optionalJoin=Optional[Join{term=70, lastAcceptedTerm=68, lastAcceptedVersion=18, sourceNode={node01}{Bqk8Khh8R5GjoDQaF-C-Cg}{brqDhviuRpKIrMGzzWX7Xw}{10.0.0.212}{10.0.0.212:9300}{dimr}{shard_indexing_pressure_enabled=true}, targetNode={node02}{enQl9djoRA24TYJYOUGMnw}{L3OlmhvHRY2Y1HxH_wibhQ}{10.0.24.6}{10.0.24.6:9300}{dimr}{shard_indexing_pressure_enabled=true}}]} org.opensearch.transport.RemoteTransportException: [node02][10.0.24.6:9300][internal:cluster/coordination/join] Caused by: org.opensearch.transport.ConnectTransportException: [node01][10.0.0.212:9300] connect_exception at org.opensearch.transport.TcpTransport$ChannelsConnectedListener.onFailure(TcpTransport.java:1076) ~[opensearch-2.5.0.jar:2.5.0] at org.opensearch.action.ActionListener.lambda$toBiConsumer$2(ActionListener.java:215) ~[opensearch-2.5.0.jar:2.5.0] at org.opensearch.common.concurrent.CompletableContext.lambda$addListener$0(CompletableContext.java:55) ~[opensearch-core-2.5.0.jar:2.5.0] at java.util.concurrent.CompletableFuture.uniWhenComplete(CompletableFuture.java:863) ~[?:?] at java.util.concurrent.CompletableFuture$UniWhenComplete.tryFire(CompletableFuture.java:841) ~[?:?] at java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:510) ~[?:?] at java.util.concurrent.CompletableFuture.completeExceptionally(CompletableFuture.java:2162) ~[?:?] at org.opensearch.common.concurrent.CompletableContext.completeExceptionally(CompletableContext.java:70) ~[opensearch-core-2.5.0.jar:2.5.0] at org.opensearch.transport.netty4.Netty4TcpChannel.lambda$addListener$0(Netty4TcpChannel.java:81) ~[transport-netty4-client-2.5.0.jar:2.5.0] at io.netty.util.concurrent.DefaultPromise.notifyListener0(DefaultPromise.java:590) ~[netty-common-4.1.86.Final.jar:4.1.86.Final] at io.netty.util.concurrent.DefaultPromise.notifyListeners0(DefaultPromise.java:583) ~[netty-common-4.1.86.Final.jar:4.1.86.Final] at io.netty.util.concurrent.DefaultPromise.notifyListenersNow(DefaultPromise.java:559) ~[netty-common-4.1.86.Final.jar:4.1.86.Final] at io.netty.util.concurrent.DefaultPromise.notifyListeners(DefaultPromise.java:492) ~[netty-common-4.1.86.Final.jar:4.1.86.Final] at io.netty.util.concurrent.DefaultPromise.setValue0(DefaultPromise.java:636) ~[netty-common-4.1.86.Final.jar:4.1.86.Final] at io.netty.util.concurrent.DefaultPromise.setFailure0(DefaultPromise.java:629) ~[netty-common-4.1.86.Final.jar:4.1.86.Final] at io.netty.util.concurrent.DefaultPromise.tryFailure(DefaultPromise.java:118) ~[netty-common-4.1.86.Final.jar:4.1.86.Final] at io.netty.channel.nio.AbstractNioChannel$AbstractNioUnsafe$1.run(AbstractNioChannel.java:262) ~[netty-transport-4.1.86.Final.jar:4.1.86.Final] at io.netty.util.concurrent.PromiseTask.runTask(PromiseTask.java:98) ~[netty-common-4.1.86.Final.jar:4.1.86.Final] at io.netty.util.concurrent.ScheduledFutureTask.run(ScheduledFutureTask.java:153) ~[netty-common-4.1.86.Final.jar:4.1.86.Final] at io.netty.util.concurrent.AbstractEventExecutor.runTask(AbstractEventExecutor.java:174) ~[netty-common-4.1.86.Final.jar:4.1.86.Final] at io.netty.util.concurrent.AbstractEventExecutor.safeExecute(AbstractEventExecutor.java:167) ~[netty-common-4.1.86.Final.jar:4.1.86.Final] at io.netty.util.concurrent.SingleThreadEventExecutor.runAllTasks(SingleThreadEventExecutor.java:470) ~[netty-common-4.1.86.Final.jar:4.1.86.Final] at io.netty.channel.nio.NioEventLoop.run(NioEventLoop.java:569) [netty-transport-4.1.86.Final.jar:4.1.86.Final] at io.netty.util.concurrent.SingleThreadEventExecutor$4.run(SingleThreadEventExecutor.java:997) [netty-common-4.1.86.Final.jar:4.1.86.Final] at io.netty.util.internal.ThreadExecutorMap$2.run(ThreadExecutorMap.java:74) [netty-common-4.1.86.Final.jar:4.1.86.Final] at java.lang.Thread.run(Thread.java:833) [?:?] Caused by: java.io.IOException: connection timed out: 10.0.0.212/10.0.0.212:9300 at io.netty.channel.nio.AbstractNioChannel$AbstractNioUnsafe$1.run(AbstractNioChannel.java:261) ~[netty-transport-4.1.86.Final.jar:4.1.86.Final] at io.netty.util.concurrent.PromiseTask.runTask(PromiseTask.java:98) ~[netty-common-4.1.86.Final.jar:4.1.86.Final] at io.netty.util.concurrent.ScheduledFutureTask.run(ScheduledFutureTask.java:153) ~[netty-common-4.1.86.Final.jar:4.1.86.Final] at io.netty.util.concurrent.AbstractEventExecutor.runTask(AbstractEventExecutor.java:174) ~[netty-common-4.1.86.Final.jar:4.1.86.Final] at io.netty.util.concurrent.AbstractEventExecutor.safeExecute(AbstractEventExecutor.java:167) ~[netty-common-4.1.86.Final.jar:4.1.86.Final] at io.netty.util.concurrent.SingleThreadEventExecutor.runAllTasks(SingleThreadEventExecutor.java:470) ~[netty-common-4.1.86.Final.jar:4.1.86.Final] at io.netty.channel.nio.NioEventLoop.run(NioEventLoop.java:569) [netty-transport-4.1.86.Final.jar:4.1.86.Final] at io.netty.util.concurrent.SingleThreadEventExecutor$4.run(SingleThreadEventExecutor.java:997) ~[?:?] at io.netty.util.internal.ThreadExecutorMap$2.run(ThreadExecutorMap.java:74) ~[?:?] at java.lang.Thread.run(Thread.java:833) ~[?:?] ``` This continues forever. **Expected behavior** I would expect the node can join the cluster and elect a manager node exactly as it is capable of doing in docker compose. **Plugins** Nothing but default. **Host/Environment (please complete the following information):** - OS: MacOS Monteray - Version 12.6.3 **Additional context** As far as I can tell, there is nothing wrong with the docker networking.

jbates5873 · July 26, 2023, 10:46pm

I forgot to add this to the original post.

but yes, from each host i can CURL each other node.

Node 1 - can curl 2 & 3
Node 2 - can curl 1 & 3
Node 3 - can curl 2 & 3

jbates5873 · July 26, 2023, 10:50pm

Thanks for that. So it seems that this is still unresolved then.

This throws a spanner in the works. I wonder if Elastic search has these issues aswell? We may have to investigate using that instead.

Thanks for pointing out the GH issue.

Govind12 · September 2, 2024, 12:14pm

Hi everyone ,

I want to setup the opensearch cluster using the 3 machines which are on same network , i want 3 master nodes , 3 data nodes and 3 client nodes , how to do this setup for my UAT/ production environment .

here are the files i created below , what customization should i do for the required setup : -
version: ‘3’
services:
opensearch-master1:
image: opensearchproject/opensearch:latest
container_name: opensearch-master1
environment:
# - OPENSEARCH_SECURITY_ADMIN_PASSWORD=@Mine1623
- cluster.name=opensearch-cluster
- node.name=opensearch-master1
- discovery.seed_hosts=opensearch-master1,opensearch-master2,opensearch-master3
- cluster.initial_cluster_manager_nodes=opensearch-master1,opensearch-master2,opensearch-master3
- bootstrap.memory_lock=true
- “OPENSEARCH_JAVA_OPTS=-Xms512m -Xmx512m”
- “DISABLE_INSTALL_DEMO_CONFIG=true”
- “DISABLE_SECURITY_PLUGIN=true” # Disable security plugin
- OPENSEARCH_INITIAL_ADMIN_PASSWORD=${OPENSEARCH_INITIAL_ADMIN_PASSWORD}

ulimits:
  memlock:
    soft: -1
    hard: -1
  nofile:
    soft: 65536
    hard: 65536
volumes:
  - opensearch_opensearch-master-data1:/usr/share/opensearch/data
    #- ./certs:/usr/share/opensearch/config/certs
ports:
  - 9201:9200
  - 9601:9600
networks:
  - opensearch-net

opensearch-master2:
image: opensearchproject/opensearch:latest
container_name: opensearch-master2
environment:
# - OPENSEARCH_SECURITY_ADMIN_PASSWORD=@Mine1623
- cluster.name=opensearch-cluster
- node.name=opensearch-master2
- discovery.seed_hosts=opensearch-master1,opensearch-master2,opensearch-master3
- cluster.initial_cluster_manager_nodes=opensearch-master1,opensearch-master2,opensearch-master3
- bootstrap.memory_lock=true
- “OPENSEARCH_JAVA_OPTS=-Xms512m -Xmx512m”
- “DISABLE_INSTALL_DEMO_CONFIG=true”
- “DISABLE_SECURITY_PLUGIN=true” # Disable security plugin
- OPENSEARCH_INITIAL_ADMIN_PASSWORD=${OPENSEARCH_INITIAL_ADMIN_PASSWORD}
ulimits:
memlock:
soft: -1
hard: -1
nofile:
soft: 65536
hard: 65536
volumes:
- opensearch_opensearch-master-data2:/usr/share/opensearch/data
# - ./certs:/usr/share/opensearch/config/certs
ports:
- 9202:9200
- 9602:9600
networks:
- opensearch-net

opensearch-master3:
image: opensearchproject/opensearch:latest
container_name: opensearch-master3
environment:
# - OPENSEARCH_SECURITY_ADMIN_PASSWORD=@Mine1623
- cluster.name=opensearch-cluster
- node.name=opensearch-master3
- discovery.seed_hosts=opensearch-master1,opensearch-master2,opensearch-master3
- cluster.initial_cluster_manager_nodes=opensearch-master1,opensearch-master2,opensearch-master3
- bootstrap.memory_lock=true
- “OPENSEARCH_JAVA_OPTS=-Xms512m -Xmx512m”
- “DISABLE_INSTALL_DEMO_CONFIG=true”
- “DISABLE_SECURITY_PLUGIN=true” # Disable security plugin
- OPENSEARCH_INITIAL_ADMIN_PASSWORD=${OPENSEARCH_INITIAL_ADMIN_PASSWORD}
ulimits:
memlock:
soft: -1
hard: -1
nofile:
soft: 65536
hard: 65536
volumes:
- opensearch_opensearch-master-data3:/usr/share/opensearch/data
# - ./certs:/usr/share/opensearch/config/certs
ports:
- 9203:9200
- 9603:9600
networks:
- opensearch-net

Data Nodes

opensearch-data1:
image: opensearchproject/opensearch:latest
container_name: opensearch-data1
environment:
#- OPENSEARCH_SECURITY_ADMIN_PASSWORD=@Mine1623
- cluster.name=opensearch-cluster
- node.name=opensearch-data1
- node.roles=data
- discovery.seed_hosts=opensearch-master1,opensearch-master2,opensearch-master3
- bootstrap.memory_lock=true
- “OPENSEARCH_JAVA_OPTS=-Xms512m -Xmx512m”
- “DISABLE_INSTALL_DEMO_CONFIG=true”
- “DISABLE_SECURITY_PLUGIN=true”
- OPENSEARCH_INITIAL_ADMIN_PASSWORD=${OPENSEARCH_INITIAL_ADMIN_PASSWORD}
ulimits:
memlock:
soft: -1
hard: -1
nofile:
soft: 65536
hard: 65536
volumes:
- opensearch_opensearch-data1:/usr/share/opensearch/data
# - ./certs:/usr/share/opensearch/config/certs
ports:
- 9204:9200
- 9604:9600
networks:
- opensearch-net

opensearch-data2:
image: opensearchproject/opensearch:latest
container_name: opensearch-data2
environment:
# - OPENSEARCH_SECURITY_ADMIN_PASSWORD=@Mine1623
- cluster.name=opensearch-cluster
- node.name=opensearch-data2
- node.roles=data
- discovery.seed_hosts=opensearch-master1,opensearch-master2,opensearch-master3
- bootstrap.memory_lock=true
- “OPENSEARCH_JAVA_OPTS=-Xms512m -Xmx512m”
- “DISABLE_INSTALL_DEMO_CONFIG=true”
- “DISABLE_SECURITY_PLUGIN=true”
- OPENSEARCH_INITIAL_ADMIN_PASSWORD=${OPENSEARCH_INITIAL_ADMIN_PASSWORD}
ulimits:
memlock:
soft: -1
hard: -1
nofile:
soft: 65536
hard: 65536
volumes:
- opensearch_opensearch-data2:/usr/share/opensearch/data
# - ./certs:/usr/share/opensearch/config/certs
ports:
- 9205:9200
- 9605:9600
networks:
- opensearch-net

opensearch-data3:
image: opensearchproject/opensearch:latest
container_name: opensearch-data3
environment:
# - OPENSEARCH_SECURITY_ADMIN_PASSWORD=@Mine1623
- cluster.name=opensearch-cluster
- node.name=opensearch-data3
- node.roles=data
- discovery.seed_hosts=opensearch-master1,opensearch-master2,opensearch-master3
- bootstrap.memory_lock=true
- “OPENSEARCH_JAVA_OPTS=-Xms512m -Xmx512m”
- “DISABLE_INSTALL_DEMO_CONFIG=true”
- “DISABLE_SECURITY_PLUGIN=true”
- OPENSEARCH_INITIAL_ADMIN_PASSWORD=${OPENSEARCH_INITIAL_ADMIN_PASSWORD}
ulimits:
memlock:
soft: -1
hard: -1
nofile:
soft: 65536
hard: 65536
volumes:
- opensearch_opensearch-data3:/usr/share/opensearch/data
# - ./certs:/usr/share/opensearch/config/certs
ports:
- 9206:9200
- 9606:9600
networks:
- opensearch-net

opensearch-client1:
image: opensearchproject/opensearch:latest
container_name: opensearch-client1
environment:
# - OPENSEARCH_SECURITY_ADMIN_PASSWORD=@Mine1623
- cluster.name=opensearch-cluster
- node.name=opensearch-client1
- node.roles=ingest,remote_cluster_client
- discovery.seed_hosts=opensearch-master1,opensearch-master2,opensearch-master3
- cluster.initial_cluster_manager_nodes=opensearch-master1,opensearch-master2,opensearch-master3
- bootstrap.memory_lock=true
- “OPENSEARCH_JAVA_OPTS=-Xms512m -Xmx512m”
- “DISABLE_INSTALL_DEMO_CONFIG=true”
- “DISABLE_SECURITY_PLUGIN=true” # Disable security plugin
- OPENSEARCH_INITIAL_ADMIN_PASSWORD=${OPENSEARCH_INITIAL_ADMIN_PASSWORD}
ulimits:
memlock:
soft: -1
hard: -1
nofile:
soft: 65536
hard: 65536
# volumes:
# - opensearch-client-data1:/usr/share/opensearch/data
# - ./certs:/usr/share/opensearch/config/certs
# - ./opensearch-config/opensearch-client.yml:/usr/share/opensearch/config/opensearch.yml
ports:
- 9207:9200
- 9607:9600
networks:
- opensearch-net

opensearch-client2:
image: opensearchproject/opensearch:latest
container_name: opensearch-client2
environment:
# - OPENSEARCH_SECURITY_ADMIN_PASSWORD=@Mine1623
- cluster.name=opensearch-cluster
- node.name=opensearch-client2
- node.roles=ingest,remote_cluster_client
- discovery.seed_hosts=opensearch-master1,opensearch-master2,opensearch-master3
- cluster.initial_cluster_manager_nodes=opensearch-master1,opensearch-master2,opensearch-master3
- bootstrap.memory_lock=true
- “OPENSEARCH_JAVA_OPTS=-Xms512m -Xmx512m”
- “DISABLE_INSTALL_DEMO_CONFIG=true”
- “DISABLE_SECURITY_PLUGIN=true” # Disable security plugin
- OPENSEARCH_INITIAL_ADMIN_PASSWORD=${OPENSEARCH_INITIAL_ADMIN_PASSWORD}
ulimits:
memlock:
soft: -1
hard: -1
nofile:
soft: 65536
hard: 65536
# volumes:
# - opensearch-client-data2:/usr/share/opensearch/data
# - ./certs:/usr/share/opensearch/config/certs
ports:
- 9208:9200
- 9608:9600
networks:
- opensearch-net

opensearch-dashboards:
image: opensearchproject/opensearch-dashboards:latest
container_name: opensearch-dashboards
environment:
# - OPENSEARCH_PASSWORD=@Mine1623
- ‘OPENSEARCH_HOSTS=[“http://opensearch-master1:9200”,“http://opensearch-master2:9200”,“http://opensearch-master3:9200”]’
- “DISABLE_SECURITY_DASHBOARDS_PLUGIN=true” # Disable security plugin
- OPENSEARCH_INITIAL_ADMIN_PASSWORD=${OPENSEARCH_INITIAL_ADMIN_PASSWORD}
ports:
- 5601:5601
networks:
- opensearch-net
#environment:
#- OPENSEARCH_SECURITY_ADMIN_PASSWORD=@VMware12

volumes:
opensearch_opensearch-master-data1:
opensearch_opensearch-master-data2:
opensearch_opensearch-master-data3:
opensearch_opensearch-data1:
opensearch_opensearch-data2:
opensearch_opensearch-data3:

opensearch-client-data1:

#opensearch-client-data2:

networks:
opensearch-net:

same files for other 2 machines

pablo · September 2, 2024, 5:30pm

@Govind12 I answered to your question here.

Topic		Replies	Views
Opensearch with multiple nodes on different servers not working OpenSearch troubleshoot	16	10750	September 3, 2024
How to do Multinode opensearch setup using docker compose by multiple machines : OpenSearch discuss , configure , install	11	599	February 19, 2025
Opensearch install error (docker) OpenSearch install	1	609	September 22, 2022
Opensearch Deployment Docker compose in multiple node in different machine IP OpenSearch community-meeting , troubleshoot , install	3	1260	January 22, 2024
Installation instructions for Production - Need Clarity General Feedback	1	1385	August 20, 2021

Multi node docker setup not working

Data Nodes

opensearch-client-data1:

Related topics