Opensearch Cluster

Sanjayraja1211 · June 23, 2025, 10:24am

Versions (relevant - OpenSearch/Dashboard/Server OS/Browser): 2.0

Describe the issue:
I am using OpenSearch Pipeline version 2. My data source is an OCI Streaming topic, and I have configured the pipeline to sink data into an OpenSearch index. However, I am observing that millions of duplicate records are being ingested into the OpenSearch index.

In my setup, I build multiple OpenSearch dashboards that rely on SQL queries executed via Metricbeat. The data flow is as follows: Metricbeat queries Oracle using SQL, pushes results to Kafka (OCI Streaming), and the OpenSearch pipeline consumes these streams to index the data.

Currently, I have designed the architecture such that each pipeline handles one stream and writes to one corresponding OpenSearch index. I am wondering if this is the right approach or if there is a more efficient design. Specifically, I want to know whether I can have one pipeline that consumes multiple streams and routes the data to multiple OpenSearch indices, instead of maintaining separate pipelines for each stream.

Configuration:

Relevant Logs or Screenshots:

Topic		Replies	Views
OpenSearch cannot start because of 'Duplicate field 'path.data'" OpenSearch configure	2	17	May 14, 2025
Data duplication in Opensearch Dashboard OpenSearch	1	887	October 18, 2022
Logstash config for Multiple pipelines usage Open Source Elasticsearch and Kibana configure , install	7	578	June 2, 2024
Duplicated .kibana indices in each Opendistro update Security	1	583	February 7, 2023
_reindex in "Dev Tools" OSD 2.7 not working as expected OpenSearch Dashboards	0	250	May 15, 2023

Opensearch Cluster

Related topics